Posts | Open Lab Room

How RL Agents Behave When Their Actions Are Modified? [Distillation post]

Summary

Posted on May 20, 2022 · 19 mins read

An Open Philanthropy grant proposal: Causal representation learning of human preferences

This is a proposal I wrote for the recent Open Philanthropy call on AI Alignment...

Posted on January 11, 2022 · 18 mins read