How RL Agents Behave When Their Actions Are Modified? [Distillation post] Summary Posted on May 20, 2022 · 19 mins read
An Open Philanthropy grant proposal: Causal representation learning of human preferences This is a proposal I wrote for the recent Open Philanthropy call on AI Alignment... Posted on January 11, 2022 · 18 mins read