
Consequences of Misaligned AI
Simon Zhuang, Dylan Hadfield-Menell
NeurIPS 2020
Why do the reward functions we give autonomous agents often yield undesirable outcomes?
Go bears
I am an alumnus of University of California, Berkeley, graduating in 2020 with a degree in Computer Science and Applied Math, with a concentration in Economics. Nowadays, I work as a trader in New York.
Broadly, I'm interested in using the tools of computer science and economics to investigate the societal impact of technological advances.
Why do the reward functions we give autonomous agents often yield undesirable outcomes?
How does the presence of AI incentivize humans to misrepresent their preferences?
How should we consider fairness in real-world optimization problems?
Hit me up by email or on LinkedIn if you're interested in chatting!