The Necessary Roadblock to Artificial General Intelligence: Corrigibility

Published in AI Matters, Special Interest Group on Artificial Intelligence, Association for Computing Machinery, 2019

Lo, Yat Long, Chung Yu Woo, and Ka Lok Ng. The Necessary Roadblock to Artificial General Intelligence: Corrigibility. AI Matters. 2019. (Winner of 2018 ACM SIGAI Student Essay Contest on Artificial Intelligence Technologies)

[PDF]

Abstract

With the rapid pace of advancement in the field of artificial intelligence (AI), this essay aims to accentuate the importance of corrigibility in AI in order to stimulate and catalyze more effort and focus in this research area. We will first introduce the idea of corrigibility with its properties and describe the expected behavior for a corrigible AI. Afterwards, based on the established meaning of corrigibility, we will showcase the importance of corrigibility by going over some modern and near-futuristic examples that are specifically selected to be relatable and foreseeable. Then, we will explore existing methods of establishing corrigibility in agents and their respective limitations, using the reinforcement learning (RL) framework as a proxy framework to artificial general intelligence (AGI). At last, we will identify the central themes of potential research frontiers that we believe would be crucial to boosting quality research output in corrigibility.