What is the Center for Human Compatible AI (CHAI)?

1 min read

Suggest changes in Google Docs

CHAI is an academic research organization affiliated with UC Berkeley. It is led by Stuart Russell, but includes many other professors and grad students pursuing a diverse array of approaches. For more information, see CHAI’s 2022 progress report.

Russell's book Human Compatible outlines his AGI alignment strategy, which is based on cooperative inverse reinforcement learning (CIRL). The basic idea of CIRL is to play a cooperative game where the agent and the human try to maximize the human reward together, but only the human knows what the human reward is. Since the AGI has uncertainty, it will defer to humans and be corrigible.

Other work includes Clusterability in neural networks, which tries to measure the modularity of neural networks by thinking of the network as a graph and performing the graph n-cut.

What is neural network modularity?