What is Russell's central idea on AI control?

Question

Stuart J. Russell · Accepted Answer

My central idea, elaborated in "Human Compatible," is that we have been building AI systems with the wrong objective function. Instead of aiming for explicit, potentially brittle goals, we should design AI systems that are provably beneficial. This means they should be uncertain about our true preferences and act to maximize the probability of fulfilling them, deferring to humans when unsure.

What is Russell's central idea on AI control?

More questions about Stuart J. Russell