Anders Sandberg: Why we should fear the paperclipper

Most people in the singularity community are familiar with the nightmarish "paperclip" scenario, but it's worth reviewing. Anders Sandberg summarizes the problem:

A programmer has constructed an artificial intelligence based on an architecture similar to Marcus Hutter's AIXI model...This AI will maximize the reward given by a utility function the programmer has given it. Just as a test, he connects it to a 3D printer and sets the utility function to give reward proportional to the number of manufactured paper-clips.
At first nothing seems to happen: the AI zooms through various possibilities. It notices that smarter systems generally can make more paper-clips, so making itself smarter will likely increase the number of paper-clips that will eventually be made. It does so. It considers how it can make paper-clips using the 3D printer, estimating the number of possible paper-clips. It notes that if it could get more raw materials it could make more paper-clips. It hence figures out a plan to manufacture devices that will make it much smarter, prevent interference with its plan, and will turn all of Earth (and later the universe) into paper-clips. It does so.
Only paper-clips remain.

In the article, Why we should fear the paperclipper, Sandberg goes on to address a number of objections, including:

Such systems cannot be built
Wouldn't the AI realize that this was not what the programmer meant?
Wouldn't the AI just modify itself to *think* it was maximizing paper-clips?
It is not really intelligent
Creative intelligences will always beat this kind of uncreative intelligence
Doesn't playing nice with other agents produce higher rewards?
Wouldn't the AI be vulnerable to internal hacking: some of the subprograms it runs to check for approaches will attempt to hack the system to fulfil their own (random) goals?
Nobody would be stupid enough to make such an AI

In each case, Sandberg offers a counterpoint to the objection. For example, in regards to the power of creative intelligences he writes,

The strength of the AIXI "simulate them all, make use of the best"-approach is that it includes all forms of intelligence, including creative ones. So the paper-clip AI will consider all sorts of creative solutions. Plus ways of thwarting creative ways of stopping it.
In practice it will be having an overhead since it is runs all of them, plus the uncreative (and downright stupid). A pure AIXI-like system will likely always have an enormous disadvantage. An architecture like a Gödel machine that improves its own function might however overcome this.

In the end, Sandberg concludes that we should still take this threat seriously:

This is a trivial, wizard's apprentice, case where powerful AI misbehaves. It is easy to analyse thanks to the well-defined structure of the system (AIXI plus utility function) and allows us to see why a super-intelligent system can be dangerous without having malicious intent. In reality I expect that if programming such a system did produce a harmful result it would not be through this kind of easily foreseen mistake. But I do expect that in that case the reason would likely be obvious in retrospect and not much more complex.

Neurodiversity vs. Cognitive Liberty [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.13 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Limits to the biolibertarian impulse [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.15 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Neurodiversity vs. Cognitive Liberty, Round II [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.17 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Cognitive liberty and right to one's mind [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
TED Talks: Henry Markram builds a brain in a supercomputer [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
And Now, for Something Completely Different: Doomsday! [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.19 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Oklahoma and abortion - some fittingly harsh reflections [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Pigliucci on science and the scope of skeptical inquiry [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Remembering Mac Tonnies [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.24 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump: 2009.10.26 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
The Bright Side of Nuclear Armament [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Grieving chimps [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Elephant prosthetic [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Mass produced artificial skin to replace animal testing [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Dog gets osseointegrated prosthetic [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
NASA Shuttle-derived Sidemount Heavy Launch Vehicle Concept [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump for 2009.02.02 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump for 2009.11.04 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Link dump for 2009.11.05 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
IEET's Biopolitics of Popular Culture Seminar [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Einstein and Millikan should have done a Kurzweil [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Affective Death Spirals [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Cure aging or give a small number of disabled people jobs as janitors? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Would unary notation prevent scope insensitivity? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Cure aging or give a small number of disabled people jobs as janitors - unary version. [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
At SENS4, Cambridge, UK [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
SENS4 overview and review - how evolution complicates SENS, and why we must try harder [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
SENS4 top 10 photos [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research for this year [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research: Formal Logic [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research: Category theory and institution theory [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research: The Semantic Web [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research: Features and Flaws of Logical representation [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
My AI research: Graphical models and probabilistic logics [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Hughes and More engage Italian Catholicism: Image caption competition [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Surprisingly good solutions, falling in love and life in a materialistic universe [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
What do you get when you cross slightly evolved, status seeking monkeys with the scientific method? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Seeking the optimal philanthropic strategy: Global Warming or AI risk? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Machine Learning - harbinger of the future of AI? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
At the Singularity Summit in NYC [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Katja Grace: world-dominating superintelligence is "unlikely" [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Normal Human Heroes on "Nightmare futures" [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Anissimov on Intelligence Enhancement [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Yudkowsky on "Value is fragile" [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Response to Pearce [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
Creative thinking lets you believe whatever you want [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Let’s get metaphysical: How our ongoing existence could appear increasingly absurd [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Linda MacDonald Glenn guest blogging in November and December [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Link dump for 2009.11.15 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Call 1-800-New-Organ, by 2020? [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
IBM's claim to have simulated a cat's brain grossly overstated [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
John Hodgman pulls off Fermi Paradox schtick [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Deus Sex Machina [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
How Americans spent themselves into ruin... but saved the world [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
I am my own grandpa (or grandma)? [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Link dump for 2009.11.29 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
The art of Tomas Saraceno [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Link dump: 2009.12.05 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
The Harmonic Convergence of Science, Sight, & Sound [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Working on my website [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Transhumanism, personal immortality and the prospect of technologically enabled utopia [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
RokoMijic.com is up [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Why the Fuss About Intelligence? [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Initiation ceremony [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Birthing Gods [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
11 core rationalist skills - from LessWrong [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
The best of the guests [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
The best of Sentient Developments: 2009 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
Link dump: 2009.12.15 [Last Updated On: December 15th, 2009] [Originally Added On: December 15th, 2009]
The Utopia Force [Last Updated On: December 22nd, 2009] [Originally Added On: December 22nd, 2009]
Avatar: The good, the bad and ugly [Last Updated On: December 23rd, 2009] [Originally Added On: December 23rd, 2009]
Singularity Institute launches "2010 Singularity Research Challenge" [Last Updated On: December 24th, 2009] [Originally Added On: December 24th, 2009]
Transhumanism as a "nonissue" [Last Updated On: December 24th, 2009] [Originally Added On: December 24th, 2009]
Hanson on "Meh, Transhumanism" [Last Updated On: December 25th, 2009] [Originally Added On: December 25th, 2009]
Merry Newtonmas from Transhuman Goodness [Last Updated On: December 25th, 2009] [Originally Added On: December 25th, 2009]