Re: AI Boxing:

From: Randall Randall (
Date: Tue Jun 03 2008 - 08:00:27 MDT

On Jun 3, 2008, at 8:26 AM, Stathis Papaioannou wrote:

> 2008/6/3 Vladimir Nesov <>:
>> This is not a universal law, in the sense that it doesn't apply to
>> all
>> situations. People can be manipulated to do some things, but not all
>> things, and not all people, and not equally reliably. There just
>> isn't
>> any 2-hour-long essay that will make me shoot myself in the head
> Yes: there must be some possible combination of words that would make
> you do a particular thing in order for a superintelligent being to
> discover what that combination of words is.

The assertion that there is no such combination of words is equivalent
to the assertion that the human brain is perfectly secure. Given that
more complex systems have more vulnerabilities (all else equal) and
that brains were evolved rather than designed, it seems to me to be
wildly implausible that there are no possible exploits for the brain.

It would not surprise me to learn that there were exploits which
required only seconds to perform verbally. It would, however,
surprise me to learn that Eliezer had discovered one; the space of
possibilities is large, and there's no reason to think that a human
could reason their way to such a thing.

Randall Randall <>
"If I can do it in Alabama, then I'm fairly certain you
  can get away with it anywhere." -- Dresden Codak

This archive was generated by hypermail 2.1.5 : Wed Jul 17 2013 - 04:01:03 MDT