Re: Recipe for CEV (was Re: Morality simulator)

From: Nick Tarleton (nickptar@gmail.com)
Date: Sat Nov 24 2007 - 19:16:44 MST

Next message: Matt Mahoney: "Re: What is stability in a FAI? (was Re: UCaRtMaAI paper)"
Previous message: Matt Mahoney: "Re: How to make a slave"
In reply to: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Next in thread: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Reply: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]

On Nov 24, 2007 8:40 PM, Matt Mahoney <matmahoney@yahoo.com> wrote:

> The model P will distinguish between descriptions (in words or pictures)
> of
> friendly and unfriendly behavior by assigning higher probabilities to the
> friendly descriptions. This is different than distinguishing between
> friendly
> and unfriendly behavior. I don't claim that such a thing is possible.

If this worked at all (that is, if a detailed model of a human mind is
actually the best way to compress a human's output AND your search algorithm
can find its way out of all of the only-slightly-worse local minima), why
would the model predict Friendly descriptions rather than human-typical
ones?

Next message: Matt Mahoney: "Re: What is stability in a FAI? (was Re: UCaRtMaAI paper)"
Previous message: Matt Mahoney: "Re: How to make a slave"
In reply to: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Next in thread: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Reply: Matt Mahoney: "Re: Recipe for CEV (was Re: Morality simulator)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]

This archive was generated by hypermail 2.1.5 : Wed Jul 17 2013 - 04:01:01 MDT