I'm slightly on the optimistic side with regards to the overlap between A[GS]I g...

I'm slightly on the optimistic side with regards to the overlap between A[GS]I goals and our own.

While the complete space of things it might want is indeed mostly occupied by things incompatible with human existence, it will also get a substantial bias towards human-like thinking and values in the case of it being trained on human examples.

This is obviously not a 100% guarantee: It isn't necessary for it to be trained on human examples (e.g. AlphaZero doing better without them); and even if it were necessary, the existence of both misanthropes and also sadistic narcissistic sociopaths is an example where the examples of many humans around them isn't sufficient to cause a mind to be friendly.

But we did get ChatGPT to be pretty friendly by asking nicely.