tekacs 6 days ago

The (obvious) flipside of this coin is that it allows us to run this adversarially against our own codebases, catching bugs that could otherwise have been found by a researcher, but that we can instead patch proactively.\

I wouldn't (personally) call it an alignment issue, as such.

1
tekacs 2 days ago

A few days later, case in point (I'm in no way affiliated): https://news.ycombinator.com/item?id=44117465