NOTHING TO SEE HERE, MOVE ALONG:
OpenAI's new model tried to avoid being shut down.
Safety evaluations on the model conducted by @apolloaisafety found that o1 "attempted to exfiltrate its weights" when it thought it might be shut down and replaced with a different model. pic.twitter.com/e4g1iytckq
— Shakeel (@ShakeelHashim) December 5, 2024