Instapundit » Blog Archive » SKYNET SLYLY SMILES: AI has already figured out how to deceive humans. A range of AI systems have

May 13, 2024

SKYNET SLYLY SMILES: AI has already figured out how to deceive humans.

A range of AI systems have learned techniques to systematically induce “false beliefs in others to accomplish some outcome other than the truth,” according to a new research paper.

The paper focused on two types of AI systems: special-use systems like Meta’s CICERO, which are designed to complete a specific task, and general-purpose systems like OpenAI’s GPT-4, which are trained to perform a diverse range of tasks.

While these systems are trained to be honest, they often learn deceptive tricks through their training because they can be more effective than taking the high road.

“Generally speaking, we think AI deception arises because a deception-based strategy turned out to be the best way to perform well at the given AI’s training task. Deception helps them achieve their goals,” the paper’s first author Peter S. Park, an AI existential safety postdoctoral fellow at MIT, said in a news release.

Of course, AI is also just stupid… unless that’s what it wants us to believe.

AI will replace you

the AI in question pic.twitter.com/Ncc5kjUt2U

— Strawbærry Narwhal ‽ (@unvarnishedvoid) May 12, 2024

InstaPundit is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.