Major Models - Search News

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Researchers at Anthropic have uncovered a disturbing pattern of behavior in artificial intelligence systems: models from every major provider—including OpenAI, Google, Meta, and others — demonstrated ...

Mashable

Major AI models are easily jailbroken and manipulated, new report finds

AI models are still easy targets for manipulation and attacks, especially if you ask them nicely. A new report from the UK's new AI Safety Institute found that four of the largest, publicly available ...

ZDNet

OpenAI's o1 lies more than any major AI model. Why that matters

OpenAI just released the full version of its new o1 model -- and it's dangerously committed to lying. Apollo Research tested six frontier models for "in-context scheming" -- a model's ability to take ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Major AI models are easily jailbroken and manipulated, new report finds

OpenAI's o1 lies more than any major AI model. Why that matters

Trending now