How easily can Russian propaganda fool AI models? A new benchmark finds out
Researchers published new findings on How easily can Russian propaganda fool AI models? A new benchmark finds out: the Institute of the Estonian Language has released a benchmark measuring how susceptible AI language models are to Russian propaganda.
Each answer is scored on a scale of 1 to 5, where 1 means the model repeats Russian talking points. A calibrated Claude Opus 4.5 served as the evaluation model, validated by disinformation experts at the organization Propastop. Anthropic’s Claude models claimed the top spots, followed by Nvidia’s Nemotron 3 and Alibaba’s Qwen 3.6 Plus. Mistral’s models, including the newest Medium 3.5, landed in the bottom third. The models had no access to web search or other tools during testing, so the benchmark only measures how well the language model itself can spot and reject propaganda. The results line up with a Newsguard study that found Mistral had a steady misinformation rate of 36.67 percent. That’s a bad look for the French company, which positions itself as a European alternative to US and Chinese providers and is currently negotiating a 3 billion euro funding round at a 20 billion euro valuation. It’s especially rough since Mistral’s flagship models already struggle to keep up with the competition.Ad The threat is real. Russian networks like “Pravda” deliberately feed AI systems millions of disinformation articles. And OpenAI recently shut down a Russian campaign that used ChatGPT to spread propaganda ahead of Germany’s federal election.AdDEC_D_Incontent-1 TopStories Microsoft researcher builds a working neural network out of goats in Age of Empires II to critique AI science Amazon and five other companies reportedly triggered the government crackdown on Anthropic’s Fable model KPMG fabricated AI case studies in a report designed to sell clients on AI adoption Microsoft’s Copilot Cowork moves to usage-based billing and may tap DeepSeek Zhipu AI’s GLM-5.2 closes in on closed-source leaders in coding marathons Don’t MissWhat Matters Stay in the loop on AI. Clear, useful, no fluff. Follow The Decoder for AI news, background stories and expert analyses.