Google Revealed That AI Chatbots Are Less Than 70% Accurate – 80 Level


That's a bit depressing.
AI chatbots' popularity has obvious reasons: why waste time on research when ChatGPT or Gemini can do the hard work for you? However, by now we are all aware of AI hallucinations, which create facts that are simply not true, and the situation is even worse than you might think.
Google created the FACTS Benchmark Suite to test how factually accurate chatbots are, and the results are underwhelming, to say the least.
The suite contains 4 parameters:
Based on Google's research, the most "correct" chatbot overall is Gemini 3 Pro, but even it shows only 68.8% accuracy, which is substantially lower than one expects from a know-it-all system. 
It is followed by Gemini 2.5 Pro (62.1%) and GPT 5 (61.8%). The least accurate model is Grok 4 Fast, with only 36 points in the FACTS Leaderboard. Not surprisingly, Multimodal tasks are the hardest for AI to deal with.
This research shows that you shouldn't blindly trust chatbots to give you correct information: we are still far away from AI having all the answers. The FACTS Leaderboard is a useful tool to measure chatbots' worth, however.
Don't forget to subscribe to our Newsletter and join our 80 Level Talent platform, follow us on TwitterLinkedInTelegram, and Instagram, where we share breakdowns, the latest news, awesome artworks, and more.
Start receiving our weekly newsletter
Facebook
Twitter
YouTube
Instagram
Podcasts
© 2025. 80 level. All rights reserved
We use cookies on this website to make your browsing experience better. By using the site you agree to our use of cookies.Learn more

source

Jesse
https://playwithchatgtp.com