top of page
Original-LogoOnly-Square-SMLL-Pixel-Tran
  • linkedin
  • twitter
  • Instagram
  • YouTube
  • facebook

Written by Michael Plis. Your go-to source for smart technology & cybersecurity insights for small business. 

  • Writer's pictureMichael Plis

Why is fact checking important for AI?

Groundbreaking News in AI Fact-Checking. This is very important for the future reliability of Generative AI large models. Why? We will discuss this in this article.


You've likely heard about powerful AI models like ChatGPT, capable of writing papers and solving complex problems. However, ensuring their accuracy has been a challenge, requiring manual verification. Enter SAFE, an innovative AI-based app developed by Google's DeepMind to automatically fact-check outputs from these models.


SAFE works like a digital detective, breaking down claims made by AI models and using Google Search to find supporting evidence. In testing, it proved to be remarkably accurate, aligning with human fact-checkers in 72% of cases and even outperforming them in disagreements 76% of the time.


This breakthrough has significant implications for ensuring the reliability of AI-generated content and could lead to greater trust in these technologies. With SAFE, the process of verifying information becomes more efficient and accessible to a wider audience.


What is DeepMind SAFE?


Large language models often make mistakes when answering questions. To check their accuracy, DeepMind staff created a question set called LongFact. They then developed a method called SAFE, where these models break down answers and verify facts using Google.


In tests, SAFE performed better than humans, agreeing with them 72% of the time and winning in 76% of disagreements.


Additionally, SAFE proved to be more cost-effective, being over 20 times cheaper thirteen language models across four model families, finding that larger models generally perform better in providing accurate information on open-ended topics.


What do you think about the role of fact-checking in AI?


Fact checking will become paramount in making sure AI apps and large language models are accurate and don't output fake facts and details.


This is important as even humans undertake fact checking when their thoughts come into the conscious and before speaking most of us fact check (verify) and then speak (output).


Every company producing AI should work on fact checking layer or supportive tool that cleans output before providing it to the end user. Why? Because accurate information scientifically, historically and socially is important.


But it must maintain unbiased output in terms of political and religious information and provide various answers or maybe future possibilities for verbal and details preferences will make it more relevant to the billions of humans with different religious and political opinions. Otherwise it will be pushing a narrow agenda thats preferred by the specific programmers of that company. It's so important - customisation and fact checking.


Humans naturally fact check and customise their answers to the audience they talking to and so should AI models.


How might tools like SAFE AI in the future?

SAFE and other such fact checking AI models will support safe and factual outputs from apps like Gemini. Other companies will follow suit in order to be able to use them more widely.


This also may bring more job losses as AI models get more and more accurate and factual and reliable. Governments must start planning for universal basic income for those who will lose jobs to more powerful Generative AI models. There is no way to sugar coat this.


But on the bright side, you will perhaps one day be able to pursue those hobbies and passions that you put off for moet of your life. I just wonder how will Capitalism work under such circumstances and will it be a rosy life or not? Only time can tell.


Regards


Michael Plis


References


Full Google DeepMind paper entitled: "Long-form factuality in large language models" on Arxiv servers:



Img Credit for both images: Unsplash / Google DeepMind





Recent Posts

See All
PXL_20240208_045007824.MP - SQUARE 250px.jpg

About Michael Plis

 

Michael is a technology and cybersecurity professional with over 18 years of experience. He offers unique insights into the benefits and potential risks of technology from a neurodivergent perspective. He believes that technology is a useful servant but a dangerous master. In his blog articles, Michael helps readers better understand and use technology in a beneficial way. He is also a strong supporter of mental health initiatives and advocates for creating business environments that promote good mental health.

Original-LogoOnly-Square-MED-Pixel-Trans

Cyberkite blog is your go-to source for smart technology and cybersecurity insights for small business. Stay ahead of the curve with our expert tips and strategies, and join the Cyberkite community by subscribing today!

Disclaimer: Please note that the opinions expressed by Michael or any blog assistants on this blog are his/their own and may not necessarily reflect the views of Cyberkite. Michael is neurodiverse so he needs the assistance of voice typing and AI tools to help him write and edit blog articles to and get them completed. Also we use open source images from Unsplash and Pixabay and we try to include credit to the artist of each image. Michael shares his opinions based on his extensive experience in the IT and Cybersecurity industry, learning from the world's top subject matter experts and passing on this knowledge to his audience in the hopes of benefiting them. If there is a mistake or something needs to be corrected please message using the green chat window bottom right hand corner or contact him through social media by searching for Michael Plis blogger. 

View our full Site Disclaimer

View our Affiliate Statement

bottom of page