Artwork

Content provided by Brian Carter. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Carter or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.
Player FM - Aplicație Podcast
Treceți offline cu aplicația Player FM !

RIG & RAG: Grounding AI in Reality with Super-Trustworthy Data

7:33
 
Distribuie
 

Manage episode 443721458 series 3605861
Content provided by Brian Carter. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Carter or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

Google Research has developed a new set of open models, known as DataGemma, that aim to ground large language models (LLMs) in real-world data using Google's Data Commons knowledge graph. DataGemma's primary goal is to improve the factuality and trustworthiness of LLMs by mitigating the risk of hallucinations, which occur when LLMs generate incorrect or misleading information. The models leverage Data Commons' natural language interface to access and incorporate real-world data into LLM responses. This is achieved through two methods: Retrieval Interleaved Generation (RIG) and Retrieval Augmented Generation (RAG). RIG fine-tunes the model to identify statistics within its responses and annotate them with a call to Data Commons, while RAG retrieves relevant information from Data Commons before the LLM generates text, providing a factual foundation for its response. DataGemma is an important step towards developing more grounded and reliable AI systems.

Read more here: https://research.google/blog/grounding-ai-in-reality-with-a-little-help-from-data-commons/

  continue reading

38 episoade

Artwork
iconDistribuie
 
Manage episode 443721458 series 3605861
Content provided by Brian Carter. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Brian Carter or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

Google Research has developed a new set of open models, known as DataGemma, that aim to ground large language models (LLMs) in real-world data using Google's Data Commons knowledge graph. DataGemma's primary goal is to improve the factuality and trustworthiness of LLMs by mitigating the risk of hallucinations, which occur when LLMs generate incorrect or misleading information. The models leverage Data Commons' natural language interface to access and incorporate real-world data into LLM responses. This is achieved through two methods: Retrieval Interleaved Generation (RIG) and Retrieval Augmented Generation (RAG). RIG fine-tunes the model to identify statistics within its responses and annotate them with a call to Data Commons, while RAG retrieves relevant information from Data Commons before the LLM generates text, providing a factual foundation for its response. DataGemma is an important step towards developing more grounded and reliable AI systems.

Read more here: https://research.google/blog/grounding-ai-in-reality-with-a-little-help-from-data-commons/

  continue reading

38 episoade

Toate episoadele

×
 
Loading …

Bun venit la Player FM!

Player FM scanează web-ul pentru podcast-uri de înaltă calitate pentru a vă putea bucura acum. Este cea mai bună aplicație pentru podcast și funcționează pe Android, iPhone și pe web. Înscrieți-vă pentru a sincroniza abonamentele pe toate dispozitivele.

 

Ghid rapid de referință