From the course: AWS Certified Generative AI Developer - Professional (AIP-C01) Cert Prep

Unlock this course with a free trial

Join today to access over 25,200 courses taught by industry experts.

What is RAG?

What is RAG?

Imagine asking a chatbot about the latest scientific breakthrough that happened last month, only to receive an apologetic response about its knowledge cutoff date. Or picture a company chatbot confidently giving outdated information about your organization's products. Frustrating, right? So this is where Retrieval Augmented Generation comes in, the technology that's transforming how AI systems access and utilize knowledge. Hence, in this lecture, we'll explore it more to answer what is Retrieval Augmented Generation all about. Retrieval Augmented Generation, or RAG, is an AI framework that integrates an information retrieval component into the generation process of large language models to improve factuality and relevance. The term RAG was coined in a 2020 meta-research paper by Kiela et al. titled Retrieval Augmented Generation for Knowledge-Intensive NLP Task. Unlike LLMs, which are large-language models that depend solely on their pre-trained parameters, RAG systems fetch external…

Contents