RAG

Introduced by Lewis et al. in Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Retriever-Augmented Generation, or RAG, is a type of language generation model that combines pre-trained parametric and non-parametric memory for language generation. Specifically, the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. For query $x$, Maximum Inner Product Search (MIPS) is used to find the top-K documents $z_{i}$. For final prediction $y$, we treat $z$ as a latent variable and marginalize over seq2seq predictions given different documents.

Source: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Retrieval	138	32.09%
Question Answering	56	13.02%
Language Modelling	29	6.74%
Information Retrieval	23	5.35%
Large Language Model	21	4.88%
Open-Domain Question Answering	12	2.79%
Text Generation	12	2.79%
Benchmarking	9	2.09%
Chatbot	6	1.40%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
BART	Transformers
BERT	Language Models

Categories

Add Remove

Transformers

RAG

Papers

Tasks

Usage Over Time

Components

Categories Edit Add Remove

Categories

Add Remove