Generation

What is generation?

Generation is a core subprocess in our SDK that focuses on producing the final output of an RAG (Retrieval-Augmented Generation) pipeline. This output typically consists of two parts:

Response – the synthesized answer, typically produced by the language model.
References – the relevant sources used to generate the response.

To support this workflow, the GL SDK provides the following components:

Compressor - For packaging many passages into a single prompt context to reduce token count.
Context Enricher - For adding useful context (e.g. metadata) into retrieved chunks before they’re passed to the language model.
Deep Researcher - For performing a deep research operation within an RAG pipeline with ease.
Reference Formatter - For filtering and formatting the references in a clear, standardized format.
Relevance Filter - For filtering context chunks based on their relevance with the user query.
Repacker - For rearranging a list of content chunks into an order that’s more effective for downstream model consumption.
Response Synthesizer - For synthesizing the response based on the provided inputs and contexts.

PreviousChunk Processor NextCompressor

Last updated 1 month ago

Was this helpful?

hashtagWhat is generation?

What is generation?