Generation
What is generation?
Generation is a core subprocess in our SDK that focuses on producing the final output of an RAG (Retrieval-Augmented Generation) pipeline. This output typically consists of two parts:
Response – the synthesized answer, typically produced by the language model.
References – the relevant sources used to generate the response.
To support this workflow, the GL SDK provides the following components:
Compressor - For packaging many passages into a single prompt context to reduce token count.
Context Enricher - For adding useful context (e.g. metadata) into retrieved chunks before they’re passed to the language model.
Deep Researcher - For performing a deep research operation within an RAG pipeline with ease.
Reference Formatter - For filtering and formatting the references in a clear, standardized format.
Relevance Filter - For filtering context chunks based on their relevance with the user query.
Repacker - For rearranging a list of content chunks into an order that’s more effective for downstream model consumption.
Response Synthesizer - For synthesizing the response based on the provided inputs and contexts.
Last updated
Was this helpful?