Query Transformation

This guide will walk you through adding a Query Transformer component to your existing RAG pipeline that automatically rewrites and optimizes user queries for better document retrieval, improving the relevance and accuracy of your search results.

Query transformation enhances your RAG pipeline by intelligently reformulating user queries to improve retrieval performance, helping you find more relevant documents and generate better responses.

This tutorial extends the Your First RAG Pipeline tutorial. Ensure you have followed the instructions to set up your repository.

Prerequisites

This example specifically requires:

Completion of the Your First RAG Pipeline tutorial - this builds directly on top of it
Completion of all setup steps listed on the Prerequisites page
A working OpenAI API key configured in your environment variables

You should be familiar with these concepts and components:

Components in Your First RAG Pipeline- Required foundation
query-transformer

View full project code on GitHub

Installation

# you can use a Conda environment
pip install --extra-index-url "https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/" gllm-rag gllm-core gllm-generation gllm-inference gllm-pipeline gllm-retrieval gllm-misc gllm-datastore

# you can use a Conda environment
pip install --extra-index-url "https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/" gllm-rag gllm-core gllm-generation gllm-inference gllm-pipeline gllm-retrieval gllm-misc gllm-datastore

# you can use a Conda environment
FOR /F "tokens=*" %T IN ('gcloud auth print-access-token') DO pip install --extra-index-url "https://oauth2accesstoken:%T@glsdk.gdplabs.id/gen-ai-internal/simple/" gllm-rag gllm-core gllm-generation gllm-inference gllm-pipeline gllm-retrieval gllm-misc gllm-datastore

How to Use this Guide

You can either:

Download or copy the complete guide file(s) to get everything ready instantly by heading to 📂 Complete Guide Files section in the end of this page. You can refer to the guide whenever you need explanation or want to clarify how each part works.
Follow along with each step to recreate the files yourself while learning about the components and how to integrate them.

Both options will work—choose based on whether you prefer speed or learning by doing!

Project Setup

Extend Your RAG Pipeline Project

Start with your completed RAG pipeline project from the Your First RAG Pipeline tutorial. We don't need to add any new file for this tutorial. Therefore, the structure should stay as is:

<project-name>/
├── data/
│   ├── <index>/...                     # preset data index folder
│   ├── chroma.sqlite3                  # preset database file
│   ├── imaginary_animals.csv           # sample data
├── modules/
│   ├── retriever.py
│   └── response_synthesizer.py
├── .env
├── indexer.py                    
└── pipeline.py    # 👈 Will be updated with query transformer

1) Build the Query Transformer Pipeline

Define extended RAG state

Create a custom state that includes the query state:

from gllm_pipeline.pipeline import RAGState

class RAGStateWithQT(RAGState):
    query: str

Create all pipeline steps

Define all steps including the new query transformer step:

from gllm_pipeline.steps import step, transform
from gllm_retrieval.query_transformer.one_to_one_query_transformer import OneToOneQueryTransformer

transform_query_step = step(
    component=OneToOneQueryTransformer(
        lm_request_processor=build_lm_request_processor(
            model_id="openai/gpt-4o-mini",
            system_template="You are a helpful assistant that rewrites queries for better retrieval. Rewrite the following query. Only output the transformed query.",
            user_template="Query: {query}",
        )
    ),
    input_map={"query": "user_query"},
    output_state="queries",
)

flatten_query = transform(
    operation=lambda x: "\n".join(x["queries"]),
    input_states=["queries"],
    output_state="query",
)

Compose the final pipeline

Chain all steps including the query transformer:

e2e_pipeline = (
    query_transformer_step 
    | flatten_query 
    | retriever_step 
    | response_synthesizer_step
)

e2e_pipeline.state_type = RAGStateWithQT

This creates a pipeline that first transforms the user query before retrieving relevant documents, leading to better search results.

🧠 The RAGStateWithQT extends the base RAGState to include the transformed query field.

2) Run the Pipeline

When running the pipeline, you may encounter an error like this:

[2025-08-26T14:36:10+0700.550 chromadb.telemetry.product.posthog ERROR] Failed to send telemetry event CollectionQueryEvent: capture() takes 1 positional argument but 3 were given

Don't worry about this, since we do not use this Chroma feature. Your Pipeline should still work.

Configure and invoke the pipeline

Configure the state and config for direct pipeline invocation:

# Run the pipeline
async def main():
    state = {"user_query": "Give me nocturnal creatures from the dataset"}  # Replace with your actual query
    config = {"top_k": 5}
    result = await e2e_pipeline.invoke(state, config)
    print(f"Pipeline result: {result['response']}")


if __name__ == "__main__":
    asyncio.run(main())

Observe output

If you successfully run all the steps, you will see something like this:

DEBUG    [OneToOneQueryTransformer] [Start 'OneToOneQueryTransformer'] Processing query:     component.py:130
         'Give me nocturnal creatures from the dataset'                                                      
WARNING  [LMRequestProcessor] The `prompt_kwargs` parameter is deprecated and     lm_request_processor.py:160
         will be removed in v0.6. Please pass the prompt kwargs as keyword                                   
         arguments instead.                                                                                  
INFO     [OpenAILMInvoker] Invoking 'OpenAILMInvoker'                                       lm_invoker.py:252
INFO     [LMRequestProcessor] LM invocation result:                               lm_request_processor.py:195
         'List nocturnal animals from the dataset.'                                                          
DEBUG    [OneToOneQueryTransformer] [Finished 'OneToOneQueryTransformer'] Successfully       component.py:130
         produced 1 result(s):                                                                               
         - 'List nocturnal animals from the dataset.'

Extending the Query Transformation System

Multiple Query Transformation Strategies

You can extend the system with different transformation approaches:

def multi_strategy_query_transformer():
    """Creates multiple query variations for better retrieval."""
    lmrp = build_lm_request_processor(
        model_id="openai/gpt-4o-mini",
        credentials=os.getenv("OPENAI_API_KEY"),
        system_template="Generate 3 different variations of the following query for better document retrieval. Output each variation on a new line.",
        user_template="Query: {query}",
    )
    
    return OneToOneQueryTransformer(lm_request_processor=lmrp)

Domain-Specific Query Transformers

Create specialized transformers for different content domains:

def academic_query_transformer():
    """Transforms queries for academic document retrieval."""
    lmrp = build_lm_request_processor(
        model_id="openai/gpt-4o-mini",
        credentials=os.getenv("OPENAI_API_KEY"),
        system_template="Rewrite the following query using academic terminology for better scholarly document retrieval.",
        user_template="Query: {query}",
    )
    
    return OneToOneQueryTransformer(lm_request_processor=lmrp)

def technical_query_transformer():
    """Transforms queries for technical documentation retrieval."""
    lmrp = build_lm_request_processor(
        model_id="openai/gpt-4o-mini",
        credentials=os.getenv("OPENAI_API_KEY"),
        system_template="Rewrite the following query using precise technical terms for better API and documentation retrieval.",
        user_template="Query: {query}",
    )
    
    return OneToOneQueryTransformer(lm_request_processor=lmrp)

Custom Query Transformation Logic

You can implement custom transformation logic:

class CustomRAGState(RAGState):
    original_query: str
    transformed_query: str
    query_intent: str

def intent_aware_query_transformer():
    """Transforms queries based on detected intent."""
    intent_detector = build_lm_request_processor(
        model_id="openai/gpt-4o-mini",
        credentials=os.getenv("OPENAI_API_KEY"),
        system_template="Classify the intent of this query as: factual, comparative, procedural, or exploratory. Output only the classification.",
        user_template="Query: {query}",
    )
    
    query_rewriter = build_lm_request_processor(
        model_id="openai/gpt-4o-mini",
        credentials=os.getenv("OPENAI_API_KEY"),
        system_template="Rewrite this {intent} query for optimal document retrieval.",
        user_template="Query: {query}",
    )
    
    return OneToOneQueryTransformer(lm_request_processor=query_rewriter)

Troubleshooting

Common Issues

Poor query transformations:
- Review and refine your system template for the query transformer
- Ensure the transformation model (GPT-4o-mini) is appropriate for your use case
- Test different system prompts to improve transformation quality
Query transformation taking too long:
- Consider using a faster model for query transformation
- Implement caching for frequently transformed queries
- Set appropriate timeout configurations in your LM request processor
Transformed queries not improving retrieval:
- Analyze the transformed queries to ensure they're more specific
- Test with different transformation strategies
- Consider the quality and indexing of your document corpus
Pipeline state management issues:
- Ensure your custom RAGState class properly extends the base RAGState
- Verify that all state field names match between pipeline steps
- Check that the state_type is properly assigned to your pipeline

Debug Tips

Enable debug mode: Set debug: true in your request to see detailed logs
Log query transformations: Use the log step to see original vs transformed queries
Test transformations in isolation: Test your query transformer component separately
Compare retrieval results: Compare document retrieval with and without query transformation
Monitor transformation quality: Manually review a sample of transformed queries for quality

Congratulations! You've successfully implemented a Query Transformer component in your RAG pipeline. This enhancement improves document retrieval by intelligently rewriting user queries, leading to more relevant search results and better response quality. Your AI system can now understand user intent better and retrieve more appropriate information from your knowledge base.

PreviousSimple Guardrail NextMultimodal Input Handling

Last updated 2 months ago

Was this helpful?

hashtagInstallation

hashtagHow to Use this Guide

hashtagProject Setup

hashtag1) Build the Query Transformer Pipeline

hashtag2) Run the Pipeline

hashtagExtending the Query Transformation System

hashtagMultiple Query Transformation Strategies

hashtagDomain-Specific Query Transformers

hashtagCustom Query Transformation Logic

hashtagTroubleshooting