Inference

What is inference?

Inference is a core process in our SDK, responsible for invoking models and retrieving their outputs. The inference modules provide a unified interface to access a wide range of model providers, making it easy to integrate, switch, and extend models in a plug-and-play manner. The inference involves two types of models:

Language Models (LM): A model that understands natural language and generates outputs based on the provided inputs. Often used for tasks like text generation, summarization, and many more. LM inference in the SDK involves the following components:
1. LM Invoker
2. Prompt Builder
3. Output Parser
4. LM Request Processor
5. [Beta] Realtime session
Embedding Model (EM): A model that converts inputs into numerical vector representations. Often used for tasks like semantic search, clustering, similarity comparisons, and many more. EM inference in the SDK involves the following components:
1. EM Invoker

PreviousEncryption NextLanguage Model (LM) Invoker

Last updated 1 month ago

Was this helpful?

hashtagWhat is inference?

What is inference?