Component

What's a Component?

A Component is the basic executable unit in GLLM Core. It wraps a piece of async business logic and standardizes how that logic is:

Discovered via a single async entrypoint (@main or fallbacks).
Executed through a uniform run(**kwargs) method.
Observed via structured input/output events.
Analyzed so pipelines and orchestrators can understand its input contract.

At a high level:

You implement a subclass of Component.
You mark one async method with @main to declare the entrypoint.
Pipelines never call your method directly. They call component.run(**kwargs).
Input schemas can be generated from the @main signature, enabling validation and argument construction.

This gives GLLM Core a uniform abstraction over heterogeneous logic: pipelines don't need to know whether a component is talking to an LLM, a database, an API, or anything else.

Prerequisites

This example specifically requires completion of all setup steps listed on the Prerequisites page.

Installation

# you can use a Conda environment
pip install --extra-index-url https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/ gllm-core

# you can use a Conda environment
FOR /F "tokens=*" %T IN ('gcloud auth print-access-token') DO pip install --extra-index-url "https://oauth2accesstoken:%T@glsdk.gdplabs.id/gen-ai-internal/simple/"  "gllm-core"

Quickstart

Define Your First Component

from gllm_core.schema import Component, main


class TextFormatter(Component):
    @main
    async def format(self, text: str, uppercase: bool = False, repeat: int = 1) -> str:
        """Format text with options."""
        result = text.upper() if uppercase else text
        return result * repeat

Key points:

Subclass Component.
Define a single async method (here format).
Mark it with @main to tell the system: "this is the entrypoint".

Execute the Component Uniformly

formatter = TextFormatter()

result = await formatter.run(text="hello", uppercase=True, repeat=2)
assert result == "HELLOHELLO"

You never call await formatter.format(...) from orchestration code. Instead, you always call await formatter.run(**kwargs):

The Component base class emits start/finish events.
Logging is performed via the component's _logger.
Pipelines can treat every component the same way.

Use the Generated Input Schema

The design for Components includes an input_params property that exposes a Pydantic model mirroring the @main signature:

formatter = TextFormatter()
ParamsModel = formatter.input_params  # type: ignore[attr-defined]

params = ParamsModel(text="world", repeat=2)
result = await formatter.run(**params.model_dump())

This gives you:

Type-checked construction of arguments.
Easy validation and error reporting.
A single source of truth: the @main signature.

The `@main` Decorator

The @main decorator marks one async method on a Component subclass as the canonical entrypoint. Architecturally, it enables:

Entry-point abstraction: pipelines don't need to know method names.
Schema generation: the @main signature drives input_params.
Future interoperability: the same entrypoint can later be wrapped as an MCP-compliant Tool.

From the docs and specs:

Component.get_main() resolves the entrypoint by honoring @main, __main_method__, or falling back to _run.
Component.input_params generates a Pydantic model from the resolved main method.
Component.run(**kwargs) executes the resolved main coroutine and emits events.

`@main` Method Resolution

The entrypoint resolution is conceptually:

Prefer an explicitly decorated @main method on the subclass.
If none is decorated, look for a class-level __main_method__ override.
As a compatibility fallback, use _run.

This resolution is cached (via a resolver such as MainMethodResolver) so the cost of introspection is paid once per class.

Using `@main` with Abstract Classes

The @main decorator works seamlessly with abstract base classes, allowing you to define a common entrypoint signature that subclasses can implement. This is particularly useful when building component hierarchies with shared interfaces.

Example: Abstract Base Component

from abc import ABC, abstractmethod
from gllm_core.schema import Component, main


class BaseProcessor(Component, ABC):
    """Abstract processor with a defined entrypoint."""
    
    @main
    @abstractmethod
    async def process(self, data: str) -> str:
        """Process data - must be implemented by subclasses."""
        pass


class UpperCaseProcessor(BaseProcessor):
    """Converts text to uppercase."""
    
    async def process(self, data: str) -> str:
        return data.upper()


class LowerCaseProcessor(BaseProcessor):
    """Converts text to lowercase."""
    
    async def process(self, data: str) -> str:
        return data.lower()

Key behaviors:

The @main decorator is inherited: Both UpperCaseProcessor and LowerCaseProcessor inherit the @main marking from BaseProcessor.process.
Subclasses implement the abstract method: Each subclass provides its own implementation of process.
Uniform execution: All subclasses can be executed via run(**kwargs):

upper = UpperCaseProcessor()
lower = LowerCaseProcessor()

result1 = await upper.run(data="hello")  # Returns "HELLO"
result2 = await lower.run(data="WORLD")  # Returns "world"

Shared input schema: Both subclasses generate the same input_params model based on the abstract signature:

# Both have the same parameter structure
assert upper.input_params.model_fields.keys() == lower.input_params.model_fields.keys()

Example: Overriding `@main` in Subclasses

If a subclass needs a different entrypoint, it can define its own @main method:

class AdvancedProcessor(BaseProcessor):
    """Processor with additional parameters."""
    
    async def process(self, data: str) -> str:
        # This implements the abstract method
        return self._transform(data)
    
    @main
    async def transform(self, data: str, mode: str = "upper") -> str:
        """Transform with configurable mode."""
        if mode == "upper":
            return data.upper()
        elif mode == "lower":
            return data.lower()
        else:
            return data
    
    def _transform(self, data: str) -> str:
        return data.upper()


# The subclass uses its own @main method
processor = AdvancedProcessor()
result = await processor.run(data="hello", mode="lower")  # Returns "hello"

# The input_params reflects the new signature
ParamsModel = processor.input_params
assert "mode" in ParamsModel.model_fields

Important notes:

Most derived @main wins: When a subclass defines its own @main method, it takes precedence over inherited @main methods.
Abstract methods must still be implemented: Even if you override @main, you must implement all abstract methods from the parent class.
Schema generation uses the resolved main: The input_params property always reflects the signature of the resolved @main method, not the abstract one.

Component Lifecycle and Runtime Behavior

The Component base class provides a logger and a standard event flow:

run(**kwargs)
1. Formats an input event with the component name and arguments.
2. Logs it via _logger.
3. Optionally emits it through an EventEmitter if one is passed in kwargs.
Calls the resolved main coroutine (or _run in the current implementation).
Formats and logs an output event containing the result.

Binary payloads (e.g., bytes) are handled via binary_handler_factory so logs show sizes or summaries instead of raw bytes.

Designing Good Component APIs

Prefer Clear, Typed Parameters

When defining your @main method:

Use explicit type hints for all parameters.
Provide sensible defaults where appropriate.
Reserve **kwargs for truly open-ended options.

Example:

class DataProcessor(Component):
    @main
    async def process(
        self,
        data: list[dict],
        limit: int = 100,
        **options,
    ) -> dict:
        """Process data with optional filters."""
        processed = data[:limit]
        return {
            "count": len(processed),
            "data": processed,
            "options": options,
        }

With the planned input_params behavior, this will:

Generate a DataProcessorParams model.
Enforce types for data and limit.
Allow extra fields (because of **options) via extra="allow".

When to Use `**kwargs`

Use **kwargs when:

You truly don't know all the options ahead of time.
You want to forward arbitrary parameters to downstream systems.

Avoid it when:

You can name and type your parameters precisely.
You want strict validation and clear API docs.

Backwards Compatibility with Legacy `_run` Components

If you have existing components that only implement _run, they continue to work:

class LegacyComponent(Component):
    async def _run(self, message: str, priority: int = 1) -> str:
        """Legacy component using _run."""
        return f"[P{priority}] {message}"

See Migrating from the old Component for a guide to move from the old _run-style Components to the newer one.

PreviousCore NextTool

Last updated 1 month ago

Was this helpful?

hashtagWhat's a Component?

hashtagInstallation

hashtagQuickstart

hashtagDefine Your First Component

hashtagExecute the Component Uniformly

hashtagUse the Generated Input Schema

hashtagThe @main Decorator

hashtag@main Method Resolution

hashtagUsing @main with Abstract Classes

hashtagExample: Overriding @main in Subclasses

hashtagComponent Lifecycle and Runtime Behavior

hashtagDesigning Good Component APIs

hashtagPrefer Clear, Typed Parameters

hashtagWhen to Use **kwargs

hashtagBackwards Compatibility with Legacy _run Components