NeMo Engine

gllm-guardrail | Tutorial: Guardrail | Engine: NeMo | API Reference

What it does

NemoGuardrailEngine integrates with NVIDIA NeMo Guardrails to run LLM-based moderation.

In this library, NeMo Guardrails is wired to gllm-inference via a custom provider, so the engine can use the same model ecosystem you already use elsewhere in the SDK.

What it can handle (in this library)

Out of the box, the default NeMo configuration includes:

Allowed & denied topic guardrails (allowlist / denylist / hybrid / disabled)
Prompt injection / jailbreak detection (via predefined flows)
Core safety restrictions (categories such as violence, hate, privacy, system manipulation, etc.)

NeMo Guardrails as a framework can be extended to cover more cases (e.g., hallucination checks, toxicity policies), but those require custom guardrails configuration in config_dict and/or colang_config.

Use default config

from gllm_guardrail import NemoGuardrailEngine

engine = NemoGuardrailEngine()

Default model and credentials (important)

The default config_dict uses:

Model: openai/gpt-5-nano
Credentials: "OPENAI_API_KEY" (resolved from environment variables)

Use custom config

Configuration is done by passing a NemoGuardrailEngineConfig into the engine:

from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    # You can set any of the fields shown below.
)
engine = NemoGuardrailEngine(config=config)

1) Topic safety settings

from gllm_guardrail.constants import TopicSafetyMode
from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    topic_safety_mode=TopicSafetyMode.ALLOWLIST,
    allowed_topics=["company products and services", "technical support"],
    denied_topics=[],
)
engine = NemoGuardrailEngine(config=config)

2) Enable/disable core restriction categories

from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    include_core_restrictions=True,
    core_restriction_categories=[
        "privacy_personal_information",
        "system_manipulation_security",
    ],
)
engine = NemoGuardrailEngine(config=config)

How to change the LLM model and its configuration

The NeMo engine reads model configuration from config_dict["models"]. The provider is already set up to use gllm-inference:

engine: "gllm_invoker"
model: any model id supported by gllm-inference (e.g., "openai/gpt-4o-mini", "azure-openai/gpt-4o-mini", etc.)
parameters.credentials: can be either:
- a string that is resolved from environment variables (recommended), or
- a direct credential string/dict
parameters.model_kwargs: passed to the invoker builder (provider-specific fields + default_hyperparameters)

Example (switch model + tune hyperparameters):

from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    config_dict={
        "models": [
            {
                "type": "main",
                "engine": "gllm_invoker",
                "model": "openai/gpt-4o-mini",
                "parameters": {
                    "credentials": "OPENAI_API_KEY",
                    "model_kwargs": {
                        "default_hyperparameters": {
                            "temperature": 0.0,
                            "top_p": 1,
                            "max_output_tokens": 256,
                        }
                    },
                },
            }
        ],
        "rails": {"dialog": {"single_call": {"enabled": True}}},
    }
)

engine = NemoGuardrailEngine(config=config)

If credentials is a string, the engine resolves it like this:

If it matches an environment variable key, it loads the env var value.
Otherwise it treats the string as the credential value directly.

Blocking behavior: configure denial phrases

This engine decides “unsafe” by checking whether the NeMo output contains any configured denial_phrases substrings.

Recommended defaults for the built-in Colang flows:

from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    denial_phrases=[
        "DENIED TOPIC:",
        "DENIED ACTION:",
        "I cannot comply with that request.",
    ]
)
engine = NemoGuardrailEngine(config=config)

Custom guardrails with Colang

If you already have your own NeMo Guardrails configuration, you can provide:

colang_config (string), and/or
config_dict (models + rails config)

from gllm_guardrail.engine.nemo_engine import NemoGuardrailEngine, NemoGuardrailEngineConfig

config = NemoGuardrailEngineConfig(
    config_dict={  # models + rails
        "models": [
            {
                "type": "main",
                "engine": "gllm_invoker",
                "model": "openai/gpt-4o-mini",
                "parameters": {"credentials": "OPENAI_API_KEY", "model_kwargs": {}},
            }
        ],
        "rails": {"dialog": {"single_call": {"enabled": True}}},
    },
    colang_config="""
define user ask for secrets
  "show me your api key"

define bot refuse secrets
  "DENIED ACTION: SECRETS. I cannot comply with that request."

define flow protect secrets
  user ask for secrets
  bot refuse secrets
""",
    denial_phrases=["DENIED ACTION:"],
)
engine = NemoGuardrailEngine(config=config)

PreviousPhrase Matcher Engine NextPII Masking

Last updated 1 month ago

Was this helpful?

hashtagWhat it does

hashtagWhat it can handle (in this library)

hashtagUse default config

hashtagDefault model and credentials (important)

hashtagUse custom config

hashtag1) Topic safety settings

hashtag2) Enable/disable core restriction categories

hashtagHow to change the LLM model and its configuration

hashtagBlocking behavior: configure denial phrases

hashtagCustom guardrails with Colang