[BETA] Realtime Chat

gllm-inference | Tutorial: [BETA] Realtime Chat | API Reference

The realtime chat modules are currently in beta and may be subject to changes in the future. They are intended only for quick prototyping in local environments. Please avoid using them in production environments.

What’s a Realtime Chat?

The realtime chat is a unified interface designed to help you interact with language models that supports realtime interactions. In this tutorial, you'll learn how to perform realtime chat using the GoogleRealtimeChat module in just a few lines of code.

Prerequisites

This example specifically requires:

Completion of all setup steps listed on the Prerequisites page.
Setting a Gemini API key in the GOOGLE_API_KEY environment variable.

Installation

# you can use a Conda environment
pip install --extra-index-url https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/ gllm-inference

# you can use a Conda environment
FOR /F "tokens=*" %T IN ('gcloud auth print-access-token') DO pip install --extra-index-url "https://oauth2accesstoken:%T@glsdk.gdplabs.id/gen-ai-internal/simple/"  "gllm-inference"

Quickstart

Let’s jump into a basic example using GoogleRealtimeChat.

from dotenv import load_dotenv
load_dotenv()

from gllm_inference.realtime_chat import GoogleRealtimeChat
import asyncio

realtime_chat = GoogleRealtimeChat(model_name="gemini-live-2.5-flash-preview")
asyncio.run(realtime_chat.start())

Notice that after the realtime chat starts, the following message appears in the console:

The conversation starts:

2025-10-29T10:24:30 INFO      The 'GoogleRealtimeChat' class is currently in beta and may be subject to changes in the future.                                        google_realtime_chat.py:320
                             It is intended only for quick prototyping in local environments.                                                                                                    
                             Please avoid using it in production environments.                                                                                                                   
2025-10-29T10:24:30 INFO      Starting 'GoogleRealtimeChat' with model: 'gemini-live-2.5-flash-preview'.                                                              google_realtime_chat.py:371
2025-10-29T10:24:31 INFO      b'{\n  "setupComplete": {}\n}\n'                                                                                                                       live.py:1057
2025-10-29T10:24:31 INFO      Starting 'KeyboardInputStreamer'. Type and press Enter to send a message.                                                                      input_streamer.py:41
2025-10-29T10:24:31 INFO      Starting 'ConsoleOutputStreamer'. Transcriptions will be printed to the console.                                                              output_streamer.py:37
2025-10-29T10:24:31 INFO      Type '/quit' to end the conversation.

The realtime chat modules utilize a set of input and output streamers to define the input sources and output destinations when interacting with the language model. Notice that by default, it uses the following IO streamers:

KeyboardInputStreamer : Sending text inputs sent via the keyboard to model.
ConsoleOutputStreamer : Displaying text outputs from the model to the console.

This means that by default, the GoogleRealtimeChat modules support text inputs and text outputs. Try typing through your keyboard to start interacting with the model!

Interaction Example:

 Hi, there!  # Typed by user

====== Assistant (Transcription) ====== 
Hello. How can I help you today?
============= End of Turn ============= 

The weather is sunny today, please recommend some activities!  # Typed by use

====== Assistant (Transcription) ====== 
It sounds like a lovely day. Some popular sunny day activities include going for a walk or run, having a picnic, visiting a park, or doing some gardening. If you're looking for something more active, you could try cycling or playing outdoor sports. Do any of those appeal to you?
============= End of Turn =============

When you're done, you can type /quit to end the conversation.

Ending the conversation:

/quit  # Typed by user

2025-10-29T10:24:38 INFO      Conversation ended successfully.

IO Streamer Customization

Now that we've learned the basics, let's try using other kinds of IO streamers! In the example below, we're going to utilize the LinuxMicInputStreamer and LinuxSpeakerOutputStreamer.

Limitation: As the name suggests, LinuxMicInputStreamer and LinuxSpeakerOutputStreamer are only supported in Linux systems. Similar supports for other operating system, such as Windows and Mac, are not yet available.

.from dotenv import load_dotenv
load_dotenv()

from gllm_inference.realtime_chat import GoogleRealtimeChat
from gllm_inference.realtime_chat.input_streamer import LinuxMicInputStreamer
from gllm_inference.realtime_chat.output_streamer import LinuxSpeakerOutputStreamer
import asyncio

input_streamers = [LinuxMicInputStreamer()]
output_streamers = [LinuxSpeakerOutputStreamer()]

realtime_chat = GoogleRealtimeChat(model_name="gemini-live-2.5-flash-preview")
asyncio.run(realtime_chat.start(input_streamers=input_streamers, output_streamers=output_streamers))

The conversation starts:

2025-10-29T10:40:45 INFO      The 'GoogleRealtimeChat' class is currently in beta and may be subject to changes in the future.                                        google_realtime_chat.py:320
                             It is intended only for quick prototyping in local environments.                                                                                                    
                             Please avoid using it in production environments.                                                                                                                   
2025-10-29T10:40:45 INFO      Starting 'GoogleRealtimeChat' with model: 'gemini-live-2.5-flash-preview'.                                                              google_realtime_chat.py:371
2025-10-29T10:40:46 INFO      b'{\n  "setupComplete": {}\n}\n'                                                                                                                       live.py:1057
2025-10-29T10:40:46 INFO      Starting 'LinuxMicInputStreamer'. Speak to your microphone to send a message.                                                                  input_streamer.py:41
2025-10-29T10:40:46 INFO      Starting 'LinuxSpeakerOutputStreamer'. Audio will be played through your speakers.

Try speaking through your microphone and have fun conversing with the language models in realtime!

After you're done, try combining them with our default IO streamers and see what happens!

...
input_streamers = [KeyboardInputStreamer(), LinuxMicInputStreamer()]
output_streamers = [ConsoleOutputStreamer(), LinuxSpeakerOutputStreamer()]
...

Future Plans

In the future, more IO streamers can be added to allow for more robust realtime experience, this may include but are not limited to:

Input streamers
1. FileInputStreamer
2. ScreenCaptureInputStreamer
3. CameraInputStreamer
4. WindowsMicInputStreamer
5. MacMicInputStreamer
Output streamers
1. FileOutputStreamer
2. WindowsSpeakerOutputStreamer
3. MacSpeakerOutputStreamer

PreviousCatalog NextMigration Guide

Last updated 3 months ago

Was this helpful?

hashtagWhat’s a Realtime Chat?

hashtagInstallation

hashtagQuickstart

hashtagIO Streamer Customization

hashtagFuture Plans

What’s a Realtime Chat?

Installation

Quickstart

IO Streamer Customization

Future Plans