πŸ“‚Dataset

We provide a BaseDataset class as the foundation, and several ready-to-use dataset types. These make it simple to load data from different sources in a unified way. This dataset object also can be passed to the evaluate function that will be used for end-to-end evaluation.


Available Datasets


πŸ“– DictDataset

Use when: You want to store your dataset directly in a list of dictionary format.

It can be created from JSONL or CSV.

Example usage:

from gllm_evals.dataset.dict_dataset import DictDataset

csv_path = "path/to/csv/data"
data: DictDataset = DictDataset.from_csv(csv_path)

πŸ€— HuggingFaceDataset

Use when: You want to load datasets directly from the HuggingFace Hub or from a Python list.

Example usage:


πŸ“ SpreadsheetDataset

Use when: You want to load datasets from Google Sheets.

Example usage:


πŸ“Š LangfuseDataset

Use when: You want to manage datasets in Langfuse or want to import from multiple formats (from Langfuse itself, dictionary, google sheets, CSV, JSONL).

Example usage:

Last updated