πDataset
We provide a BaseDataset class as the foundation, and several ready-to-use dataset types. These make it simple to load data from different sources in a unified way. This dataset object also can be passed to the evaluate function that will be used for end-to-end evaluation.
Available Datasets
π DictDataset
Use when: You want to store your dataset directly in a list of dictionary format.
It can be created from JSONL or CSV.
Example usage:
from gllm_evals.dataset.dict_dataset import DictDataset
csv_path = "path/to/csv/data"
data: DictDataset = DictDataset.from_csv(csv_path)π€ HuggingFaceDataset
Use when: You want to load datasets directly from the HuggingFace Hub or from a Python list.
Example usage:
π SpreadsheetDataset
Use when: You want to load datasets from Google Sheets.
Example usage:
π LangfuseDataset
Use when: You want to manage datasets in Langfuse or want to import from multiple formats (from Langfuse itself, dictionary, google sheets, CSV, JSONL).
Example usage:
Last updated