Skip to content

disk_dataset.py

HuggingfaceDiskDataset dataclass

Bases: IO[Dataset | DatasetDict]

Load and save a dataset from/to disk using the Huggingface datasets library.

Example usage:

>>> from ordeq_huggingface import HuggingfaceDiskDataset
>>> ds = HuggingfaceDiskDataset(path="path/to/dataset")  # doctest: +SKIP
>>> data = ds.load()  # doctest: +SKIP