Add type overloads to load_dataset for better static type inference #7888

Aditya2755 · 2025-11-29T06:19:30Z

This PR adds @overload decorators to load_dataset() to help type checkers like Pylance and mypy correctly infer the return type based on the split and streaming parameters.

Changes:

Added typing imports (Literal, overload) to load.py
Added 4 @overload signatures that map argument combinations to specific return types:
- split=None, streaming=False -> DatasetDict
- split specified, streaming=False -> Dataset
- split=None, streaming=True -> IterableDatasetDict
- split specified, streaming=True -> IterableDataset

This resolves the Pylance error where to_csv() was not recognized on Dataset objects returned by load_dataset(..., split='train'), since the type checker previously saw the return type as a Union that included types without to_csv().

No runtime behavior changes - this is purely a static typing improvement.

@overload

Fixes huggingface#7883 This PR adds @overload decorators to load_dataset() to help type checkers like Pylance and mypy correctly infer the return type based on the split and streaming parameters. Changes: - Added typing imports (Literal, overload) to load.py - Added 4 @overload signatures that map argument combinations to specific return types: * split=None, streaming=False -> DatasetDict * split specified, streaming=False -> Dataset * split=None, streaming=True -> IterableDatasetDict * split specified, streaming=True -> IterableDataset This resolves the Pylance error where to_csv() was not recognized on Dataset objects returned by load_dataset(..., split='train'), since the type checker previously saw the return type as a Union that included types without to_csv(). No runtime behavior changes - this is purely a static typing improvement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add type overloads to load_dataset for better static type inference #7888

Add type overloads to load_dataset for better static type inference #7888

Aditya2755 commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add type overloads to load_dataset for better static type inference #7888

Are you sure you want to change the base?

Add type overloads to load_dataset for better static type inference #7888

Conversation

Aditya2755 commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant