Skip to content

DOC: document missing save_infos and max_depth parameters#8223

Open
kratos0718 wants to merge 1 commit into
huggingface:mainfrom
kratos0718:fix/document-missing-params-load-dataset-flatten
Open

DOC: document missing save_infos and max_depth parameters#8223
kratos0718 wants to merge 1 commit into
huggingface:mainfrom
kratos0718:fix/document-missing-params-load-dataset-flatten

Conversation

@kratos0718
Copy link
Copy Markdown

What this fixes

Two public API functions were missing parameters from their docstrings:

1. load_dataset() — missing save_infos

The save_infos parameter has been in the function signature but was absent from the Args section of the docstring. When save_infos=True, it overrides verification_mode and forces it to VerificationMode.ALL_CHECKS, running full checksums/size/splits verification.

2. Dataset.flatten() — missing max_depth

The max_depth parameter (default 16) controls how many nesting levels are flattened, but was not documented in the Args section.

Files changed

  • src/datasets/load.py — added save_infos to load_dataset docstring
  • src/datasets/arrow_dataset.py — added max_depth to Dataset.flatten docstring

Two undocumented parameters in public API functions:

1. load_dataset() was missing save_infos from its Args section.
   When save_infos=True, verification_mode is forced to ALL_CHECKS
   to run full checksums/size/splits verification.

2. Dataset.flatten() was missing max_depth from its Args section.
   Controls how many nesting levels are flattened; defaults to 16.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant