feat: add HuggingFace Datasets-style load_dataset() API
Implements a familiar load_dataset() interface inspired by HuggingFace
Datasets, adapted for atdata's typed WebDataset approach:
- load_dataset() function with path resolution, split detection, and
support for brace notation, globs, and explicit data_files mapping
- DatasetDict class for multi-split dataset containers
- Automatic split detection from filenames (train/test/validation)
- streaming parameter for explicit streaming mode indication
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This is a binary file and will not be displayed.