mirror of
https://github.com/huggingface/lerobot.git
synced 2026-06-18 00:37:10 +00:00
66ac901632
The dataset is already rank-disjoint via split_dataset_by_node; accelerate's IterableDatasetShard wrapper kept only every Nth batch of each rank's stream, silently training on 1/N of the data per pass while decoding all of it. The --dummy benchmark path never prepared the loader, so benchmarks were unaffected. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>