lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-17 08:17:02 +00:00

Author	SHA1	Message	Date
CarolinePascal	f7010ff66c	fix(typo): fixing typo	2026-06-16 18:02:31 +02:00
CarolinePascal	f7ee453de7	fix(from_video_info): fixing early validation issue in from_video_info	2026-06-16 18:02:31 +02:00
CarolinePascal	ca7168f413	test(cleaning): cleaning up tests	2026-06-16 18:02:28 +02:00
CarolinePascal	ec6264d768	test(aggregate): extending aggregation tests to depth frames	2026-06-16 18:01:55 +02:00
CarolinePascal	d93a58a8b8	feat(tools): adding depth support in LeRobotDataset edition tools	2026-06-16 18:01:50 +02:00
CarolinePascal	92497dfcd8	feat(batched dequantization): optimizing dequantize_depth for torch based batched dequantization	2026-06-16 18:01:18 +02:00
CarolinePascal	263108d6c1	fix(TIFF): add missing quantization and cleanup for TIFF files	2026-06-16 18:01:18 +02:00
CarolinePascal	a925d20ce4	fix(typo): fixing typo	2026-06-16 18:01:18 +02:00
CarolinePascal	1f024ea3bf	fix(normalization): restricting 255 normalization to non depth/uint8 images only	2026-06-16 18:01:18 +02:00
CarolinePascal	d5f67cc7fc	fix(realsense): fixing typo in realsense serial number	2026-06-16 18:01:18 +02:00
CarolinePascal	9ab8c98494	tests(typos): fixing typos in tests	2026-06-16 18:01:18 +02:00
CarolinePascal	a561183442	fix(info): fixing info metadata update when is_depth_map was set	2026-06-16 18:01:18 +02:00
CarolinePascal	305b8d64b2	fix(pre-commit): fixing mutable defautl value	2026-06-16 18:01:18 +02:00
CarolinePascal	0a624a5cf5	feat(refactor): refactor DepthEncoderConfig quantization pipeline, so that the methods do not live in the config class. Add pixel format - channels validation.Move the default pixel format for depth in the config file.	2026-06-16 18:01:18 +02:00
CarolinePascal	d044ead377	feat(pix_fmt channels): use PyAv to check get pixel formats number of channels	2026-06-16 18:01:18 +02:00
CarolinePascal	e425fcb61a	tests(depth): adding new tests for depth integration validation	2026-06-16 18:01:17 +02:00
CarolinePascal	f08a9aea71	test(fix): fixing exisiting tests to still work with latest features	2026-06-16 18:01:17 +02:00
CarolinePascal	7d97b55cc4	chore(typos): fixing typos	2026-06-16 18:01:17 +02:00
CarolinePascal	edbd8c6f82	fix(plumbing): fixing missing parts in the depth maps pipeline	2026-06-16 18:01:17 +02:00
CarolinePascal	615954b80b	fix(stop_event): fixing stop_event race condition in camera classes	2026-06-16 18:01:17 +02:00
CarolinePascal	1c0fdfdb4b	feat(is_depth): simplifying is_depth nested name + legacy support	2026-06-16 18:01:17 +02:00
CarolinePascal	1c3ebd475f	feat(depth shape): ensuring depth maps shape is always including the channel	2026-06-16 18:01:17 +02:00
CarolinePascal	c655814788	chore(format): format code	2026-06-16 18:01:17 +02:00
CarolinePascal	a72ab14f89	feat(depth maps writer): adding support for raw depth maps recording with image writer	2026-06-16 18:01:17 +02:00
CarolinePascal	882074d707	feat(viz): render depth observations as rr.DepthImage in Viridis	2026-06-16 18:01:17 +02:00
CarolinePascal	4ae2f9f375	feat(record): plumb DepthEncoderConfig through lerobot-record	2026-06-16 18:01:17 +02:00
CarolinePascal	26099b6e03	feat(robots/so_follower): emit + populate depth keys when use_depth	2026-06-16 18:01:16 +02:00
CarolinePascal	6b395dfb24	feat(features): route 2D camera shapes to observation.depth.<key>	2026-06-16 18:01:16 +02:00
CarolinePascal	1cbabfe9a4	feat(cameras/realsense): expose async depth in metric meters	2026-06-16 18:01:16 +02:00
CarolinePascal	4744f4b913	feat(depth): wire DatasetReader to decode_depth_frames	2026-06-16 18:01:16 +02:00
CarolinePascal	9568e68b28	feat(depth): wire StreamingVideoEncoder + writer to depth encoder	2026-06-16 18:01:16 +02:00
CarolinePascal	10941c31f6	feat(depth): plumb DepthEncoderConfig through LeRobotDataset and DatasetWriter	2026-06-16 18:01:16 +02:00
CarolinePascal	a6882a048a	feat(depth): extend quantization tools to better fit the encoding/decoding pipeline	2026-06-16 18:01:16 +02:00
CarolinePascal	eb2b7d6dc3	feat(depth): persist depth metadata	2026-06-16 18:01:16 +02:00
CarolinePascal	f7f7b8c7f8	feat(video): add ffv1 to supported codecs	2026-06-16 18:01:16 +02:00
CarolinePascal	d58a324da4	feat(depth): add depth quantization helpers and tests	2026-06-16 18:01:16 +02:00
Caroline Pascal	287c823f13	fix(features copy): adding deepcopy on LeRobot dataset features to avoid shallow copy leaks (#3826 ) * fix(features copy): adding deepcopy on LeRobot dataset features to avoid shallow copy leaks * tests(test): adding new test	2026-06-16 17:58:59 +02:00
Pepijn	58ccc01508	fix(datasets): enforce one parquet row group per episode in v3 data writes (#3807 ) * fix(datasets): enforce one parquet row group per episode in v3 data writes LeRobot v3 data shards must hold exactly one row group per episode so a reader can fetch episode i with pq.ParquetFile(path).read_row_group(i) (a byte-range read) instead of loading the whole shard. The recording writer already does this (one write_table per episode); the aggregate and lerobot-annotate re-write paths instead concatenated many episodes and wrote them in one shot, collapsing the file to a single row group. - io_utils: add write_table_one_row_group_per_episode (one ParquetWriter, one write_table per episode — same pattern as the recording writer); to_parquet_with_hf_images embeds images then writes per-episode row groups; to_parquet_one_row_group_per_episode wraps it for plain frames - aggregate: route non-image data writes through the per-episode writer; leave the episodes-metadata parquet untouched (already one row/episode) - annotate: rewrite shards via the per-episode writer instead of a single bulk pq.write_table - tests: invariant coverage through the aggregate (image + video) and annotate paths No change to on-disk schema, paths, naming, rollover thresholds, or compression. Readers stay backward-compatible (old collapsed files load). * Update src/lerobot/datasets/io_utils.py Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com> Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com> * Update src/lerobot/datasets/io_utils.py Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com> Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com> * fix(datasets): correct indentation and add strict= in row-group helper The web-edited numpy version of write_table_one_row_group_per_episode had an over-indented line (IndentationError, breaking pre-commit + test collection) and a zip() without strict=. Fix both; behaviour unchanged. --------- Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com> Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com>	2026-06-16 12:15:48 +02:00
Caroline Pascal	38327fdc84	fix(images/videos): fixing aggregate_pipeline_dataset_features to avoid unwanted images features deletion (#3783 ) * fix(images/videos): fixing aggregate_pipeline_dataset_features to avoid unwanted images features deletion when videos are not used * fix(docstrings): improving docstrings Signed-off-by: Caroline Pascal <caroline8.pascal@gmail.com> --------- Signed-off-by: Caroline Pascal <caroline8.pascal@gmail.com>	2026-06-15 17:55:52 +02:00
Steven Palma	9555efc02c	chore(dependencies): update uv.lock (#3595 ) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2026-06-15 16:29:44 +02:00
Steven Palma	d576c59afb	refactor(robots): homogenize bi-manual setups implementations (#3772 ) * chore(robots): homogenize bi setups * feat(robots): split openarm mini into single and bi * refactor(robots): mixin for bi classes * docs: update docs	2026-06-15 16:28:54 +02:00
Altman	8515d456be	fix(datasets): avoid uint8 overflow in image stats (#3697 ) * fix(datasets): avoid uint8 overflow in image stats * fix(datasets): promote stats batches dynamically	2026-06-13 12:09:43 +02:00
Mahbod	30790de178	feat(edit-dataset): add `concatenate_videos` opt-out to merge (#3663 ) * feat(edit-dataset): add `concatenate_videos` opt-out to merge When merging datasets, source mp4s are concatenated into shards capped at `video_files_size_in_mb` (default 200 MB). This is great for dataloader throughput but destroys per-episode (or per-source) video boundaries, which is undesirable when you want to inspect, ship, or reuse the individual mp4s. Add a `concatenate_videos: bool = True` knob plumbed through `MergeConfig` → `merge_datasets` → `aggregate_datasets` → `aggregate_videos`. When False, each source mp4 is copied 1:1 to its own destination mp4 with no re-muxing, so the merge preserves source video boundaries. Usage: lerobot-edit-dataset \ --new_repo_id user/merged \ --operation.type=merge \ --operation.repo_ids "['user/a', 'user/b']" \ --operation.concatenate_videos=false Defaults are unchanged; the dataloader path is unaffected because the `episodes.parquet` `from_timestamp`/`to_timestamp` index keeps working regardless of whether each mp4 holds one or many episodes. * feat(edit-dataset): extend concatenate opt-out to data files Following review, add a concatenate_data flag mirroring concatenate_videos, threaded through MergeConfig, merge_datasets, aggregate_datasets, aggregate_data and append_or_create_parquet_file. Metadata index files still always concatenate. Also trim the verbose docstrings and comments since the names are self-explanatory, and extend the existing merge test to cover data files.	2026-06-12 20:05:04 +02:00
Pepijn	cec8ee0be6	feat: language annotation pipeline (#3471 ) Steerable annotation pipeline (lerobot-annotate) that populates the language_persistent and language_events columns introduced in PR 1 (#3467) directly into data/chunk-/file-.parquet. This is PR 2 of the three-PR plan: PR 1 (Add extensive language support #3467): schema + DSL + rendering, base of this PR PR 2 (this PR): annotation pipeline writing into PR 1's columns PR 3: model with language prediction and runtime A VLM (Qwen-VL family, served on vLLM) watches each episode's video and emits grounded language annotations: subtasks, plans, memory, task rephrasings, interjections + speech, and per-camera VQA. The pipeline is built for production annotation at scale — single-camera grounding, embedded-frame inputs, a describe-then-segment grounding flow, and a deterministic full-episode coverage guarantee — informed by Scale's dense-captioning findings (representation > sampling, rules > reasoning, model capacity is the biggest lever, two-pass systems compound errors)	2026-06-12 15:12:33 +02:00
Nikodem Bartnik	02b315ab6a	Docs/model card improvements (#3634 ) * update policy deployment instruction with rollout * add port and fix formatting * add more base models to generate model card * updated and extended model descriptions * fix bug * improved and extended structure * exclude the templates from config * add images and visualize dataset button * add all policies we have docs for * remove policies without the docs * new fields, improved examples	2026-06-12 13:26:52 +02:00
Pepijn	234c768dfb	feat(datasets): deterministic, resumable shuffling for EpisodeAwareSampler (#3769 ) * fix(datasets): expose a generator on EpisodeAwareSampler for distributed shuffle sync In distributed training, accelerate can only synchronize the shuffle permutation across ranks when the sampler exposes a generator attribute. EpisodeAwareSampler shuffled via the global torch RNG, so disjoint batch shards relied on every rank's global CPU RNG staying in lockstep forever; any rank-asymmetric RNG consumption (e.g. eval rollouts on the main process only) silently desynced the permutations and ranks trained on overlapping/missing samples. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * fix(train): seed sampler generator and gate dataset download per node - Pass a generator seeded with cfg.seed to EpisodeAwareSampler so accelerator.prepare registers it as the synchronized RNG and the shuffle order is reproducible. - Gate the initial make_dataset call on is_local_main_process instead of is_main_process: the global main process only exists on node 0, so on every other node all local ranks were downloading the dataset and building the Arrow cache concurrently. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * feat(datasets): add DeterministicEpisodeAwareSampler with O(1) memory and sample-exact resume Add a sampler that never materializes frame indices: it stores only per-episode boundaries (numpy, a few bytes per episode) and maps logical positions to frame indices on the fly with searchsorted. Shuffling uses a seeded Feistel permutation over [0, num_frames) (cycle-walking to the exact domain), so the data order is a pure function of (seed, epoch): - no RNG state to synchronize across distributed ranks, - constant memory and zero epoch-boundary cost at any dataset size, - O(1) seek to any position, enabling sample-exact resume. Opt in with --deterministic_sampler=true. On resume, lerobot-train maps the checkpointed step back to (epoch, start_index) via compute_sampler_state and continues at the exact sample where the run left off (up to accelerate's even_batches padding at epoch boundaries). The shuffle is pseudo-random rather than a true uniform permutation, the standard trade-off in large-scale training loaders. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * refactor(datasets): fold deterministic mode into EpisodeAwareSampler Instead of a parallel DeterministicEpisodeAwareSampler class, extend the existing EpisodeAwareSampler with a deterministic=True mode (seeded Feistel permutation, epoch auto-advance, state_dict/load_state_dict). The default mode is behavior-identical: same torch.randperm consumption and the same generator contract accelerate synchronizes; the O(N) Python index list is replaced by O(num_episodes) boundary arrays in both modes, with `indices` kept as a back-compat property. Passing a generator together with deterministic=True is rejected, and the state/seek methods raise outside deterministic mode. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * feat(train): enable deterministic_sampler by default Deterministic data order (sample-exact resume, no cross-rank RNG sync, O(1) sampler memory) is now the default for map-style training; set deterministic_sampler=false to restore the legacy RNG-based shuffle. Streaming datasets ignore the flag (the sampler path only applies to map-style datasets), replacing the previous hard validation error so streaming configs keep working with the new default. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * feat(datasets): default EpisodeAwareSampler to deterministic mode and trim comments deterministic=True is now the class default as well as the training default; the legacy RNG path requires an explicit deterministic=False (the train script's non-deterministic branch passes it). Docstrings and inline comments slimmed down across the changed files. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * test(sampler): drain resumed trillion-frame sampler via iter() to avoid list() prealloc list(sampler) calls PyObject_LengthHint -> __len__ (the full 10*12 epoch length) and preallocates that many slots before iterating, OOMing even though the resumed epoch only yields 3 frames. Collect through the iterator (no length hint) so the test exercises the real O(1) seek/drain instead of CPython's list growth heuristic. fix(datasets): guard Feistel cycle-walking loop against non-convergence Replace the unbounded while True in EpisodeAwareSampler._permute with a bounded for loop capped at _MAX_CYCLE_WALK_STEPS (100) and raise RuntimeError if the cycle-walk fails to land in [0, num_frames). The loop is expected to converge in <4 steps on the chosen power-of-two domain, so the bound is a safety net that should never trip in practice but prevents a pathological infinite loop. https://claude.ai/code/session_01HQ15tFrBsHYScjGWosEv22 * fix(datasets): make deterministic-sampler resume robust to world-size changes compute_sampler_state mapped a checkpointed step back to (epoch, start_index) using the current num_processes, but the number of sampler positions a step consumes scales with the world size that produced it. Resuming on a different GPU count therefore landed on the wrong epoch/offset, silently re-seeing or skipping data. Record num_processes in training_step.json at checkpoint time and feed the checkpoint's value into compute_sampler_state on resume, so the data order resumes at the right position regardless of the new world size. Warn when the world size changed (the global offset is correct, but per-rank sample-exactness needs the same topology). Old checkpoints without the field fall back to the current world size. Also document compute_sampler_state's assumptions explicitly: num_processes / batch_size must match the checkpointing run, and accelerate's even_batches=True padding is mirrored by the ceil(... / num_processes) term. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com> * style: apply ruff-format to lerobot_train.py Collapse the compute_sampler_state(...) call onto one line so the ruff-format pre-commit hook passes (fixes the failing CI check). Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(datasets): use seeded torch.randperm instead of Feistel in EpisodeAwareSampler Drop the Feistel permutation (and its SplitMix64 hash / cycle-walking) in favor of a torch.randperm seeded from (seed, epoch). The deterministic mode keeps its key properties - data order is a pure function of (seed, epoch), so it reproduces on every rank with no global-RNG synchronization, and - state_dict / load_state_dict still resume sample-exactly, now by regenerating the epoch's permutation and slicing from the saved offset. Construction stays O(num_episodes) (only episode boundaries are stored, never a per-frame index list). The trade-off vs Feistel: the per-epoch shuffle is again O(num_frames) memory (the randperm tensor) and no longer O(1)-seekable, in exchange for ~30 fewer LOC and a truly uniform shuffle. Tests updated: the trillion-frame O(1) test is replaced with a boundary-storage check and a scale resume-exactness test. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(datasets): make EpisodeAwareSampler always deterministic With Feistel gone, deterministic and legacy modes were both just torch.randperm and the deterministic path strictly dominated (reproducible across ranks via the (seed, epoch) seed, no accelerate generator sync, resumable). Collapse to a single path and drop the redundant flag: - remove the `deterministic` and `generator` constructor args, `_iter_default`, and `_require_deterministic`; `set_epoch` / `state_dict` / `load_state_dict` are now unconditional - remove the `deterministic_sampler` train config field and the legacy generator branch in lerobot_train.py (non-streaming map datasets always use the sampler) - drop the now-obsolete generator/legacy tests Note: removes the `generator` kwarg from EpisodeAwareSampler (back-compat break vs main); the order is now a pure function of (seed, epoch), so no cross-rank RNG sync is needed. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(datasets): address sampler review (batch_size resume guard + docs) - Record batch_size in training_step.json alongside num_processes and feed the checkpoint's value into compute_sampler_state on resume; warn when it differs (per-rank sample-exactness needs the same batch size). - Document the set_epoch vs __iter__ auto-advance coupling on EpisodeAwareSampler (callers should rely on exactly one mechanism per run). - Note the broadened (reproducibility-breaking) sampler guard and the no-generator distributed sharding correctness in lerobot_train.py. - Add load_training_batch_size + parallel tests. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(train): download dataset once on the global main process Gate the training dataset download on the global is_main_process (download once to the shared dataset root, barrier, then every other rank reads the already-populated copy) instead of per-node is_local_main_process. LeRobotDataset skips its snapshot_download when try_load() succeeds, so no rank re-downloads. Assumes the dataset root / HF cache is on storage shared across nodes. Co-authored-by: Cursor <cursoragent@cursor.com> * chore(datasets): trim sampler comment and drop duplicate tests Remove the verbose dataloader-guard comment and the two EpisodeAwareSampler tests that duplicated existing validation/warning coverage (no coverage loss). Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: Claude Fable 5 <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-12 11:47:16 +02:00
Caroline Pascal	0e9bd9e6fb	feat(trim): adding optional trimming option in reencode_video (#3779 ) * feat(trim): adding optional trimming option in reencode_video * tests(trim): add triming test --------- Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>	2026-06-12 11:29:26 +02:00
Steven Palma	87242cfced	chore(dependecies): relax grpc-related bounds (#3777 ) Signed-off-by: Steven Palma <imstevenpmwork@ieee.org>	2026-06-11 19:13:14 +02:00
Steven Palma	1edc83a0ef	feat(training): bump accelerate + use reduction types for tracked metrics in a multi rank setup (#3773 ) * feat(training): bump accelerate + use reduction types for tracked metrics in a multi rank setup * chore: address feedback	2026-06-11 19:07:28 +02:00
Steven Palma	6fbcf67249	chore: update readme (#3774 ) * chore: update readme * chore: update authors in project readme	2026-06-11 18:17:26 +02:00

1 2 3 4 5 ...

1535 Commits