lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-29 06:07:40 +00:00

Author	SHA1	Message	Date
Caroline Pascal	3dd19d043e	feat(depth maps): adding support for depth in LeRobot (#3644 ) * feat(depth): add depth quantization helpers and tests * feat(video): add ffv1 to supported codecs * feat(depth): persist depth metadata * feat(depth): extend quantization tools to better fit the encoding/decoding pipeline * feat(depth): plumb DepthEncoderConfig through LeRobotDataset and DatasetWriter * feat(depth): wire StreamingVideoEncoder + writer to depth encoder * feat(depth): wire DatasetReader to decode_depth_frames * feat(cameras/realsense): expose async depth in metric meters * feat(features): route 2D camera shapes to observation.depth.<key> * feat(robots/so_follower): emit + populate depth keys when use_depth * feat(record): plumb DepthEncoderConfig through lerobot-record * feat(viz): render depth observations as rr.DepthImage in Viridis * feat(depth maps writer): adding support for raw depth maps recording with image writer * chore(format): format code * feat(depth shape): ensuring depth maps shape is always including the channel * feat(is_depth): simplifying is_depth nested name + legacy support * fix(stop_event): fixing stop_event race condition in camera classes * fix(plumbing): fixing missing parts in the depth maps pipeline * chore(typos): fixing typos * test(fix): fixing exisiting tests to still work with latest features * tests(depth): adding new tests for depth integration validation * feat(pix_fmt channels): use PyAv to check get pixel formats number of channels * feat(refactor): refactor DepthEncoderConfig quantization pipeline, so that the methods do not live in the config class. Add pixel format - channels validation.Move the default pixel format for depth in the config file. * fix(pre-commit): fixing mutable defautl value * fix(info): fixing info metadata update when is_depth_map was set * tests(typos): fixing typos in tests * fix(realsense): fixing typo in realsense serial number * fix(normalization): restricting 255 normalization to non depth/uint8 images only * fix(typo): fixing typo * fix(TIFF): add missing quantization and cleanup for TIFF files * feat(batched dequantization): optimizing dequantize_depth for torch based batched dequantization * feat(tools): adding depth support in LeRobotDataset edition tools * test(aggregate): extending aggregation tests to depth frames * test(cleaning): cleaning up tests * fix(from_video_info): fixing early validation issue in from_video_info * fix(typo): fixing typo * fix(is_depth): adding missing doctrings and is_depth arguments in video decoding functions Co-authored-by: Wensi (Vince) Ai <59036629+wensi-ai@users.noreply.github.com> * fix(depth units): fixing depth units output for the realsense cameras * feat(output unit): adding support for output unit specification at dataset reading/training time Co-authored-by: Wensi (Vince) Ai <59036629+wensi-ai@users.noreply.github.com> * test(depth): cleaning up depth tests * test(depth encoding): updating and cleaning video/depth encoding tests * chore(format): formatting code * docs(depth): improving depth maps docs * test(fix): fixing depth tests * test(dataset tools): adding missing tests for new dataset edition tools features * chore(format): formatting code * fix(pyav check): fixing PyAV option validation for integer codec options by normalizing numeric values before calling `is_integer()` Co-authored-by: Wensi (Vince) Ai <59036629+wensi-ai@users.noreply.github.com> * docs(mermaid): fixing mermaid diagram * fix(rebase): rebase follow up corrections * feat(dataset tools): adding missing docstrings and features for depth fill support in dataset edition tools * docs(docstring): updating docstrings * docs(dataset tools): updating docs * fix(save images): fixing image saving in dataset tools * fix(update video info): fixing update video info logic to match the recording and editing use cases * test(reencode): fixing reencoding monkeypatch * fix(review): add Claude review * chore(format): format code * fix(update video info): ditching the differentiated approahces for video info update - video info are always updated unless for preserved keys. * chore(rebase): fixing rebase merge conflicts * test(visualization): fixing visualization tests * feat(docstrings): adding explicit docstring for encoding parameters. Docstrigns will now show up as description in the CLI --help. * feat(mm as default): adding a global DEFAULT_DEPTH_UNIT variable setting mm as default depth unit * fix(RGB <-> camera): renaming camera_encoder to rgb_encoder for clarity * chore(TODO): removing deprecated TODO * doc(write_u16_plane): improving docstrings for write_u16_plane * feat(units): adding constants for depth frames units (m and mm) * fix(spam): replacing spamming warning but a debug log * feat(leagcy metadata): adding automatic metadata update for legacy 'video.is_depth_map' feature * fix(copy&reindex): fixing metadat reshaping for single channel frames * fix(ImageNet): excluding dpeth frames from ImageNet stats * fix(PyAV container seek): fixing initial PyAV container seek to be robust againsy codec choice * feat(lerobot-dataset-viz): adding support for depth in lerobot-dataset-viz * fix(compress): removing rerun compression for DepthImages * fix(signle channel squeeze): fixing single channel squeezing * chore(format): format code * fix(streaming): adding support for dequantization in streaming_dataset.py * refactor(read depth): factorizing depth reading methods for realsense camera and adding support for depth-only usage * chore(renaming): fixing missed RGBEncoderConfig renamings * docs(renaming): reflecting renamings in a clearer way in the docs * chore(annotation): excluding depth from the annotation pipeline * feat(robots): adding depth support in compatible follower robots * feat(LeSadKiwi): excluding LeKiwi from depth support (for now) * chore(fail): removing misplaced file * chore(fail): removing misplaced file * fix(remove ffv1): removing ffv1 as it does not support MP4 * docs(cheat sheet): adding depth and video encoding to the cheat sheet * fix(lossless): tuning depth encoding parameters for lossless depth storage * test(fix): fixing failing tests * depth(ZMQ): excluding ZMQ from depth support * Revert "depth(ZMQ): excluding ZMQ from depth support" This reverts commit `b95cf4e4c2`. * fix(image transforms): excluding depth frames from images transforms * fix(typo): typo * fix(stats): fixing stats computation for depth frames * fix(TIFF vs. pytorch): adding an extra uint16 to float32 conversion for depth maps stored as raw TIFF images * fix(typos): fixing typos * test(dtype): fixing stats computation typing tests --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Wensi (Vince) Ai <59036629+wensi-ai@users.noreply.github.com> Co-authored-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Wensi Ai <wsai@stanford.edu>	2026-06-27 14:21:21 +02:00
Mahbod	30790de178	feat(edit-dataset): add `concatenate_videos` opt-out to merge (#3663 ) * feat(edit-dataset): add `concatenate_videos` opt-out to merge When merging datasets, source mp4s are concatenated into shards capped at `video_files_size_in_mb` (default 200 MB). This is great for dataloader throughput but destroys per-episode (or per-source) video boundaries, which is undesirable when you want to inspect, ship, or reuse the individual mp4s. Add a `concatenate_videos: bool = True` knob plumbed through `MergeConfig` → `merge_datasets` → `aggregate_datasets` → `aggregate_videos`. When False, each source mp4 is copied 1:1 to its own destination mp4 with no re-muxing, so the merge preserves source video boundaries. Usage: lerobot-edit-dataset \ --new_repo_id user/merged \ --operation.type=merge \ --operation.repo_ids "['user/a', 'user/b']" \ --operation.concatenate_videos=false Defaults are unchanged; the dataloader path is unaffected because the `episodes.parquet` `from_timestamp`/`to_timestamp` index keeps working regardless of whether each mp4 holds one or many episodes. * feat(edit-dataset): extend concatenate opt-out to data files Following review, add a concatenate_data flag mirroring concatenate_videos, threaded through MergeConfig, merge_datasets, aggregate_datasets, aggregate_data and append_or_create_parquet_file. Metadata index files still always concatenate. Also trim the verbose docstrings and comments since the names are self-explanatory, and extend the existing merge test to cover data files.	2026-06-12 20:05:04 +02:00
Pepijn	cec8ee0be6	feat: language annotation pipeline (#3471 ) Steerable annotation pipeline (lerobot-annotate) that populates the language_persistent and language_events columns introduced in PR 1 (#3467) directly into data/chunk-/file-.parquet. This is PR 2 of the three-PR plan: PR 1 (Add extensive language support #3467): schema + DSL + rendering, base of this PR PR 2 (this PR): annotation pipeline writing into PR 1's columns PR 3: model with language prediction and runtime A VLM (Qwen-VL family, served on vLLM) watches each episode's video and emits grounded language annotations: subtasks, plans, memory, task rephrasings, interjections + speech, and per-camera VQA. The pipeline is built for production annotation at scale — single-camera grounding, embedded-frame inputs, a describe-then-segment grounding flow, and a deterministic full-episode coverage guarantee — informed by Scale's dense-captioning findings (representation > sampling, rules > reasoning, model capacity is the biggest lever, two-pass systems compound errors)	2026-06-12 15:12:33 +02:00
Steven Palma	df0763a2bc	feat(dependencies): minimal default tag install (#3362 )	2026-04-12 20:03:04 +02:00
Caroline Pascal	63dca86df8	fix(dataset edit tools): clarifying `root` argument usage + adding related features (#3049 ) * fix(root): adding proper support for the root and new_root arguments * feat(roots): adding a roots agrument for the merge operation * chore(clean): cleaning up code * chore(doctrings): updating doctrings with new features * fix(repo_id): setting repo_id to None when not needed * fix(roots/repo_ids): making mypy happy by using repo_ids and roots for merge operation * fix(path): fixing path related issues * fix(repo_id): fixing issues related to repo_id * chore(doctrings): updating docstrings + fix typo * chore(clean): cleaning code * fix(split new_repo_id): reverting new_repo_id addition for split operation * docs(dosctrings): completing docstrings * fix(repo_ids/roots): improving checks for repo_ids/roots lengths * fix(repo_ids): making repo_ids optional in MergeConfig but raise if not given * fix(docstrings): fixing docstrings for split operation * fix(hints): updating get_output_path hints to accept paths as strings too * fix(y/N prompts): removing y/N prompts in lerobot_edit_dataset * fix(merge repo_id): fixing merge operation to use new_repo_id instead of repo_id * fix(typo): fixing typo in doctrings	2026-03-03 15:40:46 +01:00
masato-ka	51d3822d75	feat(datasets): Add info operation to lerobot-edit-dataset command (#2917 ) * Add New featrue to lerobot_edit_datset.py that show dataset information. * Fix to draccus error when happen give only --operation.type=info * Updating test and documents regarding lerobot-edit-dataset info function. * Updating documents regarding lerobot-edit-dataset extract function. option name in document is mistake. * feat(datasets): Update to align formatting with pre-commit.(#2917) Update to align formatting by pre-commit. --------- Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com>	2026-02-17 20:09:42 +01:00
Caroline Pascal	adebbcf090	fix(dataset tools draccus): fixing draccus parsing for dataset edit operation type specification (#2949 ) * fix(edit dataset operation): fixing dataset tools CLI operation type specification * test(edit dataset operation): adding tests for dataset tools operation type specification * chore(format): running pre-commit * chore(backward compatibility): adding a type property in OperationConfig for backward compatibility Signed-off-by: Caroline Pascal <caroline8.pascal@gmail.com>	2026-02-12 18:56:04 +01:00
Simon Alibert	974028bd28	Organize test folders (#856 ) Co-authored-by: Steven Palma <imstevenpmwork@ieee.org>	2025-03-13 14:05:55 +01:00
Steven Palma	5e9473806c	refactor(config): Move device & amp args to PreTrainedConfig (#812 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2025-03-06 17:59:28 +01:00
Simon Alibert	659ec4434d	Fix nightly (#775 )	2025-02-26 16:36:03 +01:00
Simon Alibert	c4c2ce04e7	Update pre-commits (#733 )	2025-02-15 15:51:17 +01:00
Simon Alibert	90e099b39f	Remove offline training, refactor `train.py` and logging/checkpointing (#670 ) Co-authored-by: Remi <remi.cadene@huggingface.co>	2025-02-11 10:36:06 +01:00
Remi	638d411cd3	Add Pi0 (#681 ) Co-authored-by: Simon Alibert <simon.alibert@huggingface.co> Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com> Co-authored-by: Pablo <pablo.montalvo.leroux@gmail.com>	2025-02-04 18:01:04 +01:00
Simon Alibert	3c0a209f9f	Simplify configs (#550 ) Co-authored-by: Remi <remi.cadene@huggingface.co> Co-authored-by: HUANG TZU-CHUN <137322177+tc-huang@users.noreply.github.com>	2025-01-31 13:57:37 +01:00
Simon Alibert	32eb0cec8f	Dataset v2.0 (#461 ) Co-authored-by: Remi <remi.cadene@huggingface.co>	2024-11-29 19:04:00 +01:00
Michel Aractingi	eb4c505cff	Support for converting OpenX datasets from RLDS format to LeRobotDataset (#354 ) Signed-off-by: youliangtan <tan_you_liang@hotmail.com> Co-authored-by: Simon Alibert <alibert.sim@gmail.com> Co-authored-by: youliangtan <tan_you_liang@hotmail.com> Co-authored-by: Remi <re.cadene@gmail.com>	2024-08-27 09:07:00 +02:00
Alexander Soare	f8a6574698	Add online training with TD-MPC as proof of concept (#338 )	2024-07-25 11:16:38 +01:00
Simon Alibert	0b21210d72	Convert datasets to av1 encoding (#302 )	2024-07-22 20:08:59 +02:00
Alexander Soare	342f429f1c	Add test to make sure policy dataclass configs match yaml configs (#292 )	2024-06-26 09:09:40 +01:00
Thomas Wolf	48951662f2	Bug fix: missing attention mask in VAE encoder in ACT policy (#279 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-06-19 12:07:21 +01:00
Marina Barannikov	ff8f6aa6cd	Add data augmentation in LeRobotDataset (#234 ) Co-authored-by: Simon Alibert <alibert.sim@gmail.com> Co-authored-by: Remi Cadene <re.cadene@gmail.com>	2024-06-11 19:20:55 +02:00
Remi	d585c73f9f	Add real-world support for ACT on Aloha/Aloha2 (#228 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-05-31 15:31:02 +02:00
Remi	01eae09ba6	Fix aloha real-world datasets (#175 )	2024-05-20 13:48:09 +02:00
Simon Alibert	f52f4f2cd2	Add copyrights (#157 )	2024-05-15 12:13:09 +02:00
Alexander Soare	f3bba0270d	Remove EMA model from Diffusion Policy (#134 )	2024-05-05 11:26:12 +01:00
Simon Alibert	c77633c38c	Add regression tests (#119 ) - Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts - Add `test_backward_compatibility to test generated outputs from the policies against artifacts	2024-05-04 16:20:30 +02:00
Remi	19812ca470	Add dataset visualization with rerun.io (#131 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-05-04 16:07:14 +02:00
Remi	b2cda12f87	Add video decoding to LeRobotDataset (#92 )	2024-05-03 00:50:19 +02:00
Remi	e4e739f4f8	Refactor push_dataset_to_hub (#118 )	2024-04-30 14:25:41 +02:00
Adil Zouitine	55dc9f7f51	Refactor the download and publication of the datasets and convert it into CLI script (#95 ) Co-authored-by: Remi <re.cadene@gmail.com>	2024-04-29 00:08:17 +02:00
Remi	659c69a1c0	Refactor datasets into LeRobotDataset (#91 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-04-25 12:23:12 +02:00
Remi	1030ea0070	Loads episode_data_index and stats during dataset __init__ (#85 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com> Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-04-23 14:13:25 +02:00
Remi	0928afd37d	Improve dataset examples (#82 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-04-18 11:43:16 +02:00
Cadene	70aaf1c4cb	test_datasets.py are passing!	2024-04-08 14:16:57 +00:00
Cadene	5af00d0c1e	fix train.py, stats, eval.py (training is running)	2024-04-05 09:31:39 +00:00
Cadene	e799dc5e3f	Improve mock_dataset	2024-03-19 16:38:07 +00:00
Cadene	6a1a29386a	Add replay_buffer directory in pusht datasets + aloha (WIP)	2024-03-19 15:49:45 +00:00
Cadene	f440a681ad	Add mock_dataset.py	2024-03-09 15:36:20 +01:00
Cadene	35bd577deb	Add mock_dataset.py	2024-03-09 15:36:20 +01:00

39 Commits