lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-29 06:07:40 +00:00

Author	SHA1	Message	Date
Khalil Meftah	6a788fbdb0	Add inline offline validation with train/eval split (#3824 ) * refactor(training): rename eval_freq to env_eval_freq - Rename eval_freq to env_eval_freq to distinguish sim environment evaluation from offline loss evaluation. * feat(training): add inline offline validation with train/eval split - Add eval_split config for balanced per-task holdout - Add eval_steps for periodic inline eval loss computation - Add max_eval_samples to cap eval cost * fix(datasets): remap absolute indices in __getitem__ for filtered datasets * fix(train): vectorize eval subset selection for max_eval_samples * fix(datasets): Move the remapping into EpisodeAwareSampler via absolute_to_relative_idx * fix(validation): add eval_split range check and eval_steps warning Validate eval_split is in [0.0, 1.0) to prevent garbage splits from out-of-range values. Raise when eval_steps > 0 but eval_split is 0.0 since no offline eval will run. * fix(train): prepare eval dataloader with accelerator for multi-GPU Prepare eval_dataloader through accelerator.prepare() so eval data is sharded across ranks instead of duplicated. Reduce eval_loss across ranks with mean reduction for consistent logging. * fix(test): rename eval_freq to env_eval_freq for multi-GPU training	2026-06-25 15:31:24 +02:00
Pepijn	cec8ee0be6	feat: language annotation pipeline (#3471 ) Steerable annotation pipeline (lerobot-annotate) that populates the language_persistent and language_events columns introduced in PR 1 (#3467) directly into data/chunk-/file-.parquet. This is PR 2 of the three-PR plan: PR 1 (Add extensive language support #3467): schema + DSL + rendering, base of this PR PR 2 (this PR): annotation pipeline writing into PR 1's columns PR 3: model with language prediction and runtime A VLM (Qwen-VL family, served on vLLM) watches each episode's video and emits grounded language annotations: subtasks, plans, memory, task rephrasings, interjections + speech, and per-camera VQA. The pipeline is built for production annotation at scale — single-camera grounding, embedded-frame inputs, a describe-then-segment grounding flow, and a deterministic full-episode coverage guarantee — informed by Scale's dense-captioning findings (representation > sampling, rules > reasoning, model capacity is the biggest lever, two-pass systems compound errors)	2026-06-12 15:12:33 +02:00
Jade Choghari	271d92dcaa	feat(sim): add metaworld env (#2088 ) * add metaworld * smol update Signed-off-by: Jade Choghari <chogharijade@gmail.com> * update design * Update src/lerobot/envs/metaworld.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Jade Choghari <chogharijade@gmail.com> * update * small changes * iterate on review * small fix * small fix * add docs * update doc * add better gif * smol doc fix * updage gymnasium * add note * depreciate gym-xarm * more changes * update doc * comply with mypy * more fixes * update readme * precommit * update pusht * add pusht instead * changes * style * add changes * update * revert * update v2 * chore(envs): move metaworld config to its own file + remove comments + simplify _format_raw_obs (#2200) * update final changes --------- Signed-off-by: Jade Choghari <chogharijade@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Steven Palma <imstevenpmwork@ieee.org>	2025-10-14 17:21:18 +02:00
Steven Palma	ce3b9f627e	chore(docs): prioritize use of entry points in docs + fix nightly badge (#1692 ) * chore(docs): fix typo in nightly badge * chore(docs): prioritize the use of entrypoints for consistency	2025-08-07 14:25:44 +02:00
Steven Palma	f6ec1d89a5	feat(ci): add release workflow (#1562 )	2025-07-21 19:08:32 +02:00
Simon Alibert	d4ee470b00	Package folder structure (#1417 ) * Move files * Replace imports & paths * Update relative paths * Update doc symlinks * Update instructions paths * Fix imports * Update grpc files * Update more instructions * Downgrade grpc-tools * Update manifest * Update more paths * Update config paths * Update CI paths * Update bandit exclusions * Remove walkthrough section	2025-07-01 16:34:46 +02:00
Pepijn	0b2285d1ec	Feat: Improve hub integration (#1382 ) * feat(policies): Initial setup to push policies to hub with tags and model card * feat: add dataset that is used to train * Add model template summary * fix: Update link model_card template * fix: remove print * fix: change import name * fix: add model summary in template * fix: minor text * fix: comments Lucain * fix: feedback steven * fix: restructure push to hub * fix: remove unneeded changes * fix: import * fix: import 2 * Add MANIFEST.in * fix: feedback pr * Fix tests * tests: Add smolvla end-to-end test * Fix: smolvla test * fix test name * fix policy tests * Add push to hub false policy tests * Do push to hub cleaner * fix(ci): add push_to_hub false in tests --------- Co-authored-by: Steven Palma <steven.palma@huggingface.co>	2025-06-26 14:36:16 +02:00
Steven Palma	5e9473806c	refactor(config): Move device & amp args to PreTrainedConfig (#812 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2025-03-06 17:59:28 +01:00
Steven Palma	5d24ce3160	chore(doc): add license header to all files (#818 )	2025-03-05 17:56:51 +01:00
Simon Alibert	fe483b1d0d	Remove `poetry.lock` (#737 ) Co-authored-by: Remi <remi.cadene@huggingface.co>	2025-02-17 12:03:16 +01:00
Simon Alibert	90e099b39f	Remove offline training, refactor `train.py` and logging/checkpointing (#670 ) Co-authored-by: Remi <remi.cadene@huggingface.co>	2025-02-11 10:36:06 +01:00
Remi	638d411cd3	Add Pi0 (#681 ) Co-authored-by: Simon Alibert <simon.alibert@huggingface.co> Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com> Co-authored-by: Pablo <pablo.montalvo.leroux@gmail.com>	2025-02-04 18:01:04 +01:00
Simon Alibert	3c0a209f9f	Simplify configs (#550 ) Co-authored-by: Remi <remi.cadene@huggingface.co> Co-authored-by: HUANG TZU-CHUN <137322177+tc-huang@users.noreply.github.com>	2025-01-31 13:57:37 +01:00
Alexander Soare	f8a6574698	Add online training with TD-MPC as proof of concept (#338 )	2024-07-25 11:16:38 +01:00
Marina Barannikov	c38f535c9f	FIx make_dataset to match transforms config (#264 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-06-12 19:45:42 +02:00
Simon Alibert	13310681b1	Enable cuda for end-to-end tests (#222 )	2024-05-29 23:02:23 +02:00
Alexander Soare	e3b9f1c19b	Add resume training (#205 ) Co-authored-by: Remi <re.cadene@gmail.com>	2024-05-28 12:04:23 +01:00
Alexander Soare	e67da1d7a6	Add tutorials for using the training script and (#196 ) Co-authored-by: Remi <re.cadene@gmail.com>	2024-05-21 16:47:49 +01:00
Alexander Soare	b6c216b590	Add Automatic Mixed Precision option for training and evaluation. (#199 )	2024-05-20 18:57:54 +01:00
Alexander Soare	2b270d085b	Disable online training (#202 ) Co-authored-by: Remi <re.cadene@gmail.com>	2024-05-20 18:27:54 +01:00
Alexander Soare	e89521dfa0	Enable tests for TD-MPC (#160 )	2024-05-09 13:42:12 +01:00
Alexander Soare	bccee745c3	Refactor eval.py (#127 )	2024-05-03 17:33:16 +01:00
Remi	b2cda12f87	Add video decoding to LeRobotDataset (#92 )	2024-05-03 00:50:19 +02:00
Alexander Soare	d1855a202a	Refactor TD-MPC (#103 ) Co-authored-by: Cadene <re.cadene@gmail.com> Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-05-01 16:40:04 +01:00
Alexander Soare	a4891095e4	Use PytorchModelHubMixin to save models as safetensors (#125 ) Co-authored-by: Remi <re.cadene@gmail.com>	2024-05-01 16:17:18 +01:00
Alexander Soare	9d60dce6f3	Tidy up yaml configs (#121 )	2024-04-30 16:08:59 +01:00
Quentin Gallouédec	508bd92d03	Remove `update` method from the policy (#99 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-04-29 12:27:58 +02:00
Simon Alibert	fdf6a0c4e3	More CI cleanup, add style workflow (#107 ) - Changes on the `test.yml` workflow: - Using poetry instead of pip. Contrary to what I wrote in #75, it is possible to use poetry (and have the benefits of shorter install times) without the need for having two separate versions of `pyproject.toml` and `poetry.lock`. - Reduce the trigger scope to only run when files in these directories are modified: - `lerobot/` - `tests/` - `examples/` - `.github/` - Add `style.yml` workflow for doing a `ruff check` pass on the code - More cleanup (removed deprecated workflow)	2024-04-27 09:37:56 +02:00
Simon Alibert	b980c5dd9e	CI nightlies cpu/gpu & cleanup (#75 )	2024-04-25 14:58:39 +02:00

29 Commits