lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-04 16:47:14 +00:00

Files

T

Pepijn e275ea3960 LingBot-VA: video-action world model (#3731 )

* feat(policies): add LingBot-VA autoregressive video-action world model

Port the LingBot-VA policy (Wan2.2 dual-stream video+action world model) into
LeRobot, following the EO-1 / VLA-JEPA conventions. Covers inference, checkpoint
conversion, and predicted-video saving (training is deferred to a follow-up PR).

- Vendored Wan transformer/attention/flex/VAE/scheduler modules (key names preserved
  for near-identity conversion); torch SDPA default, flashattn/flex lazy-guarded.
- LingBotVAConfig (registered "lingbot_va") + processor with fixed-quantile action
  unnormalization; full dual-stream sampling loop with CFG, two flow-matching
  schedulers and KV cache, mapped onto select_action with observed-keyframe feedback.
- convert_lingbot_va_checkpoints.py (libero/robotwin variants): bundles the ~5B
  transformer, lazy-pulls the frozen VAE+UMT5 from the source repo.
- Predicted-video plumbing in lerobot_eval (predicted_frames_callback; opt-in via
  --policy.save_predicted_video) and ConstantWithWarmupSchedulerConfig.
- pyproject: widen diffusers-dep to <0.37, add lingbot_va + imageio-dep extras,
  add lingbot_va and (missing) eo1 to `all`.
- Factory + policies/__init__ wiring, docs page + toctree, and tests.

Note: the LIBERO success-rate correctness gate must be validated on a CUDA GPU
with the converted checkpoint.

* feat(lingbot_va): RoboTwin eef-pose eval, single-file model, Hub checkpoints

Make the LingBot-VA port runnable on both LIBERO and RoboTwin and clean up the
package to LeRobot conventions.

- Consolidate all vendored Wan2.2 model code (transformer, attention, VAE helpers,
  flow-matching scheduler, grid utils, flex-attention) into a single
  modeling_lingbot_va.py; remove the separate wan_*/schedulers modules.
- Move the fixed action (un)normalization quantiles out of the config and into the
  post-processor (LIBERO 7-DoF + RoboTwin 16-d eef); remove the conversion script in
  favour of ready-to-use LeRobot-format checkpoints on the Hub.
- Fixes found via on-sim validation: undo LIBERO's 180-degree image flip
  (image_hflip), encode obs as a multi-frame streaming-VAE clip, reset the streaming
  VAE cache between episodes, run the transformer in config.dtype, lazy-load frozen
  VAE/UMT5 by subfolder with the text encoder on CPU.
- RoboTwin: add an end-effector-pose action mode to RoboTwinEnv (16-d per-arm
  xyz+quat+gripper deltas composed onto the initial eef pose, executed via CuRobo IK)
  and the robotwin_tshape latent layout (full-res head + half-res wrists via a second
  streaming VAE) with the upstream RoboTwin action quantiles + camera mapping.
- Predicted-video saving works for both benchmarks; docs + tests updated.

* feat(lingbot_va): implement training / fine-tuning (flow-matching loss)

- Implement LingBotVAPolicy.forward(): dual-stream flow-matching training loss
  (latent + action, timestep-weighted, action-masked) ported from upstream train.py;
  VAE-encodes camera clips, UMT5-encodes the task, noises both streams, runs the
  block-causal flex-attention training pass (forward_train).
- training_loss_from_streams() core + _build_training_streams() data prep (action
  scatter into the 30-d space, multi-frame VAE encode incl. robotwin_tshape).
- get_optim_params returns only trainable transformer params (LoRA/PEFT friendly);
  VAE/UMT5 stay frozen. Training needs attn_mode='flex'.
- Add a tiny-config single-training-step test (forward->loss->backward->AdamW) and a
  Training/fine-tuning section in the docs.

* fix(lingbot_va): CI quality gate + fast-test collection

- Add tests/policies/lingbot_va/__init__.py so the test files don't clash by basename
  with tests/policies/vla_jepa/* under pytest's default import mode (fast-test collection error).
- Fix vendored typos flagged by the typos hook (pach_scale->patch_scale, total_tolen->
  total_token_len, stablized->stabilized) and a mypy union-attr in RoboTwinEnv._read_eef_pose.
- Apply Prettier formatting to docs/source/lingbot_va.mdx.

* docs(lingbot_va): document EEF action-channel schema + camera order

* Update lingbot_va.mdx

Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com>

* Update pyproject.toml

Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com>

* Update pyproject.toml

Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com>

* refactor(lingbot_va): drop hardcoded action quantiles; source from checkpoint

The LIBERO/RoboTwin action (un)normalization quantiles were hardcoded as module
constants in processor_lingbot_va.py. They are already serialized into each
checkpoint's policy_postprocessor.json (via LingBotVAActionUnnormalizeStep.get_config)
and restored on load by PolicyProcessorPipeline.from_pretrained, so the constants are
dead at eval/load time for the released checkpoints (verified: libero_long/robotwin/base
all carry their quantiles on the Hub).

- Remove LIBERO_ACTION_Q01/Q99, ROBOTWIN_ACTION_Q01/Q99 and _default_action_quantiles.
- make_lingbot_va_pre_post_processors now defaults a fresh (unconverted) build to a
  neutral [-1, 1] mapping (identity rescale); real per-benchmark stats come from the
  saved checkpoint (or postprocessor_overrides), analogous to dataset-stats normalization.
- Update the config doc comment to point at the checkpoint as the source of truth.
- Tests: replace the LIBERO-default assertion with a neutral-default check, and add a
  save_pretrained/from_pretrained round-trip guard for the quantile serialization.

* docs(lingbot_va): trim verbose comments

- configuration_lingbot_va.py: condense multi-line field comments to one-liners
  (keep the ── section headers).
- processor_lingbot_va.py: shorten the action-quantile explanation block.
- modeling_lingbot_va.py: drop the bare "# ----" separator rules, keeping the
  one-line section headers.

No code changes.

* docs(lingbot_va): trim provenance comments; default wan path to base repo

- configuration_lingbot_va.py: drop the "──" decorations and the
  "(from transformer/config.json)" note; default wan_pretrained_path to
  robbyant/lingbot-va-base (has the frozen vae/text_encoder/tokenizer subfolders).
- modeling_lingbot_va.py: remove the vendored-code banner and the
  "(upstream wan_va/...)" section-header provenance/dash decorations; condense the
  transformer-dtype comment to one line.

No code changes.

* refactor(lingbot_va): use built-in UnnormalizerProcessorStep for actions

Replace the bespoke LingBotVAActionUnnormalizeStep with the standard
UnnormalizerProcessorStep in QUANTILES mode, which computes the identical
(action + 1) / 2 * (q99 - q01) + q01 mapping. The per-channel q01/q99 are stored
as the step's saved state (a safetensors file) and restored on load; a fresh build
has no action stats so the step is an identity passthrough.

The 3 Hub checkpoints (lerobot/lingbot_va_{libero_long,robotwin,base}) have been
re-uploaded with the new post-processor (policy_postprocessor.json +
*_unnormalizer_processor.safetensors); reloading from the Hub round-trips q01/q99.

- processor_lingbot_va.py: drop the custom step + registry; build the post-processor
  with UnnormalizerProcessorStep (explicit ACTION->QUANTILES norm_map so the
  preprocessor / training path is unchanged).
- tests: assert the built-in step is used, identity-when-no-stats, correct quantile
  unnormalization, and a save_pretrained/from_pretrained stats round-trip.

* docs(lingbot_va): point checkpoint paths at the lerobot org

The LeRobot-format checkpoints moved from pepijn223/* to lerobot/* (libero_long,
robotwin, base). Update the eval/train --policy.path examples accordingly.

* docs(lingbot_va): condense processor normalization comments

* fix(lingbot-va): align RoboTwin evaluation (#3784)

Thank you for the RoboTwin fix, and alignment!

* applying fixes

* updating uv lock and linting

* adjusting test to match expected values

* cleaning up deps

* cleaning up top level imports, styling, and deps guards

* cleanup
* moving wan utils and loading utils to `utils.py`
* removing ftfy by replicating the prompt_clean function without it (we don't expect to have weird chars given in the prompt anyway)

* removing unused function

* guarding for scipy dep, renaming test to avoid collision

* adding back accelerate for peak memory usage optim + justifying robotwin description dep

---------

Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com>
Co-authored-by: pepijn223 <pepijn223@hf.co>
Co-authored-by: Gangwei XU <gwxu@hust.edu.cn>
Co-authored-by: Maxime Ellerbach <maxime.ellerbach@huggingface.co>

2026-07-03 13:32:38 +02:00

_toctree.yml

LingBot-VA: video-action world model (#3731 )

2026-07-03 13:32:38 +02:00

act.mdx

fix examples (#3623 )

2026-05-21 22:14:07 +02:00

action_representations.mdx

feat(policies): add relative action support for pi0, pi0.5, and pi0_fast (#2970 )

2026-04-01 12:59:12 +02:00

adding_benchmarks.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

annotation_pipeline.mdx

feat: language annotation pipeline (#3471 )

2026-06-12 15:12:33 +02:00

async.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

backwardcomp.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

bring_your_own_policies.mdx

docs: add policy & compute guide (#3534 )

2026-05-11 15:19:12 +02:00

cameras.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

cheat-sheet.mdx

feat(train): run training remotely on HF Jobs via --job.target (#3856 )

2026-06-29 17:59:33 +02:00

contributing.md

Hardware API redesign (#777 )

2025-06-05 17:48:43 +02:00

damiao.mdx

feat(motors): add damiao motors & can bus (#2788 )

2026-01-26 17:53:25 +01:00

debug_processor_pipeline.mdx

feat(processors): use pipelines across the codebase (#1452 )

2025-09-18 15:25:26 +02:00

earthrover_mini_plus.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

env_processor.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

envhub_isaaclab_arena.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

envhub_leisaac.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

envhub.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

eo1.mdx

feat(policies): add EO-1 model (#3403 )

2026-05-06 18:01:16 +02:00

fastwam.mdx

feat(policies): Add FastWAM Policy (#3834 )

2026-07-01 14:35:57 +02:00

feetech.mdx

Add feetech firmware update docs (#1793 )

2025-08-28 11:18:54 +02:00

groot.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

hardware_guide.mdx

Docs/improve HF jobs documentation (#3909 )

2026-07-03 11:39:16 +02:00

hil_data_collection.mdx

refactor(robots): homogenize bi-manual setups implementations (#3772 )

2026-06-15 16:28:54 +02:00

hilserl_sim.mdx

chore(rl): move rl related code to its directory at top level (#2002 )

2025-09-23 16:32:34 +02:00

hilserl.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

hope_jr.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

il_robots.mdx

Docs/improve HF jobs documentation (#3909 )

2026-07-03 11:39:16 +02:00

implement_your_own_processor.mdx

feat(processors): use pipelines across the codebase (#1452 )

2025-09-18 15:25:26 +02:00

index.mdx

Update pre-commit-config.yaml + pyproject.toml + ceil rerun & transformer dependencies version (#1520 )

2025-07-17 14:30:20 +02:00

inference.mdx

refactor(robots): homogenize bi-manual setups implementations (#3772 )

2026-06-15 16:28:54 +02:00

installation.mdx

chore(deps): cap torch ceiling at <2.12, pin Linux wheels to cu128 (#3570 )

2026-05-11 19:47:55 +02:00

integrate_hardware.mdx

feat(robots): consolidate SO arms implementation (#2763 )

2026-01-08 13:04:30 +01:00

introduction_processors.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

koch.mdx

fix(docs): update outdated links (#2026 )

2025-09-24 16:17:39 +02:00

language_and_recipes.mdx

Add extensive language support (#3467 )

2026-05-19 14:46:11 +02:00

lekiwi.mdx

feat(utils): display-independent keyboard controls for recording (Wayland / headless / macOS) (#3875 )

2026-06-25 10:58:39 +02:00

lelab.mdx

Docs/add lelab (#3707 )

2026-06-03 14:22:05 +02:00

lerobot-dataset-v3.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

libero_plus.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

libero.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

lingbot_va.mdx

LingBot-VA: video-action world model (#3731 )

2026-07-03 13:32:38 +02:00

metaworld.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

molmoact2.mdx

Enable MolmoAct2 rollout on SO-100/101 with calibration correction (#3879 )

2026-06-29 18:52:59 +02:00

multi_gpu_training.mdx

Fix ACT policy type examples in docs (#3792 )

2026-06-25 08:59:07 +02:00

multi_task_dit.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

notebooks.mdx

Update pre-commit-config.yaml + pyproject.toml + ceil rerun & transformer dependencies version (#1520 )

2025-07-17 14:30:20 +02:00

omx.mdx

fix(robots): update gripper configuration and calibration settings for OMX (#2815 )

2026-01-25 22:29:37 +01:00

openarm.mdx

feat(robots): add bi manual openarm follower and leader (#2835 )

2026-01-28 17:25:57 +01:00

peft_training.mdx

fix(config): add lora_alpha to PeftConfig (#3573 )

2026-05-13 11:09:19 +02:00

phone_teleop.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

pi0.mdx

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

pi0fast.mdx

Fix pi0fast model id in docs (#3855 )

2026-06-24 11:44:03 +02:00

pi05.mdx

docs: fix broken dataset script paths (datasets/v30 -> scripts) (#3695 )

2026-06-03 14:48:19 +02:00

policy_act_README.md

Update readme (#1570 )

2025-08-01 17:39:39 +02:00

policy_diffusion_README.md

Update readme (#1570 )

2025-08-01 17:39:39 +02:00

policy_fastwam_README.md

feat(policies): Add FastWAM Policy (#3834 )

2026-07-01 14:35:57 +02:00

policy_groot_README.md

feat(policies): add Nvidia Gr00t N1.5 model (#2292 )

2025-10-23 13:50:30 +02:00

policy_molmoact2_README.md

Add MolmoAct2 policy (#3604 )

2026-05-27 18:58:37 +02:00

policy_multi_task_dit_README.md

Feature/add multitask diffusion transformer policy implementation (#2545 )

2026-03-28 00:41:26 +01:00

policy_pi0_README.md

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

policy_pi05_README.md

chore(docs): no policy readme in src code (#3286 )

2026-04-05 19:25:38 +02:00

policy_rtc_README.md

chore(docs): no policy readme in src code (#3286 )

2026-04-05 19:25:38 +02:00

policy_sarm_README.md

chore(docs): no policy readme in src code (#3286 )

2026-04-05 19:25:38 +02:00

policy_smolvla_README.md

Update readme (#1570 )

2025-08-01 17:39:39 +02:00

policy_tdmpc_README.md

Update readme (#1570 )

2025-08-01 17:39:39 +02:00

policy_vla_jepa_README.md

feat(policies): add VLA-JEPA (#3568 )

2026-06-04 19:22:51 +02:00

policy_vqbet_README.md

Update readme (#1570 )

2025-08-01 17:39:39 +02:00

policy_walloss_README.md

fix a bug for kwargs in wallx (#2714 )

2026-01-06 15:13:35 +01:00

porting_datasets_v3.mdx

docs: fix broken dataset script paths (datasets/v30 -> scripts) (#3695 )

2026-06-03 14:48:19 +02:00

processors_robots_teleop.mdx

chore(docs): update code block syntax to specify python for clarity (#2770 )

2026-01-08 14:45:07 +01:00

reachy2.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

rebot_b601.mdx

feat(robots): natively integrate Seeed Studio reBot B601-DM arm (#3624 )

2026-05-18 19:49:21 +02:00

rename_map.mdx

feat(rollout): decouple policy deployment from data recording with new lerobot-rollout CLI (#3413 )

2026-04-28 00:57:35 +02:00

robocasa.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

robocerebra.mdx

feat(envs): add RoboCerebra long-horizon manipulation benchmark (#3314 )

2026-04-20 19:12:15 +02:00

robometer.mdx

feat(rewards): add ROBOMETER reward model (#3627 )

2026-05-29 21:45:39 +02:00

robomme.mdx

feat(envs): add RoboMME benchmark (#3311 )

2026-04-20 20:21:27 +02:00

robotwin.mdx

feat(envs): add RoboTwin 2.0 benchmark (#3315 )

2026-04-20 17:46:39 +02:00

rtc.mdx

feat(rollout): decouple policy deployment from data recording with new lerobot-rollout CLI (#3413 )

2026-04-28 00:57:35 +02:00

sarm.mdx

Reward models refactor (#3142 )

2026-04-28 17:56:24 +02:00

smolvla.mdx

fix examples (#3623 )

2026-05-21 22:14:07 +02:00

so100.mdx

feat(robots): consolidate SO arms implementation (#2763 )

2026-01-08 13:04:30 +01:00

so101.mdx

Update follower arm description in documentation (#3780 )

2026-06-25 13:58:08 +02:00

streaming_video_encoding.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

tools.mdx

Add extensive language support (#3467 )

2026-05-19 14:46:11 +02:00

topreward.mdx

feat(rewards): add TOPReward reward model (#3629 )

2026-05-27 14:24:31 +02:00

torch_accelerators.mdx

Add a documentation page with a brief intro to hw backends (#2385 )

2025-12-05 13:32:58 +01:00

unitree_g1.mdx

feat(rollout): decouple policy deployment from data recording with new lerobot-rollout CLI (#3413 )

2026-04-28 00:57:35 +02:00

using_dataset_tools.mdx

feat(visualization): add foxglove support (#3902 )

2026-07-01 18:39:32 +02:00

video_encoding_parameters.mdx

feat(depth maps): adding support for depth in LeRobot (#3644 )

2026-06-27 14:21:21 +02:00

vla_jepa.mdx

feat(policies): add VLA-JEPA (#3568 )

2026-06-04 19:22:51 +02:00

vlabench.mdx

Add inline offline validation with train/eval split (#3824 )

2026-06-25 15:31:24 +02:00

walloss.mdx

chore: remove usernames + use entrypoints in docs, comments & sample commands (#2988 )

2026-02-18 22:46:12 +01:00

xvla.mdx

fix xvla docs (#3291 )

2026-04-23 14:50:32 +02:00