lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-07 01:51:47 +00:00

Files

T

Pepijn 4908433f9a chore(training): align smolvla2_hirobot.slurm with what's actually run

Match the operator's current training command for the _tool6 retrain:

  * default DATASET / POLICY_REPO_ID / JOB_NAME point at the tool6
    iteration (super_poulain_full_tool3 → smolvla2_hirobot_super_poulain_tool6)
  * STEPS default 2000 (short enough to iterate; bump to 10k for full)
  * save_freq=$STEPS so the only checkpoint is the final one
  * OUTPUT_DIR includes step count so successive runs don't clobber
  * Drop the wider augmentation envelope I added earlier — back to
    default ColorJitter ranges (brightness ±20% etc) since the
    high_level_subtask recipe fix (current-subtask supervision) is
    expected to fix the LM-head collapse on its own; the augmentation
    is just the standard regulariser, not a load-bearing widener.
  * prompt-dropout fractions stay at the original 0.15 / 0.15 / 0.20.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 18:45:38 +02:00

smolvla2_hirobot.slurm

chore(training): align smolvla2_hirobot.slurm with what's actually run

2026-05-12 18:45:38 +02:00

train_policy.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

train_with_streaming.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00