Files
lerobot/tests/rl
Khalil Meftah 0944b84279 feat(rl): consolidate HIL-SERL checkpoint into HF-style components
Make  and  s, add abstract
 /  for algorithm-owned tensors (critics,
target nets, ), and persist them as a sibling
component next to . Replace the pickled
 side-file with an enriched
carrying both  and , so resume restores actor +
critics + target nets + temperature + optimizers + RNG + counters from
plain HF-standard files.
2026-05-08 21:24:23 +02:00
..