lerobot/tests at a164bb97bd53ec9d0b9a2f9d73a8a21b3c5901c5 - lerobot - Gitea: Git with a cup of tea

admin/lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-06-18 00:37:10 +00:00

Files

T

History

Pepijn a164bb97bd feat(streaming): native datasets-5 episode batching and worker-split suppression

Allow datasets 5.x (pin >=4.7,<6; lockfile moves to 5.0.0) and use its
Arrow-native batch(by_column="episode_index") (huggingface/datasets#8194
sibling, #8172) for episode admission when available - one Arrow
accumulation per episode instead of one Python dict per row - with the
existing row loop as the 4.x fallback. A parity test asserts both paths
group identically.

Also fixes a latent worker bug this surfaced: `datasets` detects torch
DataLoader workers and re-splits its shards internally (_iter_pytorch),
on top of our explicit per-worker shard assignment. That second split
silently drops data whenever a per-worker stream has fewer internal
shards than there are workers (masked so far by single-file test
fixtures), and on datasets 5.0 it crashes by_column batching outright.
The worker context is now hidden from `datasets` while draining streams
we already partitioned (process-local patch, restored on exit).

The multi-shard shuffle buffer (huggingface/datasets#8194) is
intentionally NOT used: frame-level shuffling upstream of episode
grouping would fragment episodes and break delta windows. Its threaded
multi-source prefetch idea remains a follow-up for episode admission if
fetch timings warrant it.

Verified on both datasets 4.8.5 (fallback) and 5.0.0 (native): 27/27
streaming tests each; full datasets suite 469 passed under 5.0.0.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

2026-06-11 16:10:53 +02:00

..

feat(encoding parameters): adding support for user provided video encoding parameters (#3455 )

2026-05-14 23:46:42 +02:00

async_inference

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test(cameras): skip flaky async_read test (#3106 )

2026-03-08 14:02:33 +01:00

Add extensive language support (#3467 )

2026-05-19 14:46:11 +02:00

feat(streaming): native datasets-5 episode batching and worker-split suppression

2026-06-11 16:10:53 +02:00

feat(envs): add RoboTwin 2.0 benchmark (#3315 )

2026-04-20 17:46:39 +02:00

feat(encoding parameters): adding support for user provided video encoding parameters (#3455 )

2026-05-14 23:46:42 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

fix(feetech): motor position readings overflow (#3373 )

2026-04-13 22:39:58 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(policies): add VLA-JEPA (#3568 )

2026-06-04 19:22:51 +02:00

feat(processor): Add in-memory processor pipeline serialization (#3732 )

2026-06-08 11:27:24 +02:00

feat(rewards): add ROBOMETER reward model (#3627 )

2026-05-29 21:45:39 +02:00

RL stack refactoring (#3075 )

2026-05-12 15:49:54 +02:00

feat(robots): natively integrate Seeed Studio reBot B601-DM arm (#3624 )

2026-05-18 19:49:21 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(robots): natively integrate Seeed Studio reBot B601-DM arm (#3624 )

2026-05-18 19:49:21 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

Add extensive language support (#3467 )

2026-05-19 14:46:11 +02:00

__init__.py

chore(doc): add license header to all files (#818 )

2025-03-05 17:56:51 +01:00

conftest.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_available.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_cli_peft.py

feat(rollout): decouple policy deployment from data recording with new lerobot-rollout CLI (#3413 )

2026-04-28 00:57:35 +02:00

test_control_robot.py

feat(rollout): decouple policy deployment from data recording with new lerobot-rollout CLI (#3413 )

2026-04-28 00:57:35 +02:00

test_robomme_env.py

feat(envs): add RoboMME benchmark (#3311 )

2026-04-20 20:21:27 +02:00

test_rollout.py

feat(rollout): adding episodic strategy (#3717 )

2026-06-06 00:32:38 +02:00

test_yaml_policy_path.py

Fix policy.path in YAML configs (PR #3145 followup) (#3597 )

2026-05-26 14:01:19 +02:00

utils.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00