lerobot/tests at bfb8cfb4322a2d7842d4152918eedc8e59d41841 - lerobot - Gitea: Git with a cup of tea

admin/lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-08 02:22:02 +00:00

Files

T

History

Pepijn bfb8cfb432 fix(smolvla2): flatten say tool_calls into <say> marker before tokenizing

The chat tokenizer passed assistant `tool_calls` straight to
`apply_chat_template`, which renders them as a structured JSON
`<tool_call>` block — so the LM head was trained to emit JSON. But the
inference parser `_split_plan_and_say` looks for a `<say>...</say>`
marker, which the model never saw in training, so the `say` tool never
fired at inference.

`_flatten_say_tool_calls` is the missing training-time serializer (the
one `_split_plan_and_say`'s docstring already assumed existed): it
rewrites a `say` tool call into a `<say>...</say>` marker inside the
content text before the chat template runs, so the template only
tokenizes plain text and the supervised target span trains the model to
emit exactly the marker the runtime parses back (Pi 0.5-style flat
tool-call serialization).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-18 10:47:31 +02:00

..

refactor(annotate): drop dataset-level `tools` parquet column

2026-04-30 18:48:36 +02:00

feat(dataset): 2x faster dataloader via parallel decode, uint8 transport, and persistent workers (#3406 )

2026-04-19 00:08:22 +02:00

async_inference

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test(cameras): skip flaky async_read test (#3106 )

2026-03-08 14:02:33 +01:00

feat(language): per-camera tagging on view-dependent styles

2026-04-30 10:48:17 +02:00

fix(datasets): render flow-only low_level recipes instead of dropping them

2026-05-17 13:20:39 +02:00

feat(envs): add RoboTwin 2.0 benchmark (#3315 )

2026-04-20 17:46:39 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

fix(feetech): motor position readings overflow (#3373 )

2026-04-13 22:39:58 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

fix(smolvla2): flatten say tool_calls into <say> marker before tokenizing

2026-05-18 10:47:31 +02:00

feat(language): per-camera tagging on view-dependent styles

2026-04-30 10:48:17 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(robots): Unitree G1 WBC implementation (#2876 )

2026-03-08 11:33:24 +01:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

refactor(imports): enforce guard pattern (#3382 )

2026-04-14 22:54:05 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

Add extensive language support

2026-04-27 10:56:32 +02:00

__init__.py

chore(doc): add license header to all files (#818 )

2025-03-05 17:56:51 +01:00

conftest.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_available.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_cli_peft.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_control_robot.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00

test_robomme_env.py

feat(envs): add RoboMME benchmark (#3311 )

2026-04-20 20:21:27 +02:00

utils.py

feat(dependencies): minimal default tag install (#3362 )

2026-04-12 20:03:04 +02:00