lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-08 02:22:02 +00:00

Files

T

Pepijn Kooijmans 9dfc9084e1 review: decode keyframes via video_utils.decode_video_frames

Addresses three of CarolinePascal's frames.py comments (the fourth, the
subprocess re-encode, waits on #3611):

- replace the bespoke _decode_pyav_direct PyAV decoder with
  lerobot.datasets.video_utils.decode_video_frames (torchcodec backend,
  PyAV fallback) — torchvision's VideoReader removal no longer applies
- frames flow through the provider as torch.Tensor (C, H, W uint8); PIL
  is materialised only at the VLM-message boundary in to_image_blocks /
  to_video_block, where the chat backends need it
- _decode now returns exactly one frame per timestamp (or [] on failure),
  so frames_at pairs them with strict=True

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-18 14:00:38 +02:00

__init__.py

feat: language annotation pipeline (PR 2/3)

2026-04-30 18:48:33 +02:00

_helpers.py

feat: language annotation pipeline (PR 2/3)

2026-04-30 18:48:33 +02:00

conftest.py

review: address CarolinePascal feedback

2026-05-18 12:03:25 +02:00

run_e2e_smoke.py

review: address CarolinePascal feedback