lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-15 14:02:14 +00:00

Files

T

Khalil Meftah e86f5af5bf feat(rewards): add TOPReward reward model (#3629 )

* feat(rewards): add TOPReward reward model

* refactor(rewards): clean up TOPReward processor/model

* fix(rewards/topreward): add missing input keys mm_token_type_ids

* fix(rewards/topreward): fix pyproject extra typo and simplify processor (#3653)

Add lerobot[topreward] extra to all in
pyproject.toml, drop the redundant labels arg in scoring, and
collapse the dead-branch shape check in the encoder processor.

* optmize topreward input processing (#3660)

---------

Co-authored-by: Cole <91766445+jcoleharrison@users.noreply.github.com>
Co-authored-by: Haoming Song <haomingsong24@gmail.com>

2026-05-27 14:24:31 +02:00

__init__.py

Reward models refactor (#3142 )

2026-04-28 17:56:24 +02:00

test_classifier_processor.py

Reward models refactor (#3142 )

2026-04-28 17:56:24 +02:00

test_modeling_classifier.py

RL stack refactoring (#3075 )

2026-05-12 15:49:54 +02:00

test_modeling_topreward.py

feat(rewards): add TOPReward reward model (#3629 )