lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-19 16:02:11 +00:00

Author	SHA1	Message	Date
Reece O'Mahoney	e14bdf57d0	Convert tensors to scalars (#2903 ) Co-authored-by: Steven Palma <imstevenpmwork@ieee.org>	2026-02-09 14:46:12 +01:00
Reece O'Mahoney	97e7e0f9ed	feat(datasets): improve image transform support (#2885 ) * improve image transform support * add tests * Add stricter transform check and extra test * improve subclass check	2026-02-05 15:39:58 +01:00
jwang078	0f39248445	Small docstring fix in diffusion configuration (#2847 )	2026-02-03 19:19:00 +01:00
Iori Yanokura	a6370dd783	fix(wandb): truncate init tags to 64-character limit (#995 )	2026-02-03 14:17:04 +01:00
Michel Aractingi	14a15f90e7	Add missing RL config options: add_ee_pose_to_observation and gripper_penalty_in_reward (#2873 ) * fix(RL) add missing config arguments * respond to copilot review * fix(revert penalty in reward): reverting gripper penalty addition in reward. This is already done in compute_loss_discrete_critic. --------- Co-authored-by: CarolinePascal <caroline8.pascal@gmail.com>	2026-02-02 22:14:03 +01:00
Hirokazu Ishida	9c24a09665	docs: update document in response to Simplify configs PR (#1596 ) * docs: update document input/output_shapes -> input/output_features * fix inconsistent quote (suggested by copilot reviewer) * docs: shapes => PolicyFeature * docs: relfect normalization_mapping and remove outdated	2026-02-02 20:05:58 +01:00
Jade Choghari	b18cef2e26	feat(dataset): add subtask support (#2860 ) * add subtask * remove folder * add docs * update doc * add testing * update test * update constant naming + doc * more docs	2026-01-30 19:29:37 +01:00
Caroline Pascal	5c6182176f	fix(find zmq): adding a clearer not implemented warning for the ZMQ find_cameras method (#2879 ) Co-authored-by: Martino Russi <77496684+nepyope@users.noreply.github.com>	2026-01-30 16:58:13 +01:00
Michel Aractingi	ec04b7ce3a	Feat(dataset_tools.py) Add modify tasks tool (#2875 ) * feat(datasets): add modify_tasks function for in-place task editing Add a new utility function to modify tasks in LeRobotDataset in-place. This allows users to: - Set a single task for all episodes - Set specific tasks for individual episodes - Combine a default task with per-episode overrides * feat(edit-dataset): add CLI support for modify_tasks operation Integrate the modify_tasks function into lerobot_edit_dataset CLI. Users can now modify dataset tasks via command line: Supports setting a default task, per-episode tasks, or both combined. * test(datasets): add tests for modify_tasks function Add comprehensive test coverage for the modify_tasks utility: - Single task for all episodes - Episode-specific task assignment - Default task with per-episode overrides - Error handling for missing/invalid arguments - Verification of task_index correctness - In-place modification behavior - Metadata preservation * respond to copilot review	2026-01-30 13:19:42 +01:00
Michel Aractingi	04cbf669cf	fix(sac): make temperature a property to fix checkpoint resume bug (#2877 ) * fix(sac): make temperature a property to fix checkpoint resume bug Temperature was stored as a plain float and not restored after loading a checkpoint, causing incorrect loss computations until update_temperature() was called. Changed to a property that always computes from log_alpha, ensuring correct behavior after checkpoint loading. * simplify docstrings	2026-01-30 12:23:22 +01:00
Steven Palma	3409ef0dc2	refactor(cameras): cameras API extension (#2808 ) * feat(cameras): add new read_latest() method * fix(cameras): fix threading bug + clear state * refactor(cameras): multiple improvements * feat(camera): add context manager to camera base class * chore(camera): slight modifications to opencv * test(cameras): update opencv tests according to the changes * refactor(cameras): reflect desing changes to realsense + deal with depth * test(cameras): fix realsense tests accordingly to new changes * refactor(cameras): update reachymini and zmq accordingly * chore: wrap resource sensitive examples into a try/finally * test(cameras): add test for new read_latest * test(cameras): fix problem with image artifact in opencv tests * test(cameras): fix test_read_latest_high_frequency expectations * Apply suggestions from code review 1 Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com> Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> * chore(cameras): address feedback * feat(cameras): add max_age_ms check in read_latest * test(cameras): fix read_latest tests * chore(redundancies): removing redundancies in Reachy 2 camera class * fix(warmup): replacing the arbitrary time.sleep in by an actual warmup in the RealSense camera class * chore(format): formatting latest changes * chore(warning): adding a "to be implemented" warning for read_latest() in Camera base class * chore(warning): making read_latest() warning message shorter and clearer --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Caroline Pascal <caroline8.pascal@gmail.com>	2026-01-29 11:07:47 +01:00
Steven Palma	4483184875	feat(robots): add bi manual openarm follower and leader (#2835 ) * fix(motors): cleanup imports + fix signatures * feat(motors): add damiao canbus + multiple fixes * fix(motors): address comments -> last_state + different gains + sleep * refactor(motors): reduce duplicated code + adressed some comments in the PR * chore(motors): better timeouts * tests(motors): damiao test and imports * chore(deps): fix space * feat(robot): add openarm leader Co-authored-by: Pepijn <pepijn@huggingface.co> * feat(robot): add openarm follower Co-authored-by: Pepijn <pepijn@huggingface.co> * refactor(robot): remove mechanical compensations and double arm assumption + rename * chore(robots): remove left arm references * refactor(teleop): multiple improvements to leader * refactor(teleop): multiple improvements to leader * feat(robots): add open arm to util CLI * chore(robot): add alias openarm * Apply suggestions from code review Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> * chore(motors): remove normalization tables damiao * fix(motors): imports and signatures * feat(motors): add motor_type_str + recv_id to motor class and _get_motor_recv_id raises if no motor_obj.recv_id * chore(motors): remove normalize from base motor class and damaio * tests(motors): remove bad tests (to be replaced) * chore(motors): updated import check * fix(robots): open arm mirrored config for joint limits * chore(motors): update position_kd gain values * chore(robots): set to 0 if openarm is calibrated at connect time * chore(robots): remove macos in open arm as can doesn't support it * chore(robots): update for motor_type_str in Motor class * chore(robots): no default value for can port in open arms * feat(robots): add bi manual openarm follower and leader * use constant for kp and kd range and check responses in mit_control_batch() * Add docs on setting up canbus and use damiao otor bus, also add lerobot_setup_can.py and log if there is not response from a write command * precommit format * supress bandit as these are intentional cli commands * fix setup-can * add test * skip test in ci * nit precommit * update doc example * dont import can for tests * remove comment * Add openarms docs * format * update purchase link * can to none if nit availabl;e * add canfd option in bus * make handshake logic similar to lerobot-can * type hint * type check * add temp teleop test * remove script * mock class * mock class * ignore linter * pre-commit * Add command for bimanual openarm * fix import * fix import leader * fix import draccus --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Pepijn <pepijn@huggingface.co> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>	2026-01-28 17:25:57 +01:00
Martino Russi	149628dfd5	add g1 teleoperation (#2791 ) * add gravity compensation * add g1 teleoperation --------- Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co>	2026-01-28 15:17:38 +01:00
Steven Palma	bf337e716d	feat(robots): add OpenArm robot & teleoperator (#2795 ) * fix(motors): cleanup imports + fix signatures * feat(motors): add damiao canbus + multiple fixes * fix(motors): address comments -> last_state + different gains + sleep * refactor(motors): reduce duplicated code + adressed some comments in the PR * chore(motors): better timeouts * tests(motors): damiao test and imports * chore(deps): fix space * feat(robot): add openarm leader Co-authored-by: Pepijn <pepijn@huggingface.co> * feat(robot): add openarm follower Co-authored-by: Pepijn <pepijn@huggingface.co> * refactor(robot): remove mechanical compensations and double arm assumption + rename * chore(robots): remove left arm references * refactor(teleop): multiple improvements to leader * refactor(teleop): multiple improvements to leader * feat(robots): add open arm to util CLI * chore(robot): add alias openarm * Apply suggestions from code review Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> * chore(motors): remove normalization tables damiao * fix(motors): imports and signatures * feat(motors): add motor_type_str + recv_id to motor class and _get_motor_recv_id raises if no motor_obj.recv_id * chore(motors): remove normalize from base motor class and damaio * tests(motors): remove bad tests (to be replaced) * chore(motors): updated import check * fix(robots): open arm mirrored config for joint limits * chore(motors): update position_kd gain values * chore(robots): set to 0 if openarm is calibrated at connect time * chore(robots): remove macos in open arm as can doesn't support it * chore(robots): update for motor_type_str in Motor class * chore(robots): no default value for can port in open arms * use constant for kp and kd range and check responses in mit_control_batch() * Add docs on setting up canbus and use damiao otor bus, also add lerobot_setup_can.py and log if there is not response from a write command * precommit format * supress bandit as these are intentional cli commands * fix setup-can * add test * skip test in ci * nit precommit * update doc example * dont import can for tests * remove comment * Add openarms docs * format * update purchase link * can to none if nit availabl;e * add canfd option in bus * make handshake logic similar to lerobot-can * type hint * type check * add temp teleop test * remove script * mock class * ignore linter --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Pepijn <pepijn@huggingface.co> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>	2026-01-28 14:28:51 +01:00
Michel Aractingi	736b43f3cf	Fix(aggregate.py) Aggregation of datasets when sub-datasets are already a result of a previous merge (#2861 ) * Fix aggeregation of datasets when subdatasets are already a result of a previous merge * docstring * respond to copilot review + add regression test * Remove unnecessary int conversion for indicies	2026-01-28 13:31:27 +01:00
Steven Palma	9cfb5ce546	feat(motors): add damiao motors & can bus (#2788 ) * fix(motors): cleanup imports + fix signatures * feat(motors): add damiao canbus + multiple fixes * fix(motors): address comments -> last_state + different gains + sleep * refactor(motors): reduce duplicated code + adressed some comments in the PR * chore(motors): better timeouts * tests(motors): damiao test and imports * chore(deps): fix space * Apply suggestions from code review Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> * chore(motors): remove normalization tables damiao * fix(motors): imports and signatures * feat(motors): add motor_type_str + recv_id to motor class and _get_motor_recv_id raises if no motor_obj.recv_id * chore(motors): remove normalize from base motor class and damaio * tests(motors): remove bad tests (to be replaced) * chore(motors): updated import check * use constant for kp and kd range and check responses in mit_control_batch() * Add docs on setting up canbus and use damiao otor bus, also add lerobot_setup_can.py and log if there is not response from a write command * precommit format * supress bandit as these are intentional cli commands * fix setup-can * add test * skip test in ci * nit precommit * update doc example * dont import can for tests --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> Co-authored-by: Pepijn <pepijn@huggingface.co>	2026-01-26 17:53:25 +01:00
Reece O'Mahoney	366bef915c	add task ids to libero env cfg (#2842 )	2026-01-26 17:26:49 +01:00
Woojin Wie	9e10eb4a77	fix(robots): update gripper configuration and calibration settings for OMX (#2815 )	2026-01-25 22:29:37 +01:00
Steven Palma	0b067df57d	feat(robots): add context managers (#2828 )	2026-01-20 18:02:38 +01:00
sato_shinji	9919b16b36	fix: ensure action tensors are moved to client_device in async training (#2792 ) * feat(async_inference): server always sends CPU tensors, client handles device conversion * fix:fix the type annotation of RawObservation in src/lerobot/async_inference/helpers.py * update the import of robot_client --------- Co-authored-by: Sato shinji <wwwsatoshinji@gmail.com> Co-authored-by: Steven Palma <imstevenpmwork@ieee.org> Co-authored-by: KB <kevin-brian.n-diaye@epita.fr>	2026-01-20 15:17:38 +01:00
Alexis D	13bfee1aa4	Set 10 direction bit for Current Load attribute (#1014 )	2026-01-20 11:20:30 +01:00
Jade Choghari	79688a09f2	improve(dataset-tools): image2video editing tools : Multiple episodes per video file (#2811 ) * improve image2video * add episodes video encoding * fix mypy failing * iterate on review * nit * remove max, and let it be optional * iterate more * update docs * fix test --------- Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co>	2026-01-20 11:04:22 +01:00
Francesco Capuano	b2ff219624	Fixes aggregation of image datasets (#2717 ) * fix: use features when aggregating image based datasets * add: test asserting for data type * add: features param to writing dataset --------- Co-authored-by: Steven Palma <imstevenpmwork@ieee.org>	2026-01-19 23:36:41 +01:00
Maximilian Ofir	66929c5935	feat: add async server-client streaming support for Groot policy (#2812 )	2026-01-19 22:13:48 +01:00
Steven Palma	5286ef8439	feat(utils): extend import check util (#2820 ) * refactor(utils): is_package_available now differentiate between pkg name and module name * refactor(tests): update require_package decorator	2026-01-19 16:43:11 +01:00
bigmbigk	fe068df711	fix(train): eval env initialization on train script (#2818 ) * fix: eval env initialization on train script Signed-off-by: bigmbigk <bigmbigk@gmail.com> * fix: eval env creation condition --------- Signed-off-by: bigmbigk <bigmbigk@gmail.com>	2026-01-19 14:14:10 +01:00
Sung-Wook Lee	da41646073	fix libero reset logic for correct resetting (#2817 )	2026-01-19 13:18:52 +01:00
Steven Palma	46e19ae579	feat: is connect checks decorators (#2813 )	2026-01-16 18:52:06 +01:00
Alex Tyshka	77dc49b3a3	Fix delta timestamps with episodes filter and add tests (#2612 )	2026-01-16 18:14:54 +01:00
Alex Tyshka	33910673ec	Bugfix: Add tests for image deletion and fix mixed image-video deletion (#2592 ) * Add tests for image deletion and fix mixed-image-video deletion * Fix docstring whitespace * Remove debug print Signed-off-by: Alex Tyshka <atyshka15@gmail.com> * Remove inaccurate comment * Remove batched video test --------- Signed-off-by: Alex Tyshka <atyshka15@gmail.com>	2026-01-16 18:14:15 +01:00
Michel Aractingi	19dce78457	Refactor: Move PEFT config from training script to policy level (#2806 ) * move peft config from `lerobot_train` to policy level * Update src/lerobot/scripts/lerobot_train.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Michel Aractingi <michel.aractingi@huggingface.co> * copilot response * Change the polciy function to return targets rather than peft config.`_get_default_peft_targets()` override in PI0, PI0.5, SmolVLA * remove none check when building config dict --------- Signed-off-by: Michel Aractingi <michel.aractingi@huggingface.co>	2026-01-16 17:14:28 +01:00
Steven Palma	1c61b43b15	fix(teleop): add is_connected check to get_action (#2801 )	2026-01-14 17:14:12 +01:00
Steven Palma	15724826dd	chore: use alias & constants (#2785 ) * chore: use alias and constants * fix(rl): solve circular dependecy * chore: nit right constant * chore: pre-commit * chore(script): conflict tokenizer train --------- Signed-off-by: Steven Palma <imstevenpmwork@ieee.org>	2026-01-13 09:49:46 +01:00
Jade Choghari	2cdd9f43f7	fix: train tokenizer CLI entry point (#2784 )	2026-01-13 01:42:53 +01:00
samet-rob	d0f57f58d1	Move cfg.validate() earlier to fix NoneType error with --policy.path (#2782 )	2026-01-12 19:24:19 +01:00
Steven Palma	b8ec1152d4	fix(robots): add reachy2 fixes (#2783 ) * fix(robots): add reachy2 fixes * tests(robots): remove reachy sdk stub	2026-01-12 18:05:16 +01:00
Martino Russi	6b8d4c75a6	Feat/g1 improvements record sim (#2765 ) This PR extends the integration of Unitree g1 with the LeRobot codebase. By converting robot state to a flat dict we can now record and replay episodes (example groot/holosoma scripts need to be adjusted as well). We also improve the simulation integration by calling .step @ _subscribe_motor_state instead of it running in a separate thread. We also add ZMQ camera to lerobot, streaming base64 images over json	2026-01-12 17:31:39 +01:00
Steven Palma	d791a431fe	feat(robots): consolidates bi SO setups (#2780 ) * feat(robots): consolidates bi SO setups * fix(robots): solve circular dependecy * fix(robots): teleop & record working * feat(robots): only one SO * fix(utils): rename bi so * fix(scripts): bi so import * fix(rl): remove imports	2026-01-12 16:01:22 +01:00
Jade Choghari	473f1bd0e0	docs: improve assets (#2777 ) * add assets * add libero results pifast: * update * update * update size * update naems: : * update training tokenizer	2026-01-12 13:33:28 +01:00
Michel Aractingi	91ff9c4975	Fix: Respect policy.device=cpu config in training (#2778 ) * fix cpu training in lerobot_train * Update src/lerobot/scripts/lerobot_train.py Signed-off-by: Michel Aractingi <michel.aractingi@huggingface.co>	2026-01-12 12:19:02 +01:00
Jade Choghari	1d86c9b7f2	feat(policies): add autoregressive VLAs with tokenization PiFast (#2734 )	2026-01-09 23:08:37 +01:00
Leo Tronchon	8b6fc0ae05	feat(datasets): expose video codec option for dataset recording (#2771 ) * expose codec options + add tests * pre-commit run -a	2026-01-08 18:06:39 +01:00
Steven Palma	ccfd609ece	feat(robots): consolidate SO arms implementation (#2763 ) * feat(robots): consolidate SO arms implementation * chore(robots): delete unnecessary init modules	2026-01-08 13:04:30 +01:00
Steven Palma	fbe4c8b94f	Feat/remote rerunviz encoded images (#2767 ) * feat(visualization): allow remote viewer + compress rerun images * fix(tests): allow named argument in mocked rerun * feat(visualization): ip instead or url & cli arg for compressing images --------- Co-authored-by: J4nn1K <jannik@grothusen.de>	2026-01-07 17:38:13 +01:00
Steven Palma	4f7cd8d369	Revert "feat(visualization): allow remote viewer + compress rerun images (#2756 )" (#2766 ) This reverts commit `f844c7a458`.	2026-01-07 17:33:36 +01:00
Steven Palma	f844c7a458	feat(visualization): allow remote viewer + compress rerun images (#2756 ) * feat(visualization): allow remote viewer + compress rerun images * fix(tests): allow named argument in mocked rerun * feat(visualization): ip instead or url & cli arg for compressing images	2026-01-07 17:30:45 +01:00
Martino Russi	7e9d05a799	add holosoma locomotion (#2669 ) Add holosoma locomotion from Amazon-FAR Add reset method to unitree_g1 Format actions as dict Update docs	2026-01-07 16:05:31 +01:00
Steven Palma	e2957d7783	fix: precise_sleep is never called with negative value (#2757 )	2026-01-06 20:09:43 +01:00
Tong Wu	603d44434f	fix a bug for kwargs in wallx (#2714 ) * support wallx * fix bugs in flow * incorporate wallx model into lerobot * update the policy methods * reduce to least config and params & pass lerobot basic test * fixed dtype bugs * add wallx dependencies * update * remove flash-attn requirement && fix bug in inference and fast mode * fix bug for inference * add some small modifications * fix pre-commit errors * remove lerobot[wallx] * fix ci * fix precommit issues * fix: exclude wallx extra properly in CI workflows * fix: add uv conflicts for wallx transformers version * fix: peft test import * pre-commit * only export WallXConfig from wall_x package to avoid peft import in CI * remove torch dep * precommit * add import * update doc files * fix minor errors * fix a bug for kwargs * fix precommit issue --------- Signed-off-by: Pepijn <138571049+pkooij@users.noreply.github.com> Co-authored-by: vincentchen <chenlufang@x2robot.com> Co-authored-by: Geoffrey19 <sympathischmann35@gmail.com> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> Co-authored-by: Pepijn <pepijn@huggingface.co> Co-authored-by: geoffrey <geoffrey@x2robot.com>	2026-01-06 15:13:35 +01:00
githubnemo	e670ac5daf	Add basic PEFT support to train script + record module (#1411 ) * Add basic support for PEFT adapter methods This changes adds support for training policies with much less parameters by applying adapter methods such as LoRA on specific parts of the policies and therefore possibly higher learning rates / batch sizes. To make this as accessible as possible I thought it useful to provide defaults for `target_modules` and `modules_to_save`. Currently only SmolVLA has such defaults but when we agree that this change is useful I will set out to generate more such defaults. While the user can override these settings, they are expected to only change the peft_method, rank and init_type parameters. * Implement loading of PEFT adapters Loading a PEFT adapter is currently done by initializing a policy with default config and then applying the adapter on the resulting model. This has the obvious drawback that any configurations done during training are not applied in the adapted model. Currently the `use_peft` attribute of `PreTrainedConfig` is only set during loading to signal the following code that it has to deal with a PEFT adapter. However we could imagine a scenario where this is already set at training time and stored alongside the adapter. * Store policy config alongside PEFT checkpoint Before this change the PEFT-wrapped policy did not save the policy's config alongside the adapter config / weights which prevented us from changing the policy config. Now the policy config is saved both in full training and PEFT training. This change makes loading the PEFT policy adapter much easier as well. * Add default config for ACT * Support targets like `all-linear` * Formatting * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix failing tests * Remove PEFT compatibility changes in config We'll wait for the PEFT release that fixes this for good. * Remove `use_peft` parameter from training script Instead we make the PEFT config optional which has the same effect. * Log adapter config to WandB * Better documentation for CLI arguments * Don't unload & merge the PEFT model This can make things hard when using quantized layers (user expects quantized base layers with unquantized adapters for example, merging defaults to upcast the layers leading to higher memory). * Correct way of identifying when to save config * Add CLI end-to-end tests Currently there don't seem to be any way to test the CLI commands. Since this change mostly happens in those I thought it best to add a way to test these commands end-to-end. More integrated commands like `lerobot-record` need patching but standalone commands like training seem to work fine. * Update default targets Removed ACT since it doesn't make sense to fine-tune ACT without having it pretrained beforehand. SmolVLA and Pi0/0.5 are much more senseful targets. * Clean up loading code - Centralized instantiation of the PEFT wrapper in `make_policy` for inference (e.g. in `lerobot-record`) - Training a PEFT policy also sets `cfg.use_peft` so that all inference code loading the policy can rely on that attribute to identify if PEFT loading is needed - Modified RTC example to also include PEFT policies. Mostly because this is an example I'm currently exploring. * Make sure push_to_hub works Since PEFT only wraps `push_to_hub` and not `push_model_to_hub`, the reference to `self` in `policy.push_model_to_hub` is the unwrapped policy which, of course, doesn't know anything about PEFT. To make the upload process aware of PEFT, we pass the unwrapped policy down to `push_model_to_hub` as a kwarg. This is not ideal but I think it is the best way for now. * formatting * Warn when encountering from-scratch-training * Revamp pretrained model loading There were quite a few factors that convinced me that the status quo is able to load pretrained models from the PEFT adapter config but in fact that didn't work. This commit fixes the following things: - policies wrapped in PEFT will now have a `name_or_path` attribute containing the name or path of the pretrained model we're fine-tuning - we further assume that SmolVLA without `pretrained_path` and `load_vlm_weights==False` must be an user-side error - we assume that using PEFT on from-scratch-policies must be an user-side-error * Make it possible to unset policy features This is necessary to train pre-trained policies on new datasets so that the features are inferred from the new dataset and not from the pretrained policy. * Use correct loading for PEFT in RTC example * Make it possible to use PeftModels in eval * Add test checking that PEFT actually reduces params * Adapt state/action projections instead of full-finetuning There doesn't seem to be a benefit to fully fine-tune these layers over just adapting them, so we do that instead. * Disallow PEFT training on non-pretrained policies At first I thought it would make sense to have this feature in case you want to fine-tune a pre-trained section but in the end it makes more trouble than it's worth. It's still possible to allow this in the future when a concrete need arises. * Add basic documentation * Formatting * Add peft as extra dependency, mark tests Fast tests currently fail because of the missing dependency. * Fix pre-commit issues * Add walx <> peft conflict for uv * Exclude peft from pi install for now --------- Co-authored-by: nemo <git@ningu.net> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>	2026-01-05 08:51:26 +01:00

1 2 3 4 5 ...

299 Commits