lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-07-24 18:26:11 +00:00

Author	SHA1	Message	Date
Eugene Mironov	700f00c014	[HIL-SERL] Migrate threading to multiprocessing (#759 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-03-05 11:19:31 +01:00
pre-commit-ci[bot]	584cad808e	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-04 13:38:48 +00:00
AdilZouitine	d8a1758122	Add storage device configuration for SAC policy and replay buffer - Introduce `storage_device` parameter in SAC configuration and training settings - Update learner server to use configurable storage device for replay buffer - Reduce online buffer capacity in ManiSkill configuration - Modify replay buffer initialization to support custom storage device	2025-03-04 13:22:35 +00:00
AdilZouitine	42a038173f	Update ManiSkill configuration and replay buffer to support truncation and dataset handling - Reduced image size in ManiSkill environment configuration from 128 to 64 - Added support for truncation in replay buffer and actor server - Updated SAC policy configuration to use a specific dataset and modify vision encoder settings - Improved dataset conversion process with progress tracking and task naming - Added flexibility for joint action space masking in learner server	2025-02-24 16:53:37 +00:00
Michel Aractingi	546719137a	Added caching function in the learner_server and modeling sac in order to limit the number of forward passes through the pretrained encoder when its frozen. Added tensordict dependencies Updated the version of torch and torchvision Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-21 10:13:43 +00:00
Eugene Mironov	3ffe0cf0f4	[Port HIL-SERL] Adjust Actor-Learner architecture & clean up dependency management for HIL-SERL (#722 )	2025-02-21 10:29:00 +01:00
AdilZouitine	ff82367c62	Refactor SAC policy with performance optimizations and multi-camera support - Introduced Ensemble and CriticHead classes for more efficient critic network handling - Added support for multiple camera inputs in observation encoder - Optimized image encoding by batching image processing - Updated configuration for ManiSkill environment with reduced image size and action scaling - Compiled critic networks for improved performance - Simplified normalization and ensemble handling in critic networks Co-authored-by: michel-aractingi <michel.aractingi@gmail.com>	2025-02-20 17:14:27 +00:00
Michel Aractingi	ff47c0b0d3	- Fixed big issue in the loading of the policy parameters sent by the learner to the actor -- pass only the actor to the `update_policy_parameters` and remove `strict=False` - Fixed big issue in the normalization of the actions in the `forward` function of the critic -- remove the `torch.no_grad` decorator in `normalize.py` in the normalization function - Fixed performance issue to boost the optimization frequency by setting the storage device to be the same as the device of learning. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-19 16:22:51 +00:00
AdilZouitine	2f3370e42f	Add maniskill support. Co-authored-by: Michel Aractingi <michel.aractingi@gmail.com>	2025-02-14 19:53:29 +00:00
Michel Aractingi	7ae368e983	Fixed bug in the action scale of the intervention actions and offline dataset actions. (scale by inverse delta) Co-authored-by: Adil Zouitine <adizouitinegm@gmail.com>	2025-02-14 15:17:16 +01:00
Michel Aractingi	36711d766a	Modified crop_dataset_roi interface to automatically write the cropped parameters to a json file in the meta of the dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-14 12:32:45 +01:00
Michel Aractingi	0c32008466	Changed bounds for a new so100 robot Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-13 15:43:30 +01:00
Michel Aractingi	c462a478c7	Hardcoded some normalization parameters. TODO refactor Added masking actions on the level of the intervention actions and offline dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-13 14:27:14 +01:00
Michel Aractingi	459f22ed30	fix log_alpha in modeling_sac: change to nn.parameter added pretrained vision model in policy Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-13 11:26:24 +01:00
Michel Aractingi	dc086dc21f	Added logging for interventions to monitor the rate of interventions through time Added an s keyboard command to force success in the case the reward classifier fails Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-13 11:04:49 +01:00
Michel Aractingi	b9217b06db	Added possiblity to record and replay delta actions during teleoperation rather than absolute actions Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-12 19:25:41 +01:00
Eugene Mironov	a1d16fb400	[Port HIL-SERL] Add resnet-10 as default encoder for HIL-SERL (#696 ) Co-authored-by: Khalil Meftah <kmeftah.khalil@gmail.com> Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co> Co-authored-by: Ke Wang <superwk1017@gmail.com>	2025-02-11 11:37:00 +01:00
Michel Aractingi	a7db3959f5	- Added JointMaskingActionSpace wrapper in `gym_manipulator` in order to select which joints will be controlled. For example, we can disable the gripper actions for some tasks. - Added Nan detection mechanisms in the actor, learner and gym_manipulator for the case where we encounter nans in the loop. - changed the non-blocking in the `.to(device)` functions to only work for the case of cuda because they were causing nans when running the policy on mps - Added some joint clipping and limits in the env, robot and policy configs. TODO clean this part and make the limits in one config file only. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-11 11:34:46 +01:00
Michel Aractingi	b5f89439ff	Added sac_real config file in the policym configs dir. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-10 16:08:13 +01:00
Michel Aractingi	d51374ce12	Several fixes to move the actor_server and learner_server code from the maniskill environment to the real robot environment. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-10 16:03:39 +01:00
Eugene Mironov	b63738674c	[HIL-SERL port] Add Reward classifier benchmark tracking to chose best visual encoder (#688 )	2025-02-06 18:39:51 +01:00
Michel Aractingi	12525242ce	- Added `lerobot/scripts/server/gym_manipulator.py` that contains all the necessary wrappers to run a gym-style env around the real robot. - Added `lerobot/scripts/server/find_joint_limits.py` to test the min and max angles of the motion you wish the robot to explore during RL training. - Added logic in `manipulator.py` to limit the maximum possible joint angles to allow motion within a predefined joint position range. The limits are specified in the yaml config for each robot. Checkout the so100.yaml. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-06 16:29:37 +01:00
Michel Aractingi	506821c7df	- Refactor observation encoder in `modeling_sac.py` - added `torch.compile` to the actor and learner servers. - organized imports in `train_sac.py` - optimized the parameters push by not sending the frozen pre-trained encoder. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
Michel Aractingi	7c89bd1018	Cleaned `learner_server.py`. Added several block function to improve readability. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
Michel Aractingi	367dfe51c6	Added support for checkpointing the policy. We can save and load the policy state dict, optimizers state, optimization step and interaction step Added functions for converting the replay buffer from and to LeRobotDataset. When we want to save the replay buffer, we convert it first to LeRobotDataset format and save it locally and vice-versa. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
Michel Aractingi	9aabe212ea	Added missing config files `env/maniskill_example.yaml` and `policy/sac_maniskill.yaml` that are necessary to run the lerobot implementation of sac with the maniskill baselines. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
Michel Aractingi	42618f4bd6	- Added additional logging information in wandb around the timings of the policy loop and optimization loop. - Optimized critic design that improves the performance of the learner loop by a factor of 2 - Cleaned the code and fixed style issues - Completed the config with actor_learner_config field that contains host-ip and port elemnts that are necessary for the actor-learner servers. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
Michel Aractingi	36576c958f	FREEDOM, added back the optimization loop code in `learner_server.py` Ran experiment with pushcube env from maniskill. The learning seem to work. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-02-03 15:07:58 +00:00
AdilZouitine	d75b44f89f	Stable version of rlpd + drq	2025-02-03 15:07:57 +00:00
Michel Aractingi	3bb5ed5e91	Extend reward classifier for multiple camera views (#626 )	2025-01-13 13:57:49 +01:00
Eugene Mironov	c5bca1cf0f	[Port HIL_SERL] Final fixes for the Reward Classifier (#598 )	2025-01-06 11:34:00 +01:00
KeWang1017	22fbc9ea4a	Refine SAC configuration and policy for enhanced performance - Updated standard deviation parameterization in SACConfig to 'softplus' with defined min and max values for improved stability. - Modified action sampling in SACPolicy to use reparameterized sampling, ensuring better gradient flow and log probability calculations. - Cleaned up log probability calculations in TanhMultivariateNormalDiag for clarity and efficiency. - Increased evaluation frequency in YAML configuration to 50000 for more efficient training cycles. These changes aim to enhance the robustness and performance of the SAC implementation during training and inference.	2024-12-29 14:21:49 +00:00
KeWang1017	18a4598986	trying to get sac running	2024-12-29 14:14:13 +00:00
Michel Aractingi	7fcf638c0d	Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541 ) Co-authored-by: Yoel <yoel.chornton@gmail.com>	2024-12-17 02:41:31 +07:00
Yoel	e35546f58e	Reward classifier and training (#528 ) Co-authored-by: Daniel Ritchie <daniel@brainwavecollective.ai> Co-authored-by: resolver101757 <kelster101757@hotmail.com> Co-authored-by: Jannik Grothusen <56967823+J4nn1K@users.noreply.github.com> Co-authored-by: Remi <re.cadene@gmail.com> Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co>	2024-12-17 02:41:29 +07:00
KasparSLT	96c7052777	Rename deprecated argument (temporal_ensemble_momentum) (#490 )	2024-11-25 21:05:13 +01:00
Remi	07e8716315	Add FeetechMotorsBus, SO-100, Moss-v1 (#419 ) Co-authored-by: jess-moss <jess.moss@huggingface.co> Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-10-25 11:23:55 +02:00
Remi	97b1feb0b3	Add policy/act_aloha_real.yaml + env/act_real.yaml (#429 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-10-10 17:12:45 +02:00
Simon Alibert	1a343c3591	Add support for Stretch (hello-robot) (#409 ) Co-authored-by: Remi <remi.cadene@huggingface.co> Co-authored-by: Remi Cadene <re.cadene@gmail.com>	2024-10-04 18:56:42 +02:00
Alexander Soare	92573486a8	Don't use async envs by default (#448 )	2024-09-20 15:22:52 +02:00
Remi	9ff829a3a1	Add comments for Aloha (#417 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-09-06 21:07:52 +02:00
Remi	9c9f5cac90	Add IntelRealSenseCamera (#410 ) Co-authored-by: Simon Alibert <simon.alibert@huggingface.co> Co-authored-by: shantanuparab-tr <shantanu@trossenrobotics.com> Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-09-05 23:59:41 +02:00
Remi	429a463aff	Control aloha robot natively (#316 ) Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>	2024-09-04 19:28:05 +02:00
Jack Vial	27ba2951d1	fix(tdmpc): Add missing save_freq to tdmpc policy config (#404 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-09-02 19:04:41 +01:00
Remi	1ce418e4a1	Add koch bimanual (#385 )	2024-08-28 00:53:31 +02:00
Alexander Soare	9ce98bb93c	Add safety limits on relative action target (#373 )	2024-08-26 14:30:18 +01:00
Alexander Soare	97086cdcdf	Make gripper_open_degree a config param (#379 )	2024-08-26 12:28:16 +01:00
Zhuoheng Li	a2592a5563	Provide more information to the user (#358 ) Co-authored-by: Alexander Soare <alexander.soare159@gmail.com> Co-authored-by: Remi <re.cadene@gmail.com>	2024-08-23 11:00:35 +01:00
Remi	bbe9057225	Improve control robot ; Add process to configure motor indices (#326 ) Co-authored-by: Simon Alibert <alibert.sim@gmail.com> Co-authored-by: jess-moss <jess.moss@dextrousrobotics.com> Co-authored-by: Marina Barannikov <marina.barannikov@huggingface.co> Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>	2024-08-15 18:11:33 +02:00
Alexander Soare	f8a6574698	Add online training with TD-MPC as proof of concept (#338 )	2024-07-25 11:16:38 +01:00

1 2 3 4

172 Commits