lerobot

mirror of https://github.com/huggingface/lerobot.git synced 2026-05-18 02:00:03 +00:00

Author	SHA1	Message	Date
AdilZouitine	7c2c67fc3c	Enhance SAC configuration and replay buffer with asynchronous prefetching support - Added async_prefetch parameter to SACConfig for improved buffer management. - Implemented get_iterator method in ReplayBuffer to support asynchronous prefetching of batches. - Updated learner_server to utilize the new iterator for online and offline sampling, enhancing training efficiency.	2025-04-18 15:10:22 +02:00
Michel Aractingi	0cce2fe0fa	Added Gripper quantization wrapper and grasp penalty removed complementary info from buffer and learner server removed get_gripper_action function added gripper parameters to `common/envs/configs.py`	2025-04-18 15:10:22 +02:00
pre-commit-ci[bot]	88d26ae976	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:10:22 +02:00
s1lent4gnt	427720426b	Add complementary info in the replay buffer - Added complementary info in the add method - Added complementary info in the sample method	2025-04-18 15:10:22 +02:00
s1lent4gnt	70d418935d	Fix: Prevent Invalid next_state References When optimize_memory=True (#918 )	2025-04-18 15:10:22 +02:00
pre-commit-ci[bot]	eb44a06a9b	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:10:22 +02:00
AdilZouitine	38e8864284	Add task field to frame_dict in ReplayBuffer and simplify save_episode calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.	2025-04-18 15:09:27 +02:00
pre-commit-ci[bot]	0ea27704f6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:09:25 +02:00
AdilZouitine	cdcf346061	Update tensor device assignment in ReplayBuffer class - Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.	2025-04-18 15:06:52 +02:00
Eugene Mironov	b6a2200983	[HIL-SERL] Migrate threading to multiprocessing (#759 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	85fe8a3f4e	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:51 +02:00
AdilZouitine	ae51c19b3c	Add memory optimization option to ReplayBuffer - Introduce `optimize_memory` parameter to reduce memory usage in replay buffer - Implement simplified memory optimization by not storing duplicate next_states - Update learner server and buffer initialization to use memory optimization by default	2025-04-18 15:04:58 +02:00
AdilZouitine	1d4ec50a58	Refactor ReplayBuffer with tensor-based storage and improved sampling efficiency - Replaced list-based memory storage with pre-allocated tensor storage - Optimized sampling process with direct tensor indexing - Added support for DrQ image augmentation during sampling for offline dataset - Improved dataset conversion with more robust episode handling - Enhanced buffer initialization and state tracking - Added comprehensive testing for buffer conversion and sampling	2025-04-18 15:04:58 +02:00
AdilZouitine	4c73891575	Update ManiSkill configuration and replay buffer to support truncation and dataset handling - Reduced image size in ManiSkill environment configuration from 128 to 64 - Added support for truncation in replay buffer and actor server - Updated SAC policy configuration to use a specific dataset and modify vision encoder settings - Improved dataset conversion process with progress tracking and task naming - Added flexibility for joint action space masking in learner server	2025-04-18 15:04:58 +02:00
Eugene Mironov	e1d55c7a44	[Port HIL-SERL] Adjust Actor-Learner architecture & clean up dependency management for HIL-SERL (#722 )	2025-04-18 15:04:56 +02:00
Michel Aractingi	a16ea283f5	Fixed bug in the action scale of the intervention actions and offline dataset actions. (scale by inverse delta) Co-authored-by: Adil Zouitine <adizouitinegm@gmail.com>	2025-04-18 15:04:44 +02:00
Michel Aractingi	8209a6dfb7	Modified crop_dataset_roi interface to automatically write the cropped parameters to a json file in the meta of the dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:44 +02:00
Michel Aractingi	b5fbeb7401	Optimized the replay buffer from the memory side to store data on cpu instead of a gpu device and send the batches to the gpu. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:44 +02:00
Michel Aractingi	5195f40fd3	Hardcoded some normalization parameters. TODO refactor Added masking actions on the level of the intervention actions and offline dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:43 +02:00
Michel Aractingi	f1af97dc9c	- Added JointMaskingActionSpace wrapper in `gym_manipulator` in order to select which joints will be controlled. For example, we can disable the gripper actions for some tasks. - Added Nan detection mechanisms in the actor, learner and gym_manipulator for the case where we encounter nans in the loop. - changed the non-blocking in the `.to(device)` functions to only work for the case of cuda because they were causing nans when running the policy on mps - Added some joint clipping and limits in the env, robot and policy configs. TODO clean this part and make the limits in one config file only. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00
Michel Aractingi	9784d8a47f	Several fixes to move the actor_server and learner_server code from the maniskill environment to the real robot environment. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00
Michel Aractingi	273fa2e6e1	fixed bug in crop_dataset_roi.py added missing buffer.pt in server dir Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:13 +02:00

22 Commits