Commit Graph

97 Commits

Author SHA1 Message Date
Alexander Soare 94cc22da9e Merge remote-tracking branch 'upstream/main' into refactor_dp 2024-04-11 17:52:10 +01:00
Alexander Soare 976a197f98 backup wip 2024-04-11 17:51:35 +01:00
Cadene c1a618e567 fix pusht images type from float32 to uint8, update gym-pusht dependencies 2024-04-11 14:29:16 +00:00
Cadene 657b27cc8f fix load_data_with_delta_timestamps and add tests 2024-04-11 13:00:09 +00:00
Cadene 3914831585 remove __name__ outside script 2024-04-10 17:16:44 +00:00
Cadene f8c5a2eb10 remove comment 2024-04-10 17:14:02 +00:00
Cadene 9874652c2f enable test_compute_stats
enable test_compute_stats
2024-04-10 17:12:54 +00:00
Cadene e8622154f8 Replace import gym_pusht in pusht dataset by dynamic import 2024-04-10 15:56:18 +00:00
Cadene c08003278e test_examples are passing 2024-04-10 13:45:45 +00:00
Cadene 7c8eb7ff19 Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl 2024-04-10 11:34:51 +00:00
Cadene 06573d7f67 online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered 2024-04-10 11:34:01 +00:00
Alexander Soare 50e4c8050c Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act 2024-04-08 17:13:11 +01:00
Alexander Soare 9c96349926 Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act 2024-04-08 15:44:00 +01:00
Simon Alibert 3f6dfa4916 Add gym-aloha, rename simxarm -> xarm, refactor 2024-04-08 16:24:11 +02:00
Cadene 70aaf1c4cb test_datasets.py are passing! 2024-04-08 14:16:57 +00:00
Alexander Soare 863f28ffd8 ready for review 2024-04-08 13:10:19 +01:00
Alexander Soare e982c732f1 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-08 09:25:45 +01:00
Cadene 4371a5570d Remove latency, tdmpc policy passes tests (TODO: make it work with online RL) 2024-04-07 16:01:22 +00:00
Cadene 44656d2706 test_envs are passing 2024-04-05 23:27:12 +00:00
Alexander Soare 1e71196fe3 backup wip 2024-04-05 17:38:29 +01:00
Alexander Soare 4863e54ce9 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:00:31 +01:00
Cadene ad3379a73a fix memory leak due to itertools.cycle 2024-04-05 10:59:32 +00:00
Alexander Soare 9d77f5773d Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 11:41:11 +01:00
Cadene 5af00d0c1e fix train.py, stats, eval.py (training is running) 2024-04-05 09:31:39 +00:00
Cadene c93ce35d8c WIP stats (TODO: run tests on stats + cmpute them) 2024-04-04 16:36:03 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare c7d70a8db9 Merge remote-tracking branch 'upstream/main' into refactor_act 2024-04-03 10:08:12 +01:00
Alexander Soare caf4ffcf65 add TODO 2024-04-03 09:56:46 +01:00
Alexander Soare c50a62dd6d clarifying math 2024-04-03 09:47:38 +01:00
Alexander Soare e9eb262293 numerically sound mean computation 2024-04-03 09:44:20 +01:00
Alexander Soare 65ef8c30d0 backup wip 2024-04-02 19:13:49 +01:00
Alexander Soare 2b928eedd4 backup wip 2024-04-02 19:11:53 +01:00
Alexander Soare a6edb85da4 Remove random sampling 2024-04-02 16:52:38 +01:00
Alexander Soare 95293d459d fix stats computation 2024-04-02 16:40:33 +01:00
Alexander Soare 68d02c80cf Remove b/c workaround 2024-03-27 12:03:19 +00:00
Cadene 9ced0cf1fb unskip 2024-03-26 10:45:31 +00:00
Cadene 5a46b8a2a9 fix tests 2024-03-26 10:24:46 +00:00
Alexander Soare 1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert c5635b7d94 Minor fixes for #47 2024-03-25 18:50:47 +01:00
Simon Alibert d3adaf1379 Add stat.pth for xarm_lift_medium 2024-03-25 15:55:45 +01:00
Simon Alibert c0833f1c2d Remove simxarm download and preproc hack 2024-03-25 12:41:17 +01:00
Simon Alibert de5c30405e fix wrong version 2024-03-25 12:35:06 +01:00
Simon Alibert 462e7469e8 Add xarm_lift_medium revision 1.0 to hub 2024-03-25 12:28:07 +01:00
Cadene b905111895 fix render issue 2024-03-25 12:28:07 +01:00
Simon Alibert 1c24bbda3f WIP Upgrading simxam from mujoco-py to mujoco python bindings 2024-03-25 12:28:07 +01:00
Cadene d2ef43436c move from cadene to lerobot 2024-03-23 13:34:35 +00:00
Cadene 40f3783fca v1.2 2024-03-23 11:41:56 +00:00
Alexander Soare 8720c568d0 Add ability to eval hub model 2024-03-22 10:26:55 +00:00
Cadene 82e6e01651 v1.1 2024-03-20 17:34:00 +00:00
Alexander Soare d16f6a93b3 Merge remote-tracking branch 'upstream/main' into user/alexander-soare/multistep_policy_and_serial_env 2024-03-20 09:01:45 +00:00