Commit Graph

144 Commits

Author SHA1 Message Date
Cadene a420714ee4 fix: action_is_pad was missing in compute_loss 2024-04-05 11:33:39 +00:00
Cadene ad3379a73a fix memory leak due to itertools.cycle 2024-04-05 10:59:32 +00:00
Cadene 5af00d0c1e fix train.py, stats, eval.py (training is running) 2024-04-05 09:31:39 +00:00
Cadene c93ce35d8c WIP stats (TODO: run tests on stats + cmpute them) 2024-04-04 16:36:03 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare caf4ffcf65 add TODO 2024-04-03 09:56:46 +01:00
Alexander Soare c50a62dd6d clarifying math 2024-04-03 09:47:38 +01:00
Alexander Soare e9eb262293 numerically sound mean computation 2024-04-03 09:44:20 +01:00
Alexander Soare a6edb85da4 Remove random sampling 2024-04-02 16:52:38 +01:00
Alexander Soare 95293d459d fix stats computation 2024-04-02 16:40:33 +01:00
Alexander Soare f1148b8c2d Merge remote-tracking branch 'upstream/main' into finish_examples 2024-04-01 11:31:31 +01:00
Simon Alibert 6bddcb647e Add test_aloha env test 2024-03-28 10:35:11 +01:00
Alexander Soare b7c9c33072 revision 2024-03-27 18:33:48 +00:00
Alexander Soare 120f0aef5c Merge remote-tracking branch 'upstream/main' into finish_examples 2024-03-27 17:52:36 +00:00
Alexander Soare 68d02c80cf Remove b/c workaround 2024-03-27 12:03:19 +00:00
Alexander Soare 011f2d27fe fix tests 2024-03-26 16:40:54 +00:00
Alexander Soare 1ed0110900 finish examples 2 and 3 2024-03-26 16:13:40 +00:00
Cadene 9ced0cf1fb unskip 2024-03-26 10:45:31 +00:00
Cadene 5a46b8a2a9 fix tests 2024-03-26 10:24:46 +00:00
Alexander Soare 1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert c5635b7d94 Minor fixes for #47 2024-03-25 18:50:47 +01:00
Simon Alibert bcfdba109f Update pre-commit & run on all files 2024-03-25 17:29:35 +01:00
Simon Alibert 7cdd6d2450 Renamed set_seed -> set_global_seed 2024-03-25 17:19:28 +01:00
Simon Alibert 058ac991eb Add simxarm back into tests 2024-03-25 16:35:46 +01:00
Simon Alibert d3adaf1379 Add stat.pth for xarm_lift_medium 2024-03-25 15:55:45 +01:00
Simon Alibert dc89166bee Upgrade gym to gymnasium 2024-03-25 15:12:21 +01:00
Simon Alibert 5ef813ff1e Remove deprecated code 2024-03-25 13:22:49 +01:00
Simon Alibert c0833f1c2d Remove simxarm download and preproc hack 2024-03-25 12:41:17 +01:00
Simon Alibert de5c30405e fix wrong version 2024-03-25 12:35:06 +01:00
Simon Alibert 462e7469e8 Add xarm_lift_medium revision 1.0 to hub 2024-03-25 12:28:07 +01:00
Simon Alibert 127de1258d WIP 2024-03-25 12:28:07 +01:00
Cadene b905111895 fix render issue 2024-03-25 12:28:07 +01:00
Simon Alibert 0c41675986 fix __init__ import Base 2024-03-25 12:28:07 +01:00
Simon Alibert 1c24bbda3f WIP Upgrading simxam from mujoco-py to mujoco python bindings 2024-03-25 12:28:07 +01:00
Remi f3cfc8b3b4 Merge pull request #46 from huggingface/user/rcadene/2024_03_23_update_stats_v1.2
Fix bug with stats.pth + Move from cadene to lerobot + Update datasets to v1.2
2024-03-24 17:53:32 +01:00
Cadene d2ef43436c move from cadene to lerobot 2024-03-23 13:34:35 +00:00
Cadene 40f3783fca v1.2 2024-03-23 11:41:56 +00:00
Alexander Soare e698d38a35 Merge remote-tracking branch 'upstream/main' into fix_environment_seeding 2024-03-22 15:11:15 +00:00
Alexander Soare 15ff3b3af8 add fixes for reproducibility 2024-03-22 15:06:57 +00:00
Alexander Soare b9047fbdd2 fix environment seeding 2024-03-22 13:25:23 +00:00
Alexander Soare 8720c568d0 Add ability to eval hub model 2024-03-22 10:26:55 +00:00
Alexander Soare 72d3c3120b Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion 2024-03-21 10:20:52 +00:00
Alexander Soare acf1174447 ready for review 2024-03-21 10:18:50 +00:00
Simon Alibert 1bd50122be Merge pull request #40 from huggingface/user/aliberts/2024_03_20_enable_mps_device
Enable mps backend for Apple silicon devices
2024-03-20 19:33:12 +01:00
Simon Alibert 4631d36c05 Add get_safe_torch_device in policies 2024-03-20 18:38:55 +01:00
Cadene 82e6e01651 v1.1 2024-03-20 17:34:00 +00:00
Alexander Soare d323993569 backup wip 2024-03-20 15:01:27 +00:00
Alexander Soare 4b7ec81dde remove abstracmethods, fix online training 2024-03-20 14:49:41 +00:00
Alexander Soare 32e3f71dd1 backup wip 2024-03-20 09:49:16 +00:00
Alexander Soare 5332766a82 revision 2024-03-20 09:45:45 +00:00