FREEDOM, added back the optimization loop code in learner_server.py

Ran experiment with pushcube env from maniskill. The learning seem to work. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
2026-05-24 13:09:43 +00:00 · 2025-01-28 17:25:49 +00:00
parent 508f5d1407
commit 2ae657f568
3 changed files with 85 additions and 80 deletions
@@ -82,7 +82,7 @@ policy:
  temperature_lr: 3e-4
  # critic_target_update_weight: 0.005
  critic_target_update_weight: 0.01
-  utd_ratio: 1
+  utd_ratio: 2


  # # Loss coefficients.