Khalil Meftah
|
1ed32210c7
|
refactor(rl/sac): consolidate hyperparameter ownership and clean up discrete critic
|
2026-04-24 13:18:33 +02:00 |
|
Khalil Meftah
|
06255996ea
|
refactor(policies): rename policies/sac → policies/gaussian_actor
|
2026-04-23 19:13:18 +02:00 |
|
Khalil Meftah
|
8065bf15c7
|
fix test for flat dict structure
|
2026-04-21 12:06:25 +02:00 |
|
Khalil Meftah
|
a4c0c9e358
|
update losses names in tests
|
2026-04-21 11:53:32 +02:00 |
|
Khalil Meftah
|
a84b0e8132
|
refactor(sac): decouple algorithm hyperparameters from policy config
|
2026-04-18 16:40:56 +02:00 |
|
Khalil Meftah
|
7a1c9e74c3
|
fix: skip tests that require grpc if not available
|
2026-04-15 15:18:04 +02:00 |
|
Khalil Meftah
|
e022207c75
|
refactor: RL stack refactoring — RLAlgorithm, RLTrainer, DataMixer, and SAC restructuring
|
2026-04-13 11:39:48 +02:00 |
|