Khalil Meftah
ef6b3b5b0f
refactor: simplify docstrings for clarity and conciseness across multiple files
2026-04-28 11:11:02 +02:00
Khalil Meftah
9ce9e01469
refactor(rl): make algorithm a nested config so all SAC hyperparameters are JSON-addressable
2026-04-27 13:39:03 +02:00
Khalil Meftah
1ed32210c7
refactor(rl/sac): consolidate hyperparameter ownership and clean up discrete critic
2026-04-24 13:18:33 +02:00
Khalil Meftah
06255996ea
refactor(policies): rename policies/sac → policies/gaussian_actor
2026-04-23 19:13:18 +02:00
Khalil Meftah
a4c0c9e358
update losses names in tests
2026-04-21 11:53:32 +02:00
Khalil Meftah
a5ad273b62
fix(tests): skip tests that require grpc if not available
2026-04-15 16:30:20 +02:00
Khalil Meftah
da6e36fd03
Merge remote-tracking branch 'origin/main' into user/khalil-meftah/2026-02-16-rl-stack-refactor
2026-04-14 17:14:56 +02:00
Khalil Meftah
e022207c75
refactor: RL stack refactoring — RLAlgorithm, RLTrainer, DataMixer, and SAC restructuring
2026-04-13 11:39:48 +02:00
Steven Palma
df0763a2bc
feat(dependencies): minimal default tag install ( #3362 )
2026-04-12 20:03:04 +02:00
Steven Palma
5286ef8439
feat(utils): extend import check util ( #2820 )
...
* refactor(utils): is_package_available now differentiate between pkg name and module name
* refactor(tests): update require_package decorator
2026-01-19 16:43:11 +01:00
Steven Palma
43d878a102
chore: replace hard-coded obs values with constants throughout all the source code ( #2037 )
...
* chore: replace hard-coded OBS values with constants throughout all the source code
* chore(tests): replace hard-coded OBS values with constants throughout all the test code
2025-09-25 15:36:47 +02:00
Steven Palma
d6a32e9742
chore(rl): move rl related code to its directory at top level ( #2002 )
...
* chore(rl): move rl related code to its directory at top level
* chore(style): apply pre-commit to renamed headers
* test(rl): fix rl imports
* docs(rl): update rl headers doc
2025-09-23 16:32:34 +02:00
Simon Alibert
d4ee470b00
Package folder structure ( #1417 )
...
* Move files
* Replace imports & paths
* Update relative paths
* Update doc symlinks
* Update instructions paths
* Fix imports
* Update grpc files
* Update more instructions
* Downgrade grpc-tools
* Update manifest
* Update more paths
* Update config paths
* Update CI paths
* Update bandit exclusions
* Remove walkthrough section
2025-07-01 16:34:46 +02:00
Adil Zouitine
d8079587a2
Port HIL SERL ( #644 )
...
Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co >
Co-authored-by: Eugene Mironov <helper2424@gmail.com >
Co-authored-by: s1lent4gnt <kmeftah.khalil@gmail.com >
Co-authored-by: Ke Wang <superwk1017@gmail.com >
Co-authored-by: Yoel Chornton <yoel.chornton@gmail.com >
Co-authored-by: imstevenpmwork <steven.palma@huggingface.co >
Co-authored-by: Simon Alibert <simon.alibert@huggingface.co >
2025-06-13 13:15:47 +02:00