Khalil Meftah
ef6b3b5b0f
refactor: simplify docstrings for clarity and conciseness across multiple files
2026-04-28 11:11:02 +02:00
Khalil Meftah
e298474bf3
fix(tests): gate RL tests on the datasets extra
2026-04-27 16:53:34 +02:00
Khalil Meftah
577f14337a
refactor(tests): remove grpc import checks from test files for cleaner code
2026-04-27 16:20:13 +02:00
Khalil Meftah
9ce9e01469
refactor(rl): make algorithm a nested config so all SAC hyperparameters are JSON-addressable
2026-04-27 13:39:03 +02:00
Khalil Meftah
1ed32210c7
refactor(rl/sac): consolidate hyperparameter ownership and clean up discrete critic
2026-04-24 13:18:33 +02:00
Khalil Meftah
06255996ea
refactor(policies): rename policies/sac → policies/gaussian_actor
2026-04-23 19:13:18 +02:00
Khalil Meftah
8065bf15c7
fix test for flat dict structure
2026-04-21 12:06:25 +02:00
Khalil Meftah
a4c0c9e358
update losses names in tests
2026-04-21 11:53:32 +02:00
Khalil Meftah
a84b0e8132
refactor(sac): decouple algorithm hyperparameters from policy config
2026-04-18 16:40:56 +02:00
Khalil Meftah
d7e25c8326
refactor(rl): expose public API in rl/__init__ and use relative imports in sub-packages
2026-04-16 15:46:34 +02:00
Khalil Meftah
a5ad273b62
fix(tests): skip tests that require grpc if not available
2026-04-15 16:30:20 +02:00
Khalil Meftah
7a1c9e74c3
fix: skip tests that require grpc if not available
2026-04-15 15:18:04 +02:00
Khalil Meftah
da6e36fd03
Merge remote-tracking branch 'origin/main' into user/khalil-meftah/2026-02-16-rl-stack-refactor
2026-04-14 17:14:56 +02:00
Khalil Meftah
e022207c75
refactor: RL stack refactoring — RLAlgorithm, RLTrainer, DataMixer, and SAC restructuring
2026-04-13 11:39:48 +02:00
Steven Palma
df0763a2bc
feat(dependencies): minimal default tag install ( #3362 )
2026-04-12 20:03:04 +02:00
Steven Palma
5286ef8439
feat(utils): extend import check util ( #2820 )
...
* refactor(utils): is_package_available now differentiate between pkg name and module name
* refactor(tests): update require_package decorator
2026-01-19 16:43:11 +01:00
pre-commit-ci[bot]
7aedbbf81a
[pre-commit.ci] pre-commit autoupdate ( #1563 )
...
* [pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/pre-commit/pre-commit-hooks: v5.0.0 → v6.0.0](https://github.com/pre-commit/pre-commit-hooks/compare/v5.0.0...v6.0.0 )
- [github.com/astral-sh/ruff-pre-commit: v0.12.4 → v0.13.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.4...v0.13.0 )
- [github.com/adhtruong/mirrors-typos: v1.34.0 → v1.36.2](https://github.com/adhtruong/mirrors-typos/compare/v1.34.0...v1.36.2 )
- [github.com/gitleaks/gitleaks: v8.27.2 → v8.28.0](https://github.com/gitleaks/gitleaks/compare/v8.27.2...v8.28.0 )
- [github.com/woodruffw/zizmor-pre-commit: v1.11.0 → v1.13.0](https://github.com/woodruffw/zizmor-pre-commit/compare/v1.11.0...v1.13.0 )
* chore: update pre-commit versions
---------
Co-authored-by: Steven Palma <imstevenpmwork@ieee.org >
2025-10-18 01:20:45 +02:00
Steven Palma
43d878a102
chore: replace hard-coded obs values with constants throughout all the source code ( #2037 )
...
* chore: replace hard-coded OBS values with constants throughout all the source code
* chore(tests): replace hard-coded OBS values with constants throughout all the test code
2025-09-25 15:36:47 +02:00
Steven Palma
170c09e7f6
chore(utils): move queue utils and wandb_utils to their respective modules ( #2030 )
...
* chore(utils): move queue utils and wandb_utils to their respective modules
* fix(rl): remove double imports
---------
Signed-off-by: Steven Palma <imstevenpmwork@ieee.org >
2025-09-24 17:10:52 +02:00
Steven Palma
d6a32e9742
chore(rl): move rl related code to its directory at top level ( #2002 )
...
* chore(rl): move rl related code to its directory at top level
* chore(style): apply pre-commit to renamed headers
* test(rl): fix rl imports
* docs(rl): update rl headers doc
2025-09-23 16:32:34 +02:00
Simon Alibert
d4ee470b00
Package folder structure ( #1417 )
...
* Move files
* Replace imports & paths
* Update relative paths
* Update doc symlinks
* Update instructions paths
* Fix imports
* Update grpc files
* Update more instructions
* Downgrade grpc-tools
* Update manifest
* Update more paths
* Update config paths
* Update CI paths
* Update bandit exclusions
* Remove walkthrough section
2025-07-01 16:34:46 +02:00
Adil Zouitine
d8079587a2
Port HIL SERL ( #644 )
...
Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co >
Co-authored-by: Eugene Mironov <helper2424@gmail.com >
Co-authored-by: s1lent4gnt <kmeftah.khalil@gmail.com >
Co-authored-by: Ke Wang <superwk1017@gmail.com >
Co-authored-by: Yoel Chornton <yoel.chornton@gmail.com >
Co-authored-by: imstevenpmwork <steven.palma@huggingface.co >
Co-authored-by: Simon Alibert <simon.alibert@huggingface.co >
2025-06-13 13:15:47 +02:00