mirror of
https://github.com/huggingface/lerobot.git
synced 2026-05-20 19:19:56 +00:00
note origins of each training objective
This commit is contained in:
@@ -187,7 +187,11 @@ The vision encoder uses a separate learning rate multiplier, where 1/10th is sug
|
|||||||
|
|
||||||
#### 1. Flow Matching with Beta Sampling
|
#### 1. Flow Matching with Beta Sampling
|
||||||
|
|
||||||
Consider switching to flow matching with beta sampling distribution for potentially improved performance:
|
The original diffusion implementation here is based on the work described in [TRI's LBM paper](https://arxiv.org/abs/2507.05331)
|
||||||
|
|
||||||
|
Additionally, we have implemented a flow-matching objective, which is described at a high-level in [Boston Dynamics blog post](https://bostondynamics.com/blog/large-behavior-models-atlas-find-new-footing/).
|
||||||
|
|
||||||
|
Consider testing the flow-matching objective and evaluating performance differences for your task:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
--policy.objective=flow_matching \
|
--policy.objective=flow_matching \
|
||||||
|
|||||||
Reference in New Issue
Block a user