Files
Qizhi Chen 09862f8d3d add support for lerobot2rlds (#22)
* add support for lerobot2rlds with beam processing
2025-05-16 10:45:11 +08:00

2.5 KiB
Raw Permalink Blame History

LeRobot to RLDS

RLDS stands for Reinforcement Learning Datasets and it is an ecosystem of tools to store, retrieve and manipulate episodic data in the context of Sequential Decision Making including Reinforcement Learning (RL), Learning for Demonstrations, Offline RL or Imitation Learning.

For more details, please check official repo.

Motivation

Some classic works like OpenVLA, Octo, etc. currently only support reading the RLDS format. To meet the communitys needs, we provide a script that converts the popular LERobot format into the RLDS format.

🚀 What's New in This Script

  • Complete Data Preservation: Retains all original information from the lerobot dataset, including diverse image keys, depth maps, and associated metadata.
  • TFDS Conversion Simplified: Implements the first Python-based workflow to launch TensorFlow Datasets (TFDS) conversions with native support for parallel Beam processing.
  • Customizable RLDS Metadata: Enables flexible customization of RLDS dataset metadata fields (e.g., citations, descriptions, versioning) through a unified configuration interface.

Installation

  1. Install LeRobot:
    Follow instructions in official repo.

  2. Install others: For saving tfds/rlds, we need to install tensorflow-datasets:

    pip install tensorflow
    pip install tensorflow-datasets
    

    If you want to enable beam processing:

    pip install apache-beam
    

Get started

Warning

  • Beam processing is implemented for speed improvements, but may exhibit occasional instability with Apache Beam.
  • If your dataset is small, or you want to safely save all the data, we recommend disabling beam processing.
  • If partial episode loss is acceptable for performance gains, enable beam by adding --enable-beam.

Download source code:

git clone https://github.com/Tavish9/any4lerobot.git

Modify path in convert.sh:

python lerobot2rlds.py \
    --src-dir /path/to/lerobot/dataset \
    --output-dir /path/to/rlds_dir \
    --task-name default_task

Customizing rlds:

    --encoding-format png \
    --version 1.0.0 \
    --citation "@{...}"

For more flags, check python lerobot2rlds.py --help

Execute the script:

cd lerobot2rlds && bash convert.sh