mirror of
https://github.com/Tavish9/any4lerobot.git
synced 2026-05-17 23:19:41 +00:00
723bd71cf2
`table.slice(...).to_pandas()` produces pandas ExtensionArrays for `array[float32]` columns (e.g. `observation.states.end.orientation`) on newer pandas/pyarrow combos, which then fail in `pa.Table.from_pandas` inside `Dataset.from_pandas(...).to_parquet(...)`. Skip the pandas round-trip and wrap the `pa.Table` slice in a `Dataset` directly with `Dataset(episode_table).to_parquet(...)`. This preserves the HuggingFace dataset metadata that `Dataset.to_parquet` writes, while avoiding the ExtensionArray crash. No version pin on datasets/pyarrow needed. Closes #87
LeRobot Dataset v30 to v21
Get started
-
Downgrade datasets:
pip install "datasets<4.0.0"Need to downgrade datasets first since
4.0.0introducesListandColumn. -
Install v3.0 lerobot
git clone https://github.com/huggingface/lerobot.git pip install -e . -
Run the converter:
python convert_dataset_v30_to_v21.py \ --repo-id=your_id \ --root=your_local_dir