mirror of
https://github.com/Tavish9/any4lerobot.git
synced 2026-05-16 06:29:45 +00:00
723bd71cf2
`table.slice(...).to_pandas()` produces pandas ExtensionArrays for `array[float32]` columns (e.g. `observation.states.end.orientation`) on newer pandas/pyarrow combos, which then fail in `pa.Table.from_pandas` inside `Dataset.from_pandas(...).to_parquet(...)`. Skip the pandas round-trip and wrap the `pa.Table` slice in a `Dataset` directly with `Dataset(episode_table).to_parquet(...)`. This preserves the HuggingFace dataset metadata that `Dataset.to_parquet` writes, while avoiding the ExtensionArray crash. No version pin on datasets/pyarrow needed. Closes #87
LeRobot Dataset Version Convert
The LeRobot Dataset has undergone multiple versions over time, with significant improvements in data storage and reading performance in each iteration. The versions of the dataset are as follows: v1.0-v1.6, v2.0, v2.1, v3.0.
| Version | Release Date | Version Conversion Link |
|---|---|---|
| v1.0-v1.6 | 2024-07-23 | --- |
| v2.0 | 2024-11-30 | v1.6 to v2.0, v2.1 to v2.0 |
| v2.1 | 2025-02-25 | v2.0 to v2.1 |
| v3.0 | 2025-09-15 | v2.1 to v3.0 |