7 Commits

Author SHA1 Message Date
Qizhi Chen 2ef2370d66 🐛 fix v30→v21 converter imports (#101)
Co-authored-by: Codex <codex@openai.com>
2026-05-09 22:31:45 -07:00
FennMai 723bd71cf2 🐛 fix v30_to_v21 ArrowTypeError on pandas extension dtypes
`table.slice(...).to_pandas()` produces pandas ExtensionArrays for
`array[float32]` columns (e.g. `observation.states.end.orientation`)
on newer pandas/pyarrow combos, which then fail in
`pa.Table.from_pandas` inside `Dataset.from_pandas(...).to_parquet(...)`.

Skip the pandas round-trip and wrap the `pa.Table` slice in a
`Dataset` directly with `Dataset(episode_table).to_parquet(...)`.
This preserves the HuggingFace dataset metadata that `Dataset.to_parquet`
writes, while avoiding the ExtensionArray crash. No version pin on
datasets/pyarrow needed.

Closes #87
2026-04-30 07:03:03 +00:00
Qizhi Chen 97c278f339 🐛 fix dataset version convert (#75)
* fix v30_to_v21

* sync v21_to_v30 with official
2025-12-03 12:53:59 +08:00
Yihao Liu 01d1df3920 feat: v3.0 to v2.1 (#68)
* feat: v2.1 to v2.0

* [DEV] update to filter new 3.0 fields

* feat: v21tov20 with filters and v30tov21

* fix: gemini-code-assist comments

* remove: global stat copy

* fix: rename root to output-path

* feat: v3.0->v2.1 fields filter

* align v30_to_v21 logic to v21_to_v30

* delete v21_to_v20 logic for this PR

* remove unused code

* update readme

---------

Co-authored-by: Tavish <tavish9.chen@gmail.com>
2025-11-27 12:28:56 +08:00
Qizhi Chen c5d1312a2b add version convert collections (#63)
* v20 to v21

* v21 to v20

* v21 to v30

* v16 to v20

* update dataset version convert readme

* update readme
2025-10-04 16:49:19 +08:00
Qizhi Chen 858cbd582d ⬆️ upgrade lerobot to 0.2.0 (#51) 2025-07-31 08:51:51 +08:00
Tavish 33302f40dc refactor file structure 2025-06-17 19:24:00 +08:00