fix(ci): address PR review feedback for benchmark smoke tests

Security: - Remove "Login to Hugging Face" step — it was a no-op (ephemeral --rm container) that exposed the HF token via CLI argument in docker inspect / /proc/*/cmdline. The eval step already re-authenticates via env var. Functional: - Remove feat/benchmark-ci from push trigger branches (won't exist post-merge). Dockerfiles: - Pin uv to 0.8.0 (was unpinned, fetching whatever latest ships). - Add comment explaining the chmod +x ptxas workaround (Triton packaging bug — ships ptxas without execute bit). Scripts: - parse_eval_metrics.py: add note that it runs on bare host and must stay stdlib-only. - parse_eval_metrics.py: add NaN guard for avg_sum_reward and eval_s (was only guarding pc_success). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-07-08 18:41:54 +00:00 · 2026-04-10 12:47:58 +02:00
parent 58d4ecd304
commit c505a71f78
4 changed files with 20 additions and 13 deletions
@@ -31,7 +31,6 @@ on:

  push:
    branches:
-      - feat/benchmark-ci
      - main
    paths:
      - "src/lerobot/envs/**"
@@ -101,14 +100,6 @@ jobs:
          load: true
          tags: lerobot-benchmark-libero:ci

-      - name: Login to Hugging Face
-        if: env.HF_USER_TOKEN != ''
-        run: |
-          docker run --rm \
-            -e HF_HOME=/tmp/hf \
-            lerobot-benchmark-libero:ci \
-            bash -c "hf auth login --token '$HF_USER_TOKEN' --add-to-git-credential && hf auth whoami"
-
      - name: Run Libero smoke eval (1 episode)
        run: |
          # Named container (no --rm) so we can docker cp artifacts out.