chore(smolvla2-runtime): auto-fire one tick at dry-run startup

Previously the dry-run REPL only ticked on user input (empty Enter just redrew), so the bisection test "does the LM head produce text on start_frame=0?" required typing something arbitrary to drive a tick. Just run ``step_once`` at startup — the obs diagnostic *and* the subtask gen both fire automatically, the diag row populates, and the operator can read the result before pressing any key. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-06-23 11:17:02 +00:00 · 2026-05-12 18:34:42 +02:00
parent 47fb8318b1
commit 731576be80
1 changed files with 12 additions and 6 deletions
@@ -1318,15 +1318,21 @@ def main(argv: list[str] | None = None) -> int:
            initial_task=args.task,
            max_ticks=args.max_ticks,
        )
-    # Fire the observation provider once at startup so the
-    # ``_log_obs_tensors_once`` diagnostic prints before the REPL
-    # blocks on stdin. The REPL otherwise only ticks on user input —
-    # useful for the train/inference tensor-comparison workflow.
+    # Fire one full pipeline tick at startup so the obs diagnostic
+    # *and* the subtask generation actually run before the REPL
+    # blocks on stdin. The REPL otherwise only ticks on user input,
+    # which made the dry-run bisection test (does the LM head produce
+    # text at start_frame=0?) require typing something. Doing
+    # ``step_once`` here means the diag row populates without any
+    # manual interaction.
    if observation_provider is not None:
        try:
-            _ = observation_provider()
+            startup_logs = runtime.step_once()
        except Exception as exc:  # noqa: BLE001
-            logger.warning("startup obs probe failed: %s", exc)
+            logger.warning("startup tick failed: %s", exc)
+            startup_logs = []
+        for line in startup_logs or []:
+            print(f"[smolvla2] {line}", flush=True)
    return _run_repl(runtime, initial_task=args.task, max_ticks=args.max_ticks)