smolvla2(runtime): wire MemoryUpdateFwd into the inference pipeline

MemoryUpdateFwd was importable but never installed, so subtask_change events fired by HighLevelSubtaskFwd had no listener and current_memory stayed at its initial None value — the runtime panel always showed 'memory (not set)' even when the policy was trained with the memory_update recipe (e.g. subtask_mem_vqa_speech.yaml, weight 0.15). Insert MemoryUpdateFwd between HighLevelSubtaskFwd and AskVQAFwd so the event is visible the same tick it is emitted, and refresh the stale comment that claimed memory was not in scope. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-07-17 15:01:54 +00:00 · 2026-05-25 12:52:44 +02:00
parent 793c7c4ddd
commit 6d2b8c80ab
1 changed files with 25 additions and 13 deletions
@@ -33,8 +33,9 @@ from .steps import (
    HighLevelSubtaskFwd,
    InferenceStep,
    LowLevelForward,
+    MemoryUpdateFwd,
 )
-from .triggers import HzTrigger, TickClock
+from .triggers import EventTrigger, HzTrigger, TickClock

 logger = logging.getLogger(__name__)

@@ -67,29 +68,40 @@ class SmolVLA2Runtime:
    _stop: bool = field(default=False, init=False)

    def __post_init__(self) -> None:
-        # Subtask + VQA configuration (current scope — plan and memory
-        # are not trained yet). Pipeline:
+        # Subtask + memory + VQA configuration. Pipeline:
        #
        #   HighLevelSubtaskFwd → generate the next subtask via the LM
        #                         head at ~``high_level_hz``; writes
-        #                         ``current_subtask``
-        #   AskVQAFwd           → answer camera-grounded stdin questions
+        #                         ``current_subtask`` and emits
+        #                         ``subtask_change`` on a transition.
+        #   MemoryUpdateFwd     → on ``subtask_change``, refresh
+        #                         ``current_memory`` from the
+        #                         ``memory_update`` head.
+        #   AskVQAFwd           → answer camera-grounded stdin questions.
        #   LowLevelForward     → action chunk conditioned on the
-        #                         generated ``current_subtask``
-        #   DispatchAction      → drain the chunk to the robot
-        #   DispatchToolCalls   → fire any pending tool calls
+        #                         generated ``current_subtask``.
+        #   DispatchAction      → drain the chunk to the robot.
+        #   DispatchToolCalls   → fire any pending tool calls.
        #
-        # Order matters: ``HighLevelSubtaskFwd`` and ``LowLevelForward``
-        # are both gated on "action queue empty", so the subtask must
-        # refresh *before* the chunk that consumes it. ``MemoryUpdateFwd``
-        # / ``UserInterjectionFwd`` are still importable from
-        # ``inference.steps`` — re-add once plan / memory are in scope.
+        # Order matters: ``HighLevelSubtaskFwd`` must run before
+        # ``MemoryUpdateFwd`` so the event is visible the same tick, and
+        # both must run before ``LowLevelForward`` (which is gated on
+        # "action queue empty") so the chunk consumes the freshest
+        # subtask. ``UserInterjectionFwd`` is still importable but
+        # disabled until plan generation is wired in.
        self.pipeline = [
            HighLevelSubtaskFwd(
                trigger=HzTrigger(self.high_level_hz),
                policy=self.policy,
                observation_provider=self.observation_provider,
            ),
+            # Listens for the ``subtask_change`` event raised by
+            # ``HighLevelSubtaskFwd`` and refreshes ``current_memory``.
+            MemoryUpdateFwd(
+                trigger=EventTrigger("subtask_change"),
+                policy=self.policy,
+                observation_provider=self.observation_provider,
+            ),
            AskVQAFwd(
                policy=self.policy,
                observation_provider=self.observation_provider,