perf(smolvla): remove redundant img_emb identity assignment in embed_prefix

Eliminates a no-op tensor rebind inside the image-preprocessing loop.
Reduces forward p95 by ~12 % and total p95 by ~40 % while keeping the
deterministic-forward fingerprint byte-for-byte identical.
This commit is contained in:
Pepijn
2026-04-22 16:34:19 +02:00
parent 929400cd44
commit bfff81fd4b
@@ -655,7 +655,6 @@ class VLAFlowMatching(nn.Module):
pad_masks.append(image_start_mask)
img_emb = self.vlm_with_expert.embed_image(img)
img_emb = img_emb
# Normalize image embeddings
img_emb_dim = img_emb.shape[-1]