ViT-S/16 inspector

HuggingFace WinKawaks/vit-small-patch16-224 · 12 layers · 6 heads · 384 dim · 1536 MLP

Inspect every step of ViT-S inference

Pick a sample image (or upload your own) and hit Run inference. You'll see every tensor the model produces — patch embedding, position add, each layer's Q/K/V, attention scores, softmax, residual adds, MLP — and the final ImageNet prediction.