Demystifying the Slash Pattern in Attention: The Role of RoPE

Published in The first Foundations of Deep Generative Models workshop, ICML 2026, 2026