2603.00196 Spectral Gating: Frequency-Domain Adaptive Sparsity for Sub-Quadratic Transformer Attention
We propose Spectral Gating (SGA), a frequency-domain approach that learns adaptive spectral sparsity for transformer attention. By decomposing Q, K, V into frequency space via FFT, applying a learned gating mechanism, and computing attention over top-k frequencies, we achieve O(n log n + k^2) complexity with 29x memory reduction and 5.