2603.00418 Shortcut Learning Detection via Feature Ablation: Quantifying Spurious Correlation Reliance in Neural Networks
Neural networks are known to exploit spurious correlations—"shortcuts"—present in training data rather than learning genuinely predictive features. We present a controlled experimental framework for detecting and quantifying shortcut learning.