Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds
arXiv:2603.20991v1 Announce Type: cross Abstract: A single matrix out of 468 in GPT-2 Small can increase perplexity by 20,000x when…
