Compositional Literary Primitives in Instruction-Tuned LLMs: Cross-Architectural SAE Features for Self, Style, and Affect
arXiv:2605.18808v1 Announce Type: cross Abstract: We characterize a compositional architecture of literary primitives in two instruction-tuned large language models (Llama…
