TFGN: Task-Free, Replay-Free Continual Pre-Training Without Catastrophic Forgetting at LLM Scale
arXiv:2605.15053v2 Announce Type: replace-cross Abstract: Continually pre-training a large language model on heterogeneous text domains, without replay or task labels,…
