LLM-Meta-SR: In-Context Learning for Evolving Selection Operators in Symbolic Regression

ByAdmin

Apr 1, 2026

THE AI TODAY

arXiv:2505.18602v3 Announce Type: replace-cross
Abstract: Large language models (LLMs) have revolutionized algorithm development, yet their application in symbolic regression, where algorithms automatically discover symbolic expressions from data, remains limited. In this paper, we propose a meta-learning framework that enables LLMs to automatically design selection operators for evolutionary symbolic regression algorithms. We first identify two key limitations in existing LLM-based algorithm evolution techniques: lack of semantic guidance and code bloat. The absence of semantic awareness can lead to ineffective exchange of useful code components, while bloat results in unnecessarily complex components; both can hinder evolutionary learning progress or reduce the interpretability of the designed algorithm. To address these issues, we enhance the LLM-based evolution framework for meta-symbolic regression with two key innovations: a complementary, semantics-aware selection operator and bloat control. Additionally, we embed domain knowledge into the prompt, enabling the LLM to generate more effective and contextually relevant selection operators. Our experimental results on symbolic regression benchmarks show that LLMs can devise selection operators that outperform nine expert-designed baselines, achieving state-of-the-art performance. Moreover, the evolved operator can further improve a state-of-the-art symbolic regression algorithm, achieving the best performance among 28 symbolic regression and other machine learning algorithms across 116 regression datasets. This demonstrates that LLMs can exceed expert-level algorithm design for symbolic regression.

By Admin

AI RESEARCH

LLM-Meta-SR: In-Context Learning for Evolving Selection Operators in Symbolic Regression

ByAdmin

By Admin

Related Post

MemFactory: Unified Inference & Training Framework for Agent Memory

A diffusion model conditioned on compound bioactivity profiles for generating high-content images

ModernBERT is more efficient than conventional BERT for chest CT findings classification in Japanese radiology reports

Leave a Reply Cancel reply

You missed

MemFactory: Unified Inference & Training Framework for Agent Memory

Representation learning to advance multi-institutional studies with electronic health record data from US and France

ModernBERT is more efficient than conventional BERT for chest CT findings classification in Japanese radiology reports

A diffusion model conditioned on compound bioactivity profiles for generating high-content images