arXiv:2510.13857v1 Announce Type: cross
Abstract: The advent of powerful Large Language Models (LLMs) has ushered in an “Age of the Agent,” enabling autonomous systems to tackle complex goals. However, the transition from prototype to production is hindered by a pervasive “crisis of craft,” resulting in agents that are brittle, unpredictable, and ultimately untrustworthy in mission-critical applications. This paper argues this crisis stems from a fundamental paradigm mismatch — attempting to command inherently probabilistic processors with the deterministic mental models of traditional software engineering. To solve this crisis, we introduce a governance-first paradigm for principled agent engineering, embodied in a formal architecture we call ArbiterOS.

By Admin