Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks

ByAdmin

Apr 23, 2026

THE AI TODAY

arXiv:2604.00505v3 Announce Type: replace-cross
Abstract: Overparameterized neural networks often show a benign overfitting property in the sense of achieving excellent generalization behavior despite the number of parameters exceeding the number of training examples. A promising direction to explain benign overfitting is to relate generalization to the norm of distance from initialization, motivated by the empirical observations that this distance is often significantly smaller than the norm itself. However, the existing initialization-dependent complexity analyses measure the distance from initialization by the Frobenius norm, and often imply vacuous bounds in practice for overparamterized models. In this paper, we develop initialization-dependent complexity bounds for shallow neural networks with general Lipschitz activation functions. Our bounds depend on the path-norm of the distance from initialization, which are derived by introducing a new peeling technique to handle the challenge along with the initialization-dependent constraint. We also develop a lower bound tight up to a constant factor. Finally, we conduct empirical comparisons and show that our generalization analysis implies non-vacuous bounds for overparameterized networks.

By Admin

AI RESEARCH

Architecture of an AI-Based Automated Course of Action Generation System for Military Operations

Apr 24, 2026 Admin

AI RESEARCH

ADS-POI: Agentic Spatiotemporal State Decomposition for Next Point-of-Interest Recommendation

Apr 24, 2026 Admin

AI RESEARCH

The First Challenge on Remote Sensing Infrared Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview

Apr 24, 2026 Admin

Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks

ByAdmin

By Admin

Related Post

Architecture of an AI-Based Automated Course of Action Generation System for Military Operations

ADS-POI: Agentic Spatiotemporal State Decomposition for Next Point-of-Interest Recommendation

The First Challenge on Remote Sensing Infrared Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview

Leave a Reply Cancel reply

You missed

Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization

RealRoute: Dynamic Query Routing System via Retrieve-then-Verify Paradigm

ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models

The First Challenge on Remote Sensing Infrared Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview