Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation

ByAdmin

Jan 1, 2026

arXiv:2511.18869v2 Announce Type: replace-cross
Abstract: Evaluating song aesthetics is challenging due to the multidimensional nature of musical perception and the scarcity of labeled data. We propose HEAR, a robust music aesthetic evaluation framework that combines: (1) a multi-source multi-scale representations module to obtain complementary segment- and track-level features, (2) a hierarchical augmentation strategy to mitigate overfitting, and (3) a hybrid training objective that integrates regression and ranking losses for accurate scoring and reliable top-tier song identification. Experiments demonstrate that HEAR consistently outperforms the baseline across all metrics on both tracks of the ICASSP 2026 SongEval benchmark. The code and trained model weights are available at https://github.com/Eps-Acoustic-Revolution-Lab/EAR_HEAR.

Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation

ByAdmin

By Admin

Related Post

Interpretation, extrapolation and perturbation of single cells

Advancing single-cell omics and cell-based therapeutics with quantum computing

Let the data speak — single-cell analysis with CellWhisperer

Leave a Reply Cancel reply

You missed

Convergence of machine learning and genomics for precision oncology

Let the data speak — single-cell analysis with CellWhisperer

Advancing single-cell omics and cell-based therapeutics with quantum computing

Interpretation, extrapolation and perturbation of single cells

THE AI TODAY