Preprint
Article

This version is not peer-reviewed.

Latent Phrase-Aware Generative Modeling forExpressive Symbolic Audio Synthesis

Submitted:

02 March 2026

Posted:

04 March 2026

You are already at the latest version

Abstract
This paper presents a generative AI frameworkfor producing structured symbolic sequences with fine-grainedexpressive control. The approach introduces a compact tokenrepresentation combined with phrase-aware latent alignment tosupport coherent generation across variable-length segments. Byintegrating sequence-level regularization directly into attention,the model balances structural consistency and diversity withoutrelying on explicit post-processing constraints. Empirical analysisshows that the method maintains stable distributional behavioracross expressive dimensions, highlighting its suitability forcontrollable symbolic generation tasks.
Keywords: 
;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated