Latent Phrase-Aware Generative Modeling forExpressive Symbolic Audio Synthesis

Apeksha Bhuekar

doi:10.20944/preprints202603.0308.v1

Submitted:

02 March 2026

Posted:

04 March 2026

You are already at the latest version

Abstract

This paper presents a generative AI frameworkfor producing structured symbolic sequences with fine-grainedexpressive control. The approach introduces a compact tokenrepresentation combined with phrase-aware latent alignment tosupport coherent generation across variable-length segments. Byintegrating sequence-level regularization directly into attention,the model balances structural consistency and diversity withoutrelying on explicit post-processing constraints. Empirical analysisshows that the method maintains stable distributional behavioracross expressive dimensions, highlighting its suitability forcontrollable symbolic generation tasks.

Keywords:

controllable text generation

;

symbolic sequence modeling

;

attention mechanisms

;

latent alignment

;

generative AI

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Latent Phrase-Aware Generative Modeling forExpressive Symbolic Audio Synthesis

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe