Improvement of Speech/Music Classification for 3GPP EVS Based on LSTM

Sang-Ick Kang; Sangmin Lee

doi:10.20944/preprints201811.0126.v1

Submitted:

05 November 2018

Posted:

05 November 2018

You are already at the latest version

Abstract

Speech/music classification that facilitates optimized signal processing from classification results has been extensively adapted as an essential part of various electronics applications, such as multi-rate audio codecs, automatic speech recognition, and multimedia document indexing. In this paper, a new technique to improve the robustness of speech/music classifier for 3GPP enhanced voice service (EVS) using long short-term memory (LSTM) is proposed. For effective speech/music classification, feature vectors implemented with the LSTM are chosen from the features of the EVS. Experiments show that LSTM-based speech/music classification produces better results than conventional EVS under a variety of conditions and types of speech/music data.

Keywords:

Speech/Music Classification

;

Enhanced Voice Service

;

Long Short-Term Memory

;

Big Data

Subject:

Engineering - Electrical and Electronic Engineering

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Improvement of Speech/Music Classification for 3GPP EVS Based on LSTM

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe