Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

A Multi-Resolution Approach for Audio Classification

Version 1 : Received: 18 April 2018 / Approved: 20 April 2018 / Online: 20 April 2018 (04:30:33 CEST)
Version 2 : Received: 19 July 2018 / Approved: 19 July 2018 / Online: 19 July 2018 (05:53:20 CEST)

How to cite: Voronin, S.; Grushin, A. A Multi-Resolution Approach for Audio Classification. Preprints 2018, 2018040258. https://doi.org/10.20944/preprints201804.0258.v2 Voronin, S.; Grushin, A. A Multi-Resolution Approach for Audio Classification. Preprints 2018, 2018040258. https://doi.org/10.20944/preprints201804.0258.v2

Abstract

We describe a multi-resolution approach for audio classification and illustrate its application to the open data set for environmental sound classification. The proposed approach utilizes a multi-resolution based ensemble consisting of targeted feature extraction of approximation (coarse scale) and detail (fine scale) portions of the signal under the action of multiple transforms. This is paired with an automatic machine learning engine for algorithm and parameter selection and the LSTM algorithm, capable of mapping several sequences of features to a predicted class membership probability distribution. Initial results show an improvement in multi-class classification accuracy.

Keywords

audio classification; multi-resolution analysis; LSTM; auto-ml

Subject

Computer Science and Mathematics, Information Systems

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.