Preprint Article Version 1 This version is not peer-reviewed

An Efficient Isolated Speech Recognition Based on the Adaptive Rate Processing and Analysis

Version 1 : Received: 30 October 2018 / Approved: 31 October 2018 / Online: 31 October 2018 (08:14:15 CET)

How to cite: MIAN QAISAR, S. An Efficient Isolated Speech Recognition Based on the Adaptive Rate Processing and Analysis. Preprints 2018, 2018100739 (doi: 10.20944/preprints201810.0739.v1). MIAN QAISAR, S. An Efficient Isolated Speech Recognition Based on the Adaptive Rate Processing and Analysis. Preprints 2018, 2018100739 (doi: 10.20944/preprints201810.0739.v1).

Abstract

This paper proposes a novel approach, based on the adaptive rate processing and analysis, for the isolated speech recognition. The idea is to smartly combine the event-driven signal acquisition and windowing along with adaptive rate processing, analysis and classification for realizing an effective isolated speech recognition. The incoming speech signal is digitized with an event-driven A/D converter (EDADC). The output of EDADC is windowed with an activity selection process. These windows are later on resampled uniformly with an adaptive rate interpolator. The resampled windows are de-noised with an adaptive rate filter and their spectrum are computed with an adaptive resolution short time Fourier transform (ARSTFT). Later on, the magnitude, Delta and Delta-Delta spectral coefficients are extracted. The Dynamic Time Warping (DTW) technique is employed to compare these extracted features with the reference templates. The comparison outcomes are used to make the classification decision. The system functionality is tested for a case study and results are presented. An 8.2 times reduction in acquired number of samples is achieved by the devised approach as compared to the classical one. It aptitudes a significant computational gain and power consumption reduction of the proposed system over the counter classical ones. An average subject dependent isolated speech recognition accuracy of 96.8% is achieved. It shows that the proposed approach is a potential candidate for the automatic speech recognition applications like rehabilitation centers, smart call centers, smart homes, etc.

Subject Areas

Event-Driven Processing, Speech recognition, Adaptive Resolution Analysis, Features extraction, Dynamic Time Warping, Classification

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.