2-D psychoacoustic modeling for automatic speech recognition in noisy environment

Desai Sampreeta; Prasad Dattakumar Khandekar; J. Raut Ketan

doi:10.1109/CASP.2016.7746151

Profiles Research Units Publications

Journal

2-D psychoacoustic modeling for automatic speech recognition in noisy environment

Desai Sampreeta, , J. Raut Ketan

Published in

2016

DOI: 10.1109/CASP.2016.7746151

Pages: 129 - 132

Abstract

Powerful automatic speech recognition system (ASR)is matter of commercial importance as many leading companies are sprinting at industry and consumer level production. One of the major reasons for speech quality to hamper is environmental noise. Speech gets obscured by the loud background sound. This adversely affects the performance of automatic speech recognition system. We also know that human auditory system is comparatively more capable of managing noise than the machine. So as to improve the performance of ASR, auditory properties of human system is studied and modeled with the help of psychoacoustic filter. The filter is labeled as 2D P-filter as its parameter has values zero or positive. Also to remove noise, masking effect is implemented where the sounds falling under predetermined masking threshold are modified. Therefore the enhanced set of features are extracted by applying this filter to the Mel filter bank. The novelty of the paper is use of different distance metrics for classification and testing the performance of Automatic speech recognition system. Experiments are carried out on database of recording of rhyming words by articulatory disabled children in a studio. Expected results obtained after testing phase for noisy speech signals would be considerably improved.

Topics: Speech processing (68)%, Voice activity detection (64)%, Acoustic model (62)%, Intelligibility (communication) (61)% and Masking threshold (55)%

View more info for "2-D psychoacoustic modeling for automatic speech recognition in noisy environment"

About the journal

Journal	Conference on Advances in Signal Processing, CASP 2016
Open Access	No

Authors (1)

Prasad Dattakumar Khandekar
- School of Electronics & Communication Engineering
- Engineering and Technology

ABOUT

ACADEMICS

@MIT-WPU

ADMISSIONS/ PLACEMENTS

MISCELLANEOUS