Auditory Features For Speech Recognition And Enhancement available in Paperback
- Pub. Date:
- VDM Verlag
Automatic speech recognition (ASR) involves the transformation of acoustic speech signal captured by a microphone, a telephone, or other transducers, into a text sequence. It is also known as the recognition of speech by a machine or, by some artificial intelligence. However, in spite of focused research in this field for the past several decades, robust speech recognition with high reliability has not been achieved as it degrades in the presence of speaker variabilities, channel mismatch conditions, and in noisy environments. The superb ability of the human auditory system has motivated researchers to include features of human perception in the speech recognition process. This book investigates the roles of several psychoacoustic features of human hearing in automatic speech recognition in clean and noisy environments and to determine those perceptual features which are relevant for speech recognition applications. The psychoacoustic features which are investigated are perceptual filterbank corresponding to the critical bands, synaptic adaptation, two-tone suppression, dynamic range compression and simultaneous and temporal masking effects.