Improving Acoustic Models for Dysarthric Speech Recognition using Time Delay Neural Networks

Publication Name : 2020 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS 2020)

DOI : 10.1109/ICELTICS50595.2020.9315506

Date : 2020

Recently, deep learning approaches have been widely used to solve problems in the pattern recognition area, especially speech recognition. The deep structures of neural networks have made the system gain impressive performance for the normal speaker speech acoustic model. However, there has remained a challenge to build a speech recognition model for dysarthric speakers. This paper investigates the performance of speech recognition models for dysarthric speakers using time delay deep neural networks. Moreover, we also explore the model performance by combining dysarthria and normal speech corpus. Finally, well-tuned hyperparameters of deep neural network structures give promising results on Mandarin and English dysarthria speech.

Publication URL

https://www.webofscience.com/wos/woscc/full-record/WOS:000652352900021

Type

Book in series

ISSN

2155-6822

EISSN

Page

118 - 121

Authors

ALIM MISBULLAH

S1 - Informatika

Improving Acoustic Models for Dysarthric Speech Recognition using Time Delay Neural Networks

Authors

Files