Analysis and Optimization of Telephone Speech Command Recognition System Performance in Noisy Environment

dc.contributor.authorNovotny, Jan
dc.contributor.authorSovka, Pavel
dc.contributor.authorUhlir, Jan
dc.coverage.issue1cs
dc.coverage.volume13cs
dc.date.accessioned2016-04-26T11:02:58Z
dc.date.available2016-04-26T11:02:58Z
dc.date.issued2004-04cs
dc.description.abstractThis paper deals with the analysis and optimization of a speech command recognition system (SCRS) trained on Czech telephone database Speechdat(E) for use in a selected noisy environment. The SCRS is based on hidden Markov models of context dependent phones (triphones) and mel-frequency cepstral coefficients analysis of speech (MFCC). The main aim is to analyze and to search for the optimal settings of SCRS with respect to additive noise robustness without use of additional techniques for additive noise reduction. The analysis is pointed to the appropriate setting of MFCC computation, the silence model adjustment and grammar selection possibilities. It is shown, that the correct performance of SCRS strictly depends on an appropriate adjustment of the silence model. The ability of the silence model adaptation is confirmed. When SNR is higher than 15 dB the suitable performance of SCRS can be guarantied without any modification of the triphones speech models by: 1. the optimal setting of MFCC computation, 2. the proper silence model adaptation. The assumption of a speech command recognition system use in an environment where SNR is higher than 15 dB is fulfilled in many applications.en
dc.formattextcs
dc.format.extent1-7cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationRadioengineering. 2004, vol. 13, č. 1, s. 1-7. ISSN 1210-2512cs
dc.identifier.issn1210-2512
dc.identifier.urihttp://hdl.handle.net/11012/58034
dc.language.isoencs
dc.publisherSpolečnost pro radioelektronické inženýrstvícs
dc.relation.ispartofRadioengineeringcs
dc.relation.urihttp://www.radioeng.cz/fulltexts/2004/04_01_01_07.pdfcs
dc.rightsCreative Commons Attribution 3.0 Unported Licenseen
dc.rights.accessopenAccessen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/en
dc.subjectRobust speech recognitionen
dc.subjectMel-cepstral analysisen
dc.subjectsilence model adaptationen
dc.subjectparallel model combinationen
dc.titleAnalysis and Optimization of Telephone Speech Command Recognition System Performance in Noisy Environmenten
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.facultyFakulta eletrotechniky a komunikačních technologiícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
04_01_01_07.pdf
Size:
479.91 KB
Format:
Adobe Portable Document Format
Description:
Collections