MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech

Cerva, Petr; Nouza, Jan

MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech

dc.contributor.author	Cerva, Petr
dc.contributor.author	Nouza, Jan
dc.coverage.issue	3	cs
dc.coverage.volume	13	cs
dc.date.accessioned	2016-04-26T11:04:13Z
dc.date.available	2016-04-26T11:04:13Z
dc.date.issued	2004-09	cs
dc.description.abstract	The paper deals with the problem of efficient adaptation of speech recognition systems to individual users. The goal is to achieve better performance in specific applications where one known speaker is expected. In our approach we adopt the MAP (Maximum A Posteriori) method for this purpose. The MAP based formulae for the adaptation of the HMM (Hidden Markov Model) parameters are described. Several alternative versions of this method have been implemented and experimentally verified in two areas, first in the isolated-word recognition (IWR) task and later also in the large vocabulary continuous speech recognition (LVCSR) system, both developed for the Czech language. The results show that the word error rate (WER) can be reduced by more than 20% for a speaker who provides tens of words (in case of IWR) or tens of sentences (in case of LVCSR) for the adaptation. Recently, we have used the described methods in the design of two practical applications: voice dictation to a PC and automatic transcription of radio and TV news.	en
dc.format	text	cs
dc.format.extent	42-46	cs
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Radioengineering. 2004, vol. 13, č. 3, s. 42-46. ISSN 1210-2512	cs
dc.identifier.issn	1210-2512
dc.identifier.uri	http://hdl.handle.net/11012/58059
dc.language.iso	en	cs
dc.publisher	Společnost pro radioelektronické inženýrství	cs
dc.relation.ispartof	Radioengineering	cs
dc.relation.uri	http://www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf	cs
dc.rights	Creative Commons Attribution 3.0 Unported License	en
dc.rights.access	openAccess	en
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/	en
dc.subject	Speech recognition	en
dc.subject	speaker adaptation	en
dc.subject	maximum a posteriori method	en
dc.subject	hidden Markov models	en
dc.title	MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech	en
dc.type.driver	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en
eprints.affiliatedInstitution.faculty	Fakulta eletrotechniky a komunikačních technologií	cs

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 04_03_42_46.pdf
Size:: 418.47 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

2004/3