Automatic Speech Segmentation Based on HMM
MetadataShow full item record
This contribution deals with the problem of automatic phoneme segmentation using HMMs. Automatization of speech segmentation task is important for applications, where large amount of data is needed to process, so manual segmentation is out of the question. In this paper we focus on automatic segmentation of recordings, which will be used for triphone synthesis unit database creation. For speech synthesis, the speech unit quality is a crucial aspect, so the maximal accuracy in segmentation is needed here. In this work, different kinds of HMMs with various parameters have been trained and their usefulness for automatic segmentation is discussed. At the end of this work, some segmentation accuracy tests of all models are presented.
KeywordsSpeech processing, automatic segmentation, speech database, HMM, monophones, triphones, alignment
Document typePeer reviewed
Document versionFinal PDF
SourceRadioengineering. 2007, vol. 16, č. 2, s. 56-61. ISSN 1210-2512
- 2007/2