Segmentation of Speech and Humming in Vocal Input

dc.contributor.authorSporka, Adam J.
dc.contributor.authorPolacek, Ondrej
dc.contributor.authorHavlik, Jan
dc.coverage.issue3cs
dc.coverage.volume21cs
dc.date.accessioned2015-01-26T07:07:52Z
dc.date.available2015-01-26T07:07:52Z
dc.date.issued2012-09cs
dc.description.abstractNon-verbal vocal interaction (NVVI) is an interaction method in which sounds other than speech produced by a human are used, such as humming. NVVI complements traditional speech recognition systems with continuous control. In order to combine the two approaches (e.g. "volume up, mmm") it is necessary to perform a speech/NVVI segmentation of the input sound signal. This paper presents two novel methods of speech and humming segmentation. The first method is based on classification of MFCC and RMS parameters using a neural network (MFCC method), while the other method computes volume changes in the signal (IAC method). The two methods are compared using a corpus collected from 13 speakers. The results indicate that the MFCC method outperforms IAC in terms of accuracy, precision, and recall.en
dc.formattextcs
dc.format.extent923-929cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationRadioengineering. 2012, vol. 21, č. 3, s. 923-929. ISSN 1210-2512cs
dc.identifier.issn1210-2512cs
dc.identifier.urihttp://hdl.handle.net/11012/37193
dc.language.isoencs
dc.publisherSpolečnost pro radioelektronické inženýrstvícs
dc.relation.ispartofRadioengineeringcs
dc.relation.urihttp://www.radioeng.cz/fulltexts/2012/12_03_0923_0929.pdfcs
dc.rightsCreative Commons Attribution 3.0 Unported Licenseen
dc.rights.accessopenAccessen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/en
dc.subjectNon-verbal vocal interactionen
dc.subjectSpeechen
dc.subjectMFCCen
dc.subjectNeural networken
dc.subjectSegmentationen
dc.subjectMulti-layer perceptronen
dc.titleSegmentation of Speech and Humming in Vocal Inputen
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.facultyFakulta eletrotechniky a komunikačních technologiícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12_03_0923_0929.pdf
Size:
310.99 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections