A BIC Based Initial Training Set Selection Algorithm for Active Learning and Its Application in Audio Detection

dc.contributor.authorLeng, Yan
dc.contributor.authorQi, Guang-hui
dc.contributor.authorXu, Xin-yan
dc.coverage.issue2cs
dc.coverage.volume22cs
dc.date.accessioned2015-01-21T09:56:45Z
dc.date.available2015-01-21T09:56:45Z
dc.date.issued2013-06cs
dc.description.abstractTo construct a classification system or a detection system, large amounts of labeled samples are needed. However, manual labeling is dull and time consuming, so researchers have proposed the active learning technology. The initial training set selection is the first step of an active learning process, but currently there have been few studies on it. Most active learning algorithms adopt random sampling or algorithms like sampling by clustering (SBC) to select the initial training samples. But these two kinds of method would lose their effectiveness in detecting events of small probability. Because sometimes they could not select or select too few samples of the small probability events. To solve this problem, this paper proposes a BIC based initial training set selection algorithm. The BIC based algorithm performs clustering on the whole training set first. Then uses BIC to judge the status of clusters. Finally, it adopts different selection strategies for clusters of different status. Experimental results on two real data sets show that, compared to random sampling and SBC, the proposed BIC based initial training set selection algorithm can efficiently solve the detection problem of small probability events. In the mean time, it has obvious advantages in detecting events of non-small probability.en
dc.formattextcs
dc.format.extent638-649cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationRadioengineering. 2013, vol. 22, č. 2, s. 638-649. ISSN 1210-2512cs
dc.identifier.issn1210-2512
dc.identifier.urihttp://hdl.handle.net/11012/36896
dc.language.isoencs
dc.publisherSpolečnost pro radioelektronické inženýrstvícs
dc.relation.ispartofRadioengineeringcs
dc.relation.urihttp://www.radioeng.cz/fulltexts/2013/13_02_0638_0649.pdfcs
dc.rightsCreative Commons Attribution 3.0 Unported Licenseen
dc.rights.accessopenAccessen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/en
dc.subjectInitial training set selectionen
dc.subjectactive learningen
dc.subjectBICen
dc.subjectsubspace sample selectionen
dc.subjectaudio detectionen
dc.titleA BIC Based Initial Training Set Selection Algorithm for Active Learning and Its Application in Audio Detectionen
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.facultyFakulta eletrotechniky a komunikačních technologiícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
13_02_0638_0649.pdf
Size:
3.37 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections