Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them

Leng, Yan; Sun, Chengli; Cheng, Chuanfu; Xu, Xinyan; Li, Si; Wan, Honglin; Fang, Jing; Li, Dengwang

Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them

Files

15_02_0593_0603.pdf(317.52 KB)

Date

2015-06

Authors

Leng, Yan

Sun, Chengli

Cheng, Chuanfu

Xu, Xinyan

Li, Si

Wan, Honglin

Fang, Jing

Li, Dengwang

Publisher

Společnost pro radioelektronické inženýrství

Altmetrics

Abstract

Audio event classification, as an important part of Computational Auditory Scene Analysis, has attracted much attention. Currently, the classification technology is mature enough to classify isolated audio events accurately, but for overlapped audio events, it performs much worse. While in real life, most audio documents would have certain percentage of overlaps, and so the overlap classification problem is an important part of audio classification. Nowadays, the work on overlapped audio event classification is still scarce, and most existing overlap classification systems can only recognize one audio event for an overlap. In this paper, in order to deal with overlaps, we innovatively introduce the author-topic (AT) model which was first proposed for text analysis into audio classification, and innovatively combine it with PLSA (Probabilistic Latent Semantic Analysis). We propose 4 systems, i.e. AT, PLSA, AT-PLSA and PLSA-AT, to classify overlaps. The 4 proposed systems have the ability to recognize two or more audio events for an overlap. The experimental results show that the 4 systems perform well in classifying overlapped audio events, whether it is the overlap in training set or the overlap out of training set. Also they perform well in classifying isolated audio events.

Keywords

Audio event classification, author-topic model, PLSA, overlapped audio event, isolated audio event

Citation

Radioengineering. 2015 vol. 24, č. 2, s. 593-603. ISSN 1210-2512
http://www.radioeng.cz/fulltexts/2015/15_02_0593_0603.pdf

Document type

Peer-reviewed

Document version

Published version

Language of document

en

Document licence

Creative Commons Attribution 3.0 Unported License
http://creativecommons.org/licenses/by/3.0/