A Multi-class Method for Detecting Audio Events in News Broadcasts

Petridis, Sergios; Giannakopoulos, Theodoros; Perantonis, Stavros

doi:10.1007/978-3-642-12842-4_50

Sergios Petridis²¹,
Theodoros Giannakopoulos²¹ &
Stavros Perantonis²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6040))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

2170 Accesses
2 Citations

Abstract

We propose a method for audio event detection in video streams from news. Apart from detecting speech, which is obviously the major class in such content, the proposed method detects five non-speech audio classes. The major difficulty of the particular task lies in the fact that most of the non-speech audio events are actually background sounds, with speech as the primary sound. We have adopted a set of 21 statistics computed on a mid-term basis over 7 audio features. A variation of the One Vs All classification architecture has been adopted and each binary classification problem is modeled using a separate probabilistic Support Vector Machine. Experiments have shown that the proposed method can achieve high precision rates for most of the audio events of interest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

AC: An Audio Classifier to Classify Violent Extensive Audios

The Machine Learning Approach for Analysis of Sound Scenes and Events

Audio Surveillance: Detection of Audio-Based Emergency Situations

References

Mark, B., Jose, J.M.: Audio-based event detection for sports video. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 61–65. Springer, Heidelberg (2003)
Google Scholar
Baillie, M., Jose, J.: An audio-based sports video segmentation and event detection algorithm. In: 2004 Conference on Computer Vision and Pattern Recognition Workshop, pp. 110–110 (2004)
Google Scholar
Tzanetakis, G., Chen, M.: Building audio classifiers for broadcast news retrieval. In: 5th International Workshop on Image Analysis for Multimedia Interactive Services, Lisboa, Portugal, April 2004, pp. 21–23 (2004)
Google Scholar
Huang, R., Hansen, J.: Advances in unsupervised audio segmentation for the broadcast news and ngsw corpora. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, vol. 1 (2004)
Google Scholar
Giannakopoulos, T.: Study and application of acoustic information for the detection of harmful content, and fusion with visual information. PhD thesis, Dpt. of Informatics and Telecommunications, University of Athens, Greece (2009)
Google Scholar
Panagiotakis, C., Tziritas, G.: A speech/music discriminator based on rms and zero-crossings 7(1), 155–166 (2005)
Google Scholar
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10(5), 293–302 (2002)
Article Google Scholar
Hyoung-Gook, K., Nicolas, M., Sikora, T.: MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval. John Wiley & Sons, Chichester (2005)
Google Scholar
Misra, H., et al.: Spectral entropy based feature for robust asr. In: ICASSP, Montreal, Canada (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center of Scientific Research Demokritos,
Sergios Petridis, Theodoros Giannakopoulos & Stavros Perantonis

Authors

Sergios Petridis
View author publications
You can also search for this author in PubMed Google Scholar
Theodoros Giannakopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Stavros Perantonis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Informatics and Telecommunications, NCSR Demokritos, Ag. Paraskevi, 15310, Athens, Greece
Stasinos Konstantopoulos , Stavros Perantonis , Vangelis Karkaletsis & Constantine D. Spyropoulos , , &
Department of Information and Communication Systems Engineering, University of the Aegean, 83200, Karlovassi, Samos, Greece
George Vouros

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Petridis, S., Giannakopoulos, T., Perantonis, S. (2010). A Multi-class Method for Detecting Audio Events in News Broadcasts. In: Konstantopoulos, S., Perantonis, S., Karkaletsis, V., Spyropoulos, C.D., Vouros, G. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2010. Lecture Notes in Computer Science(), vol 6040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12842-4_50

Download citation

DOI: https://doi.org/10.1007/978-3-642-12842-4_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12841-7
Online ISBN: 978-3-642-12842-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics