Affiliation:
1. University of Trento, Trento, Italy
Abstract
Event recognition is one of the areas in multimedia that is attracting great attention of researchers. Being applicable in a wide range of applications, from personal to collective events, a number of interesting solutions for event recognition using multimedia information sources have been proposed. On the other hand, following their immense success in classification, object recognition, and detection, deep learning has been shown to perform well in event recognition tasks also. Thus, a large portion of the literature on event analysis relies nowadays on deep learning architectures. In this article, we provide an extensive overview of the existing literature in this field, analyzing how deep features and deep learning architectures have changed the performance of event recognition frameworks. The literature on event-based analysis of multimedia contents can be categorized into four groups, namely (i) event recognition in single images; (ii) event recognition in personal photo collections; (iii) event recognition in videos; and (iv) event recognition in audio recordings. In this article, we extensively review different deep-learning-based frameworks for event recognition in these four domains. Furthermore, we also review some benchmark datasets made available to the scientific community to validate novel event recognition pipelines. In the final part of the manuscript, we also provide a detailed discussion on basic insights gathered from the literature review, and identify future trends and challenges.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference170 articles.
1. Sharath Adavanne Giambattista Parascandolo Pasi Pertilä Toni Heittola and Tuomas Virtanen. 2017. Sound event detection in multichannel audio using spatial and harmonic features. arXiv preprint arXiv:1706.02293 (2017). Sharath Adavanne Giambattista Parascandolo Pasi Pertilä Toni Heittola and Tuomas Virtanen. 2017. Sound event detection in multichannel audio using spatial and harmonic features. arXiv preprint arXiv:1706.02293 (2017).
2. Sharath Adavanne Archontis Politis and Tuomas Virtanen. 2018. Multichannel sound event detection using 3D convolutional neural networks for learning inter-channel features. arXiv preprint arXiv:1801.09522 (2018). Sharath Adavanne Archontis Politis and Tuomas Virtanen. 2018. Multichannel sound event detection using 3D convolutional neural networks for learning inter-channel features. arXiv preprint arXiv:1801.09522 (2018).
3. USED
4. Event recognition in personal photo collections via multiple instance learning-based classification of multiple images
5. A saliency-based approach to event recognition
Cited by
42 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Proposal of a CNN-based Method for Predicting the Number of Clicks on Micro-event Flyer Images;IEEJ Transactions on Electronics, Information and Systems;2024-09-01
2. WaRENet: A Novel Urban Waterlogging Risk Evaluation Network;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-05-16
3. Incomplete Multiview Clustering via Semidiscrete Optimal Transport for Multimedia Data Mining in IoT;ACM Transactions on Multimedia Computing, Communications, and Applications;2023-09-26
4. Performance Evaluation of CNN Models in Urban Acoustic Event Recognition Through MFCC Hyperparameter Search;2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE);2023-07-24
5. Explainable event recognition;Multimedia Tools and Applications;2023-03-30