An Underwater Multi-Label Classification Algorithm Based on a Bilayer Graph Convolution Learning Network with Constrained Codec-Reference-Cited by-同舟云学术

An Underwater Multi-Label Classification Algorithm Based on a Bilayer Graph Convolution Learning Network with Constrained Codec

Published:2024-08-07 Issue:16 Volume:13 Page:3134
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Li Yun¹,Wang Su²,Mo Jiawei¹,Wei Xin¹

Affiliation:

1. School of Information Science and Engineering, Liuzhou Institute of Technology, Liuzhou 545000, China

2. Yangzhou Branch, China Mobile Communications Group Jiangsu Co., Ltd., Yangzhou 225000, China

Abstract

Within the domain of multi-label classification for micro-videos, utilizing terrestrial datasets as a foundation, researchers have embarked on profound endeavors yielding extraordinary accomplishments. The research into multi-label classification based on underwater micro-video datasets is still in the preliminary stage. There are some challenges: the severe color distortion and visual blurring in underwater visual imaging due to water molecular scattering and absorption, the difficulty in acquiring underwater short video datasets, the sparsity of underwater short video modality features, and the formidable task of achieving high-precision underwater multi-label classification. To address these issues, a bilayer graph convolution learning network based on constrained codec (BGCLN) is established in this paper. Specifically, modality-common representation is constructed to complete the representation of common information and specific information based on the constrained codec network. Then, the attention-driven double-layer graph convolutional network module is designed to mine the correlation information between labels and enhance the modality representation. Finally, the combined modality representation fusion and multi-label classification module are used to obtain the category classifier prediction. In the underwater video multi-label classification dataset (UVMCD), the effectiveness and high classification accuracy of the proposed BGCLN have been proved by numerous experiments.

Funder

the National Natural Science Foundation of China

the Intelligent Gateway for Data Exchange in the Lijiang River Basin

the Beidou Navigation System with the Water Network

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/16/3134/pdf

Reference34 articles.

1. Nie, L., Wang, X., Zhang, J., He, X., Zhang, H., Hong, R., and Tian, Q. (2017, January 23–27). Enhancing micro-video understanding by harnessing external sounds. Proceedings of the ACM International Conference on Multimedia, Mountain View, CA, USA.

2. Semantic pooling for complex event analysis in untrimmed videos;Chang;IEEE Trans. Pattern Anal. Mach. Intell.,2016

3. Chen, J., Song, X., Nie, L., Wang, X., Zhang, H., and Chua, T.S. (2016, January 15–19). Micro tells macro: Predicting the popularity of microvideos via a transductive model. Proceedings of the ACM International Conference on Multimedia, Amsterdam, The Netherlands.

4. Wei, Y., Wang, X., Nie, L., He, X., Hong, R., and Chua, T.S. (2019, January 21–25). MMGCN: Multi-modal graph convolution network for personalized recommendation of micro-video. Proceedings of the ACM Multimedia, Nice, France.

5. Underwater Image Enhancement Network Based on Multi-channel Hybrid Attention Mechanism;Li;J. Electron. Inf. Technol.,2017