Motion-guided spatiotemporal multitask feature discrimination for self-supervised video representation learning
-
Published:2024-11
Issue:
Volume:155
Page:110713
-
ISSN:0031-3203
-
Container-title:Pattern Recognition
-
language:en
-
Short-container-title:Pattern Recognition
Author:
Bi Shuai,
Hu ZhengpingORCID,
Zhang Hehao,
Di Jirui,
Sun Zhe
Reference40 articles.
1. Global-and local-aware feature augmentation with semantic orthogonality for few-shot image classification;Shi;Pattern Recognit.,2023
2. Video representation learning for temporal action detection using global-local attention;Tang;Pattern Recognit.,2023
3. J. Deng, W. Dong, R. Socher, et al., Imagenet: A large-scale hierarchical image database, in: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2009, pp. 248–255.
4. J. Carreira, A. Zisserman, Quo vadis, action recognition? a new model and the kinetics dataset, in: IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2017, pp. 6299–6308.
5. M. Noroozi, P. Favaro, Unsupervised learning of visual representations by solving jigsaw puzzles, in: European Conference on Computer Vision, ECCV, 2016, pp. 69–84.