PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning-Reference-Cited by-同舟云学术

PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning

Published:2024-08-19 Issue:16 Volume:16 Page:3045
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Li Kang¹^ORCID,Zhu Qiuquan¹,Wang Haoyu¹,Wang Shibo¹,Tian He¹,Zhou Ping²,Cao Xin¹

Affiliation:

1. School of Information Science and Technology, Northwest University, Xi’an 710127, China

2. Emperor Qin Shihuang’s Mausoleum Site Museum, Key Scientific Research Base of Ancient Polychrome Pottery Conservation, Xi’an 710600, China

Abstract

Self-supervised learning has made significant progress in point cloud processing. Currently, the primary tasks of self-supervised learning, which include point cloud reconstruction and representation learning, are trained separately due to their structural differences. This separation inevitably leads to increased training costs and neglects the potential for mutual assistance between tasks. In this paper, a self-supervised method named PointUR-RL is introduced, which integrates point cloud reconstruction and representation learning. The method features two key components: a variable masked autoencoder (VMAE) and contrastive learning (CL). The VMAE is capable of processing input point cloud blocks with varying masking ratios, ensuring seamless adaptation to both tasks. Furthermore, CL is utilized to enhance the representation learning capabilities and improve the separability of the learned representations. Experimental results confirm the effectiveness of the method in training and its strong generalization ability for downstream tasks. Notably, high-accuracy classification and high-quality reconstruction have been achieved with the public datasets ModelNet and ShapeNet, with competitive results also obtained with the ScanObjectNN real-world dataset.

Funder

Key Research and Development Program of Shaanxi Province

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/16/3045/pdf

Reference45 articles.

1. Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017, January 21–26). Pointnet: Deep learning on point sets for 3d classification and segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.

2. Deep learning for 3d point clouds: A survey;Guo;IEEE Trans. Pattern Anal. Mach. Intell.,2020

3. Zhang, R., Tan, J., Cao, Z., Xu, L., Liu, Y., Si, L., and Sun, F. (IEEE Trans. Multimed., 2024). Part-Aware Correlation Networks for Few-shot Learning, IEEE Trans. Multimed., Early Access.

4. Self-supervised learning: Generative or contrastive;Liu;IEEE Trans. Knowl. Data Eng.,2021

5. Generative adversarial networks: An overview;Creswell;IEEE Signal Process. Mag.,2018