PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning
-
Published:2024-08-19
Issue:16
Volume:16
Page:3045
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Li Kang1ORCID, Zhu Qiuquan1, Wang Haoyu1, Wang Shibo1, Tian He1, Zhou Ping2, Cao Xin1
Affiliation:
1. School of Information Science and Technology, Northwest University, Xi’an 710127, China 2. Emperor Qin Shihuang’s Mausoleum Site Museum, Key Scientific Research Base of Ancient Polychrome Pottery Conservation, Xi’an 710600, China
Abstract
Self-supervised learning has made significant progress in point cloud processing. Currently, the primary tasks of self-supervised learning, which include point cloud reconstruction and representation learning, are trained separately due to their structural differences. This separation inevitably leads to increased training costs and neglects the potential for mutual assistance between tasks. In this paper, a self-supervised method named PointUR-RL is introduced, which integrates point cloud reconstruction and representation learning. The method features two key components: a variable masked autoencoder (VMAE) and contrastive learning (CL). The VMAE is capable of processing input point cloud blocks with varying masking ratios, ensuring seamless adaptation to both tasks. Furthermore, CL is utilized to enhance the representation learning capabilities and improve the separability of the learned representations. Experimental results confirm the effectiveness of the method in training and its strong generalization ability for downstream tasks. Notably, high-accuracy classification and high-quality reconstruction have been achieved with the public datasets ModelNet and ShapeNet, with competitive results also obtained with the ScanObjectNN real-world dataset.
Funder
Key Research and Development Program of Shaanxi Province National Natural Science Foundation of China
Reference45 articles.
1. Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017, January 21–26). Pointnet: Deep learning on point sets for 3d classification and segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA. 2. Deep learning for 3d point clouds: A survey;Guo;IEEE Trans. Pattern Anal. Mach. Intell.,2020 3. Zhang, R., Tan, J., Cao, Z., Xu, L., Liu, Y., Si, L., and Sun, F. (IEEE Trans. Multimed., 2024). Part-Aware Correlation Networks for Few-shot Learning, IEEE Trans. Multimed., Early Access. 4. Self-supervised learning: Generative or contrastive;Liu;IEEE Trans. Knowl. Data Eng.,2021 5. Generative adversarial networks: An overview;Creswell;IEEE Signal Process. Mag.,2018
|
|