LaANIL: ANIL with Look-Ahead Meta-Optimization and Data Parallelism-Reference-Cited by-同舟云学术

LaANIL: ANIL with Look-Ahead Meta-Optimization and Data Parallelism

Published:2024-04-22 Issue:8 Volume:13 Page:1585
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Tammisetti Vasu¹²^ORCID,Bierzynski Kay¹,Stettinger Georg¹,Morales-Santos Diego P.²^ORCID,Cuellar Manuel Pegalajar²^ORCID,Molina-Solana Miguel²^ORCID

Affiliation:

1. Infineon Technologies AG, 85579 Munich, Germany

2. Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain

Abstract

Meta-few-shot learning algorithms, such as Model-Agnostic Meta-Learning (MAML) and Almost No Inner Loop (ANIL), enable machines to learn complex tasks quickly with limited data and based on previous experience. By maintaining the inner loop head of the neural network, ANIL leads to simpler computations and reduces the complexity of MAML. Despite its benefits, ANIL suffers from issues like accuracy variance, slow initial learning, and overfitting, hardening its adaptation and generalization. This work proposes “Look-Ahead ANIL” (LaANIL), an enhancement to ANIL for better learning. LaANIL reorganizes ANIL’s internal architecture, integrating parallel computing techniques (to process multiple training examples simultaneously across computing units) and incorporating Nesterov momentum (which accelerates convergence by adjusting the learning rate based on past gradient information and extracting informative features for look-ahead gradient computation). These additional features make our model more state-of-the-art capable and better edge-compatible and thus improve few-short learning by enabling models to quickly adapt to new information and tasks. LaANIL’s effectiveness is validated on established meta-few-shot learning datasets, including FC100, CIFAR-FS, Mini-ImageNet, CUBirds-200-2011, and Tiered-ImageNet. The proposed model achieved an increased validation accuracy by 7 ± 0.7% and a variance reduction by 44 ± 4% in two-way two-shot classification as well as increased validation by 5 ± 0.4% and a variance reduction by 18 ± 2% in five-way five-shot classification on the FC100 dataset and similarly performed well on other datasets.

Funder

Infineon Technologies AG

Spanish Ministry of Economic Affairs and Digital Transformation

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/8/1585/pdf

Reference42 articles.

1. A literature survey and empirical study of meta-learning for classifier selection;Khan;IEEE Access,2020

2. Chen, Y., Liu, Z., Xu, H., Darrell, T., and Wang, X. (2021, January 11–17). Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.

3. Finn, C., Abbeel, P., and Levine, S. (2017, January 6–11). Model-agnostic meta-learning for fast adaptation of deep networks. Proceedings of the International Conference on Machine Learning, PMLR, Sydney, NSW, Australia.

4. Raghu, A., Raghu, M., Bengio, S., and Vinyals, O. (2019). Rapid learning or feature reuse? Towards understanding the effectiveness of maml. arXiv.

5. Consistent meta-regularization for better meta-knowledge in few-shot learning;Tian;IEEE Trans. Neural Netw. Learn. Syst.,2021