LaANIL: ANIL with Look-Ahead Meta-Optimization and Data Parallelism
-
Published:2024-04-22
Issue:8
Volume:13
Page:1585
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Tammisetti Vasu12ORCID, Bierzynski Kay1, Stettinger Georg1, Morales-Santos Diego P.2ORCID, Cuellar Manuel Pegalajar2ORCID, Molina-Solana Miguel2ORCID
Affiliation:
1. Infineon Technologies AG, 85579 Munich, Germany 2. Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain
Abstract
Meta-few-shot learning algorithms, such as Model-Agnostic Meta-Learning (MAML) and Almost No Inner Loop (ANIL), enable machines to learn complex tasks quickly with limited data and based on previous experience. By maintaining the inner loop head of the neural network, ANIL leads to simpler computations and reduces the complexity of MAML. Despite its benefits, ANIL suffers from issues like accuracy variance, slow initial learning, and overfitting, hardening its adaptation and generalization. This work proposes “Look-Ahead ANIL” (LaANIL), an enhancement to ANIL for better learning. LaANIL reorganizes ANIL’s internal architecture, integrating parallel computing techniques (to process multiple training examples simultaneously across computing units) and incorporating Nesterov momentum (which accelerates convergence by adjusting the learning rate based on past gradient information and extracting informative features for look-ahead gradient computation). These additional features make our model more state-of-the-art capable and better edge-compatible and thus improve few-short learning by enabling models to quickly adapt to new information and tasks. LaANIL’s effectiveness is validated on established meta-few-shot learning datasets, including FC100, CIFAR-FS, Mini-ImageNet, CUBirds-200-2011, and Tiered-ImageNet. The proposed model achieved an increased validation accuracy by 7 ± 0.7% and a variance reduction by 44 ± 4% in two-way two-shot classification as well as increased validation by 5 ± 0.4% and a variance reduction by 18 ± 2% in five-way five-shot classification on the FC100 dataset and similarly performed well on other datasets.
Funder
Infineon Technologies AG Spanish Ministry of Economic Affairs and Digital Transformation
Reference42 articles.
1. A literature survey and empirical study of meta-learning for classifier selection;Khan;IEEE Access,2020 2. Chen, Y., Liu, Z., Xu, H., Darrell, T., and Wang, X. (2021, January 11–17). Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada. 3. Finn, C., Abbeel, P., and Levine, S. (2017, January 6–11). Model-agnostic meta-learning for fast adaptation of deep networks. Proceedings of the International Conference on Machine Learning, PMLR, Sydney, NSW, Australia. 4. Raghu, A., Raghu, M., Bengio, S., and Vinyals, O. (2019). Rapid learning or feature reuse? Towards understanding the effectiveness of maml. arXiv. 5. Consistent meta-regularization for better meta-knowledge in few-shot learning;Tian;IEEE Trans. Neural Netw. Learn. Syst.,2021
|
|