1. Evaluating the reliability of acoustic speech embeddings;Algayres,2020
2. Common voice: A massively-multilingual speech corpus;Ardila,2020
3. Data2vec: A general framework for self-supervised learning in speech, vision and language;Baevski,2022
4. Wav2vec 2.0: A framework for self-supervised learning of speech representations;Baevski,2020
5. Bastianelli, E., Vanzo, A., Swietojanski, P., Rieser, V., 2020. SLURP: A spoken language understanding resource package, EMNLP 2020.