LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity-Reference-Cited by-同舟云学术

LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity

Published:2023-06 Issue:3 Volume:42 Page:409-421
ISSN:0167-7055
Container-title:Computer Graphics Forum
language:en
Short-container-title:Computer Graphics Forum

Author:

Arunkumar A.¹,Sharma S.¹,Agrawal R.¹,Chandrasekaran S.¹,Bryan C.¹

Affiliation:

1. Arizona State University Tempe United States

Abstract

AbstractCross‐task generalization is a significant outcome that defines mastery in natural language understanding. Humans show a remarkable aptitude for this, and can solve many different types of tasks, given definitions in the form of textual instructions and a small set of examples. Recent work with pre‐trained language models mimics this learning style: users can define and exemplify a task for the model to attempt as a series of natural language prompts or instructions. While prompting approaches have led to higher cross‐task generalization compared to traditional supervised learning, analyzing ‘bias’ in the task instructions given to the model is a difficult problem, and has thus been relatively unexplored. For instance, are we truly modeling a task, or are we modeling a user's instructions? To help investigate this, we develop LINGO, a novel visual analytics interface that supports an effective, task‐driven workflow to (1) help identify bias in natural language task instructions, (2) alter (or create) task instructions to reduce bias, and (3) evaluate pre‐trained model performance on debiased task instructions. To robustly evaluate LINGO, we conduct a user study with both novice and expert instruction creators, over a dataset of 1,616 linguistic tasks and their natural language instructions, spanning 55 different languages. For both user groups, LINGO promotes the creation of more difficult tasks for pre‐trained models, that contain higher linguistic diversity and lower instruction bias. We additionally discuss how the insights learned in developing and evaluating LINGO can aid in the design of future dashboards that aim to minimize the effort involved in prompt creation across multiple domains.

Funder

National Science Foundation

Publisher

Wiley

Subject

Computer Graphics and Computer-Aided Design

Link

https://onlinelibrary.wiley.com/doi/am-pdf/10.1111/cgf.14840

Reference66 articles.

1. AghajanyanA. GuptaA. ShrivastavaA. ChenX. ZettlemoyerL. GuptaS.: Muppet: Massive multi‐task representations with pre‐finetuning. InProceedings of the 2021 Conference on Empirical Methods in Natural Language Processing(2021) pp.5799–5811. 1

2. Beat the Machine

3. BowmanS. R. AngeliG. PottsC. ManningC. D.:The snli corpus. 2 7

4. Language models are few‐shot learners;Brown T.;Advances in neural information processing systems,2020

5. BachS. SanhV. YongZ. X. WebsonA. RaffelC. NayakN. V. SharmaA. KimT. BariM. S. FévryT. et al.: Promptsource: An integrated development environment and repository for natural language prompts. InProceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations(2022) pp.93–104. 3