Award Abstract # 2147292
FAI: A Human-Centered Approach to Developing Accessible and Reliable Machine Translation

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: UNIVERSITY OF MARYLAND, COLLEGE PARK
Initial Amendment Date: February 16, 2022
Latest Amendment Date: February 16, 2022
Award Number: 2147292
Award Instrument: Standard Grant
Program Manager: Tatiana Korelsky
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: March 1, 2022
End Date: February 28, 2025 (Estimated)
Total Intended Award Amount: $392,993.00
Total Awarded Amount to Date: $392,993.00
Funds Obligated to Date: FY 2022 = $392,993.00
History of Investigator:
  • Marine Carpuat (Principal Investigator)
    marine@cs.umd.edu
  • Niloufar Salehi (Co-Principal Investigator)
  • Ge Gao (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Maryland, College Park
3112 LEE BUILDING
COLLEGE PARK
MD  US  20742-5100
(301)405-6269
Sponsor Congressional District: 04
Primary Place of Performance: University of Maryland, College Park
3112 LEE BLDG 7809 Regents Drive
College Park
MD  US  20742-1800
Primary Place of Performance
Congressional District:
04
Unique Entity Identifier (UEI): NPU8ULVAAS23
Parent UEI: NPU8ULVAAS23
NSF Program(s): Fairness in Artificial Intelli
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 075Z
Program Element Code(s): 114Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

This Fairness in AI project aims to develop technology to reliably enhance cross-lingual communication in high-stakes contexts, such as when a person needs to communicate with someone who does not speak their language to get health care advice or apply for a job. While machine translation technology is frequently used in these conditions, existing systems often make errors that can have severe consequences for a patient or a job applicant. Further, it is challenging for people to know when automatic translations might be wrong when they do not understand the source or target language for translation. This project addresses this issue by developing accessible and reliable machine translation for lay users. It will provide mechanisms to guide users to recognize and recover from translation errors, and help them make better decisions given imperfect translations. As a result, more people will be able to use machine translation reliably to communicate across language barriers, which can have far-reaching positive consequences on their lives.

Specifically, this project contributes advances in natural language processing and interaction design for a bot that can be added to any text-based conversation, where it can play a role similar to an interpreter. The bot will guide users to write appropriate inputs for machine translation, help users understand outputs, and intervene when it detects miscommunication and conversational breakdowns. The design of the bot will follow a human-centered design process, consisting of need-finding studies, iterative system development and deployment, and user evaluations via controlled experiments. On the back-end, the bot will rely on quality estimation models that automatically detect translation errors to produce useful guidance for end-users. The data, models, and design recommendations generated by this project will advance computational research in multiple ways. It will lead to new machine translation quality estimation techniques that take into account the impact of errors on end-users; it will expand the scope of explainable artificial intelligence research to encompass the considerable risks and harms caused by language generation tools, and it will generate new interface design that assists lay users' sense making of artificial intelligence systems.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Agrawal, Sweta and Carpuat, Marine "Controlling Pre-trained Language Models for Grade-Specific Text Simplification" , 2023 https://doi.org/10.18653/v1/2023.emnlp-main.790 Citation Details
Agrawal, Sweta and Carpuat, Marine "Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension" Transactions of the Association for Computational Linguistics , v.12 , 2024 https://doi.org/10.1162/tacl_a_00653 Citation Details
Briakou, Eleftheria and Goyal, Navita and Carpuat, Marine "Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences" Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , 2023 https://doi.org/10.18653/v1/2023.emnlp-main.690 Citation Details
Gao, Ge and Zheng, Jian and Choe, Eun Kyoung and Yamashita, Naomi "Taking a Language Detour: How International Migrants Speaking a Minority Language Seek COVID-Related Information in Their Host Countries" Proceedings of the ACM on Human-Computer Interaction , v.6 , 2022 https://doi.org/10.1145/3555600 Citation Details
Han, HyoJung and Boyd-Graber, Jordan and Carpuat, Marine "Bridging Background Knowledge Gaps in Translation with Automatic Explicitation" Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , 2023 https://doi.org/10.18653/v1/2023.emnlp-main.603 Citation Details
Ki, Dayeon and Carpuat, Marine "Automatic Input Rewriting Improves Translation with Large Language Models" , 2025 https://doi.org/10.18653/v1/2025.naacl-long.542 Citation Details
Ki, Dayeon and Carpuat, Marine "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" , 2024 https://doi.org/10.18653/v1/2024.findings-naacl.265 Citation Details
Mehandru, Nikita and Agrawal, Sweta and Xiao, Yimin and Gao, Ge and Khoong, Elaine and Carpuat, Marine and Salehi, Niloufar "Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors" Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , 2023 https://doi.org/10.18653/v1/2023.emnlp-main.712 Citation Details
Sweta Agrawal, Nikita Mehandru "Quality Estimation via Backtranslation at the WMT 2022 Quality Estimation Task" Proceedings of the Seventh Conference on Machine Translation (WMT) , 2022 Citation Details
Xiao, Yimin "Designing AI-Based Language Tools for Non-Native Speakers' Language Use and Development" , 2025 https://doi.org/10.1145/3706599.3721094 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page