Skip to feedback

Award Abstract # 1750499
CAREER: Scalable Learning and Models for Mapping Instructions to Actions

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: CORNELL UNIVERSITY
Initial Amendment Date: February 12, 2018
Latest Amendment Date: April 9, 2024
Award Number: 1750499
Award Instrument: Continuing Grant
Program Manager: Tatiana Korelsky
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: June 1, 2018
End Date: May 31, 2025 (Estimated)
Total Intended Award Amount: $551,329.00
Total Awarded Amount to Date: $601,329.00
Funds Obligated to Date: FY 2018 = $120,853.00
FY 2019 = $117,435.00

FY 2020 = $107,427.00

FY 2021 = $119,716.00

FY 2022 = $117,898.00

FY 2023 = $8,000.00

FY 2024 = $10,000.00
History of Investigator:
  • Yoav Artzi (Principal Investigator)
    yoav@cs.cornell.edu
Recipient Sponsored Research Office: Cornell University
341 PINE TREE RD
ITHACA
NY  US  14850-2820
(607)255-5014
Sponsor Congressional District: 19
Primary Place of Performance: Cornell University
2 West Loop Road
New York
NY  US  10044-0052
Primary Place of Performance
Congressional District:
12
Unique Entity Identifier (UEI): G56PUALJ3KT5
Parent UEI:
NSF Program(s): Robust Intelligence
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
01001920DB NSF RESEARCH & RELATED ACTIVIT

01002021DB NSF RESEARCH & RELATED ACTIVIT

01002122DB NSF RESEARCH & RELATED ACTIVIT

01002223DB NSF RESEARCH & RELATED ACTIVIT

01002324DB NSF RESEARCH & RELATED ACTIVIT

01002425DB NSF RESEARCH & RELATED ACTIVIT

01001819DB NSF RESEARCH & RELATED ACTIVIT

01001920DB NSF RESEARCH & RELATED ACTIVIT

01002021DB NSF RESEARCH & RELATED ACTIVIT

01002122DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1045, 7495, 9251
Program Element Code(s): 749500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Robust language understanding has the potential to dramatically improve the quality and accessibility of autonomous systems operating in complex environments. Already today such systems are becoming increasingly common, including self-driving cars, drones, and robots surveying disaster areas. Natural language interfaces open new opportunities for non-expert users to control complex systems and increase the accessibility of current systems. However, existing methods are limited in expressivity and, more often than not, disappoint users. This Faculty Early Career Development Grant will fundamentally transform how this problem is addressed, and provide new avenues to build systems with robust natural language understanding and ability to improve and learn through interaction with users. The project's five-year goal of grounded language understanding directly connects to robotic agents and autonomous cars, and will enable new interdisciplinary applications and research directions.

The goal of the research program is to create a new framework for mapping natural language instructions to actions. Instead of taking a modular approach, this work adopts a single-model view, where input text and raw visual observations are directly mapped to actions. While the approach includes components that can be trained and re-used separately, it does not require any intermediate symbolic representation, and does away with the need for different types of training data, as required to train conventional modular systems. The five-year goal of this project is a continuously learning reflective autonomous agent following natural language instructions in realistic environments. The research will address learning from sparse natural signals, reasoning about complex sequences of instructions, learning continuously from users, and developing interpretable models.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 18)
Suhr, Alane and Artzi, Yoav "Continual Learning for Instruction Following from Realtime Feedback" Neural Information Processing Systems , 2023 Citation Details
Anne Wu and Kianté Brantley and Noriyuki Kojima and Yoav Artzi "lilGym: Natural Language Visual Reasoning with Reinforcement Learning" Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics , 2023 Citation Details
Anya Ji and Noriyuki Kojima and Noah Rush and Alane Suhr and Wai Keen Vong and Robert D. Hawkins and Yoav Artzi "Abstract Visual Reasoning with Tangram Shapes" Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2022 Citation Details
Blukis, Valts and Misra, Dipendra and Knepper, Ross A. and Artzi, Yoav "Mapping Navigation Instructions to Continuous Control Actions with Position-Visitation Prediction" Proceedings of The 2nd Conference on Robot Learning , 2018 Citation Details
Blukis, Valts and Paxton, Chris and Fox, Dieter and Garg, Animesh and Artzi, Yoav "A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution" In Proceedings of the Conference on Robot Learning (CoRL) , 2021 Citation Details
Blukis, Valts and Terme, Yannick and Niklasson, Eyvind and Knepper, Ross A and Artzi, Yoav "Learning to Map Natural Language Instructions to Physical Quadcopter Control Using Simulated Flight" Proceedings of the Conference on Robot Learning (CoRL) , 2019 Citation Details
Chen, Howard and Suhr, Alane and Misra, Dipendra and Snavely, Noah and Artzi, Yoav "TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments" 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2019 10.1109/CVPR.2019.01282 Citation Details
Cui, Yuqin and Khandelwal, Apoorv and Artzi, Yoav and Snavely, Noah and Averbuch-Elor, Hadar "Who's Waldo? Linking People Across Text and Images" International Conference on Computer Vision (ICCV) , 2021 Citation Details
Effenberger, Anna and Singh, Rhia and Yan, Eva and Suhr, Alane and Artzi, Yoav "Analysis of Language Change in Collaborative Instruction Following" Findings of the Association for Computational Linguistics: EMNLP 2021 , 2021 https://doi.org/10.18653/v1/2021.findings-emnlp.239 Citation Details
Gao, Ge and Chen, Hung-Ting and Artzi, Yoav and Choi, Eunsol "Continually Improving Extractive QA via Human Feedback" Conference on Empirical Methods in Natural Language Processing , 2023 https://doi.org/10.18653/v1/2023.emnlp-main.27 Citation Details
Gardner, Matt and Artzi, Yoav and Basmov, Victoria and Berant, Jonathan and Bogin, Ben and Chen, Sihao and Dasigi, Pradeep and Dua, Dheeru and Elazar, Yanai and Gottumukkala, Ananth and Gupta, Nitish and Hajishirzi, Hannaneh and Ilharco, Gabriel and Khash "Evaluating Models Local Decision Boundaries via Contrast Sets" Findings of Empirical Methods in Natural Language Processing , 2020 https://doi.org/10.18653/v1/2020.findings-emnlp.117 Citation Details
(Showing: 1 - 10 of 18)

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page