NSF Award Search: Award # 1750082

Award Abstract # 1750082

CAREER: Visual Recognition with Knowledge

NSF Org:	IIS Division of Information & Intelligent Systems
Recipient:	ARIZONA STATE UNIVERSITY
Initial Amendment Date:	January 17, 2018
Latest Amendment Date:	July 28, 2022
Award Number:	1750082
Award Instrument:	Continuing Grant
Program Manager:	Jie Yang jyang@nsf.gov (703)292-4768 IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering
Start Date:	August 15, 2018
End Date:	July 31, 2025 (Estimated)
Total Intended Award Amount:	$550,000.00
Total Awarded Amount to Date:	$550,000.00
Funds Obligated to Date:	FY 2018 = $101,523.00 FY 2019 = $105,850.00 FY 2020 = $110,133.00 FY 2021 = $114,654.00 FY 2022 = $117,840.00
History of Investigator:	Yezhou Yang (Principal Investigator) yz.yang@asu.edu
Recipient Sponsored Research Office:	Arizona State University 660 S MILL AVENUE STE 204 TEMPE AZ US 85281-3670 (480)965-5479
Sponsor Congressional District:	04
Primary Place of Performance:	Arizona State University PO Box 876011 Tempe AZ US 85281-6011
Primary Place of Performance Congressional District:	04
Unique Entity Identifier (UEI):	NTLHJXM55KZ6
Parent UEI:
NSF Program(s):	Robust Intelligence
Primary Program Source:	01001819DB NSF RESEARCH & RELATED ACTIVIT 01001920DB NSF RESEARCH & RELATED ACTIVIT 01002021DB NSF RESEARCH & RELATED ACTIVIT 01002122DB NSF RESEARCH & RELATED ACTIVIT 01002223DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):	7495, 1045
Program Element Code(s):	749500
Award Agency Code:	4900
Fund Agency Code:	4900
Assistance Listing Number(s):	47.070

ABSTRACT

This project will address the problem of Visual Recognition with Knowledge (VR-K): a challenging Artificial Intelligence task to enable a seeing machine to identify unknown visible concepts from previous encounters (annotated data samples) and knowledge (other contextual information). For example, consider such a system that has never encountered a zebra, but which has previous visual encounters with "horses" and "black and white striped" patterns. Incorporating the linguistic input that, "A zebra is a horse-like animal with a black and white striped appearance", the machine's task is to formulate a new recognizer for the visual concept "zebra" and to recognize this new concept later. A system that integrates visual and linguistic information in this way can provide the basis for robust personal mobile applications or service robots, such as visual assistants to the vision-impaired, and voice-enable agents for elder care.

Conventional supervised learning techniques have been perfected to perform increasingly well on narrow performance tasks. To enable satisfactory performance in service robots and mobile multimedia applications, this research will integrate background and commonsense knowledge models to enable higher level reasoning together with such high-performance recognizers. This project will develop the VR-K framework focused on enabling more generalizable computer vision algorithms through integration with natural language understanding and grounding in knowledge-based reasoning. The research program will include 1) developing efficient probabilistic reasoning engines to construct recognition models of unseen concepts (object and attribute) without new annotation through probabilistic semantic parsing; 2) setting up new large-scale visual challenges and testbeds as the basis for rigorous performance evaluation of visual recognition with knowledge models and ablation analysis; and 3) prototyping the proposed framework on service robots and mobile devices for evaluation of the proposed framework's performance in complex real-world applications over a variety of user studies. The project will include education and outreach activities advancing AI in undergraduate research, diversity enhancement, Entrepreneurial Mindset (EM) education, and K-12 classrooms, and will include workshops to introduce AI and deep learning to professionals in non-CS professions such as medical research and pathology.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 23)

Show All

Aditya, S. and Yang, Y. and Baral, C and Aloimonos, Y "Combining Knowledge and Reasoning through Probabilistic Soft Logic for Image Puzzle Solving" Uncertainty in artificial intelligence , 2018 Citation Details

Aditya, Somak and Saha, Rudra and Yang, Yezhou and Baral, Chitta "Spatial Knowledge Distillation to Aid Visual Reasoning" 2019 IEEE Winter Conference on Applications of Computer Vision (WACV) , 2019 10.1109/WACV.2019.00030 Citation Details

Aditya, Somak and Yang, Yezhou and Baral, Chitta "Integrating Knowledge and Reasoning in Image Understanding" Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence , 2019 https://doi.org/10.24963/ijcai.2019/873 Citation Details

Banerjee, Pratyay and Gokhale, Tejas and Yang, Yezhou and Baral, Chitta "Weakly Supervised Relative Spatial Reasoning for Visual Question Answering" 2021 IEEE/CVF International Conference on Computer Vision (ICCV) , 2021 https://doi.org/10.1109/ICCV48922.2021.00192 Citation Details

Chatterjee, A and Gokhale, T and Baral, C and Yang, Y "On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation" , 2024 Citation Details

Fang, Zhiyuan and Kong, Shu and Fowlkes, Charless and Yang, Yezhou "Modularized Textual Grounding for Counterfactual Resilience" 2019 {IEEE}/{CVF} Conference on Computer Vision and Pattern Recognition ({CVPR}) , 2019 10.1109/cvpr.2019.00654 Citation Details

Fang, Zhiyuan and Wang, Jianfeng and Hu, Xiaowei and Liang, Lin and Gan, Zhe and Wang, Lijuan and Yang, Yezhou and Liu, Zicheng "Injecting Semantic Concepts into End-to-End Image Captioning" 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2022 https://doi.org/10.1109/CVPR52688.2022.01748 Citation Details

Farhadi, Mohammad and Ghasemi, Mehdi and Yang, Yezhou "A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGA" 2019 IEEE High Performance Extreme Computing Conference (HPEC) , 2019 https://doi.org/10.1109/HPEC.2019.8916237 Citation Details

Farhadi, Mohammad and Yang, Yezhou "TKD: Temporal Knowledge Distillation for Active Perception" 2020 {IEEE} Winter Conference on Applications of Computer Vision ({WACV}) , 2020 10.1109/wacv45572.2020.9093437 Citation Details

Feinglass, Joshua and Yang, Yezhou "SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis" Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing , v.1 , 2021 https://doi.org/10.18653/v1/2021.acl-long.175 Citation Details

Gokhale, T. and Banerjee, P. and Baral, C and Yang, Y. "VQA-LOL: Visual Question Answering Under the Lens of Logic" ECCV 2020: Computer Vision ECCV 2020 , 2020 https://doi.org/10.1007/978-3-030-58589-1 Citation Details

(Showing: 1 - 10 of 23)

Show All

Please report errors in award information by writing to: awardsearch@nsf.gov.

Success

Error