Award Abstract # 1065243
RI: Medium: Collaborative Research: Semantically Discriminative: Guiding Mid-Level Representations for Visual Object Recognition with External Knowledge

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: UNIVERSITY OF SOUTHERN CALIFORNIA
Initial Amendment Date: March 25, 2011
Latest Amendment Date: July 13, 2017
Award Number: 1065243
Award Instrument: Continuing Grant
Program Manager: Jie Yang
jyang@nsf.gov
 (703)292-4768
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: August 1, 2011
End Date: December 31, 2017 (Estimated)
Total Intended Award Amount: $491,289.00
Total Awarded Amount to Date: $491,289.00
Funds Obligated to Date: FY 2011 = $236,996.00
FY 2013 = $125,630.00

FY 2014 = $128,663.00
History of Investigator:
  • Fei Sha (Principal Investigator)
    feisha@usc.edu
Recipient Sponsored Research Office: University of Southern California
3720 S FLOWER ST FL 3
LOS ANGELES
CA  US  90033
(213)740-7762
Sponsor Congressional District: 34
Primary Place of Performance: University of Southern California
3720 S FLOWER ST FL 3
LOS ANGELES
CA  US  90033
Primary Place of Performance
Congressional District:
34
Unique Entity Identifier (UEI): G88KLJR3KYT5
Parent UEI:
NSF Program(s): Robust Intelligence
Primary Program Source: 01001112DB NSF RESEARCH & RELATED ACTIVIT
01001314DB NSF RESEARCH & RELATED ACTIVIT

01001415DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7495, 7924
Program Element Code(s): 749500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

This project explores (semi-)automatic ways to create "semantically discriminative" mid-level cues for visual object categorization, by introducing external knowledge of object properties into the statistical learning procedures that learn to distinguish them. In particular, the PIs investigate four key ideas: (1) exploiting taxonomies over object categories to inform feature selection algorithms such that they home in on the most abstract description for a given granularity of label predictions; (2) leveraging inter-object relationships conveyed by the same taxonomies to guide context learning, so that it captures more than simple data-driven co-occurrences; (3) exploring the utility of visual attributes drawn from natural language, both as auxiliary learning problems to bias models for object categorization, as well as ordinal properties that must be teased out using non-traditional human supervision strategies; (4) mining attributes that are both distinctive and human-nameable, moving beyond manually constructed semantics.

The project entails original contributions in both computer vision and machine learning, and is an integral step towards semantically-grounded object categorization. Whereas mainstream approaches reduce human knowledge to mere category labels on exemplars, this work leverages semantically rich knowledge more deeply and earlier in the learning pipeline. The approach results in vision systems that are less prone to overfit incidental visual patterns, and representations that are readily extendible to novel visual learning tasks. Beyond the research community, the work has broader impact through inter-disciplinary training of graduate and undergraduate students, and outreach to pre-college educators and students through workshops and summer camps encouraging young students to pursue science and engineering.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Alber, Maximilian and Kindermans, Pieter-Jan and Sch\"{u}tt, Kristof and M\"{u}ller, Klaus-Robert and Sha, Fei "An Empirical Study on The Properties of Random Bases for Kernel Methods" NIPS , 2017
Boqing Gong, Kristen Grauman, and Fei Sha "Learning Kernels for Unsupervised Domain Adaptation withApplications to Visual Object Recognition." International Journal of Computer Vision (IJCV) , v.109 , 2014
Changpinyo, Soravit and Chao, Wei-Lun and Sha, Fei "Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning" ICCV , 2017
Chao, Wei-Lun and Hu, Hexiang and Sha, Fei "Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets" CVPR 2017 Workshop on Negative Results in Computer Vision , 2017
Hexiang Hu and Shiyi Lan and Yuning Jiang and Zhimin Cao and Fei Sha "FastMask: Segment Multi-scale Object Candidates in One Shot" 2017 {IEEE} Conference on Computer Vision and Pattern Recognition, {CVPR} , 2017
Soravit Changpinyo, Weilun Chao and Fei Sha "Predicting Visual Exemplars for Zero-shot Learning" Proc. of ICCV , 2017
Soravit Changpinyo, Weilun Chao, Boqing Gong, and Fei Sha "Synthesized Classifiers for Zero-Shot Learning" Proc. of CVPR , 2016
Weilun Chao, Soravit Chanpinyo, Boqing Gong and Fei Sha "An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild" Proc. of ECCV , 2016

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This project explores (semi-)automatic ways to create "semantically discriminative" mid-level cues for visual object categorization, by introducing external knowledge of object properties into the statistical learning procedures that learn to distinguish them. In particular, the PIs investigate four key ideas: (1) exploiting taxonomies over object categories to inform feature selection algorithms such that they home in on the most abstract description for a given granularity of label predictions; (2) leveraging inter-object relationships conveyed by the same taxonomies to guide context learning, so that it captures more than simple data-driven co-occurrences; (3) exploring the utility of visual attributes drawn from natural language, both as auxiliary learning problems to bias models for object categorization, as well as ordinal properties that must be teased out using non-traditional human supervision strategies; (4) mining attributes that are both distinctive and human-nameable, moving beyond manually constructed semantics.


The project entails original contributions in both computer vision and machine learning, and is an integral step towards semantically-grounded object categorization. Whereas mainstream approaches reduce human knowledge to mere category labels on exemplars, this work leverages semantically rich knowledge more deeply and earlier in the learning pipeline. The approach results in vision systems that are less prone to overfit incidental visual patterns, and representations that are readily extendible to novel visual learning tasks. Beyond the research community, the work has broader impact through inter-disciplinary training of graduate and undergraduate students, and outreach to pre-college educators and students through workshops and summer camps encouraging young students to pursue science and engineering.


During the lifespan of this project, the support from NSF has generated significant outcomes in several aspects, summarized as follows: (1) interdisciplinary research between machine learning and computer vision in the areas of learning attributes as mid-level representations, addressing domain adaptations, a very challenging problem in computer vision, zero-shot learning for building large-scale and robust computer vision systems that can operate in the wild, preliminary research into visual question and answering for artificial general intelligence; (2) applying reserarch results to real-world problems such as collaboration with government labs and industry research teams; (3) educating next-generation academics including graduating two students who have taken academic teaching and research jobs; (4) training undegraduate and graduate students. 


The project had also catalyzed the collaborative research between us and the research lab led by Prof. Kristen Grauman at U of Texas (Austin). Building on top of the collaborative research conducted in this project, the two labs have formed broader collaborations and have since taken another joint research project under NSF support.  


Last Modified: 04/20/2018
Modified by: Fei Sha

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page