Award Abstract # 1463102
CI-New: Collaborative Research: Federated Data Set Infrastructure for Recognition Problems in Computer Vision

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: REGENTS OF THE UNIVERSITY OF MICHIGAN
Initial Amendment Date: September 17, 2014
Latest Amendment Date: September 17, 2014
Award Number: 1463102
Award Instrument: Standard Grant
Program Manager: Jie Yang
jyang@nsf.gov
 (703)292-4768
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2014
End Date: September 30, 2017 (Estimated)
Total Intended Award Amount: $150,000.00
Total Awarded Amount to Date: $150,000.00
Funds Obligated to Date: FY 2014 = $150,000.00
History of Investigator:
  • Jason Corso (Principal Investigator)
    jjcorso@umich.edu
Recipient Sponsored Research Office: Regents of the University of Michigan - Ann Arbor
1109 GEDDES AVE STE 3300
ANN ARBOR
MI  US  48109-1015
(734)763-6438
Sponsor Congressional District: 06
Primary Place of Performance: University of Michigan Ann Arbor
MI  US  48109-1274
Primary Place of Performance
Congressional District:
06
Unique Entity Identifier (UEI): GNJ7BBP73WE9
Parent UEI:
NSF Program(s): CCRI-CISE Cmnty Rsrch Infrstrc
Primary Program Source: 01001415DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7359
Program Element Code(s): 735900
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Broad access to image and video datasets has been responsible for much of the progress in computer vision recognition problems over the last decade. These common benchmarks have played a leading role in transforming recognition research from a black art into an experimental science. Progress, however, has stagnated; although datasets continue to grow, they are developed and annotated in isolation: e.g., a collection of sporting activities, a set of objects in images, etc. These isolated datasets suffer from task and domain-specific bias, and knowledge transfer across them is extremely limited. This project is investigating and establishing a prototype architecture that federates across various recognition problems and modalities, by establishing a common namespace for entities, events and annotations across the datasets. The project is also establishing a web-portal for the prototype federated dataset architecture and linking two existing recognition datasets into the prototype architecture. The resulting federated structure is truly greater than the sum of its parts, and can support new research that was not previously possible for the computer vision community and other related fields.

As a first test scenario for this federated architecture, this project is investigating and constructing a new federated dataset of images and video annotated with various forms of associated text. Image and video content annotations span both the spatial and temporal dimensions while textual annotations reflecting depicted content range from complete free-form natural language descriptions, to more targeted phrases and referring expressions, to individual keyword lists. This dataset is being constructed to promote and enhance collaboration efforts between the vision and language communities by providing a new multi-modal annotated dataset with associated research competitions.

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The Federated Data Set Infrastructure for Recognition Problems in Computer Vision was a seed project in the CRI program that led to two significant outcomes.  First, the investigators of the workshop held a widely attended information-gathering workshop at CVPR 2015: The Future of Datasets in Vision.  The workshop hosted four invited speakers, twenty posters from twenty-six institutions, ten countries and eighty-four individual speakers.  During and after the workshop, the investigators gathered and analyzed feedback and interests from the computer vision community to help craft a subsequent full-sized proposal to the CRI program.  Significant lessons learned from the community were that a common infrastructure to store and provide access to the large and growing number of datasets in the community would provide the most utility to the community while the alternative potential of establishing a common namespace was not seen as likely to the fruitful for the community.


This full-sized proposal was funded and is the second outcome of the seed project.  COVE -- Computer Vision Exchange for Data, Annotation and Tools -- is the resulting full-size project that integrates the lessons learned from this seed and from the workshop.  COVE will provide a common index for data sets in the computer vision community, with an index and archive coinciding with the Computer Vision Foundation's open access archive.  The COVE project is ongoing and will reside at http://cove.thecvf.com.


Last Modified: 02/12/2018
Modified by: Jason J Corso

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page