NSF Award Search: Award # 1953052 - Geometric and Semantic Structures for Two- and Three-Dimensional Shape Understanding

Award Abstract # 1953052

Geometric and Semantic Structures for Two- and Three-Dimensional Shape Understanding

NSF Org:	DMS Division Of Mathematical Sciences
Recipient:	OCCIDENTAL COLLEGE
Initial Amendment Date:	March 13, 2020
Latest Amendment Date:	March 13, 2020
Award Number:	1953052
Award Instrument:	Standard Grant
Program Manager:	Yong Zeng yzeng@nsf.gov (703)292-7299 DMS Division Of Mathematical Sciences MPS Directorate for Mathematical and Physical Sciences
Start Date:	June 1, 2020
End Date:	May 31, 2024 (Estimated)
Total Intended Award Amount:	$160,000.00
Total Awarded Amount to Date:	$160,000.00
Funds Obligated to Date:	FY 2020 = $160,000.00
History of Investigator:	Kathryn Leonard (Principal Investigator) kleonardci@gmail.com
Recipient Sponsored Research Office:	Occidental College 1600 CAMPUS RD LOS ANGELES CA US 90041-3314 (323)259-1414
Sponsor Congressional District:	34
Primary Place of Performance:	Occidental College CA US 90041-3314
Primary Place of Performance Congressional District:	34
Unique Entity Identifier (UEI):	DCQQX5TRCYN9
Parent UEI:
NSF Program(s):	TOPOLOGY, CDS&E-MSS
Primary Program Source:	01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):	9263
Program Element Code(s):	126700, 806900
Award Agency Code:	4900
Fund Agency Code:	4900
Assistance Listing Number(s):	47.049

ABSTRACT

True computer vision will provide end-to-end image analysis, where images are decomposed into objects of interest, those objects are decomposed into parts, and the parts and objects are recognized. Performing integrated tasks with an image, such as shape generation, animation, editing, or partial matching, requires structure-aware shape processing. A full shape structure consists of a decomposition into parts, an understanding of which parts are more significant than others, and an ability to measure similarity of parts moving toward recognition. A pipeline that takes as input two- or three-dimensional images, performs accurate segmentation to determine shapes of interest, extracts a shape structure, then recognizes the parts and the shapes would represent a fundamental step forward in artificial vision. The task is challenging because human visual perception does not follow computational rules. For example, two shapes can both be similar to a third shape without being similar to each other. For another, our understanding of meaning of shapes adds a semantic level to our geometric perception: if someone is seated on an object, we classify that object as a chair regardless of its shape. Any useful shape analysis must explicitly model the interplay between semantics and geometric shape. This project aims to develop the foundational theory of shape structure and provide robust implementations of the resulting techniques while maintaining the connection to human semantic perception through benchmarking to user studies.

The Blum medial axis gives a skeletal decomposition of a closed region in Euclidean space. For spatial dimensions 2 and 3, these regions can be interpreted as 2D and 3D shapes, with the skeletal model providing a lower-dimensional representation of the shape. The skeleton, a Whitney stratified set, is a deformation retract of the shape boundary that captures complete geometric information about the boundary of the shape. This project will introduce functions on the medial axis that encode shape geometry in a way that allows for the determination of a parts decomposition and hierarchy within a shape, as well as similarity between parts, for shapes of any finite genus. Based on that analysis, the research will develop formal measures of shape complexity and benchmark results through human perception studies. Finally, the project aims to connect the new shape structure characterization to current approaches using neural networks for image understanding by developing network architectures that learn the geometry of a shape skeleton from its natural or binary image representation.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bazazian, Dena and Magland, Bonnie and Grimm, Cindy and Chambers, Erin and Leonard, Kathryn "Perceptually grounded quantification of 2D shape complexity" The Visual Computer , v.38 , 2022 https://doi.org/10.1007/s00371-022-02634-8 Citation Details

Haddock, Jamie and Kassab, Lara and Li, Sixian and Kryshchenko, Alona and Grotheer, Rachel and Sizikova, Elena and Wang, Chuntian and Merkh, Thomas and Madushani, R. W. and Ahn, Miju and Needell, Deanna and Leonard, Kathryn "Semi-supervised Nonnegative Matrix Factorization for Document Classification" 2021 55th Asilomar Conference on Signals, Systems, and Computers , 2021 https://doi.org/10.1109/IEEECONF53345.2021.9723109 Citation Details

Kumar, Yulia and Gordon, Zachary and Alabi, Oluwatunmise and Li, Jenny and Leonard, Kathryn and Ness, Linda and Morreale, Patricia "ChatGPT Translation of Program Code for Image Sketch Abstraction" Applied Sciences , v.14 , 2024 https://doi.org/10.3390/app14030992 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The development of a visual system for a computer that can function analogously to the human visual system requires a process for understanding shape structure, A full shape structure must include a decomposition of a shape into parts, a hierarchy of importance of those parts, and the relative part similarity (e.g., the shape has four legs). Performing integrated tasks with a shape such as shape generation, animation, editing, or partial matching requires a well-developed shape structure that matches with human perception.

This project developed foundational theoretical results, robust implementations, and links with human perception as part of a shape structure framework based on the medial axis of a shape and the functions defined on it.

The interior Blum medial axis of a shape is a skeleton of the shape together with a distance to the shape boundary for each point in the skeleton. Defining functions on the medial axis that capture the geometry of the shape creates a framework for a shape structure, Our theoretical results demonstrate that different such functions extract the parts, the hierarchy, and the similarity, and do so for 2D and 3D shapes of any finite genus. Our theoretical results also show that these functions are stable functions that behave well for shapes that themselves behave well.

Based on the theoretical results, we developed and tested algorithms for implementation. We have shared robust implementations of all code through github repositories. Finally, through user experiements, we also show that these functions capture important components of human perception of shape complexity.

The project also supported several activities in the broader community, including annual Deep Geometric Learning conferences associated with CVPR, annual SkelNetOn challenges for using deep learning to learn clean skeletal representations of 2D shapes from images (resulting in two successful approaches from the computer vision community), and the Summer Geometry Intensive at MIT, an annual online 8-week research experience for students all over the world to work on a collection of 1-2 week research projects in applied geometry. In addition, the project hosted a research collaboration workshop (WiSH 2021), where 50 participants worked for one week on open research questions in shape modeling at BIRS-Oaxaca (online because of the pandemic).

Last Modified: 09/29/2024
Modified by: Kathryn Leonard

Please report errors in award information by writing to: awardsearch@nsf.gov.

Success

Error