
NSF Org: |
CCF Division of Computing and Communication Foundations |
Recipient: |
|
Initial Amendment Date: | June 6, 2014 |
Latest Amendment Date: | July 21, 2015 |
Award Number: | 1422324 |
Award Instrument: | Standard Grant |
Program Manager: |
Rahul Shah
CCF Division of Computing and Communication Foundations CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2014 |
End Date: | August 31, 2018 (Estimated) |
Total Intended Award Amount: | $470,349.00 |
Total Awarded Amount to Date: | $480,349.00 |
Funds Obligated to Date: |
FY 2015 = $10,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
520 LEE ENTRANCE STE 211 AMHERST NY US 14228-2577 (716)645-2634 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
338 Davis Hall Buffalo NY US 14260-2500 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Algorithmic Foundations |
Primary Program Source: |
01001516DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Recent progress in biomedicine has heavily relied on computer science technology. As the territory of biomedicine is rapidly enlarging, more powerful computational techniques are needed to foster its continuous growth. This project is for developing efficient computer algorithms for a set of geometric problems arising in several biomedical imaging applications. Particularly, it will design algorithms for three challenging problems (as well as their related problems): (1) generalizations of Voronoi diagram (which is a fundamental structure in geometry) called Clustering Induced Voronoi Diagram (CIVD), (2) general forms of matching in geometric settings based on Earth Mover's Distance (EMD), and (3) uniform framework for various types of constrained clustering problems. Problem (1) is for developing a mathematical model for understanding the interaction of dendritic cells and T cells in the immune system. Problem (2) is motivated by a medical imaging application of combining anatomic and functional images of moving organs for better imaging quality. Problem (3) aims to achieve better accuracy in image classifications by using a priori knowledge.
This project will use computational geometry techniques to develop novel algorithms for the proposed problems. It will introduce several general algorithmic techniques to the area of computational geometry, enriching and prodding its further development. These algorithmic techniques are also likely to be used in other areas, such as machine learning, computer vision, information security, data mining, and pattern recognition, and bring new ideas to these areas. This project could lead to several long term impacts. It could potentially help understanding the immune function and dysregulation in cancer, improving our ability to recovering fast organ motion (e.g., cardiac and lung motion), and achieving more accurate image classification in various applications.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This project is for developing algorithmic solutions to three fundamental geometric problems arising in biomedical imaging applications: (1) Clustering Induced Voronoi Diagrams, (2) Earth Mover?s Distance Based Geometric Matching, and (3) Clustering Constrained Data without Locality Property. The first problem is for investigating a significant generalization of the fundamental geometric structure, Voronoi diagram, and is motivated by the biomedical problem of developing a mathematical model for understanding the immune system. The second problem is for studying the most generalized form of matching in geometric settings and is motivated by a medical application of matching images with different modalities. The third problem is for developing unified approach for a large class of constrained clustering problems, and is motivated by an image classification problem with a priori knowledge.
Outcomes related to Intellectual Merits:
The PI of this project and his students have developed state-of-the-art solutions to each of the proposed problem. For the first problem, they have used the concept of joint influence to generalize Voronoi diagram, for the first time, from a distance-based to influence-based structure, and obtained efficient algorithms for constructing two types of diagrams called Clustering Induced Voronoi Diagrams (CIVD) and Influence-based Voronoi Diagrams (IVD) for ungrouped and grouped points, respectively. The associated joint influence techniques also enable them to achieve the best algorithms for several important problems, such as truth discovery, influence estimation, distributed and robust SVM, and fast pattern search. As by-products of these algorithms, a set of general techniques, such as Approximate Influence (AI) Decomposition, Range Cover, and Sum Query, were also obtained, which have the potential to be used for solving other problems.
For the second problem, they have focused on computing earth mover?s distance under rigid transformation. This is a longstanding open problem in the field of computational geometry. Based on a number of new techniques, they have achieved the first near optimal solution (FPTAS) for matching two sets of points in any fixed dimensional space. As a consequence of this breakthrough, they have also obtained the best solutions to several related problems, such as the alignment problem, and matching under other metrics and transformations, such as Hausdorff distance, similarity transformation, and affine transformation.
For the third problem, they have obtained a unified framework called Peeling-and-Enclosing (PnE) for a large class of constrained clustering problems, where the constraints can be very general. This is a rather powerful technique as most of the clustering problems in real world applications are constrained in some ways. Their PnE technique is able to achieve near optimal ((1+epsilon)-approximation)) solution for each of them for two types of clustering, k-means clustering and k-median clustering. An important theoretical result from this framework is a Simplex Lemma which can be used as a general technique for many high dimensional problems.
To obtain the aforementioned algorithms, they have also studied a number of related problems, such as Clustering-Based Collaborative Filtering, Random Gradient Descent Tree, Geometric Approach for Global Alignment of PPI Networks, Online and Distributed SVM, Large Scale Constrained Linear Regression, and Differentially Private Empirical Risk Minimization. A number of quality guaranteed techniques have been obtained.
All the designed algorithmic techniques have been disseminated through publications in peer-reviewed journals and conferences. Due to their fundamental nature, these algorithms have the potential to be used to solve various problems in computer science and other related areas.
Outcomes related to Broader Impacts:
The obtained computer algorithms have been used to analyze several types of biomedical data. Particularly, they have used the obtained algorithms to analyze biomedical data related to skin cancer. This study aims to determine what kinds of mutations are likely caused by sun exposure and will eventually lead to skin cancer. Some interesting results have already been obtained which will be disseminated soon.
The CIVD technique has been used to solve an important truth discovery problem in data science and achieve the first quality guaranteed solution for two popular models of this problem. The obtained truth discovery algorithms have been used for analyzing teaching evaluation data in the PI?s department and thus brings positive impact to the quality of education. One of the truth discovery paper won the best paper award in the 24th International Computing and Combinatorics Conference (COCOON 2018).
This project has provided educational and research opportunity to 9 PhD students (including 2 underrepresented female students). 6 of them have completed their doctoral dissertations. Three of them are now assistant professors in academia, including Michigan State University and Penn State University-Behrend.
Last Modified: 01/02/2019
Modified by: Jinhui Xu
Please report errors in award information by writing to: awardsearch@nsf.gov.