
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | August 27, 2018 |
Latest Amendment Date: | August 27, 2018 |
Award Number: | 1808591 |
Award Instrument: | Standard Grant |
Program Manager: |
Tevfik Kosar
OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2018 |
End Date: | August 31, 2021 (Estimated) |
Total Intended Award Amount: | $175,263.00 |
Total Awarded Amount to Date: | $175,263.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
200 UNIVERSTY OFC BUILDING RIVERSIDE CA US 92521-0001 (951)827-5535 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
CA US 92521-0001 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
CDS&E-MSS, CDS&E |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
In modern applications of science and engineering, large volumes of data are collected from diverse sensor modalities, commonly stored in the form of high-order arrays (tensors), and jointly analyzed in order to extract information about underlying phenomena. This joint tensor analysis can exploit inherent dependencies across data modalities and allow for markedly enhanced inference. Standard methods for tensor analysis rely on formulations that are sensitive to heavily corrupted points among the processed data (outliers). To counteract the destructive impact of outliers in modern data analysis (and thereto relying applications), this project will investigate new theory and robust algorithmic methods. The performance benefits of the developed tools will be evaluated in applications from the fields of data analytics, machine learning and computer vision. Thus, this research aspires to increase significantly the reliability of data-enabled research across science and engineering. Combining theoretical explorations, with practical algorithmic solutions for data analysis and experimental evaluations, this project has the potential to build significant future capacity not only for U.S. academic institutions but also for the U.S. government and industry. Thus, apart from promoting the progress of science, this project could contribute to advances in the national prosperity and welfare. In addition, research activities under this project will be integrated with education. Participating students, at both graduate and undergraduate levels, will gain important experience in optimization theory, machine learning, computer vision, and data mining, among other areas. Moreover, the project plan includes multiple STEM outreach activities and supports diversity in STEM by involving?students from underrepresented groups.
In this project, the theoretical underpinnings of L1-norm tensor analysis will be investigated, with a focus on its computational hardness and exact solution. Then, based on these new foundations, efficient/practical algorithms for L1-norm tensor analysis will be explored, together with scalable and distributed software implementations. These theoretical and algorithmic investigations are expected to advance significantly the knowledge in the currently under-explored area of L1-norm tensor analysis and deliver highly impactful methodologies for outlier-resistant multimodal data processing. Next, the PIs will employ the newly developed algorithmic tools in key problems from the fields of data analytics, machine learning and computer vision. In addition, research activities under this project will be integrated with education. Participating students, at both graduate and undergraduate levels, will gain important experience in optimization theory, machine learning, computer vision, and data mining, among other areas. Moreover, the project plan includes multiple STEM outreach activities?and supports diversity in STEM by involving students from underrepresented groups.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Tensor methods are powerful data analytic tools from multilinear algebra which are widely used in machine learning and data science. Tensors can represent heterogeneous, multi-modal datasets which come from a wide variety of real-world applications, including social network analysis, computer vision, recommendation systems, and scientific applications. The main focus of the project was to develop and apply robust tensor methods, where the data that are being analyzed are corrupted, noisy, or incomplete, which is very frequently the case in real-world scenarios. The research outcomes of this project fall into two thrusts: (1) Fundamental algorithm development for robust tensor methods, and (2) Real-world applications.
The project advanced the state-of-the-art in robust tensor methods while also innovating in key high-impact applications where the power and generality of tensor methods in obtaining state-of-the-art results was demonstrated. As part of this project, graduate students were trained in the PI’s lab, and the results of the research were broadly disseminated at top scientific venues. Furthermore, as part of fostering a vibrant research community, the PI was actively involved in the organization of symposia and workshops in the area of tensor methods for data science and machine learning.
Highlights for research outcomes are outlined below.
Fundamental algorithm development for robust tensor methods:
The first main thrust of the project was the design and development of fundamental algorithms for tensor analysis, with specific emphasis on robustness. Within that framework, the following two threads were explored:
(a) Robust tensor decomposition and compression: Within this thread, the project developed novel formulations for computing tensor decompositions for data analysis and compression in a manner which is less sensitive to outliers and noise, thus offering higher-quality in terms of data representation, reconstruction, and compression. Furthermore, a compression-based streaming tensor decomposition was developed.
(b) Tensor model order selection and its interplay with compression: Model selection in tensor analysis refers to the selection of the number of hidden patterns in the data, and is of utmost importance, since it dictates the number of insights that can be extracted from the data. The selection of this number is an extremely hard problem. Different methods for model order selection were developed, with specific emphasis on doing so in compressed data, which allows for scalable computations.
Real-world Applications:
The second main thrust of the work was the adaptation and application of tensor methods to high-impact real-world problems, including social media and network analysis, recommendation systems, computer vision, and scientific data analysis.
(a) Social media and network analysis:
WIthin the umbrella of social media and network analysis, the project focused on a number of high-impact and timely problems, such as misinformation detection (studying different aspects of it, including analyzing article content, analyzing visual cues from website screenshots, and detecting DeepFake images), geolocation of social media posts from text and images, humor recognition especially in the cases where limited human annotations are present, and the development of methods for shielding graph neural networks against adversarial attacks, where corrupted network interactions are meant to mislead the classification of certain entities in the network.
(b) Recommendation systems:
One of the main tensor models studied in the first thrust (RESCAL) was successfully applied to the challenging problem of complementary item recommendation in online grocery shopping.
(c) Computer vision:
As part of shielding neural networks that specialize in image classification, the project focused on tensor compression for alleviating adversarially corrupted images, which are meant to fool the classifier while remaining imperceptible to humans. Furthermore, the project developed tensor-based methods for computing representations of hyperspectral images which rival neural network methods in pixel classification.
(d) Scientific data analysis:
Scientific data are inherently multi-dimensional and multi-modal. In this project, specific emphasis was placed in the analysis of data from the Laser Interferometer Gravitational Wave Observatory (LIGO) with tensor methods towards the detection of so-called “glitches”, noise transients that appear similar to meaningful patterns in the data, thus corrupting the scientific data and impacting the accurate detection of gravitational waves.
Last Modified: 12/22/2021
Modified by: Evangelos Papalexakis
Please report errors in award information by writing to: awardsearch@nsf.gov.