Award Abstract # 2319370
Collaborative Research: ATD: Rapid Structure Recovery and Outlier Detection in Multidimensional Data

NSF Org: DMS
Division Of Mathematical Sciences
Recipient: COLORADO STATE UNIVERSITY
Initial Amendment Date: August 9, 2023
Latest Amendment Date: August 9, 2023
Award Number: 2319370
Award Instrument: Standard Grant
Program Manager: Tomek Bartoszynski
tbartosz@nsf.gov
 (703)292-4885
DMS
 Division Of Mathematical Sciences
MPS
 Directorate for Mathematical and Physical Sciences
Start Date: September 15, 2023
End Date: August 31, 2025 (Estimated)
Total Intended Award Amount: $144,726.00
Total Awarded Amount to Date: $144,726.00
Funds Obligated to Date: FY 2023 = $144,726.00
History of Investigator:
  • James Wilson (Principal Investigator)
    james.wilson@colostate.edu
Recipient Sponsored Research Office: Colorado State University
601 S HOWES ST
FORT COLLINS
CO  US  80521-2807
(970)491-6355
Sponsor Congressional District: 02
Primary Place of Performance: Colorado State University
601 S HOWES ST
FORT COLLINS
CO  US  80521-2807
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): LT9CXX8L19G1
Parent UEI:
NSF Program(s):
Primary Program Source: 01002324RB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 079Z, 6877
Program Element Code(s):
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.049

ABSTRACT

The project aims to develop and apply tools coming from algebra to identify attacks embedded within legitimate communications streams and online fora. The research will enhance statistical methods and Artificial Intelligence to detect and combat threats. Specifically, the research will work with unlabeled data and will be applicable in environments that cannot support learning on training data, such as new domains or previously unseen threats. The project will provide training opportunities for both undergraduate and graduate students, preparing them for their future careers in STEM fields. The research on detecting outliers in data, recovering missing data, and detecting hidden constraints will have many applications across the sciences.

The project aims to design a self-adaptive linear-time algorithm to separate signals, find hidden constraint equations, and detect similarities in high-dimensional data (tensors). This collaborative research of the three investigators and student participants will focus on three independent tasks. The first will extend signal separation and outlier prediction to a continuous spectrum. The second will refactor algebraic structures into tensor networks for uniform algorithms. The third will devise faster (linear-time) solutions to matrix systems to enhance practical range. Analysis of high-dimensional data often runs afoul of the curse of dimensionality: as the number of independent parameters increases, the time needed to search neighbors grows exponentially. Also, the meaning of outlier becomes blurred as notions of far apart and close together are less distinguishable, and traditional statistics tend to identify large subspaces. The new algebraic markers will detect structure in any dimension and be quickly computable.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page