
NSF Org: |
DMS Division Of Mathematical Sciences |
Recipient: |
|
Initial Amendment Date: | May 16, 2018 |
Latest Amendment Date: | May 16, 2018 |
Award Number: | 1832046 |
Award Instrument: | Standard Grant |
Program Manager: |
Gabor Szekely
DMS Division Of Mathematical Sciences MPS Directorate for Mathematical and Physical Sciences |
Start Date: | August 1, 2017 |
End Date: | June 30, 2020 (Estimated) |
Total Intended Award Amount: | $76,912.00 |
Total Awarded Amount to Date: | $76,912.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
1918 F ST NW WASHINGTON DC US 20052-0042 (202)994-0728 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
2121 Eye Street Washington DC US 20052-0001 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | STATISTICS |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.049 |
ABSTRACT
Functional data analysis, which deals with a sample of functions or curves, plays an important role in modern data analysis. Nowadays in the era of "Big Data", multidimensional and multivariate functional data are becoming increasingly common, especially in biological, medical, and engineering applications. There are significant challenges posed by the very large dimension and complex structure of these data. The proposed research will substantially narrow the gap between the increasing demand for handling such data in practice, and the insufficient development of statistical methods and computational tools. This research has applications to neuroscience, climate science, and engineering. It will provide scientists, engineers, and doctors with tools to help understand problems in their area, and enhance interdisciplinary collaborations.
This project offers a comprehensive research plan to advance the understanding and applicability of multidimensional and multivariate functional data. The research will focus on the following three sub-projects: (1) Develop data-adaptive and interpretable representation of the covariance function for multidimensional functional data, (2) Develop a novel model-free procedure to detect dependency between components of multivariate functional data, and (3) Address the modeling and prediction of multivariate functional time series. The resulting methods will be applied to neuroimaging and climate data. The integration of these three sub-projects will foster creative directions and strategies for multidimensional and multivariate functional data.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Functional data analysis, which deals with a sample of functions or curves, plays an important role in modern data analysis. Nowadays in the era of "Big Data", multidimensional and multivariate functional data are becoming increasingly common, especially in biological, medical, and engineering applications, but the large dimension and complex structure of these data pose significant challenges.
This project has substantially narrowed the gap between the increasing demand for handling multidimensional and multivariate functional data in practice, and the insufficient development of statistical methods and computational tools. This research has advanced mathematical and computational statistics, and the project outcomes have been applied to neuroimaging, oceanographic, and traffic data. This project has focused on three specific sub-projects:
1. Covariance function estimation for multidimensional functional data: Covariance function estimation is regarded as a crucial step in functional data analysis. However, for multidimensional functional data, there are very few existing estimators due to the fundamental difficulty of multi-dimensionality. We proposed an efficient yet widely applicable modeling which addresses this. Not only do the resulting techniques provide powerful covariance estimators, but they also serve as a building block for subsequent statistical techniques for multidimensional functional data.
2. Independence tests for multivariate functional data: Independence tests are useful statistical tools for detecting relationships among different attributes. When these attributes are functions, we obtain a multivariate functional dataset, e.g., multiple signals measured over time (regarded as functions) from different human brain regions when the corresponding subject is performing a particular task. The corresponding relationship between these brain regions is known as functional connectivity. We proposed a novel and powerful test that can identify the dependency for multivariate functional data. Our work is empirically shown to be useful in detecting functional connectivity patterns.
3. Autoregressive models for multivariate functional time series: Historically functional data are collected from independent sources. Nowadays many functional data are recorded in a sequential manner, called "multivariate functional time series" of which measurements of a few variables are naturally separated by consecutive time intervals, e.g., hourly maximum temperature and hourly maximum wind speed in many days. Since functional time series often arise from a single source, the dependency between curves is usually expected and it is of interest to model this dependency by predicting future measurements using the history. Most existing techniques are based on a truncated procedure whose discrete nature often adversely affects the statistical performance. We developed an estimation that avoids this discrete procedure to improve estimation accuracy, computational stability and predictability.
Graduate students, including women, have participated in this project. They have learned advanced computational and mathematical skills. Some of the above sub-projects have naturally become part of their dissertations. The project outcomes have been disseminated in seminars and conferences. This project has also produced peer-reviewed publications and publicly accessible computer code and packages.
Last Modified: 10/12/2020
Modified by: Xiaoke Zhang
Please report errors in award information by writing to: awardsearch@nsf.gov.