
NSF Org: |
DGE Division Of Graduate Education |
Recipient: |
|
Initial Amendment Date: | September 9, 2016 |
Latest Amendment Date: | September 9, 2016 |
Award Number: | 1633631 |
Award Instrument: | Standard Grant |
Program Manager: |
Vinod Lohani
DGE Division Of Graduate Education EDU Directorate for STEM Education |
Start Date: | September 15, 2016 |
End Date: | August 31, 2021 (Estimated) |
Total Intended Award Amount: | $2,967,150.00 |
Total Awarded Amount to Date: | $2,967,150.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
160 ALDRICH HALL IRVINE CA US 92697-0001 (949)824-7295 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
4216 Bren Hall Irvine CA US 92697-0001 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | NSF Research Traineeship (NRT) |
Primary Program Source: |
04001617DB NSF Education & Human Resource |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.076 |
ABSTRACT
Massively parallel computers simulate data about molecular phenomena at previously unimaginable scales, satellites scan the planet capturing vast sets of measurements about ecosystem health, and particle accelerators generate tremendous amounts of data revealing fundamental properties of the smallest building blocks of matter; all with potentially broad societal benefits in areas such as drug discovery, energy conservation, and materials science. To fully realize these benefits will require a workforce with the technical skills to extract useful information from massive scientific data sets, calling for new approaches to graduate student training that emphasize expertise in data-driven science. This National Science Foundation Research Traineeship (NRT) award to the University of California Irvine (UCI) will tackle this challenge by creating a training ecosystem comprised of leading UCI, national-laboratory, and private-sector researchers across particle physics, earth science, chemistry, statistics and machine learning; all bound together by expertise in the emerging Science of Team Science. The project anticipates training over sixty (60) MS and PhD students, including twenty (20) funded trainees, from diverse backgrounds in computational statistics, machine learning, earth science, particle physics, synthetic chemistry, and team science. After graduation, students from this program will have both the technical and team-science skills to be leaders in the emerging field of data-driven science, and to participate in and lead interdisciplinary research teams at national laboratories, in academia, and in industry labs.
The research agenda of the program seeks to create the foundation from which bridges can be built between the traditional scientific route of building interpretable models based on physical principles and data-driven modeling approaches that can provide high fidelity predictions but may lack clear interpretability in terms of the underlying science. The program will involve a number of interrelated research themes across multiple disciplines in the information and physical sciences, including machine learning (e.g. temporal and spatial data modeling, multi-scale models, deep learning, and scalable learning algorithms), particle and astroparticle physics (e.g. accelerator based experiments), earth systems science (e.g. reducing ecosystem response prediction uncertainties), and chemistry (e.g. prediction of physical properties of small molecules). A significant aspect of the program is an emphasis on team science as a core theme. Students will collaborate in small interdisciplinary research teams consisting of students and faculty with different disciplinary skills, and will take part in team-science workshops leading to student-led development of a team-science certificate in years 3 to 5 of the program. Summer internships for student participants, at both national and industry research laboratories, will serve to reinforce the students' academic training via participation in large-scale interdisciplinary data science research projects.
The NSF Research Traineeship (NRT) Program is designed to encourage the development and implementation of bold, new potentially transformative models for STEM graduate education training. The Traineeship Track is dedicated to effective training of STEM graduate students in high priority interdisciplinary research areas, through the comprehensive traineeship model that is innovative, evidence-based, and aligned with changing workforce and research needs.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Modern scientific research is increasingly data-driven, with huge volumes of scientific data being generated by telescopes, particle accelerators, satellites, and more. As a result it is critically important that future generations of scienticts have expertise in analyzing data. As part of a five-year Machine Learning and Physical Sciences (MAPS) graduate training program at the University of California, Irvine, 51 PhD students working at the intersection of physical and information sciences were supported in their graduate research by funding from the National Science Foundation Research Traineeship program. In particular, students received training and mentoring with a specific emphasis on developing and harnessing new techniques from data science and machine learning for data-driven scientific discovery.
In the natural sciences, students from areas such as particle physics, climate science, and chemistry gained skills in topics such as machine learning, algorithms, and statistics; and engaged in graduate thesis projects with a significant data science component. On the information science side, students from computer science and statistics focused on particular disciplines within the physical sciences as the application area for their graduate thesis work. A significant additional aspect of the program was an emphasis on student communication and leadership skills in interdisciplinary team science.
The program produced a cohort of graduate students who have the data science skills to explore new research directions at the interface of information science and physical sciences. In addition these students have both the technical and team-science skills to be leaders in the emerging field of data science, and to participate in and lead interdisciplinary research and engineering teams at national laboratories, in academia, and in private industry.
From a research perspective, the program contributed to fundamental new knowledge in the scientific disciplines of particle and astrophysics, synthetic chemistry, and earth and climate science, In addition the program contributed to the development of new techniques and algorithms in machine learning and data science. The PhD students in the program produced over 50 peer-reviewed research papers on their research as well as a variety publicly-available open-source software and research datasets.
In addition, students in the program were actively engaged in community outreach activities during the five years of the program, particularly in terms of increasing awareness about university study and research careers among K-12 students in disadvantaged communities in Southern California.
Last Modified: 12/21/2021
Modified by: Padhraic Smyth
Please report errors in award information by writing to: awardsearch@nsf.gov.