Award Abstract # 1262416
ABI Innovation: A Systematic Computational Approach to Creating Libraries of High-Quality Mass Spectra for Unidentified Metabolites

NSF Org: DBI
Division of Biological Infrastructure
Recipient: UNIVERSITY OF NORTH CAROLINA AT CHARLOTTE
Initial Amendment Date: May 15, 2013
Latest Amendment Date: May 15, 2013
Award Number: 1262416
Award Instrument: Standard Grant
Program Manager: Peter McCartney
DBI
 Division of Biological Infrastructure
BIO
 Directorate for Biological Sciences
Start Date: September 1, 2013
End Date: August 31, 2018 (Estimated)
Total Intended Award Amount: $580,611.00
Total Awarded Amount to Date: $580,611.00
Funds Obligated to Date: FY 2013 = $580,611.00
History of Investigator:
  • Xiuxia Du (Principal Investigator)
    xdu4@uncc.edu
Recipient Sponsored Research Office: University of North Carolina at Charlotte
9201 UNIVERSITY CITY BLVD
CHARLOTTE
NC  US  28223-0001
(704)687-1888
Sponsor Congressional District: 12
Primary Place of Performance: University of North Carolina at Charlotte
9201 University City Blvd
Charlotte
NC  US  28223-0001
Primary Place of Performance
Congressional District:
12
Unique Entity Identifier (UEI): JB33DT84JNA5
Parent UEI: NEYCH3CVBTR6
NSF Program(s): ADVANCES IN BIO INFORMATICS
Primary Program Source: 01001314DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):
Program Element Code(s): 116500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

An award is made to the University of North Carolina at Charlotte to develop a computational workflow
that will create libraries of high-quality mass spectra for unknown metabolites that are observed
repeatedly by mass spectrometers in metabolomics studies. Metabolomics is a rapidly developing field of
?omics? research concerned with the high-throughput identification and quantitation of small molecule
metabolites in the metabolome. Since the metabolome constitutes a wide array of compound classes that
are crucial for the normal functioning of a biological system, the metabolomics approach promises to offer
new insights in many areas of biological investigation. Recent metabolomics research benefited greatly
from advances in mass spectrometry and chromatography. These advances allow researchers to detect
many metabolites that could not be detected previously. However, a sizable fraction of these compounds
are unknown and a new computational infrastructure is required for processing the complex mass
spectral data and identifying and characterizing these metabolites. This project addresses this need by
developing a computational workflow that will create libraries of high-quality mass spectra for unknown
compounds from many samples. These resulting libraries will enable the identification of many currently
unidentified, but commonly observed components by their spectra. Equally important, the workflow will
allow more precise quantitation of metabolites and subsequent differential analysis of metabolic profiles.
The most biologically interesting unknown compounds in the library can then be subjected to further
attempts at structure elucidation.

The project will contribute to the training of postdoctoral fellows and graduate students in bioinformatics
methods. The PI will develop modules covering metabolomics bioinformatics methods for a graduate
course. Materials developed for the class will also be made available online and presented at a
bioinformatics workshop hosted at UNC-Charlotte.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Owen Myers, Susan Sumner, Shuzhao Li, Stephen Barnes, Xiuxia Du "A detailed investigation and com- parison of the XCMS and MZmine 2 chromatogram construction and chromatographic peak detection methods for preprocessing mass spectrometry metabolomics data" Analytical Chemistry , 2017 DOI: 10.1021/acs.analchem.7b01069
Owen Myers, Susan Sumner, Shuzhao Li, Stephen Barnes, Xiuxia Du "One step forward for reducing false positive and false negative compound identifications from mass spectrometry metabolomics data: new algorithms for constructing extracted ion chromatograms and detecting chromatographic peaks" Analytical Chemistry , 2017 DOI: 10.1021/acs.analchem.7b00947
Xiuxia Du, Aleksandr Smirnov, and Wei Jia "A preparatory study of how to construct consensus mass spectra of recurrent unknown metabolites from untargeted GC-MS metabolomics data" International Journal of Mass Spectrometry , 2018

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

1. Overview

Metabolomics is a rapidly developing field of -omics research concerned with the high-throughput identification and quantification of small molecule metabolites in the metabolome. The metabolome constitutes a wide array of compound classes that are crucial for the normal functioning of a biological system. As a result, the metabolomics approach promises to offer new insights in many areas of biological investigation. 

 

Recent metabolomics research benefited greatly from advances in mass spectrometry and chromatography. These advances allow researchers to detect many metabolites that could not be detected previously. However, a sizable fraction of these compounds is unknown, and a new computational infrastructure is required for processing the complex mass spectral data and facilitating the identification of these metabolites. 

 

The PI's group addressed this need through this awarded project by having developed a streamlined computational workflow that can create libraries of high-quality mass spectra for unknown compounds from many biological samples. Furthermore, the PI's group has used this computational workflow and created such a mass spectral library for facilitating prioritization and eventual identification of currently unidentified metabolites. Equally importantly, the developed workflow allows more precise quantitation of metabolites and subsequent differential analysis of metabolic profiles. The most biologically interesting unknown compounds in the library can then be subjected to further attempts at structure elucidation.

 

2. List of grant awards, publications and presentations

With the results from this project, the PI applied for NIH funding and has been awarded two grants from the NIH Common Fund Metabolomics Program with a combined total cost at $1.53 million. In addition, the PI's group has published five journal papers and given about 35 conference/workshop oral presentations and five conference/symposium poster presentations. One journal manuscript is currently near completion and another one is in preparation. NSF funding was acknowledged in all of the publications and presentations.

 

3. Intellectual Merit

The computational workflow that resulted from this project streamlined the process of extracting reliable mass spectra of single compounds from complex raw mass spectral data and creating libraries of unknown spectra. It removed obstacles to exhaustive metabolite profiling, allowed researchers to build spectral libraries representing unknown compounds of potential interest, and paved the way for subsequent characterization and structure elucidation of these unknowns. Eventually, knowledge gained about the unknown compounds enables researchers to: 1) synthesize them, 2) discover new metabolic pathways, 3) conduct further studies to understand the roles of these metabolites in metabolic pathways, 4) discover interactions between the metabolome and other -omes (i.e., genome, transcriptome, and proteome), 5) yield valuable insights into underlying biochemical processes, and 6) ultimately contribute to advances in functional genomics. The computational workflow could especially benefit plant science and related areas because the plant kingdom is estimated to have more than 200,000 metabolites of enormous biochemical diversity, and many of them are unknown. 

 

4. Broader Impacts

The project has enabled the PI to train next-generation bioinformaticians by training two postdoctoral researchers, one Ph.D. student, and four master's students. In addition, the PI has been invited to speak for six consecutive years (2013-2018) at the UAB metabolomics workshop, which was sponsored by the National Institute of General Medical Sciences (NIGMS) as part of the NIH Common Fund Metabolomics Initiative. Each year, a total of about 40 participants attended the workshop. These included graduate students, basic science and clinical fellows, junior and mid-career faculty members, research associates, and directors of facilities. These six workshops trained a large number of scientists who could start using metabolomics in their own research. Last but not the least, the PI has used metabolomics examples and datasets accumulated over the course of the project and used them in teaching undergraduate and graduate courses including Machine Learning for Bioinformatics in Fall 2014, Fall 2015, Fall 2016, Fall 2017, Statistics for Bioinformatics in Spring 2015, Spring 2016, and BINF 4211: Applied Data Mining for Bioinformatics in Spring 2018. 

 


Last Modified: 01/08/2019
Modified by: Xiuxia Du

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page