Award Abstract # 1436827
Accelerator-Rich Architectures with Applications to Healthcare

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: UNIVERSITY OF CALIFORNIA, LOS ANGELES
Initial Amendment Date: June 17, 2014
Latest Amendment Date: June 14, 2016
Award Number: 1436827
Award Instrument: Continuing Grant
Program Manager: Sankar Basu
sabasu@nsf.gov
 (703)292-7843
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: July 1, 2014
End Date: June 30, 2018 (Estimated)
Total Intended Award Amount: $750,000.00
Total Awarded Amount to Date: $899,999.00
Funds Obligated to Date: FY 2014 = $450,000.00
FY 2015 = $300,000.00

FY 2016 = $149,999.00
History of Investigator:
  • Jason Cong (Principal Investigator)
    cong@cs.ucla.edu
  • Mau-Chung Frank Chang (Co-Principal Investigator)
  • Glenn Reinman (Co-Principal Investigator)
  • Eleazar Eskin (Co-Principal Investigator)
  • Alex Bui (Co-Principal Investigator)
Recipient Sponsored Research Office: University of California-Los Angeles
10889 WILSHIRE BLVD STE 700
LOS ANGELES
CA  US  90024-4200
(310)794-0102
Sponsor Congressional District: 36
Primary Place of Performance: University of California-Los Angeles, Computer Science Dept.
BOX 951596, 4731J BH
Los Angeles
CA  US  90095-1596
Primary Place of Performance
Congressional District:
36
Unique Entity Identifier (UEI): RN64EPNH8JC6
Parent UEI:
NSF Program(s): Information Technology Researc,
Expeditions in Computing,
Software & Hardware Foundation
Primary Program Source: 01001415DB NSF RESEARCH & RELATED ACTIVIT
01001516DB NSF RESEARCH & RELATED ACTIVIT

01001617DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7723, 7945, 8091, 8624
Program Element Code(s): 164000, 772300, 779800
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Many healthcare applications present significant computational challenges. For example, the computational demand for personalized cancer treatment is prohibitively high for the general-purpose computing technologies, as tumor heterogeneity requires great sequencing depths, structural aberrations are difficult to detect with today's methods, and the tumor has the ability to evolve i.e., the same tumor might be assayed a great many times during the course of treatment. The goal of this project is to apply the domain-specific customized computing techniques developed by the Center for Domain-Specific Computing (CDSC) at UCLA to greatly accelerate computation for some key healthcare applications.

The CDSC, established in 2009 with the support of the NSF, looks beyond parallelization, and focuses on domain-specific customization as the next disruptive technology for power-performance efficiency improvement. In the past four years, CDSC has demonstrated significant performance and energy efficiency with innovation in developing customizable heterogeneous computing technologies. The current proposal under the NSF Innovation Transition program leverages the research results from CDSC, and focuses on key research problems and solutions to make domain-specific customizable computing feasible and practical for innovation transition to the industry, Specifically, the project will develop accelerator-rich architectures along with unified adaptive runtime systems for personalized cancer treatment, medical image processing, and will enable deployment in several energy efficient programmable platforms capable of handling huge volumes of state of the art real time patient data.

The center will continue its already successful outreach program, through a partnership with the UCLA Center for Excellence in Engineering and Diversity, to involve highly diversified high school and undergraduate students for summer research. The success of our project will enable significant advances in medical imaging analysis and personalized cancer treatment, which will greatly improve healthcare quality while reducing cost. The participation of the industrial partner in this InTrans project will greatly facilitate the innovation transition of research results from this project to industry for energy-efficient computing.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 30)
[AJHG16H] Farhad Hormozdiari, Eun Yong Kang, Michael Bilow, Eyal Ben-David, Chris Vulpe, Stela McLachlan, Aldons J. Lusis, Buhm Han, Eleazar Eskin "Imputing phenotypes for genome-wide association studies" American Journal of Human Genetics , 2016
[AJHG17H] Hormozdiari, F., Zhu, A., Kichaev, G., Ju, C.J.T., Segrè, A.V., Joo, J.W.J., Won, H., Sankararaman, S., Pasaniuc, B., Shifman, S. and Eskin, E. "Widespread allelic heterogeneity in complex traits" The American Journal of Human Genetics, , v.100 , 2017 , p.789-802
[B16D] Dat Duong, Jennifer Zou, Farhad Hormozdiari, Jae-hoon Sul, Jason Ernst, Buhm Han, Eleazar Eskin "Using genomic annotations increases statistical power to detect eGenes" Bioinformatics , v.32 , 2016 , p.i156-i163
[bioRxiv16M] Mangul, S., Yang, H.T., Strauli, N., Gruhl, F., Daley, T., Christenson, S., Andersen, A.W., Spreafico, R., Rios, C., Eng, C. and Smith, A.D. "Dumpster diving in RNA-sequencing to find the source of every last read" bioRxiv, p.053041 , 2016
[CACM15E] Eleazar Eskin "Discovering Genes Involved in Disease and the Mystery of Missing Heritability" Communications of the ACM , v.55 , 2015 , p.80 10.1145/2817827
[CBM17S] Shen S., Han S.X., Petousis P., Weiss R.E., Meng F., Bui A.A.T., Hsu W. "A Bayesian model for estimating multi-state disease progression." Computers in Biology and Medicine , v.81 , 2017 , p.111-120
Duggan, Nóirín, Egil Bae, Shiwen Shen, William Hsu, Alex Bui, Edward Jones, Martin Glavin, and Luminita Vese "A Technique for Lung Nodule Candidate Detection in CT Using Global Minimization Methods" In Energy Minimization Methods in Computer Vision and Pattern Recognition, Springer International Publishing , 2015 , p.pp. 478-4
[EL16L] Yilei Li, Wei-Han Cho, Yuan Du, Jieqiong Du, Po-Tsang Huang, Sheau Jiung Lee, and Mau-Chung Frank Chang "Carrier Synchronization for Multiband RF Interconnect (MRFI) to Facilitate Chip-to-Chip Wireline Communication" IET Electronics Letters, Apr 2016 , v.52 , 2016 , p.535-537
Furlotte, N. A., & Eskin, E. "Efficient multiple trait association and estimation of genetic correlation using the matrix-variate linear mixed-model" Genetics. doi:10.1534/genetics.114.17144 , 2015
[G15F] Nicholas A. Furlotte, Eleazar Eskin "Efficient Multiple Trait Association and Estimation of Genetic Correlation Using the Matrix-Variate Linear Mixed-Model" Genetics , v.203 , 2015 , p.114 10.1534/genetics.114.171447
[G16J] Jong Wha J Joo, Eun Yong Kang, Elin Org, Nick Furlotte, Brian Parks, Farhad Hormozdiari, Aldons J. Lusis, Eleazar Eskin "Efficient and Accurate MultiplePhenotypeRegression Method for High Dimensional Data Considering Population Structure" Genetics , v.204 , 2016 , p.1379-1390
(Showing: 1 - 10 of 30)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This project under the NSF InTrans (Innovation Transition) Program extended the previous project “Customizable Domain-Specific Computing” awarded in 2009 under the NSF Expeditions in Computing Program, which looked beyond parallelization and focused on domain-specific customization as a new disruptive technology for significant power-performance efficiency improvement. The proposed research in both projects explored the extensive use of accelerators, as custom-designed accelerators often provide 10-100X performance/energy efficiency over the general-purpose processors. Such an accelerator-rich architecture (ARA) presents a fundamental departure from the classical von Neumann architecture, which a common pipeline for the execution of different instructions, providing an efficient solution when the computing resource is scarce. In contrast, the ARA features heterogeneity and customization for energy efficiency; this is better suited for today’s technology where the silicon resource is abundant and energy consumption is the limiting constraint. This project greatly influenced the research community and the computing industry in making recent shift towards customizable computing, e.g. evident by the large-scale deployment of FPGAs in Microsoft’s and Amazon’s datacenters.

In this project, the customization is carried out at multiple levels of computing hierarchy, including the chip-level, server-node level, and datacenter level. At the chip-level, the research proposed the use of customizable ARAs, where a sea of heterogeneous accelerators can be programmed and composed for customized acceleration, in junction with a customizable memory hierarchy and network-on-chip (see Figure 1). It demonstrated orders-of-magnitude energy efficient improvement computational kernels in multiple application domains.  At the server-node level, it investigates various CPU+FPGA computing platforms, where off-the-shelf FPGAs (field-programmable gate-arrays) can be used to implement customized accelerators in a flexible and cost-efficient way.  It provided detailed quantitative characterization of multiple commonly used CPU-FPGA platforms, and demonstrated substantially speedups on a wide range of applications, from string matching, to data compression, and to deep learning.   At the datacenter level, it explored multiple heterogeneous datacenter prototypes with FPGA accelerators, including CPU-only clusters with Xeon or Atom processors, a Xeon cluster with FPGA accelerator attached to the PCI-E bus, and a cluster of embedded ARM processors with on-chip FPGA fabrics (see Figure 2).  The research demonstrated several times energy reduction with FPGA acceleration for multiple big-data applications when compared to the baseline CPU-only cluster.

In parallel to the ARA architecture exploration, this research devoted substantial effort to develop the compilation and runtime support for ARAs.  In particular, significant progress has been made on source-to-source transformations and optimizations, such as the automated generation of various micro-architecture templates (e.g. see Figure 3) and automated design space exploration, so that one can make effective use of modern high-level synthesis tools for high-performance accelerator designs with high productivity.   It also developed the efficient runtime systems for deploying FPGA and GPU accelerators into state-of-the-art big-data computing frameworks like Hadoop and Spark. It supports both node-level accelerator management for CPU+FPGA+GPU co-scheduling (see Figure 4) and cluster-level accelerator management for global accelerator sharing/management.

Finally, this project demonstrated the benefits of ARAs in two important healthcare application domains.   One is precision medicine, where the research focused on accelerating the next-generation sequencing alignment and analysis pipeline for discovering the genomic causes of various diseases, which suffered long computation time. The FPGA and GPU accelerators developed in this project lead to 5-25X acceleration for multiple compute-intensive kernels in the pipeline (e.g. see Figure 5).  Another application domain is medical imaging.  In particular, the research focused on the automated detection of lung cancer, including 1) super-resolution enhancement of suspected indeterminate pulmonary nodules (IPNs), denoising the appearance of nodules to better characterize the spatial/visual properties for classification as benign/malignant; and 2) convolutional neural networks for identifying IPNs in low-dose computed tomography studies (see Figure 6). These efforts are helping to advance strategies for making lung cancer screening both more consistent and scalable through computer-aided detection techniques powered by various customized accelerators from this research.

In terms of broader impact, this project involved 31 graduate students and 5 postdocs.   The multi-disciplinary nature of the research enabled these young researchers and engineers to be well positioned as the leaders in the new era of heterogeneous accelerated computing.    Moreover, this project led to one book, over 30 journal publications, over 50 conference publications, many invited talks and keynote speeches, which greatly facilitated the wide dissemination of the knowledge obtained from this research.  It also led to multiple open-source releases, such as the ARA simulator, various accelerator implementations, multiple source-to-source compilation tools and runtime management tools for FPGA and GPU accelerator design and deployment, and several application pipelines.   At the start of this InTrans award, Intel Corporation was the sole co-sponsor.  At the end of this project, there are over ten companies making financial contributions at various levels,  providing another channel of know transfer with the impact to the industry.

 


Last Modified: 10/18/2018
Modified by: Jason Cong

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page