
NSF Org: |
CCF Division of Computing and Communication Foundations |
Recipient: |
|
Initial Amendment Date: | September 5, 2013 |
Latest Amendment Date: | September 5, 2013 |
Award Number: | 1318694 |
Award Instrument: | Standard Grant |
Program Manager: |
Sankar Basu
sabasu@nsf.gov (703)292-7843 CCF Division of Computing and Communication Foundations CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 15, 2013 |
End Date: | August 31, 2017 (Estimated) |
Total Intended Award Amount: | $250,701.00 |
Total Awarded Amount to Date: | $250,701.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
1400 TOWNSEND DR HOUGHTON MI US 49931-1200 (906)487-1885 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
Townsend Dr 1400 Houghton MI US 49931-1295 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | DES AUTO FOR MICRO & NANO SYST |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Unlike traditional fast SPICE simulation techniques that rely on a variety of approximation approaches to trade off simulation accuracy for greater speed, SPICE-accurate integrated circuit (IC) simulations can truthfully predict circuit electrical behaviors, and therefore become indispensable for design and verification of nanoscale ICs. However, for post-layout nanoscale circuits, using traditional SPICE-accurate simulation techniques to encapsulate multi-million or even multi-billion devices coupled through complex parasitics can be prohibitively expensive, and thus not applicable to large IC designs, since the runtime and memory cost for solving large sparse matrix problems using direct solution methods will increase quickly with the growing circuit sizes and parasitics densities. To achieve greater simulation efficiency and capacity during post-layout simulations, preconditioned iterative solution techniques have been recently proposed to substitute the direct solution methods. However, existing preconditioned methods for post-layout circuit simulations are typically designed with various assumptions and constraints on the circuit and systems to be analyzed, which therefore cannot be effectively and reliably applied to general-purpose SPICE-accurate circuit simulations. In this research project, the PI will study efficient yet robust circuit-oriented preconditioning approaches for scalable SPICE-accurate post-layout IC simulations by leveraging recent graph sparsification research. By systematically sparsifying linear/nonlinear dynamic networks originated from dense parasitics components and complex device elements of post-layout circuits, scalable, and more importantly, parallelizable preconditioned iterative algorithms will be investigated and developed by the PI to enable much greater speed and capacity for SPICE-accurate IC simulations in both time and frequency domains.
The successful completion of this work will immediately benefit the semiconductor industries. The algorithms and methodologies to be developed through this project will be integrated into undergraduate/graduate level VLSI design/CAD courses, while the research results will be broadly disseminated to major semiconductor and EDA companies for potential industrial applications. The CAD tools developed under this research plan will also be exchanged with collaborating industrial partners. The acquired experience in the proposed research plan is also likely to contribute to computing advances in other science and engineering fields, impacting broader research areas that are related to large/complex system modeling and simulation.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Prior methods for simulating SPICE-accurate post-layout circuits do not scale well due to the superlinear complexity of existing sparse matrix solution algorithms, such as LU decomposition methods. To address the challenging computational bottlenecks, this research project aims to leverage emerging graph sparsification techniques and heterogeneous parallel computers for scalable SPICE-accurate post-layout integrated circuit simulations in both time and frequency domains. Recent graph sparsification research allows computing ultra-sparse graph proxies for approximating the original undirected graphs, which can immediately accelerate many numerical and graph-related algorithms.
However, integrated circuit networks usually involve not only undirected edges but also directed edges. To effectively accelerate sparse matrix solvers in circuit simulation tasks, three different types of circuits have been investigated during the four-year research study by leveraging recent graph sparsification techniques: 1) time-domain SPICE simulation of interconnect-dominant circuits, 2) time-domain SPICE simulation of transistor-dominant circuits, 3) frequency-domain harmonic balance (HB) analysis of radio-frequency (RF) circuits. We have proposed brand new circuit simulation algorithms that are highly scalable for handling extremely large circuit systems, allowing large-scale post-layout circuit designs to be analyzed in nearly-linear time and space. Our main technical contributions have been summarized as follows:
- During the first two years of this project, we have developed a series graph sparsification techniques for sparsifying large time-domain and frequency domain post-layout integrated circuits (modified nodal analysis matrices) that can be subsequently leveraged for achieving nearly-linear complexity runtime/memory cost performance. Additionally, we successfully developed performance-guided graph sparsification frameworks for time-domain and frequency-domain SPICE-accurate circuit simulations that allow automatically identifying the optimal graph sparsification configuration that can lead to optimal runtime performance.
- We have also developed a highly-parallel graph-theoretic preconditioning method for Harmonic Balance analysis of microwave and RF circuits, which achieved up to 20X speedups compared to the state-of-the-art HB simulation algorithm published in DAC’09 by Berkeley Design Automation. The new method is built upon our recent graph sparsification research as well as a sparse block matrix solver optimized for modern CPU-GPU heterogeneous computing platforms.
- We developed a nearly-linear time yet practically-efficient spectral graph sparsification algorithm that can immediately lead to the development of nearly-linear time sparse symmetric diagonally-dominant (SDD) matrix solvers and spectral graph (data) partitioning (clustering) algorithms. This work can potentially allow constructing circuit proxies with guaranteed high quality and sparsity, which will be critical for future nanoscale IC designs that involve increasingly stronger couplings due to complex parasitics components.
- We have released a preliminary software "GRASS: GRAph Spectral Sparsifier" (https://sites.google.com/mtu.edu/zhuofeng-graphspar/home). GRASS is the first highly efficient graph spectral sparsification engine that scales almost linearly with the size of the input graph Laplacian. The spectrally sparsified graphs computed by GRASS can approximately preserve the distances or effective resistances between vertices so that much faster numerical and graph-related algorithms can be developed based on these sparsified graphs. For example, spectrally sparsified social (data) networks may allow for more efficiently modeling, mining and analysis of large social (data) networks, spectrally sparsified circuit networks may lead to more efficient simulation, optimization and verification of large integrated circuit systems, etc.
- We have been trying hard to disseminate the research outcome of this research in the industrial and academic sectors by publishing in top IEEE/ACM conferences and journals. We also have delivered many research presentations to industrial partners, such as Cadence Design Systems and Intel Corporation, as well as many universities, such as University of California San Diego, Duke University, Carnegie Mellon University, The University of Pittsburgh, University at Buffalo, Lehigh University, University of Houston, Tsinghua University, Peking University, The University of Electronic Science and Technology of China, Shenzhen University, Fudan University, ShanghaiTech University, Sun Yat-Sen University, etc, to disseminate our latest research results.
The research investigation of graph sparsification techniques for IC simulations will immediately benefit the solver developments for large-scale sparse matrix problems involved in other engineering fields that require solving large linear/nonlinear partial differential equations, such as computational fluid dynamics, and computational electromagnetics. It is expected that graph sparsification related research outcome will immediately benefit other research communities that heavily rely on scientific computing, such as machine learning, data mining, computer vision, image processing, network analysis, and computational biology.
By leveraging the research results from this project, people may run much larger simulation/computation tasks on relatively small (personal) computers in the near future without introducing additional cost. Even on small, low-power mobile computing devices, traditionally very costly functions, such as circuit simulations, data clustering, matrix solvers, etc. can be potentially be executed locally using multi-core CPUs and many-core GPUs without leveraging power-consuming cloud computing facilities.
- We have been trying hard to disseminate the research outcome of this research in the industrial and academic sectors. To this end, we have released a preliminary software for graph sparsification, which allows researchers and industrial partners to immediately benefit from this research.
Last Modified: 11/22/2017
Modified by: Zhuo Feng
Please report errors in award information by writing to: awardsearch@nsf.gov.