FAILURE-RESISTANT SYSTEMS (FRS)

A Joint Initiative between NSF and SRC

Synopsis

The National Science Foundation (NSF) and the Semiconductor Research Corporation (SRC) have agreed to embark on a new collaborative research program to address compelling research challenges in failure resistant systems that are of paramount importance to industry, academia, and society at large.

New approaches in the design of electronic circuits and systems are needed for products and services that continue to operate correctly in the presence of transient, permanent, or systematic failures. From large information processing systems supporting communications and computation, to small embedded systems targeting medical and automotive applications, whole industries are facing the challenge of improving the reliability of systems.

Increasing miniaturization and integrated circuit fabrication processes are creating a tension between reliability and efficiency. Higher rates of faults, variation, and degradation due to aging in integrated circuits are forcing systems engineers to assume that devices and circuits may not always perform as designed. More and more, systems are constructed using IP blocks (3rd party Intellectual Property) from different sources, contributing further to unpredictable behavior. Thus behavior under adverse conditions may not be fully known in deployed systems. Current techniques for ensuring reliability, such as voltage and clock rate margins, replication, and disk-based check-pointing will not be able to satisfy the competing requirements for future integrated circuits. These techniques typically operate only at one level of the system stack, yet layers from devices to applications all contribute to system reliability. Such single-layer techniques must be used under worst-case assumptions about the other layers in the stack. This potentially leads to inefficiencies that will make these techniques impractical in future fabrication processes.

A system-level cross-layer approach to reliability, encompassing failure mechanisms of both digital and analog components, has the potential to deliver high reliability with significantly lower power and performance overheads than current single-layer techniques. By distributing reliability across the system design stack, cross-layer approaches can take advantage of the information available at each level, including even application-level knowledge, to efficiently tolerate errors, aging, and variation. This will allow handling of different physical effects at the most efficient stack layer, and can be adapted to varying application needs, operating environments, and changing hardware state.

Fundamental new advances in techniques for designing and developing systems resilient to failure could have a significant impact on multiple industries and boost their competitiveness on a global scale, helping to transform market segments and translate research results into practice.

Program contacts

Name	Email	Phone	Organization
Sankar Basu CISE/CCF Program Director	sabasu@nsf.gov	(703) 292-7843	CISE/CCF
Ahmed Louri CISE/CCF Program Director	alouri@nsf.gov	(703) 292-8238	CISE/CCF
George Haddad ENG/ECCS Program Director	ghaddad@nsf.gov	(703) 292-8897	ENG/ECCS
Anupama B. Kaul ENG/ECCS Program Director	akaul@nsf.gov	(703) 292-8153	ENG/ECCS

Awards made through this program

Browse projects funded by this program

Map of recent awards made through this program

Program status: Archived

Program 12-556 was archived.

Program guidelines

Previous guidelines NSF 12-556

Published:

April 23, 2012

FAILURE-RESISTANT SYSTEMS (FRS)

Status: Archived

Archived funding opportunity

Important information for proposers and award recipients

Synopsis

Program contacts

Awards made through this program

FAILURE-RESISTANT SYSTEMS (FRS)

Status: Archived

Archived funding opportunity

Important information for proposers and award recipients

Synopsis

Program contacts

Awards made through this program

Organization(s)