Award Abstract # 1807809
SemiSynBio: Nucleic Acid Memory

NSF Org: ECCS
Division of Electrical, Communications and Cyber Systems
Recipient: BOISE STATE UNIVERSITY
Initial Amendment Date: July 16, 2018
Latest Amendment Date: April 11, 2022
Award Number: 1807809
Award Instrument: Continuing Grant
Program Manager: Usha Varshney
ECCS
 Division of Electrical, Communications and Cyber Systems
ENG
 Directorate for Engineering
Start Date: July 15, 2018
End Date: June 30, 2023 (Estimated)
Total Intended Award Amount: $1,125,000.00
Total Awarded Amount to Date: $1,125,000.00
Funds Obligated to Date: FY 2018 = $375,000.00
FY 2019 = $375,000.00

FY 2020 = $375,000.00
History of Investigator:
  • Tim Andersen (Principal Investigator)
    tim@cs.boisestate.edu
  • Wan Kuang (Co-Principal Investigator)
  • Elton Graugnard (Co-Principal Investigator)
  • Eric Hayden (Co-Principal Investigator)
  • William Hughes (Former Principal Investigator)
  • Tim Andersen (Former Co-Principal Investigator)
Recipient Sponsored Research Office: Boise State University
1910 UNIVERSITY DR
BOISE
ID  US  83725-0001
(208)426-1574
Sponsor Congressional District: 02
Primary Place of Performance: Boise State University
Boise
ID  US  83725-1135
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): HYWTVM5HNFM3
Parent UEI: HYWTVM5HNFM3
NSF Program(s): SemiSynBio - Semicon Synth Bio,
Genetic Mechanisms
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
01002021DB NSF RESEARCH & RELATED ACTIVIT

01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7465, 108E, 9150
Program Element Code(s): 061Y00, 111200
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.041

ABSTRACT

The rapid proliferation of cloud computing and the emergence of big data for massive scientific, financial, governmental, and genetic records is creating an information storage crisis. These data, once generated, cascade through the information storage lifecycle -- from primary storage media in the form of hard disks and solid-state drives to archival media such as magnetic tape. While innovations in information density, stability, and energy consumption routinely occur, existing memory materials are approaching their physical and economic finish lines. As imagined by the Semiconductor Synthetic Biology (SemiSynBio) Roadmap, DNA-based massive information storage is a brand new start for memory manufacturing. As a result, the research team proposes to pioneer a cold storage paradigm by designing, building, and testing two accessible, editable, and non-volatile memory technologies made from DNA. Inspired by DNA circuits and made possible by state-of-the-art optical physics, the team will: (1) biologically synthesize DNA molecules, (2) engineer substrates made from said molecules, (3) write digital information onto the substrates using additional DNA molecules, (4) minimize encoding and decoding errors using computer science algorithms, and (5) read, as well as edit digital information onto the substrates using reversible DNA binding. In full support of this interdisciplinary project, the research team includes expertise in: DNA nanotechnology, nanoscale characterization, optical physics, biologically-inspired algorithms, and synthetic biology. Modeled after the faculty collaboration, a new cadre of students will work and study at the confluence of the biological, computational, and engineering sciences in anticipation of the emerging field called Nucleic Acid Memory (NAM). As active participants in and co-owners of a Vertically Integrated Project called NAM, undergraduate and graduate students will enroll into a multi-year and multi-disciplinary research team that provides ongoing course and teaching credit.

The focal points of this proposal are two storage medium prototypes, digital Nucleic Acid (dNAM) and sequence Nucleic Acid Memory (seqNAM). Each offer a novel approach to coding information using DNA, and both use super-resolution microscopy to read information. In dNAM, information is encoded into defined spatial arrangements of DNA sequences on top of addressable DNA origami nanostructures, called NAM storage nodes. DNA origami provides a convenient pathway and a proven approach to high-yield and rapid prototyping of NAM node structures. Staple strands will be extended from the NAM node structures with a unique sequence for site-specific attachment of NAM data strands. When bound, data strands serve as docking sites for complementary data imager strands, which are employed in a DNA-based form of super-resolution microscopy (SRM) called DNA-PAINT. DNA PAINT is a stochastic super-resolution imaging technique that uses repetitive, transient binding of fluorescently labeled data imager strands to circumvent the diffraction limit of light. Thus, data imager strands act as the read head and reveal the state of each site of the NAM storage node with better than 7 nm resolution. Binary states at each data cell can be defined by the presence (1) or absence (0) of the NAM data strand, as determined by SRM. Increasing site-specific bit-density from 1 to 3 bits can be simply achieved by multiple orthogonal sequences. Editing of data strands is performed by either adding a required data strand to a vacant data cell or by removing an existing data strand via toehold-mediated strand displacement. Built upon a similar storage node platform, seqNAM employs two data cells to arrange data strands into ordered arrays. In seqNAM, information is encoded within portions of the data strands that remain single stranded. The sequences of the data strands are read using a multi-color super-resolution sequencing (SRS) process that uses a library of locked nucleic acid imager strands. Editing is performed by removing the target data strands with complementary sequences using toehold-mediated strand invasion and then adding the replacement data strands. seqNAM exceeds dNAM by storing information within DNA sequences at a potentially higher density. In addition, it creates a new enzyme-free sequencing platform.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Dickinson, George D. and Mortuza, Golam Md and Clay, William and Piantanida, Luca and Green, Christopher M. and Watson, Chad and Hayden, Eric J. and Andersen, Tim and Kuang, Wan and Graugnard, Elton and Zadegan, Reza and Hughes, William L. "An alternative approach to nucleic acid memory" Nature Communications , v.12 , 2021 https://doi.org/10.1038/s41467-021-22277-y Citation Details
Green, Christopher M. and Hughes, William L. and Graugnard, Elton and Kuang, Wan "Correlative Super-Resolution and Atomic Force Microscopy of DNA Nanostructures and Characterization of Addressable Site Defects" ACS Nano , 2021 https://doi.org/10.1021/acsnano.1c01976 Citation Details
Mortuza, Golam Md and Guerrero, Jorge and Llewellyn, Shoshanna and Tobiason, Michael D. and Dickinson, George D. and Hughes, William L. and Zadegan, Reza and Andersen, Tim "In-vitro validated methods for encoding digital data in deoxyribonucleic acid (DNA)" BMC Bioinformatics , v.24 , 2023 https://doi.org/10.1186/s12859-023-05264-6 Citation Details
Piantanida, Luca and Hughes, William L. "A PCR-free approach to random access in DNA" Nature Materials , v.20 , 2021 https://doi.org/10.1038/s41563-021-01089-x Citation Details
Tobiason, Michael and Yurke, Bernard and Hughes, William L. "Generation of DNA oligomers with similar chemical kinetics via in-silico optimization" Communications Chemistry , v.6 , 2023 https://doi.org/10.1038/s42004-023-01026-w Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

 

In support of the Semiconductor Synthetic Biology Roadmap, this research project designed, built, and tested technologies for archival data storage using DNA in an approach called digital Nucleic Acid Memory or dNAM. dNAM encodes data as the presence or absence of DNA strands that are precisely positioned within a DNA origami breadboard (see image) with nanometer resolution. The goal is to read the encoded information, i.e. the presence or absence of the DNA data strands, using super-resolution microscopy (SRM), by introducing imager strands bound to fluorescent markers that selectively bind to the data strands of the origami structure, and can then be viewed optically using SRM thus revealing the presence or absence of the data strands. 

In developing the dNAM technology, the project addressed several research challenges and developed new knowledge in five key research areas as follows.

(1) The project developed novel error tolerant information encoding and decoding algorithms for storing information on the origami structures. The algorithms integrated information from the SRM image combined with error correction codes to target likely errors and correct up to 9 separate bit errors in a 48-bit self-contained data block with high reliability.

(2) The project developed new understanding of the critical issues impacting the design of DNA sequences that form origami structures and data strands, and created a software utility that automates DNA sequence optimization and design.

(3) The project developed techniques for creating larger origami structures with reduced fabrication costs and high yields, enabling the storage of larger volumes of data.

(4) The project developed machine learning-based algorithms for increasing the resolution of SRM images, and for localizing imager strands on the origami structure, improving the robustness and accuracy of binary data acquisition.

(5) The project achieved sub 5nm resolution for imager strand localization, approaching the physical limit for data density.

These advances taken together were leveraged to create a prototype origami structure capable of storing 80 bits of information per origami. The origami design was validated by encoding a small file across multiple origami structures, and using SRM to recover the encoded data with high reliability, providing a proof of concept.  

The project also supported the creation of a research-based course (called a Vertically Integrated Projects course, or VIP) targeted towards undergraduate students. In this course, several cadres of diverse undergraduate students were trained to work along the interface of semiconductor manufacturing and synthetic biology. The students developed summer research projects performing synthesis, purification, and characterization of dNAM nodes and FRET systems. The students attended the Undergraduate Research Showcase where they presented their work.

 

 

 


Last Modified: 10/28/2023
Modified by: Tim L Andersen

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page