Random access DNA memory using Boolean search in an archival file storage system.
Journal
Nature materials
ISSN: 1476-4660
Titre abrégé: Nat Mater
Pays: England
ID NLM: 101155473
Informations de publication
Date de publication:
09 2021
09 2021
Historique:
received:
20
04
2020
accepted:
26
04
2021
pubmed:
12
6
2021
medline:
28
9
2021
entrez:
11
6
2021
Statut:
ppublish
Résumé
DNA is an ultrahigh-density storage medium that could meet exponentially growing worldwide demand for archival data storage if DNA synthesis costs declined sufficiently and if random access of files within exabyte-to-yottabyte-scale DNA data pools were feasible. Here, we demonstrate a path to overcome the second barrier by encapsulating data-encoding DNA file sequences within impervious silica capsules that are surface labelled with single-stranded DNA barcodes. Barcodes are chosen to represent file metadata, enabling selection of sets of files with Boolean logic directly, without use of amplification. We demonstrate random access of image files from a prototypical 2-kilobyte image database using fluorescence sorting with selection sensitivity of one in 10
Identifiants
pubmed: 34112975
doi: 10.1038/s41563-021-01021-3
pii: 10.1038/s41563-021-01021-3
pmc: PMC8564878
mid: NIHMS1721145
doi:
Substances chimiques
Silicon Dioxide
7631-86-9
DNA
9007-49-2
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
1272-1280Subventions
Organisme : NCI NIH HHS
ID : F32 CA236425
Pays : United States
Commentaires et corrections
Type : CommentIn
Informations de copyright
© 2021. The Author(s), under exclusive licence to Springer Nature Limited.
Références
Zhirnov, V., Zadegan, R. M., Sandhu, G. S., Church, G. M. & Hughes, W. L. Nucleic acid memory. Nat. Mater. 15, 366–370 (2016).
doi: 10.1038/nmat4594
Ceze, L., Nivala, J. & Strauss, K. Molecular digital data storage using DNA. Nat. Rev. Genet. 20, 456–466 (2019).
doi: 10.1038/s41576-019-0125-3
Kosuri, S. & Church, G. M. Large-scale de novo DNA synthesis: technologies and applications. Nat. Methods 11, 499–507 (2014).
doi: 10.1038/nmeth.2918
Palluk, S. et al. De novo DNA synthesis using polymerase-nucleotide conjugates. Nat. Biotechnol. 36, 645–650 (2018).
doi: 10.1038/nbt.4173
Lee, H. H., Kalhor, R., Goela, N., Bolot, J. & Church, G. M. Terminator-free template-independent enzymatic DNA synthesis for digital information storage. Nat. Commun. 10, 2383 (2019).
doi: 10.1038/s41467-019-10258-1
Church, G. M., Gao, Y. & Kosuri, S. Next-generation digital information storage in DNA. Science 337, 1628–1628 (2012).
doi: 10.1126/science.1226355
Goldman, N. et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature 494, 77–80 (2013).
doi: 10.1038/nature11875
Yazdi, S. M. H. T., Yuan, Y., Ma, J., Zhao, H. & Milenkovic, O. A rewritable, random-access DNA-based storage system. Sci. Rep. 5, 14138 (2015).
doi: 10.1038/srep14138
Grass, R. N., Heckel, R., Puddu, M., Paunescu, D. & Stark, W. J. Robust chemical preservation of digital information on DNA in silica with error-correcting codes. Angew. Chem. Int. Ed. 54, 2552–2555 (2015).
doi: 10.1002/anie.201411378
Yazdi, S. M. H. T., Gabrys, R. & Milenkovic, O. Portable and error-free DNA-based data storage. Sci. Rep. 7, 5011 (2017).
doi: 10.1038/s41598-017-05188-1
Erlich, Y. & Zielinski, D. DNA Fountain enables a robust and efficient storage architecture. Science 355, 950–954 (2017).
doi: 10.1126/science.aaj2038
Organick, L. et al. Random access in large-scale DNA data storage. Nat. Biotechnol. 36, 242–248 (2018).
doi: 10.1038/nbt.4079
Ranu, N., Villani, A.-C., Hacohen, N. & Blainey, P. C. Targeting individual cells by barcode in pooled sequence libraries. Nucleic Acids Res. 47, e4 (2018).
doi: 10.1093/nar/gky856
Kashiwamura, S., Yamamoto, M., Kameda, A., Shiba, T. & Ohuchi, A. Hierarchical DNA memory based on nested PCR. In 8th International Workshop on DNA-Based Computers (DNA8) (eds Hagiya, M. & Ohuchi, A.) 112–123 (Springer, 2003).
Yamamoto, M., Kashiwamura, S., Ohuchi, A. & Furukawa, M. Large-scale DNA memory based on the nested PCR. Nat. Comput. 7, 335–346 (2008).
doi: 10.1007/s11047-008-9076-x
Yamamoto, M., Kashiwamura, S. & Ohuchi, A. DNA memory with 16.8M addresses. In 13th International Meeting on DNA Computing (DNA13) (eds Garzon, M. H. & Yan, H.) 99–108 (Springer, 2008).
Tomek, K. J. et al. Driving the scalability of DNA-based information storage systems. ACS Synth. Biol. 8, 1241–1248 (2019).
doi: 10.1021/acssynbio.9b00100
Organick, L. et al. Probing the physical limits of reliable DNA data retrieval. Nat. Commun. 11, 616 (2020).
doi: 10.1038/s41467-020-14319-8
Chen, Y.-J. et al. Quantifying molecular bias in DNA data storage. Nat. Commun. 11, 3264 (2020).
doi: 10.1038/s41467-020-16958-3
Xu, Q., Schlabach, M. R., Hannon, G. J. & Elledge, S. J. Design of 240,000 orthogonal 25mer DNA barcode probes. Proc. Natl Acad. Sci. USA 106, 2289–2294 (2009).
doi: 10.1073/pnas.0812506106
Newman, S. et al. High density DNA data storage library via dehydration with digital microfluidic retrieval. Nat. Commun. 10, 1706 (2019).
doi: 10.1038/s41467-019-09517-y
Lin, K. N., Volkel, K., Tuck, J. M. & Keung, A. J. Dynamic and scalable DNA-based information storage. Nat. Commun. 11, 2981 (2020).
doi: 10.1038/s41467-020-16797-2
Paunescu, D., Puddu, M., Soellner, J. O. B., Stoessel, P. R. & Grass, R. N. Reversible DNA encapsulation in silica to produce ROS-resistant and heat-resistant synthetic DNA ‘fossils’. Nat. Protoc. 8, 2440–2448 (2013).
doi: 10.1038/nprot.2013.154
Paunescu, D., Fuhrer, R. & Grass, R. N. Protection and deprotection of DNA—high-temperature stability of nucleic acid barcodes for polymer labeling. Angew. Chem. Int. Ed. 52, 4269–4272 (2013).
doi: 10.1002/anie.201208135
Farzadfard, F. et al. Single-nucleotide-resolution computing and memory in living cells. Mol. Cell 75, 769–780.E4 (2019).
doi: 10.1016/j.molcel.2019.07.011
Farzadfard, F. & Lu, T. K. Genomically encoded analog memory with precise in vivo DNA writing in living cell populations. Science 346, 1256272 (2014).
doi: 10.1126/science.1256272
Farzadfard, F. & Lu, T. K. Emerging applications for DNA writers and molecular recorders. Science 361, 870–875 (2018).
doi: 10.1126/science.aat9249
Nguyen, H. H. et al. Long-term stability and integrity of plasmid-based DNA data storage. Polymers 10, 28 (2018).
doi: 10.3390/polym10010028
Plesa, C., Sidore, A. M., Lubock, N. B., Zhang, D. & Kosuri, S. Multiplexed gene synthesis in emulsions for exploring protein functional landscapes. Science 359, 343–347 (2018).
doi: 10.1126/science.aao5167
Shepherd, T. R., Du, R. R., Huang, H., Wamhoff, E.-C. & Bathe, M. Bioproduction of pure, kilobase-scale single-stranded DNA. Sci. Rep. 9, 6121 (2019).
doi: 10.1038/s41598-019-42665-1
Veneziano, R. et al. In vitro synthesis of gene-length single-stranded DNA. Sci. Rep. 8, 6548 (2018).
doi: 10.1038/s41598-018-24677-5
Minev, D. et al. Rapid in vitro production of single-stranded DNA. Nucleic Acids Res. 47, 11956–11962 (2019).
Reif, J. H. et al. Experimental construction of very large scale DNA databases with associative search capability. In 7th International Workshop on DNA-Based Computers (DNA7) (eds Jonoska, N. & Seeman, N. C.) 231–247 (Springer, 2002).
Chen, W. D. et al. Combining data longevity with high storage capacity—layer-by-layer DNA encapsulated in magnetic nanoparticles. Adv. Funct. Mater. 29, 1901672 (2019).
doi: 10.1002/adfm.201901672
Pillai, P. P., Reisewitz, S., Schroeder, H. & Niemeyer, C. M. Quantum-dot-encoded silica nanospheres for nucleic acid hybridization. Small 6, 2130–2134 (2010).
doi: 10.1002/smll.201000949
Leidner, A. et al. Biopebbles: DNA-functionalized core–shell silica nanospheres for cellular uptake and cell guidance studies. Adv. Funct. Mater. 28, 1707572 (2018).
doi: 10.1002/adfm.201707572
Sun, P. et al. Biopebble containers: DNA-directed surface assembly of mesoporous silica nanoparticles for cell studies. Small 15, 1900083 (2019).
doi: 10.1002/smll.201900083
Perfetto, S. P., Chattopadhyay, P. K. & Roederer, M. Seventeen-colour flow cytometry: unravelling the immune system. Nat. Rev. Immunol. 4, 648–655 (2004).
doi: 10.1038/nri1416
Chattopadhyay, P. K. et al. Quantum dot semiconductor nanocrystals for immunophenotyping by polychromatic flow cytometry. Nat. Med. 12, 972–977 (2006).
doi: 10.1038/nm1371
Fontana, R. E.Jr & Decad, G. M. Moore’s law realities for recording systems and memory storage components: HDD, tape, NAND, and optical. AIP Adv. 8, 056506 (2018).
doi: 10.1063/1.5007621
Machado, A. H. E. et al. Encapsulation of DNA in macroscopic and nanosized calcium alginate gel particles. Langmuir 29, 15926–15935 (2013).
doi: 10.1021/la4032927
Zelikin, A. N. et al. A general approach for DNA encapsulation in degradable polymer microcapsules. ACS Nano 1, 63–69 (2007).
doi: 10.1021/nn700063w
Hur, S. C., Tse, H. T. K. & Di Carlo, D. Sheathless inertial cell ordering for extreme throughput flow cytometry. Lab Chip 10, 274–280 (2010).
doi: 10.1039/B919495A
Lee, H., Kim, J., Kim, H., Kim, J. & Kwon, S. Colour-barcoded magnetic microparticles for multiplexed bioassays. Nat. Mater. 9, 745–749 (2010).
doi: 10.1038/nmat2815
Stewart, K. et al. A content-addressable DNA database with learned sequence encodings. In 24th International Conference on DNA Computing and Molecular Programming (DNA 24) (eds Doty, D & Dietz, H.)55–70 (Springer, 2018).
Shieh, P. et al. Cleavable comonomers enable degradable, recyclable thermoset plastics. Nature 583, 542–547 (2020).
doi: 10.1038/s41586-020-2495-2
Kohll, A. X. et al. Stabilizing synthetic DNA for long-term data storage with earth alkaline salts. Chem. Commun. 56, 3613–3616 (2020).
doi: 10.1039/D0CC00222D
Broekema, P. C., van Nieuwpoort, R. V. & Bal, H. E. In Proceedings of the 2012 Workshop on High-Performance Computing for Astronomy Date 9–16 (Association for Computing Machinery, 2012).
Gaillard, M. & Pandolfi, S. CERN Data Centre passes the 200-petabyte milestone. CERN https://cds.cern.ch/record/2276551 (2017).
Mayer, L. et al. The Nippon Foundation—GEBCO seabed 2030 project: the quest to see the world’s oceans completely mapped by 2030. Geosciences 8, 63 (2018).
doi: 10.3390/geosciences8020063
Banal, J. L. et al., DNA-Memory-Blocks v.2.0 https://doi.org/10.5281/zenodo.4586900 (Zenodo, 2021).