AUTHOR=Caswell Jacob , Gans Jason D. , Generous Nicholas , Hudson Corey M. , Merkley Eric , Johnson Curtis , Oehmen Christopher , Omberg Kristin , Purvine Emilie , Taylor Karen , Ting Christina L. , Wolinsky Murray , Xie Gary TITLE=Defending Our Public Biological Databases as a Global Critical Infrastructure JOURNAL=Frontiers in Bioengineering and Biotechnology VOLUME=7 YEAR=2019 URL=https://www.frontiersin.org/journals/bioengineering-and-biotechnology/articles/10.3389/fbioe.2019.00058 DOI=10.3389/fbioe.2019.00058 ISSN=2296-4185 ABSTRACT=

Progress in modern biology is being driven, in part, by the large amounts of freely available data in public resources such as the International Nucleotide Sequence Database Collaboration (INSDC), the world's primary database of biological sequence (and related) information. INSDC and similar databases have dramatically increased the pace of fundamental biological discovery and enabled a host of innovative therapeutic, diagnostic, and forensic applications. However, as high-value, openly shared resources with a high degree of assumed trust, these repositories share compelling similarities to the early days of the Internet. Consequently, as public biological databases continue to increase in size and importance, we expect that they will face the same threats as undefended cyberspace. There is a unique opportunity, before a significant breach and loss of trust occurs, to ensure they evolve with quality and security as a design philosophy rather than costly “retrofitted” mitigations. This Perspective surveys some potential quality assurance and security weaknesses in existing open genomic and proteomic repositories, describes methods to mitigate the likelihood of both intentional and unintentional errors, and offers recommendations for risk mitigation based on lessons learned from cybersecurity.