Better together: Elements of successful scientific software development in a distributed collaborative community.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
05 2020
Historique:
entrez: 5 5 2020
pubmed: 5 5 2020
medline: 28 7 2020
Statut: epublish

Résumé

Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities.

Identifiants

pubmed: 32365137
doi: 10.1371/journal.pcbi.1007507
pii: PCOMPBIOL-D-19-01408
pmc: PMC7197760
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S. Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1007507

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM099827
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL122010
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM080403
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM117189
Pays : United States
Organisme : Howard Hughes Medical Institute
Pays : United States
Organisme : NIAID NIH HHS
ID : T32 AI007244
Pays : United States
Organisme : NCI NIH HHS
ID : F32 CA189246
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM099842
Pays : United States
Organisme : NIDDK NIH HHS
ID : R01 DK097376
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM127578
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM073151
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM131923
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM092802
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM117968
Pays : United States
Organisme : NIAID NIH HHS
ID : R21 AI121799
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM088277
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM078221
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM098101
Pays : United States
Organisme : NIGMS NIH HHS
ID : RL1 GM084433
Pays : United States
Organisme : NIGMS NIH HHS
ID : T32 GM008403
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM122579
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM110089
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI113867
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM121487
Pays : United States
Organisme : NIAID NIH HHS
ID : U19 AI117905
Pays : United States
Organisme : NIAID NIH HHS
ID : UM1 AI100663
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM122517
Pays : United States
Organisme : NCI NIH HHS
ID : RL1 CA133832
Pays : United States
Organisme : NCI NIH HHS
ID : R21 CA219847
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM084453
Pays : United States

Déclaration de conflit d'intérêts

The Rosetta software has been licensed to numerous non-profit and for-profit organizations. Rosetta Licensing is managed by UW CoMotion, and royalty proceeds are managed by the RosettaCommons. Under institutional participation agreements between the University of Washington, acting on behalf of the RosettaCommons, their respective institutions may be entitled to a portion of revenue received on licensing Rosetta software including programs described here. Baker, Malmström, Yarov-Yarovoy, Gront, Meiler, Whitehead, Schueler-Furman, King, Gray, Sgourakis, Lindert, Strauss, Karanicolas, Bonneau, Sammond, Kortemme, and Bradley are unpaid board members of the RosettaCommons. As members of the Scientific Advisory Board of Cyrus Biotechnology, Baker and Gray are granted stock options. Yifan Song, Indigo C. King, Steven M. Lewis, Brandon Frenz, Karen Khar and Ryan Pavlovicz are currently employed at Cyrus Biotechnology with granted stock options. Cyrus Biotechnology distributes the Rosetta software. Brian D. Weitzner and Scott E. Boyken hold equity in Lyell Immunopharma. Vikram K. Mulligan is a co-founder of and shareholder in Menten Biotechnology Labs, Inc. The content of this manuscript is relevant to work performed at Lyell and Menten. Neil P. King is a co-founder and shareholder of Icosavax, Inc., a biotech company developing nanoparticle vaccines. Justin B. Siegel is a co-founder and shareholder of Digestiva, Inc. and PvP Biologics Inc. David Baker is a co-founder, shareholder, or advisor to the following companies: ARZEDA, PvP Biologics, Cyrus Biotechnology, Cue Biopharma, Icosavax, Neoleukin Therapeutics, Lyell Immunotherapeutics, Sana Biotechnology, and A-Alpha Bio.

Références

Proteins. 2007;69 Suppl 8:118-28
pubmed: 17894356
Bioinformatics. 2010 Mar 1;26(5):689-91
pubmed: 20061306
Science. 2017 Dec 15;358(6369):1461-1466
pubmed: 29242347
Proteins. 1999 Jan 1;34(1):82-95
pubmed: 10336385
Methods Enzymol. 2019;614:321-362
pubmed: 30611429
Science. 2016 Dec 9;354(6317):1240-1241
pubmed: 27940837
Nucleic Acids Res. 2009 Jun;37(10):e73
pubmed: 19389725
Biochemistry. 2016 Aug 30;55(34):4748-63
pubmed: 27490953
J Mol Biol. 1997 Apr 25;268(1):209-25
pubmed: 9149153
Structure. 2013 Oct 8;21(10):1735-42
pubmed: 24035711
Proteins. 2009;77 Suppl 9:89-99
pubmed: 19701941
Bioinformatics. 2017 Sep 01;33(17):2765-2767
pubmed: 28481970
PLoS One. 2011;6(8):e23294
pubmed: 21887241
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W526-31
pubmed: 15215442
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2561-2562
pubmed: 29531033
PLoS One. 2011;6(8):e22431
pubmed: 21909349
PeerJ. 2016 Jul 28;4:e2248
pubmed: 27547555
Prog Nucl Magn Reson Spectrosc. 2018 Jun - Aug;106-107:1-25
pubmed: 31047599
Methods Enzymol. 2011;487:545-74
pubmed: 21187238
PLoS One. 2013 May 22;8(5):e63906
pubmed: 23717507
PLoS One. 2011;6(6):e20161
pubmed: 21731610
Nature. 2019 Jul;571(7763):133-134
pubmed: 31263262
Genome Res. 2011 Nov;21(11):1981-94
pubmed: 21824995
Proc Natl Acad Sci U S A. 2011 Nov 22;108(47):18949-53
pubmed: 22065763
Sci STKE. 2004 Feb 03;2004(219):pl2
pubmed: 14872095
Biochem Mol Biol Educ. 2019 Mar;47(2):133-139
pubmed: 30638297
Proteins. 2001;Suppl 5:119-26
pubmed: 11835488
Proteins. 1999;Suppl 3:171-6
pubmed: 10526365
Bioinformatics. 2015 Dec 15;31(24):4023-5
pubmed: 26315900
J Chem Theory Comput. 2017 Jun 13;13(6):3031-3048
pubmed: 28430426
PLoS Comput Biol. 2017 Dec 7;13(12):e1005837
pubmed: 29216185
Cogn Sci. 2018 Jan;42(1):129-167
pubmed: 28213928
Nature. 2010 Aug 5;466(7307):756-60
pubmed: 20686574

Auteurs

Julia Koehler Leman (J)

Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America.
Dept of Biology, New York University, New York, NY, United States of America.

Brian D Weitzner (BD)

Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.
Dept of Biochemistry, University of Washington, Seattle, WA, United States of America.
Institute for Protein Design, University of Washington, Seattle, WA, United States of America.
Lyell Immunopharma, Seattle, WA, United States of America.

P Douglas Renfrew (PD)

Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America.

Steven M Lewis (SM)

Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America.
Dept of Biochemistry, Duke University, Durham, NC, United States of America.
Cyrus Biotechnology, Seattle, WA United States of America.

Rocco Moretti (R)

Dept of Chemistry, Vanderbilt University, Nashville, TN, United States of America.

Andrew M Watkins (AM)

Dept of Biochemistry, Stanford University School of Medicine, Stanford CA, United States of America.

Vikram Khipple Mulligan (VK)

Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America.
Dept of Biochemistry, University of Washington, Seattle, WA, United States of America.
Institute for Protein Design, University of Washington, Seattle, WA, United States of America.

Sergey Lyskov (S)

Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.

Jared Adolf-Bryfogle (J)

Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America.

Jason W Labonte (JW)

Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.
Dept of Chemistry, Franklin & Marshall College, Lancaster, PA, United States of America.

Justyna Krys (J)

Dept of Chemistry, University of Warsaw, Warsaw, Poland.

Christopher Bystroff (C)

Dept of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States of America.

William Schief (W)

Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America.

Dominik Gront (D)

Dept of Chemistry, University of Warsaw, Warsaw, Poland.

Ora Schueler-Furman (O)

Dept of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.

David Baker (D)

Dept of Biochemistry, University of Washington, Seattle, WA, United States of America.
Institute for Protein Design, University of Washington, Seattle, WA, United States of America.

Philip Bradley (P)

Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America.

Roland Dunbrack (R)

Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia PA, United States of America.

Tanja Kortemme (T)

Dept of Bioengineering and Therapeutic Sciences, University of California San Francisco, CA, United States of America.

Andrew Leaver-Fay (A)

Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America.

Charlie E M Strauss (CEM)

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States of America.

Jens Meiler (J)

Depts of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, United States of America.
Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America.
Institute for Chemical Biology, Vanderbilt University, Nashville, TN, United States of America.
Institute for Drug Discovery, Leipzig University, Leipzig, Germany.

Brian Kuhlman (B)

Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America.

Jeffrey J Gray (JJ)

Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.

Richard Bonneau (R)

Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America.
Dept of Biology, New York University, New York, NY, United States of America.
Dept of Computer Science, New York University, New York, NY, United States of America.
Center for Data Science, New York University, New York, NY, United States of America.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH