EnzymeML-a data exchange format for biocatalysis and enzymology.

FAIR data principles Python Systems Biology Markup Language XML biocatalysis bioinformatics data exchange enzymology research data management

Journal

The FEBS journal
ISSN: 1742-4658
Titre abrégé: FEBS J
Pays: England
ID NLM: 101229646

Informations de publication

Date de publication:
10 2022
Historique:
revised: 15 11 2021
received: 22 09 2021
accepted: 09 12 2021
pubmed: 11 12 2021
medline: 6 10 2022
entrez: 10 12 2021
Statut: ppublish

Résumé

EnzymeML is an XML-based data exchange format that supports the comprehensive documentation of enzymatic data by describing reaction conditions, time courses of substrate and product concentrations, the kinetic model, and the estimated kinetic constants. EnzymeML is based on the Systems Biology Markup Language, which was extended by implementing the STRENDA Guidelines. An EnzymeML document serves as a container to transfer data between experimental platforms, modeling tools, and databases. EnzymeML supports the scientific community by introducing a standardized data exchange format to make enzymatic data findable, accessible, interoperable, and reusable according to the FAIR data principles. An application programming interface in Python supports the integration of software tools for data acquisition, data analysis, and publication. The feasibility of a seamless data flow using EnzymeML is demonstrated by creating an EnzymeML document from a structured spreadsheet or from a STRENDA DB database entry, by kinetic modeling using the modeling platform COPASI, and by uploading to the enzymatic reaction kinetics database SABIO-RK.

Identifiants

pubmed: 34890097
doi: 10.1111/febs.16318
doi:

Types de publication

Journal Article Review Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

5864-5874

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/S004955/1
Pays : United Kingdom

Informations de copyright

© 2021 The Authors. The FEBS Journal published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.

Références

Pellis A, Cantone S, Ebert C, Gardossi L. Evolving biocatalysis to meet bioeconomy challenges and opportunities. N Biotechnol. 2018;40:154-69.
Decoene T, De Paepe B, Maertens J, Coussement P, Peters G, De Maeseneire SL, et al. Standardization in synthetic biology: an engineering discipline coming of age. Crit Rev Biotechnol. 2018;38:647-56.
Lapatas V, Stefanidakis M, Jimenez RC, Via A, Schneider MV. Data integration in biological research: an overview. J Biol Res. 2015;22:1-16.
Kettner C, Cornish-Bowden A. Quo Vadis, enzymology data? Introductory remarks. Perspect Sci. 2014;1:1-6.
Swainston N, Golebiewski M, Messiha HL, Malys N, Kania R, Kengne S, et al. Enzyme kinetics informatics: from instrument to browser. FEBS J. 2010;277:3769-79.
Stark PB. No reproducibility without preproducibility. Nature. 2018;557:613.
Baker M, Penny D. Is there a reproducibility crisis? Nature. 2016;533:452-4.
Halling P, Fitzpatrick PF, Raushel FM, Rohwer J, Schnell S, Wittig U, et al. An empirical analysis of enzyme function reporting for experimental reproducibility: missing/incomplete information in published papers. Biophys Chem. 2018;242:22-7.
Rich RL, Papalia GA, Flynn PJ, Furneisen J, Quinn J, Klein JS, et al. A global benchmark study using affinity-based biosensors. Anal Biochem. 2009;386:194-216.
Cannon MJ, Papalia GA, Navratilova I, Fisher RJ, Roberts LR, Worthy KM, et al. Comparative analyses of a small molecule/enzyme interaction by multiple users of Biacore technology. Anal Biochem. 2004;330:98-113.
Myszka DG, Abdiche YN, Arisaka F, Byron O, Eisenstein E, Hensley P, et al. The ABRF-MIRG’02 study: assembly state, thermodynamic, and kinetic analysis of an enzyme/inhibitor interaction. J Biomol Tech. 2003;14:247-69.
McNutt M. Journals unite for reproducibility. Science. 2014;346:679.
Ioannidis JPA. How to make more published research true. PLoS Med. 2014;11:e1001747.
Wulf C, Beller M, Boenisch T, Deutschmann O, Hanf S, Kockmann N, et al. A unified research data infrastructure for catalysis research - challenges and concepts. ChemCatChem. 2021;13:3223-36.
Wilkinson MD, Verborgh R, da Silva Santos LOB, Clark T, Swertz MA, Kelpin FDL, et al. Interoperability and FAIRness through a novel combination of Web technologies. PeerJ Comput Sci. 2017;2017:e110.
Spellman PT, Miller M, Stewart J, Troup C, Sarkans U, Chervitz S, et al. Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol. 2002;3:RESEARCH0046.
Pedrioli PGA, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, et al. A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol. 2004;22:1459-66.
Larralde M, Lawson TN, Weber RJM, Moreno P, Haug K, Rocca-Serra P, et al. mzML2ISA & nmrML2ISA: generating enriched ISA-Tab metadata files from metabolomics XML data. Bioinformatics. 2017;33:2598-600.
Wittig U, Kania R, Bittkowski M, Wetsch E, Shi L, Jong L, et al. Data extraction for the reaction kinetics database SABIO-RK. Perspect Sci. 2014;1:33-40.
Wittig U, Kania R, Golebiewski M, Rey M, Shi L, Jong L, et al. SABIO-RK - database for biochemical reaction kinetics. Nucleic Acids Res. 2011;40:D790-6.
Schomburg I, Chang A, Schomburg D. BRENDA, enzyme data and metabolic information. Nucleic Acids Res. 2002;30:47-9.
Apweiler R, Armstrong R, Bairoch A, Cornish-Bowden A, Halling PJ, Hofmeyr J-HS, et al. A large-scale protein-function database. Nat Chem Biol. 2010;6:785.
Tipton KF, Armstrong RN, Bakker BM, Bairoch A, Cornish-Bowden A, Halling PJ, et al. Standards for reporting enzyme data: the STRENDA Consortium: what it aims to do and why it should be helpful. Perspect Sci. 2014;1:131-7.
Swainston N, Baici A, Bakker BM, Cornish-Bowden A, Fitzpatrick PF, Halling P, et al. STRENDA DB: enabling the validation and sharing of enzyme kinetics data. FEBS J. 2018;285:2193-204.
Dörr M, Fibinger MPC, Last D, Schmidt S, Santos-Aberturas J, Böttcher D, et al. Fully automatized high-throughput enzyme library screening using a robotic platform. Biotechnol Bioeng. 2016;113:1421-32.
Ringborg RH, Toftgaard Pedersen A, Woodley JM. Automated determination of oxygen-dependent enzyme kinetics in a tube-in-tube flow reactor. ChemCatChem. 2017;9:3285-8.
Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. 2016;3:160018.
Hucka M, Bergmann FT, Dräger A, Hoops S, Keating SM, Le Novère N, et al. The Systems Biology Markup Language (SBML): language specification for level 3 version 2 core. J Integr Bioinform. 2018;15:20170081. https://doi.org/10.1515/jib-2017-0081
Le NN, Finney A, Hucka M, Bhalla US, Campagne F, Collado-Vides J, et al. Minimum information requested in the annotation of biochemical models (MIRIAM). Nat Biotechnol. 2005;23:1509-15.
Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V, et al. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016;44:D1214-9.
The UniProt Consortium. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2018;46:2699.
Courtot M, Juty N, Knüpfer C, Waltemath D, Zhukova A, Dräger A, et al. Controlled vocabularies and semantics in systems biology. Mol Syst Biol. 2011;7:543.
Bergmann FT, Adams R, Moodie S, Cooper J, Glont M, Golebiewski M, et al. COMBINE archive and OMEX format: one file to share all information to reproduce a modeling project. BMC Bioinformatics. 2014;15:369.
Hoops S, Sahle S, Gauges R, Lee C, Pahle J, Simus N, et al. COPASI - a complex pathway simulator. Bioinformatics. 2006;22:3067-74.
Buchholz PCF, Ohs R, Spiess AC, Pleiss J. Progress curve analysis within BioCatNet: comparing kinetic models for enzyme-catalyzed self-ligation. Biotechnol J. 2019;14:1-8.
Wolstencroft K, Krebs O, Snoep JL, Stanford NJ, Bacall F, Golebiewski M, et al. FAIRDOMHub: a repository and collaboration environment for sharing systems biology research. Nucleic Acids Res. 2017;45:D404-7.
Crosas M. The dataverse network®: an open-source application for sharing, discovering and preserving data. D-Lib Mag. 2011;17. https://doi.org/10.1045/january2011-crosas
Pleiss J. Standardized data, scalable documentation, sustainable storage - EnzymeML as a basis for FAIR data management in biocatalysis. ChemCatChem. 2021;13:3909-13.
Fernandes P. Miniaturization in biocatalysis. Int J Mol Sci. 2010;11:858-79.
Rabe KS, Müller J, Skoupi M, Niemeyer CM. Cascades in compartments: en route to machine-assisted biotechnology. Angew Chem Int Ed Engl. 2017;56:13574-89.
Barillari C, Ottoz DSM, Fuentes-Serna JM, Ramakrishnan C, Rinn B, Rudolf F. openBIS ELN-LIMS: an open-source database for academic laboratories. Bioinformatics. 2016;32:638-40.
Tremouilhac P, Nguyen A, Huang Y-C, Kotov S, Lütjohann DS, Hübsch F, et al. Chemotion ELN: an Open Source electronic lab notebook for chemists in academia. J Cheminform. 2017;9:54.
Bär H, Hochstrasser R, Papenfuß B. SiLA: basic standards for rapid integration in laboratory automation. J Lab Autom. 2012;17:86-95.
Christensen CD, Hofmeyr JHS, Rohwer JM. PySCeSToolbox: a collection of metabolic pathway analysis tools. Bioinformatics. 2018;34:124-5.

Auteurs

Jan Range (J)

Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Germany.

Colin Halupczok (C)

Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Germany.

Jens Lohmann (J)

Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Germany.

Neil Swainston (N)

Institute of Systems, Molecular and Integrative Biology, University of Liverpool, UK.

Carsten Kettner (C)

Beilstein-Institut, Frankfurt am Main, Germany.

Frank T Bergmann (FT)

BioQUANT/COS, Heidelberg University, Germany.

Andreas Weidemann (A)

Heidelberg Institute for Theoretical Studies, Germany.

Ulrike Wittig (U)

Heidelberg Institute for Theoretical Studies, Germany.

Santiago Schnell (S)

Department of Molecular & Integrative Physiology, University of Michigan Medical School, Ann Arbor, MI, USA.
Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.

Jürgen Pleiss (J)

Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Germany.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH