Automated workflow composition in mass spectrometry-based proteomics.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
15 02 2019
Historique:
received: 29 01 2018
revised: 06 07 2018
accepted: 26 07 2018
pubmed: 31 7 2018
medline: 5 11 2019
entrez: 31 7 2018
Statut: ppublish

Résumé

Numerous software utilities operating on mass spectrometry (MS) data are described in the literature and provide specific operations as building blocks for the assembly of on-purpose workflows. Working out which tools and combinations are applicable or optimal in practice is often hard. Thus researchers face difficulties in selecting practical and effective data analysis pipelines for a specific experimental design. We provide a toolkit to support researchers in identifying, comparing and benchmarking multiple workflows from individual bioinformatics tools. Automated workflow composition is enabled by the tools' semantic annotation in terms of the EDAM ontology. To demonstrate the practical use of our framework, we created and evaluated a number of logically and semantically equivalent workflows for four use cases representing frequent tasks in MS-based proteomics. Indeed we found that the results computed by the workflows could vary considerably, emphasizing the benefits of a framework that facilitates their systematic exploration. The project files and workflows are available from https://github.com/bio-tools/biotoolsCompose/tree/master/Automatic-Workflow-Composition. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 30060113
pii: 5060940
doi: 10.1093/bioinformatics/bty646
pmc: PMC6378944
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

656-664

Informations de copyright

© The Author(s) 2018. Published by Oxford University Press.

Références

J Biomed Semantics. 2011 Mar 07;2 Suppl 1:S5
pubmed: 21388574
Bioinformatics. 2012 Sep 15;28(18):i451-i457
pubmed: 22962466
J Proteomics. 2014 Apr 14;101:130-40
pubmed: 24549006
J Proteome Res. 2006 Mar;5(3):619-24
pubmed: 16512677
Brief Bioinform. 2017 May 1;18(3):530-536
pubmed: 27013646
Nat Methods. 2016 Aug 30;13(9):741-8
pubmed: 27575624
BMC Bioinformatics. 2009 Oct 01;10 Suppl 10:S8
pubmed: 19796405
Brief Bioinform. 2007 Jan;8(1):45-59
pubmed: 16772270
Bioinformatics. 2010 Apr 1;26(7):966-8
pubmed: 20147306
Proteomics. 2010 Mar;10(6):1265-9
pubmed: 20077414
Nucleic Acids Res. 2016 Jan 4;44(D1):D38-47
pubmed: 26538599
Bioinformatics. 2016 May 15;32(10):1502-8
pubmed: 26787660
Mol Cell Proteomics. 2014 Jan;13(1):360-71
pubmed: 24187338
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12:S7
pubmed: 21210986
Proc Natl Acad Sci U S A. 1980 Mar;77(3):1632-6
pubmed: 6929513
Nat Commun. 2014 Oct 31;5:5277
pubmed: 25358478
Nat Biotechnol. 2015 Feb;33(2):137-9
pubmed: 25658277
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
pubmed: 27137889
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W557-61
pubmed: 23640334
Bioinformatics. 2013 May 15;29(10):1325-32
pubmed: 23479348
PLoS One. 2016 Jun 22;11(6):e0157989
pubmed: 27331905
Nat Biotechnol. 2008 Dec;26(12):1367-72
pubmed: 19029910
BMC Bioinformatics. 2015 Sep 03;16:284
pubmed: 26335531
BMC Genomics. 2014 Aug 22;15:703
pubmed: 25149441
Bioinformatics. 2011 Apr 15;27(8):1183-4
pubmed: 21325302
J Biomed Inform. 2008 Oct;41(5):837-47
pubmed: 18373957
PLoS One. 2015 Sep 02;10(9):e0137048
pubmed: 26331617
J Proteome Res. 2014 Dec 5;13(12):5898-908
pubmed: 25301683
Proteomics. 2013 Mar;13(6):1036-41
pubmed: 23307401
J Proteome Res. 2011 Dec 2;10(12):5354-62
pubmed: 22073976
Nucleic Acids Res. 2016 Jul 8;44(W1):W83-9
pubmed: 27098042
BMC Bioinformatics. 2011 Mar 08;12:70
pubmed: 21385435

Auteurs

Magnus Palmblad (M)

Center for Proteomics and Metabolomics, Leiden University Medical Center, RC Leiden, The Netherlands.

Anna-Lena Lamprecht (AL)

Department of Information and Computing Sciences, Utrecht University, CC Utrecht, The Netherlands.

Jon Ison (J)

National Life Science Supercomputing Center, Technical University of Denmark, Kongens Lyngby, Denmark.

Veit Schwämmle (V)

Department of Biochemistry and Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Odense, Denmark.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic

Classifications MeSH