Large-Scale Automatic Feature Selection for Biomarker Discovery in High-Dimensional OMICs Data.

biomarkers signature feature selection machine learning omics precision medicine

Journal

Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621

Informations de publication

Date de publication:
2019
Historique:
received: 23 01 2019
accepted: 30 04 2019
entrez: 4 6 2019
pubmed: 4 6 2019
medline: 4 6 2019
Statut: epublish

Résumé

The identification of biomarker signatures in omics molecular profiling is usually performed to predict outcomes in a precision medicine context, such as patient disease susceptibility, diagnosis, prognosis, and treatment response. To identify these signatures, we have developed a biomarker discovery tool, called BioDiscML. From a collection of samples and their associated characteristics, i.e., the biomarkers (e.g., gene expression, protein levels, clinico-pathological data), BioDiscML exploits various feature selection procedures to produce signatures associated to machine learning models that will predict efficiently a specified outcome. To this purpose, BioDiscML uses a large variety of machine learning algorithms to select the best combination of biomarkers for predicting categorical or continuous outcomes from highly unbalanced datasets. The software has been implemented to automate all machine learning steps, including data pre-processing, feature selection, model selection, and performance evaluation. BioDiscML is delivered as a stand-alone program and is available for download at https://github.com/mickaelleclercq/BioDiscML.

Identifiants

pubmed: 31156708
doi: 10.3389/fgene.2019.00452
pmc: PMC6532608
doi:

Types de publication

Journal Article

Langues

eng

Pagination

452

Références

Proc Natl Acad Sci U S A. 1999 Jun 8;96(12):6745-50
pubmed: 10359783
Nat Med. 2002 Jan;8(1):68-74
pubmed: 11786909
Nature. 2002 Jan 24;415(6870):436-42
pubmed: 11807556
Cancer Cell. 2002 Mar;1(2):203-9
pubmed: 12086878
Ultrasound Med Biol. 2002 Jul;28(7):897-902
pubmed: 12208332
Bioinformatics. 2004 Oct 12;20(15):2429-37
pubmed: 15087314
IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):629-34
pubmed: 15369099
Bioinformatics. 2007 Mar 1;23(5):651-3
pubmed: 17237069
J Clin Oncol. 2007 Jun 10;25(17):2426-33
pubmed: 17485708
Bioinformatics. 2007 Oct 1;23(19):2507-17
pubmed: 17720704
BMC Bioinformatics. 2008 Feb 05;9:82
pubmed: 18251993
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W305-11
pubmed: 19465376
Bioinformatics. 2009 Nov 1;25(21):2855-6
pubmed: 19706745
BMC Bioinformatics. 2009 Sep 17;10 Suppl 9:S7
pubmed: 19761577
PLoS One. 2010 Jan 27;5(1):e8819
pubmed: 20111698
Comput Biol Chem. 2010 Aug;34(4):215-25
pubmed: 20702140
Nucleic Acids Res. 2011 Mar;39(4):1197-207
pubmed: 20965966
Proc Natl Acad Sci U S A. 2011 Feb 22;108(8):3342-7
pubmed: 21300890
J Neurochem. 2011 Aug;118(3):449-57
pubmed: 21561454
Crit Care. 2011 Jun 20;15(3):R149
pubmed: 21682927
BMC Genomics. 2011 Aug 08;12:399
pubmed: 21824406
Neuroimage. 2012 Jan 16;59(2):895-907
pubmed: 21992749
BMC Bioinformatics. 2012 Feb 03;13:24
pubmed: 22305354
BMC Bioinformatics. 2012 May 01;13:69
pubmed: 22548963
Bioinformatics. 2012 Sep 15;28(18):2404-6
pubmed: 22815360
Med Res Rev. 2014 May;34(3):455-78
pubmed: 23775602
PLoS One. 2013 Sep 03;8(9):e73074
pubmed: 24019896
OMICS. 2013 Dec;17(12):595-610
pubmed: 24116388
Front Hum Neurosci. 2013 Oct 17;7:670
pubmed: 24151458
PLoS One. 2013 Dec 31;8(12):e85190
pubmed: 24391997
Bioinformatics. 2014 May 1;30(9):1325-6
pubmed: 24443376
PLoS Comput Biol. 2014 Jan;10(1):e1003440
pubmed: 24453961
Bioinformatics. 2014 Jun 15;30(12):1782-4
pubmed: 24574115
Comput Struct Biotechnol J. 2013 Jun 08;6:e201303004
pubmed: 24688712
Am J Epidemiol. 2014 Aug 1;180(3):318-24
pubmed: 24966219
Biochim Biophys Acta. 2015 May;1850(5):872-877
pubmed: 25450171
BMC Genomics. 2014;15 Suppl 10:S1
pubmed: 25559769
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17
pubmed: 25750696
BMC Genomics. 2015;16 Suppl 1:S2
pubmed: 25923811
Nat Rev Genet. 2015 Jun;16(6):321-32
pubmed: 25948244
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):766-72
pubmed: 26356346
PLoS Comput Biol. 2016 Feb 04;12(2):e1004717
pubmed: 26845763
Nucleic Acids Res. 2016 Jul 8;44(W1):W90-7
pubmed: 27141961
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977
pubmed: 27400279
Methods. 2016 Dec 1;111:21-31
pubmed: 27592382
BMC Bioinformatics. 2017 Feb 27;18(1):128
pubmed: 28241739
Bioinformatics. 2017 Aug 1;33(15):2424-2426
pubmed: 28369169
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):926-933
pubmed: 28391206
Stem Cells. 2017 Aug;35(8):1867-1880
pubmed: 28589621
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752
pubmed: 29099853
Brief Bioinform. 2017 Nov 29;:null
pubmed: 29194464
BioData Min. 2017 Dec 8;10:35
pubmed: 29234465
Per Med. 2015 Aug;12(4):371-387
pubmed: 29771660

Auteurs

Mickael Leclercq (M)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Médecine Moléculaire, Université Laval, Québec City, QC, Canada.

Benjamin Vittrant (B)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Médecine Moléculaire, Université Laval, Québec City, QC, Canada.

Marie Laure Martin-Magniette (ML)

Institute of Plant Sciences Paris Saclay IPS2, CNRS, INRA, Université Paris-Sud, Université Evry, Université Paris-Saclay, Paris Diderot, Sorbonne Paris-Cité, Orsay, France.
UMR MIA-Paris, AgroParisTech, INRA, Université Paris-Saclay, Paris, France.

Marie Pier Scott Boyer (MP)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Médecine Moléculaire, Université Laval, Québec City, QC, Canada.

Olivier Perin (O)

Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France.

Alain Bergeron (A)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Chirurgie, Oncology Axis, Université Laval, Québec City, QC, Canada.

Yves Fradet (Y)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Chirurgie, Oncology Axis, Université Laval, Québec City, QC, Canada.

Arnaud Droit (A)

Centre de Recherche du CHU de Québec-Université Laval, Québec City, QC, Canada.
Département de Médecine Moléculaire, Université Laval, Québec City, QC, Canada.

Classifications MeSH