BOSO: A novel feature selection algorithm for linear regression with high-dimensional data.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
05 2022
Historique:
received: 23 06 2021
accepted: 07 05 2022
revised: 10 06 2022
pubmed: 1 6 2022
medline: 15 6 2022
entrez: 31 5 2022
Statut: epublish

Résumé

With the frenetic growth of high-dimensional datasets in different biomedical domains, there is an urgent need to develop predictive methods able to deal with this complexity. Feature selection is a relevant strategy in machine learning to address this challenge. We introduce a novel feature selection algorithm for linear regression called BOSO (Bilevel Optimization Selector Operator). We conducted a benchmark of BOSO with key algorithms in the literature, finding a superior accuracy for feature selection in high-dimensional datasets. Proof-of-concept of BOSO for predicting drug sensitivity in cancer is presented. A detailed analysis is carried out for methotrexate, a well-studied drug targeting cancer metabolism.

Identifiants

pubmed: 35639775
doi: 10.1371/journal.pcbi.1010180
pii: PCOMPBIOL-D-21-01162
pmc: PMC9187084
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1010180

Subventions

Organisme : Cancer Research UK
ID : C355/A26819
Pays : United Kingdom

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Nature. 2019 May;569(7757):503-508
pubmed: 31068700
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
Genome Med. 2009 Sep 04;1(9):83
pubmed: 19732436
Dev Cell. 2017 Jan 23;40(2):193-201
pubmed: 28089369
Nat Biotechnol. 2019 Oct;37(10):1217-1228
pubmed: 31477923
BMC Bioinformatics. 2020 Feb 11;21(1):54
pubmed: 32046651
Philos Trans R Soc Lond B Biol Sci. 2014 Feb 03;369(1638):20130109
pubmed: 24493757
Nat Rev Cancer. 2008 Jan;8(1):37-49
pubmed: 18097463
Sci Rep. 2016 Oct 26;6:36076
pubmed: 27782180
Nature. 2018 Oct;562(7728):526-531
pubmed: 30333627
Sci Rep. 2017 Jun 28;7(1):4354
pubmed: 28659577
Biochim Biophys Acta. 2010 Dec;1806(2):258-67
pubmed: 20600632
Biomed Res Int. 2014;2014:616025
pubmed: 24967384
Nature. 2015 Nov 5;527(7576):S16-7
pubmed: 26536219
Science. 2000 Dec 22;290(5500):2323-6
pubmed: 11125150
Bioinformatics. 2007 Oct 1;23(19):2507-17
pubmed: 17720704
Cell Chem Biol. 2017 Sep 21;24(9):1161-1180
pubmed: 28938091
Brief Bioinform. 2021 Jan 18;22(1):77-87
pubmed: 32597465
Cancer Lett. 2018 Nov 1;436:87-95
pubmed: 30145202
Nat Biotechnol. 2017 May 9;35(5):406-409
pubmed: 28486464
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752
pubmed: 29099853
Curr Opin Biotechnol. 2019 Aug;58:161-167
pubmed: 30965188
Nucleic Acids Res. 2013 Jan;41(Database issue):D955-61
pubmed: 23180760
Nat Commun. 2018 Oct 31;9(1):4534
pubmed: 30382087
Nat Genet. 2015 Sep;47(9):1091-8
pubmed: 26258848
Cell. 2017 Jul 27;170(3):564-576.e16
pubmed: 28753430
Bioinformatics. 2015 Jun 1;31(11):1754-61
pubmed: 25619995

Auteurs

Luis V Valcárcel (LV)

Universidad de Navarra, Tecnun Escuela de Ingeniería, San Sebastián, Spain.
Universidad de Navarra, CIMA Centro de Investigación de Medicina Aplicada, Pamplona, Spain.

Edurne San José-Enériz (E)

Universidad de Navarra, CIMA Centro de Investigación de Medicina Aplicada, Pamplona, Spain.
CIBERONC Centro de Investigación Biomédica en Red de Cáncer, Pamplona, Spain.

Xabier Cendoya (X)

Universidad de Navarra, Tecnun Escuela de Ingeniería, San Sebastián, Spain.

Ángel Rubio (Á)

Universidad de Navarra, Tecnun Escuela de Ingeniería, San Sebastián, Spain.
Universidad de Navarra, Centro de Ingeniería Biomédica, Pamplona, Spain.
Universidad de Navarra, DATAI Instituto de Ciencia de los Datos e Inteligencia Artificial, Pamplona, Spain.

Xabier Agirre (X)

Universidad de Navarra, CIMA Centro de Investigación de Medicina Aplicada, Pamplona, Spain.
CIBERONC Centro de Investigación Biomédica en Red de Cáncer, Pamplona, Spain.

Felipe Prósper (F)

Universidad de Navarra, CIMA Centro de Investigación de Medicina Aplicada, Pamplona, Spain.
CIBERONC Centro de Investigación Biomédica en Red de Cáncer, Pamplona, Spain.
IdiSNA Instituto de Investigación Sanitaria de Navarra, Pamplona, Spain.
Clínica Universidad de Navarra, Pamplona, Spain.

Francisco J Planes (FJ)

Universidad de Navarra, Tecnun Escuela de Ingeniería, San Sebastián, Spain.
Universidad de Navarra, Centro de Ingeniería Biomédica, Pamplona, Spain.
Universidad de Navarra, DATAI Instituto de Ciencia de los Datos e Inteligencia Artificial, Pamplona, Spain.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH