Wide and deep learning for automatic cell type identification.

Classification Deep learning Single cell data

Journal

Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369

Informations de publication

Date de publication:
2021
Historique:
received: 30 09 2020
revised: 16 01 2021
accepted: 18 01 2021
entrez: 22 2 2021
pubmed: 23 2 2021
medline: 23 2 2021
Statut: epublish

Résumé

Cell type classification is an important problem in cancer research, especially with the advent of single cell technologies. Correctly identifying cells within the tumor microenvironment can provide oncologists with a snapshot of how a patient's immune system reacts to the tumor. Wide and deep learning (WDL) is an approach to construct a cell-classification prediction model that can learn patterns within high-dimensional data (deep) and ensure that biologically relevant features (wide) remain in the final model. In this paper, we demonstrate that regularization can prevent overfitting and adding a wide component to a neural network can result in a model with better predictive performance. In particular, we observed that a combination of dropout and

Identifiants

pubmed: 33613870
doi: 10.1016/j.csbj.2021.01.027
pii: S2001-0370(21)00031-3
pmc: PMC7878986
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1052-1062

Subventions

Organisme : NCI NIH HHS
ID : R01 CA124515
Pays : United States
Organisme : NCI NIH HHS
ID : R01 CA157664
Pays : United States

Informations de copyright

© 2021 The Authors.

Déclaration de conflit d'intérêts

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Références

Bioinformatics. 2019 Jul 15;35(14):i436-i445
pubmed: 31510660
Nat Immunol. 2019 Feb;20(2):163-172
pubmed: 30643263
Cell. 2015 May 21;161(5):1202-1214
pubmed: 26000488
Nat Med. 2019 Aug;25(8):1251-1259
pubmed: 31359002
Mol Cell. 2019 Jan 3;73(1):130-142.e5
pubmed: 30472192
Nat Commun. 2019 Oct 11;10(1):4667
pubmed: 31604912
Genome Biol. 2019 Sep 9;20(1):194
pubmed: 31500660
Front Genet. 2019 Apr 05;10:317
pubmed: 31024627
Bioinformatics. 2019 Jul 15;35(14):i446-i454
pubmed: 31510656
Nucleic Acids Res. 2019 Sep 19;47(16):e95
pubmed: 31226206
Front Immunol. 2018 Oct 23;9:2425
pubmed: 30405621
Nat Commun. 2017 Jan 16;8:14049
pubmed: 28091601
Drug Discov Today. 2018 Jun;23(6):1241-1250
pubmed: 29366762
Nat Methods. 2014 Feb;11(2):163-6
pubmed: 24363023
Nat Protoc. 2014 Jan;9(1):171-81
pubmed: 24385147
Cell Rep. 2012 Sep 27;2(3):666-73
pubmed: 22939981
Nat Methods. 2015 May;12(5):453-7
pubmed: 25822800
Nat Methods. 2013 Nov;10(11):1096-8
pubmed: 24056875
Cell. 2019 Jun 13;177(7):1888-1902.e21
pubmed: 31178118
Front Genet. 2019 Mar 26;10:214
pubmed: 30972100
Science. 2016 Apr 8;352(6282):189-96
pubmed: 27124452
Nat Commun. 2020 May 11;11(1):2338
pubmed: 32393754

Auteurs

Christopher M Wilson (CM)

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, 12902 USF Magnolia Drive, Tampa, FL 33612, USA.

Brooke L Fridley (BL)

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, 12902 USF Magnolia Drive, Tampa, FL 33612, USA.

José R Conejo-Garcia (JR)

Department of Immunology, H. Lee Moffitt Cancer Center & Research Institute, 12902 USF Magnolia Drive, Tampa, FL 33612, USA.

Xuefeng Wang (X)

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, 12902 USF Magnolia Drive, Tampa, FL 33612, USA.

Xiaoqing Yu (X)

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, 12902 USF Magnolia Drive, Tampa, FL 33612, USA.

Classifications MeSH