A Review of Integrative Imputation for Multi-Omics Datasets.

autoencoders deep learning integrative imputation machine learning multi-omics imputation multi-view matrix factorization single-omics imputation transfer learning

Journal

Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621

Informations de publication

Date de publication:
2020
Historique:
received: 07 06 2020
accepted: 16 09 2020
entrez: 16 11 2020
pubmed: 17 11 2020
medline: 17 11 2020
Statut: epublish

Résumé

Multi-omics studies, which explore the interactions between multiple types of biological factors, have significant advantages over single-omics analysis for their ability to provide a more holistic view of biological processes, uncover the causal and functional mechanisms for complex diseases, and facilitate new discoveries in precision medicine. However, omics datasets often contain missing values, and in multi-omics study designs it is common for individuals to be represented for some omics layers but not all. Since most statistical analyses cannot be applied directly to the incomplete datasets, imputation is typically performed to infer the missing values. Integrative imputation techniques which make use of the correlations and shared information among multi-omics datasets are expected to outperform approaches that rely on single-omics information alone, resulting in more accurate results for the subsequent downstream analyses. In this review, we provide an overview of the currently available imputation methods for handling missing values in bioinformatics data with an emphasis on multi-omics imputation. In addition, we also provide a perspective on how deep learning methods might be developed for the integrative imputation of multi-omics datasets.

Identifiants

pubmed: 33193667
doi: 10.3389/fgene.2020.570255
pmc: PMC7594632
doi:

Types de publication

Journal Article Review

Langues

eng

Pagination

570255

Subventions

Organisme : NCRR NIH HHS
ID : M01 RR000585
Pays : United States
Organisme : NIA NIH HHS
ID : R01 AG061917
Pays : United States
Organisme : NIAMS NIH HHS
ID : R01 AR069055
Pays : United States
Organisme : NIAMS NIH HHS
ID : R01 AR027065
Pays : United States
Organisme : NIA NIH HHS
ID : U19 AG055373
Pays : United States
Organisme : NIGMS NIH HHS
ID : P20 GM109036
Pays : United States

Informations de copyright

Copyright © 2020 Song, Greenbaum, Luttrell, Zhou, Wu, Shen, Gong, Zhang and Deng.

Références

Genetics. 2014 Aug;197(4):1081-95
pubmed: 24831820
Sci Data. 2019 Oct 31;6(1):251
pubmed: 31672978
Genome Biol. 2017 Apr 11;18(1):67
pubmed: 28395661
Brief Bioinform. 2018 Mar 1;19(2):325-340
pubmed: 28011753
Pac Symp Biocomput. 2018;23:448-459
pubmed: 29218904
Genetics. 2017 Nov;207(3):893-902
pubmed: 28893853
Bioinformatics. 2001 Jun;17(6):520-5
pubmed: 11395428
Nucleic Acids Res. 2020 Sep 4;48(15):e85
pubmed: 32588900
Bioinformatics. 2020 Apr 1;36(7):2009-2016
pubmed: 31755899
Nat Genet. 2016 Mar;48(3):245-52
pubmed: 26854917
Comput Biol Chem. 2007 Oct;31(5-6):320-7
pubmed: 17920334
Am J Hum Genet. 2008 Feb;82(2):444-52
pubmed: 18252224
Am J Hum Genet. 2007 Nov;81(5):1084-97
pubmed: 17924348
Nat Commun. 2019 Jan 23;10(1):390
pubmed: 30674886
Science. 2016 Apr 29;352(6285):600-4
pubmed: 27126046
Nat Biotechnol. 2017 Oct;35(10):936-939
pubmed: 28854175
Nat Commun. 2018 May 8;9(1):1825
pubmed: 29739930
PLoS One. 2015 Apr 20;10(3):e0121314
pubmed: 25894527
BMC Bioinformatics. 2016 Oct 3;17(1):402
pubmed: 27716030
Genome Biol. 2020 Mar 30;21(1):81
pubmed: 32228704
Genet Epidemiol. 2020 Jul;44(5):425-441
pubmed: 32190932
Annu Rev Genomics Hum Genet. 2018 Aug 31;19:73-96
pubmed: 29799802
Nature. 1970 Aug 8;227(5258):561-3
pubmed: 4913914
Hum Hered. 2008;66(2):87-98
pubmed: 18382088
PLoS Genet. 2013;9(2):e1003264
pubmed: 23408905
Genome Biol. 2018 Nov 12;19(1):196
pubmed: 30419955
J R Soc Interface. 2018 Apr;15(141):
pubmed: 29618526
Am J Hum Genet. 2018 Sep 6;103(3):338-348
pubmed: 30100085
Nat Methods. 2017 Sep;14(9):865-868
pubmed: 28759029
NAR Genom Bioinform. 2020 Dec;2(4):lqaa077
pubmed: 33029585
Nat Genet. 2016 Oct;48(10):1284-1287
pubmed: 27571263
Nat Rev Endocrinol. 2020 Feb;16(2):91-103
pubmed: 31792439
Stat Anal Data Min. 2017 Dec;10(6):363-377
pubmed: 29403567
Am J Hum Genet. 2019 Aug 1;105(2):258-266
pubmed: 31230719
Nat Commun. 2018 Mar 8;9(1):997
pubmed: 29520097
IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2410-2423
pubmed: 30387725
Genet Epidemiol. 2006 Dec;30(8):718-27
pubmed: 16986160
Bioinformatics. 2004 Apr 12;20(6):917-23
pubmed: 14751970
Cell. 2018 Jul 26;174(3):716-729.e27
pubmed: 29961576
PLoS Genet. 2009 Jun;5(6):e1000529
pubmed: 19543373
Bioinformatics. 2014 May 1;30(9):1266-72
pubmed: 24413527
Genet Epidemiol. 2010 Dec;34(8):816-34
pubmed: 21058334
Genome Biol. 2020 May 11;21(1):111
pubmed: 32393329
IEEE Trans Inf Technol Biomed. 2009 Jan;13(1):131-7
pubmed: 19129032
Am J Hum Genet. 2006 Apr;78(4):629-44
pubmed: 16532393
Genes (Basel). 2019 Jan 28;10(2):
pubmed: 30696086
Nat Commun. 2019 Aug 23;10(1):3834
pubmed: 31444360
Nature. 2015 Feb 19;518(7539):317-30
pubmed: 25693563
Nat Methods. 2018 Dec;15(12):1053-1058
pubmed: 30504886
Bioinformatics. 2006 Mar 1;22(5):566-72
pubmed: 16377613
Nat Biotechnol. 2015 Apr;33(4):364-76
pubmed: 25690853
Nat Genet. 2019 Apr;51(4):592-599
pubmed: 30926968
Nat Commun. 2019 Oct 8;10(1):4576
pubmed: 31594952
Bioinform Biol Insights. 2019 Oct 13;13:1177932219881435
pubmed: 31662603
Bioinformatics. 2019 Apr 15;35(8):1278-1283
pubmed: 30202885
BMC Bioinformatics. 2018 Jun 8;19(1):220
pubmed: 29884114
Nat Rev Genet. 2019 May;20(5):257-272
pubmed: 30696980
Inf Fusion. 2019 Oct;50:71-91
pubmed: 30467459
J Proteome Res. 2015 May 1;14(5):1993-2001
pubmed: 25855118
F1000Res. 2018 Nov 2;7:1740
pubmed: 30906525
Sci Rep. 2018 Nov 5;8(1):16329
pubmed: 30397240
Genes (Basel). 2019 Aug 28;10(9):
pubmed: 31466333
Nat Commun. 2018 Feb 22;9(1):781
pubmed: 29472610
Contemp Oncol (Pozn). 2015;19(1A):A68-77
pubmed: 25691825
Nat Genet. 2019 Mar;51(3):568-576
pubmed: 30804563
Bioinformatics. 2014 Jul 1;30(13):1876-83
pubmed: 24618466
PLoS Genet. 2019 Jan 22;15(1):e1007889
pubmed: 30668570
Genome Biol. 2019 Mar 21;20(1):61
pubmed: 30898142
Front Cell Dev Biol. 2018 Apr 20;6:28
pubmed: 29732369
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Cell. 2019 Jun 13;177(7):1888-1902.e21
pubmed: 31178118
Genome Biol. 2020 Aug 27;21(1):218
pubmed: 32854757
Nat Rev Genet. 2014 Jan;15(1):34-48
pubmed: 24296534
Nat Methods. 2018 Jul;15(7):539-542
pubmed: 29941873
Bioinformatics. 2019 Jul 15;35(14):i501-i509
pubmed: 31510700
Nat Methods. 2019 Nov;16(11):1139-1145
pubmed: 31591579
Nat Methods. 2019 Sep;16(9):875-878
pubmed: 31471617
Nat Genet. 2015 Sep;47(9):1091-8
pubmed: 26258848
Brief Bioinform. 2018 Nov 27;19(6):1344-1355
pubmed: 28575146
Genome Biol. 2020 Feb 7;21(1):31
pubmed: 32033589
Front Genet. 2017 Jun 16;8:84
pubmed: 28670325
PLoS Genet. 2019 Dec 23;15(12):e1008500
pubmed: 31869403
BMC Bioinformatics. 2016 Jun 21;17:247
pubmed: 27329642
Nat Commun. 2018 Apr 11;9(1):1402
pubmed: 29643364
Nat Commun. 2020 Jan 31;11(1):651
pubmed: 32005835
Genome Biol. 2019 Oct 18;20(1):211
pubmed: 31627739
Mol Syst Biol. 2018 Jun 20;14(6):e8124
pubmed: 29925568
J Proteome Res. 2016 Apr 1;15(4):1116-25
pubmed: 26906401

Auteurs

Meng Song (M)

School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS, United States.

Jonathan Greenbaum (J)

Tulane Center of Biomedical Informatics and Genomics, School of Medicine, Tulane University, New Orleans, LA, United States.

Joseph Luttrell (J)

School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS, United States.

Weihua Zhou (W)

College of Computing, Michigan Technological University, Houghton, MI, United States.

Chong Wu (C)

Department of Statistics, Florida State University, Tallahassee, FL, United States.

Hui Shen (H)

Tulane Center of Biomedical Informatics and Genomics, School of Medicine, Tulane University, New Orleans, LA, United States.

Ping Gong (P)

Environmental Laboratory, U.S. Army Engineer Research and Development Center, Vicksburg, MS, United States.

Chaoyang Zhang (C)

School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS, United States.

Hong-Wen Deng (HW)

Tulane Center of Biomedical Informatics and Genomics, School of Medicine, Tulane University, New Orleans, LA, United States.

Classifications MeSH