Ten quick tips for avoiding pitfalls in multi-omics data integration analyses.
Journal
PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922
Informations de publication
Date de publication:
Jul 2023
Jul 2023
Historique:
medline:
10
7
2023
pubmed:
6
7
2023
entrez:
6
7
2023
Statut:
epublish
Résumé
Data are the most important elements of bioinformatics: Computational analysis of bioinformatics data, in fact, can help researchers infer new knowledge about biology, chemistry, biophysics, and sometimes even medicine, influencing treatments and therapies for patients. Bioinformatics and high-throughput biological data coming from different sources can even be more helpful, because each of these different data chunks can provide alternative, complementary information about a specific biological phenomenon, similar to multiple photos of the same subject taken from different angles. In this context, the integration of bioinformatics and high-throughput biological data gets a pivotal role in running a successful bioinformatics study. In the last decades, data originating from proteomics, metabolomics, metagenomics, phenomics, transcriptomics, and epigenomics have been labelled -omics data, as a unique name to refer to them, and the integration of these omics data has gained importance in all biological areas. Even if this omics data integration is useful and relevant, due to its heterogeneity, it is not uncommon to make mistakes during the integration phases. We therefore decided to present these ten quick tips to perform an omics data integration correctly, avoiding common mistakes we experienced or noticed in published studies in the past. Even if we designed our ten guidelines for beginners, by using a simple language that (we hope) can be understood by anyone, we believe our ten recommendations should be taken into account by all the bioinformaticians performing omics data integration, including experts.
Identifiants
pubmed: 37410704
doi: 10.1371/journal.pcbi.1011224
pii: PCOMPBIOL-D-23-00210
pmc: PMC10325053
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e1011224Informations de copyright
Copyright: © 2023 Chicco et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Déclaration de conflit d'intérêts
The authors declare they have no conflict of interest.
Références
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):248-60
pubmed: 27045825
Methods Mol Biol. 2018;1716:389-408
pubmed: 29222764
PLoS Comput Biol. 2022 Feb 7;18(2):e1009337
pubmed: 35130273
Genome Biol. 2020 May 11;21(1):111
pubmed: 32393329
Methods. 2016 Dec 1;111:3-11
pubmed: 27637471
Biomed Res Int. 2019 Jun 9;2019:8304260
pubmed: 31281846
Arch Toxicol. 2020 Feb;94(2):371-388
pubmed: 32034435
Methods Mol Biol. 2022;2401:187-194
pubmed: 34902129
Bioinformatics. 2023 Feb 3;39(2):
pubmed: 36637211
Comput Biol Med. 2022 Dec;151(Pt A):106244
pubmed: 36343407
Proc Natl Acad Sci U S A. 2020 Aug 4;117(31):18869-18879
pubmed: 32675233
PLoS Genet. 2011 Jun;7(6):e1001393
pubmed: 21695224
J Hum Genet. 2021 Jan;66(1):93-102
pubmed: 32385339
BMC Bioinformatics. 2011 Jun 22;12:253
pubmed: 21693065
Methods Mol Biol. 2023;2553:417-439
pubmed: 36227553
PLoS Comput Biol. 2022 Aug 11;18(8):e1010357
pubmed: 35951526
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
Bioinformatics. 2020 Dec 22;36(20):5076-5085
pubmed: 33026062
Bioinformatics. 2022 Sep 30;38(19):4589-4597
pubmed: 35960154
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34791014
BMC Bioinformatics. 2016 Jun 06;17 Suppl 5:180
pubmed: 27295212
Nucleic Acids Res. 2020 Jul 2;48(W1):W395-W402
pubmed: 32479607
PLoS Comput Biol. 2018 Dec 27;14(12):e1006472
pubmed: 30589835
BioData Min. 2018 Oct 25;11:22
pubmed: 30386434
PLoS Comput Biol. 2014 Apr 24;10(4):e1003542
pubmed: 24763340
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
PLoS Comput Biol. 2022 Dec 15;18(12):e1010718
pubmed: 36520712
Bioinformatics. 2015 Jun 15;31(12):1881-8
pubmed: 25649616
Gigascience. 2021 Sep 16;10(9):
pubmed: 34528664
Nucleic Acids Res. 2003 Jan 1;31(1):51-4
pubmed: 12519945
PLoS One. 2011 Feb 28;6(2):e17238
pubmed: 21386892
Sci Rep. 2020 Jan 20;10(1):703
pubmed: 31959844
Genome Biol. 2014 Feb 20;15(2):403
pubmed: 25001293
Bioinformatics. 2021 Oct 25;37(20):3546-3552
pubmed: 33974036
PLoS Comput Biol. 2023 Jan 5;19(1):e1010778
pubmed: 36602952
Comput Struct Biotechnol J. 2021 Jun 22;19:3735-3746
pubmed: 34285775
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752
pubmed: 29099853
BMC Bioinformatics. 2022 Jul 14;23(Suppl 6):279
pubmed: 35836114
Brief Bioinform. 2018 Jul 20;19(4):693-699
pubmed: 28088754
Genome Biol. 2004;5(10):R80
pubmed: 15461798
PLoS Comput Biol. 2019 Jul 11;15(7):e1007084
pubmed: 31295267
Nat Biotechnol. 2019 Apr;37(4):358-367
pubmed: 30940948
PLoS Comput Biol. 2022 Aug 11;18(8):e1010348
pubmed: 35951505
Ann Appl Stat. 2013 Mar 1;7(1):523-542
pubmed: 23745156
PLoS Comput Biol. 2018 Dec 20;14(12):e1006561
pubmed: 30571677
Bioinformatics. 2018 May 1;34(9):1615-1617
pubmed: 29272348
PLoS Comput Biol. 2022 Dec 8;18(12):e1010675
pubmed: 36480496
Bioinformatics. 2016 Mar 1;32(5):697-704
pubmed: 26519501
Nat Commun. 2021 May 11;12(1):2700
pubmed: 33976213
Nat Biotechnol. 2022 Oct;40(10):1458-1466
pubmed: 35501393
Bioinform Biol Insights. 2020 Jan 31;14:1177932219899051
pubmed: 32076369
Nat Methods. 2018 Jul;15(7):475-476
pubmed: 29967506
Genes (Basel). 2019 Mar 20;10(3):
pubmed: 30897838
Cell Rep. 2022 May 10;39(6):110800
pubmed: 35545044
Nat Biotechnol. 2010 May;28(5):495-501
pubmed: 20436461
Nucleic Acids Res. 2016 May 5;44(8):e71
pubmed: 26704973
Nucleic Acids Res. 2016 Jan 4;44(D1):D463-70
pubmed: 26467476
Methods Mol Biol. 2023;2553:325-393
pubmed: 36227551
BioData Min. 2017 Dec 8;10:35
pubmed: 29234465
Comput Biol Med. 2023 Jan;152:106373
pubmed: 36462367
Comput Struct Biotechnol J. 2020 Mar 05;18:509-517
pubmed: 32206210
PLoS Biol. 2011 Apr;9(4):e1001046
pubmed: 21526222
Genome Biol. 2019 Apr 16;20(1):76
pubmed: 30992073
Genome Biol. 2013 Apr 29;14(4):R34
pubmed: 23618380
Sci Data. 2022 Sep 30;9(1):592
pubmed: 36180441
Front Genet. 2017 Jun 16;8:84
pubmed: 28670325
Front Bioinform. 2022 Oct 26;2:968327
pubmed: 36388843
Genome Med. 2016 Dec 13;8(1):129
pubmed: 27964755
Metab Eng. 2023 Jan;75:181-191
pubmed: 36566974
BMJ. 2016 Oct 10;355:i5295
pubmed: 27758792
PeerJ. 2015 Oct 08;3:e1319
pubmed: 26500826
BMC Med Genomics. 2018 Sep 14;11(Suppl 3):71
pubmed: 30255801
Bioinformatics. 2022 Mar 4;38(6):1761-1763
pubmed: 34935889
Nucleic Acids Res. 2022 Jan 7;50(D1):D1500-D1507
pubmed: 34747489
Genome Med. 2021 Jul 14;13(1):112
pubmed: 34261540
BMC Bioinformatics. 2017 Jan 3;18(1):6
pubmed: 28049410
PLoS Comput Biol. 2015 Sep 10;11(9):e1004385
pubmed: 26356732