Data Integration-Possibilities of Molecular and Clinical Data Fusion on the Example of Thyroid Cancer Diagnostics.
bioinformatics
biomarkers
cancer
classification
data fusion
data integration
thyroid cancer
Journal
International journal of molecular sciences
ISSN: 1422-0067
Titre abrégé: Int J Mol Sci
Pays: Switzerland
ID NLM: 101092791
Informations de publication
Date de publication:
06 Oct 2022
06 Oct 2022
Historique:
received:
03
08
2022
revised:
24
09
2022
accepted:
28
09
2022
entrez:
14
10
2022
pubmed:
15
10
2022
medline:
18
10
2022
Statut:
epublish
Résumé
(1) Background: The data from independent gene expression sources may be integrated for the purpose of molecular diagnostics of cancer. So far, multiple approaches were described. Here, we investigated the impacts of different data fusion strategies on classification accuracy and feature selection stability, which allow the costs of diagnostic tests to be reduced. (2) Methods: We used molecular features (gene expression) combined with a feature extracted from the independent clinical data describing a patient's sample. We considered the dependencies between selected features in two data fusion strategies (early fusion and late fusion) compared to classification models based on molecular features only. We compared the best accuracy classification models in terms of the number of features, which is connected to the potential cost reduction of the diagnostic classifier. (3) Results: We show that for thyroid cancer, the extracted clinical feature is correlated with (but not redundant to) the molecular data. The usage of data fusion allows a model to be obtained with similar or even higher classification quality (with a statistically significant accuracy improvement, a
Identifiants
pubmed: 36233181
pii: ijms231911880
doi: 10.3390/ijms231911880
pmc: PMC9569592
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Silesian University of Technology
ID : 02/040/BK_22/1022
Organisme : The National Center for Research and Development
ID : STRATEGMED2/267398/4/NCBR/2015
Organisme : Polish Ministry of Science and Higher Education, the Implementation Doctorate program at the Silesian University of Technology, Gliwice, Poland
ID : 10/DW/2017/01/1
Références
J Biomed Inform. 2018 Sep;85:189-203
pubmed: 30031057
Gland Surg. 2020 Feb;9(Suppl 2):S69-S76
pubmed: 32175247
Comput Biol Med. 2015 Nov 1;66:1-10
pubmed: 26327447
BMC Genomics. 2008 Sep 16;9 Suppl 2:S24
pubmed: 18831790
Endocr Pathol. 2020 Jun;31(2):143-149
pubmed: 32236858
Arch Pathol Lab Med. 2016 Dec;140(12):1338-1344
pubmed: 27557410
Hum Genomics Proteomics. 2009 Jan 12;2009:
pubmed: 20948564
Genes (Basel). 2019 Sep 23;10(10):
pubmed: 31547603
Adv Clin Exp Med. 2017 Jan-Feb;26(1):177-182
pubmed: 28397450
J Biomed Inform. 2007 Feb;40(1):5-16
pubmed: 16574494
BMC Genomics. 2010 Dec 02;11 Suppl 4:S21
pubmed: 21143805
Endokrynol Pol. 2016;67(1):74-107
pubmed: 26884119
J Am Coll Radiol. 2017 May;14(5):587-595
pubmed: 28372962
Endocr Relat Cancer. 2007 Sep;14(3):809-26
pubmed: 17914110
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066138
pubmed: 15244698
N Engl J Med. 2012 Aug 23;367(8):705-15
pubmed: 22731672
Bioinformatics. 2011 Jul 1;27(13):1876-7
pubmed: 21531770
Thyroid. 2009 Dec;19(12):1351-61
pubmed: 19895341
J Clin Endocrinol Metab. 2010 Dec;95(12):5296-304
pubmed: 20826580
J Am Med Inform Assoc. 2013 May 1;20(3):544-53
pubmed: 23059731
Medicine (Baltimore). 2019 Dec;98(50):e18320
pubmed: 31852120
Thyroid. 2009 Nov;19(11):1159-65
pubmed: 19888858
Adv Bioinformatics. 2015;2015:198363
pubmed: 26170834
Otolaryngol Clin North Am. 2010 Apr;43(2):229-38, vii
pubmed: 20510711
NPJ Digit Med. 2019 Jul 26;2:69
pubmed: 31372505
ScientificWorldJournal. 2013 Oct 27;2013:704504
pubmed: 24288502
Thyroid. 2017 Nov;27(11):1341-1346
pubmed: 29091573
Front Genet. 2019 May 16;10:452
pubmed: 31156708