Optimized network based natural language processing approach to reveal disease comorbidities in COVID-19.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
28 Jan 2024
Historique:
received: 23 03 2023
accepted: 24 01 2024
medline: 29 1 2024
pubmed: 29 1 2024
entrez: 28 1 2024
Statut: epublish

Résumé

A novel virus emerged from Wuhan, China, at the end of 2019 and quickly evolved into a pandemic, significantly impacting various industries, especially healthcare. One critical lesson from COVID-19 is the importance of understanding and predicting underlying comorbidities to better prioritize care and pharmacological therapies. Factors like age, race, and comorbidity history are crucial in determining disease mortality. While clinical data from hospitals and cohorts have led to the identification of these comorbidities, traditional approaches often lack a mechanistic understanding of the connections between them. In response, we utilized a deep learning approach to integrate COVID-19 data with data from other diseases, aiming to detect comorbidities with mechanistic insights. Our modified algorithm in the mpDisNet package, based on word-embedding deep learning techniques, incorporates miRNA expression profiles from SARS-CoV-2 infected cell lines and their target transcription factors. This approach is aligned with the emerging field of network medicine, which seeks to define diseases based on distinct pathomechanisms rather than just phenotypes. The main aim is discovery of possible unknown comorbidities by connecting the diseases by their miRNA mediated regulatory interactions. The algorithm can predict the majority of COVID-19's known comorbidities, as well as several diseases that have yet to be discovered to be comorbid with COVID-19. These potentially comorbid diseases should be investigated further to raise awareness and prevention, as well as informing the comorbidity research for the next possible outbreak.

Identifiants

pubmed: 38282038
doi: 10.1038/s41598-024-52819-5
pii: 10.1038/s41598-024-52819-5
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

2325

Informations de copyright

© 2024. The Author(s).

Références

Porcheddu, R., Serra, C., Kelvin, D., Kelvin, N. & Rubino, S. Similarity in Case Fatality Rates (CFR) of COVID-19/SARS-COV-2 in Italy and China. J. Infect. Dev. Ctries. 14(02), 125–128. https://doi.org/10.3855/jidc.12600 (2020).
doi: 10.3855/jidc.12600 pubmed: 32146445
Sadegh, S. et al. Lacking mechanistic disease definitions and corresponding association data hamper progress in network medicine and beyond. Nat. Commun. 14(1), 1–15. https://doi.org/10.1038/s41467-023-37349-4 (2023).
doi: 10.1038/s41467-023-37349-4
Yang, J. et al. Molecular interaction and inhibition of SARS-CoV-2 binding to the ACE2 receptor. Nat. Commun. 11(1), 1–10. https://doi.org/10.1038/s41467-020-18319-6 (2020).
doi: 10.1038/s41467-020-18319-6
Spataro, N., Rodríguez, J. A., Navarro, A. & Bosch, E. Properties of human disease genes and the role of genes linked to Mendelian disorders in complex disease aetiology. Hum. Mol. Genet. 26(3), 489. https://doi.org/10.1093/HMG/DDW405 (2017).
doi: 10.1093/HMG/DDW405 pubmed: 28053046 pmcid: 5409085
Capobianco, E. & Lio, P. Comorbidity: A multidimensional approach. Trends Mol. Med. 19(9), 515–521. https://doi.org/10.1016/j.molmed.2013.07.004 (2013).
doi: 10.1016/j.molmed.2013.07.004 pubmed: 23948386
Keshava Prasad, T. S. et al. Human protein reference database—2009 update. Nucleic Acids Res. 37(Database issue), D767. https://doi.org/10.1093/NAR/GKN892 (2009).
doi: 10.1093/NAR/GKN892 pubmed: 18988627
Lee, D. S. et al. The implications of human metabolic network topology for disease comorbidity. Proc. Natl. Acad. Sci. 105(29), 9880–9885. https://doi.org/10.1073/PNAS.0802208105 (2008).
doi: 10.1073/PNAS.0802208105 pubmed: 18599447 pmcid: 2481357
Jin, S. et al. A network-based approach to uncover microRNA-mediated disease comorbidities and potential pathobiological implications. npj Syst. Biol. Appl. 5(1), 1–11. https://doi.org/10.1038/s41540-019-0115-2 (2019).
doi: 10.1038/s41540-019-0115-2
Jiang, Q. et al. miR2Disease: A manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 37(suppl_1), D98–D104. https://doi.org/10.1093/NAR/GKN714 (2009).
doi: 10.1093/NAR/GKN714 pubmed: 18927107
Hamosh, A., Scott, A. F., Amberger, J. S., Bocchini, C. A. & McKusick, V. A. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 33(suppl_1), D514–D517. https://doi.org/10.1093/NAR/GKI033 (2005).
doi: 10.1093/NAR/GKI033 pubmed: 15608251
Hermjakob, H. et al. IntAct: An open source molecular interaction database. Nucleic Acids Res. 32(Database issue), D452. https://doi.org/10.1093/NAR/GKH052 (2004).
doi: 10.1093/NAR/GKH052 pubmed: 14681455 pmcid: 308786
Cowley, M. J. et al. PINA v2.0: Mining interactome modules. Nucleic Acids Res. 40(D1), D862–D865. https://doi.org/10.1093/NAR/GKR967 (2012).
doi: 10.1093/NAR/GKR967 pubmed: 22067443
Marbach, D. et al. Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat Methods 13(4), 366–370. https://doi.org/10.1038/NMETH.3799 (2016).
doi: 10.1038/NMETH.3799 pubmed: 26950747 pmcid: 4967716
Rouillard, A. D. et al. The harmonizome: A collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database https://doi.org/10.1093/DATABASE/BAW100 (2016).
doi: 10.1093/DATABASE/BAW100 pubmed: 27374120 pmcid: 4930834
Ko, Y., Cho, M., Lee, J.-S. & Kim, J. Identification of disease comorbidity through hidden molecular mechanisms. Sci. Rep. https://doi.org/10.1038/SREP39433 (2016).
doi: 10.1038/SREP39433 pubmed: 28009019 pmcid: 5180101
Hidalgo, C. A., Blumm, N., Barabási, A. L. & Christakis, N. A. A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 5(4), e1000353. https://doi.org/10.1371/JOURNAL.PCBI.1000353 (2009).
doi: 10.1371/JOURNAL.PCBI.1000353 pubmed: 19360091 pmcid: 2661364
Li, B. et al. Scaling Word2Vec on big corpus. Data Sci. Eng. 4(2), 157–175. https://doi.org/10.1007/S41019-019-0096-6/FIGURES/16 (2019).
doi: 10.1007/S41019-019-0096-6/FIGURES/16
Li, Y. et al. HMDD v2.0: A database for experimentally supported human microRNA and disease associations. Nucleic Acids Res. 42(D1), D1070–D1074. https://doi.org/10.1093/NAR/GKT1023 (2014).
doi: 10.1093/NAR/GKT1023 pubmed: 24194601
Wyler, E. et al. Transcriptomic profiling of SARS-CoV-2 infected human cell lines identifies HSP90 as target for COVID-19 therapy. iScience 24(3), 102151. https://doi.org/10.1016/J.ISCI.2021.102151 (2021).
doi: 10.1016/J.ISCI.2021.102151 pubmed: 33585804 pmcid: 7866843
Mckinney, W. Data Structures for Statistical Computing in Python (2010)..
Waskom, M. L. seaborn: Statistical data visualization. J. Open Source Softw. 6(60), 3021. https://doi.org/10.21105/JOSS.03021 (2021).
doi: 10.21105/JOSS.03021
Margaretten, M., Julian, L., Katz, P. & Yelin, E. Depression in patients with rheumatoid arthritis: Description, causes and mechanisms. Int. J. Clin. Rheumtol. 6(6), 617–623. https://doi.org/10.2217/IJR.11.62 (2011).
doi: 10.2217/IJR.11.62 pubmed: 22211138 pmcid: 3247620
Chiu, H. Y., Hsieh, C. F., Chiang, Y. T., Huang, W. F. & Tsai, T. F. The risk of chronic pancreatitis in patients with psoriasis: A population-based cohort study. PLoS ONE https://doi.org/10.1371/JOURNAL.PONE.0160041 (2016).
doi: 10.1371/JOURNAL.PONE.0160041 pubmed: 27992461 pmcid: 5167320
Baerwald, C., Manger, B. & Hueber, A. Depression as comorbidity of rheumatoid arthritis. Z. Rheumatol. 78(3), 243–248. https://doi.org/10.1007/S00393-018-0568-5 (2019).
doi: 10.1007/S00393-018-0568-5 pubmed: 30488094
Semerdzhiev, S. A., Fakhree, M. A. A., Segers-Nolten, I., Blum, C. & Claessens, M. M. A. E. Interactions between SARS-CoV-2 N-protein and α-synuclein accelerate amyloid formation. ACS Chem. Neurosci. 13(1), 143–150. https://doi.org/10.1021/ACSCHEMNEURO.1C00666/ASSET/IMAGES/LARGE/CN1C00666_0005.JPEG (2022).
doi: 10.1021/ACSCHEMNEURO.1C00666/ASSET/IMAGES/LARGE/CN1C00666_0005.JPEG pubmed: 34860005

Auteurs

Emre Taylan Duman (ET)

Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey. emretaylanduman@gmail.com.
NGS-Core Unit for Integrative Genomics, Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany. emretaylanduman@gmail.com.

Gizem Tuna (G)

Department of Molecular Biology, Gebze Technical University, Kocaeli, Turkey.

Enes Ak (E)

Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey.

Gülben Avsar (G)

Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey.

Pinar Pir (P)

Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey.

Classifications MeSH