Transforming RNA-Seq gene expression to track cancer progression in the multi-stage early to advanced-stage cancer development.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2023
Historique:
received: 08 06 2022
accepted: 31 03 2023
medline: 26 4 2023
pubmed: 24 4 2023
entrez: 24 04 2023
Statut: epublish

Résumé

Cancer progression can be tracked by gene expression changes that occur throughout early-stage to advanced-stage cancer development. The accumulated genetic changes can be detected when gene expression levels in advanced-stage are less variable but show high variability in early-stage. Normalizing advanced-stage expression samples with early-stage and clustering of the normalized expression samples can reveal cancers with similar or different progression and provide insight into clinical and phenotypic patterns of patient samples within the same cancer. This study aims to investigate cancer progression through RNA-Seq expression profiles across the multi-stage process of cancer development. RNA-sequenced gene expression of Diffuse Large B-cell Lymphoma, Lung cancer, Liver cancer, Cervical cancer, and Testicular cancer were downloaded from the UCSC Xena database. Advanced-stage samples were normalized with early-stage samples to consider heterogeneity differences in the multi-stage cancer progression. WGCNA was used to build a gene network and categorized normalized genes into different modules. A gene set enrichment analysis selected key gene modules related to cancer. The diagnostic capacity of the modules was evaluated after hierarchical clustering. Unnormalized RNA-Seq gene expression failed to segregate advanced-stage samples based on selected cancer cohorts. Normalization with early-stage revealed the true heterogeneous gene expression that accumulates across the multi-stage cancer progression, this resulted in well segregated cancer samples. Cancer-specific pathways were enriched in the normalized WGCNA modules. The normalization method was further able to stratify patient samples based on phenotypic and clinical information. Additionally, the method allowed for patient survival analysis, with the Cox regression model selecting gene MAP4K1 in cervical cancer and Kaplan-Meier confirming that upregulation is favourable. The application of the normalization method further enhanced the accuracy of clustering of cancer samples based on how they progressed. Additionally, genes responsible for cancer progression were discovered.

Sections du résumé

BACKGROUND
Cancer progression can be tracked by gene expression changes that occur throughout early-stage to advanced-stage cancer development. The accumulated genetic changes can be detected when gene expression levels in advanced-stage are less variable but show high variability in early-stage. Normalizing advanced-stage expression samples with early-stage and clustering of the normalized expression samples can reveal cancers with similar or different progression and provide insight into clinical and phenotypic patterns of patient samples within the same cancer.
OBJECTIVE
This study aims to investigate cancer progression through RNA-Seq expression profiles across the multi-stage process of cancer development.
METHODS
RNA-sequenced gene expression of Diffuse Large B-cell Lymphoma, Lung cancer, Liver cancer, Cervical cancer, and Testicular cancer were downloaded from the UCSC Xena database. Advanced-stage samples were normalized with early-stage samples to consider heterogeneity differences in the multi-stage cancer progression. WGCNA was used to build a gene network and categorized normalized genes into different modules. A gene set enrichment analysis selected key gene modules related to cancer. The diagnostic capacity of the modules was evaluated after hierarchical clustering.
RESULTS
Unnormalized RNA-Seq gene expression failed to segregate advanced-stage samples based on selected cancer cohorts. Normalization with early-stage revealed the true heterogeneous gene expression that accumulates across the multi-stage cancer progression, this resulted in well segregated cancer samples. Cancer-specific pathways were enriched in the normalized WGCNA modules. The normalization method was further able to stratify patient samples based on phenotypic and clinical information. Additionally, the method allowed for patient survival analysis, with the Cox regression model selecting gene MAP4K1 in cervical cancer and Kaplan-Meier confirming that upregulation is favourable.
CONCLUSION
The application of the normalization method further enhanced the accuracy of clustering of cancer samples based on how they progressed. Additionally, genes responsible for cancer progression were discovered.

Identifiants

pubmed: 37093793
doi: 10.1371/journal.pone.0284458
pii: PONE-D-22-16548
pmc: PMC10124877
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0284458

Informations de copyright

Copyright: © 2023 Livesey et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

AIDS. 2008 Oct 18;22(16):2135-41
pubmed: 18832877
Curr Drug Targets. 2009 Jul;10(7):581-9
pubmed: 19601762
Biometrics. 1949 Jun;5(2):99-114
pubmed: 18151955
Mol Cell Biol. 2019 Jul 29;39(16):
pubmed: 31138663
Ther Adv Med Oncol. 2019 Jul 12;11:1758835919861914
pubmed: 31320937
Memo. 2017;10(4):248-254
pubmed: 29250206
Front Oncol. 2021 Apr 29;11:675353
pubmed: 33996608
Nucleic Acids Res. 2018 Jan 4;46(D1):D661-D667
pubmed: 29136241
J Stat Softw. 2011 Mar;39(5):1-13
pubmed: 27065756
Nat Rev Clin Oncol. 2018 Jun;15(6):353-365
pubmed: 29599476
J Acquir Immune Defic Syndr. 2009 Dec;52(5):611-22
pubmed: 19770804
Acta Oncol. 2019 Aug;58(8):1170-1177
pubmed: 31131659
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
J Biomed Inform. 2018 Sep;85:80-92
pubmed: 30041017
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W77-83
pubmed: 23703215
BMC Bioinformatics. 2007 Jan 24;8:22
pubmed: 17250769
Cancer Med. 2021 Sep;10(18):6428-6441
pubmed: 34337875
Nucleic Acids Res. 2019 Jul 2;47(W1):W199-W205
pubmed: 31114916
Int J Clin Exp Pathol. 2019 Jan 01;12(1):1-20
pubmed: 31933717
Nucleic Acids Res. 2017 Jul 3;45(W1):W130-W137
pubmed: 28472511
Nucleic Acids Res. 2019 Jul 2;47(W1):W212-W224
pubmed: 31114921
PLoS One. 2009 Jul 30;4(7):e6447
pubmed: 19649250
Nat Biotechnol. 2020 Jun;38(6):675-678
pubmed: 32444850
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W741-8
pubmed: 15980575
Nature. 2017 Oct 11;550(7675):204-213
pubmed: 29022597
J Cancer. 2020 Feb 10;11(9):2496-2508
pubmed: 32201520
Nucleic Acids Res. 2017 Jan 4;45(D1):D353-D361
pubmed: 27899662
Nat Rev Clin Oncol. 2018 Feb;15(2):81-94
pubmed: 29115304
Nat Rev Genet. 2009 Jan;10(1):57-63
pubmed: 19015660
Int J Mol Sci. 2021 May 22;22(11):
pubmed: 34067485
Science. 2017 Aug 18;357(6352):
pubmed: 28818916
Insights Imaging. 2015 Dec;6(6):679-92
pubmed: 26443451
Database (Oxford). 2016 Jun 23;2016:
pubmed: 27337980
PLoS Med. 2016 Feb 16;13(2):e1001961
pubmed: 26881778
Stat Appl Genet Mol Biol. 2005;4:Article17
pubmed: 16646834
J Gastrointest Oncol. 2021 Feb;12(1):153-173
pubmed: 33708433
BMC Bioinformatics. 2008 Dec 29;9:559
pubmed: 19114008
Nucleic Acids Res. 2015 Jul 1;43(W1):W589-98
pubmed: 25897122
J R Stat Soc Series B Stat Methodol. 2012 Mar;74(2):245-266
pubmed: 25506256
Expert Rev Anticancer Ther. 2013 Jan;13(1):29-42
pubmed: 23259425
BMC Bioinformatics. 2014;15 Suppl 2:S2
pubmed: 24564555
J Biopharm Stat. 2010 Mar;20(2):281-300
pubmed: 20309759
Am J Cancer Res. 2018 Aug 01;8(8):1343-1355
pubmed: 30210909
Cancer Epidemiol Biomarkers Prev. 2007 Jun;16(6):1266-9
pubmed: 17548695
Sci Rep. 2022 Jun 1;12(1):9099
pubmed: 35650297
J Cell Mol Med. 2020 Jan;24(1):418-430
pubmed: 31730267

Auteurs

Michelle Livesey (M)

South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa.

Sophia Catherine Rossouw (SC)

South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa.

Renette Blignaut (R)

Department of Statistics and Population Studies, University of the Western Cape, Cape Town, South Africa.

Alan Christoffels (A)

South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa.

Hocine Bendou (H)

South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH