Identification of a Transcriptomic Prognostic Signature by Machine Learning Using a Combination of Small Cohorts of Prostate Cancer.
RNA-seq
biochemical recurrence
machine learning
predictive signature
prostate cancer
random forest
Journal
Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621
Informations de publication
Date de publication:
2020
2020
Historique:
received:
05
06
2020
accepted:
29
10
2020
entrez:
16
12
2020
pubmed:
17
12
2020
medline:
17
12
2020
Statut:
epublish
Résumé
Determining which treatment to provide to men with prostate cancer (PCa) is a major challenge for clinicians. Currently, the clinical risk-stratification for PCa is based on clinico-pathological variables such as Gleason grade, stage and prostate specific antigen (PSA) levels. But transcriptomic data have the potential to enable the development of more precise approaches to predict evolution of the disease. However, high quality RNA sequencing (RNA-seq) datasets along with clinical data with long follow-up allowing discovery of biochemical recurrence (BCR) biomarkers are small and rare. In this study, we propose a machine learning approach that is robust to batch effect and enables the discovery of highly predictive signatures despite using small datasets. Gene expression data were extracted from three RNA-Seq datasets cumulating a total of 171 PCa patients. Data were re-analyzed using a unique pipeline to ensure uniformity. Using a machine learning approach, a total of 14 classifiers were tested with various parameters to identify the best model and gene signature to predict BCR. Using a random forest model, we have identified a signature composed of only three genes (JUN, HES4, PPDPF) predicting BCR with better accuracy [74.2%, balanced error rate (BER) = 27%] than the clinico-pathological variables (69.2%, BER = 32%) currently in use to predict PCa evolution. This score is in the range of the studies that predicted BCR in single-cohort with a higher number of patients. We showed that it is possible to merge and analyze different small and heterogeneous datasets altogether to obtain a better signature than if they were analyzed individually, thus reducing the need for very large cohorts. This study demonstrates the feasibility to regroup different small datasets in one larger to identify a predictive genomic signature that would benefit PCa patients.
Identifiants
pubmed: 33324443
doi: 10.3389/fgene.2020.550894
pmc: PMC7723980
doi:
Types de publication
Journal Article
Langues
eng
Pagination
550894Informations de copyright
Copyright © 2020 Vittrant, Leclercq, Martin-Magniette, Collins, Bergeron, Fradet and Droit.
Références
Oncotarget. 2017 May 16;8(20):32990-33001
pubmed: 28380430
Ther Adv Med Oncol. 2017 Aug;9(8):565-573
pubmed: 28794807
Oncogene. 1990 Jul;5(7):1055-8
pubmed: 2165232
J Mol Med (Berl). 2005 Dec;83(12):1014-24
pubmed: 16211407
Nature. 2010 Apr 15;464(7291):993-8
pubmed: 20393554
Prostate Cancer Prostatic Dis. 2016 Dec;19(4):395-397
pubmed: 27431496
Contemp Oncol (Pozn). 2015;19(1A):A68-77
pubmed: 25691825
Methods Mol Biol. 2010;593:25-48
pubmed: 19957143
PLoS Biol. 2015 Jul 07;13(7):e1002195
pubmed: 26151137
Prostate. 2013 Mar;73(4):382-90
pubmed: 22926970
Clin Cancer Res. 2008 Jul 1;14(13):4059-66
pubmed: 18593982
PLoS Genet. 2018 Apr 16;14(4):e1007355
pubmed: 29659569
Am J Cancer Res. 2018 Aug 01;8(8):1403-1413
pubmed: 30210912
Lab Invest. 2005 Jan;85(1):154-9
pubmed: 15543203
Lancet Oncol. 2018 May;19(5):705-714
pubmed: 29606586
PLoS One. 2017 Oct 4;12(10):e0184741
pubmed: 28977016
Eur Urol. 2017 Jul;72(1):22-31
pubmed: 27815082
Biomark Cancer. 2016 May 05;8(Suppl 2):15-33
pubmed: 27168728
PLoS One. 2014 Dec 31;9(12):e115892
pubmed: 25551575
Br J Cancer. 2009 May 19;100(10):1603-7
pubmed: 19401683
Adv Cancer Res. 1990;55:1-35
pubmed: 2166997
Oncotarget. 2016 May 24;7(21):30760-71
pubmed: 27120795
J Clin Oncol. 2012 May 20;30(15):1857-63
pubmed: 22508816
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Nat Biotechnol. 2016 May;34(5):525-7
pubmed: 27043002
Genome Biol. 2014 Aug 26;15(8):426
pubmed: 25155515
Carcinogenesis. 2020 May 14;41(3):267-273
pubmed: 31408512
Cytotechnology. 2011 Dec;63(6):645-54
pubmed: 21850463
Oncol Lett. 2019 Nov;18(5):4907-4915
pubmed: 31612001
J Interferon Cytokine Res. 2016 Dec;36(12):698-705
pubmed: 27726464
Nucleic Acids Res. 2012 Jan;40(Database issue):D1060-6
pubmed: 22110038
PeerJ. 2020 Jan 3;8:e8312
pubmed: 31921517
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17
pubmed: 25750696
J Cancer. 2018 Apr 30;9(11):1989-2002
pubmed: 29896284
Endocr Relat Cancer. 2018 May;25(5):569-581
pubmed: 29592867
Oncogenesis. 2013 Apr 08;2:e43
pubmed: 23567620
F1000Res. 2015 Dec 30;4:1521
pubmed: 26925227
J Clin Oncol. 2003 Apr 1;21(7):1232-7
pubmed: 12663709
Oral Oncol. 2019 Jul;94:115-120
pubmed: 31178206
Cancer Res. 1980 Jul;40(7):2428-32
pubmed: 7388802
Proc Natl Acad Sci U S A. 1987 May;84(9):2848-52
pubmed: 3033666
Int J Mol Sci. 2018 May 04;19(5):
pubmed: 29734647
Adv Bioinformatics. 2015;2015:198363
pubmed: 26170834
Ann Surg Oncol. 2010 Jun;17(6):1471-4
pubmed: 20180029
Cancer Cell. 2007 Apr;11(4):361-74
pubmed: 17418412
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613
pubmed: 30476243
Tumour Biol. 2015 Aug;36(8):5891-9
pubmed: 25712376
Oncotarget. 2017 Mar 14;8(11):17862-17872
pubmed: 28160568
Tumour Biol. 2015 Jan;36(1):219-25
pubmed: 25230788
N Engl J Med. 2004 Oct 7;351(15):1502-12
pubmed: 15470213
BJU Int. 2008 Aug 5;102(5):628-32
pubmed: 18410441
Int J Cancer. 2015 Mar 15;136(6):E569-77
pubmed: 25220908
Cell. 2015 Nov 5;163(4):1011-25
pubmed: 26544944
Oncotarget. 2016 Oct 25;7(43):69991-69999
pubmed: 27588397
Nucleic Acids Res. 2015 Jul 1;43(W1):W589-98
pubmed: 25897122
J Clin Oncol. 2003 Jun 1;21(11):2163-72
pubmed: 12775742
Biostatistics. 2012 Jul;13(3):539-52
pubmed: 22101192
Nature. 2013 Jun 13;498(7453):255-60
pubmed: 23765498
Artif Intell Med. 2012 May;55(1):25-35
pubmed: 22206941
Lancet Oncol. 2014 Dec;15(13):1521-1532
pubmed: 25456371
J Cancer. 2016 Sep 27;7(14):1960-1967
pubmed: 27877211
Sci Rep. 2018 Apr 27;8(1):6653
pubmed: 29703916
Biomark Insights. 2015 Nov 30;10:103-12
pubmed: 26648682
Front Oncol. 2019 Nov 12;9:1243
pubmed: 31803620
J Cancer Res Clin Oncol. 2018 May;144(5):883-891
pubmed: 29511883
Brief Bioinform. 2018 Mar 1;19(2):325-340
pubmed: 28011753
Nat Biotechnol. 2014 Sep;32(9):896-902
pubmed: 25150836
Pediatr Blood Cancer. 2017 May;64(5):
pubmed: 27786411
CA Cancer J Clin. 2017 Jan;67(1):7-30
pubmed: 28055103
Evol Comput. 2012 Summer;20(2):249-75
pubmed: 22339368
Nat Rev Cancer. 2019 Mar;19(3):133-150
pubmed: 30755690
Sci Rep. 2017 Jul 17;7(1):5517
pubmed: 28717245
Aging (Albany NY). 2016 Oct 5;8(11):2702-2712
pubmed: 27705925
PLoS One. 2018 Mar 27;13(3):e0194889
pubmed: 29584784
Artif Intell Med. 2000 Aug;20(1):59-75
pubmed: 11185421
Sci Rep. 2018 Aug 13;8(1):12054
pubmed: 30104757
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W305-11
pubmed: 19465376
Cancer Res. 2010 Feb 15;70(4):1469-78
pubmed: 20145124
Database (Oxford). 2011 Jul 23;2011:bar030
pubmed: 21785142
Dev Biol. 1999 Nov 15;215(2):264-77
pubmed: 10545236
Anticancer Res. 2018 Mar;38(3):1471-1477
pubmed: 29491074
Cancer Res. 2014 Jun 15;74(12):3228-37
pubmed: 24713434