Walking pathways with positive feedback loops reveal DNA methylation biomarkers of colorectal cancer.
Binding Sites
/ genetics
Biomarkers, Tumor
/ genetics
Colorectal Neoplasms
/ diagnosis
CpG Islands
/ genetics
DNA Methylation
/ genetics
Epigenesis, Genetic
Feedback, Physiological
Female
Gene Expression Profiling
Gene Expression Regulation, Neoplastic
Humans
Male
Middle Aged
Neoplasm Staging
Signal Transduction
/ genetics
Transcription Factors
/ metabolism
Circulating DNA
Colorectal cancer
DNA methylation
Genetic algorithm
Multi-omics analysis
Prognostic biomarkers
Signal transduction
Transcription factor binding sites
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
18 Apr 2019
18 Apr 2019
Historique:
entrez:
20
4
2019
pubmed:
20
4
2019
medline:
15
6
2019
Statut:
epublish
Résumé
The search for molecular biomarkers of early-onset colorectal cancer (CRC) is an important but still quite challenging and unsolved task. Detection of CpG methylation in human DNA obtained from blood or stool has been proposed as a promising approach to a noninvasive early diagnosis of CRC. Thousands of abnormally methylated CpG positions in CRC genomes are often located in non-coding parts of genes. Novel bioinformatic methods are thus urgently needed for multi-omics data analysis to reveal causative biomarkers with a potential driver role in early stages of cancer. We have developed a method for finding potential causal relationships between epigenetic changes (DNA methylations) in gene regulatory regions that affect transcription factor binding sites (TFBS) and gene expression changes. This method also considers the topology of the involved signal transduction pathways and searches for positive feedback loops that may cause the carcinogenic aberrations in gene expression. We call this method "Walking pathways", since it searches for potential rewiring mechanisms in cancer pathways due to dynamic changes in the DNA methylation status of important gene regulatory regions ("epigenomic walking"). In this paper, we analysed an extensive collection of full genome gene-expression data (RNA-seq) and DNA methylation data of genomic CpG islands (using Illumina methylation arrays) generated from a sample of tumor and normal gut epithelial tissues of 300 patients with colorectal cancer (at different stages of the disease) (data generated in the EU-supported SysCol project). Identification of potential epigenetic biomarkers of DNA methylation was performed using the fully automatic multi-omics analysis web service "My Genome Enhancer" (MGE) (my-genome-enhancer.com). MGE uses the database on gene regulation TRANSFAC®, the signal transduction pathways database TRANSPATH®, and software that employs AI (artificial intelligence) methods for the analysis of cancer-specific enhancers. The identified biomarkers underwent experimental testing on an independent set of blood samples from patients with colorectal cancer. As a result, using advanced methods of statistics and machine learning, a minimum set of 6 biomarkers was selected, which together achieve the best cancer detection potential. The markers include hypermethylated positions in regulatory regions of the following genes: CALCA, ENO1, MYC, PDX1, TCF7, ZNF43.
Sections du résumé
BACKGROUND
BACKGROUND
The search for molecular biomarkers of early-onset colorectal cancer (CRC) is an important but still quite challenging and unsolved task. Detection of CpG methylation in human DNA obtained from blood or stool has been proposed as a promising approach to a noninvasive early diagnosis of CRC. Thousands of abnormally methylated CpG positions in CRC genomes are often located in non-coding parts of genes. Novel bioinformatic methods are thus urgently needed for multi-omics data analysis to reveal causative biomarkers with a potential driver role in early stages of cancer.
METHODS
METHODS
We have developed a method for finding potential causal relationships between epigenetic changes (DNA methylations) in gene regulatory regions that affect transcription factor binding sites (TFBS) and gene expression changes. This method also considers the topology of the involved signal transduction pathways and searches for positive feedback loops that may cause the carcinogenic aberrations in gene expression. We call this method "Walking pathways", since it searches for potential rewiring mechanisms in cancer pathways due to dynamic changes in the DNA methylation status of important gene regulatory regions ("epigenomic walking").
RESULTS
RESULTS
In this paper, we analysed an extensive collection of full genome gene-expression data (RNA-seq) and DNA methylation data of genomic CpG islands (using Illumina methylation arrays) generated from a sample of tumor and normal gut epithelial tissues of 300 patients with colorectal cancer (at different stages of the disease) (data generated in the EU-supported SysCol project). Identification of potential epigenetic biomarkers of DNA methylation was performed using the fully automatic multi-omics analysis web service "My Genome Enhancer" (MGE) (my-genome-enhancer.com). MGE uses the database on gene regulation TRANSFAC®, the signal transduction pathways database TRANSPATH®, and software that employs AI (artificial intelligence) methods for the analysis of cancer-specific enhancers.
CONCLUSIONS
CONCLUSIONS
The identified biomarkers underwent experimental testing on an independent set of blood samples from patients with colorectal cancer. As a result, using advanced methods of statistics and machine learning, a minimum set of 6 biomarkers was selected, which together achieve the best cancer detection potential. The markers include hypermethylated positions in regulatory regions of the following genes: CALCA, ENO1, MYC, PDX1, TCF7, ZNF43.
Identifiants
pubmed: 30999858
doi: 10.1186/s12859-019-2687-7
pii: 10.1186/s12859-019-2687-7
pmc: PMC6471696
doi:
Substances chimiques
Biomarkers, Tumor
0
Transcription Factors
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
119Références
Cancer Lett. 2002 Dec 15;188(1-2):115-9
pubmed: 12406556
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D108-10
pubmed: 16381825
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D546-51
pubmed: 16381929
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W541-5
pubmed: 16845066
BMC Bioinformatics. 2006 Sep 06;7 Suppl 2:S13
pubmed: 17118134
Clin Chem. 2009 Jul;55(7):1337-46
pubmed: 19406918
Expert Rev Mol Diagn. 2010 May;10(4):481-8
pubmed: 20465502
BMC Genomics. 2014 Mar 26;15:119
pubmed: 24669864
Nature. 2014 Aug 7;512(7512):87-90
pubmed: 25079323
Biomark Med. 2014;8(5):755-69
pubmed: 25123042
Genes (Basel). 2014 Sep 16;5(3):821-64
pubmed: 25229548
Endocr Relat Cancer. 2016 Mar;23(3):R157-71
pubmed: 26764421
Epigenomics. 2016 May;8(5):685-703
pubmed: 27102979
Nucleic Acids Res. 2016 Jul 27;44(13):6055-69
pubmed: 27288444
Microarrays (Basel). 2015 May 21;4(2):270-86
pubmed: 27600225
Science. 2017 May 5;356(6337):
pubmed: 28473536
Cell Rep. 2017 May 9;19(6):1268-1280
pubmed: 28494874
BMC Med Genomics. 2018 Feb 13;11(Suppl 1):12
pubmed: 29504919
EuPA Open Proteom. 2016 Sep 09;13:1-13
pubmed: 29900117