Joint analysis of expression levels and histological images identifies genes associated with tissue morphology.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
11 03 2021
Historique:
received: 21 08 2017
accepted: 05 02 2021
entrez: 12 3 2021
pubmed: 13 3 2021
medline: 25 3 2021
Statut: epublish

Résumé

Histopathological images are used to characterize complex phenotypes such as tumor stage. Our goal is to associate features of stained tissue images with high-dimensional genomic markers. We use convolutional autoencoders and sparse canonical correlation analysis (CCA) on paired histological images and bulk gene expression to identify subsets of genes whose expression levels in a tissue sample correlate with subsets of morphological features from the corresponding sample image. We apply our approach, ImageCCA, to two TCGA data sets, and find gene sets associated with the structure of the extracellular matrix and cell wall infrastructure, implicating uncharacterized genes in extracellular processes. We find sets of genes associated with specific cell types, including neuronal cells and cells of the immune system. We apply ImageCCA to the GTEx v6 data, and find image features that capture population variation in thyroid and in colon tissues associated with genetic variants (image morphology QTLs, or imQTLs), suggesting that genetic variation regulates population variation in tissue morphological traits.

Identifiants

pubmed: 33707455
doi: 10.1038/s41467-021-21727-x
pii: 10.1038/s41467-021-21727-x
pmc: PMC7952575
doi:

Substances chimiques

BRCA1 Protein 0
BRCA1 protein, human 0
Biomarkers, Tumor 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

1609

Subventions

Organisme : NHLBI NIH HHS
ID : R01 HL133218
Pays : United States

Références

IEEE Trans Biomed Eng. 2014 May;61(5):1400-11
pubmed: 24759275
Science. 2015 Jan 23;347(6220):1260419
pubmed: 25613900
Science. 2015 Apr 24;348(6233):aaa6090
pubmed: 25858977
Nature. 2002 Jan 31;415(6871):530-6
pubmed: 11823860
J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1099-108
pubmed: 23959844
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Genome Res. 2017 Feb;27(2):196-207
pubmed: 27864353
Sci Transl Med. 2011 Nov 9;3(108):108ra113
pubmed: 22072638
J Cell Biol. 2010 May 31;189(5):777-82
pubmed: 20513764
J Pathol Inform. 2015 Mar 24;6:15
pubmed: 25838967
PLoS Biol. 2007 May;5(5):e97
pubmed: 17425406
Br J Cancer. 2009 Jul 7;101(1):132-8
pubmed: 19536094
EMBO Mol Med. 2018 Sep;10(9):
pubmed: 30108113
Nature. 2010 Apr 1;464(7289):768-72
pubmed: 20220758
Arch Pathol Lab Med. 2000 Jul;124(7):966-78
pubmed: 10888772
PLoS One. 2013 Jul 29;8(7):e70221
pubmed: 23922958
Bioinformatics. 2012 May 15;28(10):1353-8
pubmed: 22492648
Nature. 2012 Oct 4;490(7418):61-70
pubmed: 23000897
PLoS Comput Biol. 2010 May 06;6(5):e1000770
pubmed: 20463871
Diagn Pathol. 2012 Jun 20;7:42
pubmed: 22515559
Nature. 2017 Oct 11;550(7675):204-213
pubmed: 29022597
Nature. 2013 Oct 17;502(7471):377-80
pubmed: 23995691
BMC Bioinformatics. 2011 Aug 04;12:323
pubmed: 21816040
Sci Rep. 2020 Apr 14;10(1):6423
pubmed: 32286358
BMC Bioinformatics. 2020 Jul 21;21(1):324
pubmed: 32693778
Nat Rev Genet. 2007 Sep;8(9):689-98
pubmed: 17680007
Neuron. 2017 May 17;94(4):752-758.e1
pubmed: 28521130
Nature. 2007 Jun 7;447(7145):661-78
pubmed: 17554300
Int J Mol Sci. 2017 Mar 28;18(4):
pubmed: 28350360
PLoS Genet. 2007 Sep;3(9):1724-35
pubmed: 17907809
N Engl J Med. 2015 Jun 25;372(26):2481-98
pubmed: 26061751
Biostatistics. 2009 Jul;10(3):515-34
pubmed: 19377034
Biomaterials. 2014 Jan;35(3):961-9
pubmed: 24183171

Auteurs

Jordan T Ash (JT)

Department of Computer Science, Princeton University, Princeton, NJ, USA.

Gregory Darnell (G)

Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.

Daniel Munro (D)

Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.

Barbara E Engelhardt (BE)

Department of Computer Science, Princeton University, Princeton, NJ, USA. bee@princeton.edu.
Center for Statistics and Machine Learning, Princeton University, Princeton, NJ, USA. bee@princeton.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH