EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2019
Historique:
received: 19 02 2019
accepted: 09 07 2019
entrez: 7 8 2019
pubmed: 7 8 2019
medline: 3 3 2020
Statut: epublish

Résumé

Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license.

Identifiants

pubmed: 31386677
doi: 10.1371/journal.pone.0220279
pii: PONE-D-19-04825
pmc: PMC6684082
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0220279

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Nat Commun. 2014;5:3231
pubmed: 24488081
JAMA. 2016 Dec 13;316(22):2402-2410
pubmed: 27898976
PLoS One. 2013;8(2):e55871
pubmed: 23409071
Bioinformatics. 2013 Jun 15;29(12):1553-61
pubmed: 23613489
Nature. 2017 Jun 28;546(7660):686
pubmed: 28658222
PLoS One. 2015 Jan 20;10(1):e0116776
pubmed: 25602758
Nucleic Acids Res. 2015 Jan;43(Database issue):D1113-6
pubmed: 25361974
Sci Rep. 2017 Aug 17;7(1):8617
pubmed: 28819158
Nat Genet. 2013 Jun;45(6):580-5
pubmed: 23715323
Nucleic Acids Res. 2011 Jan;39(Database issue):D1005-10
pubmed: 21097893
J Biomol Screen. 2014 Jun;19(5):803-16
pubmed: 24518066
Med Image Anal. 2017 Dec;42:60-88
pubmed: 28778026
Nat Genet. 2013 Oct;45(10):1113-20
pubmed: 24071849
BMC Bioinformatics. 2008 Dec 29;9:559
pubmed: 19114008
Proc Natl Acad Sci U S A. 2000 Oct 24;97(22):12182-6
pubmed: 11027309
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W305-11
pubmed: 19465376
BMC Syst Biol. 2016 Aug 01;10 Suppl 2:51
pubmed: 27490697
Carcinogenesis. 2013 Oct;34(10):2300-8
pubmed: 23740839
PLoS Biol. 2007 Jan;5(1):e8
pubmed: 17214507
Proc Natl Acad Sci U S A. 1998 Dec 8;95(25):14863-8
pubmed: 9843981
Nucleic Acids Res. 2002 Apr 1;30(7):1575-84
pubmed: 11917018
J Proteome Res. 2013 Jun 7;12(6):2354-64
pubmed: 23590569
IEEE Trans Med Imaging. 2016 May;35(5):1285-98
pubmed: 26886976
Nucleic Acids Res. 2013 Jan;41(Database issue):D171-6
pubmed: 23203885
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38
pubmed: 16119262

Auteurs

Benafsh Husain (B)

Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC United States of America.

F Alex Feltus (FA)

Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC United States of America.
Genetics and Biochemistry Department, Clemson University, Clemson, SC United States of America.
Center for Human Genetics, Clemson University, Clemson, SC United States of America.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH