RNAseqCNV: analysis of large-scale copy number variations from RNA-seq data.


Journal

Leukemia
ISSN: 1476-5551
Titre abrégé: Leukemia
Pays: England
ID NLM: 8704895

Informations de publication

Date de publication:
06 2022
Historique:
received: 12 11 2021
accepted: 11 03 2022
revised: 03 03 2022
pubmed: 31 3 2022
medline: 7 6 2022
entrez: 30 3 2022
Statut: ppublish

Résumé

Transcriptome sequencing (RNA-seq) is widely used to detect gene rearrangements and quantitate gene expression in acute lymphoblastic leukemia (ALL), but its utility and accuracy in identifying copy number variations (CNVs) has not been well described. CNV information inferred from RNA-seq can be highly informative to guide disease classification and risk stratification in ALL due to the high incidence of aneuploid subtypes within this disease. Here we describe RNAseqCNV, a method to detect large scale CNVs from RNA-seq data. We used models based on normalized gene expression and minor allele frequency to classify arm level CNVs with high accuracy in ALL (99.1% overall and 98.3% for non-diploid chromosome arms, respectively), and the models were further validated with excellent performance in acute myeloid leukemia (accuracy 99.8% overall and 99.4% for non-diploid chromosome arms). RNAseqCNV outperforms alternative RNA-seq based algorithms in calling CNVs in the ALL dataset, especially in samples with a high proportion of CNVs. The CNV calls were highly concordant with DNA-based CNV results and more reliable than conventional cytogenetic-based karyotypes. RNAseqCNV provides a method to robustly identify copy number alterations in the absence of DNA-based analyses, further enhancing the utility of RNA-seq to classify ALL subtype.

Identifiants

pubmed: 35351983
doi: 10.1038/s41375-022-01547-8
pii: 10.1038/s41375-022-01547-8
pmc: PMC9177690
mid: NIHMS1788556
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

1492-1498

Subventions

Organisme : NCI NIH HHS
ID : R35 CA197695
Pays : United States
Organisme : NCI NIH HHS
ID : K99 CA241297
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA021765
Pays : United States
Organisme : NCI NIH HHS
ID : R00 CA241297
Pays : United States
Organisme : NIGMS NIH HHS
ID : P50 GM115279
Pays : United States

Informations de copyright

© 2022. The Author(s), under exclusive licence to Springer Nature Limited.

Références

Li JF, Dai YT, Lilljebjorn H, Shen SH, Cui BW, Bai L, et al. Transcriptional landscape of B cell precursor acute lymphoblastic leukemia based on an international study of 1223 cases. Proc Natl Acad Sci USA. 2018;115:E11711–E11720. Dec 11
pubmed: 30487223 pmcid: 6294900
Gu Z, Churchman ML, Roberts KG, Moore I, Zhou X, Nakitandwe J, et al. PAX5-driven subtypes of B-progenitor acute lymphoblastic leukemia. Nat Genet. 2019;51:296–307. Jan 14
doi: 10.1038/s41588-018-0315-5
Liu Y, Easton J, Shao Y, Maciaszek J, Wang Z, Wilkinson MR, et al. The genomic landscape of pediatric and young adult T-lineage acute lymphoblastic leukemia. Nature genetics 2017 Jul 03.
Talevich E, Shain AH CNVkit-RNA: Copy number inference from RNA-Sequencing data. bioRxiv 2018: 408534.
Serin Harmanci A, Harmanci AO, Zhou X. CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data. Nat Commun. 2020;11:89. Jan 3
doi: 10.1038/s41467-019-13779-x
Iacobucci I, Mullighan CG. Genetic basis of acute lymphoblastic leukemia. J Clin Oncol: Off J Am Soc Clin Oncol. 2017;35:975–83. Mar 20
doi: 10.1200/JCO.2016.70.7836
Inaba H, Azzato EM, Mullighan CG. Integration of next-generation sequencing to treat acute lymphoblastic leukemia with targetable lesions: The St. Jude Children’s Research Hospital Approach. Front Pediatr. 2017;5:258.
doi: 10.3389/fped.2017.00258
Chen X, Gupta P, Wang J, Nakitandwe J, Roberts K, Dalton JD, et al. CONSERTING: integrating copy-number analysis with structural-variation detection. Nat methods. 2015;12:527–30. Jun
doi: 10.1038/nmeth.3394
Boeva V, Popova T, Bleakley K, Chiche P, Cappo J, Schleiermacher G, et al. Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data. Bioinformatics. 2012;28:423–5. Feb 01
doi: 10.1093/bioinformatics/btr670
Yau C, Mouradov D, Jorissen RN, Colella S, Mirza G, Steers G, et al. A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data. Genome Biol. 2010;11:R92.
doi: 10.1186/gb-2010-11-9-r92
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SF, et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 2007;17:1665–74. Nov
doi: 10.1101/gr.6861907
Mayrhofer M, Viklund B, Isaksson A. Rawcopy: improved copy number analysis with Affymetrix arrays. Sci Rep. 2016;6:36158. Oct 31
doi: 10.1038/srep36158
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
doi: 10.1186/s13059-014-0550-8
McLeod C, Gout AM, Zhou X, Thrasher A, Rahbarinia D, Brady SW, et al. St. Jude cloud: a pediatric cancer genomic data-sharing ecosystem. Cancer Discov. 2021;11:1082–99. May
doi: 10.1158/2159-8290.CD-20-1230
Kuhn M. Building predictive models in R using the caret package. J Stat Softw. 2008. 2008 2008-11-10;28: 6.
Flensburg C, Sargeant T, Oshlack A, Majewski IJ. SuperFreq: Integrated mutation detection and clonal tracking in cancer. PLoS computational Biol. 2020;16:e1007603. Feb
doi: 10.1371/journal.pcbi.1007603
Ma SK, Chan GC, Wan TS, Lam CK, Ha SY, Lau YL, et al. Near-haploid common acute lymphoblastic leukaemia of childhood with a second hyperdiploid line: a DNA ploidy and fluorescence in-situ hybridization study. Br J Haematol. 1998;103:750–5. Dec
doi: 10.1046/j.1365-2141.1998.01044.x

Auteurs

Jan Bařinka (J)

Childhood Leukemia Investigation Prague (CLIP), 2nd Faculty of Medicine, Charles University and University Hospital Motol, Prague, Czech Republic.

Zunsong Hu (Z)

Department of Computational and Quantitative Medicine & Systems Biology, Beckman Research Institute of City of Hope, Duarte, CA, 91010, USA.

Lu Wang (L)

Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA.

David A Wheeler (DA)

Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA.

Delaram Rahbarinia (D)

Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA.

Clay McLeod (C)

Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA.

Zhaohui Gu (Z)

Department of Computational and Quantitative Medicine & Systems Biology, Beckman Research Institute of City of Hope, Duarte, CA, 91010, USA. zgu@coh.org.

Charles G Mullighan (CG)

Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA. charles.mullighan@stjude.org.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH