Structural variation and fusion detection using targeted sequencing data from circulating cell free DNA.
Journal
Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011
Informations de publication
Date de publication:
23 04 2019
23 04 2019
Historique:
accepted:
01
02
2019
revised:
15
12
2018
received:
18
07
2018
pubmed:
14
2
2019
medline:
29
10
2019
entrez:
14
2
2019
Statut:
ppublish
Résumé
Cancer is a complex disease that involves rapidly evolving cells, often forming multiple distinct clones. In order to effectively understand progression of a patient-specific tumor, one needs to comprehensively sample tumor DNA at multiple time points, ideally obtained through inexpensive and minimally invasive techniques. Current sequencing technologies make the 'liquid biopsy' possible, which involves sampling a patient's blood or urine and sequencing the circulating cell free DNA (cfDNA). A certain percentage of this DNA originates from the tumor, known as circulating tumor DNA (ctDNA). The ratio of ctDNA may be extremely low in the sample, and the ctDNA may originate from multiple tumors or clones. These factors present unique challenges for applying existing tools and workflows to the analysis of ctDNA, especially in the detection of structural variations which rely on sufficient read coverage to be detectable. Here we introduce SViCT , a structural variation (SV) detection tool designed to handle the challenges associated with cfDNA analysis. SViCT can detect breakpoints and sequences of various structural variations including deletions, insertions, inversions, duplications and translocations. SViCT extracts discordant read pairs, one-end anchors and soft-clipped/split reads, assembles them into contigs, and re-maps contig intervals to a reference genome using an efficient k-mer indexing approach. The intervals are then joined using a combination of graph and greedy algorithms to identify specific structural variant signatures. We assessed the performance of SViCT and compared it to state-of-the-art tools using simulated cfDNA datasets with properties matching those of real cfDNA samples. The positive predictive value and sensitivity of our tool was superior to all the tested tools and reasonable performance was maintained down to the lowest dilution of 0.01% tumor DNA in simulated datasets. Additionally, SViCT was able to detect all known SVs in two real cfDNA reference datasets (at 0.6-5% ctDNA) and predict a novel structural variant in a prostate cancer cohort. SViCT is available at https://github.com/vpc-ccg/svict. Contact:faraz.hach@ubc.ca.
Identifiants
pubmed: 30759232
pii: 5316733
doi: 10.1093/nar/gkz067
pmc: PMC6468241
doi:
Substances chimiques
Cell-Free Nucleic Acids
0
Circulating Tumor DNA
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e38Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM108348
Pays : United States
Informations de copyright
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
Références
Nat Methods. 2011 Nov 20;9(1):72-4
pubmed: 22101854
Bioinformatics. 2014 Apr 15;30(8):1064-1072
pubmed: 24389656
Genome Res. 2009 Jul;19(7):1270-8
pubmed: 19447966
Genome Res. 2011 Jan;21(1):56-67
pubmed: 21036922
Clin Cancer Res. 2016 Jun 15;22(12):3067-77
pubmed: 26847055
Br J Haematol. 1994 Apr;86(4):774-9
pubmed: 7918071
BJU Int. 2008 Jan;101(1):113-9
pubmed: 17941920
Oncotarget. 2016 Oct 25;7(43):71013-71035
pubmed: 27589834
Bioinformatics. 2009 Nov 1;25(21):2865-71
pubmed: 19561018
Nat Rev Cancer. 2017 Apr;17(4):223-238
pubmed: 28233803
Curr Protoc Bioinformatics. 2014;45:15.6.1-11
pubmed: 25152801
Mol Biol Cell. 2003 May;14(5):1780-9
pubmed: 12802054
Bioinformatics. 2012 Sep 15;28(18):i333-i339
pubmed: 22962449
Nucleic Acids Res. 2001 Jan 1;29(1):308-11
pubmed: 11125122
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Bioinformatics. 2017 Jul 15;33(14):i161-i169
pubmed: 28881988
Transl Oncol. 2013 Jun 01;6(3):319-28
pubmed: 23730412
Genome Biol. 2014 Jun 26;15(6):R84
pubmed: 24970577
Bioinformatics. 2013 Apr 15;29(8):1076-7
pubmed: 23413434
JAMA Oncol. 2016 Dec 1;2(12):1598-1606
pubmed: 27148695
Cancer Res. 2006 Oct 15;66(20):9829-36
pubmed: 17047043
Genome Res. 2017 Dec;27(12):2050-2060
pubmed: 29097403
Cancer Epidemiol Biomarkers Prev. 1994 Jan-Feb;3(1):67-71
pubmed: 8118388
Oncology. 1989;46(5):318-22
pubmed: 2779946
Cell. 2016 Jan 14;164(1-2):57-68
pubmed: 26771485
C R Seances Soc Biol Fil. 1948 Feb;142(3-4):241-3
pubmed: 18875018
Theranostics. 2017 Apr 2;7(6):1437-1446
pubmed: 28529628
Bioinformatics. 2019 Jun 1;35(11):1829-1836
pubmed: 30351359
Nature. 2007 Aug 2;448(7153):595-9
pubmed: 17671502
Genome Res. 2017 Mar;27(3):491-499
pubmed: 28100584
Nature. 2010 Mar 11;464(7286):302-5
pubmed: 20220849
Genes Dev. 2013 Dec 1;27(23):2513-30
pubmed: 24298051
Nucleic Acids Res. 2014 Jan;42(Database issue):D986-92
pubmed: 24174537
J Mol Endocrinol. 2003 Aug;31(1):169-83
pubmed: 12914534
Curr Protoc Hum Genet. 2016 Oct 11;91:10.11.1-10.11.37
pubmed: 27727438
Ann Oncol. 2017 Sep 1;28(9):2149-2159
pubmed: 28911069
Nucleic Acids Res. 2016 Jun 20;44(11):e108
pubmed: 27060149
Nat Med. 2014 Apr;20(4):430-5
pubmed: 24658074
Cancer Metastasis Rev. 2016 Sep;35(3):347-76
pubmed: 27392603
BMC Genomics. 2012 Feb 15;13:74
pubmed: 22336055