Deep whole-genome sequencing of 3 cancer cell lines on 2 sequencing platforms.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
13 12 2019
Historique:
received: 04 06 2019
accepted: 30 11 2019
entrez: 15 12 2019
pubmed: 15 12 2019
medline: 15 12 2020
Statut: epublish

Résumé

To test the performance of a new sequencing platform, develop an updated somatic calling pipeline and establish a reference for future benchmarking experiments, we performed whole-genome sequencing of 3 common cancer cell lines (COLO-829, HCC-1143 and HCC-1187) along with their matched normal cell lines to great sequencing depths (up to 278x coverage) on both Illumina HiSeqX and NovaSeq sequencing instruments. Somatic calling was generally consistent between the two platforms despite minor differences at the read level. We designed and implemented a novel pipeline for the analysis of tumor-normal samples, using multiple variant callers. We show that coupled with a high-confidence filtering strategy, the use of combination of tools improves the accuracy of somatic variant calling. We also demonstrate the utility of the dataset by creating an artificial purity ladder to evaluate the somatic pipeline and benchmark methods for estimating purity and ploidy from tumor-normal pairs. The data and results of the pipeline are made accessible to the cancer genomics community.

Identifiants

pubmed: 31836783
doi: 10.1038/s41598-019-55636-3
pii: 10.1038/s41598-019-55636-3
pmc: PMC6911065
doi:

Types de publication

Comparative Study Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

19123

Subventions

Organisme : NCI NIH HHS
ID : R21 CA220411
Pays : United States

Références

Cancer Genet Cytogenet. 1993 Sep;69(2):108-12
pubmed: 8402545
Arch Pathol Lab Med. 2015 Apr;139(4):508-17
pubmed: 25356985
Nucleic Acids Res. 2019 Jan 8;47(D1):D941-D947
pubmed: 30371878
Nature. 2012 Mar 28;483(7391):603-7
pubmed: 22460905
Genome Res. 2004 Feb;14(2):287-95
pubmed: 14762065
Nucleic Acids Res. 2017 Mar 17;45(5):e34
pubmed: 27903916
Bioinformatics. 2012 Mar 1;28(5):619-27
pubmed: 22238266
Nat Commun. 2015 Dec 09;6:10001
pubmed: 26647970
Bioinformatics. 2016 Apr 15;32(8):1220-2
pubmed: 26647377
Commun Biol. 2018 Mar 22;1:20
pubmed: 30271907
Nat Commun. 2017 Jan 24;8:14061
pubmed: 28117401
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nat Genet. 2013 Oct;45(10):1113-20
pubmed: 24071849
Genome Res. 2018 Apr;28(4):581-591
pubmed: 29535149
Sci Rep. 2016 Apr 20;6:24607
pubmed: 27094764
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Nat Commun. 2020 Sep 2;11(1):4301
pubmed: 32879317
Nucleic Acids Res. 2016 Jul 27;44(13):6274-86
pubmed: 27260798
Nat Biotechnol. 2013 Mar;31(3):213-9
pubmed: 23396013
Nucleic Acids Res. 2001 Jan 1;29(1):308-11
pubmed: 11125122
Int J Cancer. 1998 Dec 9;78(6):766-74
pubmed: 9833771
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
PLoS One. 2013 Jun 10;8(6):e64991
pubmed: 23762276
Nature. 2010 Jan 14;463(7278):191-6
pubmed: 20016485
Genome Biol. 2016 Jun 06;17(1):122
pubmed: 27268795
Nature. 2016 Aug 17;536(7616):285-91
pubmed: 27535533
Bioinformatics. 2016 Oct 15;32(20):3196-3198
pubmed: 27354699
Genome Biol. 2014 Jun 26;15(6):R84
pubmed: 24970577
Nat Methods. 2018 Aug;15(8):591-594
pubmed: 30013048
Bioinformatics. 2013 Nov 1;29(21):2678-82
pubmed: 24045776
Nature. 2016 Oct 20;538(7625):378-382
pubmed: 27732578
Nucleic Acids Res. 2014 Jan;42(Database issue):D986-92
pubmed: 24174537
J Mol Diagn. 2015 May;17(3):251-64
pubmed: 25801821
Cell Syst. 2015 Sep 23;1(3):210-223
pubmed: 26645048

Auteurs

Kanika Arora (K)

New York Genome Center, New York, NY, 10013, USA.

Minita Shah (M)

New York Genome Center, New York, NY, 10013, USA.

Molly Johnson (M)

New York Genome Center, New York, NY, 10013, USA.

Rashesh Sanghvi (R)

New York Genome Center, New York, NY, 10013, USA.

Jennifer Shelton (J)

New York Genome Center, New York, NY, 10013, USA.

Kshithija Nagulapalli (K)

New York Genome Center, New York, NY, 10013, USA.

Dayna M Oschwald (DM)

New York Genome Center, New York, NY, 10013, USA.

Michael C Zody (MC)

New York Genome Center, New York, NY, 10013, USA.

Soren Germer (S)

New York Genome Center, New York, NY, 10013, USA.

Vaidehi Jobanputra (V)

New York Genome Center, New York, NY, 10013, USA.

Jade Carter (J)

New York Genome Center, New York, NY, 10013, USA.

Nicolas Robine (N)

New York Genome Center, New York, NY, 10013, USA. nrobine@nygenome.org.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH