Sources of erroneous sequences and artifact chimeric reads in next generation sequencing of genomic DNA from formalin-fixed paraffin-embedded samples.


Journal

Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011

Informations de publication

Date de publication:
25 01 2019
Historique:
received: 12 03 2018
accepted: 06 11 2018
pubmed: 13 11 2018
medline: 21 8 2019
entrez: 13 11 2018
Statut: ppublish

Résumé

Tissues used in pathology laboratories are typically stored in the form of formalin-fixed, paraffin-embedded (FFPE) samples. One important consideration in repurposing FFPE material for next generation sequencing (NGS) analysis is the sequencing artifacts that can arise from the significant damage to nucleic acids due to treatment with formalin, storage at room temperature and extraction. One such class of artifacts consists of chimeric reads that appear to be derived from non-contiguous portions of the genome. Here, we show that a major proportion of such chimeric reads align to both the 'Watson' and 'Crick' strands of the reference genome. We refer to these as strand-split artifact reads (SSARs). This study provides a conceptual framework for the mechanistic basis of the genesis of SSARs and other chimeric artifacts along with supporting experimental evidence, which have led to approaches to reduce the levels of such artifacts. We demonstrate that one of these approaches, involving S1 nuclease-mediated removal of single-stranded fragments and overhangs, also reduces sequence bias, base error rates, and false positive detection of copy number and single nucleotide variants. Finally, we describe an analytical approach for quantifying SSARs from NGS data.

Identifiants

pubmed: 30418619
pii: 5173669
doi: 10.1093/nar/gky1142
pmc: PMC6344851
doi:

Substances chimiques

Fixatives 0
Formaldehyde 1HG84L3525

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Pagination

e12

Références

Genome Biol. 2010;11(8):R82
pubmed: 20696054
Nature. 2011 Jul 27;476(7360):298-303
pubmed: 21796119
Clin Chem. 2015 Jan;61(1):64-71
pubmed: 25421801
Bioinformatics. 2012 Oct 15;28(20):2678-9
pubmed: 22914218
Lab Invest. 2013 Jun;93(6):701-10
pubmed: 23568031
Genet Med. 2018 Oct;20(10):1196-1205
pubmed: 29388947
Genome Biol. 2010;11(12):R119
pubmed: 21143862
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Arch Pathol Lab Med. 2014 Nov;138(11):1520-30
pubmed: 25357115
PLoS One. 2017 Jun 1;12(6):e0178706
pubmed: 28570594
Bioinformatics. 2012 Jul 15;28(14):1811-7
pubmed: 22581179
Nat Biotechnol. 2011 Jan;29(1):24-6
pubmed: 21221095
Biotechniques. 2015 Jul 01;59(1):19-25
pubmed: 26156780
Nucleic Acids Res. 2009 Dec;37(22):e148
pubmed: 19815668

Auteurs

Simon Haile (S)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Richard D Corbett (RD)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Steve Bilobram (S)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Morgan H Bye (MH)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Heather Kirk (H)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Pawan Pandoh (P)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Eva Trinh (E)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Tina MacLeod (T)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Helen McDonald (H)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Miruna Bala (M)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Diane Miller (D)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Karen Novik (K)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Robin J Coope (RJ)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Richard A Moore (RA)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Yongjun Zhao (Y)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Andrew J Mungall (AJ)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Yussanne Ma (Y)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Rob A Holt (RA)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Steven J Jones (SJ)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.

Marco A Marra (MA)

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada.
Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH