Accurate and Effective Detection of Recurrent Copy Number Variants in Large SNP Genotype Datasets.
CNVs
SNPs
bioinformatics pipeline
structural variation
Journal
Current protocols
ISSN: 2691-1299
Titre abrégé: Curr Protoc
Pays: United States
ID NLM: 101773894
Informations de publication
Date de publication:
Dec 2022
Dec 2022
Historique:
entrez:
5
12
2022
pubmed:
6
12
2022
medline:
25
2
2023
Statut:
ppublish
Résumé
Structural variations, including recurrent Copy Number Variants (CNVs) at specific genomic loci, have been found to be associated with increased risk of several diseases and syndromes. CNV carrier status can be determined in large collections of samples using SNP arrays and, more recently, sequencing data. Although there is some consensus among researchers about the essential steps required in such analysis (i.e., CNV calling, filtering of putative carriers, and visual validation using intensity data plots of the genomic region), standard methodologies and processes to control the quality and consistency of the results are lacking. Here, we present a comprehensive and user-friendly protocol that we have refined from our extensive research experience in the field. We cover every aspect of the analysis, from input data curation to final results. For each step, we highlight which parameters affect the analysis the most and how different settings may lead to different results. We provide a pipeline to run the complete analysis with effective (but customizable) pre-sets. We present software that we developed to better handle and filter putative CNV carriers and perform visual inspection to validate selected candidates. Finally, we describe methods to evaluate the critical sections and actions to counterbalance potential problems. The current implementation is focused on Illumina SNP array data. All the presented software is freely available and provided in a ready-to-use docker container. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: From raw intensity data files to CNV calls Basic Protocol 2: From CNV calls to validated CNV carriers. Basic Protocol 3: Quality control and quality assessment Basic Protocol 4: Install the necessary software.
Identifiants
pubmed: 36469582
doi: 10.1002/cpz1.621
pmc: PMC9729012
mid: NIHMS1848134
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e621Subventions
Organisme : Medical Research Council
ID : MR/V004905/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/W014416/1
Pays : United Kingdom
Organisme : NIMH NIH HHS
ID : R01 MH124789
Pays : United States
Informations de copyright
© 2022 The Authors. Current Protocols published by Wiley Periodicals LLC.
Références
Gigascience. 2021 Feb 16;10(2):
pubmed: 33594436
Nucleic Acids Res. 2007;35(6):2013-25
pubmed: 17341461
Nature. 2015 Oct 1;526(7571):75-81
pubmed: 26432246
Cell. 2012 Mar 16;148(6):1223-41
pubmed: 22424231
Nature. 2014 Jan 16;505(7483):361-6
pubmed: 24352232
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
Bioinformatics. 2014 Jul 15;30(14):2076-8
pubmed: 24681907
Trends Genet. 2002 Feb;18(2):74-82
pubmed: 11818139
JAMA Psychiatry. 2022 Jan 1;79(1):59-69
pubmed: 34817560
J Med Genet. 2019 Mar;56(3):131-138
pubmed: 30343275
Nature. 2008 Sep 11;455(7210):232-6
pubmed: 18668039
Nat Rev Genet. 2013 Feb;14(2):125-38
pubmed: 23329113
Am J Hum Genet. 2005 Jul;77(1):78-88
pubmed: 15918152
Genome Res. 2007 Nov;17(11):1665-74
pubmed: 17921354
Am J Med Genet. 1992 Sep 15;44(2):261-8
pubmed: 1360769
Nat Genet. 2008 Jan;40(1):90-5
pubmed: 18059269