A robust pipeline with high replication rate for detection of somatic variants in the adaptive immune system as a source of common genetic variation in autoimmune disease.
Adaptive Immunity
/ genetics
Adult
Algorithms
Alleles
Autoimmune Diseases
/ genetics
Computational Biology
/ methods
DNA Mutational Analysis
Female
Genetic Variation
/ genetics
High-Throughput Nucleotide Sequencing
/ methods
Humans
Male
Middle Aged
Mutation
Polymorphism, Single Nucleotide
/ genetics
Reproducibility of Results
Sequence Analysis, DNA
/ methods
Software
Journal
Human molecular genetics
ISSN: 1460-2083
Titre abrégé: Hum Mol Genet
Pays: England
ID NLM: 9208958
Informations de publication
Date de publication:
15 04 2019
15 04 2019
Historique:
received:
09
11
2018
revised:
09
11
2018
accepted:
05
12
2018
pubmed:
13
12
2018
medline:
2
7
2019
entrez:
13
12
2018
Statut:
ppublish
Résumé
The role of somatic variants in diseases beyond cancer is increasingly being recognized, with potential roles in autoinflammatory and autoimmune diseases. However, as mutation rates and allele fractions are lower, studies in these diseases are substantially less tolerant of false positives, and bio-informatics algorithms require high replication rates. We developed a pipeline combining two variant callers, MuTect2 and VarScan2, with technical filtering and prioritization. Our pipeline detects somatic variants with allele fractions as low as 0.5% and achieves a replication rate of >55%. Validation in an independent data set demonstrates excellent performance (sensitivity > 57%, specificity > 98%, replication rate > 80%). We applied this pipeline to the autoimmune disease multiple sclerosis (MS) as a proof-of-principle. We demonstrate that 60% of MS patients carry 2-10 exonic somatic variants in their peripheral blood T and B cells, with the vast majority (80%) occurring in T cells and variants persisting over time. Synonymous variants significantly co-occur with non-synonymous variants. Systematic characterization indicates somatic variants are enriched for being novel or very rare in public databases of germline variants and trend towards being more damaging and conserved, as reflected by higher phred-scaled combined annotation-dependent depletion (CADD) and genomic evolutionary rate profiling (GERP) scores. Our pipeline and proof-of-principle now warrant further investigation of common somatic genetic variation on top of inherited genetic variation in the context of autoimmune disease, where it may offer subtle survival advantages to immune cells and contribute to the capacity of these cells to participate in the autoimmune reaction.
Identifiants
pubmed: 30541027
pii: 5240686
doi: 10.1093/hmg/ddy425
pmc: PMC6452186
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1369-1380Informations de copyright
© The Author(s) 2018. Published by Oxford University Press.
Références
Sci Rep. 2017 Feb 24;7:43169
pubmed: 28233799
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11
pubmed: 25355519
Ann Neurol. 2011 Feb;69(2):292-302
pubmed: 21387374
Nature. 2011 Aug 10;476(7359):214-9
pubmed: 21833088
Haematologica. 2015 Jan;100(1):91-9
pubmed: 25281507
J Clin Invest. 2011 Jan;121(1):106-12
pubmed: 21183795
Blood. 2003 Jun 15;101(12):4775-82
pubmed: 12595306
J Clin Invest. 2011 Jan;121(1):16-9
pubmed: 21183786
Nat Commun. 2017 Jun 21;8:15869
pubmed: 28635960
Mult Scler. 2014 Oct;20(12):1564-8
pubmed: 24740369
Proc Natl Acad Sci U S A. 2015 Nov 3;112(44):13615-20
pubmed: 26483451
N Engl J Med. 2014 Dec 25;371(26):2477-87
pubmed: 25426838
Immunology. 2011 Sep;134(1):17-32
pubmed: 21711350
Nat Rev Immunol. 2015 Sep 15;15(9):545-58
pubmed: 26250739
Science. 1990 Feb 9;247(4943):718-21
pubmed: 1689076
Nat Biotechnol. 2013 Mar;31(3):213-9
pubmed: 23396013
Exp Gerontol. 2016 May;77:87-95
pubmed: 26883338
Alzheimers Dement. 2015 Nov;11(11):1265-76
pubmed: 25937274
Nat Genet. 2013 Nov;45(11):1353-60
pubmed: 24076602
Trends Genet. 2018 Jul;34(7):545-557
pubmed: 29731376
Sci Rep. 2016 Nov 22;6:36540
pubmed: 27874022
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Nature. 2017 Sep 21;549(7672):389-393
pubmed: 28854169
Blood. 2017 Aug 10;130(6):742-752
pubmed: 28483762
Arthritis Rheum. 2011 Nov;63(11):3625-32
pubmed: 21702021
J Immunol. 2006 Dec 1;177(11):7802-10
pubmed: 17114451
N Engl J Med. 2012 May 17;366(20):1905-13
pubmed: 22591296
Mult Scler. 2015 Nov;21(13):1670-80
pubmed: 25948629
Nat Commun. 2015 Dec 09;6:10001
pubmed: 26647970
Brain. 2011 Dec;134(Pt 12):3560-77
pubmed: 22058139
Nucleic Acids Res. 2010 Sep;38(16):e164
pubmed: 20601685
Neurol Neuroimmunol Neuroinflamm. 2016 May 10;3(4):e240
pubmed: 27231713
Cell. 2017 Jul 27;170(3):534-547.e23
pubmed: 28753428
Hum Mol Genet. 2004 Jun 15;13(12):1219-24
pubmed: 15115757
Bioinformatics. 2011 Nov 1;27(21):2987-93
pubmed: 21903627
Clin Immunol. 2017 Feb;175:75-81
pubmed: 27932211
Nat Med. 2014 Dec;20(12):1472-8
pubmed: 25326804
Sci Rep. 2017 Apr 6;7(1):663
pubmed: 28386103
Genome Res. 2012 Mar;22(3):568-76
pubmed: 22300766
J Mol Diagn. 2014 Jan;16(1):75-88
pubmed: 24211364
Neurology. 1994 Feb;44(2):311-5
pubmed: 8309581
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
J Neuroimmunol. 2007 Feb;183(1-2):168-74
pubmed: 17084910
N Engl J Med. 2014 Dec 25;371(26):2488-98
pubmed: 25426837
Clin Exp Immunol. 1994 Mar;95(3):465-71
pubmed: 7511079
J Immunol. 2007 Jun 15;178(12):7710-9
pubmed: 17548608