A secure SNP panel scheme using homomorphically encrypted K-mers without SNP calling on the user side.
Genomic privacy
Genomic security
Homomorphic encryption
K-mer
SNP panel
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
04 Apr 2019
04 Apr 2019
Historique:
entrez:
11
4
2019
pubmed:
11
4
2019
medline:
29
8
2019
Statut:
epublish
Résumé
Single Nucleotide Polymorphism (SNP) in the genome has become crucial information for clinical use. For example, the targeted cancer therapy is primarily based on the information which clinically important SNPs are detectable from the tumor. Many hospitals have developed their own panels that include clinically important SNPs. The genome information exchange between the patient and the hospital has become more popular. However, the genome sequence information is innate and irreversible and thus its leakage has serious consequences. Therefore, protecting one's genome information is critical. On the other side, hospitals may need to protect their own panels. There is no known secure SNP panel scheme to protect both. In this paper, we propose a secure SNP panel scheme using homomorphically encrypted K-mers without requiring SNP calling on the user side and without revealing the panel information to the user. Use of the powerful homomorphic encryption technique is desirable, but there is no known algorithm to efficiently align two homomorphically encrypted sequences. Thus, we designed and implemented a novel secure SNP panel scheme utilizing the computationally feasible equality test on two homomorphically encrypted K-mers. To make the scheme work correctly, in addition to SNPs in the panel, sequence variations at the population level should be addressed. We designed a concept of Point Deviation Tolerance (PDT) level to address the false positives and false negatives. Using the TCGA BRCA dataset, we demonstrated that our scheme works at the level of over a hundred thousand somatic mutations. In addition, we provide a computational guideline for the panel design, including the size of K-mer and the number of SNPs. The proposed method is the first of its kind to protect both the user's sequence and the hospital's panel information using the powerful homomorphic encryption scheme. We demonstrated that the scheme works with a simulated dataset and the TCGA BRCA dataset. In this study, we have shown only the feasibility of the proposed scheme and much more efforts should be done to make the scheme usable for clinical use.
Sections du résumé
BACKGROUND
BACKGROUND
Single Nucleotide Polymorphism (SNP) in the genome has become crucial information for clinical use. For example, the targeted cancer therapy is primarily based on the information which clinically important SNPs are detectable from the tumor. Many hospitals have developed their own panels that include clinically important SNPs. The genome information exchange between the patient and the hospital has become more popular. However, the genome sequence information is innate and irreversible and thus its leakage has serious consequences. Therefore, protecting one's genome information is critical. On the other side, hospitals may need to protect their own panels. There is no known secure SNP panel scheme to protect both.
RESULTS
RESULTS
In this paper, we propose a secure SNP panel scheme using homomorphically encrypted K-mers without requiring SNP calling on the user side and without revealing the panel information to the user. Use of the powerful homomorphic encryption technique is desirable, but there is no known algorithm to efficiently align two homomorphically encrypted sequences. Thus, we designed and implemented a novel secure SNP panel scheme utilizing the computationally feasible equality test on two homomorphically encrypted K-mers. To make the scheme work correctly, in addition to SNPs in the panel, sequence variations at the population level should be addressed. We designed a concept of Point Deviation Tolerance (PDT) level to address the false positives and false negatives. Using the TCGA BRCA dataset, we demonstrated that our scheme works at the level of over a hundred thousand somatic mutations. In addition, we provide a computational guideline for the panel design, including the size of K-mer and the number of SNPs.
CONCLUSIONS
CONCLUSIONS
The proposed method is the first of its kind to protect both the user's sequence and the hospital's panel information using the powerful homomorphic encryption scheme. We demonstrated that the scheme works with a simulated dataset and the TCGA BRCA dataset. In this study, we have shown only the feasibility of the proposed scheme and much more efforts should be done to make the scheme usable for clinical use.
Identifiants
pubmed: 30967116
doi: 10.1186/s12864-019-5473-z
pii: 10.1186/s12864-019-5473-z
pmc: PMC6456943
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
188Références
Nature. 2001 Feb 15;409(6822):860-921
pubmed: 11237011
PLoS Genet. 2008 Aug 29;4(8):e1000167
pubmed: 18769715
IEEE Trans Inf Technol Biomed. 2008 Sep;12(5):606-17
pubmed: 18779075
IEEE Trans Inf Technol Biomed. 2012 Jan;16(1):166-75
pubmed: 22010157
Nature. 2012 Oct 4;490(7418):61-70
pubmed: 23000897
Science. 2013 Jan 18;339(6117):321-4
pubmed: 23329047
Bioinformatics. 2013 Apr 1;29(7):886-93
pubmed: 23413435
Nature. 2013 Oct 17;502(7471):333-339
pubmed: 24132290
J Biomed Inform. 2014 Aug;50:133-41
pubmed: 24509073
Bioinformatics. 2014 Dec 1;30(23):3334-41
pubmed: 25147357
N Engl J Med. 2015 Jun 4;372(23):2243-57
pubmed: 26014596
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Bioinformatics. 2016 Jan 15;32(2):211-8
pubmed: 26446135
Am J Hum Genet. 2015 Nov 5;97(5):631-46
pubmed: 26522470
J Priv Confid. 2013;5(1):137-166
pubmed: 26525346
ACM Comput Surv. 2015 Sep;48(1):
pubmed: 26640318
KDD. 2013 Aug;2013:1079-1087
pubmed: 26691928
BMC Med Inform Decis Mak. 2015;15 Suppl 5:S1
pubmed: 26732892
BMC Med Inform Decis Mak. 2015;15 Suppl 5:S3
pubmed: 26733152
BMC Med Inform Decis Mak. 2015;15 Suppl 5:S5
pubmed: 26733391
Bioinformatics. 2016 May 1;32(9):1293-300
pubmed: 26769317
Cell Syst. 2016 Jul;3(1):54-61
pubmed: 27453444
BMC Med Genomics. 2016 Oct 13;9(1):63
pubmed: 27733153
IEEE J Biomed Health Inform. 2017 Sep;21(5):1466-1472
pubmed: 27834660
Hum Mol Genet. 2017 Feb 1;26(3):489-500
pubmed: 28053046
Nat Commun. 2017 May 09;8:15183
pubmed: 28485371
Science. 2017 Aug 18;357(6352):692-695
pubmed: 28818945
Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10166-10171
pubmed: 28874526
J Geogr Syst. 2017 Jul;19(3):197-220
pubmed: 29085255
Nucleic Acids Res. 2018 Jan 4;46(D1):D762-D769
pubmed: 29106570
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1413-1426
pubmed: 30004884