Informative SNP Selection Based on a Fuzzy Clustering and Improved Binary Particle Swarm Optimization Algorithm.
Journal
Computational and mathematical methods in medicine
ISSN: 1748-6718
Titre abrégé: Comput Math Methods Med
Pays: United States
ID NLM: 101277751
Informations de publication
Date de publication:
2022
2022
Historique:
received:
17
03
2022
revised:
14
04
2022
accepted:
30
04
2022
entrez:
27
6
2022
pubmed:
28
6
2022
medline:
29
6
2022
Statut:
epublish
Résumé
Single-nucleotide polymorphism (SNP) involves the replacement of a single nucleotide in a deoxyribonucleic acid (DNA) sequence and is often linked to the development of specific diseases. Although current genotyping methods can tag SNP loci within biological samples to provide accurate genetic information for a disease associated, they have limited prediction accuracy. Furthermore, they are complex to perform and may result in the prediction of an excessive number of tag SNP loci, which may not always be associated with the disease. Therefore in this manuscript, we aimed to evaluate the impact of a newly optimized fuzzy clustering and binary particle swarm optimization algorithm (FCBPSO) on the accuracy and running time of informative SNP selection. Fuzzy clustering and FCBPSO were first applied to identify the equivalence relation and the candidate tag SNP set to reduce the redundancy between loci. The FCBPSO algorithm was then optimized and used to obtain the final tag SNP set. The prediction performance and running time of the newly developed model were compared with other traditional methods, including NMC, SPSO, and MCMR. The prediction accuracy of the FCBPSO algorithm was always higher than that of the other algorithms especially as the number of tag SNPs increased. However, when the number of tag SNPs was low, the prediction accuracy of FCBPSO was slightly lower than that of MCMR (add prediction accuracy values for each algorithm). However, the running time of the FCBPSO algorithm was always lower than that of MCMR. FCBPSO not only reduced the size and dimension of the optimization problem but also simplified the training of the prediction model. This improved the prediction accuracy of the model and reduced the running time when compared with other traditional methods.
Identifiants
pubmed: 35756402
doi: 10.1155/2022/3837579
pmc: PMC9225903
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
3837579Informations de copyright
Copyright © 2022 Zejun Li et al.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships leading to a potential conflict of interest.
Références
Curr Gene Ther. 2021;21(1):2
pubmed: 33586642
Bioinformatics. 2005 Jun;21 Suppl 1:i195-203
pubmed: 15961458
Science. 2002 Jun 21;296(5576):2225-9
pubmed: 12029063
Am J Hum Genet. 2004 Jan;74(1):106-20
pubmed: 14681826
Front Genet. 2019 Feb 14;10:94
pubmed: 30891058
Bioinformatics. 2017 Oct 15;33(20):3195-3201
pubmed: 28637337
Sci Adv. 2020 Nov 27;6(48):
pubmed: 33246949
IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1529-34
pubmed: 22585142
Innovation (Camb). 2021 May 28;2(2):100116
pubmed: 33997827
Sci Rep. 2016 Mar 10;6:22811
pubmed: 26960563
Bioinformatics. 2017 Sep 15;33(18):2829-2836
pubmed: 28541468
Front Bioeng Biotechnol. 2020 May 19;8:394
pubmed: 32509741
Front Genet. 2018 Dec 20;9:657
pubmed: 30619477
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):688-95
pubmed: 24091401
Sci Rep. 2015 Oct 19;5:15145
pubmed: 26477495
Front Genet. 2020 Sep 24;11:1025
pubmed: 33101366
Genome Res. 2004 Aug;14(8):1633-40
pubmed: 15289481
IEEE J Biomed Health Inform. 2021 Sep;25(9):3677-3684
pubmed: 34181562
Front Microbiol. 2018 Oct 23;9:2500
pubmed: 30405563
Mol Ther Nucleic Acids. 2020 Sep 4;21:676-686
pubmed: 32759058
Biochim Biophys Acta Mol Basis Dis. 2020 Nov 1;1866(11):165916
pubmed: 32771416
Bioinformatics. 2018 Jun 1;34(11):1953-1956
pubmed: 29365045
Curr Gene Ther. 2021;21(4):280-289
pubmed: 33045967
Front Cell Dev Biol. 2021 May 03;9:619330
pubmed: 34012960
Nucleic Acids Res. 2020 Jan 8;48(D1):D554-D560
pubmed: 31584099
Nat Genet. 2001 Oct;29(2):229-32
pubmed: 11586305
Front Bioeng Biotechnol. 2020 Aug 05;8:701
pubmed: 32850687
Front Bioeng Biotechnol. 2020 Jul 30;8:817
pubmed: 32850708
Brief Bioinform. 2021 Nov 5;22(6):
pubmed: 34378011
Nucleic Acids Res. 2022 Jan 7;50(D1):D867-D874
pubmed: 34634820
Bioinformatics. 2005 Apr 15;21(8):1735-6
pubmed: 15585525
IEEE Trans Cybern. 2020 Dec;50(12):4862-4875
pubmed: 31613789
Science. 2015 May 8;348(6235):648-60
pubmed: 25954001
mBio. 2013 Jul 02;4(4):
pubmed: 23820391
Curr Gene Ther. 2022;22(2):144-151
pubmed: 33998988
Genome Res. 2004 May;14(5):908-16
pubmed: 15078859