3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
11 12 2021
11 12 2021
Historique:
received:
02
02
2021
revised:
05
07
2021
accepted:
15
07
2021
medline:
13
4
2023
pubmed:
17
7
2021
entrez:
16
7
2021
Statut:
ppublish
Résumé
Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms. We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates. Codes (https://github.com/KyoungYeulLee/3Cnet/) and data (https://zenodo.org/record/4716879#.YIO-xqkzZH1) are freely available to non-commercial users. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 34270679
pii: 6322986
doi: 10.1093/bioinformatics/btab529
pmc: PMC8665754
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
4626-4634Informations de copyright
© The Author(s) 2021. Published by Oxford University Press.