A validated heart-specific model for splice-disrupting variants in childhood heart disease.
Congenital Heart Disease
Genomics
Machine Learning
Non-canonical
RNA splicing
Journal
Genome medicine
ISSN: 1756-994X
Titre abrégé: Genome Med
Pays: England
ID NLM: 101475844
Informations de publication
Date de publication:
15 Oct 2024
15 Oct 2024
Historique:
received:
02
12
2023
accepted:
16
09
2024
medline:
15
10
2024
pubmed:
15
10
2024
entrez:
14
10
2024
Statut:
epublish
Résumé
Congenital heart disease (CHD) is the most common congenital anomaly. Almost 90% of isolated cases have an unexplained genetic etiology after clinical testing. Non-canonical splice variants that disrupt mRNA splicing through the loss or creation of exon boundaries are not routinely captured and/or evaluated by standard clinical genetic tests. Recent computational algorithms such as SpliceAI have shown an ability to predict such variants, but are not specific to cardiac-expressed genes and transcriptional isoforms. We used genome sequencing (GS) (n = 1101 CHD probands) and myocardial RNA-Sequencing (RNA-Seq) (n = 154 CHD and n = 43 cardiomyopathy probands) to identify and validate splice disrupting variants, and to develop a heart-specific model for canonical and non-canonical splice variants that can be applied to patients with CHD and cardiomyopathy. Two thousand five hundred seventy GS samples from the Medical Genome Reference Bank were analyzed as healthy controls. Of 8583 rare DNA splice-disrupting variants initially identified using SpliceAI, 100 were associated with altered splice junctions in the corresponding patient myocardium affecting 95 genes. Using strength of myocardial gene expression and genome-wide DNA variant features that were confirmed to affect splicing in myocardial RNA, we trained a machine learning model for predicting cardiac-specific splice-disrupting variants (AUC 0.86 on internal validation). In a validation set of 48 CHD probands, the cardiac-specific model outperformed a SpliceAI model alone (AUC 0.94 vs 0.67 respectively). Application of this model to an additional 947 CHD probands with only GS data identified 1% patients with canonical and 11% patients with non-canonical splice-disrupting variants in CHD genes. Forty-nine percent of predicted splice-disrupting variants were intronic and > 10 bp from existing splice junctions. The burden of high-confidence splice-disrupting variants in CHD genes was 1.28-fold higher in CHD cases compared with healthy controls. A new cardiac-specific in silico model was developed using complementary GS and RNA-Seq data that improved genetic yield by identifying a significant burden of non-canonical splice variants associated with CHD that would not be detectable through panel or exome sequencing.
Sections du résumé
BACKGROUND
BACKGROUND
Congenital heart disease (CHD) is the most common congenital anomaly. Almost 90% of isolated cases have an unexplained genetic etiology after clinical testing. Non-canonical splice variants that disrupt mRNA splicing through the loss or creation of exon boundaries are not routinely captured and/or evaluated by standard clinical genetic tests. Recent computational algorithms such as SpliceAI have shown an ability to predict such variants, but are not specific to cardiac-expressed genes and transcriptional isoforms.
METHODS
METHODS
We used genome sequencing (GS) (n = 1101 CHD probands) and myocardial RNA-Sequencing (RNA-Seq) (n = 154 CHD and n = 43 cardiomyopathy probands) to identify and validate splice disrupting variants, and to develop a heart-specific model for canonical and non-canonical splice variants that can be applied to patients with CHD and cardiomyopathy. Two thousand five hundred seventy GS samples from the Medical Genome Reference Bank were analyzed as healthy controls.
RESULTS
RESULTS
Of 8583 rare DNA splice-disrupting variants initially identified using SpliceAI, 100 were associated with altered splice junctions in the corresponding patient myocardium affecting 95 genes. Using strength of myocardial gene expression and genome-wide DNA variant features that were confirmed to affect splicing in myocardial RNA, we trained a machine learning model for predicting cardiac-specific splice-disrupting variants (AUC 0.86 on internal validation). In a validation set of 48 CHD probands, the cardiac-specific model outperformed a SpliceAI model alone (AUC 0.94 vs 0.67 respectively). Application of this model to an additional 947 CHD probands with only GS data identified 1% patients with canonical and 11% patients with non-canonical splice-disrupting variants in CHD genes. Forty-nine percent of predicted splice-disrupting variants were intronic and > 10 bp from existing splice junctions. The burden of high-confidence splice-disrupting variants in CHD genes was 1.28-fold higher in CHD cases compared with healthy controls.
CONCLUSIONS
CONCLUSIONS
A new cardiac-specific in silico model was developed using complementary GS and RNA-Seq data that improved genetic yield by identifying a significant burden of non-canonical splice variants associated with CHD that would not be detectable through panel or exome sequencing.
Identifiants
pubmed: 39402625
doi: 10.1186/s13073-024-01383-8
pii: 10.1186/s13073-024-01383-8
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
119Subventions
Organisme : CIHR
ID : ENP 161429
Pays : Canada
Organisme : CardioVasculair Onderzoek Nederland (CVON)
ID : 2014-18 CONCOR-genes
Organisme : CardioVasculair Onderzoek Nederland (CVON)
ID : 2014-18 CONCOR-genes
Organisme : National Heart Foundation of Australia
ID : NSW CVRN Career Advancement
Informations de copyright
© 2024. The Author(s).
Références
van der Linde D, Konings EEM, Slager MA, Witsenburg M, Helbing WA, Takkenberg JJM, et al. Birth prevalence of congenital heart disease worldwide: a systematic review and meta-analysis. J Am Coll Cardiol. 2011;58:2241–7.
pubmed: 22078432
doi: 10.1016/j.jacc.2011.08.025
Øyen N, Poulsen G, Boyd HA, Wohlfahrt J, Jensen PKA, Melbye M. Recurrence of congenital heart defects in families. Circulation. 2009;120:295–301.
pubmed: 19597048
doi: 10.1161/CIRCULATIONAHA.109.857987
Blue GM, Kirk EP, Giannoulatou E, Sholler GF, Dunwoodie SL, Harvey RP, et al. Advances in the Genetics of Congenital Heart Disease: A Clinician’s Guide. J Am Coll Cardiol. 2017;69:859–70.
pubmed: 28209227
doi: 10.1016/j.jacc.2016.11.060
Page DJ, Miossec MJ, Williams SG, Monaghan RM, Fotiou E, Cordell HJ, et al. Whole Exome Sequencing Reveals the Major Genetic Contributors to Nonsyndromic Tetralogy of Fallot. Circ Res. 2019;124:553–63.
pubmed: 30582441
pmcid: 6377791
doi: 10.1161/CIRCRESAHA.118.313250
Blue GM, Mekel M, Das D, Troup M, Rath E, Ip E, et al. Whole genome sequencing in transposition of the great arteries and associations with clinically relevant heart, brain and laterality genes. Am Heart J. 2022;244:1–13.
pubmed: 34670123
doi: 10.1016/j.ahj.2021.10.185
Lesurf R, Said A, Akinrinade O, Breckpot J, Delfosse K, Liu T, et al. Whole genome sequencing delineates regulatory, copy number, and cryptic splice variants in early onset cardiomyopathy. NPJ Genom Med. 2022;7:18.
pubmed: 35288587
pmcid: 8921194
doi: 10.1038/s41525-022-00288-y
Škorić-Milosavljević D, Tadros R, Bosada FM, Tessadori F, van Weerd JH, Woudstra OI, et al. Common Genetic Variants Contribute to Risk of Transposition of the Great Arteries. Circ Res. 2022;130:166–80.
pubmed: 34886679
doi: 10.1161/CIRCRESAHA.120.317107
Rowlands C, Thomas HB, Lord J, Wai HA, Arno G, Beaman G, et al. Comparison of in silico strategies to prioritize rare genomic variants impacting RNA splicing for the diagnosis of genomic disorders. Sci Rep. 2021;11:20607.
pubmed: 34663891
pmcid: 8523691
doi: 10.1038/s41598-021-99747-2
Blakes AJM, Wai HA, Davies I, Moledina HE, Ruiz A, Thomas T, et al. A systematic analysis of splicing variants identifies new diagnoses in the 100,000 Genomes Project. Genome Med. 2022;14:79.
pubmed: 35883178
pmcid: 9327385
doi: 10.1186/s13073-022-01087-x
O’Neill MJ, Wada Y, Hall LD, Mitchell DW, Glazer AM, Roden DM. Functional Assays Reclassify Suspected Splice-Altering Variants of Uncertain Significance in Mendelian Channelopathies. Circ Genom Precis Med. 2022;15:e003782.
pubmed: 36197721
pmcid: 9772980
doi: 10.1161/CIRCGEN.122.003782
Jang MY, Patel PN, Pereira AC, Willcox JAL, Haghighi A, Tai AC, et al. Contribution of Previously Unrecognized RNA Splice-Altering Variants to Congenital Heart Disease. Circ Genom Precis Med. 2023;16:224–31.
pubmed: 37165897
pmcid: 10404383
doi: 10.1161/CIRCGEN.122.003924
Singer ES, Crowe J, Holliday M, Isbister JC, Lal S, Nowak N, et al. The burden of splice-disrupting variants in inherited heart disease and unexplained sudden cardiac death. NPJ Genom Med. 2023;8:29.
pubmed: 37821546
pmcid: 10567745
doi: 10.1038/s41525-023-00373-w
Walker LC, de la Hoya M, Wiggins GAR, Lindy A, Vincent LM, Parsons MT, et al. Using the ACMG/AMP framework to capture evidence related to predicted and observed impact on splicing: Recommendations from the ClinGen SVI Splicing Subgroup. Am J Hum Genet. 2023;110:1046–67.
pubmed: 37352859
pmcid: 10357475
doi: 10.1016/j.ajhg.2023.06.002
Guo Y, Long J, He J, Li C-I, Cai Q, Shu X-O, et al. Exome sequencing generates high quality data in non-target regions. BMC Genomics. 2012;13:194.
pubmed: 22607156
pmcid: 3416685
doi: 10.1186/1471-2164-13-194
Zhang L, Shen M, Shu X, Zhou J, Ding J, Zhong C, et al. Intronic position +9 and -9 are potentially splicing sites boundary from intronic variants analysis of whole exome sequencing data. BMC Med Genomics. 2023;16:146.
pubmed: 37365551
pmcid: 10291791
doi: 10.1186/s12920-023-01542-7
Lesurf R, Jeroen B, Jade B, Nour H, Anjali J, Yijing L, et al. Genome sequencing identifies splice-disrupting variants in childhood heart disease [Internet]. European Genome-Phenome Archive (EGA); 2024. Available from: https://ega-archive.org/studies/EGAS50000000586
Pinese M, Lacaze P, Rath EM, Stone A, Brion M-J, Ameur A, et al. The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly. Nat Commun. 2020;11:435.
pubmed: 31974348
pmcid: 6978518
doi: 10.1038/s41467-019-14079-0
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
pubmed: 24695404
pmcid: 4103590
doi: 10.1093/bioinformatics/btu170
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
pubmed: 19451168
pmcid: 2705234
doi: 10.1093/bioinformatics/btp324
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
pubmed: 21478889
pmcid: 3083463
doi: 10.1038/ng.806
Trost B, Walker S, Wang Z, Thiruvahindrapuram B, MacDonald JR, Sung WWL, et al. A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data. Am J Hum Genet. 2018;102:142–55.
pubmed: 29304372
pmcid: 5777982
doi: 10.1016/j.ajhg.2017.12.007
Zhu M, Need AC, Han Y, Ge D, Maia JM, Zhu Q, et al. Using ERDS to infer copy-number variants in high-coverage genomes. Am J Hum Genet. 2012;91:408–21.
pubmed: 22939633
pmcid: 3511991
doi: 10.1016/j.ajhg.2012.07.004
Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011;21:974–84.
pubmed: 21324876
pmcid: 3106330
doi: 10.1101/gr.114876.110
Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, Källberg M, et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. 2016;32:1220–2.
pubmed: 26647377
doi: 10.1093/bioinformatics/btv710
Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28:i333–9.
pubmed: 22962449
pmcid: 3436805
doi: 10.1093/bioinformatics/bts378
Pedersen BS, Bhetariya PJ, Brown J, Kravitz SN, Marth G, Jensen RL, et al. Somalier: rapid relatedness estimation for cancer and germline studies using efficient genome sketches. Genome Med. 2020;12:62.
pubmed: 32664994
pmcid: 7362544
doi: 10.1186/s13073-020-00761-2
DRAGEN DNA Pipeline [Internet]. Illumina; Available from: https://support-docs.illumina.com/SW/DRAGEN_v38/Content/SW/DRAGEN/GPipelineIntro_fDG.htm
Picard toolkit [Internet]. Broad Institute; 2019. Available from: http://broadinstitute.github.io/picard/
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–24.
pubmed: 25741868
pmcid: 4544753
doi: 10.1038/gim.2015.30
Riggs ER, Andersen EF, Cherry AM, Kantarci S, Kearney H, Patel A, et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med. 2020;22:245–57.
pubmed: 31690835
doi: 10.1038/s41436-019-0686-8
Li Q, Wang K. InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines. Am J Hum Genet. 2017;100:267–80.
pubmed: 28132688
pmcid: 5294755
doi: 10.1016/j.ajhg.2017.01.004
Stenson PD, Mort M, Ball EV, Evans K, Hayden M, Heywood S, et al. The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum Genet. 2017;136:665–77.
pubmed: 28349240
pmcid: 5429360
doi: 10.1007/s00439-017-1779-6
Griffin EL, Nees SN, Morton SU, Wynn J, Patel N, Jobanputra V, et al. Evidence-Based Assessment of Congenital Heart Disease Genes to Enable Returning Results in a Genomic Study. Circ Genom Precis Med. 2023;16:e003791.
pubmed: 36803080
pmcid: 10121846
doi: 10.1161/CIRCGEN.122.003791
Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A. OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 2015;43:D789–798.
Rehm HL, Berg JS, Brooks LD, Bustamante CD, Evans JP, Landrum MJ, et al. ClinGen–the Clinical Genome Resource. N Engl J Med. 2015;372:2235–42.
pubmed: 26014595
pmcid: 4474187
doi: 10.1056/NEJMsr1406261
Yang A, Alankarage D, Cuny H, Ip EKK, Almog M, Lu J, et al. CHDgene: A Curated Database for Congenital Heart Disease Genes. Circ Genom Precis Med. 2022;15:e003539.
pubmed: 35522174
doi: 10.1161/CIRCGEN.121.003539
Morales J, Pujar S, Loveland JE, Astashyn A, Bennett R, Berry A, et al. A joint NCBI and EMBL-EBI transcript set for clinical genomics and research. Nature. 2022;604:310–5.
pubmed: 35388217
pmcid: 9007741
doi: 10.1038/s41586-022-04558-8
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–43.
pubmed: 32461654
pmcid: 7334197
doi: 10.1038/s41586-020-2308-7
Smit A, Hubley R, Green P. RepeatMasker Open-4.0. 2013. Available from: http://www.repeatmasker.org
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.
pubmed: 21221095
pmcid: 3346182
doi: 10.1038/nbt.1754
Robinson JT, Thorvaldsdóttir H, Wenger AM, Zehir A, Mesirov JP. Variant Review with the Integrative Genomics Viewer. Cancer Res. 2017;77:e31–4.
pubmed: 29092934
pmcid: 5678989
doi: 10.1158/0008-5472.CAN-17-0337
Koboldt DC. Best practices for variant calling in clinical sequencing. Genome Med. 2020;12:91.
pubmed: 33106175
pmcid: 7586657
doi: 10.1186/s13073-020-00791-w
Landrum MJ, Lee JM, Benson M, Brown G, Chao C, Chitipiralla S, et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44:D862–868.
pubmed: 26582918
doi: 10.1093/nar/gkv1222
Landrum MJ, Kattman BL. ClinVar at five years: Delivering on the promise. Hum Mutat. 2018;39:1623–30.
pubmed: 30311387
doi: 10.1002/humu.23641
Trost B, Thiruvahindrapuram B, Chan AJS, Engchuan W, Higginbotham EJ, Howe JL, et al. Genomic architecture of autism from comprehensive whole-genome sequence annotation. Cell. 2022;185:4409–4427.e18.
pubmed: 36368308
pmcid: 10726699
doi: 10.1016/j.cell.2022.10.009
Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, Rajan D, et al. DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am J Hum Genet. 2009;84:524–33.
pubmed: 19344873
pmcid: 2667985
doi: 10.1016/j.ajhg.2009.03.010
MacDonald JR, Ziman R, Yuen RKC, Feuk L, Scherer SW. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2014;42:D986–992.
pubmed: 24174537
doi: 10.1093/nar/gkt958
Collins RL, Brand H, Karczewski KJ, Zhao X, Alföldi J, Francioli LC, et al. A structural variation reference for medical and population genetics. Nature. 2020;581:444–51.
pubmed: 32461652
pmcid: 7334194
doi: 10.1038/s41586-020-2287-8
Belyeu JR, Chowdhury M, Brown J, Pedersen BS, Cormier MJ, Quinlan AR, et al. Samplot: a platform for structural variant visual validation and automated filtering. Genome Biol. 2021;22:161.
pubmed: 34034781
pmcid: 8145817
doi: 10.1186/s13059-021-02380-5
Xiang J, Peng J, Baxter S, Peng Z. AutoPVS1: An automatic classification tool for PVS1 interpretation of null variants. Hum Mutat. 2020;41:1488–98.
pubmed: 32442321
doi: 10.1002/humu.24051
Papaz T, Liston E, Zahavich L, Stavropoulos DJ, Jobling RK, Kim RH, et al. Return of genetic and genomic research findings: experience of a pediatric biorepository. BMC Med Genomics. 2019;12:173.
pubmed: 31775751
pmcid: 6882371
doi: 10.1186/s12920-019-0618-0
Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF, Darbandi SF, Knowles D, Li YI, et al. Predicting Splicing from Primary Sequence with Deep Learning. Cell. 2019;176:535–548.e24.
pubmed: 30661751
doi: 10.1016/j.cell.2018.12.015
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10:giab008.
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, et al. The Ensembl Variant Effect Predictor. Genome Biol. 2016;17:122.
pubmed: 27268795
pmcid: 4893825
doi: 10.1186/s13059-016-0974-4
Mercer TR, Clark MB, Andersen SB, Brunck ME, Haerty W, Crawford J, et al. Genome-wide discovery of human splicing branchpoints. Genome Res. 2015;25:290–303.
pubmed: 25561518
pmcid: 4315302
doi: 10.1101/gr.182899.114
Paggi JM, Bejerano G. A sequence-based, deep learning model accurately predicts RNA splicing branchpoints. RNA. 2018;24:1647–58.
pubmed: 30224349
pmcid: 6239175
doi: 10.1261/rna.066290.118
Rentzsch P, Schubach M, Shendure J, Kircher M. CADD-Splice-improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med. 2021;13:31.
pubmed: 33618777
pmcid: 7901104
doi: 10.1186/s13073-021-00835-9
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21.
pubmed: 23104886
doi: 10.1093/bioinformatics/bts635
Aken BL, Achuthan P, Akanni W, Amode MR, Bernsdorff F, Bhai J, et al. Ensembl 2017. Nucleic Acids Res. 2017;45:D635–42.
pubmed: 27899575
doi: 10.1093/nar/gkw1104
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323.
pubmed: 21816040
pmcid: 3163565
doi: 10.1186/1471-2105-12-323
Mertes C, Scheller IF, Yépez VA, Çelik MH, Liang Y, Kremer LS, et al. Detection of aberrant splicing events in RNA-seq data using FRASER. Nat Commun. 2021;12:529.
pubmed: 33483494
pmcid: 7822922
doi: 10.1038/s41467-020-20573-7
Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc. 2009;4:1184–91.
pubmed: 19617889
pmcid: 3159387
doi: 10.1038/nprot.2009.97
Brechtmann F, Mertes C, Matusevičiūtė A, Yépez VA, Avsec Ž, Herzog M, et al. OUTRIDER: A Statistical Method for Detecting Aberrantly Expressed Genes in RNA Sequencing Data. Am J Hum Genet. 2018;103:907–17.
pubmed: 30503520
pmcid: 6288422
doi: 10.1016/j.ajhg.2018.10.025
Liaw A, Wiener M. Classification and Regression by randomForest. R News. 2002;2:18–22.
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Int Res. 2002;16:321–57.
Millson A, Lewis T, Pesaran T, Salvador D, Gillespie K, Gau C-L, et al. Processed Pseudogene Confounding Deletion/Duplication Assays for SMAD4. J Mol Diagn. 2015;17:576–82.
pubmed: 26165824
doi: 10.1016/j.jmoldx.2015.05.005
Reimand J, Kull M, Peterson H, Hansen J, Vilo J. g:Profiler–a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007;35:W193–200.
pubmed: 17478515
pmcid: 1933153
doi: 10.1093/nar/gkm226
Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H, et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 2019;47:W191–8.
pubmed: 31066453
pmcid: 6602461
doi: 10.1093/nar/gkz369
Gargano MA, Matentzoglu N, Coleman B, Addo-Lartey EB, Anagnostopoulos AV, Anderton J, et al. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res. 2024;52:D1333–46.
pubmed: 37953324
doi: 10.1093/nar/gkad1005
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006.
pubmed: 12045153
pmcid: 186604
doi: 10.1101/gr.229102
Frankish A, Diekhans M, Ferreira A-M, Johnson R, Jungreis I, Loveland J, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47:D766–73.
pubmed: 30357393
doi: 10.1093/nar/gky955
Wickham H. ggplot2: Elegant Graphics for Data Analysis [Internet]. Springer-Verlag New York; 2016. Available from: https://ggplot2.tidyverse.org
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.
pubmed: 21414208
pmcid: 3068975
doi: 10.1186/1471-2105-12-77
Wu J, Yang Y, He Y, Li Q, Wang X, Sun C, et al. EFTUD2 gene deficiency disrupts osteoblast maturation and inhibits chondrocyte differentiation via activation of the p53 signaling pathway. Hum Genomics. 2019;13:63.
pubmed: 31806011
pmcid: 6894506
doi: 10.1186/s40246-019-0238-y
Aicher JK, Jewell P, Vaquero-Garcia J, Barash Y, Bhoj EJ. Mapping RNA splicing variations in clinically accessible and nonaccessible tissues to facilitate Mendelian disease diagnosis using RNA-seq. Genet Med. 2020;22:1181–90.
pubmed: 32225167
pmcid: 7335339
doi: 10.1038/s41436-020-0780-y
Lin J-H, Wu H, Zou W-B, Masson E, Fichou Y, Le Gac G, et al. Splicing Outcomes of 5’ Splice Site GT>GC Variants That Generate Wild-Type Transcripts Differ Significantly Between Full-Length and Minigene Splicing Assays. Front Genet. 2021;12:701652.
pubmed: 34422003
pmcid: 8375439
doi: 10.3389/fgene.2021.701652
Hsieh A, Morton SU, Willcox JAL, Gorham JM, Tai AC, Qi H, et al. EM-mosaic detects mosaic point mutations that contribute to congenital heart disease. Genome Med. 2020;12:42.
pubmed: 32349777
pmcid: 7189690
doi: 10.1186/s13073-020-00738-1
Pais LS, Snow H, Weisburd B, Zhang S, Baxter SM, DiTroia S, et al. seqr: A web-based analysis and collaboration tool for rare disease genomics. Hum Mutat. 2022;43:698–707.
pubmed: 35266241
pmcid: 9903206