SpliceVarDB: A comprehensive database of experimentally validated human splicing variants.

canonical splice site genomics pathogenic splice region splice site splicing splicing regulatory element transcriptomics variant interpretation whole-genome sequencing

Journal

American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475

Informations de publication

Date de publication:
28 Aug 2024
Historique:
received: 09 10 2023
revised: 03 08 2024
accepted: 06 08 2024
medline: 4 9 2024
pubmed: 4 9 2024
entrez: 3 9 2024
Statut: aheadofprint

Résumé

Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as "splice-altering" (∼25%), "not splice-altering" (∼25%), and "low-frequency splice-altering" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.

Identifiants

pubmed: 39226898
pii: S0002-9297(24)00288-X
doi: 10.1016/j.ajhg.2024.08.002
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of interests The authors declare that they have no competing interests.

Auteurs

Patricia J Sullivan (PJ)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia; School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia; UNSW Centre for Childhood Cancer Research, UNSW Sydney, Sydney, NSW, Australia.

Julian M W Quinn (JMW)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.

Weilin Wu (W)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.

Mark Pinese (M)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia; School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia.

Mark J Cowley (MJ)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia. Electronic address: mcowley@ccia.org.au.

Classifications MeSH