Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications.

Clinical genetics Deep intronic Genomics Intronic variant Splice region Splice site Splicing Splicing regulatory element Variant interpretation

Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
17 05 2023
Historique:
received: 30 03 2022
accepted: 10 04 2023
medline: 19 5 2023
pubmed: 18 5 2023
entrez: 17 5 2023
Statut: epublish

Résumé

Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme .

Identifiants

pubmed: 37198692
doi: 10.1186/s13059-023-02936-7
pii: 10.1186/s13059-023-02936-7
pmc: PMC10190034
doi:

Substances chimiques

RNA Splice Sites 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

118

Informations de copyright

© 2023. The Author(s).

Références

Clin Immunol. 2017 Jul;180:33-44
pubmed: 28359783
Nucleic Acids Res. 2020 Jul 27;48(13):7066-7078
pubmed: 32484558
Nature. 2020 May;581(7809):434-443
pubmed: 32461654
Wiley Interdiscip Rev RNA. 2013 Jan-Feb;4(1):61-76
pubmed: 23074130
PLoS Comput Biol. 2018 Aug 17;14(8):e1006360
pubmed: 30118475
Mol Cell. 2019 Jan 3;73(1):183-194.e8
pubmed: 30503770
Bioinformatics. 2019 Nov 1;35(21):4405-4407
pubmed: 30993321
Science. 2015 Jan 9;347(6218):1254806
pubmed: 25525159
Med J Aust. 2018 Aug 3;209(5):197-199
pubmed: 29621958
Hum Mol Genet. 2023 Mar 20;32(7):1127-1136
pubmed: 36322148
Am J Hum Genet. 2019 Sep 5;105(3):573-587
pubmed: 31447096
Eur J Hum Genet. 2021 May;29(5):760-770
pubmed: 33437033
Genome Biol. 2019 Mar 1;20(1):48
pubmed: 30823901
J Comput Biol. 2004;11(2-3):377-94
pubmed: 15285897
Genome Res. 2018 Aug;28(8):1111-1125
pubmed: 30012835
EMBO Rep. 2009 Aug;10(8):810-6
pubmed: 19648957
Adv Bioinformatics. 2016;2016:5614058
pubmed: 27313609
Genome Biol. 2023 May 17;24(1):118
pubmed: 37198692
Nucleic Acids Res. 2014 Dec 16;42(22):13534-44
pubmed: 25416802
Am J Hum Genet. 2017 May 4;100(5):751-765
pubmed: 28475858
Biomed Rep. 2015 Mar;3(2):152-158
pubmed: 25798239
Eur J Hum Genet. 2019 Feb;27(2):308-316
pubmed: 30353151
Am J Med Genet A. 2022 Jul;188(7):2226-2230
pubmed: 35393742
Proc Natl Acad Sci U S A. 2011 Jul 5;108(27):11093-8
pubmed: 21685335
Nucleic Acids Res. 2016 Feb 29;44(4):1483-95
pubmed: 26773057
Genet Med. 2022 Jan;24(1):130-145
pubmed: 34906502
Cell. 2019 Jan 24;176(3):535-548.e24
pubmed: 30661751
Nucleic Acids Res. 2003 Jul 1;31(13):3568-71
pubmed: 12824367
Wiley Interdiscip Rev RNA. 2018 Jan;9(1):
pubmed: 28949076
Nat Genet. 2016 Jan;48(1):4-6
pubmed: 26711108
Nat Commun. 2022 Mar 29;13(1):1655
pubmed: 35351883
Nucleic Acids Res. 2018 Sep 6;46(15):7913-7923
pubmed: 29750258
BMC Biol. 2016 Jul 05;14:54
pubmed: 27380775
Nucleic Acids Res. 2019 Jan 8;47(D1):D886-D894
pubmed: 30371827
Contemp Oncol (Pozn). 2015;19(1A):A68-77
pubmed: 25691825
Nat Med. 2020 Nov;26(11):1742-1753
pubmed: 33020650
Database (Oxford). 2020 Dec 1;2020:
pubmed: 33258967
J Pathol. 2019 Aug;248(4):409-420
pubmed: 30883759
Kidney Int. 2022 Nov;102(5):1167-1177
pubmed: 35870639
Nature. 2018 Oct;562(7726):217-222
pubmed: 30209399
Bioinformatics. 2018 Mar 15;34(6):920-927
pubmed: 29092009
Hum Mutat. 2009 Feb;30(2):221-7
pubmed: 18853456
Genome Biol. 2006;7(1):R1
pubmed: 16507133
Nucleic Acids Res. 2009 May;37(9):e67
pubmed: 19339519
Bioinformatics. 2017 Sep 15;33(18):2938-2940
pubmed: 28645171
Genet Med. 2019 Aug;21(8):1761-1771
pubmed: 30670881
Genome Med. 2021 Feb 22;13(1):31
pubmed: 33618777
Genome Biol. 2016 Jun 01;17(1):118
pubmed: 27250555

Auteurs

Patricia J Sullivan (PJ)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia.
University of New South Wales Centre for Childhood Cancer Research, UNSW Sydney, Sydney, NSW, Australia.

Velimir Gayevskiy (V)

Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, Australia.

Ryan L Davis (RL)

Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, Australia.
Department of Neurogenetics, Kolling Institute, St. Leonards, NSW, Australia.
Sydney Medical School-Northern, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia.

Marie Wong (M)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia.

Chelsea Mayoh (C)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia.

Amali Mallawaarachchi (A)

Division of Genomics and Epigenetics, Garvan Institute of Medical Research, Sydney, Australia.
Clinical Genetics Unit, Institute of Precision Medicine and Bioinformatics, Sydney Local Health District, Sydney, Australia.

Yvonne Hort (Y)

Division of Genomics and Epigenetics, Garvan Institute of Medical Research, Sydney, Australia.

Mark J McCabe (MJ)

Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, Australia.

Sarah Beecroft (S)

Centre for Medical Research, University of Western Australia, Harry Perkins Institute of Medical Research, QEII Medical Centre, Nedlands, WA, Australia.

Matilda R Jackson (MR)

Department of Genetics and Molecular Pathology, Centre for Cancer Biology, An Alliance Between SA Pathology and the University of South Australia, Adelaide, Australia.
Australian Genomics, Parkville, VIC, Australia.

Peer Arts (P)

Department of Genetics and Molecular Pathology, Centre for Cancer Biology, An Alliance Between SA Pathology and the University of South Australia, Adelaide, Australia.

Andrew Dubowsky (A)

Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, Australia.

Nigel Laing (N)

Centre for Medical Research, University of Western Australia, Harry Perkins Institute of Medical Research, QEII Medical Centre, Nedlands, WA, Australia.

Marcel E Dinger (ME)

Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, Australia.
School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia.

Hamish S Scott (HS)

Department of Genetics and Molecular Pathology, Centre for Cancer Biology, An Alliance Between SA Pathology and the University of South Australia, Adelaide, Australia.
Australian Genomics, Parkville, VIC, Australia.
School of Medicine, University of Adelaide, Adelaide, SA, Australia.
ACRF Cancer Genomics Facility, Centre for Cancer Biology, An Alliance Between SA Pathology and the University of South Australia, Adelaide, SA, Australia.

Emily Oates (E)

School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia.

Mark Pinese (M)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia.
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia.

Mark J Cowley (MJ)

Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Sydney, NSW, Australia. MCowley@ccia.org.au.
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Sydney, NSW, Australia. MCowley@ccia.org.au.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH