Characterising the loss-of-function impact of 5' untranslated region variants in 15,708 individuals.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
27 05 2020
Historique:
received: 06 03 2019
accepted: 23 05 2019
entrez: 29 5 2020
pubmed: 29 5 2020
medline: 25 8 2020
Statut: epublish

Résumé

Upstream open reading frames (uORFs) are tissue-specific cis-regulators of protein translation. Isolated reports have shown that variants that create or disrupt uORFs can cause disease. Here, in a systematic genome-wide study using 15,708 whole genome sequences, we show that variants that create new upstream start codons, and variants disrupting stop sites of existing uORFs, are under strong negative selection. This selection signal is significantly stronger for variants arising upstream of genes intolerant to loss-of-function variants. Furthermore, variants creating uORFs that overlap the coding sequence show signals of selection equivalent to coding missense variants. Finally, we identify specific genes where modification of uORFs likely represents an important disease mechanism, and report a novel uORF frameshift variant upstream of NF2 in neurofibromatosis. Our results highlight uORF-perturbing variants as an under-recognised functional class that contribute to penetrant human disease, and demonstrate the power of large-scale population sequencing data in studying non-coding variant classes.

Identifiants

pubmed: 32461616
doi: 10.1038/s41467-019-10717-9
pii: 10.1038/s41467-019-10717-9
pmc: PMC7253449
doi:

Substances chimiques

5' Untranslated Regions 0
Proteins 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

2523

Subventions

Organisme : British Heart Foundation
ID : RG/18/10/33842
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_UP_1102/20
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 107469/Z/15/Z
Pays : United Kingdom
Organisme : NIMH NIH HHS
ID : R01 MH085548
Pays : United States
Organisme : Department of Health
ID : IS-BRC-1215-20007
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_U120085815
Pays : United Kingdom
Organisme : NIDDK NIH HHS
ID : U54 DK105566
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM104371
Pays : United States
Organisme : Department of Health
ID : HICF-R6-373
Pays : United Kingdom
Organisme : British Heart Foundation
ID : CS/14/2/30841
Pays : United Kingdom
Organisme : NIDDK NIH HHS
ID : P30 DK020572
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom

Investigateurs

Irina M Armean (IM)
Eric Banks (E)
Louis Bergelson (L)
Kristian Cibulskis (K)
Ryan L Collins (RL)
Kristen M Connolly (KM)
Miguel Covarrubias (M)
Beryl Cummings (B)
Mark J Daly (MJ)
Stacey Donnelly (S)
Yossi Farjoun (Y)
Steven Ferriera (S)
Stacey Gabriel (S)
Laura D Gauthier (LD)
Jeff Gentry (J)
Namrata Gupta (N)
Thibault Jeandet (T)
Diane Kaplan (D)
Kristen M Laricchia (KM)
Christopher Llanwarne (C)
Eric V Minikel (EV)
Ruchi Munshi (R)
Benjamin M Neale (BM)
Sam Novod (S)
Nikelle Petrillo (N)
Timothy Poterba (T)
David Roazen (D)
Valentin Ruano-Rubio (V)
Andrea Saltzman (A)
Kaitlin E Samocha (KE)
Molly Schleicher (M)
Cotton Seed (C)
Matthew Solomonson (M)
Jose Soto (J)
Grace Tiao (G)
Kathleen Tibbetts (K)
Charlotte Tolonen (C)
Christopher Vittal (C)
Gordon Wade (G)
Arcturus Wang (A)
Qingbo Wang (Q)
Nicholas A Watts (NA)
Ben Weisburd (B)
Carlos A Aguilar Salinas (CA)
Tariq Ahmad (T)
Christine M Albert (CM)
Diego Ardissino (D)
Gil Atzmon (G)
John Barnard (J)
Laurent Beaugerie (L)
Emelia J Benjamin (EJ)
Michael Boehnke (M)
Lori L Bonnycastle (LL)
Erwin P Bottinger (EP)
Donald W Bowden (DW)
Matthew J Bown (MJ)
John C Chambers (JC)
Juliana C Chan (JC)
Daniel Chasman (D)
Judy Cho (J)
Mina K Chung (MK)
Bruce Cohen (B)
Adolfo Correa (A)
Dana Dabelea (D)
Mark J Daly (MJ)
Dawood Darbar (D)
Ravindranath Duggirala (R)
Josée Dupuis (J)
Patrick T Ellinor (PT)
Roberto Elosua (R)
Jeanette Erdmann (J)
Tõnu Esko (T)
Martti Färkkilä (M)
Jose Florez (J)
Andre Franke (A)
Gad Getz (G)
Benjamin Glaser (B)
Stephen J Glatt (SJ)
David Goldstein (D)
Clicerio Gonzalez (C)
Leif Groop (L)
Christopher Haiman (C)
Craig Hanis (C)
Matthew Harms (M)
Mikko Hiltunen (M)
Matti M Holi (MM)
Christina M Hultman (CM)
Mikko Kallela (M)
Jaakko Kaprio (J)
Sekar Kathiresan (S)
Bong-Jo Kim (BJ)
Young Jin Kim (YJ)
George Kirov (G)
Jaspal Kooner (J)
Seppo Koskinen (S)
Harlan M Krumholz (HM)
Subra Kugathasan (S)
Soo Heon Kwak (SH)
Markku Laakso (M)
Terho Lehtimäki (T)
Ruth J F Loos (RJF)
Steven A Lubitz (SA)
Ronald C W Ma (RCW)
Jaume Marrugat (J)
Kari M Mattila (KM)
Steven McCarroll (S)
Mark I McCarthy (MI)
Dermot McGovern (D)
Ruth McPherson (R)
James B Meigs (JB)
Olle Melander (O)
Andres Metspalu (A)
Benjamin M Neale (BM)
Peter M Nilsson (PM)
Michael C O'Donovan (MC)
Dost Ongur (D)
Lorena Orozco (L)
Michael J Owen (MJ)
Colin N A Palmer (CNA)
Aarno Palotie (A)
Kyong Soo Park (KS)
Carlos Pato (C)
Ann E Pulver (AE)
Nazneen Rahman (N)
Anne M Remes (AM)
John D Rioux (JD)
Samuli Ripatti (S)
Dan M Roden (DM)
Danish Saleheen (D)
Veikko Salomaa (V)
Nilesh J Samani (NJ)
Jeremiah Scharf (J)
Heribert Schunkert (H)
Moore B Shoemaker (MB)
Pamela Sklar (P)
Hilkka Soininen (H)
Harry Sokol (H)
Tim Spector (T)
Patrick F Sullivan (PF)
Jaana Suvisaari (J)
E Shyong Tai (ES)
Yik Ying Teo (YY)
Tuomi Tiinamaija (T)
Ming Tsuang (M)
Dan Turner (D)
Teresa Tusie-Luna (T)
Erkki Vartiainen (E)
Hugh Watkins (H)
Rinse K Weersma (RK)
Maija Wessman (M)
James G Wilson (JG)
Ramnik J Xavier (RJ)
Marquis P Vawter (MP)

Commentaires et corrections

Type : CommentIn
Type : ErratumIn

Références

Calvo, S. E., Pagliarini, D. J. & Mootha, V. K. Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc. Natl Acad. Sci. USA 106, 7507–7512 (2009).
doi: 10.1073/pnas.0810916106
Johnstone, T. G., Bazzini, A. A. & Giraldez, A. J. Upstream ORFs are prevalent translational repressors in vertebrates. EMBO J. 35, 706–723 (2016).
doi: 10.15252/embj.201592759
Iacono, M., Mignone, F. & Pesole, G. uAUG and uORFs in human and rodent 5’untranslated mRNAs. Gene 349, 97–105 (2005).
doi: 10.1016/j.gene.2004.11.041
Kozak, M. Pushing the limits of the scanning mechanism for initiation of translation. Gene 299, 1–34 (2002).
doi: 10.1016/S0378-1119(02)01056-9
Kozak, M. An analysis of 5’-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 15, 8125–8148 (1987).
doi: 10.1093/nar/15.20.8125
Noderer, W. L. et al. Quantitative analysis of mammalian translation initiation sites by FACS‐seq. Mol. Syst. Biol. 10, 748 (2014).
doi: 10.15252/msb.20145136
Hinnebusch, A. G., Ivanov, I. P. & Sonenberg, N. Translational control by 5’-untranslated regions of eukaryotic mRNAs. Science 352, 1413–1416 (2016).
doi: 10.1126/science.aad9868
Jousse, C. et al. Inhibition of CHOP translation by a peptide encoded by an open reading frame localized in the chop 5′ UTR. Nucleic Acids Res. 29, 4341–4351 (2001).
doi: 10.1093/nar/29.21.4341
Ji, Z., Song, R., Regev, A. & Struhl, K. Many lncRNAs, 5’UTRs, and pseudogenes are and some are likely to express functional proteins. Elife 4, e08890 (2015).
doi: 10.7554/eLife.08890
Couso, J.-P. & Patraquim, P. Classification and function of small open reading frames. Nat. Rev. Mol. Cell Biol. 18, 575–589 (2017).
doi: 10.1038/nrm.2017.58
Wethmar, K. The regulatory potential of upstream open reading frames in eukaryotic gene expression. Wiley Interdiscip. Rev. RNA 5, 765–778 (2014).
doi: 10.1002/wrna.1245
Wang, X. & Rothnagel, J. A. 5′‐Untranslated regions with multiple upstream AUG codons can support low‐level translation via leaky scanning and reinitiation. Nucleic Acids Res. 32, 1382–1391 (2004).
doi: 10.1093/nar/gkh305
Chugunova, A., Navalayeu, T., Dontsova, O. & Sergiev, P. Mining for small translated ORFs. J. Proteome Res. 17, 1–11 (2018).
doi: 10.1021/acs.jproteome.7b00707
Sample, P. J. et al. Human 5′ UTR design and variant effect prediction from a massively parallel translation assay. Preprint at https://www.biorxiv.org/content/10.1101/310375v1 (2018).
Schulz, J. et al. Loss-of-function uORF mutations in human malignancies. Sci. Rep. 8, 2395 (2018).
doi: 10.1038/s41598-018-19201-8
Liu, L. et al. Mutation of the CDKN2A 5’ UTR creates an aberrant initiation codon and predisposes to melanoma. Nat. Genet. 21, 128–132 (1999).
doi: 10.1038/5082
Wen, Y. et al. Loss-of-function mutations of an inhibitory upstream ORF in the human hairless transcript cause Marie Unna hereditary hypotrichosis. Nat. Genet. 41, 228–233 (2009).
doi: 10.1038/ng.276
Occhi, G. et al. A novel mutation in the upstream open reading frame of the CDKN1B gene causes a MEN4 phenotype. PLoS Genet. 9, e1003350 (2013).
doi: 10.1371/journal.pgen.1003350
Barbosa, C., Peixeiro, I. & Romão, L. Gene expression regulation by upstream open reading frames and human disease. PLoS Genet. 9, e1003529 (2013).
doi: 10.1371/journal.pgen.1003529
Matthes, T. et al. Severe hemochromatosis in a Portuguese family associated with a new mutation in the 5’-UTR of the HAMP gene. Blood 104, 2181–2183 (2004).
doi: 10.1182/blood-2004-01-0332
von Bohlen, A. E. et al. A mutation creating an upstream initiation codon in the SOX9 5’ UTR causes acampomelic campomelic dysplasia. Mol. Genet Genom. Med. 5, 261–268 (2017).
doi: 10.1002/mgg3.282
Karczewski, K. J. et al. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. Preprint at https://www.biorxiv.org/content/10.1101/531210v2 (2019).
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
doi: 10.1038/nature19057
Olexiouk, V., Crappé, J. & Verbruggen, S. sORFs. org: a repository of small ORFs identified by ribosome profiling. Nucleic Acids 44, D324–D329 (2015).
Pollard, K. S., Hubisz, M. J., Rosenbloom, K. R. & Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20, 110–121 (2010).
doi: 10.1101/gr.097857.109
Riggs, E. R. et al. Copy number variant discrepancy resolution using the ClinGen dosage sensitivity map results in updated clinical interpretations in ClinVar. Hum. Mutat. 39, 1650–1659 (2018).
doi: 10.1002/humu.23610
Stenson, P. D. et al. The Human Gene Mutation Database: providing a comprehensive central mutation database for molecular diagnostics and personalised genomics. Hum. Genom. 4, 69 (2009).
doi: 10.1186/1479-7364-4-2-69
Landrum, M. J. et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42, D980–D985 (2014).
doi: 10.1093/nar/gkt1113
Evans, D. G. et al. Comprehensive RNA analysis of the NF1 gene in classically affected NF1 affected individuals meeting NIH criteria has high sensitivity and mutation negative testing is reassuring in isolated cases with pigmentary features only. EBioMedicine 7, 212–220 (2016).
doi: 10.1016/j.ebiom.2016.04.005
de Lima, R. L. L. F. et al. Prevalence and nonrandom distribution of exonic mutations in interferon regulatory factor 6 in 307 families with Van der Woude syndrome and 37 families with popliteal pterygium syndrome. Genet. Med. 11, 241–247 (2009).
doi: 10.1097/GIM.0b013e318197a49a
Kondo, S. et al. Mutations in IRF6 cause Van der Woude and popliteal pterygium syndromes. Nat. Genet. 32, 285–289 (2002).
doi: 10.1038/ng985
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
doi: 10.1186/s13059-016-0974-4
Short, P. J. et al. De novo mutations in regulatory elements in neurodevelopmental disorders. Nature 555, 611–616 (2018).
doi: 10.1038/nature25983
Zhang, S. et al. Base-specific mutational intolerance near splice sites clarifies the role of nonessential splice nucleotides. Genome Res. 28, 968–974 (2018).
doi: 10.1101/gr.231902.117
Lord, J. et al. Pathogenicity and selective constraint on variation near splice sites. Genome Res. 29, 159–170 (2019).
doi: 10.1101/gr.238444.118
An, J.-Y. et al. Genome-wide de novo risk score implicates promoter variation in autism spectrum disorder. Science 362, eaat6576 (2018).

Auteurs

Nicola Whiffin (N)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK. n.whiffin@imperial.ac.uk.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK. n.whiffin@imperial.ac.uk.
Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA. n.whiffin@imperial.ac.uk.

Konrad J Karczewski (KJ)

Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
Analytical and Translational Genetics Unit, Massachusetts General Hospital, 55 Fruit Street, Boston, MA, 02114, USA.

Xiaolei Zhang (X)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK.

Sonia Chothani (S)

Program in Cardiovascular and Metabolic Disorders, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.

Miriam J Smith (MJ)

NW Genomic Laboratory Hub, Centre for Genomic Medicine, Division of Evolution and Genomic Science, St Mary's Hospital, University of Manchester, Oxford Road, Manchester, M13 9WL, UK.

D Gareth Evans (DG)

NW Genomic Laboratory Hub, Centre for Genomic Medicine, Division of Evolution and Genomic Science, St Mary's Hospital, University of Manchester, Oxford Road, Manchester, M13 9WL, UK.

Angharad M Roberts (AM)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK.

Nicholas M Quaife (NM)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK.

Sebastian Schafer (S)

Program in Cardiovascular and Metabolic Disorders, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.
National Heart Centre Singapore, 5 Hospital Drive, Singapore, 169609, Singapore.

Owen Rackham (O)

Program in Cardiovascular and Metabolic Disorders, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.

Jessica Alföldi (J)

Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
Analytical and Translational Genetics Unit, Massachusetts General Hospital, 55 Fruit Street, Boston, MA, 02114, USA.

Anne H O'Donnell-Luria (AH)

Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, 02115, USA.
Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.

Laurent C Francioli (LC)

Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
Analytical and Translational Genetics Unit, Massachusetts General Hospital, 55 Fruit Street, Boston, MA, 02114, USA.

Stuart A Cook (SA)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
Program in Cardiovascular and Metabolic Disorders, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.
National Heart Centre Singapore, 5 Hospital Drive, Singapore, 169609, Singapore.

Paul J R Barton (PJR)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK.

Daniel G MacArthur (DG)

Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
Analytical and Translational Genetics Unit, Massachusetts General Hospital, 55 Fruit Street, Boston, MA, 02114, USA.
Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia.
Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.

James S Ware (JS)

National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
NIHR Royal Brompton Cardiovascular Research Centre, Royal Brompton and Harefield National Health Service Foundation Trust, Sydney Street, London, SW3 6NP, UK.
Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH