Structural variation of the malaria-associated human glycophorin A-B-E region.
Copy number variation
Erythrocytes
GYPA
GYPB
GYPE
Glycophorin
Immune response
Inversion
Malaria
Structural variation
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
29 Jun 2020
29 Jun 2020
Historique:
received:
07
01
2020
accepted:
18
06
2020
entrez:
1
7
2020
pubmed:
1
7
2020
medline:
2
2
2021
Statut:
epublish
Résumé
Approximately 5% of the human genome shows common structural variation, which is enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120 kb in length and carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A (encoded by GYPA) and glycophorin B (encoded by GYPB) are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They are receptors for the invasion of the protist parasite Plasmodium falciparum, a causative agent of malaria. A particular complex structural variant, called DUP4, creates a GYPB-GYPA fusion gene known to confer resistance to malaria. Many other structural variants exist across the glycophorin gene cluster, and they remain poorly characterised. Here, we analyse sequences from 3234 diploid genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection of these variants using fibre-FISH and breakpoint mapping at the sequence level. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by non-allelic homologous recombination and by comparing the structural variant breakpoints with recombination hotspot maps, confirm the importance of a particular meiotic recombination hotspot on structural variant formation in this region. We identify and validate large structural variants in the human glycophorin A-B-E gene cluster which may be associated with different clinical aspects of malaria.
Sections du résumé
BACKGROUND
BACKGROUND
Approximately 5% of the human genome shows common structural variation, which is enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120 kb in length and carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A (encoded by GYPA) and glycophorin B (encoded by GYPB) are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They are receptors for the invasion of the protist parasite Plasmodium falciparum, a causative agent of malaria. A particular complex structural variant, called DUP4, creates a GYPB-GYPA fusion gene known to confer resistance to malaria. Many other structural variants exist across the glycophorin gene cluster, and they remain poorly characterised.
RESULTS
RESULTS
Here, we analyse sequences from 3234 diploid genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection of these variants using fibre-FISH and breakpoint mapping at the sequence level. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by non-allelic homologous recombination and by comparing the structural variant breakpoints with recombination hotspot maps, confirm the importance of a particular meiotic recombination hotspot on structural variant formation in this region.
CONCLUSIONS
CONCLUSIONS
We identify and validate large structural variants in the human glycophorin A-B-E gene cluster which may be associated with different clinical aspects of malaria.
Identifiants
pubmed: 32600246
doi: 10.1186/s12864-020-06849-8
pii: 10.1186/s12864-020-06849-8
pmc: PMC7325229
doi:
Substances chimiques
GYPA protein, human
0
GYPB protein, human
0
Glycophorins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
446Subventions
Organisme : Saudi Arabia Cultural Bureau in London
ID : n/a
Organisme : Wellcome Trust
ID : WT098051
Pays : United Kingdom
Références
Vox Sang. 1964 Jul-Aug;9:502-4
pubmed: 14177440
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Parasit Vectors. 2019 Jun 24;12(1):317
pubmed: 31234897
Nat Rev Genet. 2015 Mar;16(3):172-83
pubmed: 25645873
J Immunogenet. 1982 Feb;9(1):51-5
pubmed: 7077083
Nature. 2016 Feb 11;530(7589):177-83
pubmed: 26814963
BMC Genomics. 2014 May 01;15:329
pubmed: 24885186
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Science. 2014 Nov 14;346(6211):1256442
pubmed: 25395542
Mol Biol Evol. 2002 Mar;19(3):223-9
pubmed: 11861881
PLoS Genet. 2018 Mar 28;14(3):e1007293
pubmed: 29590102
Proc Natl Acad Sci U S A. 2011 Jul 26;108(30):12378-83
pubmed: 21750151
Nat Genet. 2015 Mar;47(3):296-303
pubmed: 25621458
Hum Mutat. 2003 Jul;22(1):79-85
pubmed: 12815597
Nucleic Acids Res. 2015 Jul 1;43(W1):W580-4
pubmed: 25845596
Nature. 2010 Apr 1;464(7289):704-12
pubmed: 19812545
Am J Hum Genet. 2018 Nov 1;103(5):769-776
pubmed: 30388403
Nature. 2015 Oct 1;526(7571):75-81
pubmed: 26432246
Cell. 1990 Feb 9;60(3):473-85
pubmed: 2406022
Brief Funct Genomics. 2015 Sep;14(5):329-38
pubmed: 26163405
Eur J Biochem. 1990 Aug 17;191(3):619-25
pubmed: 2390989
Nature. 2016 Oct 13;538(7624):201-206
pubmed: 27654912
Genome Res. 2002 Jun;12(6):996-1006
pubmed: 12045153
Nat Commun. 2019 Dec 16;10(1):5732
pubmed: 31844061
Gigascience. 2017 Sep 1;6(9):1-7
pubmed: 28938720
Biotechniques. 2008 Jun;44(7):921-3
pubmed: 18533902
Cad Saude Publica. 2005 Jul-Aug;21(4):1177-85
pubmed: 16021255
Lancet Haematol. 2018 Aug;5(8):e333-e345
pubmed: 30033078
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nucleic Acids Res. 2014 Jan;42(Database issue):D986-92
pubmed: 24174537
Nucleic Acids Res. 2020 Jan 8;48(D1):D941-D947
pubmed: 31584097
Hum Genet. 2014 Oct;133(10):1217-33
pubmed: 25110110
Nat Commun. 2016 Oct 06;7:12989
pubmed: 27708267
Am J Hematol. 1991 May;37(1):57-8
pubmed: 2024643
J Virol. 2004 Sep;78(18):9807-13
pubmed: 15331714
Science. 2017 Jun 16;356(6343):
pubmed: 28522690
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
J Biol Chem. 1994 Apr 29;269(17):13013-20
pubmed: 8175721
Genetics. 2016 Apr;202(4):1251-4
pubmed: 27053122
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11901-11906
pubmed: 27702888
Brief Bioinform. 2013 Mar;14(2):178-92
pubmed: 22517427
Hum Genet. 1994 Feb;93(2):148-50
pubmed: 8112738