Structural variation of the malaria-associated human glycophorin A-B-E region

2019 
Approximately 5% of the human genome consists of structural variants, which are enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120kb in length, carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A and glycophorin B are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They act as receptors for invasion of a causative agent of malaria, Plasmodium falciparum. A particular complex structural variant (DUP4) that creates a GYPB/GYPA fusion gene is known to confer resistance to malaria. Many other structural variants exist, and remain poorly characterised. Here, we analyse sequences from 6466 genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection using fibre-FISH and breakpoint mapping. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by unequal cross over events (non-allelic homologous recombination, NAHR) and by comparing the structural variant breakpoints with recombination hotspot maps, show the importance of a particular meiotic recombination hotspot on structural variant formation in this region.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    46
    References
    1
    Citations
    NaN
    KQI
    []