Telomere-to-telomere assembly of a complete human X chromosome.

2020 
After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no one chromosome has been finished end to end, and hundreds of unresolved gaps persist1,2. Here we present a de novo human genome assembly that surpasses the continuity of GRCh382, along with the first gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome3, we reconstructed the ~3.1 megabase centromeric satellite DNA array and closed all 29 remaining gaps in the current reference, including new sequence from the human pseudoautosomal regions and cancer-testis ampliconic gene families (CT-X and GAGE). These novel sequences will be integrated into future human reference genome releases. Additionally, a complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays for the first time. Our results demonstrate that finishing the entire human genome is now within reach and the data presented here will enable ongoing efforts to complete the remaining human chromosomes.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    68
    References
    257
    Citations
    NaN
    KQI
    []