Telomere-to-telomere assembly of a complete human X chromosome
- PMID: 32663838
- PMCID: PMC7484160
- DOI: 10.1038/s41586-020-2547-7
Telomere-to-telomere assembly of a complete human X chromosome
Abstract
After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist1,2. Here we present a human genome assembly that surpasses the continuity of GRCh382, along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome3, we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes.
Conflict of interest statement
E.E.E. is on the scientific advisory board of DNAnexus. K.H.M., S.K. and W.T. have received travel funds to speak at symposia organized by Oxford Nanopore. W.T. has two patents licensed to Oxford Nanopore (US patent 8,748,091 and US patent 8,394,584). A.D.S., J.-M.B. and S.S. are employees of Arima Genomics. R.R. shares equity in NanoString Technologies and is the principal investigator on an NIH SBIR subcontract research agreement with TwinStrand Biosciences. All other authors have no competing interests to declare.
Figures













Comment in
-
A long read of the human genome.Nat Rev Genet. 2020 Oct;21(10):577. doi: 10.1038/s41576-020-0273-5. Nat Rev Genet. 2020. PMID: 32747762 No abstract available.
Similar articles
-
The X chromosome from telomere to telomere: key achievements and future opportunities.Fac Rev. 2021 Jul 30;10:63. doi: 10.12703/r-01-000001. eCollection 2021. Fac Rev. 2021. PMID: 35088059 Free PMC article. Review.
-
Centromere reference models for human chromosomes X and Y satellite arrays.Genome Res. 2014 Apr;24(4):697-707. doi: 10.1101/gr.159624.113. Epub 2014 Feb 5. Genome Res. 2014. PMID: 24501022 Free PMC article.
-
Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies.Nat Methods. 2022 Jun;19(6):687-695. doi: 10.1038/s41592-022-01440-3. Epub 2022 Mar 31. Nat Methods. 2022. PMID: 35361931 Free PMC article.
-
Closing in on a complete human genome.Nature. 2021 Feb;590(7847):679-681. doi: 10.1038/d41586-021-00462-9. Nature. 2021. PMID: 33619406 No abstract available.
-
Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics.Cells. 2020 Dec 18;9(12):2714. doi: 10.3390/cells9122714. Cells. 2020. PMID: 33352976 Free PMC article. Review.
Cited by
-
RAviz: a visualization tool for detecting false-positive alignments in repetitive genomic regions.Hortic Res. 2022 May 20;9:uhac161. doi: 10.1093/hr/uhac161. eCollection 2022. Hortic Res. 2022. PMID: 36204211 Free PMC article. No abstract available.
-
Towards population-scale long-read sequencing.Nat Rev Genet. 2021 Sep;22(9):572-587. doi: 10.1038/s41576-021-00367-3. Epub 2021 May 28. Nat Rev Genet. 2021. PMID: 34050336 Free PMC article. Review.
-
Chromosome-scale assemblies of the male and female Populus euphratica genomes reveal the molecular basis of sex determination and sexual dimorphism.Commun Biol. 2022 Nov 4;5(1):1186. doi: 10.1038/s42003-022-04145-7. Commun Biol. 2022. PMID: 36333427 Free PMC article.
-
Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies.Genome Biol. 2020 Sep 14;21(1):245. doi: 10.1186/s13059-020-02134-9. Genome Biol. 2020. PMID: 32928274 Free PMC article.
-
Centromeric Transcription: A Conserved Swiss-Army Knife.Genes (Basel). 2020 Aug 9;11(8):911. doi: 10.3390/genes11080911. Genes (Basel). 2020. PMID: 32784923 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
Grants and funding
- 2U41HG007234/HG/NHGRI NIH HHS/United States
- MR/S035362/1/MRC_/Medical Research Council/United Kingdom
- T32 GM007445/GM/NIGMS NIH HHS/United States
- MR/M501621/1/MRC_/Medical Research Council/United Kingdom
- R21 HG010548/HG/NHGRI NIH HHS/United States
- R21 CA238758/CA/NCI NIH HHS/United States
- HG002385/NH/NIH HHS/United States
- P30 CA014236/CA/NCI NIH HHS/United States
- R01 HG009190/HG/NHGRI NIH HHS/United States
- R01 HG002385/HG/NHGRI NIH HHS/United States
- R01 GM124041/GM/NIGMS NIH HHS/United States
- U01 1U01HG010971/NH/NIH HHS/United States
- U01 HL137183/HL/NHLBI NIH HHS/United States
- DP2 MH119424/MH/NIMH NIH HHS/United States
- U54 1U54HG007990/NH/NIH HHS/United States
- T32 LM012419/LM/NLM NIH HHS/United States
- U01 1U01HL137183/HL/NHLBI NIH HHS/United States
- 1F32GM134558-01/NH/NIH HHS/United States
- 212965/Z/18/Z/WT_/Wellcome Trust/United Kingdom
- HHMI/Howard Hughes Medical Institute/United States
- F32 GM134558/GM/NIGMS NIH HHS/United States
- R01 HG010169/HG/NHGRI NIH HHS/United States
- R01 GM129263/GM/NIGMS NIH HHS/United States
- U41 HG007234/HG/NHGRI NIH HHS/United States
- R01 CA181308/CA/NCI NIH HHS/United States
- U01 HG010971/HG/NHGRI NIH HHS/United States
- U54 HG007990/HG/NHGRI NIH HHS/United States
- WT_/Wellcome Trust/United Kingdom
- R44 HG008118/HG/NHGRI NIH HHS/United States
- HG010169/NH/NIH HHS/United States
- MR/J014370/1/MRC_/Medical Research Council/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous