Complete genomic and epigenetic maps of human centromeres

doi:10.1126/science.abl4178

Open AccessJournal ArticleDOI

Complete genomic and epigenetic maps of human centromeres

- 01 Apr 2022 -

Science

- Vol. 376, Iss: 6588

Chats0

TLDR

In this paper , a complete, telomere-to-telomere human genome assembly (T2T-CHM13) has enabled the comprehensively characterize pericentromeric and centromeric repeats, which constitute 6.2% of the genome.

Abstract:

Existing human genome assemblies have almost entirely excluded repetitive sequences within and near centromeres, limiting our understanding of their organization, evolution, and functions, which include facilitating proper chromosome segregation. Now, a complete, telomere-to-telomere human genome assembly (T2T-CHM13) has enabled us to comprehensively characterize pericentromeric and centromeric repeats, which constitute 6.2% of the genome (189.9 megabases). Detailed maps of these regions revealed multimegabase structural rearrangements, including in active centromeric repeat arrays. Analysis of centromere-associated sequences uncovered a strong relationship between the position of the centromere and the evolution of the surrounding DNA through layered repeat expansions. Furthermore, comparisons of chromosome X centromeres across a diverse panel of individuals illuminated high degrees of structural, epigenetic, and sequence variation in these complex and rapidly evolving regions.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

The complete sequence of a human genome

Sergey Koren, +6 more

- 01 Apr 2022 -

Science

TL;DR: The T2T-CHM13-T2T Consortium presented a complete 3.055 billion-base pair sequence of a human genome, including gapless assemblies for all chromosomes except Y, corrected errors in the prior references, and introduced nearly 200 million base pairs of sequence containing gene predictions, 99 of which are predicted to be protein coding as discussed by the authors .

...read moreread less

Journal ArticleDOI

A complete reference genome improves analysis of human genetic variation

Justin M. Zook, +8 more

- 01 Apr 2022 -

Science

TL;DR: The T2T-CHM13 reference as discussed by the authors has been shown to universally improve read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively.

...read moreread less

Journal ArticleDOI

From telomere to telomere: The transcriptional and epigenetic state of human repeat elements

- 01 Apr 2022 -

Science

TL;DR: In this paper , a de novo repeat discovery and annotation of the T2T-CHM13 human reference genome was presented, which expanded the catalog of variants and families for repeats and mobile elements, characterized classes of complex composite repeat, and located retroelement transduction events.

...read moreread less

Journal ArticleDOI

A draft human pangenome reference

Wen-Wei Liao, +55 more

- 09 Jul 2022 -

Visual education

TL;DR: The pangenome reference as discussed by the authors contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals and is more than 99% accurate at the structural and base pair levels.

...read moreread less

Journal ArticleDOI

Epigenetic patterns in a complete human genome

- 01 Apr 2022 -

Science

TL;DR: In this article , a high-resolution epigenetic study of previously unresolved sequences was presented, representing entire acrocentric chromosome short arms, gene family expansions, and a diverse collection of repeat classes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Sequence Alignment/Map format and SAMtools

Heng Li, +8 more

- 01 Aug 2009 -

Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Journal ArticleDOI

MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods

Koichiro Tamura, +5 more

- 01 Oct 2011 -

Molecular Biology and Evolution

TL;DR: The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models, inferring ancestral states and sequences, and estimating evolutionary rates site-by-site.

...read moreread less

Journal ArticleDOI

MUSCLE: multiple sequence alignment with high accuracy and high throughput

Robert C. Edgar

- 01 Mar 2004 -

Nucleic Acids Research

TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.

...read moreread less

Journal ArticleDOI

Cutadapt removes adapter sequences from high-throughput sequencing reads

Marcel Martin

- 02 May 2011 -

EMBnet.journal

TL;DR: The command-line tool cutadapt is developed, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features.

...read moreread less

Journal ArticleDOI

BEDTools: a flexible suite of utilities for comparing genomic features

Aaron R. Quinlan, +1 more

- 15 Mar 2010 -

Bioinformatics

TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.

...read moreread less