scispace - formally typeset
Search or ask a question

Showing papers by "Marc Jan Bonder published in 2023"


Posted ContentDOI
12 Jun 2023-bioRxiv
TL;DR: In this article , the authors present long-read de novo assemblies of 43 diverse Y chromosomes spanning 180,000 years of human evolution, including two from deep-rooted African Y lineages, and report remarkable complexity and diversity in chromosome size and structure, in contrast with its low level of base substitution variation.
Abstract: The prevalence of highly repetitive sequences within the human Y chromosome has led to its incomplete assembly and systematic omission from genomic analyses. Here, we present long-read de novo assemblies of 43 diverse Y chromosomes spanning 180,000 years of human evolution, including two from deep-rooted African Y lineages, and report remarkable complexity and diversity in chromosome size and structure, in contrast with its low level of base substitution variation. The size of the Y chromosome assemblies varies extensively from 45.2 to 84.9 Mbp and include, on average, 81 kbp of novel sequence per Y chromosome. Half of the male-specific euchromatic region is subject to large inversions with a >2-fold higher recurrence rate compared to inversions in the rest of the human genome. Ampliconic sequences associated with these inversions further show differing mutation rates that are sequence context-dependent and some ampliconic genes show evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, the Yq12, is composed of alternating arrays of DYZ1 and DYZ2 repeat units that show extensive variation in the number, size and distribution of these arrays, but retain a 1:1 copy number ratio of the monomer repeats, consistent with the notion that functional or evolutionary forces are acting on this chromosomal region. Finally, our data suggests that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kbp distal to the currently established boundary. The availability of sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of specific traits with Y-chromosomal variants and garnering novel insights into the evolution and function of complex regions of the human genome.

5 citations


Posted ContentDOI
01 Feb 2023-bioRxiv
TL;DR: In this paper , a single-cell DNA methylation and matched transcriptome dataset from mouse peripheral blood samples, spanning a broad range of ages (10-101 weeks of age).
Abstract: Ageing is the accumulation of changes and overall decline of the function of cells, organs and organisms over time. At the molecular and cellular level, the concept of biological age has been established and biomarkers of biological age have been identified, notably epigenetic DNA-methylation based clocks. With the emergence of single-cell DNA methylation profiling methods, the possibility to study biological age of individual cells has been proposed, and a first proof-of-concept study, based on limited single cell datasets mostly from early developmental origin, indicated the feasibility and relevance of this approach to better understand organismal changes and cellular ageing heterogeneity. Here we generated a large single-cell DNA methylation and matched transcriptome dataset from mouse peripheral blood samples, spanning a broad range of ages (10-101 weeks of age). We observed that the number of genes expressed increased at older ages, but gene specific changes were small. We next developed a robust single cell DNA methylation age predictor (scEpiAge), which can accurately predict age in a broad range of publicly available datasets, including very sparse data and it also predicts age in single cells. Interestingly, the DNA methylation age distribution is wider than technically expected in 19% of single cells, suggesting that epigenetic age heterogeneity is present in vivo and may relate to functional differences between cells. In addition, we observe differences in epigenetic ageing between the major blood cell types. Our work provides a foundation for better single-cell and sparse data epigenetic age predictors and highlights the significance of cellular heterogeneity during ageing. Highlights - Model to estimate DNA methylation age in single cells - Large multi-omics dataset of single cells from murine blood - Epigenetic age deviations from chronological age are greater than technical expected from technical variability - Number of genes expressed increases with chronological and epigenetic age

2 citations


Posted ContentDOI
15 May 2023-bioRxiv
TL;DR: In this paper , the authors developed an integrative Hi-C analysis pipeline to generate a comprehensive catalog of TADs, TAD boundaries, and loops in human genomes to fill the gap of limited resources.
Abstract: The human genome is packaged into the three-dimensional (3D) nucleus and organized into functional units known as topologically associating domains (TADs) and chromatin loops. Recent studies show that the 3D genome can be modified by genome structural variants (SVs) through disrupting higher-order chromatin organizations such as TADs, which play an essential role in insulating genes from aberrant regulation by regulatory elements outside TADs. Here, we have developed an integrative Hi-C analysis pipeline to generate a comprehensive catalog of TADs, TAD boundaries, and loops in human genomes to fill the gap of limited resources. We identified 2,293 TADs and 6,810 sub-TADs missing in the previously released TADs of GM12878. We then quantified the impact of SVs overlapping with TAD boundaries and observed that two SVs could significantly alter chromatin architecture leading to abnormal expression and splicing of genes associated with human diseases.