The single-cell epigenetic regulatory landscape in mammalian perinatal testis development
Summary (4 min read)
Introduction
- Mammalian testis consists of germ cells and distinct somatic cell types that coordinately underpin the maintenance of spermatogenesis and fertility.
- These testicular cells display extensive developmental dynamics during the perinatal period.
- In mice, Sertoli cells actively proliferate during the neonatal period for two weeks (Vergouwen et al. 1991).
- Since all cells share the same genetic information, lineage specification must be regulated by differential chromatin accessibility in a cell type-specific and dynamic manner.
Results
- Single-cell ATAC-Seq captures developmental and cell type-specific heterogeneity in the testis.
- To delineate the dynamic changes on cellular populations in a developing testis, the authors profiled the chromatin accessibility landscapes of mouse perinatal testis across E18.5 and postnatal stages (P0.5, P2.5 and P5.5) by scATAC-Seq (Figure 1A).
- These time points were chosen to represent the diversity of cell type compositions involved in the key developmental events in the testis (Figure 1B).
- Altogether, the authors profiled chromatin accessibility in 25,613 individual cells after stringent quality control filtration and heterotypic doublet removal (Supplementary Figure S1).
- These samples showed no clustering based on covariates such as transcription start site (TSS) enrichment and fragment size (Supplementary Figure S2A).
A. Experimental design. The workflow of testis collection and scATAC-seq to measure
- Shared embedding in which cells were grouped by cell type rather than developmental stage (Korsunsky et al. 2019).
- Prediction of cell types in scATAC-Seq was then performed by directly aligning cells from scATAC-Seq with cells from scRNA-Seq through comparing the ‘query’ gene activity scores matrix with ‘reference’ scRNA-Seq gene expression matrix.
- Taken together, scATAC-Seq allowed the detection and assignment of cell identities in the developing testis.
- Chromatin accessibility defines cell types in developing testis Cell types can be distinguished based on whether differentially accessible chromatin regions (DARs) are ‘open’ or ‘closed’.
- It is The copyright holder for this preprintthis version posted March 17, 2021.
D. TF footprints (average ATAC-seq signal around predicted binding sites) for selected
- Deconvolution of chromatin accessibility by cell types revealed accessible sites are primarily located in the distal and intron region (>3-kb from TSS), suggesting an enrichment of gene regulatory elements (Supplementary Figure S3A).
- To test this, the authors used an analytical framework to link distal peaks to genes in cis, based on the coordination of chromatin accessibility and gene expression levels across cells (Figure 3A).
- The authors analysis also successfully revealed a previously identified functional enhancer as a novel candidate to regulate Sertoli cell marker Wt1 (Figure 3D).
- Notably, the Dlk1 -Gtl2 locus demonstrated preferential accessibility in stromal and PTM cells.
- Dmrt1 prevents spermatogonia from undergoing meiosis by repressing Stra8 transcription (Matson et al. 2010).
B. Heatmaps of differential TF motif activity (left) and gene activity (right) of positive TF
- Trajectory from gonocyte to undifferentiated and then differentiating spermatogonia in P5.5.
- The second path bypassed the undifferentiated state but passed through the unknown populations and directly reached the differentiating state by P5.5.
- The authors observed that the motif binding activity of Id4 was initially high but then declined after birth, while that of ETS and Sp/KLF family members increased in spermatogonia (Supplementary Figure S7D).
- Sohlh2 , which is critical for early spermatogenesis, is more accessible at the late stage (Hao et al. 2008).
- Cluster 3 peak-to-gene links were more accessible predominantly in T1-ProSG, such as T and Fbxo4 .
TF dynamics during perinatal Sertoli cell development
- Re-clustering of Sertoli cells revealed 6 cell clusters (Figure 5A).
- Notably, C1 is largely distinct from other clusters and comprises mainly E18.5, P0.5 and P2.5 cells, suggesting this subpopulation is only present during the embryonic and early neonatal period.
- The authors then correlated the gene score of a TF to its corresponding TF z-score to reveal TFs enriched at different developmental stages (Figure 5C).
- In contrast, HIC1 and CEBPD were upregulated at the later stage (Supplementary Figure S10B).
- Further, induction of C/EBP proteins by cAMP may play a role in FSH-dependent regulation in Sertoli cells (Grønning et al. 1999).
F. Representative confocal images of seminiferous tubules from Oct4-GFP transgenic
- The authors then identified the positive TF regulators in Sertoli cells.
- C1 and C3 showed enrichment in different sets of SLC markers.
- These results suggested there is also large heterogeneity among different TFs in their involvement in different Leydig cell subpopulations.
- TFs in immune cells Re-clustering of all the immune cells generated 4 cell clusters ((Figure 1D, Supplementary Figure S12A), which can be re-grouped as 3 main groups based on the expression of their corresponding marker genes, including T cells/NK cells (C1 and C2), myeloid cells (C3) and dendritic cells (C4) (Supplementary Figure S12B).
- The authors then focused on Sertoli cells and CAD traits.
Discussion
- Understanding the genetic networks that underlie developmental processes requires a comprehensive understanding of the genes involved as well as the regulatory mechanisms that modulate the expression of these genes.
- More importantly, a significant amount of peak-to-gene links are within known testis enhancer regions, indicating the reliability of their analysis strategy.
- In contrast, the subsequent rounds of spermatogenesis are derived from Ngn3 -positive undifferentiated spermatogonia (Yoshida et al. 2006), consistent with the high expression of Ngn3 in the undifferentiated spermatogonia cluster in their dataset.
- A recent study in chicken sex determination suggested a novel stem Sertoli population could be characterized by lower levels of Dmrt1 and Sox9.
- The authors showed that testicular cell type-specific peaks displayed increased heritability enrichment in cell populations consistent with the known biology and revealed new biological insights.
Animals
- All the animal experiments were performed according to the protocols approved by the Animal Experiment Ethics Committee (AEEC) of The Chinese University of Hong Kong (CUHK) and followed the Animals (Control of Experiments) Ordinance (Cap. 340) licensed from the Department of Health, the Government of Hong Kong Special Administrative Region.
- All the mice were housed under a cycle of 12-hour light/dark and kept in ad libitum feeding and controlled the temperature of 22-24°C.
- Oct4-EGFP transgenic mice and C57BL/6J mice were maintained in CUHK Laboratory Animal Services Centre.
Sample collection
- The testes were then digested with 1 mg/ml type 4 collagenase , 1 mg/ml hyaluronidase (Sigma-Aldrich) and 5 µg/ml DNase I (Sigma-Aldrich) at 37°C for 20 min with occasional shaking.
- The suspension was passed through a 40- µm strainer cap (BD Falcon) to yield a uniform single cell suspension.
Cell lysis and tagmentation
- The lysis buffer was diluted with ATAC-Tween buffer that contains 0.1% Tween-20 as a detergent.
- Nuclei were resuspended in tagmentation mix, buffered with 1×PBS supplemented with 0.1% BSA and agitated on a ThermoMixer for 30 min at 37 °C.
- Tagmented nuclei were kept on ice before encapsulation.
- ScATAC-Seq library preparation and sequencing Tagmented nuclei were loaded onto a ddSEQ Single-Cell Isolator (Bio-Rad).
- PCR products were purified using Ampure XP beads and quantified on an Agilent Bioanalyzer (G2939BA, Agilent) using the High-Sensitivity DNA kit (5067-4626, Agilent).
Sequencing reads preprocessing
- Sequencing data were processed using the Bio-Rad ATAC-Seq Analysis Toolkit.
- The reference index was built upon the mouse genome mm10.
- For generation of the fragments file, which contain the start and end genomic coordinates of all aligned sequenced fragments, sorted bam files were further process with “bap-frag” module of BAP (https://github.com/caleblareau/bap ).
- The authors filtered out low-quality nuclei with stringent selection criteria, including read depth per cell (>2,000) and TSS enrichment score (>20%).
- ArchR was used to estimate gene expression for genes and TF motif activity from single cell chromatin accessibility data.
Trajectory analysis
- Trajectory analysis was performed in ArchR. addTrajectory function in ArchR was used to construct trajectory on cisTopic UMAP embedding.
- To perform integrative analyses for identification of positive TF regulators by integration of gene scores with motif accessibility across pseudo-time, the authors used the correlateTrajectories function which takes two SummarizedExperiment objects retrieved from the getTrajectories function.
Footprinting analysis
- Differential transcription factor footprints across cell types were identified using the Regulatory Genomics Toolbox application HINT (Li et al. 2018).
- Aligned BAM files from different cell types were treated as pseudo-bulk ATAC-Seq profiles and then subjected to rgt-hint analysis.
- Based on MACS calling peaks, the authors used HINT-ATAC to predict footprints with the “rgt-hint footprinting” command.
- The authors then identified all binding sites of a particular TF overlapping with footprints by using its motif from JASPAR with “rgt-motifanalysis matching” command.
- Differential motif occupancy was identified with “rgt-hint differential” command and ”–bc” was specified to use the bias-corrected signal.
Statistical analysis
- Assessment of statistical significance was performed using two-tailed unpaired t-tests, one-way ANOVA with Tukey multiple comparisons tests or Chi-squared tests.
- Statistical analysis was performed using GraphPad Prism v8.
Acknowledgments
- The authors thank Tommy Lo for scATAC-Seq experimental assistance and Jesse Xiao for interactive website development.
- The authors also thank the SBS Core Laboratories in The Chinese University of Hong Kong for technical support.
Author Contributions
- JL, HC Suen, RMH and TLL conceived and designed the study.
- JL and HC Suen performed experiments, single-cell sequencing analysis and wrote the manuscript.
- SR, RZ and HC So performed LDSC analysis.
- JL, HC Suen, RMH and TLL analyzed and interpreted data.
- JL, HC Suen, ACSL, AWTL, THTC, MYC, HTC, RMH and TLL reviewed and edited the manuscript.
Did you find this useful? Give us your feedback
References
2,459 citations
1,518 citations
1,452 citations
798 citations
710 citations
Related Papers (5)
Frequently Asked Questions (15)
Q2. What tests were used to assess the significance of the TF footprints?
Assessment of statistical significance was performed using two-tailed unpaired t-tests, one-way ANOVA with Tukey multiple comparisons tests or Chi-squared tests.
Q3. What is the role of enhancers in establishing tissue-specific gene expression patterns?
As enhancers play a critical role in establishing tissue-specific gene expression patterns during development, the authors predicted that active enhancers would be enriched around lineage-specific genes.
Q4. What is the role of T1-ProSG in the development of testicular cells?
Shortly after birth, T1-ProSG resume mitotic activity and begin migrating from the center of the testis cords to the basal lamina of testicular cords, and become T2-prospermatogonia (T2-ProSG).
Q5. What was the cleaning of bin regions?
Bin regions were cleaned by eliminating bins overlapping with ENCODE Blacklist regions, mitochondrial DNA as well as the top 5% of invariant features (house-keeping gene promoters).
Q6. What is the role of testicular cells in the development of sperm?
They are mainly involved in tubule contractions to facilitate the movement of sperm to the epididymis and secrete extracellular matrix materials (Chen et al. 2014).
Q7. What was the process used for generating the fragments file?
For generation of the fragments file, which contain the start and end genomic coordinates of all aligned sequenced fragments, sorted bam files were further process with “bap-frag” module of BAP (https://github.com/caleblareau/bap ).
Q8. What is the role of scATAC-Seq in identifying sper?
Since there is currently a lack of marker genes to accurately identify gonocytes which undergo the first wave of spermatogenesis, their scATAC-Seq data uncovered a list of potential markers that warrants further investigation .
Q9. What are the GO terms associated with the marker genes in C1?
GO terms associated with the marker genes in C1 include extracellular structure organization and connective tissue development (Supplementary Figure S10G).
Q10. What was the rgt-hint command used to identify the differential motif?
Differential motif occupancy was identified with “rgt-hint differential” command and ”–bc” was specified to use the bias-corrected signal.
Q11. How many markers were identified across the clusters?
761 marker genes were identified across the clusters (FDR < 0.1, log2FC > 0.5), with most upregulated in C1 (Supplementary Figure S10F).
Q12. What is the chromatin accessibility of cells in the testis?
Chromatin accessibility defines cell types in developing testisCell types can be distinguished based on whether differentially accessible chromatin regions (DARs) are ‘open’ or ‘closed’.
Q13. What is the significance of the scATAC-Seq data?
the authors showed that similar to scRNA-Seq, the chromatin accessibility information obtained from scATAC-Seq is able to define cell types in the testis.
Q14. What is the role of scATAC-Seq in identifying cell types?
Their scATAC-Seq provides additional information on TF regulation by correlating motif accessibility and predicted gene activity, which allows us to reveal cell type-specific TFs.
Q15. How many gene scores were identified among the clusters?
Based on the inferred gene scores calculated by ArchR, 81 marker genes were identified among the clusters (FDR < 0.1, log 2FC > 0.5) (Supplementary Figure S10A).