A map of transcriptional heterogeneity and regulatory variation in human microglia
Summary (3 min read)
Introduction
- Microglia are tissue resident macrophages of the central nervous system and play critical roles in neurological immune defence, development and homeostasis (Schafer and Stevens 2015; Q. Li and Barres 2018; Salter and Stevens 2017).
- Two populations (C and D) were common in patients with acute brain injury (25-76% of cells) but rare in other pathologies (<5% of cells).
- This analysis retrieved colocalisations at other AD GWAS loci, such as CD33 and CASS4.
- The authors also used the three-way model to evaluate the extent of sharing between the microglia eQTLs, IPSDM or monocyte eQTLs, and AD risk loci.
- The authors identified multiple microglial subpopulations and showed how these populations are shaped by insult, injury and other life history factors.
Tissue sampling
- Human brain tissue was obtained with informed consent under protocol REC 16/LO/2168 approved by the NHS Health Research Authority.
- Adult brain tissue biopsies were taken from the site of neurosurgery resection for the original clinical indication.
- Paired venous blood was sampled at the induction of anaesthesia.
Dissociation of brain tissue
- The prepared mix was spun in HBSS+ (Life Technologies) at 300g for 5 mins and supernatant discarded.
- The digested tissue was rigorously triturated at 4°C and filtered through a 70 m nylon cell strainer to remove large cell debris and undigested tissue.
- Supernatant was discarded and the pellet was re-suspended in ice cold supplemented HALF.
Fluorescence-activated cell sorting
- For single cell smart sequencing, human microglia were using fluorescence-activated cell sorting.
- The isolated cell suspension was incubated with conjugated PE anti-human CD11b antibody for 20 mins at 4°C.
- Cells were washed twice in ice cold supplemented HALF and stained with Helix NP viability marker.
- Cell sorting was performed on BD AriaIII cell sorter (Becton, Dickinson and Company, Franklin Lakes, New Jersey, US) at the University of Cambridge Cell Phenotyping Hub at Cambridge University Hospital, Cambridge, UK.
- Cells were either sorted into 98 well plates, prepared by the Wellcome Trust Sanger Institute for the purposes of single cell sequencing.
Magnetic-activated cell sorting
- To avoid sustained stress on microglia as a result of prolonged sorting times for bulk sequencing magnetic-activated cell sorting was performed on these cells.
- An isolated cell suspension of cells were incubated with anti-CD11b conjugated magnetic beads for 15 mins at 4°C.
- Cells were washed twice with supplemented HALF and passed through an MS column .
- Each sample was washed three times in the column and then extracted.
Blood preparation
- DNA extraction was performed from the venous blood.
- 10 ml of whole blood was washed with 1% phosphate buffered saline (PBS) and layered on pancoll human (PAN biotech) and spun at 500g for 25 mins.
- The white cell component was extracted and transferred to a 1.5ml Eppendorf and stored as a frozen pellet at -80C prior to sequencing.
SNP genotyping
- Genomic DNA was extracted from blood using the QIAamp DNA mini and blood mini kit (Qiagen, 51104).
- IPS cell culture and macrophage differentiation was carried as previously described (Alasoo et al. 2018) but with some minor modifications (see Supplementary Methods for details).
- Tagmentation was quenched with 0.2 % sodium dodecyl sulphate.
- Low-input bulk RNA-seq and ATAC-seq library preparation for primary microglia and iPS-derived macrophages For RNA-seq samples, between 0.3 ng and 10 ng of bulk total RNA from primary microglia cells or iPS-derived macrophage cells was used as input for a modified Smart-seq2 library preparation (Picelli et al. 2014) (see Supplementary Methods for detailed protocol).
Sequencing data preprocessing
- All sequence data sets were aligned to human genome assembly GRCh38.
- All other RNA-seq data were also aligned as same as their RNA-seq data without adapter trimming.
- The authors fit the latent factor linear mixed model in which the three different studies were treated as a random effect (see Supplementary Note Section 1 for details).
- The authors processed the data using the provided R script and obtained the cell type annotation for PBMCs.
- The count data from two studies were joined by gene IDs and converted into CPM (count per million) along with their primary microglia read count data.
Variance component analysis
- A linear mixed model of log(TPM+1) values across genome-wide genes (whose TPM>0 for 10% of total cells) was used to estimate the transcriptional variation.
- The 13 different factors (Patient, the number of expressed genes per cell, pathology, plate ID, ERCC percentage, the number of expressed genes in each cell, 96 well plate position, age of patient, mitochondria RNA percentage, brain region, brain hemisphere, ethnicity and sex) were fitted as random effects with independent variance parameters 𝜙"#.
- The variance explained by the factor k was measured by the intraclass correlation 𝜙"#/(1 + 𝜙"#), where the other 12 factors were fixed constant.
- The standard error of the intraclass correlation was computed by the delta method with the standard error of the variance parameter estimator.
- See Supplementary Note Section 1.1 for details.
Detection of microglia subpopulations
- The authors used the linear mixed model to estimate the latent factors with the 13 known confounding factors (see Supplementary Note Section 1.2 for details).
- There are 2l-1 contrasts which were tested against the null model (removing the focal factor k in the model) to compute Bayes factors.
- The fragment counts were GC corrected as described in (Kumasaka, Knights, and Gaffney 2016), normalised into TPM (transcripts per million) and then log transformed (log of TPM+1).
- 25 principal components (PCs) were calculated and regressed out from the normalised expression levels.
- The authors picked up the minimum BH Q-value for each gene to perform the multiple testing correction genome-wide.
Bayesian hierarchical model
- The authors extended a standard Bayesian hierarchical model (Veyrieras et al. 2008) to jointly map eQTLs in three different cell types.
- It can provide posterior probability that a gene is an eQTL for each cell type.
- See Supplementary Note Section 2 for more details.
- Alzheimer’s disease GWAS summary statistics GWAS of diagnosed AD (Kunkle et al. 2019) and a GWAS for family history of AD that the authors conducted in the UK Biobank (see Liu and Schwartzentruber 2019 for details) across 10,687,126 overlapping variants.
- The authors lumped the true and proxy-cases together (53,042 unique affected individuals, 355,900 controls) and performed association tests using BOLT-LMM (Loh et al., 2015).
URL
- Welch, Joshua D., Velina Kozareva, Ashley Ferreira, Charles Vanderburg, Carly Martin, and Evan Z. Macosko.
- Zhang, Hanrui, Chenyi Xue, Rhia Shah, Kate Bermingham, Christine C. Hinkle, Wenjun Li, Amrith Rodrigues, et al. 2015.
Did you find this useful? Give us your feedback
Citations
189 citations
95 citations
66 citations
62 citations
60 citations
References
43,862 citations
"A map of transcriptional heterogene..." refers methods in this paper
...The ATAC-seq data were aligned using bwa (H. Li and Durbin 2009) (version 0.7.4)....
[...]
30,684 citations
"A map of transcriptional heterogene..." refers methods in this paper
...Both Smart-seq2 and bulk RNA-seq data were aligned using STAR (Dobin et al. 2013) (version 2....
[...]
...Both Smart-seq2 and bulk RNA-seq data were aligned using STAR (Dobin et al. 2013) (version 2.5.3a; see URLs) using ENSEMBL human gene assembly 90 as the reference transcriptome....
[...]
...GTEx: https://www.gtexportal.org/home/datasets PBMC 68k cell data: https://github.com/10XGenomics/single-cell-3prime-paper/tree/master/pbmc68k_analysis RASQUAL (https://github.com/natsuhiko/rasqual) 1000 Genomes Phase III integrated variant set (http://www.internationalgenome.org/data) Beagle 4.0 (https://faculty.washington.edu/browning/beagle/b4_0.html) bwa 0.7.4 (https://sourceforge.net/projects/bio-bwa/files/) skewer 0.1.127 (https://github.com/relipmoc/skewer) STAR (https://github.com/alexdobin/STAR/releases) featureCounts (http://subread.sourceforge.net/)...
[...]
14,103 citations
4,219 citations
"A map of transcriptional heterogene..." refers methods or result in this paper
...We then compared our single cell data to public datasets of 68K PBMCs isolated from a healthy donor (Zheng et al. 2017) and 15K brain cells from 5 GTEx donors (Habib et al....
[...]
...We then compared our single cell data to public datasets of 68K PBMCs isolated from a healthy donor (Zheng et al. 2017) and 15K brain cells from 5 GTEx donors (Habib et al. 2017)....
[...]
...Pink dots show our samples. d. UMAP of single-cell RNA-seq data combined with 68K PBMC scRNA-seq (Zheng et al. 2017) and whole brain DroNc-seq (Habib et al. 2017)....
[...]
...We then performed cell type clustering with other primary single cell RNA-seq of 68k PBMCs and GTEx brain tissues (Zheng et al. 2017; Habib et al. 2017) (see below)....
[...]
3,726 citations
Related Papers (5)
Frequently Asked Questions (8)
Q2. How was the iPS cell culture performed?
Samples were added to a 1.5 ml Eppendorf to which 350 µl of RNAlater (Qiagen) was added, samples were stored at -80°C prior to sequencingDNA extraction was performed from the venous blood.
Q3. How many ng of gDNA was used for input for the SNP array?
200 ng of gDNA was used for input for the SNP array (Infinium Omni2.5-8 v1.4 Kit) and genotyping was performed according to the manufacturer's instructions.
Q4. How many GWAS loci were used to identify independently associated SNPs?
Across 36 associated loci the authors used GCTA to identify independently associated SNPs with a threshold of p < 10-5, based on LD from 10,000 randomly-sampled UK Biobank individuals.
Q5. What did the authors use to keep the posterior genotype dosage identical to the prior genotype dosage?
The authors used --no-posteriorupdate option to keep the posterior genotype dosage identical to the prior genotype dosage, that allowed us to stabilise the convergence of model fitting.
Q6. How was the white cell component extracted?
The white cell component was extracted and transferred to a 1.5ml Eppendorf and stored as a frozen pellet at -80C prior to sequencing.
Q7. How many cells passed the quality control criteria?
In total the authors sequenced 26,496 cells, of which 9,538 cells passed the quality control criteria: the minimum number of sequenced fragments (>10,000 autosomal fragments), the minimum number of expressed genes (>500 autosomal genes), mitochondrial fragment percentage (<20%) and the library complexity (percentage of autosomal fragment counts for the top 100 highly expressed genes<30%).
Q8. What was the GWAS for the BIN1 and PTK2B loci?
For the BIN1 and PTK2B loci, the authors used GCTA --cojo-cond to determine summary statistics for each of the two independent signals at each locus, with a window of +/- 500 kb around each lead SNP.