Cell-type Specific Expression Quantitative Trait Loci Associated with Alzheimer Disease in Blood and Brain Tissue
Summary (3 min read)
INTRODUCTION
- Recent expression quantitative trait locus (eQTL) analysis studies suggest that changes in gene expression have a role in the pathogenesis of AD 1, 2.
- In addition, disease and trait-associated cis-eQTLs were more cell type specific than average cis-eQTLs 7.
- Microglia, monocytes and macrophages share a similar developmental lineage and are all considered to be myeloid cells 9.
Study cohorts
- The FHS is a multigenerational study of health and disease in a prospectively followed community-based and primarily non-Hispanic white sample.
- Requisite information for this study was available for 5,257 participants.
- ROS enrolled older nuns and priests from across the US, without known dementia for longitudinal clinical analysis and brain donation and MAP enrolled older subjects without dementia from retirement homes who agreed to brain donation at the time of death 14, 15.
- Preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Data processing
- Generation and initial quality control (QC) procedures of the FHS GWAS and expression data are described elsewhere and include all genotype QC and preadjustment of gene expression levels for batch effects and other technical covariates 13.
- ROSMAP gene expression data were log-normalized and adjusted for known and hidden variables detected by surrogate variable analysis (SVA) 17 in order to determine which of these variables should be included as covariates in analysis models for eQTL discovery.
- Additional filtering steps of FHS and ROSMAP GWAS and gene expression data included eliminating subjects with missing data, restricting gene expression data to protein coding genes, and retaining common variants (MAF ≥ 0.05) with good imputation quality (R2 ≥ 0.3).
Cis eQTL mapping
- Cis-eQTL mapping was performed using a genome-wide design (Fig. S1).
- The association of gene expression with SNP genotypes for all cis SNPs within1 Mb of protein-coding genes was evaluated using linear mixed models adjusting for family structure in FHS and linear regression models for unrelated individuals in ROSMAP.
- The linear model for analysis of FHS can data be expressed as follows: preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
- Yi is the expression value for gene i, Gj is the genotype dosage for cis SNP j, Aij, Sij, PMij, S2ij, and SV1ij are the covariates for age, sex, PMI, study and SV1 respectively, ij is the residual error, and the βs are regression coefficients.
Cis ct-eQTL mapping
- Models testing associations with cell type-specific eQTLs (ct-eQTLs) included an interaction term for expression levels of “proxy” genes that represent cell types.
- These proxy genes for cell types in blood were established previously using BLUEPRINT expression data to validate celltype-specific expression in each cell-type module 5 and the proxy genes for brain cell types have been incorporated in several studies 20-22.
- Preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
- The copyright holder for thisthis version posted November 24, 2020.
- Significance of the pathways was determined by the Fisher's Exact test with False discovery rate (FDR) multiple test correction.
Colocalization analyses
- Assessment of causal variants shared by adjacent GWAS and eQTL signals was performed using a Bayesian colocalization approach implemented in the R package coloc 24.
- This analysis incorporated information about significantly associated variants for AD risk obtained from a recent large GWAS 25 and lead eQTL variants each defined as the eSNP showing the strongest association with gene expression.
- Following recommended guidelines, the variants were deemed to be colocalized by a high posterior probability that a single shared variant is responsible for both signals (PP4 > 0.8) 24, 26.
- A lower threshold for statistical significance with a false discovery rate (FDR) < 0.05 for eQTL significant results was applied to maximize detection of colocalized pairs.
- Regional plots were constructed with LocusZoom 27.
RESULTS
- Preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
- The copyright holder for thisthis version posted November 24, 2020.
- Among the AD-associated SNPs at the GWS level, rs9271058 is a significant eSNP for HLA-DRB1 in both blood and brain cell types (the most significant association by p-value was observed in anti-bacterial cells and microglia) and rs9271192 is a significant ct-eQTL for the gene in multiple brain cell types (Table 1).
- Among the significant ct-eQTLs in brain, the cell types with the largest proportion that were also significant in monocytes/macrophages were microglia (1.6%) and neurons (1.3%) (Table 4C).
DISCUSSION
- The authors identified several novel AD-related eQTLs that highlight the importance of cell-type dependent context.
- Preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
- Five established AD genes (CR1, ECHDC3, HLA-DRB1, HLA-DRB5, and WWOX 25, 28) were shared eGenes in brain and blood and could be playing a key role in the systemic AD mechanisms.
- The copyright holder for thisthis version posted November 24, 2020.
- Also, findings in brain may reflect post-mortem changes unrelated to disease or cell-type different expression 36.
Conclusion
- The authors observation of cell-type specific expression patterns for established and potentially novel AD genes, finding of additional evidence for the role of myeloid cells in AD risk, and discovery of potential novel blood and brain AD biomarkers highlight the importance All rights reserved.
- Preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
- The copyright holder for thisthis version posted November 24, 2020.
Did you find this useful? Give us your feedback
Citations
5 citations
4 citations
2 citations
1 citations
1 citations
References
3,726 citations
3,343 citations
3,289 citations
2,454 citations
2,273 citations
Related Papers (5)
Frequently Asked Questions (12)
Q2. What are the future works in "Cell-type specific expression quantitative trait loci associated with alzheimer disease in blood and brain tissue" ?
Future studies that compare cell type specific differential gene expression among AD cases and controls using single cell RNA-sequencing or cell count data, as well as functional experiments, are needed to validate and extend their findings.
Q3. What is the widely used marker to examine activated microglia in normal and diseased?
In addition, the most widely used marker to examine activated microglia in normal and diseased human brains is HLA-DR and microglia activation increases with the progression of AD 49, 50.
Q4. What is the key susceptibility locus in many immunological diseases?
The human leukocyte antigen (HLA) region is the key susceptibility locus in many immunological diseases and many associations have been reported between neurodegenerative diseases and HLA haplotypes 48.
Q5. What is the significance of the ct-eQTLs in the ROSMAP?
Because many AD risk genes are expressed in myeloid cells including microglia 10, the large number of microglia ct-eQTLs is consistent with the high proportion of AD subjects in the ROSMAP dataset.
Q6. What were the filtering steps for FHS and ROSMAP GWAS?
Additional filtering steps of FHS and ROSMAP GWAS and gene expression data included eliminating subjects with missing data, restricting gene expression data to protein coding genes, and retaining common variants (MAF ≥ 0.05) with good imputation quality (R2 ≥ 0.3).
Q7. What is the significant eSNP for rs6557994?
Rs6557994 is also correlated with a GWAS SNP in CLU, located approximately 150 kb from PTK2B, that is not significantly associated with expression of any gene.
Q8. What is the significant eSNP for and located in PTK2B?
Rs6557994 is the most significant eSNP for and located in PTK2B (blood ct-eQTL P=2.58x10-9) and is moderately correlated with the PTK2B GWAS SNP (rs28834970, r2=0.78, P=1.58x10-9).
Q9. What are the common GWS SNPs for AD?
Many GWS SNPs for AD risk are eSNPs affecting expression of the nearest gene, which is usually recognized as the causative gene, but several GWS SNPs target other genes (Table S9).
Q10. What is the significance of ct-eQTLs in brain and blood?
Five established AD genes (CR1, ECHDC3, HLA-DRB1, HLA-DRB5, and WWOX 25, 28) were shared eGenes in brain and blood and could be playing a key role in the systemic AD mechanisms.
Q11. How many cteQTLs are located in microglia?
Approximately 52% of significant cteQTLs in microglia are located in AD regions including five of the 20 most significant cteQTLs in this group (Table 4B).
Q12. What is the link between rs113986870 and ARL17A?
rs113986870 also significantly influenced expression of another gene in this region, ARL17A, that was previously linked to progressive supranuclear palsy by analysis of gene expression and methylation 83.