Identification of genetic variants associated with Huntington's disease progression: a genome-wide association study
Summary (2 min read)
INTRODUCTION
- Huntington’s disease (HD) is a autosomal dominant fatal neurodegenerative condition caused by a CAG repeat expansion in HTT (1).
- The transition from premanifest to manifest HD is gradual (4, 5), making clinical definition challenging, furthermore psychiatric and cognitive changes may not be concurrent with motor onset (6).
- The need for clinical trials close to disease onset has motivated a raft of observational studies (5, 9, 10).
- TRACK-HD represents the most deeply phenotyped cohort of premanifest and symptomatic disease with annual visits involving clinical, cognitive and motor testing alongside detailed brain imaging (5, 6).
- The authors developed a similar measure in subjects from the REGISTRY study to replicate their findings (9).
Study design and participants
- All experiments were performed in accordance with the Declaration of Helsinki and approved by the University College London (UCL)/UCL Hospitals Joint Research Ethics Committee; ethical approval for the REGISTRY analysis is outlined in (8).
- It provides annually collected high quality longitudinal prospective multivariate data over three years (2008-2011) with 243 subjects at baseline (6) .
- Demographic details of these individuals are shown in Supplementary Information.
- REGISTRY(9) was a multisite prospective observational study which collected phenotypic data between 2003 – 2013 on over 13,000 subjects, mostly manifest HD gene carriers.
- Age, CAG repeat length, UHDRS Total Motor Score (TMS) and Total Functional Capacity (TFC); some patients have further assessments such as a cognitive battery (9), also known as The core data include.
Procedures
- For both studies, atypical severity scores were derived with a combination of principal component analysis (PCA) and regression of the predictable effects of the primary gene HTT CAG repeat length.
- Details differed however, due to differences in nature of the two data sets.
- This model regressed the observed values on clinical probability of onset statistic (CPO) derived from CAG repeat length and age, and its interaction with follow-up length.
- Principal Component Analyses (PCA) of the random slopes was then used to study the dimensionality of these age and CAG-length corrected longitudinal changes.
- Further methodological detail, including control for potential demographic confounders, is given in Supplementary Methods and a flow chart is given in Figure 1.
Statistical and genetic analysis
- Data analyses were performed using SAS/STAT 14·0 and 14·1 primarily via the MIXED, FACTOR and GML procedures (11).
- Genotypes for the REGISTRY subjects were obtained from the GeM-HD Consortium (8), where details of their genotyping, quality control, curation and imputation are provided.
- Association analyses were performed with the mixed linear model (MLM) functions included in GCTA v1·26(12).
- Because of the relatively small sample sizes, analyses were restricted to SNPs with minor allele frequency >1%.
- Gene-wide p-values were calculated using MAGMA v1·05, a powerful alternative to SNP-based analyses which aggregates the association signal inside genes while taking linkage disequilibrium (LD) between SNPs into account (15), using a window of 35kb upstream and 10kb downstream of genes (16).
RESULTS
- The authors performed individual PCA of each domain and found that first PC scores were highly correlated between the domains (P < 0·0001 in all cases, Supplementary Information.).
- The first PC of this combined analysis accounted for 23.4% of the joint variance, and was at least moderately correlated (r>0·4) with most of the variables that contributed heavily to each domainspecific first PC (Supplementary Tables 3 and 4).
- Notably, the genic associations at the MSH3 locus in the TRACK-HD sample also remain significant after correcting for AAO (http://hdresearch.ucl.ac.uk/data-resources/), as does the association with rs557874766 (p=6·30x10 -6 ).
- Msh3 is required for both somatic expansion of HTT CAG repeats and for enhancing an early disease phenotype in mouse striatum (32), Msh3 expression level is associated with repeat instability in mouse brain, (whereas DHFR is not) (30) and expansion of CAG and CTG repeats is prevented by msh3Δ in Saccharomyces cerevisiae (33).
- This indicates that either their progression measure developed in TRACK-HD is an excellent reflection of disease pathophysiological progression or that this is a locus with a very large effect size, or, most likely, both.
Author contributions and declarations
- DJHM collected data, undertook analysis, and wrote the first draft of the ms.
- DL undertook the statistical analysis of phenotype, co-wrote the ms.
- LJ helped secure funding, supervised data analyses, co-wrote the ms.
- DL reports grant funding from CHDI via University College London (UCL), and personal fees from Roche Pharmaceutical, Voyager Pharmaceutical, and Teva Pharmaceuticals.
- DJHM, KL, AD, AFP, SM, LJ, RR, PH, and SJT declare no competing interests.
Figure & Table legends
- After establishing that brain imaging, quantitative motor and cognitive variables are correlated and follow a similar trajectory, the authors scored the TRACK-HD subjects using principal component 1 as a Unified progression measure, and used this measure to look for genomewide associations with HD progression.
- Assessing progression in Huntington’s disease (A) Graphical illustration of the trajectory of HD symptoms and signs over time, annotated to show what time period the different measures of onset and progression discussed in this paper cover, also known as Figure 2.
- (B) Manhattan plot of REGISTRY GWA analysis showing suggestive trails on chromosome 15 in the same area as the GeM GWAS significant locus (8), and chromosome 5 in the same area as the TRACK progression GWAS.
- (iii) Repair of the strand break leads to expansion of the CAG repeat.
- The p-values in columns 2 – 4 refer to the association between the pathway indicated and rate of progression described in this paper (TRACK- TRACK-HD study; REGISTRYREGISTRY study; META- meta-analysis).
Did you find this useful? Give us your feedback
Citations
547 citations
255 citations
244 citations
219 citations
Cites background from "Identification of genetic variants ..."
...Disease-risk genes are expected to harbor both common and rare risk variants, and empirical data for height (Kemper et al., 2012; Marouli et al., 2017), type 2 diabetes (Fuchsberger et al., 2016), inflammatory bowel disease (IBD) (Luo et al., 2017), and high-density lipoprotein (HDL) cholesterol (Rosenson et al., 2018) show commonvariant associations variants in genes responsible for related monogenic disorders....
[...]
...For IBD, WES (4,280 cases) identified a single rare variant (in a previously known locus) and an excess of very rare, damaging missense variants in known Crohn’s disease risk genes (including those identified through GWAS) (Luo et al., 2017)....
[...]
..., 2016), inflammatory bowel disease (IBD) (Luo et al., 2017), and high-density lipoprotein (HDL) cholesterol (Rosenson et al....
[...]
...Yet important advances for identification of potential therapeutic targets have been made through GWASs of age of onset (GeM-HD, 2015) and rate of disease progression (HensmanMoss et al., 2017)....
[...]
193 citations
References
35,225 citations
"Identification of genetic variants ..." refers background in this paper
...For both studies, atypical severity scores were derived with a combination of principal component analysis (PCA) and regression of the predictable effects of the primary gene HTT CAG repeat length....
[...]
...The GO and KEGG terms in the first column refer to pathways of biologically related genes in the Gene Ontology Consortium(1) and Kyoto Encyclopedia of Genes and Genomes (2) databases respectively....
[...]
...The colours of the circles are based on r 2 with the lead SNP in TRACK-HD as shown in the bottom of the plot; intensity of colour reflects multiple overlying SNPs. Dashed lines: 5x10 -8 Figure 4: Significant genes are functionally linked and may cause somatic expansion of the HTT CAG repeat tract....
[...]
...Background Huntington’s disease (HD) is a fatal inherited neurodegenerative disease, caused by a CAG repeat expansion in HTT. Age at onset (AAO) has been used as a quantitative phenotype in genetic analysis looking for HD modifiers, but is hard to define and not always available....
[...]
...Huntington’s disease (HD) is a autosomal dominant fatal neurodegenerative condition caused by a CAG repeat expansion in HTT (1)....
[...]
24,024 citations
14,574 citations
8,758 citations
"Identification of genetic variants ..." refers background in this paper
...Loss of or variation in mismatch repair complexes can cause malignancy and thus they are not regarded as ideal drug targets, but MSH3 is not essential as it can tolerate loss of function variation (36) and could provide a therapeutic target in HD....
[...]
8,224 citations
"Identification of genetic variants ..." refers background in this paper
...org/cgi/ accessed October 2016 and January 2017 (37)....
[...]
Related Papers (5)
Frequently Asked Questions (11)
Q2. What is the effect size of Msh3 on the pathological phenotypes?
altering levels of Msh3 in HD mice reduces somatic instability and crossing Msh3 null mice with HD mouse models prevents somatic instability of the HTT CAG repeat and reduces pathological phenotypes.
Q3. What is the significant SNP in the meta-analysis?
The most significant SNP in the meta-analysis is rs1232027, which is genome-wide significant (p=1.12x10 -10 ), with the p-value of rs557874766 being 1.58x10 -8 .
Q4. What is the significance of the progression measure in TRACK-HD?
The progression measures used in this study can be generated in asymptomatic and symptomatic subjects using a subset of the clinically relevant parameters gathered in TRACK-HD.
Q5. What is the significance of the GWAS SNPs in the TRACK-HD gene?
Despite the lack of co-localisation between the TRACK GWAS and MSH3 expression signal, several of the most significant GWAS SNPs were associated with decreased MSH3 expression and slower progression (Supplementary Information).
Q6. What was the primary component used in the TRACK-HD cohort?
The cross-domain first principal component was used as a unified Huntington’s disease progression measure in the TRACK-HD cohort (Figure 1 and 2B).
Q7. What are the genetic modifiers of age at motor onset in Huntington’s disease?
Genetic modifiers of age at motor onset have recently been identified in HD that highlight pathways, which if modulated in people, might delay disease onset.
Q8. What is the evidence for MSH3 in Huntington’s disease?
A complete list of genes in the Pearl et al. (20) pathways is given in http://hdresearch.ucl.ac.uk/data-resources/.DISCUSSIONThe evidence from their study suggests that MSH3 is likely to be a modifier of disease progression in Huntington’s disease.
Q9. What could be the reason for the signal on chromosome 5?
the signal on chromosome 5 could be due to the coding change in MSH3, or to expression changes in MSH3, DHFR or both, and both effects may operate in disease.
Q10. What is the likely association between MSH3 and HD?
MSH3 is a neuronally expressed member of a family of DNA mismatch repair proteins (29); it forms a heteromeric complex with MSH2 to form MutSβ, which recognises insertion-deletion loops of up to 13 nucleotides (30) (Figure 4D).
Q11. What is the likely cause of the changes in CAG repeat size in HD?
Changes in CAG repeat size occur in terminally differentiated neurons in several HD mouse models and in human patient striatum, the brain area most affected in HD, and notably, somatic expansion of the CAG repeat in HD patient brain predicts onset (31).