scispace - formally typeset
Search or ask a question

Showing papers in "American Journal of Human Genetics in 2008"


Journal ArticleDOI
TL;DR: The results further implicate the SHANK3-NLGN4-NRXN1 postsynaptic density genes and also identify novel loci at DPP6-DPP10-PCDH9 (synapse complex), ANKRD11, DPYD, PTCHD1, 15q24, among others, for a role in ASD susceptibility.
Abstract: Structural variation (copy number variation [CNV] including deletion and duplication, translocation, inversion) of chromosomes has been identified in some individuals with autism spectrum disorder (ASD), but the full etiologic role is unknown. We performed genome-wide assessment for structural abnormalities in 427 unrelated ASD cases via single-nucleotide polymorphism microarrays and karyotyping. With microarrays, we discovered 277 unbalanced CNVs in 44% of ASD families not present in 500 controls (and re-examined in another 1152 controls). Karyotyping detected additional balanced changes. Although most variants were inherited, we found a total of 27 cases with de novo alterations, and in three (11%) of these individuals, two or more new variants were observed. De novo CNVs were found in ∼7% and ∼2% of idiopathic families having one child, or two or more ASD siblings, respectively. We also detected 13 loci with recurrent/overlapping CNV in unrelated cases, and at these sites, deletions and duplications affecting the same gene(s) in different individuals and sometimes in asymptomatic carriers were also found. Notwithstanding complexities, our results further implicate the SHANK3-NLGN4-NRXN1 postsynaptic density genes and also identify novel loci at DPP6-DPP10-PCDH9 (synapse complex), ANKRD11, DPYD, PTCHD1, 15q24, among others, for a role in ASD susceptibility. Our most compelling result discovered CNV at 16p11.2 (p = 0.002) (with characteristics of a genomic disorder) at ∼1% frequency. Some of the ASD regions were also common to mental retardation loci. Structural variants were found in sufficiently high frequency influencing ASD to suggest that cytogenetic and microarray analyses be considered in routine clinical workup.

1,716 citations


Journal ArticleDOI
TL;DR: It is shown that the collapsing method, which involves collapsing genotypes across variants and applying a univariate test, is powerful for analyzing rare variants, whereas multivariate analysis is robust against inclusion of noncausal variants.
Abstract: Although whole-genome association studies using tagSNPs are a powerful approach for detecting common variants, they are underpowered for detecting associations with rare variants. Recent studies have demonstrated that common diseases can be due to functional variants with a wide spectrum of allele frequencies, ranging from rare to common. An effective way to identify rare variants is through direct sequencing. The development of cost-effective sequencing technologies enables association studies to use sequence data from candidate genes and, in the future, from the entire genome. Although methods used for analysis of common variants are applicable to sequence data, their performance might not be optimal. In this study, it is shown that the collapsing method, which involves collapsing genotypes across variants and applying a univariate test, is powerful for analyzing rare variants, whereas multivariate analysis is robust against inclusion of noncausal variants. Both methods are superior to analyzing each variant individually with univariate tests. In order to unify the advantages of both collapsing and multiple-marker tests, we developed the Combined Multivariate and Collapsing (CMC) method and demonstrated that the CMC method is both powerful and robust. The CMC method can be applied to either candidate-gene or whole-genome sequence data.

1,500 citations


Journal ArticleDOI
TL;DR: This work presents a method for prioritization of candidate genes by use of a global network distance measure, random walk analysis, for definition of similarities in protein-protein interaction networks, significantly outperforming previous methods based on local distance measures.
Abstract: The identification of genes associated with hereditary disorders has contributed to improving medical care and to a better understanding of gene functions, interactions, and pathways. However, there are well over 1500 Mendelian disorders whose molecular basis remains unknown. At present, methods such as linkage analysis can identify the chromosomal region in which unknown disease genes are located, but the regions could contain up to hundreds of candidate genes. In this work, we present a method for prioritization of candidate genes by use of a global network distance measure, random walk analysis, for definition of similarities in protein-protein interaction networks. We tested our method on 110 disease-gene families with a total of 783 genes and achieved an area under the ROC curve of up to 98% on simulated linkage intervals of 100 genes surrounding the disease gene, significantly outperforming previous methods based on local distance measures. Our results not only provide an improved tool for positional-cloning projects but also add weight to the assumption that phenotypically similar diseases are associated with disturbances of subnetworks within the larger protein interactome that extend beyond the disease proteins themselves.

1,135 citations


Journal ArticleDOI
TL;DR: Using pedigrees to identify individuals with no shared maternal and paternal ancestors in five, and probably at least ten, generations, it is shown that ROHs measuring up to 4 Mb are common in demonstrably outbred individuals.
Abstract: Estimating individual genome-wide autozygosity is important both in the identification of recessive disease variants via homozygosity mapping and in the investigation of the effects of genome-wide homozygosity on traits of biomedical importance. Approaches have tended to involve either single-point estimates or rather complex multipoint methods of inferring individual autozygosity, all on the basis of limited marker data. Now, with the availability of high-density genome scans, a multipoint, observational method of estimating individual autozygosity is possible. Using data from a 300,000 SNP panel in 2618 individuals from two isolated and two more-cosmopolitan populations of European origin, we explore the potential of estimating individual autozygosity from data on runs of homozygosity (ROHs). Termed Froh, this is defined as the proportion of the autosomal genome in runs of homozygosity above a specified length. Mean Froh distinguishes clearly between subpopulations classified in terms of grandparental endogamy and population size. With the use of good pedigree data for one of the populations (Orkney), Froh was found to correlate strongly with the inbreeding coefficient estimated from pedigrees (r = 0.86). Using pedigrees to identify individuals with no shared maternal and paternal ancestors in five, and probably at least ten, generations, we show that ROHs measuring up to 4 Mb are common in demonstrably outbred individuals. Given the stochastic variation in ROH number, length, and location and the fact that ROHs are important whether ancient or recent in origin, approaches such as this will provide a more useful description of genomic autozygosity than has hitherto been possible.

934 citations


Journal ArticleDOI
TL;DR: A Human Phenotype Ontology with over 8000 terms representing individual phenotypic anomalies and all clinical entries in Online Mendelian Inheritance in Man with the terms of the HPO are annotated.
Abstract: There are many thousands of hereditary diseases in humans, each of which has a specific combination of phenotypic features, but computational analysis of phenotypic data has been hampered by lack of adequate computational data structures Therefore, we have developed a Human Phenotype Ontology (HPO) with over 8000 terms representing individual phenotypic anomalies and have annotated all clinical entries in Online Mendelian Inheritance in Man with the terms of the HPO We show that the HPO is able to capture phenotypic similarities between diseases in a useful and highly significant fashion

849 citations


Journal ArticleDOI
TL;DR: In this paper, a two-stage association study tested 2758 SNPs across a 10 Mb 7q35 language-related autism QTL in AGRE (Autism Genetic Resource Exchange) trios and found significant association with Contactin Associated Protein-Like 2 (CNTNAP2), a strong a priori candidate.
Abstract: Autism is a genetically complex neurodevelopmental syndrome in which language deficits are a core feature. We describe results from two complimentary approaches used to identify risk variants on chromosome 7 that likely contribute to the etiology of autism. A two-stage association study tested 2758 SNPs across a 10 Mb 7q35 language-related autism QTL in AGRE (Autism Genetic Resource Exchange) trios1,2 and found significant association with Contactin Associated Protein-Like 2 (CNTNAP2), a strong a priori candidate. Male-only containing families were identified as primarily responsible for this association signal, consistent with the strong male affection bias in ASD and other language-based disorders. Gene-expression analyses in developing human brain further identified CNTNAP2 as enriched in circuits important for language development. Together, these results provide convergent evidence for involvement of CNTNAP2, a Neurexin family member, in autism, and demonstrate a connection between genetic risk for autism and specific brain structures.

789 citations


Journal ArticleDOI
TL;DR: This study uses CpG-island microarrays to identify DNA-methylation changes in the frontal cortex and germline associated with schizophrenia and bipolar disorder and reports evidence for a strong correlation between DNA methylation in the MEK1 gene promoter region and lifetime antipsychotic use in schizophrenia patients.
Abstract: Epigenetic misregulation is consistent with various non-Mendelian features of schizophrenia and bipolar disorder. To date, however, few studies have investigated the role of DNA methylation in major psychosis, and none have taken a genome-wide epigenomic approach. In this study we used CpG-island microarrays to identify DNA-methylation changes in the frontal cortex and germline associated with schizophrenia and bipolar disorder. In the frontal cortex we find evidence for psychosis-associated DNA-methylation differences in numerous loci, including several involved in glutamatergic and GABAergic neurotransmission, brain development, and other processes functionally linked to disease etiology. DNA-methylation changes in a significant proportion of these loci correspond to reported changes of steady-state mRNA level associated with psychosis. Gene-ontology analysis highlighted epigenetic disruption to loci involved in mitochondrial function, brain development, and stress response. Methylome network analysis uncovered decreased epigenetic modularity in both the brain and the germline of affected individuals, suggesting that systemic epigenetic dysfunction may be associated with major psychosis. We also report evidence for a strong correlation between DNA methylation in the MEK1 gene promoter region and lifetime antipsychotic use in schizophrenia patients. Finally, we observe that frontal-cortex DNA methylation in the BDNF gene is correlated with genotype at a nearby nonsynonymous SNP that has been previously associated with major psychosis. Our data are consistent with the epigenetic theory of major psychosis and suggest that DNA-methylation changes are important to the etiology of schizophrenia and bipolar disorder.

762 citations


Journal ArticleDOI
TL;DR: A common polymorphism in contactin-associated protein-like 2 (CNTNAP2), a member of the neurexin superfamily, that is significantly associated with autism susceptibility is identified and displays a parent-of-origin and gender effect recapitulating the inheritance of autism.
Abstract: Autism is a childhood neuropsychiatric disorder that, despite exhibiting high heritability, has largely eluded efforts to identify specific genetic variants underlying its etiology. We performed a two-stage genetic study in which genome-wide linkage and family-based association mapping was followed up by association and replication studies in an independent sample. We identified a common polymorphism in contactin-associated protein-like 2 (CNTNAP2), a member of the neurexin superfamily, that is significantly associated with autism susceptibility. Importantly, the genetic variant displays a parent-of-origin and gender effect recapitulating the inheritance of autism.

578 citations


Journal ArticleDOI
TL;DR: A number of rare sequence variants in the coding region, including two missense changes in conserved residues of the alpha-neurexin 1 leader sequence and of an epidermal growth factor (EGF)-like domain, suggest that even subtle changes in NRXN1 might contribute to susceptibility to ASD.
Abstract: Autism is a neurodevelopmental disorder of complex etiology in which genetic factors play a major role. We have implicated the neurexin 1 (NRXN1) gene in two independent subjects who display an autism spectrum disorder (ASD) in association with a balanced chromosomal abnormality involving 2p16.3. In the first, with karyotype 46,XX,ins(16;2)(q22.1;p16.1p16.3)pat, NRXN1 is directly disrupted within intron 5. Importantly, the father possesses the same chromosomal abnormality in the absence of ASD, indicating that the interruption of α-NRXN1 is not fully penetrant and must interact with other factors to produce ASD. The breakpoint in the second subject, with 46,XY,t(1;2)(q31.3;p16.3)dn, occurs ∼750 kb 5′ to NRXN1 within a 2.6 Mb genomic segment that harbors no currently annotated genes. A scan of the NRXN1 coding sequence in a cohort of ASD subjects, relative to non-ASD controls, revealed that amino acid alterations in neurexin 1 are not present at high frequency in ASD. However, a number of rare sequence variants in the coding region, including two missense changes in conserved residues of the α-neurexin 1 leader sequence and of an epidermal growth factor (EGF)-like domain, respectively, suggest that even subtle changes in NRXN1 might contribute to susceptibility to ASD.

569 citations


Journal ArticleDOI
TL;DR: It is suggested that CNV analysis in phenotypically discordant monozygotic twins may provide a powerful tool for identifying disease-predisposition loci and that caution should be exercised when interpreting disease causality of de novo CNVs found in patients based on analysis of a single tissue in routine disease-related DNA diagnostics.
Abstract: The exploration of copy-number variation (CNV), notably of somatic cells, is an understudied aspect of genome biology. Any differences in the genetic makeup between twins derived from the same zygote represent an irrefutable example of somatic mosaicism. We studied 19 pairs of monozygotic twins with either concordant or discordant phenotype by using two platforms for genome-wide CNV analyses and showed that CNVs exist within pairs in both groups. These findings have an impact on our views of genotypic and phenotypic diversity in monozygotic twins and suggest that CNV analysis in phenotypically discordant monozygotic twins may provide a powerful tool for identifying disease-predisposition loci. Our results also imply that caution should be exercised when interpreting disease causality of de novo CNVs found in patients based on analysis of a single tissue in routine disease-related DNA diagnostics.

567 citations


Journal ArticleDOI
TL;DR: At least one in 200 healthy humans harbors a pathogenic mtDNA mutation that potentially causes disease in the offspring of female carriers, and the exclusive detection of m.14484T→C on haplogroup J implicates the background mtDNA haplotype in mutagenesis.
Abstract: Mitochondrial DNA (mtDNA) mutations are a major cause of genetic disease, but their prevalence in the general population is not known. We determined the frequency of ten mitochondrial point mutations in 3168 neonatal-cord-blood samples from sequential live births, analyzing matched maternal-blood samples to estimate the de novo mutation rate. mtDNA mutations were detected in 15 offspring (0.54%, 95% CI = 0.30-0.89%). Of these live births, 0.00107% (95% CI = 0.00087-0.0127) harbored a mutation not detected in the mother's blood, providing an estimate of the de novo mutation rate. The most common mutation was m.3243A-->G. m.14484T-->C was only found on sub-branches of mtDNA haplogroup J. In conclusion, at least one in 200 healthy humans harbors a pathogenic mtDNA mutation that potentially causes disease in the offspring of female carriers. The exclusive detection of m.14484T-->C on haplogroup J implicates the background mtDNA haplotype in mutagenesis. These findings emphasize the importance of developing new approaches to prevent transmission.

Journal ArticleDOI
TL;DR: The cytogenetic and mutation screening data suggest that rare variants may also contribute to the pathophysiology of ASD, but place limits on the magnitude of this contribution.
Abstract: demonstrate the presence of CNTNAP2 in the synaptic plasma membrane fraction of rat forebrain lysates. We comprehensively resequenced CNTNAP2 in 635 patients and 942 controls. Among patients, we identified a total of 27 nonsynonymous changes; 13 were rare and unique to patients and 8 of these were predicted to be deleterious by bioinformatic approaches and/or altered residues conserved across all species. One variant at a highly conserved position, I869T, was inherited by four affected children in three unrelated families, but was not found in 4010 control chromosomes (p ¼ 0.014). Overall, this resequencing data demonstrated a modest nonsignificant increase in the burden of rare variants in cases versus controls. Nonethless, when viewed in light of two independent studies published in this issue of AJHG showing a relationship between ASD and common CNTNAP2 alleles, 4,5 the cytogenetic and mutation screening data suggest that rare variants may also contribute to the pathophysiology of ASD, but place limits on the magnitude of this contribution. The clinical hallmarks of ASD (MIM 209850) are derangements in reciprocal social interaction, abnormal development of speech and language, and the presence of highly restricted interests and stereotyped behaviors. 6 Fundamental impairment in some but not all of these domains defines a spectrum of conditions that includes Asperger syndrome and Pervasive Developmental Disorder Not Otherwise Specified (PDD-NOS). In the DSM-IV, rare developmental disorders including Rett Syndrome and Childhood Disintegrative Disorder 6 are grouped in the same diagnostic category. A majority of patients with ASD have MR in addition to their social disability and up to one-third suffer from seizures. 6 Individuals with ASD also show an increased burden of chromosomal abnormalities 1 and de novo rare copy number variants. 7

Journal ArticleDOI
TL;DR: The results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively and highlight the therapeutic potential of some of these ASOs in the context of SMA.
Abstract: survival of motor neuron 2, centromeric (SMN2) is a gene that modifies the severity of spinal muscular atrophy (SMA), a motor-neuron disease that is the leading genetic cause of infant mortality. Increasing inclusion of SMN2 exon 7, which is predominantly skipped, holds promise to treat or possibly cure SMA; one practical strategy is the disruption of splicing silencers that impair exon 7 recognition. By using an antisense oligonucleotide (ASO)-tiling method, we systematically screened the proximal intronic regions flanking exon 7 and identified two intronic splicing silencers (ISSs): one in intron 6 and a recently described one in intron 7. We analyzed the intron 7 ISS by mutagenesis, coupled with splicing assays, RNA-affinity chromatography, and protein overexpression, and found two tandem hnRNP A1/A2 motifs within the ISS that are responsible for its inhibitory character. Mutations in these two motifs, or ASOs that block them, promote very efficient exon 7 inclusion. We screened 31 ASOs in this region and selected two optimal ones to test in human SMN2 transgenic mice. Both ASOs strongly increased hSMN2 exon 7 inclusion in the liver and kidney of the transgenic animals. Our results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively. Most importantly, our results highlight the therapeutic potential of some of these ASOs in the context of SMA.

Journal ArticleDOI
TL;DR: This study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-epsilon4, primarily acts as a modifier of onset age.
Abstract: Alzheimer's disease (AD) is a genetically complex and heterogeneous disorder. To date four genes have been established to either cause early-onset autosomal-dominant AD (APP, PSEN1, and PSEN21–4) or to increase susceptibility for late-onset AD (APOE5). However, the heritability of late-onset AD is as high as 80%,6 and much of the phenotypic variance remains unexplained to date. We performed a genome-wide association (GWA) analysis using 484,522 single-nucleotide polymorphisms (SNPs) on a large (1,376 samples from 410 families) sample of AD families of self-reported European descent. We identified five SNPs showing either significant or marginally significant genome-wide association with a multivariate phenotype combining affection status and onset age. One of these signals (p = 5.7 × 10−14) was elicited by SNP rs4420638 and probably reflects APOE-ɛ4, which maps 11 kb proximal (r2 = 0.78). The other four signals were tested in three additional independent AD family samples composed of nearly 2700 individuals from almost 900 families. Two of these SNPs showed significant association in the replication samples (combined p values 0.007 and 0.00002). The SNP (rs11159647, on chromosome 14q31) with the strongest association signal also showed evidence of association with the same allele in GWA data generated in an independent sample of ∼1,400 AD cases and controls (p = 0.04). Although the precise identity of the underlying locus(i) remains elusive, our study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-ɛ4, primarily acts as a modifier of onset age.

Journal ArticleDOI
TL;DR: It is concluded that ARVC at locus ARVD5 is a lethal, fully penetrant, sex-influenced morbid disorder and contains a response element for PPAR gamma, which may explain the fibrofatty replacement of the myocardium, a characteristic pathological finding in ARVC.
Abstract: Autosomal-dominant arrhythmogenic right ventricular cardiomyopathy/dysplasia (ARVC/D) causes sudden cardiac death and is characterized by clinical and genetic heterogeneity. Fifteen unrelated ARVC families with a disease-associated haplotype on chromosome 3p (ARVD5) were ascertained from a genetically isolated population. Identification of key recombination events reduced the disease region to a 2.36 Mb interval containing 20 annotated genes. Bidirectional resequencing showed one rare variant in transmembrane protein 43 (TMEM43 1073C→T, S358L), was carried on all recombinant ARVD5 ancestral haplotypes from affected subjects and not found in population controls. The mutation occurs in a highly conserved transmembrane domain of TMEM43 and is predicted to be deleterious. Clinical outcomes in 257 affected and 151 unaffected subjects were compared, and penetrance was determined. We concluded that ARVC at locus ARVD5 is a lethal, fully penetrant, sex-influenced morbid disorder. Median life expectancy was 41 years in affected males compared to 71 years in affected females (relative risk 6.8, 95% CI 1.3–10.9). Heart failure was a late manifestation in survivors. Although little is known about the function of the TMEM43 gene, it contains a response element for PPARγ (an adipogenic transcription factor), which may explain the fibrofatty replacement of the myocardium, a characteristic pathological finding in ARVC.

Journal ArticleDOI
TL;DR: A genome-wide association analysis for commonly measured serum and urine biochemical traits found association between serum urate and SLC2A9, a glucose transporter, and provides a potential biological mechanism for the recent association of this same allele of the same SNP with increased risk of coronary disease.
Abstract: Many common diseases are accompanied by disturbances in biochemical traits. Identifying the genetic determinants could provide novel insights into disease mechanisms and reveal avenues for developing new therapies. Here, we report a genome-wide association analysis for commonly measured serum and urine biochemical traits. As part of the WTCCC, 500,000 SNPs genome wide were genotyped in 1955 hypertensive individuals characterized for 25 serum and urine biochemical traits. For each trait, we assessed association with individual SNPs, adjusting for age, sex, and BMI. Lipid measurements were further examined in a meta-analysis of genome-wide data from a type 2 diabetes scan. The most promising associations were examined in two epidemiological cohorts. We discovered association between serum urate and SLC2A9, a glucose transporter (p = 2 × 10−15) and confirmed this in two independent cohorts, GRAPHIC study (p = 9 × 10−15) and TwinsUK (p = 8 × 10−19). The odds ratio for hyperuricaemia (defined as urate >0.4 mMol/l) is 1.89 (95% CI = 1.36–2.61) per copy of common allele. We also replicated many genes previously associated with serum lipids and found previously recognized association between LDL levels and SNPs close to genes encoding PSRC1 and CELSR2 (p = 1 × 10−7). The common allele was associated with a 6% increase in nonfasting serum LDL. This region showed increased association in the meta-analysis (p = 4 × 10−14). This finding provides a potential biological mechanism for the recent association of this same allele of the same SNP with increased risk of coronary disease.

Journal ArticleDOI
TL;DR: A novel mechanism of action is suggested for PD risk in which the modulation of the susceptibility gene's translation by common variations interfere with the regulation mechanisms of microRNA.
Abstract: Parkinson disease (PD) is a common neurodegenerative disorder caused by environmental and genetic factors. We have previously shown linkage of PD to chromosome 8p. Subsequently, fibroblast growth factor 20 (FGF20) at 8p21.3–22 was identified as a risk factor in several association studies. To identify the risk-conferring polymorphism in FGF20, we performed genetic and functional analysis of single-nucleotide polymorphisms within the gene. In a sample of 729 nuclear families with 1089 affected and 1165 unaffected individuals, the strongest evidence of association came from rs12720208 in the 3′ untranslated region of FGF20. We show in several functional assays that the risk allele for rs12720208 disrupts a binding site for microRNA-433, increasing translation of FGF20 in vitro and in vivo. In a cell-based system and in PD brains, this increase in translation of FGF20 is correlated with increased α-synuclein expression, which has previously been shown to cause PD through both overexpression and point mutations. We suggest a novel mechanism of action for PD risk in which the modulation of the susceptibility gene's translation by common variations interfere with the regulation mechanisms of microRNA. We propose this is likely to be a common mechanism of genetic modulation of individual susceptibility to complex disease.

Journal ArticleDOI
TL;DR: The tree phylogeny and coalescence calculations suggest that Khoisan matrilineal ancestry diverged from the rest of the human mtDNA pool 90,000-150,000 years before present and that at least five additional, currently extant maternal lineages existed during this period in parallel.
Abstract: The quest to explain demographic history during the early part of human evolution has been limited because of the scarce paleoanthropological record from the Middle Stone Age. To shed light on the structure of the mitochondrial DNA (mtDNA) phylogeny at the dawn of Homo sapiens, we constructed a matrilineal tree composed of 624 complete mtDNA genomes from sub-Saharan Hg L lineages. We paid particular attention to the Khoi and San (Khoisan) people of South Africa because they are considered to be a unique relic of hunter-gatherer lifestyle and to carry paternal and maternal lineages belonging to the deepest clades known among modern humans. Both the tree phylogeny and coalescence calculations suggest that Khoisan matrilineal ancestry diverged from the rest of the human mtDNA pool 90,000–150,000 years before present (ybp) and that at least five additional, currently extant maternal lineages existed during this period in parallel. Furthermore, we estimate that a minimum of 40 other evolutionarily successful lineages flourished in sub-Saharan Africa during the period of modern human dispersal out of Africa approximately 60,000–70,000 ybp. Only much later, at the beginning of the Late Stone Age, about 40,000 ybp, did introgression of additional lineages occur into the Khoisan mtDNA pool. This process was further accelerated during the recent Bantu expansions. Our results suggest that the early settlement of humans in Africa was already matrilineally structured and involved small, separately evolving isolated populations.

Journal ArticleDOI
TL;DR: In this article, the authors performed a genome-wide association study of plasma liver-enzyme levels in three populations (total n = 7715) with replication in three additional cohorts (totaln = 4704).
Abstract: Plasma liver-enzyme tests are widely used in the clinic for the diagnosis of liver diseases and for monitoring the response to drug treatment. There is considerable evidence that human genetic variation influences plasma levels of liver enzymes. However, such genetic variation has not been systematically assessed. In the present study, we performed a genome-wide association study of plasma liver-enzyme levels in three populations (total n = 7715) with replication in three additional cohorts (total n = 4704). We identified two loci influencing plasma levels of alanine-aminotransferase (ALT) (CPN1-ERLIN1-CHUK on chromosome 10 and PNPLA3-SAMM50 on chromosome 22), one locus influencing gamma-glutamyl transferase (GGT) levels (HNF1A on chromosome 12), and three loci for alkaline phosphatase (ALP) levels (ALPL on chromosome 1, GPLD1 on chromosome 6, and JMJD1C-REEP3 on chromosome 10). In addition, we confirmed the associations between the GGT1 locus and GGT levels and between the ABO locus and ALP levels. None of the ALP-associated SNPs were associated with other liver tests, suggesting intestine and/or bone specificity. The mechanisms underlying the associations may involve cis- or trans-transcriptional effects (some of the identified variants were associated with mRNA transcription in human liver or lymphoblastoid cells), dysfunction of the encoded proteins (caused by missense variations at the functional domains), or other unknown pathways. These findings may help in the interpretation of liver-enzyme tests and provide candidate genes for liver diseases of viral, metabolic, autoimmune, or toxic origin. The specific associations with ALP levels may point to genes for bone or intestinal diseases.

Journal ArticleDOI
TL;DR: This research was supported in part by grant M01-RR00425 to the Cedars-Sinai GCRC genotyping core, and by NIH grant DK62413 (K.D.I.T).
Abstract: A.L.P. is supported by a Ruth Kirschstein National Research Service Award from the NIH. N.P. is supported by a K-01 career development award from the NIH. D.R. is supported by a Burroughs Wellcome Career Development Award in the Biomedical Sciences. This research was also supported by U-01 award HG004168 from the NIH (D.R. and N.P.), by NIDDK grant PO1DK46763 (J.I.R.), and by the Board of Governor's Chair in Medical Genetics at Cedars-Sinai Medical Center (J.I.R.). Genotyping of the Puerto Rican samples was supported in part by grant M01-RR00425 to the Cedars-Sinai GCRC genotyping core (K.D.T.) and by NIH grant DK62413 (K.D.T).

Journal ArticleDOI
TL;DR: Molecular analysis revealed that Foxg1 might also share common molecular mechanisms with MeCP2 during neuronal development, exhibiting partially overlapping expression domain in postnatal cortex and neuronal subnuclear localization.
Abstract: Rett syndrome is a severe neurodevelopmental disease caused by mutations in the X-linked gene encoding for the methyl-CpG-binding protein MeCP2. Here, we report the identification of FOXG1-truncating mutations in two patients affected by the congenital variant of Rett syndrome. FOXG1 encodes a brain-specific transcriptional repressor that is essential for early development of the telencephalon. Molecular analysis revealed that Foxg1 might also share common molecular mechanisms with MeCP2 during neuronal development, exhibiting partially overlapping expression domain in postnatal cortex and neuronal subnuclear localization.

Journal ArticleDOI
TL;DR: It is demonstrated that a fifth gene, TINF2, is mutated in classical DC and, for the first time, in Revesz syndrome, the first shelterin complex mutation linked to human disease and the role of very short telomeres as a diagnostic test for DC is confirmed.
Abstract: Patients with dyskeratosis congenita (DC), a heterogeneous inherited bone marrow failure syndrome, have abnormalities in telomere biology, including very short telomeres and germline mutations in DKC1, TERC, TERT, or NOP10, but ∼60% of DC patients lack an identifiable mutation. With the very short telomere phenotype and a highly penetrant, rare disease model, a linkage scan was performed on a family with autosomal-dominant DC and no mutations in DKCI, TERC, or TERT. Evidence favoring linkage was found at 2p24 and 14q11.2, and this led to the identification of TINF2 (14q11.2) mutations, K280E, in the proband and her five affected relatives and TINF2 R282H in three additional unrelated DC probands, including one with Revesz syndrome; a fifth DC proband had a R282S mutation. TINF2 mutations were not present in unaffected relatives, DC probands with mutations in DKC1, TERC, or TERT or 298 control subjects. We demonstrate that a fifth gene, TINF2, is mutated in classical DC and, for the first time, in Revesz syndrome. This represents the first shelterin complex mutation linked to human disease and confirms the role of very short telomeres as a diagnostic test for DC.

Journal ArticleDOI
TL;DR: The total genomic content of currently known common human CNVs is likely smaller than previously thought, and approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints.
Abstract: Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.

Journal ArticleDOI
TL;DR: It is concluded that the conserved region around rs12913832 represents a regulatory region controlling constitutive expression of OCA2 and that the C allele at rs129 13832 leads to decreased expression ofOCA2, particularly within iris melanocytes, which is postulate to be the ultimate cause of blue eye color.
Abstract: We have previously demonstrated that haplotypes of three single nucleotide polymorphisms (SNPs) within the first intron of the OCA2 gene are extremely strongly associated with variation in human eye color. In the present work, we describe additional fine association mapping of eye color SNPs in the intergenic region upstream of OCA2 and within the neighboring HERC2 (hect domain and RLD2) gene. We screened an additional 92 SNPs in 300–3000 European individuals and found that a single SNP in intron 86 of HERC2, rs12913832, predicted eye color significantly better (ordinal logistic regression R2 = 0.68, association LOD = 444) than our previous best OCA2 haplotype. Comparison of sequence alignments of multiple species showed that this SNP lies in the center of a short highly conserved sequence and that the blue-eye-associated allele (frequency 78%) breaks up this conserved sequence, part of which forms a consensus binding site for the helicase-like transcription factor (HLTF). We were also able to demonstrate the OCA2 R419Q, rs1800407, coding SNP acts as a penetrance modifier of this new HERC2 SNP for eye color, and somewhat independently, of melanoma risk. We conclude that the conserved region around rs12913832 represents a regulatory region controlling constitutive expression of OCA2 and that the C allele at rs12913832 leads to decreased expression of OCA2, particularly within iris melanocytes, which we postulate to be the ultimate cause of blue eye color.

Journal ArticleDOI
TL;DR: Recent research has focused on the role of neocentromeres in evolution and speciation, as well as in disease development and the understanding of the organization and epigenetic maintenance of the centromere.
Abstract: Since the discovery of the first human neocentromere in 1993, these spontaneous, ectopic centromeres have been shown to be an astonishing example of epigenetic change within the genome. Recent research has focused on the role of neocentromeres in evolution and speciation, as well as in disease development and the understanding of the organization and epigenetic maintenance of the centromere. Here, we review recent progress in these areas of research and the significant insights gained.

Journal ArticleDOI
TL;DR: Overexpression of human wild-type but not patient mutant ARL13B rescued the Arl13b scorpion zebrafish mutant, indicating an evolutionarily conserved role mediating cilia function in multiple organs.
Abstract: Joubert syndrome (JS) and related disorders are a group of autosomal-recessive conditions sharing the “molar tooth sign” on axial brain MRI, together with cerebellar vermis hypoplasia, ataxia, and psychomotor delay. JS is suggested to be a disorder of cilia function and is part of a spectrum of disorders involving retinal, renal, digital, oral, hepatic, and cerebral organs. We identified mutations in ARL13B in two families with the classical form of JS. ARL13B belongs to the Ras GTPase family, and in other species is required for ciliogenesis, body axis formation, and renal function. The encoded Arl13b protein was expressed in developing murine cerebellum and localized to the cilia in primary neurons. Overexpression of human wild-type but not patient mutant ARL13B rescued the Arl13b scorpion zebrafish mutant. Thus, ARL13B has an evolutionarily conserved role mediating cilia function in multiple organs.

Journal ArticleDOI
TL;DR: It is demonstrated that mutations of RPL5, RPL11, or RPS7 in DBA cells is associated with diverse defects in the maturation of ribosomal RNAs in the large or the small ribosome subunit production pathway, expanding the repertoire of ribOSomal RNA processing defects associated with DBA.
Abstract: Diamond-Blackfan anemia (DBA), a congenital bone-marrow-failure syndrome, is characterized by red blood cell aplasia, macrocytic anemia, clinical heterogeneity, and increased risk of malignancy. Although anemia is the most prominent feature of DBA, the disease is also characterized by growth retardation and congenital anomalies that are present in ∼30%–50% of patients. The disease has been associated with mutations in four ribosomal protein (RP) genes, RPS19, RPS24, RPS17, and RPL35A, in about 30% of patients. However, the genetic basis of the remaining 70% of cases is still unknown. Here, we report the second known mutation in RPS17 and probable pathogenic mutations in three more RP genes, RPL5, RPL11, and RPS7. In addition, we identified rare variants of unknown significance in three other genes, RPL36, RPS15, and RPS27A. Remarkably, careful review of the clinical data showed that mutations in RPL5 are associated with multiple physical abnormalities, including craniofacial, thumb, and heart anomalies, whereas isolated thumb malformations are predominantly present in patients carrying mutations in RPL11. We also demonstrate that mutations of RPL5, RPL11, or RPS7 in DBA cells is associated with diverse defects in the maturation of ribosomal RNAs in the large or the small ribosomal subunit production pathway, expanding the repertoire of ribosomal RNA processing defects associated with DBA.

Journal ArticleDOI
TL;DR: It is suggested that slightly deleterious SNPs (sdSNPs) subjected to weak purifying selection are major players in genetic control of susceptibility to common diseases and including rareSNPs in genotyping platforms will advance identification of causal SNPs in case-control association studies, particularly as sample sizes increase.
Abstract: Currently, single-nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) of >5% are preferentially used in case-control association studies of common human diseases. Recent technological developments enable inexpensive and accurate genotyping of a large number of SNPs in thousands of cases and controls, which can provide adequate statistical power to analyze SNPs with MAF <5%. Our purpose was to determine whether evaluating rare SNPs in case-control association studies could help identify causal SNPs for common diseases. We suggest that slightly deleterious SNPs (sdSNPs) subjected to weak purifying selection are major players in genetic control of susceptibility to common diseases. We compared the distribution of MAFs of synonymous SNPs with that of nonsynonymous SNPs (1) predicted to be benign, (2) predicted to be possibly damaging, and (3) predicted to be probably damaging by PolyPhen. Our sources of data were the International HapMap Project, ENCODE, and the SeattleSNPs project. We found that the MAF distribution of possibly and probably damaging SNPs was shifted toward rare SNPs compared with the MAF distribution of benign and synonymous SNPs that are not likely to be functional. We also found an inverse relationship between MAF and the proportion of nsSNPs predicted to be protein disturbing. On the basis of this relationship, we estimated the joint probability that a SNP is functional and would be detected as significant in a case-control study. Our analysis suggests that including rare SNPs in genotyping platforms will advance identification of causal SNPs in case-control association studies, particularly as sample sizes increase.

Journal ArticleDOI
TL;DR: A new method, LAMP (Local Ancestry in adMixed Populations), which infers the ancestry of each individual at every single-nucleotide polymorphism (SNP), which is significantly more accurate and more efficient than existing methods for inferring locus-specific ancestries.
Abstract: Large-scale genotyping of SNPs has shown a great promise in identifying markers that could be linked to diseases. One of the major obstacles involved in performing these studies is that the underlying population substructure could produce spurious associations. Population substructure can be caused by the presence of two distinct subpopulations or a single pool of admixed individuals. In this work, we focus on the latter, which is significantly harder to detect in practice. New advances in this research direction are expected to play a key role in identifying loci that are different among different populations and are still associated with a disease. We evaluated current methods for inference of population substructure in such cases and show that they might be quite inaccurate even in relatively simple scenarios. We therefore introduce a new method, LAMP (Local Ancestry in adMixed Populations), which infers the ancestry of each individual at every single-nucleotide polymorphism (SNP). LAMP computes the ancestry structure for overlapping windows of contiguous SNPs and combines the results with a majority vote. Our empirical results show that LAMP is significantly more accurate and more efficient than existing methods for inferrring locus-specific ancestries, enabling it to handle large-scale datasets. We further show that LAMP can be used to estimate the individual admixture of each individual. Our experimental evaluation indicates that this extension yields a considerably more accurate estimate of individual admixture than state-of-the-art methods such as STRUCTURE or EIGENSTRAT, which are frequently used for the correction of population stratification in association studies.

Journal ArticleDOI
TL;DR: Japanese population structure was examined by "Eigenanalysis," using the genotypes for 140,387 SNPs in 7003 Japanese individuals, along with 60 European, 60 African, and 90 East-Asian individuals, in the HapMap project, and genetic differentiation was observed.
Abstract: Because population stratification can cause spurious associations in case-control studies, understanding the population structure is important. Here, we examined Japanese population structure by “Eigenanalysis,” using the genotypes for 140,387 SNPs in 7003 Japanese individuals, along with 60 European, 60 African, and 90 East-Asian individuals, in the HapMap project. Most Japanese individuals fell into two main clusters, Hondo and Ryukyu; the Hondo cluster includes most of the individuals from the main islands in Japan, and the Ryukyu cluster includes most of the individuals from Okinawa. The SNPs with the greatest frequency differences between the Hondo and Ryukyu clusters were found in the HLA region in chromosome 6. The nonsynonymous SNPs with the greatest frequency differences between the Hondo and Ryukyu clusters were the Val/Ala polymorphism (rs3827760) in the EDAR gene, associated with hair thickness, and the Gly/Ala polymorphism (rs17822931) in the ABCC11 gene, associated with ear-wax type. Genetic differentiation was observed, even among different regions in Honshu Island, the largest island of Japan. Simulation studies showed that the inclusion of different proportions of individuals from different regions of Japan in case and control groups can lead to an inflated rate of false-positive results when the sample sizes are large.