scispace - formally typeset
Search or ask a question

Showing papers in "Genetics in 2015"


Journal ArticleDOI
01 Sep 2015-Genetics
TL;DR: This work reports that direct injection of in vitro–assembled Cas9-CRISPR RNA (crRNA) trans-activating crRNA (tracrRNA) ribonucleoprotein complexes into the gonad of Caenorhabditis elegans yields HDR edits at a high frequency.
Abstract: Homology-directed repair (HDR) of breaks induced by the RNA-programmed nuclease Cas9 has become a popular method for genome editing in several organisms. Most HDR protocols rely on plasmid-based expression of Cas9 and the gene-specific guide RNAs. Here we report that direct injection of in vitro–assembled Cas9-CRISPR RNA (crRNA) trans-activating crRNA (tracrRNA) ribonucleoprotein complexes into the gonad of Caenorhabditis elegans yields HDR edits at a high frequency. Building on our earlier finding that PCR fragments with 35-base homology are efficient repair templates, we developed an entirely cloning-free protocol for the generation of seamless HDR edits without selection. Combined with the co-CRISPR method, this protocol is sufficiently robust for use with low-efficiency guide RNAs and to generate complex edits, including ORF replacement and simultaneous tagging of two genes with fluorescent proteins.

550 citations


Journal ArticleDOI
01 Aug 2015-Genetics
TL;DR: A new selection strategy for producing fluorescent protein (FP) knock-ins using CRISPR/Cas9-triggered homologous recombination using a newly developed self-excising cassette for drug selection is described.
Abstract: A central goal in the development of genome engineering technology is to reduce the time and labor required to produce custom genome modifications. Here we describe a new selection strategy for producing fluorescent protein (FP) knock-ins using CRISPR/Cas9-triggered homologous recombination. We have tested our approach in Caenorhabditis elegans. This approach has been designed to minimize hands-on labor at each step of the procedure. Central to our strategy is a newly developed self-excising cassette (SEC) for drug selection. SEC consists of three parts: a drug-resistance gene, a visible phenotypic marker, and an inducible Cre recombinase. SEC is flanked by LoxP sites and placed within a synthetic intron of a fluorescent protein tag, resulting in an FP-SEC module that can be inserted into any C. elegans gene. Upon heat shock, SEC excises itself from the genome, leaving no exogenous sequences outside the fluorescent protein tag. With our approach, one can generate knock-in alleles in any genetic background, with no PCR screening required and without the need for a second injection step to remove the selectable marker. Moreover, this strategy makes it possible to produce a fluorescent protein fusion, a transcriptional reporter and a strong loss-of-function allele for any gene of interest in a single injection step.

473 citations


Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: The various tools developed and the status of the TRiP collection, which is currently composed of 11,491 lines and covering 71% of Drosophila genes, are described.
Abstract: To facilitate large-scale functional studies in Drosophila, the Drosophila Transgenic RNAi Project (TRiP) at Harvard Medical School (HMS) was established along with several goals: developing efficient vectors for RNAi that work in all tissues, generating a genome-scale collection of RNAi stocks with input from the community, distributing the lines as they are generated through existing stock centers, validating as many lines as possible using RT-qPCR and phenotypic analyses, and developing tools and web resources for identifying RNAi lines and retrieving existing information on their quality. With these goals in mind, here we describe in detail the various tools we developed and the status of the collection, which is currently composed of 11,491 lines and covering 71% of Drosophila genes. Data on the characterization of the lines either by RT-qPCR or phenotype is available on a dedicated website, the RNAi Stock Validation and Phenotypes Project (RSVP, http://www.flyrnai.org/RSVP.html), and stocks are available from three stock centers, the Bloomington Drosophila Stock Center (United States), National Institute of Genetics (Japan), and TsingHua Fly Center (China).

469 citations


Journal ArticleDOI
01 Jun 2015-Genetics
TL;DR: The organism and the many features that make it an outstanding experimental system, including its small size, rapid life cycle, transparency, and well-annotated genome are introduced.
Abstract: A little over 50 years ago, Sydney Brenner had the foresight to develop the nematode (round worm) Caenorhabditis elegans as a genetic model for understanding questions of developmental biology and neurobiology. Over time, research on C. elegans has expanded to explore a wealth of diverse areas in modern biology including studies of the basic functions and interactions of eukaryotic cells, host-parasite interactions, and evolution. C. elegans has also become an important organism in which to study processes that go awry in human diseases. This primer introduces the organism and the many features that make it an outstanding experimental system, including its small size, rapid life cycle, transparency, and well-annotated genome. We survey the basic anatomical features, common technical approaches, and important discoveries in C. elegans research. Key to studying C. elegans has been the ability to address biological problems genetically, using both forward and reverse genetics, both at the level of the entire organism and at the level of the single, identified cell. These possibilities make C. elegans useful not only in research laboratories, but also in the classroom where it can be used to excite students who actually can see what is happening inside live cells and tissues.

405 citations


Journal ArticleDOI
01 Apr 2015-Genetics
TL;DR: The history behind the multitude of definitions that have been employed since the conception of epigenetics are discussed, the components of these definitions are analyzed, and solutions for clarifying the field and mitigating the problems that have arisen due to these definitional ambiguities are offered.
Abstract: Interest in the field of epigenetics has increased rapidly over the last decade, with the term becoming more identifiable in biomedical research, scientific fields outside of the molecular sciences, such as ecology and physiology, and even mainstream culture. It has become increasingly clear, however, that different investigators ascribe different definitions to the term. Some employ epigenetics to explain changes in gene expression, others use it to refer to transgenerational effects and/or inherited expression states. This disagreement on a clear definition has made communication difficult, synthesis of epigenetic research across fields nearly impossible, and has in many ways biased methodologies and interpretations. This article discusses the history behind the multitude of definitions that have been employed since the conception of epigenetics, analyzes the components of these definitions, and offers solutions for clarifying the field and mitigating the problems that have arisen due to these definitional ambiguities.

370 citations


Journal ArticleDOI
01 Dec 2015-Genetics
TL;DR: This study investigates several modeling extensions to improve the estimation accuracy of the population covariance matrix and all the related measures and defines a robust Bayesian framework to characterize adaptive genetic differentiation across populations.
Abstract: In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (iii) to consider alternative covariate models for analyses of association with population-specific covariables. In particular, the auxiliary variable model allows one to deal with multiple testing issues and, providing the relative marker positions are available, to capture some linkage disequilibrium information. A comprehensive simulation study was carried out to evaluate the performances of these different models. Also, when compared in terms of power, robustness, and computational efficiency to five other state-of-the-art genome-scan methods (BayEnv2, BayScEnv, BayScan, flk, and lfmm), the proposed approaches proved highly effective. For illustration purposes, genotyping data on 18 French cattle breeds were analyzed, leading to the identification of 13 strong signatures of selection. Among these, four (surrounding the KITLG, KIT, EDN3, and ALB genes) contained SNPs strongly associated with the piebald coloration pattern while a fifth (surrounding PLAG1) could be associated to morphological differences across the populations. Finally, analysis of Pool-Seq data from 12 populations of Littorina saxatilis living in two different ecotypes illustrates how the proposed framework might help in addressing relevant ecological issues in nonmodel species. Overall, the proposed methods define a robust Bayesian framework to characterize adaptive genetic differentiation across populations. The BayPass program implementing the different models is available at http://www1.montpellier.inra.fr/CBGP/software/baypass/.

336 citations


Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: A practical guide to use the CRISPR/Cas9 system for mouse mutagenesis and technical improvements to increase efficiency of RNA-guided genome editing in mouse embryos are provided and practical problems such as mosaicism in founders, which complicates genotyping and phenotyping are addressed.
Abstract: CRISPR/Cas9 system of RNA-guided genome editing is revolutionizing genetics research in a wide spectrum of organisms. Even for the laboratory mouse, a model that has thrived under the benefits of embryonic stem (ES) cell knockout capabilities for nearly three decades, CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas9 technology enables one to manipulate the genome with unprecedented simplicity and speed. It allows generation of null, conditional, precisely mutated, reporter, or tagged alleles in mice. Moreover, it holds promise for other applications beyond genome editing. The crux of this system is the efficient and targeted introduction of DNA breaks that are repaired by any of several pathways in a predictable but not entirely controllable manner. Thus, further optimizations and improvements are being developed. Here, we summarize current applications and provide a practical guide to use the CRISPR/Cas9 system for mouse mutagenesis, based on published reports and our own experiences. We discuss critical points and suggest technical improvements to increase efficiency of RNA-guided genome editing in mouse embryos and address practical problems such as mosaicism in founders, which complicates genotyping and phenotyping. We describe a next-generation sequencing strategy for simultaneous characterization of on- and off-target editing in mice derived from multiple CRISPR experiments. Additionally, we report evidence that elevated frequency of precise, homology-directed editing can be achieved by transient inhibition of the Ligase IV-dependent nonhomologous end-joining pathway in one-celled mouse embryos.

313 citations


Journal ArticleDOI
01 Jun 2015-Genetics
TL;DR: This work uses massively parallel assays to measure the effects of nearly 2000 missense substitutions in the RING domain of BRCA1 on its E3 ubiquitin ligase activity and its binding to the BARD1 Ringing domain, and generates a model that outperforms widely used biological-effect prediction algorithms.
Abstract: Interpreting variants of uncertain significance (VUS) is a central challenge in medical genetics. One approach is to experimentally measure the functional consequences of VUS, but to date this approach has been post hoc and low throughput. Here we use massively parallel assays to measure the effects of nearly 2000 missense substitutions in the RING domain of BRCA1 on its E3 ubiquitin ligase activity and its binding to the BARD1 RING domain. From the resulting scores, we generate a model to predict the capacities of full-length BRCA1 variants to support homology-directed DNA repair, the essential role of BRCA1 in tumor suppression, and show that it outperforms widely used biological-effect prediction algorithms. We envision that massively parallel functional assays may facilitate the prospective interpretation of variants observed in clinical sequencing.

283 citations


Journal ArticleDOI
01 Apr 2015-Genetics
TL;DR: An assembly strategy is settled on that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection, which will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.
Abstract: Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.

277 citations


Journal ArticleDOI
01 Aug 2015-Genetics
TL;DR: The parent–child pairs revealed a trend toward increasing exogamy over time; the presence in the cohort of individuals endorsing multiple race/ethnicity categories creates interesting challenges and future opportunities for genetic epidemiologic studies.
Abstract: Using genome-wide genotypes, we characterized the genetic structure of 103,006 participants in the Kaiser Permanente Northern California multi-ethnic Genetic Epidemiology Research on Adult Health and Aging Cohort and analyzed the relationship to self-reported race/ethnicity. Participants endorsed any of 23 race/ethnicity/nationality categories, which were collapsed into seven major race/ethnicity groups. By self-report the cohort is 80.8% white and 19.2% minority; 93.8% endorsed a single race/ethnicity group, while 6.2% endorsed two or more. Principal component (PC) and admixture analyses were generally consistent with prior studies. Approximately 17% of subjects had genetic ancestry from more than one continent, and 12% were genetically admixed, considering only nonadjacent geographical origins. Self-reported whites were spread on a continuum along the first two PCs, indicating extensive mixing among European nationalities. Self-identified East Asian nationalities correlated with genetic clustering, consistent with extensive endogamy. Individuals of mixed East Asian–European genetic ancestry were easily identified; we also observed a modest amount of European genetic ancestry in individuals self-identified as Filipinos. Self-reported African Americans and Latinos showed extensive European and African genetic ancestry, and Native American genetic ancestry for the latter. Among 3741 genetically identified parent–child pairs, 93% were concordant for self-reported race/ethnicity; among 2018 genetically identified full-sib pairs, 96% were concordant; the lower rate for parent–child pairs was largely due to intermarriage. The parent–child pairs revealed a trend toward increasing exogamy over time; the presence in the cohort of individuals endorsing multiple race/ethnicity categories creates interesting challenges and future opportunities for genetic epidemiologic studies.

274 citations


Journal ArticleDOI
01 Oct 2015-Genetics
TL;DR: There are many distinct fly models for a range of neurodegenerative diseases; this work focuses on select studies from models of polyglutamine disease and amyotrophic lateral sclerosis that illustrate the type and range of insights that can be gleaned.
Abstract: With the increase in the ageing population, neurodegenerative disease is devastating to families and poses a huge burden on society. The brain and spinal cord are extraordinarily complex: they consist of a highly organized network of neuronal and support cells that communicate in a highly specialized manner. One approach to tackling problems of such complexity is to address the scientific questions in simpler, yet analogous, systems. The fruit fly, Drosophila melanogaster, has been proven tremendously valuable as a model organism, enabling many major discoveries in neuroscientific disease research. The plethora of genetic tools available in Drosophila allows for exquisite targeted manipulation of the genome. Due to its relatively short lifespan, complex questions of brain function can be addressed more rapidly than in other model organisms, such as the mouse. Here we discuss features of the fly as a model for human neurodegenerative disease. There are many distinct fly models for a range of neurodegenerative diseases; we focus on select studies from models of polyglutamine disease and amyotrophic lateral sclerosis that illustrate the type and range of insights that can be gleaned. In discussion of these models, we underscore strengths of the fly in providing understanding into mechanisms and pathways, as a foundation for translational and therapeutic research.

Journal ArticleDOI
01 Apr 2015-Genetics
TL;DR: A strategy is described and validated for Caenorhabditis elegans that reliably achieved a high frequency of genome editing for all targets tested in vivo and combined the 3′ GG guide improvement with a co-CRISPR/co-conversion approach to provide a powerful means to obtain desired genetic changes in an otherwise unaltered genome.
Abstract: Success with genome editing by the RNA-programmed nuclease Cas9 has been limited by the inability to predict effective guide RNAs and DNA target sites. Not all guide RNAs have been successful, and even those that were, varied widely in their efficacy. Here we describe and validate a strategy for Caenorhabditis elegans that reliably achieved a high frequency of genome editing for all targets tested in vivo. The key innovation was to design guide RNAs with a GG motif at the 3' end of their target-specific sequences. All guides designed using this simple principle induced a high frequency of targeted mutagenesis via nonhomologous end joining (NHEJ) and a high frequency of precise DNA integration from exogenous DNA templates via homology-directed repair (HDR). Related guide RNAs having the GG motif shifted by only three nucleotides showed severely reduced or no genome editing. We also combined the 3' GG guide improvement with a co-CRISPR/co-conversion approach. For this co-conversion scheme, animals were only screened for genome editing at designated targets if they exhibited a dominant phenotype caused by Cas9-dependent editing of an unrelated target. Combining the two strategies further enhanced the ease of mutant recovery, thereby providing a powerful means to obtain desired genetic changes in an otherwise unaltered genome.

Journal ArticleDOI
01 Nov 2015-Genetics
TL;DR: This primer describes the organism’s natural history, the features of sequenced genomes within the genus, the wide range of available genetic tools and online resources, the types of biological questions Drosophila can help address, and historical milestones.
Abstract: Fruit flies of the genus Drosophila have been an attractive and effective genetic model organism since Thomas Hunt Morgan and colleagues made seminal discoveries with them a century ago. Work with Drosophila has enabled dramatic advances in cell and developmental biology, neurobiology and behavior, molecular biology, evolutionary and population genetics, and other fields. With more tissue types and observable behaviors than in other short-generation model organisms, and with vast genome data available for many species within the genus, the fly’s tractable complexity will continue to enable exciting opportunities to explore mechanisms of complex developmental programs, behaviors, and broader evolutionary questions. This primer describes the organism’s natural history, the features of sequenced genomes within the genus, the wide range of available genetic tools and online resources, the types of biological questions Drosophila can help address, and historical milestones.

Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: This work employed electroporation as a means to deliver the CRISPR/Cas9 components into mouse zygotes and recovered live mice with targeted nonhomologous end joining and homology-directed repair mutations with high efficiency.
Abstract: The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein (Cas) system is an adaptive immune system in bacteria and archaea that has recently been exploited for genome engineering. Mutant mice can be generated in one step through direct delivery of the CRISPR/Cas9 components into a mouse zygote. Although the technology is robust, delivery remains a bottleneck, as it involves manual injection of the components into the pronuclei or the cytoplasm of mouse zygotes, which is technically demanding and inherently low throughput. To overcome this limitation, we employed electroporation as a means to deliver the CRISPR/Cas9 components, including Cas9 messenger RNA, single-guide RNA, and donor oligonucleotide, into mouse zygotes and recovered live mice with targeted nonhomologous end joining and homology-directed repair mutations with high efficiency. Our results demonstrate that mice carrying CRISPR/Cas9-mediated targeted mutations can be obtained with high efficiency by zygote electroporation.

Journal ArticleDOI
01 Oct 2015-Genetics
TL;DR: It is concluded that prediction accuracy can be improved by modeling epistasis for selfing species but may not for outcrossing species, and why the RKHS model based on a Gaussian kernel captures epistatic effects among markers.
Abstract: Modeling epistasis in genomic selection is impeded by a high computational load. The extended genomic best linear unbiased prediction (EG-BLUP) with an epistatic relationship matrix and the reproducing kernel Hilbert space regression (RKHS) are two attractive approaches that reduce the computational load. In this study, we proved the equivalence of EG-BLUP and genomic selection approaches, explicitly modeling epistatic effects. Moreover, we have shown why the RKHS model based on a Gaussian kernel captures epistatic effects among markers. Using experimental data sets in wheat and maize, we compared different genomic selection approaches and concluded that prediction accuracy can be improved by modeling epistasis for selfing species but may not for outcrossing species.

Journal ArticleDOI
01 Jul 2015-Genetics
TL;DR: This study shows that CAVIar and BIMBAM are actually approximately equivalent to each other, and develops a fine-mapping method using marginal test statistics in the Bayesian framework, which is called CAVIAR Bayes factor (CAVIARBF).
Abstract: Two recently developed fine-mapping methods, CAVIAR and PAINTOR, demonstrate better performance over other fine-mapping methods. They also have the advantage of using only the marginal test statistics and the correlation among SNPs. Both methods leverage the fact that the marginal test statistics asymptotically follow a multivariate normal distribution and are likelihood based. However, their relationship with Bayesian fine mapping, such as BIMBAM, is not clear. In this study, we first show that CAVIAR and BIMBAM are actually approximately equivalent to each other. This leads to a fine-mapping method using marginal test statistics in the Bayesian framework, which we call CAVIAR Bayes factor (CAVIARBF). Another advantage of the Bayesian framework is that it can answer both association and fine-mapping questions. We also used simulations to compare CAVIARBF with other methods under different numbers of causal variants. The results showed that both CAVIARBF and BIMBAM have better performance than PAINTOR and other methods. Compared to BIMBAM, CAVIARBF has the advantage of using only marginal test statistics and takes about one-quarter to one-fifth of the running time. We applied different methods on two independent cohorts of the same phenotype. Results showed that CAVIARBF, BIMBAM, and PAINTOR selected the same top 3 SNPs; however, CAVIARBF and BIMBAM had better consistency in selecting the top 10 ranked SNPs between the two cohorts. Software is available at https://bitbucket.org/Wenan/caviarbf.

Journal ArticleDOI
01 Feb 2015-Genetics
TL;DR: The current mechanistic understanding of each step in the process of yeast CME, and the essential roles played by actin polymerization at these sites are described, while providing a historical perspective of how the landscape has changed since the preceding version of the YeastBook was published 17 years ago (1997).
Abstract: Endocytosis, the process whereby the plasma membrane invaginates to form vesicles, is essential for bringing many substances into the cell and for membrane turnover. The mechanism driving clathrin-mediated endocytosis (CME) involves > 50 different protein components assembling at a single location on the plasma membrane in a temporally ordered and hierarchal pathway. These proteins perform precisely choreographed steps that promote receptor recognition and clustering, membrane remodeling, and force-generating actin-filament assembly and turnover to drive membrane invagination and vesicle scission. Many critical aspects of the CME mechanism are conserved from yeast to mammals and were first elucidated in yeast, demonstrating that it is a powerful system for studying endocytosis. In this review, we describe our current mechanistic understanding of each step in the process of yeast CME, and the essential roles played by actin polymerization at these sites, while providing a historical perspective of how the landscape has changed since the preceding version of the YeastBook was published 17 years ago (1997). Finally, we discuss the key unresolved issues and where future studies might be headed.

Journal ArticleDOI
01 Feb 2015-Genetics
TL;DR: It is demonstrated that 60mer oligos with 29 bp of homology drive efficient knock-in of point mutations, and that disabling nonhomologous end joining by RNAi inactivation of the cku-80 gene significantly improves knock- in efficiency.
Abstract: As in other organisms, CRISPR/Cas9 methods provide a powerful approach for genome editing in the nematode Caenorhabditis elegans. Oligonucleotides are excellent repair templates for introducing substitutions and short insertions, as they are cost effective, require no cloning, and appear in other organisms to target changes by homologous recombination at DNA double-strand breaks (DSBs). Here, I describe a methodology in C. elegans to efficiently knock in epitope tags in 8-9 days, using a temperature-sensitive lethal mutation in the pha-1 gene as a co-conversion marker. I demonstrate that 60mer oligos with 29 bp of homology drive efficient knock-in of point mutations, and that disabling nonhomologous end joining by RNAi inactivation of the cku-80 gene significantly improves knock-in efficiency. Homology arms of 35-80 bp are sufficient for efficient editing and DSBs up to 54 bp away from the insertion site produced knock-ins. These findings will likely be applicable for a range of genome editing approaches in C. elegans, which will improve editing efficiency and minimize screening efforts.

Journal ArticleDOI
01 Oct 2015-Genetics
TL;DR: The fission yeast Schizosaccharomyces pombe appears to have evolved less rapidly than S. cerevisiae so that it retains more characteristics of the common ancient yeast ancestor, causing it to share more features with metazoan cells.
Abstract: The fission yeast Schizosaccharomyces pombe is an important model organism for the study of eukaryotic molecular and cellular biology. Studies of S. pombe, together with studies of its distant cousin, Saccharomyces cerevisiae, have led to the discovery of genes involved in fundamental mechanisms of transcription, translation, DNA replication, cell cycle control, and signal transduction, to name but a few processes. However, since the divergence of the two species approximately 350 million years ago, S. pombe appears to have evolved less rapidly than S. cerevisiae so that it retains more characteristics of the common ancient yeast ancestor, causing it to share more features with metazoan cells. This Primer introduces S. pombe by describing the yeast itself, providing a brief description of the origins of fission yeast research, and illustrating some genetic and bioinformatics tools used to study protein function in fission yeast. In addition, a section on some key differences between S. pombe and S. cerevisiae is included for readers with some familiarity with budding yeast research but who may have an interest in developing research projects using S. pombe.

Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.
Abstract: The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California—San Francisco, undertook genome-wide genotyping of >100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated >70 billion genotypes, represents the first large-scale use of the Affymetrix Axiom Genotyping Solution. Because genotyping took place over a short 14-month period, creating a near-real-time analysis pipeline for experimental assay quality control and final optimized analyses was critical. Because of the multi-ethnic nature of the cohort, four different ethnic-specific arrays were employed to enhance genome-wide coverage. All assays were performed on DNA extracted from saliva samples. To improve sample call rates and significantly increase genotype concordance, we partitioned the cohort into disjoint packages of plates with similar assay contexts. Using strict QC criteria, the overall genotyping success rate was 103,067 of 109,837 samples assayed (93.8%), with a range of 92.1–95.4% for the four different arrays. Similarly, the SNP genotyping success rate ranged from 98.1 to 99.4% across the four arrays, the variation depending mostly on how many SNPs were included as single copy vs. double copy on a particular array. The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.

Journal ArticleDOI
01 Feb 2015-Genetics
TL;DR: Mixed models at the individual plant or plot level produced more realistic heritability estimates, and for simulated traits standard errors were up to 13 times smaller, and genomic prediction was improved by using these mixed models, with up to a 49% increase in accuracy.
Abstract: Heritability is a central parameter in quantitative genetics, from both an evolutionary and a breeding perspective. For plant traits heritability is traditionally estimated by comparing within- and between-genotype variability. This approach estimates broad-sense heritability and does not account for different genetic relatedness. With the availability of high-density markers there is growing interest in marker-based estimates of narrow-sense heritability, using mixed models in which genetic relatedness is estimated from genetic markers. Such estimates have received much attention in human genetics but are rarely reported for plant traits. A major obstacle is that current methodology and software assume a single phenotypic value per genotype, hence requiring genotypic means. An alternative that we propose here is to use mixed models at the individual plant or plot level. Using statistical arguments, simulations, and real data we investigate the feasibility of both approaches and how these affect genomic prediction with the best linear unbiased predictor and genome-wide association studies. Heritability estimates obtained from genotypic means had very large standard errors and were sometimes biologically unrealistic. Mixed models at the individual plant or plot level produced more realistic estimates, and for simulated traits standard errors were up to 13 times smaller. Genomic prediction was also improved by using these mixed models, with up to a 49% increase in accuracy. For genome-wide association studies on simulated traits, the use of individual plant data gave almost no increase in power. The new methodology is applicable to any complex trait where multiple replicates of individual genotypes can be scored. This includes important agronomic crops, as well as bacteria and fungi.

Journal ArticleDOI
01 Jun 2015-Genetics
TL;DR: It is shown that the clades coexisted for >6000 generations before one went extinct and evolved a frequency-dependent interaction, which prevented the immediate competitive exclusion of either clade, but which collapsed as beneficial mutations accumulated in the clade that prevailed.
Abstract: Twelve replicate populations of Escherichia coli have been evolving in the laboratory for >25 years and 60,000 generations. We analyzed bacteria from whole-population samples frozen every 500 generations through 20,000 generations for one well-studied population, called Ara−1. By tracking 42 known mutations in these samples, we reconstructed the history of this population’s genotypic evolution over this period. The evolutionary dynamics of Ara−1 show strong evidence of selective sweeps as well as clonal interference between competing lineages bearing different beneficial mutations. In some cases, sets of several mutations approached fixation simultaneously, often conveying no information about their order of origination; we present several possible explanations for the existence of these mutational cohorts. Against a backdrop of rapid selective sweeps both earlier and later, two genetically diverged clades coexisted for >6000 generations before one went extinct. In that time, many additional mutations arose in the clade that eventually prevailed. We show that the clades evolved a frequency-dependent interaction, which prevented the immediate competitive exclusion of either clade, but which collapsed as beneficial mutations accumulated in the clade that prevailed. Clonal interference and frequency dependence can occur even in the simplest microbial populations. Furthermore, frequency dependence may generate dynamics that extend the period of coexistence that would otherwise be sustained by clonal interference alone.

Journal ArticleDOI
01 Mar 2015-Genetics
TL;DR: Drosophila is an excellent model organism for studies that have translational impact for genetic disease and for other medical implications such as vector-borne illnesses, and a better collaboration between DrosophILA geneticists/biologists and human geneticist/bioinformaticians/clinicians is promoted.
Abstract: Many scientists complain that the current funding situation is dire. Indeed, there has been an overall decline in support in funding for research from the National Institutes of Health and the National Science Foundation. Within the Drosophila field, some of us question how long this funding crunch will last as it demotivates principal investigators and perhaps more importantly affects the long-term career choice of many young scientists. Yet numerous very interesting biological processes and avenues remain to be investigated in Drosophila, and probing questions can be answered fast and efficiently in flies to reveal new biological phenomena. Moreover, Drosophila is an excellent model organism for studies that have translational impact for genetic disease and for other medical implications such as vector-borne illnesses. We would like to promote a better collaboration between Drosophila geneticists/biologists and human geneticists/bioinformaticians/clinicians, as it would benefit both fields and significantly impact the research on human diseases.

Journal ArticleDOI
01 Oct 2015-Genetics
TL;DR: Techniques for studying synapses in Drosophila are discussed, with a focus on the larval neuromuscular junction (NMJ), a well-established model glutamatergic synapse.
Abstract: Chemical synapses are sites of contact and information transfer between a neuron and its partner cell. Each synapse is a specialized junction, where the presynaptic cell assembles machinery for the release of neurotransmitter, and the postsynaptic cell assembles components to receive and integrate this signal. Synapses also exhibit plasticity, during which synaptic function and/or structure are modified in response to activity. With a robust panel of genetic, imaging, and electrophysiology approaches, and strong evolutionary conservation of molecular components, Drosophila has emerged as an essential model system for investigating the mechanisms underlying synaptic assembly, function, and plasticity. We will discuss techniques for studying synapses in Drosophila, with a focus on the larval neuromuscular junction (NMJ), a well-established model glutamatergic synapse. Vesicle fusion, which underlies synaptic release of neurotransmitters, has been well characterized at this synapse. In addition, studies of synaptic assembly and organization of active zones and postsynaptic densities have revealed pathways that coordinate those events across the synaptic cleft. We will also review modes of synaptic growth and plasticity at the fly NMJ, and discuss how pre- and postsynaptic cells communicate to regulate plasticity in response to activity.

Journal ArticleDOI
01 Aug 2015-Genetics
TL;DR: A linkage disequilibrium-based test for deducing the QTL allele was developed, and how it was used to produce IPN-resistant salmon, leading to a 75% decrease in the number of IPN outbreaks in the salmon farming industry.
Abstract: Infectious pancreatic necrosis virus (IPNV) is the cause of one of the most prevalent diseases in farmed Atlantic salmon (Salmo salar). A quantitative trait locus (QTL) has been found to be responsible for most of the genetic variation in resistance to the virus. Here we describe how a linkage disequilibrium-based test for deducing the QTL allele was developed, and how it was used to produce IPN-resistant salmon, leading to a 75% decrease in the number of IPN outbreaks in the salmon farming industry. Furthermore, we describe how whole-genome sequencing of individuals with deduced QTL genotypes was used to map the QTL down to a region containing an epithelial cadherin (cdh1) gene. In a coimmunoprecipitation assay, the Cdh1 protein was found to bind to IPNV virions, strongly indicating that the protein is part of the machinery used by the virus for internalization. Immunofluorescence revealed that the virus colocalizes with IPNV in the endosomes of homozygous susceptible individuals but not in the endosomes of homozygous resistant individuals. A putative causal single nucleotide polymorphism was found within the full-length cdh1 gene, in phase with the QTL in all observed haplotypes except one; the absence of a single, all-explaining DNA polymorphism indicates that an additional causative polymorphism may contribute to the observed QTL genotype patterns. Cdh1 has earlier been shown to be necessary for the internalization of certain bacteria and fungi, but this is the first time the protein is implicated in internalization of a virus.

Journal ArticleDOI
01 Jul 2015-Genetics
TL;DR: An oligonucleotide (oligo)-based chromosome painting technique in cucumber that will be applicable in any plant species with a sequenced genome is developed and precisely map the pairing between cucumber chromosome 7 and chromosome 1 of Cucumis hystrix in a F1 hybrid.
Abstract: Chromosome-specific painting is a powerful technique in molecular cytogenetic and genome research. We developed an oligonucleotide (oligo)-based chromosome painting technique in cucumber (Cucumis sativus) that will be applicable in any plant species with a sequenced genome. Oligos specific to a single chromosome of cucumber were identified using a newly developed bioinformatic pipeline and then massively synthesized de novo in parallel. The synthesized oligos were amplified and labeled with biotin or digoxigenin for use in fluorescence in situ hybridization (FISH). We developed three different probes with each containing 23,000-27,000 oligos. These probes spanned 8.3-17 Mb of DNA on targeted cucumber chromosomes and had the densities of 1.5-3.2 oligos per kilobases. These probes produced FISH signals on a single cucumber chromosome and were used to paint homeologous chromosomes in other Cucumis species diverged from cucumber for up to 12 million years. The bulked oligo probes allowed us to track a single chromosome in early stages during meiosis. We were able to precisely map the pairing between cucumber chromosome 7 and chromosome 1 of Cucumis hystrix in a F1 hybrid. These two homeologous chromosomes paired in 71% of prophase I cells but only 25% of metaphase I cells, which may provide an explanation of the higher recombination rates compared to the chiasma frequencies between homeologous chromosomes reported in plant hybrids.

Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: CRISPR/Cas9-mediated genome engineering is employed to create AKH and AKH plus APRP-specific mutants in the model insect Drosophila melanogaster to provide evidence for a negative autoregulatory loop in Akh gene regulation.
Abstract: Maintenance of biological functions under negative energy balance depends on mobilization of storage lipids and carbohydrates in animals. In mammals, glucagon and glucocorticoid signaling mobilizes energy reserves, whereas adipokinetic hormones (AKHs) play a homologous role in insects. Numerous studies based on AKH injections and correlative studies in a broad range of insect species established the view that AKH acts as master regulator of energy mobilization during development, reproduction, and stress. In contrast to AKH, the second peptide, which is processed from the Akh encoded prohormone [termed “adipokinetic hormone precursor-related peptide” (APRP)] is functionally orphan. APRP is discussed as ecdysiotropic hormone or as scaffold peptide during AKH prohormone processing. However, as in the case of AKH, final evidence for APRP functions requires genetic mutant analysis. Here we employed CRISPR/Cas9-mediated genome engineering to create AKH and AKH plus APRP-specific mutants in the model insect Drosophila melanogaster. Lack of APRP did not affect any of the tested steroid-dependent processes. Similarly, Drosophila AKH signaling is dispensable for ontogenesis, locomotion, oogenesis, and homeostasis of lipid or carbohydrate storage until up to the end of metamorphosis. During adulthood, however, AKH regulates body fat content and the hemolymph sugar level as well as nutritional and oxidative stress responses. Finally, we provide evidence for a negative autoregulatory loop in Akh gene regulation.

Journal ArticleDOI
01 Sep 2015-Genetics
TL;DR: The results demonstrate that starvation affects a variety of life-history traits in the exposed animals and their descendants, some presumably reflecting fitness costs but others potentially adaptive.
Abstract: Starvation during early development can have lasting effects that influence organismal fitness and disease risk. We characterized the long-term phenotypic consequences of starvation during early larval development in Caenorhabditis elegans to determine potential fitness effects and develop it as a model for mechanistic studies. We varied the amount of time that larvae were developmentally arrested by starvation after hatching (“L1 arrest”). Worms recovering from extended starvation grew slowly, taking longer to become reproductive, and were smaller as adults. Fecundity was also reduced, with the smallest individuals most severely affected. Feeding behavior was impaired, possibly contributing to deficits in growth and reproduction. Previously starved larvae were more sensitive to subsequent starvation, suggesting decreased fitness even in poor conditions. We discovered that smaller larvae are more resistant to heat, but this correlation does not require passage through L1 arrest. The progeny of starved animals were also adversely affected: Embryo quality was diminished, incidence of males was increased, progeny were smaller, and their brood size was reduced. However, the progeny and grandprogeny of starved larvae were more resistant to starvation. In addition, the progeny, grandprogeny, and great-grandprogeny were more resistant to heat, suggesting epigenetic inheritance of acquired resistance to starvation and heat. Notably, such resistance was inherited exclusively from individuals most severely affected by starvation in the first generation, suggesting an evolutionary bet-hedging strategy. In summary, our results demonstrate that starvation affects a variety of life-history traits in the exposed animals and their descendants, some presumably reflecting fitness costs but others potentially adaptive.

Journal ArticleDOI
01 Jun 2015-Genetics
TL;DR: A system for one-generation multiplex conditional mutagenesis in zebrafish using transgenic expression of both cas9 and multiple single guide RNAs and sgRNAs is demonstrated.
Abstract: Determining the mechanism of gene function is greatly enhanced using conditional mutagenesis. However, generating engineered conditional alleles is inefficient and has only been widely used in mice. Importantly, multiplex conditional mutagenesis requires extensive breeding. Here we demonstrate a system for one-generation multiplex conditional mutagenesis in zebrafish (Danio rerio) using transgenic expression of both cas9 and multiple single guide RNAs (sgRNAs). We describe five distinct zebrafish U6 promoters for sgRNA expression and demonstrate efficient multiplex biallelic inactivation of tyrosinase and insulin receptor a and b, resulting in defects in pigmentation and glucose homeostasis. Furthermore, we demonstrate temporal and tissue-specific mutagenesis using transgenic expression of Cas9. Heat-shock-inducible expression of cas9 allows temporal control of tyr mutagenesis. Liver-specific expression of cas9 disrupts insulin receptor a and b, causing fasting hypoglycemia and postprandial hyperglycemia. We also show that delivery of sgRNAs targeting ascl1a into the eye leads to impaired damage-induced photoreceptor regeneration. Our findings suggest that CRISPR/Cas9-based conditional mutagenesis in zebrafish is not only feasible but rapid and straightforward.

Journal ArticleDOI
01 Jan 2015-Genetics
TL;DR: The results indicate that GC-biased gene conversion does not play a major role in shaping the nucleotide composition of the S. pombe genome and suggest that the mechanisms of DNA maintenance may have diverged significantly between fission and budding yeasts.
Abstract: The rate at which new mutations arise in the genome is a key factor in the evolution and adaptation of species. Here we describe the rate and spectrum of spontaneous mutations for the fission yeast Schizosaccharomyces pombe, a key model organism with many similarities to higher eukaryotes. We undertook an ∼1700-generation mutation accumulation (MA) experiment with a haploid S. pombe, generating 422 single-base substitutions and 119 insertion-deletion mutations (indels) across the 96 replicates. This equates to a base-substitution mutation rate of 2.00 × 10(-10) mutations per site per generation, similar to that reported for the distantly related budding yeast Saccharomyces cerevisiae. However, these two yeast species differ dramatically in their spectrum of base substitutions, the types of indels (S. pombe is more prone to insertions), and the pattern of selection required to counteract a strong AT-biased mutation rate. Overall, our results indicate that GC-biased gene conversion does not play a major role in shaping the nucleotide composition of the S. pombe genome and suggest that the mechanisms of DNA maintenance may have diverged significantly between fission and budding yeasts. Unexpectedly, CpG sites appear to be excessively liable to mutation in both species despite the likely absence of DNA methylation.