Showing papers by "Cold Spring Harbor Laboratory published in 2014"
••
TL;DR: The Reactome Knowledgebase provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations—an extended version of a classic metabolic map, in a single consistent data model.
Abstract: The Reactome Knowledgebase (www.reactome.org) provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations-an extended version of a classic metabolic map, in a single consistent data model. Reactome functions both as an archive of biological processes and as a tool for discovering unexpected functional relationships in data such as gene expression pattern surveys or somatic mutation catalogues from tumour cells. Over the last two years we redeveloped major components of the Reactome web interface to improve usability, responsiveness and data visualization. A new pathway diagram viewer provides a faster, clearer interface and smooth zooming from the entire reaction network to the details of individual reactions. Tool performance for analysis of user datasets has been substantially improved, now generating detailed results for genome-wide expression datasets within seconds. The analysis module can now be accessed through a RESTFul interface, facilitating its inclusion in third party applications. A new overview module allows the visualization of analysis results on a genome-wide Reactome pathway hierarchy using a single screen page. The search interface now provides auto-completion as well as a faceted search to narrow result lists efficiently.
5,065 citations
••
TL;DR: It is estimated that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation.
Abstract: Whole exome sequencing has proven to be a powerful tool for understanding the genetic architecture of human disease. Here we apply it to more than 2,500 simplex families, each having a child with an autistic spectrum disorder. By comparing affected to unaffected siblings, we show that 13% of de novo missense mutations and 43% of de novo likely gene-disrupting (LGD) mutations contribute to 12% and 9% of diagnoses, respectively. Including copy number variants, coding de novo mutations contribute to about 30% of all simplex and 45% of female diagnoses. Almost all LGD mutations occur opposite wild-type alleles. LGD targets in affected females significantly overlap the targets in males of lower intelligence quotient (IQ), but neither overlaps significantly with targets in males of higher IQ. We estimate that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation. LGD targets in the joint class overlap with published targets for intellectual disability and schizophrenia, and are enriched for chromatin modifiers, FMRP-associated genes and embryonically expressed genes. Most of the significance for the latter comes from affected females.
2,124 citations
••
Alistair R. R. Forrest, Hideya Kawaji, Michael Rehli1, J Kenneth Baillie2 +277 more•Institutions (63)
TL;DR: For example, the authors mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body.
Abstract: Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research
1,715 citations
••
University of California, San Diego1, Pennsylvania State University2, Stanford University3, University of Washington4, University of Michigan5, Florida State University6, New College of Florida7, Cold Spring Harbor Laboratory8, California Institute of Technology9, University of Vienna10, Emory University11, Fred Hutchinson Cancer Research Center12, Massachusetts Institute of Technology13, Broad Institute14, University of California, Irvine15, University of California, Santa Cruz16, University of California, San Francisco17, Yale University18, University of Florida19, Johns Hopkins University20, University College London21, University of Oxford22, Cornell University23, Memorial Sloan Kettering Cancer Center24, Harvard University25, University of Iowa26, Yeshiva University27, University of Pennsylvania28, Washington University in St. Louis29, National Institutes of Health30, University of North Carolina at Chapel Hill31
TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.
Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases
1,335 citations
••
TL;DR: This perspective emphasizes that the ultimate goal is to dispense with classification criteria and directly define interneuron types by function, and views them as elaborations of a much more finite group of developmentally specified cardinal classes that become further specialized as they mature.
Abstract: Understanding brain circuits begins with an appreciation of their component parts - the cells. Although GABAergic interneurons are a minority population within the brain, they are crucial for the control of inhibition. Determining the diversity of these interneurons has been a central goal of neurobiologists, but this amazing cell type has so far defied a generalized classification system. Interneuron complexity within the telencephalon could be simplified by viewing them as elaborations of a much more finite group of developmentally specified cardinal classes that become further specialized as they mature. Our perspective emphasizes that the ultimate goal is to dispense with classification criteria and directly define interneuron types by function.
927 citations
••
TL;DR: These findings establish a cortical circuit for the enhancement of visual response by locomotion and provide a potential common Circuit for the modulation of sensory processing by behavioral state.
841 citations
••
Salk Institute for Biological Studies1, Cold Spring Harbor Laboratory2, University of California, San Francisco3, Leidos4, University of California, Los Angeles5, Tohoku University6, University of New South Wales7, Garvan Institute of Medical Research8, University of Sydney9, Westmead Hospital10, Hospital of the University of Pennsylvania11, University of Pennsylvania12, Howard Hughes Medical Institute13
TL;DR: It is shown that VDR acts as a master transcriptional regulator of PSCs to reprise the quiescent state, resulting in induced stromal remodeling, increased intratumoral gemcitabine, reduced tumor volume, and a 57% increase in survival compared to chemotherapy alone.
827 citations
••
TL;DR: The current understanding of molecular mechanisms that underlie the promising therapeutic effects of BET bromodomain inhibition are reviewed.
698 citations
••
Massachusetts Institute of Technology1, California Institute of Technology2, Stanford University3, Harvard University4, Broad Institute5, Duke University6, University of Massachusetts Medical School7, National Institutes of Health8, University of Southern California9, Yale University10, Florida State University11, Cold Spring Harbor Laboratory12, Wellcome Trust Sanger Institute13, University of California, Santa Cruz14, Princeton University15, University of California, San Diego16, University of Washington17, University of Chicago18, Pennsylvania State University19
TL;DR: The strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies are reviewed.
Abstract: With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.
691 citations
••
TL;DR: A comprehensive review on the fundamental role of cancer-associated fibroblasts in shaping the tumor microenvironment and promoting tumor initiation and progression is provided.
Abstract: Fibroblasts regulate the structure and function of healthy tissues, participate transiently in tissue repair after acute inflammation, and assume an aberrant stimulatory role during chronic inflammatory states including cancer. Such cancer-associated fibroblasts (CAFs) modulate the tumor microenvironment and influence the behavior of neoplastic cells in either a tumor-promoting or tumor-inhibiting manner. These pleiotropic functions highlight the inherent plasticity of fibroblasts and may provide new avenues to understand and therapeutically intervene in malignancies. We discuss the emerging themes of CAF biology in the context of tumorigenesis and therapy.
646 citations
••
TL;DR: The CRISPR/Cas9 system is highly efficient at generating targeted mutations in stable transgenic tomato plants, and homozygous deletions of a desired size can be created in the first generation.
Abstract: During the past 12 years, there has been rapid development of genome-editing strategies that make it possible to directly target regions of genes in a DNA sequence-specific manner. Two of these strategies, zinc finger nucleases ([Urnov et al., 2010][1]) and transcription activator-like nucleases ([
••
TL;DR: It is found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing in Drosophila melanogaster.
Abstract: Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)+ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long non-coding RNAs (lncRNAs), some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and polyadenylation sites.
••
TL;DR: It is found that during preparation, while the monkey holds still, changes in motor cortical activity cancel out at the level of these population readouts, and motor cortex can thereby prepare the movement without prematurely causing it.
Abstract: Neural circuits must perform computations and then selectively output the results to other circuits. Yet synapses do not change radically at millisecond timescales. A key question then is: how is communication between neural circuits controlled? In motor control, brain areas directly involved in driving movement are active well before movement begins. Muscle activity is some readout of neural activity, yet it remains largely unchanged during preparation. Here we find that during preparation, while the monkey holds still, changes in motor cortical activity cancel out at the level of these population readouts. Motor cortex can thereby prepare the movement without prematurely causing it. Further, we found evidence that this mechanism also operates in dorsal premotor cortex, largely accounting for how preparatory activity is attenuated in primary motor cortex. Selective use of 'output-null' vs. 'output-potent' patterns of activity may thus help control communication to the muscles and between these brain areas.
•
TL;DR: An easy-to-use guideline evaluating histomorphology in mouse models of intestinal inflammation is provided, to facilitate classification and rating of new relevant models, and transfer of functional findings to comparable histopathologies in human disease.
Abstract: Histomorphology remains a powerful routine evaluating intestinal inflammation in animal models. Emphasizing the focus of a given animal study, histopathology can overstate differences between established models. We aimed to systematize histopathological evaluation of intestinal inflammation in mouse models facilitating inter-study comparisons. Samples of all parts of the intestinal tract from well-established mouse models of intestinal inflammation were evaluated from hematoxylin/eosin-stained sections and specific observations confirmed by subsequent immunohistochemistry. Three main categories sufficiently reflected the severity of histopathology independent of the localization and the overall extent of an inflammation: (i) quality and dimension of inflammatory cell infiltrates, (ii) epithelial changes and (iii) overall mucosal architecture. Scoring schemata were defined along specified criteria for each of the three categories. The direction of the initial hit proved crucial for the comparability of histological changes. Chemical noxes, infection with intestinal parasites or other models where the barrier was disturbed from outside, the luminal side, showed high levels of similarity and distinct differences to changes in the intestinal balance resulting from inside events like altered cytokine responses or disruption of the immune cell homeostasis. With a high degree of generalisation and maximum scores from 4-8 suitable scoring schemata accounted specific histopathological hallmarks. Truly integrating demands and experiences of gastroenterologists, mouse researchers, microbiologists and pathologists we provide an easy-to-use guideline evaluating histomorphology in mouse models of intestinal inflammation. Standard criteria and definitions facilitate classification and rating of new relevant models, allow comparison in animal studies and transfer of functional findings to comparable histopathologies in human disease.
••
TL;DR: It is argued that equitability is properly formalized by a self-consistency condition closely related to Data Processing Inequality, and shown that estimating mutual information provides a natural and practical method for equitably quantifying associations in large datasets.
Abstract: How should one quantify the strength of association between two random variables without bias for relationships of a specific form? Despite its conceptual simplicity, this notion of statistical "equitability" has yet to receive a definitive mathematical formalization. Here we argue that equitability is properly formalized by a self-consistency condition closely related to Data Processing Inequality. Mutual information, a fundamental quantity in information theory, is shown to satisfy this equitability criterion. These findings are at odds with the recent work of Reshef et al. [Reshef DN, et al. (2011) Science 334(6062):1518-1524], which proposed an alternative definition of equitability and introduced a new statistic, the "maximal information coefficient" (MIC), said to satisfy equitability in contradistinction to mutual information. These conclusions, however, were supported only with limited simulation evidence, not with mathematical arguments. Upon revisiting these claims, we prove that the mathematical definition of equitability proposed by Reshef et al. cannot be satisfied by any (nontrivial) dependence measure. We also identify artifacts in the reported simulation evidence. When these artifacts are removed, estimates of mutual information are found to be more equitable than estimates of MIC. Mutual information is also observed to have consistently higher statistical power than MIC. We conclude that estimating mutual information provides a natural (and often practical) way to equitably quantify statistical associations in large datasets.
••
TL;DR: In this article, the crystal structure of the intact heterotetrameric GluN1-GluN2B NMDA receptor ion channel at 4 angstroms was shown.
Abstract: N-Methyl-D-aspartate (NMDA) receptors belong to the family of ionotropic glutamate receptors, which mediate most excitatory synaptic transmission in mammalian brains. Calcium permeation triggered by activation of NMDA receptors is the pivotal event for initiation of neuronal plasticity. Here, we show the crystal structure of the intact heterotetrameric GluN1-GluN2B NMDA receptor ion channel at 4 angstroms. The NMDA receptors are arranged as a dimer of GluN1-GluN2B heterodimers with the twofold symmetry axis running through the entire molecule composed of an amino terminal domain (ATD), a ligand-binding domain (LBD), and a transmembrane domain (TMD). The ATD and LBD are much more highly packed in the NMDA receptors than non-NMDA receptors, which may explain why ATD regulates ion channel activity in NMDA receptors but not in non-NMDA receptors.
••
TL;DR: This work evaluated rat PPC neurons recorded during multisensory decisions and revealed that the network explored different dimensions during decision and movement, suggesting that a single network of neurons can support the evolving behavioral demands of decision-making.
Abstract: The posterior parietal cortex (PPC) receives diverse inputs and is involved in a dizzying array of behaviors These many behaviors could rely on distinct categories of neurons specialized to represent particular variables or could rely on a single population of PPC neurons that is leveraged in different ways To distinguish these possibilities, we evaluated rat PPC neurons recorded during multisensory decisions Newly designed tests revealed that task parameters and temporal response features were distributed randomly across neurons, without evidence of categories This suggests that PPC neurons constitute a dynamic network that is decoded according to the animal's present needs To test for an additional signature of a dynamic network, we compared moments when behavioral demands differed: decision and movement Our new state-space analysis revealed that the network explored different dimensions during decision and movement These observations suggest that a single network of neurons can support the evolving behavioral demands of decision-making
••
TL;DR: The CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases and reveals a general convergence of practices on most elements of the analysis and interpretation process.
Abstract: Background
There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance.
••
TL;DR: A flexible and modular approach is used to target intersectionally specified populations of inhibitory interneurons in mammalian hippocampus and neurons of the ventral tegmental area defined by both genetic and wiring properties, which may expand the application of genetically encoded interventional and observational tools for intact-systems biology.
Abstract: Precisely defining the roles of specific cell types is an intriguing frontier in the study of intact biological systems and has stimulated the rapid development of genetically encoded tools for observation and control. However, targeting these tools with adequate specificity remains challenging: most cell types are best defined by the intersection of two or more features such as active promoter elements, location and connectivity. Here we have combined engineered introns with specific recombinases to achieve expression of genetically encoded tools that is conditional upon multiple cell-type features, using Boolean logical operations all governed by a single versatile vector. We used this approach to target intersectionally specified populations of inhibitory interneurons in mammalian hippocampus and neurons of the ventral tegmental area defined by both genetic and wiring properties. This flexible and modular approach may expand the application of genetically encoded interventional and observational tools for intact-systems biology.
••
TL;DR: Intriguingly, the inheritance of lncRNA expression patterns in 105 recombinant inbred lines reveals apparent transgressive segregation, and maize lncRNAs are less affected by cis- than by trans-genetic factors.
Abstract: Background: Long non-coding RNAs (lncRNAs) are transcripts that are 200 bp or longer, do not encode proteins, and potentially play important roles in eukaryotic gene regulation. However, the number, characteristics and expression inheritance pattern of lncRNAs in maize are still largely unknown. Results: By exploiting available public EST databases, maize whole genome sequence annotation and RNA-seq datasets from 30 different experiments, we identified 20,163 putative lncRNAs. Of these lncRNAs, more than 90% are predicted to be the precursors of small RNAs, while 1,704 are considered to be high-confidence lncRNAs. High confidence lncRNAs have an average transcript length of 463 bp and genes encoding them contain fewer exons than annotated genes. By analyzing the expression pattern of these lncRNAs in 13 distinct tissues and 105 maize recombinant inbred lines, we show that more than 50% of the high confidence lncRNAs are expressed in a tissue-specific manner, a result that is supported by epigenetic marks. Intriguingly, the inheritance of lncRNA expression patterns in 105 recombinant inbred lines reveals apparent transgressive segregation, and maize lncRNAs are less affected by cis- than by trans-genetic factors. Conclusions: We integrate all available transcriptomic datasets to identify a comprehensive set of maize lncRNAs, provide a unique annotation resource of the maize genome and a genome-wide characterization of maize lncRNAs, and explore the genetic control of their expression using expression quantitative trait locus mapping. Background While the central dogma defines the primary role for RNA as a messenger molecule in the process of gene expression, there is ample evidence for additional functions of RNA molecules. These RNA molecules include small nuclear RNAs (snRNAs), small nucleolar RNAs (snoRNAs; mainly tRNAs and rRNAs), signal recognition particle (7SL/SRP) RNAs, microRNAs (miRNAs), small interfering RNAs (siRNAs), piwi RNAs (piRNAs) and trans-acting siRNAs (ta-siRNAs), natural cis-acting siRNAs and long noncoding RNAs (lncRNAs). lncRNAs have been arbitrarily defined as non-protein coding RNAs more than 200 bp in length, distinguishing them from short noncoding RNAs such as miRNAs and
••
French Institute of Health and Medical Research1, Katholieke Universiteit Leuven2, Ben-Gurion University of the Negev3, German Cancer Research Center4, Istituto Superiore di Sanità5, University of Turin6, Georgia Regents University7, University of Oxford8, Cardiff University9, Oregon Health & Science University10, Yale University11, Cold Spring Harbor Laboratory12, University of Pennsylvania13, University of Burgundy14, Georgetown University15, National Institutes of Health16, University of Miami17, Icahn School of Medicine at Mount Sinai18, GlaxoSmithKline19, University of Paris20, Centre national de la recherche scientifique21, Heidelberg University22, University of Pittsburgh23, Karolinska Institutet24, Hamad Medical Corporation25, University of Sheffield26, Centre Hospitalier Universitaire de Toulouse27, University of Lausanne28, University of Milan29, University of Navarra30, Leiden University31, University of Texas Health Science Center at Houston32, Istituto Giannina Gaslini33, Roswell Park Cancer Institute34, University of California, San Francisco35, Northwestern University36, Johns Hopkins University37, Thomas Jefferson University38, Main Line Health39, University of Buenos Aires40, Memorial Sloan Kettering Cancer Center41, University of Chicago42, Martin Luther University of Halle-Wittenberg43, Mie University44, University of Lisbon45, University of Queensland46, QIMR Berghofer Medical Research Institute47, Ludwig Institute for Cancer Research48, University of Connecticut49, University of Nebraska Medical Center50, Université catholique de Louvain51, Mayo Clinic52, Technische Universität München53, University of South Florida54, Cornell University55, University of Michigan56
TL;DR: A critical, integrated classification of anticancer immunotherapies is proposed and the clinical relevance of these approaches is discussed.
Abstract: During the past decades, anticancer immunotherapy has evolved from a promising therapeutic option to a robust clinical reality. Many immunotherapeutic regimens are now approved by the US Food and Drug Administration and the European Medicines Agency for use in cancer patients, and many others are being investigated as standalone therapeutic interventions or combined with conventional treatments in clinical studies. Immunotherapies may be subdivided into "passive" and "active" based on their ability to engage the host immune system against cancer. Since the anticancer activity of most passive immunotherapeutics (including tumor-targeting monoclonal antibodies) also relies on the host immune system, this classification does not properly reflect the complexity of the drug-host-tumor interaction. Alternatively, anticancer immunotherapeutics can be classified according to their antigen specificity. While some immunotherapies specifically target one (or a few) defined tumor-associated antigen(s), others operate in a relatively non-specific manner and boost natural or therapy-elicited anticancer immune responses of unknown and often broad specificity. Here, we propose a critical, integrated classification of anticancer immunotherapies and discuss the clinical relevance of these approaches.
••
TL;DR: The number of genetic loci that underlie ASDs and the relative contributions from different mutational classes are considered, and possible mechanisms by which these mutations might lead to dysfunction are discussed.
Abstract: In the past few years, there have been rapid advances in the identification of the genetic components of autism spectrum disorders, particularly in the form of de novo mutations. Here, the authors review these developments in light of genetic models for autism spectrum disorders.
••
TL;DR: This is a peak time for clinical onset of most mental illnesses and the highest single source of global economic burden in the world for mental health costs.
Abstract: The adolescent brain is more “plastic” than it will ever be again, capable of remarkable adaptability in light of the many social, physical, sexual, and intellectual challenges that this developmental phase brings. This is also a peak time for clinical onset of most mental illnesses (see the chart) ( 1 ). One in five adolescents have a mental illness that will persist into adulthood ( 2 ). Mental illnesses that emerge before adulthood impose a 10-fold higher cost than those that emerge later in life ( 3 ). Mental health costs are the highest single source of global economic burden in the world ( 4 ).
••
TL;DR: Al Alteration of Rb fox targets in some autistic brains is correlated with downregulation of all three Rbfox proteins, supporting the potential clinical relevance of the splicing-regulatory network.
••
TL;DR: It is suggested that perturbations in genes, which function in the epigenetic regulation of brain development and cognition, could have a central role in the susceptibility to, pathogenesis and treatment of mental disorders.
Abstract: Schizophrenia is a serious psychiatric disorder with a broadly undiscovered genetic etiology. Recent studies of de novo mutations (DNMs) in schizophrenia and autism have reinforced the hypothesis that rare genetic variation contributes to risk. We carried out exome sequencing on 57 trios with sporadic or familial schizophrenia. In sporadic trios, we observed a ~3.5-fold increase in the proportion of nonsense DNMs (0.101 vs 0.031, empirical P=0.01, Benjamini-Hochberg-corrected P=0.044). These mutations were significantly more likely to occur in genes with highly ranked probabilities of haploinsufficiency (P=0.0029, corrected P=0.006). DNMs of potential functional consequence were also found to occur in genes predicted to be less tolerant to rare variation (P=2.01 × 10(-)(5), corrected P=2.1 × 10(-)(3)). Genes with DNMs overlapped with genes implicated in autism (for example, AUTS2, CHD8 and MECP2) and intellectual disability (for example, HUWE1 and TRAPPC9), supporting a shared genetic etiology between these disorders. Functionally CHD8, MECP2 and HUWE1 converge on epigenetic regulation of transcription suggesting that this may be an important risk mechanism. Our results were consistent in an analysis of additional exome-based sequencing studies of other neurodevelopmental disorders. These findings suggest that perturbations in genes, which function in the epigenetic regulation of brain development and cognition, could have a central role in the susceptibility to, pathogenesis and treatment of mental disorders.
••
TL;DR: High-throughput sequencing assays on the transcriptome and epigenome reveal that, in general, differences dominate similarities between the two species, and indicate that there is considerable RNA expression diversity between humans and mice.
Abstract: Although the similarities between humans and mice are typically highlighted, morphologically and genetically, there are many differences. To better understand these two species on a molecular level, we performed a comparison of the expression profiles of 15 tissues by deep RNA sequencing and examined the similarities and differences in the transcriptome for both protein-coding and -noncoding transcripts. Although commonalities are evident in the expression of tissue-specific genes between the two species, the expression for many sets of genes was found to be more similar in different tissues within the same species than between species. These findings were further corroborated by associated epigenetic histone mark analyses. We also find that many noncoding transcripts are expressed at a low level and are not detectable at appreciable levels across individuals. Moreover, the majority lack obvious sequence homologs between species, even when we restrict our attention to those which are most highly reproducible across biological replicates. Overall, our results indicate that there is considerable RNA expression diversity between humans and mice, well beyond what was described previously, likely reflecting the fundamental physiological differences between these two organisms.
••
TL;DR: The advantages and challenges associated with the current techniques for studying the in vivo function of lncRNAs in the light of different models of lNCRNA molecular mechanism are discussed, and the design of experiments to mutate lncRNA loci is reflected on.
Abstract: Although a small number of the vast array of animal long non-coding RNAs (lncRNAs) have known effects on cellular processes examined in vitro, the extent of their contributions to normal cell processes throughout development, differentiation and disease for the most part remains less clear. Phenotypes arising from deletion of an entire genomic locus cannot be unequivocally attributed either to the loss of the lncRNA per se or to the associated loss of other overlapping DNA regulatory elements. The distinction between cis- or trans-effects is also often problematic. We discuss the advantages and challenges associated with the current techniques for studying the in vivo function of lncRNAs in the light of different models of lncRNA molecular mechanism, and reflect on the design of experiments to mutate lncRNA loci. These considerations should assist in the further investigation of these transcriptional products of the genome.
••
Yale University1, Dartmouth College2, University of California, Berkeley3, Lawrence Berkeley National Laboratory4, Cold Spring Harbor Laboratory5, University of Washington6, University of California, Los Angeles7, University of Connecticut Health Center8, Pompeu Fabra University9, Harvard University10, Indiana University11, Tsinghua University12, National Institutes of Health13, Wellcome Trust Sanger Institute14, University of Lausanne15, Swiss Institute of Bioinformatics16, King's College London17, Kettering University18, Carnegie Mellon University19, Vanderbilt University20, University of California, Irvine21, Howard Hughes Medical Institute22, European Bioinformatics Institute23, University of Vienna24, CAS-MPG Partner Institute for Computational Biology25, The Chinese University of Hong Kong26
TL;DR: It is found in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a ‘universal model’ based on a single set of organism-independent parameters.
Abstract: The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
••
TL;DR: It is shown that easiRNAs in Arabidopsis thaliana resemble secondary siRNAs, in that thousands of transposon transcripts are specifically targeted by more than 50 miRNAs for cleavage and processing by RDR6.
Abstract: In plants, post-transcriptional gene silencing (PTGS) is mediated by DICER-LIKE 1 (DCL1)-dependent microRNAs (miRNAs), which also trigger 21-nucleotide secondary short interfering RNAs (siRNAs) via RNA-DEPENDENT RNA POLYMERASE 6 (RDR6), DCL4 and ARGONAUTE 1 (AGO1), whereas transcriptional gene silencing (TGS) of transposons is mediated by 24-nucleotide heterochromatic (het)siRNAs, RDR2, DCL3 and AGO4 (ref. 4). Transposons can also give rise to abundant 21-nucleotide 'epigenetically activated' small interfering RNAs (easiRNAs) in DECREASED DNA METHYLATION 1 (ddm1) and DNA METHYLTRANSFERASE 1 (met1) mutants, as well as in the vegetative nucleus of pollen grains and in dedifferentiated plant cell cultures. Here we show that easiRNAs in Arabidopsis thaliana resemble secondary siRNAs, in that thousands of transposon transcripts are specifically targeted by more than 50 miRNAs for cleavage and processing by RDR6. Loss of RDR6, DCL4 or DCL1 in a ddm1 background results in loss of 21-nucleotide easiRNAs and severe infertility, but 24-nucleotide hetsiRNAs are partially restored, supporting an antagonistic relationship between PTGS and TGS. Thus miRNA-directed easiRNA biogenesis is a latent mechanism that specifically targets transposon transcripts, but only when they are epigenetically reactivated during reprogramming of the germ line. This ancient recognition mechanism may have been retained both by transposons to evade long-term heterochromatic silencing and by their hosts for genome defence.
••
TL;DR: A novel mechanism of allosteric inhibition that targets the C-terminal, non-catalytic segment of PTP1B, and presents the first ensemble structure of P TP1B containing this intrinsically disordered segment, within which a binding site for the small molecule inhibitor, MSI-1436 is identified.
Abstract: PTP1B, a validated therapeutic target for diabetes and obesity, has a critical positive role in HER2 signaling in breast tumorigenesis. Efforts to develop therapeutic inhibitors of PTP1B have been frustrated by the chemical properties of the active site. We define a new mechanism of allosteric inhibition that targets the C-terminal, noncatalytic segment of PTP1B. We present what is to our knowledge the first ensemble structure of PTP1B containing this intrinsically disordered segment, within which we identified a binding site for the small-molecule inhibitor MSI-1436. We demonstrate binding to a second site close to the catalytic domain, with cooperative effects between the two sites locking PTP1B in an inactive state. MSI-1436 antagonized HER2 signaling, inhibited tumorigenesis in xenografts and abrogated metastasis in the NDL2 mouse model of breast cancer, validating inhibition of PTP1B as a therapeutic strategy in breast cancer. This new approach to inhibition of PTP1B emphasizes the potential of disordered segments of proteins as specific binding sites for therapeutic small molecules.