Showing papers by "Michael Snyder published in 2014"

PDF

Open Access

Journal Article•DOI•

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu⁵, Weisheng Wu¹, Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁶, Yin Shen², Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender¹², M. A. Bender⁴, Miaohua Zhang¹², Rachel Byron¹², Mark Groudine⁴, Mark Groudine¹², David McCleary², Long Pham², Zhen Ye², Samantha Kuan², Lee Edsall², Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹³, Manolis Kellis¹⁴, Cheryl A. Keller¹, Christapher S. Morrissey¹, Tejaswini Mishra¹, Deepti Jain¹, Nergiz Dogan¹, Robert S. Harris¹, Philip Cayting³, Trupti Kawli³, Alan P. Boyle³, Alan P. Boyle⁵, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang¹, Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao², Sheng Zhong², Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon³⁰, Rebecca F. Lowdon²⁹, Leslie B. Adams³⁰, Leslie B. Adams³¹, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁶, Ross C. Hardison¹, Michael A. Beer²⁰, Bing Ren² - Show less +142 more•Institutions (31)

Pennsylvania State University¹, University of California, San Diego², Stanford University³, University of Washington⁴, University of Michigan⁵, Florida State University⁶, New College of Florida⁷, Cold Spring Harbor Laboratory⁸, California Institute of Technology⁹, University of Vienna¹⁰, Emory University¹¹, Fred Hutchinson Cancer Research Center¹², Massachusetts Institute of Technology¹³, Broad Institute¹⁴, University of California, Irvine¹⁵, University of California, Santa Cruz¹⁶, University of California, San Francisco¹⁷, Yale University¹⁸, University of Florida¹⁹, Johns Hopkins University²⁰, University College London²¹, University of Oxford²², Cornell University²³, Memorial Sloan Kettering Cancer Center²⁴, Harvard University²⁵, University of Iowa²⁶, Yeshiva University²⁷, University of Pennsylvania²⁸, Washington University in St. Louis²⁹, National Institutes of Health³⁰, University of North Carolina at Chapel Hill³¹

20 Nov 2014-Nature

TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases

...read moreread less

1,335 citations

Journal Article•DOI•

Proteogenomic characterization of human colon and rectal cancer

[...]

Bing Zhang¹, Jing Wang¹, Xiaojing Wang¹, Jing Zhu¹, Qi Liu¹, Zhiao Shi¹, Matthew C. Chambers¹, Lisa J. Zimmerman¹, Kent Shaddox¹, Sangtae Kim², Sherri R. Davies³, Sean Wang⁴, Pei Wang⁵, Christopher R. Kinsinger⁶, Robert Rivers⁶, Henry Rodriguez⁶, R. Reid Townsend³, Matthew J. Ellis³, Steven A. Carr⁷, Steven A. Carr⁸, David L. Tabb¹, Robert J. Coffey¹, Robbert J.C. Slebos¹, Daniel C. Liebler¹, Michael A. Gillette⁸, Karl R. Klauser⁸, Eric Kuhn⁸, D. R. Mani⁸, Philipp Mertins⁸, Karen A. Ketchum, Amanda G. Paulovich⁴, Jeffrey R. Whiteaker⁴, Nathan Edwards⁹, Peter B. McGarvey⁹, Subha Madhavan⁹, Daniel W. Chan¹⁰, Akhilesh Pandey¹⁰, Ie Ming Shih¹⁰, Hui Zhang¹⁰, Zhen Zhang¹⁰, Heng Zhu¹⁰, Gordon Whiteley¹¹, Steven J. Skates⁸, Forest M. White⁷, Douglas A. Levine¹², Emily S. Boja⁶, Tara Hiltke⁶, Mehdi Mesri⁶, Kenna M. Shaw⁶, Stephen E. Stein¹³, David Fenyö¹⁴, Tao Liu², Jason E. McDermott², Samuel H. Payne², Karin D. Rodland², Richard D. Smith², Paul A. Rudnick, Michael Snyder¹⁵, Yingming Zhao¹⁶, Xian Chen¹⁷, David F. Ransohoff¹⁷, Andrew N. Hoofnagle¹⁸, Melinda E. Sanders¹, Yue Wang¹⁹, Li Ding³ - Show less +61 more•Institutions (19)

Vanderbilt University¹, Pacific Northwest National Laboratory², Washington University in St. Louis³, Fred Hutchinson Cancer Research Center⁴, Icahn School of Medicine at Mount Sinai⁵, National Institutes of Health⁶, Massachusetts Institute of Technology⁷, Harvard University⁸, Georgetown University⁹, Johns Hopkins University¹⁰, Leidos¹¹, Memorial Sloan Kettering Cancer Center¹², National Institute of Standards and Technology¹³, New York University¹⁴, Stanford University¹⁵, University of Chicago¹⁶, University of North Carolina at Chapel Hill¹⁷, University of Washington¹⁸, Virginia Tech¹⁹

18 Sep 2014-Nature

TL;DR: Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.

...read moreread less

Abstract: Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to cancer phenotypes. To address this problem, we analysed proteomes of colon and rectal tumours characterized previously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA 'microsatellite instability/CpG island methylation phenotype' transcriptomic subtype, but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.

...read moreread less

1,183 citations

Journal Article•DOI•

Topologically associating domains are stable units of replication-timing regulation

[...]

Benjamin D. Pope¹, Tyrone Ryba², Vishnu Dileep¹, Feng Yue³, Weisheng Wu³, Olgert Denas⁴, Daniel L. Vera¹, Yanli Wang³, R. Scott Hansen⁵, Theresa K. Canfield⁵, Robert E. Thurman⁵, Yong Cheng⁶, Günhan Gülsoy⁷, Jonathan H. Dennis¹, Michael Snyder⁶, John A. Stamatoyannopoulos⁵, James Taylor⁴, Ross C. Hardison³, Tamer Kahveci⁷, Bing Ren⁸, David M. Gilbert¹ - Show less +17 more•Institutions (8)

Florida State University¹, New College of Florida², Pennsylvania State University³, Emory University⁴, University of Washington⁵, Stanford University⁶, University of Florida⁷, University of California, San Diego⁸

20 Nov 2014-Nature

TL;DR: It is demonstrated that, collectively, replication domain boundaries share a near one-to-one correlation with TAD boundaries, whereas within a cell type, adjacent TADs that replicate at similar times obscure replicationdomain boundaries, largely accounting for the previously reported lack of alignment.

...read moreread less

Abstract: Eukaryotic chromosomes replicate in a temporal order known as the replication-timing program. In mammals, replication timing is cell-type-specific with at least half the genome switching replication timing during development, primarily in units of 400-800 kilobases ('replication domains'), whose positions are preserved in different cell types, conserved between species, and appear to confine long-range effects of chromosome rearrangements. Early and late replication correlate, respectively, with open and closed three-dimensional chromatin compartments identified by high-resolution chromosome conformation capture (Hi-C), and, to a lesser extent, late replication correlates with lamina-associated domains (LADs). Recent Hi-C mapping has unveiled substructure within chromatin compartments called topologically associating domains (TADs) that are largely conserved in their positions between cell types and are similar in size to replication domains. However, TADs can be further sub-stratified into smaller domains, challenging the significance of structures at any particular scale. Moreover, attempts to reconcile TADs and LADs to replication-timing data have not revealed a common, underlying domain structure. Here we localize boundaries of replication domains to the early-replicating border of replication-timing transitions and map their positions in 18 human and 13 mouse cell types. We demonstrate that, collectively, replication domain boundaries share a near one-to-one correlation with TAD boundaries, whereas within a cell type, adjacent TADs that replicate at similar times obscure replication domain boundaries, largely accounting for the previously reported lack of alignment. Moreover, cell-type-specific replication timing of TADs partitions the genome into two large-scale sub-nuclear compartments revealing that replication-timing transitions are indistinguishable from late-replicating regions in chromatin composition and lamina association and accounting for the reduced correlation of replication timing to LADs and heterochromatin. Our results reconcile cell-type-specific sub-nuclear compartmentalization and replication timing with developmentally stable structural domains and offer a unified model for large-scale chromosome structure and function.

...read moreread less

783 citations

Journal Article•DOI•

Defining functional DNA elements in the human genome

[...]

Manolis Kellis¹, Barbara J. Wold², Michael Snyder³, Bradley E. Bernstein⁴, Anshul Kundaje⁵, Georgi K. Marinov², Lucas D. Ward⁵, Ewan Birney, Gregory E. Crawford⁶, Job Dekker⁷, Ian Dunham, Laura Elnitski⁸, Peggy J. Farnham⁹, Elise A. Feingold⁸, Mark Gerstein¹⁰, Morgan C. Giddings, David M. Gilbert¹¹, Thomas R. Gingeras¹², Eric D. Green⁸, Roderic Guigó, Tim Hubbard¹³, Jim Kent¹⁴, Jason D. Lieb¹⁵, Richard M. Myers, Michael J. Pazin⁸, Bing Ren¹⁶, John A. Stamatoyannopoulos¹⁷, Zhiping Weng⁷, Kevin P. White¹⁸, Ross C. Hardison¹⁹ - Show less +26 more•Institutions (19)

Massachusetts Institute of Technology¹, California Institute of Technology², Stanford University³, Harvard University⁴, Broad Institute⁵, Duke University⁶, University of Massachusetts Medical School⁷, National Institutes of Health⁸, University of Southern California⁹, Yale University¹⁰, Florida State University¹¹, Cold Spring Harbor Laboratory¹², Wellcome Trust Sanger Institute¹³, University of California, Santa Cruz¹⁴, Princeton University¹⁵, University of California, San Diego¹⁶, University of Washington¹⁷, University of Chicago¹⁸, Pennsylvania State University¹⁹

29 Apr 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies are reviewed.

...read moreread less

Abstract: With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.

...read moreread less

691 citations

Journal Article•DOI•

Landscape and variation of RNA secondary structure across the human transcriptome

[...]

Yue Wan¹, Kun Qu¹, Qiangfeng Cliff Zhang¹, Ryan A. Flynn¹, Ohad Manor², Zhengqing Ouyang¹, Jiajing Zhang¹, Robert C. Spitale¹, Michael Snyder¹, Eran Segal², Howard Y. Chang¹ - Show less +7 more•Institutions (2)

Stanford University¹, Weizmann Institute of Science²

30 Jan 2014-Nature

TL;DR: The initial landscape and variation of RNA secondary structures (RSSs) in a human family trio (mother, father and their child) is reported, which provides a comprehensive RSS map of human coding and non-coding RNAs.

...read moreread less

Abstract: In parallel to the genetic code for protein synthesis, a second layer of information is embedded in all RNA transcripts in the form of RNA structure. RNA structure influences practically every step in the gene expression program. However, the nature of most RNA structures or effects of sequence variation on structure are not known. Here we report the initial landscape and variation of RNA secondary structures (RSSs) in a human family trio (mother, father and their child). This provides a comprehensive RSS map of human coding and non-coding RNAs. We identify unique RSS signatures that demarcate open reading frames and splicing junctions, and define authentic microRNA-binding sites. Comparison of native deproteinized RNA isolated from cells versus refolded purified RNA suggests that the majority of the RSS information is encoded within RNA sequence. Over 1,900 transcribed single nucleotide variants (approximately 15% of all transcribed single nucleotide variants) alter local RNA structure. We discover simple sequence and spacing rules that determine the ability of point mutations to impact RSSs. Selective depletion of 'riboSNitches' versus structurally synonymous variants at precise locations suggests selection for specific RNA shapes at thousands of sites, including 3' untranslated regions, binding sites of microRNAs and RNA-binding proteins genome-wide. These results highlight the potentially broad contribution of RNA structure and its variation to gene regulation.

...read moreread less

512 citations

Journal Article•DOI•

Clinical Interpretation and Implications of Whole-Genome Sequencing

[...]

Frederick E. Dewey, Megan E. Grove, Cuiping Pan¹, Benjamin A. Goldstein¹, Jonathan A. Bernstein¹, Hassan Chaib¹, Jason D. Merker¹, Rachel L. Goldfeder¹, Gregory M. Enns¹, Sean P. David¹, Neda Pakdaman¹, Kelly E. Ormond¹, Colleen Caleshu, Kerry Kingham¹, Teri E. Klein¹, Michelle Whirl-Carrillo¹, Kenneth Sakamoto¹, Matthew T. Wheeler, Atul J. Butte¹, James M. Ford¹, Linda M. Boxer¹, John P. A. Ioannidis, Alan C. Yeung², Alan C. Yeung¹, Russ B. Altman¹, Themistocles L. Assimes², Themistocles L. Assimes¹, Michael Snyder¹, Michael Snyder², Euan A. Ashley, Thomas Quertermous - Show less +27 more•Institutions (2)

Stanford University¹, Cardiovascular Institute of the South²

12 Mar 2014-JAMA

TL;DR: The use of WGS was associated with incomplete coverage of inherited disease genes, low reproducibility of detection of genetic variation with the highest potential clinical effects, and uncertainty about clinically reportable findings.

...read moreread less

Abstract: Importance Whole-genome sequencing (WGS) is increasingly applied in clinical medicine and is expected to uncover clinically significant findings regardless of sequencing indication. Objectives To examine coverage and concordance of clinically relevant genetic variation provided by WGS technologies; to quantitate inherited disease risk and pharmacogenomic findings in WGS data and resources required for their discovery and interpretation; and to evaluate clinical action prompted by WGS findings. Design, Setting, and Participants An exploratory study of 12 adult participants recruited at Stanford University Medical Center who underwent WGS between November 2011 and March 2012. A multidisciplinary team reviewed all potentially reportable genetic findings. Five physicians proposed initial clinical follow-up based on the genetic findings. Main Outcomes and Measures Genome coverage and sequencing platform concordance in different categories of genetic disease risk, person-hours spent curating candidate disease-risk variants, interpretation agreement between trained curators and disease genetics databases, burden of inherited disease risk and pharmacogenomic findings, and burden and interrater agreement of proposed clinical follow-up. Results Depending on sequencing platform, 10% to 19% of inherited disease genes were not covered to accepted standards for single nucleotide variant discovery. Genotype concordance was high for previously described single nucleotide genetic variants (99%-100%) but low for small insertion/deletion variants (53%-59%). Curation of 90 to 127 genetic variants in each participant required a median of 54 minutes (range, 5-223 minutes) per genetic variant, resulted in moderate classification agreement between professionals (Gross κ, 0.52; 95% CI, 0.40-0.64), and reclassified 69% of genetic variants cataloged as disease causing in mutation databases to variants of uncertain or lesser significance. Two to 6 personal disease-risk findings were discovered in each participant, including 1 frameshift deletion in the BRCA1 gene implicated in hereditary breast and ovarian cancer. Physician review of sequencing findings prompted consideration of a median of 1 to 3 initial diagnostic tests and referrals per participant, with fair interrater agreement about the suitability of WGS findings for clinical follow-up (Fleiss κ, 0.24; P Conclusions and Relevance In this exploratory study of 12 volunteer adults, the use of WGS was associated with incomplete coverage of inherited disease genes, low reproducibility of detection of genetic variation with the highest potential clinical effects, and uncertainty about clinically reportable findings. In certain cases, WGS will identify clinically actionable genetic variants warranting early medical intervention. These issues should be considered when determining the role of WGS in clinical medicine.

...read moreread less

413 citations

Journal Article•DOI•

Widespread contribution of transposable elements to the innovation of gene regulatory networks

[...]

Vasavi Sundaram¹, Yong Cheng², Zhihai Ma², Daofeng Li¹, Xiaoyun Xing¹, Peter Edge³, Michael Snyder², Ting Wang¹ - Show less +4 more•Institutions (3)

Washington University in St. Louis¹, Stanford University², University of Minnesota³

01 Dec 2014-Genome Research

TL;DR: Transposable elements have significantly and continuously shaped gene regulatory networks during mammalian evolution, and are an important driving force for regulatory innovation.

...read moreread less

Abstract: Transposable elements (TEs) have been shown to contain functional binding sites for certain transcription factors (TFs). However, the extent to which TEs contribute to the evolution of TF binding sites is not well known. We comprehensively mapped binding sites for 26 pairs of orthologous TFs in two pairs of human and mouse cell lines (representing two cell lineages), along with epigenomic profiles, including DNA methylation and six histone modifications. Overall, we found that 20% of binding sites were embedded within TEs. This number varied across different TFs, ranging from 2% to 40%. We further identified 710 TF–TE relationships in which genomic copies of a TE subfamily contributed a significant number of binding peaks for a TF, and we found that LTR elements dominated these relationships in human. Importantly, TE-derived binding peaks were strongly associated with open and active chromatin signatures, including reduced DNA methylation and increased enhancer-associated histone marks. On average, 66% of TE-derived binding events were cell type-specific with a cell type-specific epigenetic landscape. Most of the binding sites contributed by TEs were species-specific, but we also identified binding sites conserved between human and mouse, the functional relevance of which was supported by a signature of purifying selection on DNA sequences of these TEs. Interestingly, several TFs had significantly expanded binding site landscapes only in one species, which were linked to species-specific gene functions, suggesting that TEs are an important driving force for regulatory innovation. Taken together, our data suggest that TEs have significantly and continuously shaped gene regulatory networks during mammalian evolution.

...read moreread less

388 citations

Journal Article•DOI•

H3K4me3 Breadth Is Linked to Cell Identity and Transcriptional Consistency

[...]

Bérénice A. Benayoun¹, Elizabeth A. Pollina¹, Duygu Ucar¹, Salah Mahmoudi¹, Kalpana Karra¹, Edith D. Wong¹, Keerthana Devarajan¹, Aaron Daugherty¹, Anshul Kundaje¹, Elena Mancini¹, Benjamin C. Hitz¹, Rakhi Gupta¹, Thomas A. Rando², Thomas A. Rando¹, Julie C. Baker¹, Michael Snyder¹, J. Michael Cherry¹, Anne Brunet¹ - Show less +14 more•Institutions (2)

Stanford University¹, VA Palo Alto Healthcare System²

31 Jul 2014-Cell

TL;DR: It is shown that H3K4me3 domains that spread more broadly over genes in a given cell type preferentially mark genes that are essential for the identity and function of that cell type.

...read moreread less

388 citations

Journal Article•DOI•

Comparison of the transcriptional landscapes between human and mouse tissues

[...]

Shin Lin¹, Yiing Lin², Joseph R. Nery³, Mark A. Urich³, Alessandra Breschi⁴, Carrie A. Davis⁵, Alexander Dobin⁵, Chris Zaleski⁵, Michael A. Beer⁶, William C. Chapman², Thomas R. Gingeras⁵, Joseph R. Ecker³, Michael Snyder¹ - Show less +9 more•Institutions (6)

Stanford University¹, Washington University in St. Louis², Salk Institute for Biological Studies³, Pompeu Fabra University⁴, Cold Spring Harbor Laboratory⁵, Johns Hopkins University⁶

02 Dec 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: High-throughput sequencing assays on the transcriptome and epigenome reveal that, in general, differences dominate similarities between the two species, and indicate that there is considerable RNA expression diversity between humans and mice.

...read moreread less

Abstract: Although the similarities between humans and mice are typically highlighted, morphologically and genetically, there are many differences. To better understand these two species on a molecular level, we performed a comparison of the expression profiles of 15 tissues by deep RNA sequencing and examined the similarities and differences in the transcriptome for both protein-coding and -noncoding transcripts. Although commonalities are evident in the expression of tissue-specific genes between the two species, the expression for many sets of genes was found to be more similar in different tissues within the same species than between species. These findings were further corroborated by associated epigenetic histone mark analyses. We also find that many noncoding transcripts are expressed at a low level and are not detectable at appreciable levels across individuals. Moreover, the majority lack obvious sequence homologs between species, even when we restrict our attention to those which are most highly reproducible across biological replicates. Overall, our results indicate that there is considerable RNA expression diversity between humans and mice, well beyond what was described previously, likely reflecting the fundamental physiological differences between these two organisms.

...read moreread less

313 citations

Journal Article•DOI•

Genome-wide map of regulatory interactions in the human genome

[...]

Nastaran Heidari¹, Douglas H. Phanstiel¹, Chao He², Fabian Grubert¹, Fereshteh Jahanbani¹, Maya Kasowski¹, Maya Kasowski³, Michael Q. Zhang², Michael Q. Zhang⁴, Michael Snyder¹ - Show less +6 more•Institutions (4)

Stanford University¹, Tsinghua University², Yale University³, University of Texas at Dallas⁴

16 Sep 2014-Genome Research

TL;DR: New mechanistic and functional insights are revealed into regulatory region organization in the nucleus into cohesin, CTCF, and ZNF143 as key components of three-dimensional chromatin structure and how the distal chromatin state affects gene transcription.

...read moreread less

Abstract: Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencing (ChIA-PET) experiments targeting six broadly distributed factors. Bound regions covered 80% of DNase I hypersensitive sites including 99.7% of TSS and 98% of enhancers. Correlating this map with ChIP-seq and RNA-seq data sets revealed cohesin, CTCF, and ZNF143 as key components of three-dimensional chromatin structure and revealed how the distal chromatin state affects gene transcription. Comparison of interactions between cell types revealed that enhancer-promoter interactions were highly cell-type-specific. Construction and comparison of distal and proximal regulatory networks revealed stark differences in structure and biological function. Proximal binding events are enriched at genes with housekeeping functions, while distal binding events interact with genes involved in dynamic biological processes including response to stimulus. This study reveals new mechanistic and functional insights into regulatory region organization in the nucleus.

...read moreread less

269 citations

Journal Article•DOI•

Principles of regulatory information conservation between mouse and human

[...]

Yong Cheng¹, Zhihai Ma¹, Bong Hyun Kim², Weisheng Wu³, Weisheng Wu⁴, Philip Cayting¹, Alan P. Boyle, Vasavi Sundaram⁵, Xiaoyun Xing⁵, Nergiz Dogan³, Jingjing Li¹, Ghia Euskirchen¹, Shin Lin¹, Yiing Lin⁵, Yiing Lin¹, Axel Visel⁶, Axel Visel⁷, Axel Visel⁸, Trupti Kawli¹, Xinqiong Yang¹, Dorrelyn Patacsil¹, Cheryl A. Keller, Belinda Giardine³, Anshul Kundaje¹, Ting Wang⁵, Len A. Pennacchio⁷, Len A. Pennacchio⁸, Zhiping Weng², Ross C. Hardison, Michael Snyder¹ - Show less +26 more•Institutions (8)

Stanford University¹, University of Massachusetts Medical School², Pennsylvania State University³, University of Michigan⁴, Washington University in St. Louis⁵, University of California, Merced⁶, Lawrence Berkeley National Laboratory⁷, United States Department of Energy⁸

20 Nov 2014-Nature

TL;DR: Using the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged are deduced.

...read moreread less

Abstract: To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.

...read moreread less

Journal Article•DOI•

Defining a personal, allele-specific, and single-molecule long-read transcriptome.

[...]

Hagen Tilgner¹, Fabian Grubert¹, Donald Sharon¹, Michael Snyder¹•Institutions (1)

Stanford University¹

08 Jul 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This work sequenced the lymphoblastoid transcriptomes of three family members by using a Pacific Biosciences long-read approach complemented with Illumina 101-bp sequencing and found that reads representing all splice sites of a transcript are evident for most sufficiently expressed genes ≤3 kb and often for genes longer than that.

...read moreread less

Abstract: Personal transcriptomes in which all of an individual’s genetic variants (e.g., single nucleotide variants) and transcript isoforms (transcription start sites, splice sites, and polyA sites) are defined and quantified for full-length transcripts are expected to be important for understanding individual biology and disease, but have not been described previously. To obtain such transcriptomes, we sequenced the lymphoblastoid transcriptomes of three family members (GM12878 and the parents GM12891 and GM12892) by using a Pacific Biosciences long-read approach complemented with Illumina 101-bp sequencing and made the following observations. First, we found that reads representing all splice sites of a transcript are evident for most sufficiently expressed genes ≤3 kb and often for genes longer than that. Second, we added and quantified previously unidentified splicing isoforms to an existing annotation, thus creating the first personalized annotation to our knowledge. Third, we determined SNVs in a de novo manner and connected them to RNA haplotypes, including HLA haplotypes, thereby assigning single full-length RNA molecules to their transcribed allele, and demonstrated Mendelian inheritance of RNA molecules. Fourth, we show how RNA molecules can be linked to personal variants on a one-by-one basis, which allows us to assess differential allelic expression (DAE) and differential allelic isoforms (DAI) from the phased full-length isoform reads. The DAI method is largely independent of the distance between exon and SNV—in contrast to fragmentation-based methods. Overall, in addition to improving eukaryotic transcriptome annotation, these results describe, to our knowledge, the first large-scale and full-length personal transcriptome.

...read moreread less

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu², Weisheng Wu⁵, Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁷, Yin Shen¹, Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender⁴, M. A. Bender¹², Miaohua Zhang¹², Rachel Byron¹², Mark Groudine¹², Mark Groudine⁴, David McCleary¹, Long Pham¹, Zhen Ye¹, Samantha Kuan¹, Lee Edsall¹, Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹⁴, Manolis Kellis¹³, Cheryl A. Keller², Christapher S. Morrissey², Tejaswini Mishra², Deepti Jain², Nergiz Dogan², Robert S. Harris², Philip Cayting³, Trupti Kawli³, Alan P. Boyle³, Alan P. Boyle⁵, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang², Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao¹, Sheng Zhong¹, Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon²⁹, Rebecca F. Lowdon³⁰, Leslie B. Adams³¹, Leslie B. Adams³⁰, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁷, Ross C. Hardison², Michael A. Beer²⁰, Bing Ren¹ - Show less +142 more•Institutions (31)

University of California, San Diego¹, Pennsylvania State University², Stanford University³, University of Washington⁴, University of Michigan⁵, New College of Florida⁶, Florida State University⁷, Cold Spring Harbor Laboratory⁸, California Institute of Technology⁹, University of Vienna¹⁰, Emory University¹¹, Fred Hutchinson Cancer Research Center¹², Massachusetts Institute of Technology¹³, Broad Institute¹⁴, University of California, Irvine¹⁵, University of California, Santa Cruz¹⁶, University of California, San Francisco¹⁷, Yale University¹⁸, University of Florida¹⁹, Johns Hopkins University²⁰, University College London²¹, University of Oxford²², Cornell University²³, Memorial Sloan Kettering Cancer Center²⁴, Harvard University²⁵, University of Iowa²⁶, Yeshiva University²⁷, University of Pennsylvania²⁸, Washington University in St. Louis²⁹, National Institutes of Health³⁰, University of North Carolina at Chapel Hill³¹

01 Nov 2014

TL;DR: By comparing with the human genome, this work not only confirms substantial conservation in the newly annotated potential functional sequences, but also finds a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.

...read moreread less

Journal Article•DOI•

Quantitative analysis of RNA-protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes.

[...]

Jason D. Buenrostro¹, Carlos L. Araya¹, Lauren M. Chircus¹, Curtis J. Layton¹, Howard Y. Chang¹, Michael Snyder¹, William J. Greenleaf¹ - Show less +3 more•Institutions (1)

Stanford University¹

01 Jun 2014-Nature Biotechnology

TL;DR: This work repurposes a high-throughput sequencing instrument to quantitatively measure binding and dissociation of a fluorescently labeled protein to >107 RNA targets generated on a flow cell surface by in situ transcription and intermolecular tethering of RNA to DNA.

...read moreread less

Abstract: Repurposing a DNA sequencing instrument for high-throughput analysis of RNA-protein interactions enables detailed analysis of sequence-function relationships.

...read moreread less

Journal Article•DOI•

Whole-genome haplotyping using long reads and statistical methods

[...]

Volodymyr Kuleshov¹, Volodymyr Kuleshov², Dan Xie¹, Rui Chen¹, Dmitry Pushkarev², Zhihai Ma¹, Tim Blauwkamp², Michael Kertesz², Michael Snyder¹ - Show less +5 more•Institutions (2)

Stanford University¹, Illumina²

01 Mar 2014-Nature Biotechnology

TL;DR: Using statistically aided, long-read haplotyping (SLRH), a rapid, accurate method that uses a statistical algorithm to take advantage of the partially phased information contained in long genomic fragments analyzed by short-read sequencing, this work phases 99% of single-nucleotide variants in three human genomes into long haplotype blocks 0.2–1 Mbp in length.

...read moreread less

Abstract: Haplotyping of human genomes is improved by augmenting experimental dilution-based haplotyping with statistical analyses, a strategy known until now only as 'Moleculo.'

...read moreread less

Journal Article•DOI•

Comparative analysis of regulatory information and circuits across distant species

[...]

Alan P. Boyle¹, Carlos L. Araya¹, Cathleen M. Brdlik¹, Philip Cayting¹, Chao Cheng², Yong Cheng¹, Kathryn E. Gardner², LaDeana W. Hillier³, J. Janette², Lixia Jiang¹, Dionna M. Kasper², Trupti Kawli¹, Pouya Kheradpour⁴, Anshul Kundaje⁴, Anshul Kundaje¹, Jingyi Jessica Li⁵, Jingyi Jessica Li⁶, Lijia Ma³, Wei Niu², E. Jay Rehm², Joel Rozowsky⁷, Matthew Slattery², Rebecca Spokony⁷, Robert Terrell⁷, D. Vafeados³, Daifeng Wang², Peter Weisdepp³, Yi-Chieh Wu⁴, Dan Xie¹, Koon-Kiu Yan², Elise A. Feingold⁸, Peter J. Good⁸, Michael J. Pazin⁸, Haiyan Huang⁵, Peter J. Bickel⁵, Steven E. Brenner⁵, Valerie Reinke², Robert H. Waterston³, Mark Gerstein², Kevin P. White⁷, Manolis Kellis⁴, Michael Snyder¹ - Show less +38 more•Institutions (8)

Stanford University¹, Yale University², University of Washington³, Massachusetts Institute of Technology⁴, University of California, Berkeley⁵, University of California, Los Angeles⁶, University of Chicago⁷, National Institutes of Health⁸

28 Aug 2014-Nature

TL;DR: The results suggest that gene-regulatory properties previously observed for individual factors are general principles of metazoan regulation that are remarkably well-preserved despite extensive functional divergence of individual network connections.

...read moreread less

Abstract: Despite the large evolutionary distances between metazoan species, they can show remarkable commonalities in their biology, and this has helped to establish fly and worm as model organisms for human biology. Although studies of individual elements and factors have explored similarities in gene regulation, a large-scale comparative analysis of basic principles of transcriptional regulatory features is lacking. Here we map the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors, generating a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time. We find that structural properties of regulatory networks are remarkably conserved and that orthologous regulatory factor families recognize similar binding motifs in vivo and show some similar co-associations. Our results suggest that gene-regulatory properties previously observed for individual factors are general principles of metazoan regulation that are remarkably well-preserved despite extensive functional divergence of individual network connections. The comparative maps of regulatory circuitry provided here will drive an improved understanding of the regulatory underpinnings of model organism biology and how these relate to human biology, development and disease.

...read moreread less

Journal Article•DOI•

Mutations in NGLY1 cause an inherited disorder of the endoplasmic reticulum-associated degradation pathway

[...]

Gregory M. Enns¹, Vandana Shashi², Matthew N. Bainbridge³, Michael J. Gambello⁴, Farah R. Zahir⁵, Thomas Bast, Rebecca Crimian², Kelly Schoch², Julia Platt¹, Rachel Cox¹, Jonathan A. Bernstein¹, Mena Scavina⁶, Rhonda S. Walter⁶, Audrey L. Bibb⁴, Melanie A. Jones⁴, Madhuri Hegde⁴, Brett H. Graham³, Anna C. Need⁷, Angelica Oviedo⁸, Christian P. Schaaf³, Christian P. Schaaf⁹, Sean Michael Boyle¹⁰, Atul J. Butte¹⁰, Rong Chen¹⁰, Michael J. Clark¹⁰, Rajini R Haraksingh¹⁰, Tina M. Cowan¹⁰, Ping He¹¹, Sylvie Langlois⁵, Huda Y. Zoghbi³, Huda Y. Zoghbi⁹, Michael Snyder¹⁰, Richard A. Gibbs³, Hudson H. Freeze¹¹, David Goldstein² - Show less +31 more•Institutions (11)

Lucile Packard Children's Hospital¹, Duke University², Baylor College of Medicine³, Emory University⁴, University of British Columbia⁵, Alfred I. duPont Hospital for Children⁶, Imperial College London⁷, Dalhousie University⁸, Boston Children's Hospital⁹, Stanford University¹⁰, Sanford-Burnham Institute for Medical Research¹¹

01 Oct 2014-Genetics in Medicine

TL;DR: NGLY1 deficiency is a novel autosomal recessive disorder of the endoplasmic reticulum–associated degradation pathway associated with neurological dysfunction, abnormal tear production, and liver disease.

...read moreread less

Journal Article•DOI•

Sushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures

[...]

Douglas H. Phanstiel¹, Alan P. Boyle¹, Carlos L. Araya¹, Michael Snyder¹•Institutions (1)

Stanford University¹

05 Jun 2014-Bioinformatics

TL;DR: This work presents Sushi.R, an R/Bioconductor package that allows flexible integration of genomic visualizations into highly customizable, publication-ready, multi-panel figures from common genomic data formats including Browser Extensible Data (BED), bedGraph and Browser extensible Data Paired-End (BedPE).

...read moreread less

Abstract: Motivation: Interpretation and communication of genomic data require flexible and quantitative tools to analyze and visualize diverse data types, yet a comprehensive tool to display all common genomic data types in publication quality figures does not exist to date. To address this shortcoming, we present Sushi.R, an R/Bioconductor package that allows flexible integration of genomic visualizations into highly-customizable, publication-ready, multi-panel figures from common genomic data formats including BED, bedGraph, and BEDPE. Sushi.R is open source and made publicly available through GitHub (https://github.com/dphansti/Sushi) and Bioconductor (http://bioconductor.org/packages/release/bioc/html/Sushi.html).

...read moreread less

Comparative analysis of regulatory information and circuits across distant species

[...]

01 Aug 2014

TL;DR: In this article, the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors were mapped for a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time.

...read moreread less

Journal Article•DOI•

Gene-centric Meta-analysis in 87,736 Individuals of European Ancestry Identifies Multiple Blood-Pressure-Related Loci

[...]

Vinicius Tragante¹, Michael R. Barnes², Santhi K. Ganesh³, Matthew B. Lanktree⁴, Wei Guo⁵, Nora Franceschini⁶, Erin N. Smith⁷, Toby Johnson², Michael V. Holmes⁸, Sandosh Padmanabhan⁹, Konrad J. Karczewski¹⁰, Berta Almoguera⁸, John Barnard¹¹, Jens Baumert, Yen Pei C. Chang¹², Clara C. Elbers¹, Martin Farrall¹³, Mary E. Fischer¹⁴, Tom R. Gaunt¹⁵, Johannes M.I.H. Gho¹, Christian Gieger, Anuj Goel¹³, Yan Gong¹⁶, Aaron Isaacs¹⁷, Marcus E. Kleber¹⁸, Irene Mateo Leach¹⁹, Caitrin W. McDonough¹⁶, Matthijs F.L. Meijs¹, Olle Melander²⁰, Christopher P. Nelson²¹, Christopher P. Nelson²², Ilja M. Nolte¹⁹, Nathan Pankratz²³, Thomas S. Price, Jonathan A. Shaffer²⁴, Sonia Shah²⁵, Maciej Tomaszewski²¹, Peter J. van der Most¹⁹, Erik P A Van Iperen, Judith M. Vonk¹⁹, Kate Witkowska², Caroline O. L. Wong², Li Zhang¹¹, Amber L. Beitelshees¹², Gerald S. Berenson²⁶, Deepak L. Bhatt²⁷, Morris Brown²⁸, Amber A. Burt²⁹, Rhonda M. Cooper-DeHoff¹⁶, John M. C. Connell³⁰, Karen J. Cruickshanks¹⁴, Sean P. Curtis³¹, George Davey-Smith¹⁵, Christian Delles⁹, Ron T. Gansevoort¹⁹, Xiuqing Guo³², Shen Haiqing¹², Claire E. Hastie⁹, Marten H. Hofker¹⁹, Marten H. Hofker¹, G. Kees Hovingh, Daniel Seung Kim²⁹, Susan Kirkland³³, Barbara E.K. Klein¹⁴, Ronald Klein¹⁴, Yun Li⁸, Steffi Maiwald, Christopher Newton-Cheh²⁷, Eoin O'Brien³⁴, N. Charlotte Onland-Moret¹, Walter Palmas²⁴, Afshin Parsa¹², Brenda W.J.H. Penninx³⁵, Mary Pettinger³⁶, Ramachandran S. Vasan³⁷, Jane E. Ranchalis²⁹, Paul M. Ridker²⁷, Lynda M. Rose²⁷, Peter S. Sever³⁸, Daichi Shimbo²⁴, Laura Steele⁸, Ronald P. Stolk¹⁹, Barbara Thorand, Mieke D. Trip, Cornelia M. van Duijn¹⁷, W M Monique Verschuren¹, Cisca Wijmenga¹⁹, Sharon B. Wyatt³⁹, J. Hunter Young⁴⁰, Aeilko H. Zwinderman, Connie R. Bezzina⁴¹, Eric Boerwinkle⁴², Juan P. Casas⁴³, Mark J. Caulfield², Aravinda Chakravarti⁴⁰, Daniel I. Chasman²⁷, Karina W. Davidson²⁴, Pieter A. Doevendans¹, Anna F. Dominiczak⁹, Garret A. FitzGerald⁸, John G. Gums¹⁶, Myriam Fornage⁴², Hakon Hakonarson⁸, Indrani Halder⁴⁴, Hans L. Hillege¹⁹, Thomas Illig⁴⁵, Gail P. Jarvik³⁸, Julie A. Johnson¹⁶, John J.P. Kastelein, Wolfgang Koenig⁴⁶, Meena Kumari²⁵, Winfried März⁴⁷, Sarah S. Murray⁷, Jeffrey R. O'Connell¹², Albertine J. Oldehinkel¹⁹, James S. Pankow²³, Daniel J. Rader⁸, Susan Redline²⁷, Muredach P. Reilly⁸, Eric E. Schadt⁴⁸, Kandice Kottke-Marchant¹¹, Harold Snieder¹⁹, Michael Snyder¹⁰, Alice Stanton⁴⁹, Martin D. Tobin²¹, André G. Uitterlinden¹⁷, Pim van der Harst¹⁹, Yvonne T. van der Schouw¹, Nilesh J. Samani²², Nilesh J. Samani²¹, Hugh Watkins¹³, Andrew D. Johnson, Alexander P. Reiner³⁶, Xiaofeng Zhu⁵, Paul I.W. de Bakker⁵⁰, Daniel Levy, Folkert W. Asselbergs¹, Folkert W. Asselbergs²⁵, Patricia B. Munroe², Brendan J. Keating⁸ - Show less +136 more•Institutions (50)

Utrecht University¹, Queen Mary University of London², University of Michigan³, McMaster University⁴, Case Western Reserve University⁵, University of North Carolina at Chapel Hill⁶, University of California, San Diego⁷, University of Pennsylvania⁸, University of Glasgow⁹, Stanford University¹⁰, Cleveland Clinic¹¹, University of Maryland, Baltimore¹², University of Oxford¹³, University of Wisconsin-Madison¹⁴, University of Bristol¹⁵, University of Florida¹⁶, Erasmus University Rotterdam¹⁷, Heidelberg University¹⁸, University of Groningen¹⁹, Lund University²⁰, University of Leicester²¹, Glenfield Hospital²², University of Minnesota²³, Columbia University²⁴, University College London²⁵, Tulane University²⁶, Harvard University²⁷, University of Cambridge²⁸, University of Washington²⁹, University of Dundee³⁰, Merck & Co.³¹, Cedars-Sinai Medical Center³², Dalhousie University³³, University College Dublin³⁴, VU University Amsterdam³⁵, Fred Hutchinson Cancer Research Center³⁶, Boston University³⁷, Imperial College London³⁸, University of Mississippi³⁹, Johns Hopkins University⁴⁰, University of Amsterdam⁴¹, University of Texas Health Science Center at Houston⁴², University of London⁴³, University of Pittsburgh⁴⁴, Hannover Medical School⁴⁵, University of Ulm⁴⁶, Medical University of Graz⁴⁷, Icahn School of Medicine at Mount Sinai⁴⁸, Royal College of Surgeons in Ireland⁴⁹, Brigham and Women's Hospital⁵⁰

06 Mar 2014-American Journal of Human Genetics

TL;DR: The findings extend the understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification and provide support for a putative role in hypertension of several genes.

...read moreread less

Abstract: Blood pressure (BP) is a heritable risk factor for cardiovascular disease To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP), we genotyped ~50,000 SNPs in up to 87,736 individuals of European ancestry and combined these in a meta-analysis We replicated findings in an independent set of 68,368 individuals of European ancestry Our analyses identified 11 previously undescribed associations in independent loci containing 31 genes including PDE1A, HLA-DQB1, CDK6, PRKAG2, VCL, H19, NUCB2, RELA, HOXC@ complex, FBN1, and NFAT5 at the Bonferroni-corrected array-wide significance threshold (p < 6 × 10(-7)) and confirmed 27 previously reported associations Bioinformatic analysis of the 11 loci provided support for a putative role in hypertension of several genes, such as CDK6 and NUCB2 Analysis of potential pharmacological targets in databases of small molecules showed that ten of the genes are predicted to be a target for small molecules In summary, we identified previously unknown loci associated with BP Our findings extend our understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification

...read moreread less

Journal Article•DOI•

Integrated systems analysis reveals a molecular network underlying autism spectrum disorders

[...]

Jingjing Li¹, Minyi Shi¹, Zhihai Ma¹, Shuchun Zhao¹, Ghia Euskirchen¹, Jennifer L. Ziskin¹, Alexander E. Urban¹, Joachim Hallmayer¹, Michael Snyder¹ - Show less +5 more•Institutions (1)

Stanford University¹

01 Dec 2014-Molecular Systems Biology

TL;DR: A systems framework involving the interactome, gene expression and genome sequencing is developed to identify a protein interaction module with members strongly enriched for autism candidate genes that delineates a natural network involved in autism.

...read moreread less

Abstract: Autism is a complex disease whose etiology remains elusive. We integrated previously and newly generated data and developed a systems framework involving the interactome, gene expression and genome sequencing to identify a protein interaction module with members strongly enriched for autism candidate genes. Sequencing of 25 patients confirmed the involvement of this module in autism, which was subsequently validated using an independent cohort of over 500 patients. Expression of this module was dichotomized with a ubiquitously expressed subcomponent and another subcomponent preferentially expressed in the corpus callosum, which was significantly affected by our identified mutations in the network center. RNA-sequencing of the corpus callosum from patients with autism exhibited extensive gene mis-expression in this module, and our immunochemical analysis showed that the human corpus callosum is predominantly populated by oligodendrocyte cells. Analysis of functional genomic data further revealed a significant involvement of this module in the development of oligodendrocyte cells in mouse brain. Our analysis delineates a natural network involved in autism, helps uncover novel candidate genes for this disease and improves our understanding of its molecular pathology.

...read moreread less

Journal Article•DOI•

Regulatory analysis of the C. elegans genome with spatiotemporal resolution

[...]

Carlos L. Araya¹, Trupti Kawli¹, Anshul Kundaje², Lixia Jiang¹, Beijing Wu¹, D. Vafeados³, Robert Terrell³, Peter Weissdepp³, Louis Gevirtzman³, Daniel L. Mace³, Wei Niu⁴, Alan P. Boyle¹, Dan Xie¹, Lijia Ma⁵, John I. Murray⁶, Valerie Reinke⁴, Robert H. Waterston³, Michael Snyder¹ - Show less +14 more•Institutions (6)

Stanford University¹, Massachusetts Institute of Technology², University of Washington³, Yale University⁴, University of Chicago⁵, University of Pennsylvania⁶

28 Aug 2014-Nature

TL;DR: This work determined the genomic distribution of binding sites for 92 transcription factors and regulatory proteins across multiple stages of Caenorhabditis elegans development by performing 241 ChIP-seq (chromatin immunoprecipitation followed by sequencing) experiments and produced a spatiotemporally resolved metazoan transcription factor binding map.

...read moreread less

Abstract: Discovering the structure and dynamics of transcriptional regulatory events in the genome with cellular and temporal resolution is crucial to understanding the regulatory underpinnings of development and disease. We determined the genomic distribution of binding sites for 92 transcription factors and regulatory proteins across multiple stages of Caenorhabditis elegans development by performing 241 ChIP-seq (chromatin immunoprecipitation followed by sequencing) experiments. Integration of regulatory binding and cellular-resolution expression data produced a spatiotemporally resolved metazoan transcription factor binding map. Using this map, we explore developmental regulatory circuits that encode combinatorial logic at the levels of co-binding and co-expression of transcription factors, characterizing the genomic coverage and clustering of regulatory binding, the binding preferences of, and biological processes regulated by, transcription factors, the global transcription factor co-associations and genomic subdomains that suggest shared patterns of regulation, and identifying key transcription factors and transcription factor co-associations for fate specification of individual lineages and cell types.

...read moreread less

Journal Article•DOI•

Identification of STAT5A and STAT5B Target Genes in Human T Cells

[...]

Takahiro Kanai¹, Scott Seki¹, Jennifer A. Jenks¹, Arunima Kohli¹, Trupti Kawli¹, Dorrelyn Patacsil Martin¹, Michael Snyder¹, Rosa Bacchetta¹, Kari C. Nadeau¹ - Show less +5 more•Institutions (1)

Stanford University¹

30 Jan 2014-PLOS ONE

TL;DR: Signal transducer and activator of transcription (STAT) comprises a family of universal transcription factors that help cells sense and respond to environmental signals, and a novel, unique role for STAT5A is found in binding to genes involved in neural development and function, while STAT5B appears to play a distinct role in T cell development andfunction via DOCK8, SNX9, FOXP3 and IL2RA binding.

...read moreread less

Abstract: Signal transducer and activator of transcription (STAT) comprises a family of universal transcription factors that help cells sense and respond to environmental signals. STAT5 refers to two highly related proteins, STAT5A and STAT5B, with critical function: their complete deficiency is lethal in mice; in humans, STAT5B deficiency alone leads to endocrine and immunological problems, while STAT5A deficiency has not been reported. STAT5A and STAT5B show peptide sequence similarities greater than 90%, but subtle structural differences suggest possible non-redundant roles in gene regulation. However, these roles remain unclear in humans. We applied chromatin immunoprecipitation followed by DNA sequencing using human CD4+ T cells to detect candidate genes regulated by STAT5A and/or STAT5B, and quantitative-PCR in STAT5A or STAT5B knock-down (KD) human CD4+ T cells to validate the findings. Our data show STAT5A and STAT5B play redundant roles in cell proliferation and apoptosis via SGK1 interaction. Interestingly, we found a novel, unique role for STAT5A in binding to genes involved in neural development and function (NDRG1, DNAJC6, and SSH2), while STAT5B appears to play a distinct role in T cell development and function via DOCK8, SNX9, FOXP3 and IL2RA binding. Our results also suggest that one or more co-activators for STAT5A and/or STAT5B may play important roles in establishing different binding abilities and gene regulation behaviors. The new identification of these genes regulated by STAT5A and/or STAT5B has major implications for understanding the pathophysiology of cancer progression, neural disorders, and immune abnormalities.

...read moreread less

Journal Article•DOI•

Shared functions of plant and mammalian StAR-related lipid transfer (START) domains in modulating transcription factor activity

[...]

Kathrin Schrick¹, Kathrin Schrick², Michael Bruno³, Aashima Khosla¹, Paige N Cox¹, Sara A Marlatt², Remigio A Roque², Henry C. Nguyen², Cuiwen He², Michael Snyder³, Daljit Singh, Gitanjali Yadav - Show less +8 more•Institutions (3)

Kansas State University¹, Keck Graduate Institute of Applied Life Sciences², Stanford University³

27 Aug 2014-BMC Biology

TL;DR: The data provide evidence for an evolutionarily conserved mechanism by which lipid metabolites can orchestrate transcription in a yeast system and propose a model in which the START domain is used by both plants and mammals to regulate transcription factor activity.

...read moreread less

Abstract: Steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains were first identified from mammalian proteins that bind lipid/sterol ligands via a hydrophobic pocket. In plants, predicted START domains are predominantly found in homeodomain leucine zipper (HD-Zip) transcription factors that are master regulators of cell-type differentiation in development. Here we utilized studies of Arabidopsis in parallel with heterologous expression of START domains in yeast to investigate the hypothesis that START domains are versatile ligand-binding motifs that can modulate transcription factor activity. Our results show that deletion of the START domain from Arabidopsis Glabra2 (GL2), a representative HD-Zip transcription factor involved in differentiation of the epidermis, results in a complete loss-of-function phenotype, although the protein is correctly localized to the nucleus. Despite low sequence similarly, the mammalian START domain from StAR can functionally replace the HD-Zip-derived START domain. Embedding the START domain within a synthetic transcription factor in yeast, we found that several mammalian START domains from StAR, MLN64 and PCTP stimulated transcription factor activity, as did START domains from two Arabidopsis HD-Zip transcription factors. Mutation of ligand-binding residues within StAR START reduced this activity, consistent with the yeast assay monitoring ligand-binding. The D182L missense mutation in StAR START was shown to affect GL2 transcription factor activity in maintenance of the leaf trichome cell fate. Analysis of in vivo protein–metabolite interactions by mass spectrometry provided direct evidence for analogous lipid-binding activity in mammalian and plant START domains in the yeast system. Structural modeling predicted similar sized ligand-binding cavities of a subset of plant START domains in comparison to mammalian counterparts. The START domain is required for transcription factor activity in HD-Zip proteins from plants, although it is not strictly necessary for the protein’s nuclear localization. START domains from both mammals and plants are modular in that they can bind lipid ligands to regulate transcription factor function in a yeast system. The data provide evidence for an evolutionarily conserved mechanism by which lipid metabolites can orchestrate transcription. We propose a model in which the START domain is used by both plants and mammals to regulate transcription factor activity.

...read moreread less

Journal Article•DOI•

Allelic Expression of Deleterious Protein-Coding Variants across Human Tissues

[...]

Kimberly R. Kukurba¹, Rui Zhang¹, Xin Li¹, Kevin S. Smith¹, David A. Knowles¹, Meng How Tan¹, Robert Piskol¹, Monkol Lek², Michael Snyder¹, Daniel G. MacArthur², Jin Billy Li¹, Stephen B. Montgomery¹ - Show less +8 more•Institutions (2)

Stanford University¹, Harvard University²

01 May 2014-PLOS Genetics

TL;DR: The potential importance of transcriptome data to the interpretation of pathogenic protein-coding variants is demonstrated, exposing stronger allelic bias for rare stop-gain variants and informing the extent to which rare deleterious coding alleles are consistently expressed across tissues.

...read moreread less

Abstract: Personal exome and genome sequencing provides access to loss-of-function and rare deleterious alleles whose interpretation is expected to provide insight into individual disease burden. However, for each allele, accurate interpretation of its effect will depend on both its penetrance and the trait's expressivity. In this regard, an important factor that can modify the effect of a pathogenic coding allele is its level of expression; a factor which itself characteristically changes across tissues. To better inform the degree to which pathogenic alleles can be modified by expression level across multiple tissues, we have conducted exome, RNA and deep, targeted allele-specific expression (ASE) sequencing in ten tissues obtained from a single individual. By combining such data, we report the impact of rare and common loss-of-function variants on allelic expression exposing stronger allelic bias for rare stop-gain variants and informing the extent to which rare deleterious coding alleles are consistently expressed across tissues. This study demonstrates the potential importance of transcriptome data to the interpretation of pathogenic protein-coding variants.

...read moreread less

Journal Article•DOI•

Personalized sequencing and the future of medicine: discovery, diagnosis and defeat of disease

[...]

Edward D. Esplin¹, Ling Oei¹, Michael Snyder¹•Institutions (1)

Stanford University¹

10 Dec 2014-Pharmacogenomics

TL;DR: The advancing capacity of personalized sequencing is documents, its impact on disease-oriented scientific discovery and anticipates its role in the future of medicine are reviewed.

...read moreread less

Abstract: The potential for personalized sequencing to individually optimize medical treatment in diseases such as cancer and for pharmacogenomic application is just beginning to be realized, and the utility of sequencing healthy individuals for managing health is also being explored. The data produced requires additional advancements in interpretation of variants of unknown significance to maximize clinical benefit. Nevertheless, personalized sequencing, only recently applied to clinical medicine, has already been broadly applied to the discovery and study of disease. It is poised to enable the earlier and more accurate diagnosis of disease risk and occurrence, guide prevention and individualized intervention as well as facilitate monitoring of healthy and treated patients, and play a role in the prevention and recurrence of future disease. This article documents the advancing capacity of personalized sequencing, reviews its impact on disease-oriented scientific discovery and anticipates its role in the future of ...

...read moreread less

Journal Article•DOI•

Transcriptome sequencing from diverse human populations reveals differentiated regulatory architecture.

[...]

Alicia R. Martin¹, Helio A. Costa¹, Tuuli Lappalainen¹, Brenna M. Henn², Jeffrey M. Kidd³, Muh Ching Yee¹, Fabian Grubert¹, Howard M. Cann, Michael Snyder¹, Stephen B. Montgomery¹, Carlos Bustamante¹ - Show less +7 more•Institutions (3)

Stanford University¹, Stony Brook University², University of Michigan³

14 Aug 2014-PLOS Genetics

TL;DR: Together, these results provide a resource for studies analyzing functional differences across populations by estimating the degree of shared gene expression, alternative splicing, and regulatory genetics across populations from the broadest points of human migration history yet sampled.

...read moreread less

Abstract: Large-scale sequencing efforts have documented extensive genetic variation within the human genome. However, our understanding of the origins, global distribution, and functional consequences of this variation is far from complete. While regulatory variation influencing gene expression has been studied within a handful of populations, the breadth of transcriptome differences across diverse human populations has not been systematically analyzed. To better understand the spectrum of gene expression variation, alternative splicing, and the population genetics of regulatory variation in humans, we have sequenced the genomes, exomes, and transcriptomes of EBV transformed lymphoblastoid cell lines derived from 45 individuals in the Human Genome Diversity Panel (HGDP). The populations sampled span the geographic breadth of human migration history and include Namibian San, Mbuti Pygmies of the Democratic Republic of Congo, Algerian Mozabites, Pathan of Pakistan, Cambodians of East Asia, Yakut of Siberia, and Mayans of Mexico. We discover that approximately 25.0% of the variation in gene expression found amongst individuals can be attributed to population differences. However, we find few genes that are systematically differentially expressed among populations. Of this population-specific variation, 75.5% is due to expression rather than splicing variability, and we find few genes with strong evidence for differential splicing across populations. Allelic expression analyses indicate that previously mapped common regulatory variants identified in eight populations from the International Haplotype Map Phase 3 project have similar effects in our seven sampled HGDP populations, suggesting that the cellular effects of common variants are shared across diverse populations. Together, these results provide a resource for studies analyzing functional differences across populations by estimating the degree of shared gene expression, alternative splicing, and regulatory genetics across populations from the broadest points of human migration history yet sampled.

...read moreread less

Journal Article•DOI•

Toward More Transparent and Reproducible Omics Studies Through a Common Metadata Checklist and Data Publications

[...]

Eugene Kolker¹, Vural Ozdemir, Lennart Martens², William S. Hancock³, Gordon A. Anderson⁴, Nathaniel Anderson, Sukru Aynacioglu⁵, Ancha Baranova⁶, Shawn R. Campagna⁷, Rui Chen⁸, John Choiniere, Stephen P. Dearth⁷, Wu-chun Feng⁹, Lynnette R. Ferguson¹⁰, Geoffrey C. Fox¹¹, Dmitrij Frishman¹², Robert L. Grossman¹³, Allison Heath¹³, Roger Higdon, Mara H. Hutz¹⁴, Imre Janko¹⁵, Lihua Jiang⁸, Sanjay Joshi¹⁶, Alexander Kel, Joseph W. Kemnitz¹⁷, Isaac S. Kohane¹⁸, Natali Kolker¹⁵, Doron Lancet¹⁹, Elaine Lee¹⁵, Weizhong Li²⁰, Andrey Lisitsa²¹, Adrián LLerena²², Courtney MacNealy-Koch, Jean-Claude Marshall, Paola Masuzzo², Amanda L. May⁷, George I. Mias⁸, Matthew E. Monroe⁴, Elizabeth Montague, Sean D. Mooney²³, Alexey I. Nesvizhskii²⁴, Santosh Noronha²⁵, Gilbert S. Omenn²⁴, Harsha Rajasimha, Preveen Ramamoorthy²⁶, Jerry Sheehan²⁰, Larry Smarr²⁰, Charles V. Smith¹⁵, Todd M. Smith, Michael Snyder⁸, Srikanth Rapole²⁷, Sanjeeva Srivastava²⁵, Larissa Stanberry, Elizabeth Stewart, Stefano Toppo²⁸, Peter Uetz²⁹, Kenneth Verheggen², Brynn H. Voy³⁰, Louise Warnich³¹, Steven W. Wilhelm⁷, Gregory Yandl - Show less +57 more•Institutions (31)

Seattle Children's Research Institute¹, Ghent University², Northeastern University³, Pacific Northwest National Laboratory⁴, University of Gaziantep⁵, George Mason University⁶, University of Tennessee⁷, Stanford University⁸, Virginia Tech⁹, University of Auckland¹⁰, Indiana University¹¹, Technische Universität München¹², University of Chicago¹³, Universidade Federal do Rio Grande do Sul¹⁴, Boston Children's Hospital¹⁵, EMC Corporation¹⁶, University of Wisconsin-Madison¹⁷, Harvard University¹⁸, Weizmann Institute of Science¹⁹, University of California, San Diego²⁰, Russian Academy²¹, University of Extremadura²², Buck Institute for Research on Aging²³, University of Michigan²⁴, Indian Institute of Technology Bombay²⁵, University of Colorado Denver²⁶, Savitribai Phule Pune University²⁷, University of Padua²⁸, Virginia Commonwealth University²⁹, University Of Tennessee System³⁰, Stellenbosch University³¹

01 Jan 2014-Omics A Journal of Integrative Biology

TL;DR: The proposed omics metadata checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications and allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era.

...read moreread less

Abstract: Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.

...read moreread less

Journal Article•DOI•

Extended lifespan and reduced adiposity in mice lacking the FAT10 gene.

[...]

Allon Canaan, Jason DeFuria¹, Eddie Perelman², Vincent Schultz, Montrell Seay, David Tuck, Richard A. Flavell³, Michael Snyder⁴, Martin S. Obin¹, Sherman M. Weissman - Show less +6 more•Institutions (4)

United States Department of Agriculture¹, Ben-Gurion University of the Negev², Yale University³, Stanford University⁴

08 Apr 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is shown that FAT10 knockout prevents the development of age-associated obesity in mice while extending lifespan and vigor without the appearance of deleterious developmental effects, and suggest novel roles of FAT10 in immune metabolic regulation that impact aging and chronic disease.

...read moreread less

Abstract: The HLA-F adjacent transcript 10 (FAT10) is a member of the ubiquitin-like gene family that alters protein function/stability through covalent ligation. Although FAT10 is induced by inflammatory mediators and implicated in immunity, the physiological functions of FAT10 are poorly defined. We report the discovery that FAT10 regulates lifespan through pleiotropic actions on metabolism and inflammation. Median and overall lifespan are increased 20% in FAT10ko mice, coincident with elevated metabolic rate, preferential use of fat as fuel, and dramatically reduced adiposity. This phenotype is associated with metabolic reprogramming of skeletal muscle (i.e., increased AMP kinase activity, β-oxidation and -uncoupling, and decreased triglyceride content). Moreover, knockout mice have reduced circulating glucose and insulin levels and enhanced insulin sensitivity in metabolic tissues, consistent with elevated IL-10 in skeletal muscle and serum. These observations suggest novel roles of FAT10 in immune metabolic regulation that impact aging and chronic disease.

...read moreread less

Journal Article•DOI•

Coherent functional modules improve transcription factor target identification, cooperativity prediction, and disease association.

[...]

Konrad J. Karczewski¹, Michael Snyder¹, Russ B. Altman¹, Nicholas P. Tatonetti²•Institutions (2)

Stanford University¹, Columbia University²

06 Feb 2014-PLOS Genetics

TL;DR: An improved mapping of targets is created by integrating ChIP-Seq data with 423 functional modules derived from 9,395 human expression experiments, which identified 5,002 TF-module relationships, significantly improved TF target prediction, and found 30 high-confidence TF-TF associations.

...read moreread less

Abstract: Transcription factors (TFs) are fundamental controllers of cellular regulation that function in a complex and combinatorial manner. Accurate identification of a transcription factor's targets is essential to understanding the role that factors play in disease biology. However, due to a high false positive rate, identifying coherent functional target sets is difficult. We have created an improved mapping of targets by integrating ChIP-Seq data with 423 functional modules derived from 9,395 human expression experiments. We identified 5,002 TF-module relationships, significantly improved TF target prediction, and found 30 high-confidence TF-TF associations, of which 14 are known. Importantly, we also connected TFs to diseases through these functional modules and identified 3,859 significant TF-disease relationships. As an example, we found a link between MEF2A and Crohn's disease, which we validated in an independent expression dataset. These results show the power of combining expression data and ChIP-Seq data to remove noise and better extract the associations between TFs, functional modules, and disease.

...read moreread less