OrthoClust: an orthology-based network framework for clustering data across multiple species
Reads0
Chats0
TLDR
OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species and outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific.Abstract:
Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific. We demonstrate the application of OrthoClust using the RNA-Seq expression profiles of Caenorhabditis elegans and Drosophila melanogaster from the modENCODE consortium. A potential application of cross-species modules is to infer putative analogous functions of uncharacterized elements like non-coding RNAs based on guilt-by-association.read more
Citations
More filters
Journal ArticleDOI
Comparative analysis of the transcriptome across distant species
Mark Gerstein,Joel Rozowsky,Koon-Kiu Yan,Daifeng Wang,Chao Cheng,James B. Brown,James B. Brown,Carrie A. Davis,LaDeana W. Hillier,Cristina Sisu,Jingyi Jessica Li,Jingyi Jessica Li,Baikang Pei,Arif Harmanci,Michael O. Duff,Sarah Djebali,Roger P. Alexander,Burak H. Alver,Raymond K. Auerbach,Kimberly Bell,Peter J. Bickel,Max E. Boeck,Nathan Boley,Nathan Boley,Benjamin W. Booth,Lucy Cherbas,Peter Cherbas,Chao Di,Alexander Dobin,Jorg Drenkow,Brent Ewing,Gang Fang,Megan Fastuca,Elise A. Feingold,Adam Frankish,Guanjun Gao,Peter J. Good,Roderic Guigó,Ann S. Hammonds,Jen Harrow,Roger A. Hoskins,Cédric Howald,Cédric Howald,Long Hu,Haiyan Huang,Tim Hubbard,Tim Hubbard,Chau Huynh,Sonali Jha,Dionna M. Kasper,Masaomi Kato,Thomas C. Kaufman,Robert R. Kitchen,Erik Ladewig,Julien Lagarde,Eric C. Lai,Jing Leng,Zhi Lu,Michael J. MacCoss,Gemma E. May,Gemma E. May,Rebecca McWhirter,Gennifer E. Merrihew,David M. Miller,Ali Mortazavi,Rabi Murad,Brian Oliver,Sara Olson,Peter J. Park,Michael J. Pazin,Norbert Perrimon,Norbert Perrimon,Dmitri D. Pervouchine,Valerie Reinke,Alexandre Reymond,Garrett Robinson,Anastasia Samsonova,Anastasia Samsonova,Gary Saunders,Gary Saunders,Felix Schlesinger,Anurag Sethi,Frank J. Slack,William C. Spencer,Marcus H. Stoiber,Marcus H. Stoiber,Pnina Strasbourger,Andrea Tanzer,Andrea Tanzer,Owen Thompson,Kenneth H. Wan,Guilin Wang,Huaien Wang,Kathie L. Watkins,Jiayu Wen,Kejia Wen,Chenghai Xue,Li Yang,Li Yang,Kevin Y. Yip,Chris Zaleski,Yan Zhang,Henry Zheng,Steven E. Brenner,Brenton R. Graveley,Susan E. Celniker,Thomas R. Gingeras,Robert H. Waterston +107 more
TL;DR: It is found in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a ‘universal model’ based on a single set of organism-independent parameters.
Journal Article
The yeast coexpression network has a small-world, scale-free architecture and can be explained by a simple model
TL;DR: A new model is derived based on the observation that there is a positive correlation between the sequence similarity of paralogues and their probability of coexpression or sharing of transcription factor binding sites (TFBSs) that reproduces the scale‐free, small‐world architecture of the coregulation network and the homology relations between coregulated genes without the need for selection.
Comparative analysis of regulatory information and circuits across distant species
Alan P. Boyle,Carlos L. Araya,Cathleen M. Brdlik,Philip Cayting,Chao Cheng,Yong Cheng,Kathryn E. Gardner,LaDeana W. Hillier,J. Janette,Lixia Jiang,Dionna M. Kasper,Trupti Kawli,Pouya Kheradpour,Anshul Kundaje,Anshul Kundaje,Jingyi Jessica Li,Jingyi Jessica Li,Lijia Ma,Wei Niu,E. Jay Rehm,Joel Rozowsky,Matthew Slattery,Rebecca Spokony,Robert Terrell,D. Vafeados,Daifeng Wang,Peter Weisdepp,Yi-Chieh Wu,Dan Xie,Koon-Kiu Yan,Elise A. Feingold,Peter J. Good,Michael J. Pazin,Haiyan Huang,Peter J. Bickel,Steven E. Brenner,Valerie Reinke,Robert H. Waterston,Mark Gerstein,Kevin P. White,Manolis Kellis,Michael Snyder +41 more
TL;DR: In this article, the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors were mapped for a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time.
Journal ArticleDOI
Discerning molecular interactions: A comprehensive review on biomolecular interaction databases and network analysis tools.
TL;DR: The distinctive attribute mentioned in this review is not only to provide an overview of tools and web servers for gene and protein-protein interaction (PPI) network analysis but also to extract useful and meaningful information from the interaction networks.
Journal ArticleDOI
Mapping single-cell atlases throughout Metazoa unravels cell type evolution
Alexander J. Tarashansky,Jacob M. Musser,Margarita Khariton,Pengyang Li,Detlev Arendt,Stephen R. Quake,Bo Wang +6 more
TL;DR: In this paper, a self-assembling manifold (SAM) algorithm is used to robustly reconstruct manifolds from single-cell data and to identify homologous cell types with shared expression programs across distant species within phyla.
References
More filters
Journal ArticleDOI
Cluster analysis and display of genome-wide expression patterns
TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Journal ArticleDOI
WGCNA: an R package for weighted correlation network analysis.
Peter Langfelder,Steve Horvath +1 more
TL;DR: The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis that includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software.
Journal ArticleDOI
Fast unfolding of communities in large networks
Vincent D. Blondel,Jean-Loup Guillaume,Jean-Loup Guillaume,Renaud Lambiotte,Renaud Lambiotte,Etienne Lefebvre +5 more
TL;DR: This work proposes a heuristic method that is shown to outperform all other known community detection methods in terms of computation time and the quality of the communities detected is very good, as measured by the so-called modularity.
Journal ArticleDOI
Mapping and quantifying mammalian transcriptomes by RNA-Seq.
TL;DR: Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors.
Journal ArticleDOI
RNA-Seq: a revolutionary tool for transcriptomics
TL;DR: The RNA-Seq approach to transcriptome profiling that uses deep-sequencing technologies provides a far more precise measurement of levels of transcripts and their isoforms than other methods.
Related Papers (5)
WGCNA: an R package for weighted correlation network analysis.
Peter Langfelder,Steve Horvath +1 more