Comparative analysis of the transcriptome across distant species
Mark Gerstein,Joel Rozowsky,Koon-Kiu Yan,Daifeng Wang,Chao Cheng,James B. Brown,James B. Brown,Carrie A. Davis,LaDeana W. Hillier,Cristina Sisu,Jingyi Jessica Li,Jingyi Jessica Li,Baikang Pei,Arif Harmanci,Michael O. Duff,Sarah Djebali,Roger P. Alexander,Burak H. Alver,Raymond K. Auerbach,Kimberly Bell,Peter J. Bickel,Max E. Boeck,Nathan Boley,Nathan Boley,Benjamin W. Booth,Lucy Cherbas,Peter Cherbas,Chao Di,Alexander Dobin,Jorg Drenkow,Brent Ewing,Gang Fang,Megan Fastuca,Elise A. Feingold,Adam Frankish,Guanjun Gao,Peter J. Good,Roderic Guigó,Ann S. Hammonds,Jen Harrow,Roger A. Hoskins,Cédric Howald,Cédric Howald,Long Hu,Haiyan Huang,Tim Hubbard,Tim Hubbard,Chau Huynh,Sonali Jha,Dionna M. Kasper,Masaomi Kato,Thomas C. Kaufman,Robert R. Kitchen,Erik Ladewig,Julien Lagarde,Eric C. Lai,Jing Leng,Zhi Lu,Michael J. MacCoss,Gemma E. May,Gemma E. May,Rebecca McWhirter,Gennifer E. Merrihew,David M. Miller,Ali Mortazavi,Rabi Murad,Brian Oliver,Sara Olson,Peter J. Park,Michael J. Pazin,Norbert Perrimon,Norbert Perrimon,Dmitri D. Pervouchine,Valerie Reinke,Alexandre Reymond,Garrett Robinson,Anastasia Samsonova,Anastasia Samsonova,Gary Saunders,Gary Saunders,Felix Schlesinger,Anurag Sethi,Frank J. Slack,William C. Spencer,Marcus H. Stoiber,Marcus H. Stoiber,Pnina Strasbourger,Andrea Tanzer,Andrea Tanzer,Owen Thompson,Kenneth H. Wan,Guilin Wang,Huaien Wang,Kathie L. Watkins,Jiayu Wen,Kejia Wen,Chenghai Xue,Li Yang,Li Yang,Kevin Y. Yip,Chris Zaleski,Yan Zhang,Henry Zheng,Steven E. Brenner,Brenton R. Graveley,Susan E. Celniker,Thomas R. Gingeras,Robert H. Waterston +107 more
Reads0
Chats0
TLDR
It is found in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a ‘universal model’ based on a single set of organism-independent parameters.Abstract:
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.read more
Citations
More filters
Journal ArticleDOI
RNA mis-splicing in disease
TL;DR: An overview of RNA splicing mechanisms is provided followed by a discussion of disease-associated errors, with an emphasis on recently described mutations that have provided new insights into splicing regulation.
Journal ArticleDOI
Araport11: a complete reannotation of the Arabidopsis thaliana reference genome
Chia Yi Cheng,Vivek Krishnakumar,Agnes P. Chan,Françoise Thibaud-Nissen,Seth Schobel,Christopher D. Town +5 more
TL;DR: This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further the understanding of the biological processes of this plant model but also of other species.
Journal ArticleDOI
A copy number variation map of the human genome
TL;DR: An updated CNV map of the human genome is constructed and found approximately 100 genes that can be completely deleted without producing apparent phenotypic consequences, which will aid the interpretation of new CNV findings for both clinical and research applications.
Evolutionary Dynamics of Gene and Isoform Regulation in Mammalian Tissues
TL;DR: For example, this paper found that while tissue-specific gene expression programs are largely conserved, alternative splicing is well conserved in only a subset of tissues and is frequently lineage-specific.
Journal ArticleDOI
Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas
Yu Hou,Huahu Guo,Chen Cao,Xianlong Li,Boqiang Hu,Ping Zhu,Ping Zhu,Xinglong Wu,Xinglong Wu,Lu Wen,Fuchou Tang,Yanyi Huang,Yanyi Huang,Jirun Peng,Jirun Peng +14 more
TL;DR: A single-cell triple omics sequencing technique that can be used to simultaneously analyze the genomic copy-number variations (CNVs), DNA methylome, and transcriptome of an individual mammalian cell is demonstrated and shows that large-scale CNVs cause proportional changes in RNA expression of genes within the gained or lost genomic regions.
References
More filters
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
Optimization by Simulated Annealing
TL;DR: There is a deep and useful connection between statistical mechanics and multivariate or combinatorial optimization (finding the minimum of a given function depending on many parameters), and a detailed analogy with annealing in solids provides a framework for optimization of very large and complex systems.
Journal ArticleDOI
STAR: ultrafast universal RNA-seq aligner
Alexander Dobin,Carrie A. Davis,Felix Schlesinger,Jorg Drenkow,Chris Zaleski,Sonali Jha,Philippe Batut,Mark Chaisson,Thomas R. Gingeras +8 more
TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Related Papers (5)
An integrated encyclopedia of DNA elements in the human genome
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
WGCNA: an R package for weighted correlation network analysis.
Peter Langfelder,Steve Horvath +1 more