HTSeq—a Python framework to work with high-throughput sequencing data
TLDR
This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.Abstract:
Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an opensource software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.deread more
Citations
More filters
Journal ArticleDOI
Tet2 promotes pathogen infection-induced myelopoiesis through mRNA oxidation
Qicong Shen,Qian Zhang,Qian Zhang,Yang Shi,Qingzhu Shi,Yanyan Jiang,Yan Gu,Zhiqing Li,Xia Li,Kai Zhao,Chunmei Wang,Nan Li,Xuetao Cao,Xuetao Cao +13 more
TL;DR: It is demonstrated that Tet2 promotes infection-induced myelopoiesis in an mRNA oxidation-dependent manner through Adar1-mediated repression of Socs3 expression at the post-transcription level, revealing a previously unknown regulatory role of Tet2 at the epitranscriptomic level.
Journal ArticleDOI
Functional Dynamics of the Gut Microbiome in Elderly People during Probiotic Consumption
Emiley A. Eloe-Fadrosh,Arthur Brady,Jonathan Crabtree,Elliott F. Drabek,Bing Ma,Anup Mahurkar,Jacques Ravel,Miriam Haverkamp,Anne-Maria Fiorino,Christine Botelho,Irina Andreyeva,Patricia L. Hibberd,Claire M. Fraser +12 more
TL;DR: The results provide evidence for the discrete functional effects imparted by a specific single-organism probiotic and challenge the prevailing notion that probiotics substantially modify the resident microbiota within nondiseased individuals in an appreciable fashion.
Journal ArticleDOI
Survival and biomarker analyses from the OpACIN-neo and OpACIN neoadjuvant immunotherapy trials in stage III melanoma
Elisa A. Rozeman,Esmée P. Hoefsmit,Irene L.M. Reijers,Robyn P. M. Saw,Robyn P. M. Saw,Robyn P. M. Saw,Judith M. Versluis,Oscar Krijgsman,Petros Dimitriadis,Karolina Sikorska,B.A. van de Wiel,Hanna Eriksson,Hanna Eriksson,María Jesús González González,A A Torres Acosta,Lindsay G Grijpink-Ongering,Kerwin F. Shannon,Kerwin F. Shannon,J.B.A.G. Haanen,J Stretch,J Stretch,J Stretch,Sydney Ch'ng,Sydney Ch'ng,Sydney Ch'ng,Omgo E. Nieweg,Omgo E. Nieweg,Omgo E. Nieweg,H. Mallo,Sandra Adriaansz,Ron M. Kerkhoven,Sten Cornelissen,Annegien Broeks,Willem M.C. Klop,Charlotte L. Zuur,W.J. van Houdt,Daniel S. Peeper,Andrew J. Spillane,Andrew J. Spillane,Andrew J. Spillane,A.C.J. van Akkooi,Richard A. Scolyer,Richard A. Scolyer,Ton N. Schumacher,A.M. Menzies,A.M. Menzies,Georgina V. Long,Georgina V. Long,Christian U. Blank +48 more
TL;DR: In the OpACIN-neo study, the 2-year estimated relapse-free survival was 84% for all patients, 97% for patients achieving a pathologic response and 36% for nonresponders as discussed by the authors.
Journal ArticleDOI
Hedgehog signaling promotes basal progenitor expansion and the growth and folding of the neocortex.
TL;DR: It is found that constitutively active Sonic hedgehog (Shh) signaling expanded bRGs and IPCs and induced folding in the otherwise smooth mouse neocortex, whereas the loss of Shh signaling decreased the number of bRGS and I PCs and the size of the neocortex.
Journal ArticleDOI
Haplotype-phased genome and evolution of phytonutrient pathways of tetraploid blueberry.
Marivi Colle,Courtney P. Leisner,Ching Man Wai,Shujun Ou,Kevin A. Bird,Jie Wang,Jennifer H. Wisecaver,Alan E. Yocca,Elizabeth I. Alger,Haibao Tang,Zhiyong Xiong,Pete Callow,Gil Ben-Zvi,Avital Brodt,Kobi Baruch,Thomas Swale,Lily Shiue,Guo Qing Song,Kevin L. Childs,Anthony L. Schilmiller,Nicholi Vorsa,C. Robin Buell,Robert VanBuren,Ning Jiang,Patrick P. Edger +24 more
TL;DR: Genome-wide analyses revealed that both polyploidy and tandem gene duplications modified various pathways involved in the biosynthesis of key phytonutrients in highbush blueberry.
References
More filters
Journal ArticleDOI
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
Trimmomatic: a flexible trimmer for Illumina sequence data
TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
BEDTools: a flexible suite of utilities for comparing genomic features
Aaron R. Quinlan,Ira M. Hall +1 more
TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.