HTSeq—a Python framework to work with high-throughput sequencing data
TLDR
This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.Abstract:
Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an opensource software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.deread more
Citations
More filters
Journal ArticleDOI
Big Data Analytics: A Review on Theoretical Contributions and Tools Used in Literature
Purva Grover,Arpan Kumar Kar +1 more
TL;DR: It was found that most of the existing research on big data focuses majorly on consumer discretionary, followed by public administration, and not much focus was highlighted in these studies to demonstrate the tools used for the analysis to address this gap.
Journal ArticleDOI
The allotetraploid origin and asymmetrical genome evolution of the common carp Cyprinus carpio
Peng Xu,Jian Xu,Guangjian Liu,Lin Chen,Zhixiong Zhou,Wenzhu Peng,Yanliang Jiang,Zixia Zhao,Zhiying Jia,Yonghua Sun,Yidi Wu,Baohua Chen,Fei Pu,Jianxin Feng,Jing Luo,Jing Chai,Hanyuan Zhang,Hui Wang,Hui Wang,Chuanju Dong,Wenkai Jiang,Xiaowen Sun +21 more
TL;DR: Gen expression bias across surveyed tissues such that subgenome B is more dominant in homoeologous expression is found, and CG methylation in promoter regions may play an important role in altering gene expression in allotetraploid C. carpio.
Journal ArticleDOI
Genome-wide Association Study of Intraocular Pressure Uncovers New Pathways to Glaucoma
Stuart MacGregor,Jue-Sheng Ong,Jiyuan An,Xikun Han,Tiger Zhou,Owen M. Siggs,Matthew Law,Emmanuelle Souzeau,Shiwani Sharma,David J. Lynn,David J. Lynn,Jonathan Beesley,Bronwyn Sheldrick,Richard A. Mills,John Landers,Jonathan B Ruddle,Stuart L. Graham,Paul R. Healey,Andrew White,Robert J Casson,Stephen Best,John R. Grigg,Ivan Goldberg,Joseph E. Powell,Joseph E. Powell,David C. Whiteman,Graham L. Radford-Smith,Graham L. Radford-Smith,Nicholas G. Martin,Grant W. Montgomery,Kathryn P. Burdon,Kathryn P. Burdon,David A. Mackey,David A. Mackey,Puya Gharahkhani,Jamie E Craig,Alex W. Hewitt,Alex W. Hewitt +37 more
TL;DR: A combined analysis of participants from the UK Biobank and the International Glaucoma Genetic Consortium identifies 85 new loci for intraocular pressure (IOP), and pathway analysis uncovers new pathways associated with both IOP and glAUcoma.
Journal ArticleDOI
Distinct modes of mitochondrial metabolism uncouple T cell differentiation and function
Will Bailis,Will Bailis,Justin A. Shyer,Jun Zhao,Juan Carlos Garcia Canaveras,Juan Carlos Garcia Canaveras,Fatimah J. Al Khazal,Rihao Qu,Holly R. Steach,Piotr Bielecki,Omair Khan,Ruaidhri Jackson,Yuval Kluger,Louis J. Maher,Joshua D. Rabinowitz,Joshua D. Rabinowitz,Joe Craft,Richard A. Flavell,Richard A. Flavell +18 more
TL;DR: Genetic, pharmacological and metabolomics experiments reveal that the malate–aspartate shuttle and mitochondrial citrate export support the differentiation of mouse T helper 1 cells, whereas succinate dehydrogenase enforces their terminal effector function, and suggest that transcriptional programming acts together with a parallel biochemical network to enforce cell state.
Journal ArticleDOI
Notch Signaling Facilitates In Vitro Generation of Cross-Presenting Classical Dendritic Cells
Margaret E. Kirkling,Margaret E. Kirkling,Urszula Cytlak,Colleen M. Lau,Kanako L. Lewis,Anastasia Resteu,Alireza Khodadadi-Jamayran,Christian W. Siebel,Hélène Salmon,Miriam Merad,Aristotelis Tsirigos,Matthew Collin,Matthew Collin,Venetia Bigley,Venetia Bigley,Boris Reizis,Boris Reizis +16 more
TL;DR: It is reported that OP9 stromal cells expressing the Notch ligand Delta-like 1 (OP9-DL1) optimize FLT3L-driven development of cDC1s from murine immortalized progenitors and primary bone marrow cells, and Notch signaling optimizes cDC generation in vitro and yields authentic cDC 1s for functional studies and translational applications.
References
More filters
Journal ArticleDOI
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
Trimmomatic: a flexible trimmer for Illumina sequence data
TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
BEDTools: a flexible suite of utilities for comparing genomic features
Aaron R. Quinlan,Ira M. Hall +1 more
TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.