HTSeq—a Python framework to work with high-throughput sequencing data
TLDR
This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.Abstract:
Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an opensource software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.deread more
Citations
More filters
Journal ArticleDOI
Elimination of senescent cells by β-galactosidase-targeted prodrug attenuates inflammation and restores physical function in aged mice
Cai Yusheng,Zhou Huanhuan,Yinhua Zhu,Qi Sun,Yin Ji,Anqi Xue,Yuting Wang,Wenhan Chen,Xiaojie Yu,Longteng Wang,Han Chen,Cheng Li,Tuoping Luo,Hongkui Deng +13 more
TL;DR: It is demonstrated that lysosomal β-gal can be effectively leveraged to selectively eliminate senescent cells, providing a novel strategy to develop anti-aging interventions.
Journal ArticleDOI
Dissecting the Causal Mechanism of X-Linked Dystonia-Parkinsonism by Integrating Genome and Transcriptome Assembly
Tatsiana Aneichyk,Tatsiana Aneichyk,William T. Hendriks,Rachita Yadav,Rachita Yadav,David Shin,Dadi Gao,Dadi Gao,Christine A. Vaine,Ryan L. Collins,Ryan L. Collins,Aloysius Domingo,Benjamin Currall,Alexei Stortchevoi,Trisha Multhaupt-Buell,Ellen B. Penney,Lilian Cruz,Jyotsna Dhakal,Harrison Brand,Carrie Hanscom,Caroline Antolik,Marisela E. Dy,Ashok Ragavendran,Jason G. Underwood,Jason G. Underwood,Stuart Cantsilieris,Katherine M. Munson,Evan E. Eichler,Evan E. Eichler,Patrick Acuna,Criscely L. Go,R. Dominic G. Jamora,Raymond L. Rosales,Deanna M. Church,Stephen R. Williams,Sarah Garcia,Christine Klein,Ulrich Müller,Kirk C. Wilhelmsen,H. T. Marc Timmers,Yechiam Sapir,Brian J. Wainger,Daniel Henderson,Naoto Ito,Neil I. Weisenfeld,David M. Jaffe,Nutan Sharma,Xandra O. Breakefield,Laurie J. Ozelius,D. Cristopher Bragg,Michael E. Talkowski +50 more
TL;DR: The integrated genome and transcriptome assembly technologies suggest an SVA-mediated aberrant transcriptional mechanism associated with XDP and may provide a roadmap for layered technologies and integrated assembly-based analyses for other unsolved Mendelian disorders.
Journal ArticleDOI
Expansion of primitive human hematopoietic stem cells by culture in a zwitterionic hydrogel
Tao Bai,Jianqiang Li,Andrew Sinclair,Suzan Imren,Fabiola Merriam,Fang Sun,Mary Beth O'Kelly,Cynthia Nourigat,Priyesh Jain,Jeffrey J. Delrow,Ryan Basom,Hsiang-Chieh Hung,Peng Zhang,Bowen Li,Shelly Heimfeld,Shaoyi Jiang,Colleen Delaney,Colleen Delaney +17 more
TL;DR: Using 3D culture of hematopoietic stem and progenitor cells in a degradable zwitterionic hydrogel, substantial expansion of phenotypically primitive CD34+ cord blood and bone-marrow-derived HSPCs is achieved and this culture system led to a 73-fold increase in long-term hematoplastic stem cell frequency.
Journal ArticleDOI
Proteogenomics of Non-smoking Lung Cancer in East Asia Delineates Molecular Signatures of Pathogenesis and Progression.
Yi-Ju Chen,Theodoros I. Roumeliotis,Ya Hsuan Chang,Ching-Tai Chen,Chia Li Han,Miao-Hsia Lin,Huei-Wen Chen,Gee-Chen Chang,Yih-Leong Chang,Chen-Tu Wu,Mong-Wei Lin,Min Shu Hsieh,Yu Tai Wang,Yet-Ran Chen,Inge Jonassen,Fatemeh Zamanzad Ghavidel,Ze Shiang Lin,Kuen Tyng Lin,Ching Wen Chen,Pei Yuan Sheu,Chen Ting Hung,Ke Chieh Huang,Hao Chin Yang,Pei-Yi Lin,Ta Chi Yen,Yi Wei Lin,Jen-Hung Wang,Lovely Raghav,Lovely Raghav,Chien-Yu Lin,Yan Si Chen,Pei Shan Wu,Chi Ting Lai,Shao Hsing Weng,Kang-Yi Su,Wei Hung Chang,Pang Yan Tsai,Ana I. Robles,Henry Rodriguez,Yi Jing Hsiao,Wen Hsin Chang,Ting-Yi Sung,Jin-Shing Chen,Sung-Liang Yu,Jyoti S. Choudhary,Hsuan-Yu Chen,Hsuan-Yu Chen,Pan-Chyr Yang,Pan-Chyr Yang,Yu-Ju Chen,Yu-Ju Chen +50 more
TL;DR: A deep comprehensive proteogenomic study on a prospectively collected cohort in Taiwan, representing early stage, predominantly female, non-smoking lung adenocarcinoma, revealed the cellular remodeling underpinning clinical trajectories and nominated candidate biomarkers for patient stratification and therapeutic intervention.
Journal ArticleDOI
Single-cell RNA-seq ties macrophage polarization to growth rate of intracellular Salmonella
Antoine-Emmanuel Saliba,Lei Li,Alexander J. Westermann,Silke Appenzeller,Daphne A.C. Stapels,Leon N. Schulte,Sophie Helaine,Jörg Vogel +7 more
TL;DR: The data suggest that gene expression variability in infected host cells shapes different cellular environments, some of which may favour a growth arrest of Salmonella facilitating immune evasion and the establishment of a long-term niche, while others allowSalmonella to escape intracellular antimicrobial activity and proliferate.
References
More filters
Journal ArticleDOI
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
Trimmomatic: a flexible trimmer for Illumina sequence data
TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
BEDTools: a flexible suite of utilities for comparing genomic features
Aaron R. Quinlan,Ira M. Hall +1 more
TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.