RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics

doi:10.1093/NAR/GKS540

Open AccessJournal ArticleDOI

RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics

Marc Lohse, +6 more

- 01 Jul 2012 -

Nucleic Acids Research

- Vol. 40, pp 622-627

TLDR

RobiNA is an integrated solution that consolidates all steps of RNA-Seq-based differential gene-expression analysis in one user-friendly cross-platform application featuring a rich graphical user interface and supports quality checking, flexible filtering and statistical analysis of differential gene expression based on state of the art biostatistical methods developed in the R/Bioconductor projects.

Abstract:

Recent rapid advances in next generation RNA sequencing (RNA-Seq)-based provide researchers with unprecedentedly large data sets and open new perspectives in transcriptomics. Furthermore, RNA-Seq-based transcript profiling can be applied to non-model and newly discovered organisms because it does not require a predefined measuring platform (like e.g. microarrays). However, these novel technologies pose new challenges: the raw data need to be rigorously quality checked and filtered prior to analysis, and proper statistical methods have to be applied to extract biologically relevant information. Given the sheer volume of data, this is no trivial task and requires a combination of considerable technical resources along with bioinformatics expertise. To aid the individual researcher, we have developed RobiNA as an integrated solution that consolidates all steps of RNA-Seq-based differential gene-expression analysis in one user-friendly cross-platform application featuring a rich graphical user interface. RobiNA accepts raw FastQ files, SAM/BAM alignment files and counts tables as input. It supports quality checking, flexible filtering and statistical analysis of differential gene expression based on state-of-the art biostatistical methods developed in the R/Bioconductor projects. In-line help and a step-by-step manual guide users through the analysis. Installer packages for Mac OS X, Windows and Linux are available under the LGPL licence from http://mapman.gabipd.org/web/guest/ robin.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

HTSeq—a Python framework to work with high-throughput sequencing data

Simon Anders, +2 more

- 15 Jan 2015 -

Bioinformatics

TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.

...read moreread less

Journal ArticleDOI

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

Brian J. Haas, +24 more

- 01 Aug 2013 -

Nature Protocols

TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.

...read moreread less

Journal ArticleDOI

N 6 -methyladenosine-dependent RNA structural switches regulate RNA–protein interactions

Nian Liu, +5 more

- 26 Feb 2015 -

Nature

TL;DR: It is found that m6A alters the local structure in mRNA and long non-coding RNA (lncRNA) to facilitate binding of heterogeneous nuclear ribonucleoprotein C (HNRNPC), an abundant nuclear RNA-binding protein responsible for pre-mRNA processing.

...read moreread less

Journal ArticleDOI

SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data.

Chen Yuxin, +14 more

- 01 Jan 2018 -

GigaScience

TL;DR: SOAPnuke is demonstrated as a tool with abundant functions for a “QC-Preprocess-QC” workflow and MapReduce acceleration framework that enables large scalability to distribute all the processing works to an entire compute cluster.

...read moreread less

Journal ArticleDOI

Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.

Hongshan Jiang, +3 more

- 12 Jun 2014 -

BMC Bioinformatics

TL;DR: A novel algorithm, the bit-masked k-difference matching algorithm, which has O(kn) expected time with O(m) space, where k is the maximum number of differences allowed, n is the read length, and m is the adapter length is devised, which achieves as yet unmatched accuracies for adapter trimming with low time bound.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Mark D. Robinson, +2 more

- 01 Jan 2010 -

Bioinformatics

TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.

...read moreread less

Journal ArticleDOI

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

Ben Langmead, +3 more

- 04 Mar 2009 -

Genome Biology

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.

...read moreread less

Journal ArticleDOI

Differential expression analysis for sequence count data.

Simon Anders, +1 more

- 27 Oct 2010 -

Genome Biology

TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.

...read moreread less

Journal ArticleDOI

Bioconductor: open software development for computational biology and bioinformatics

Robert Gentleman, +24 more

- 15 Sep 2004 -

Genome Biology

TL;DR: Details of the aims and methods of Bioconductor, the collaborative creation of extensible software for computational biology and bioinformatics, and current challenges are described.

...read moreread less

Journal ArticleDOI

Database resources of the National Center for Biotechnology Information

David L. Wheeler, +12 more

- 01 Jan 2004 -

Nucleic Acids Research

TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.

...read moreread less

Collapse

Related Papers (5)

The Sequence Alignment/Map format and SAMtools

Heng Li, +8 more

- 01 Aug 2009 -

Bioinformatics

Fast and accurate short read alignment with Burrows–Wheeler transform

Heng Li, +1 more

- 01 Jul 2009 -

Bioinformatics

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Mark D. Robinson, +2 more

- 01 Jan 2010 -

Bioinformatics

Fast gapped-read alignment with Bowtie 2

Ben Langmead, +3 more

- 01 Apr 2012 -

Nature Methods

RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics

Citations

HTSeq—a Python framework to work with high-throughput sequencing data

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

N 6 -methyladenosine-dependent RNA structural switches regulate RNA–protein interactions

SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data.

Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.

References

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

Differential expression analysis for sequence count data.

Bioconductor: open software development for computational biology and bioinformatics

Database resources of the National Center for Biotechnology Information

Related Papers (5)

The Sequence Alignment/Map format and SAMtools

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Fast and accurate short read alignment with Burrows–Wheeler transform

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Fast gapped-read alignment with Bowtie 2