Journal•ISSN: 1087-0156
Nature Biotechnology
Nature Portfolio
About: Nature Biotechnology is an academic journal. The journal publishes majorly in the area(s): Medicine & Genome. It has an ISSN identifier of 1087-0156. Over the lifetime, 11349 publications have been published receiving 1357247 citations.
Papers published on a yearly basis
Papers
More filters
[...]
TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.
Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.
12,649 citations
[...]
TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.
Abstract: High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.
11,485 citations
[...]
TL;DR: MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data, detects peaks, isotope clusters and stable amino acid isotope–labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space and achieves mass accuracy in the p.p.b. range.
Abstract: Efficient analysis of very large amounts of raw data for peptide identification and protein quantification is a principal challenge in mass spectrometry (MS)-based proteomics. Here we describe MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data. Using correlation analysis and graph theory, MaxQuant detects peaks, isotope clusters and stable amino acid isotope-labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space. By integrating multiple mass measurements and correcting for linear and nonlinear mass offsets, we achieve mass accuracy in the p.p.b. range, a sixfold increase over standard techniques. We increase the proportion of identified fragmentation spectra to 73% for SILAC peptide pairs via unambiguous assignment of isotope and missed-cleavage state and individual mass precision. MaxQuant automatically quantifies several hundred thousand peptides per SILAC-proteome experiment and allows statistically robust identification and quantification of >4,000 proteins in mammalian cell lysates.
10,058 citations
[...]
TL;DR: In this article, the authors present an approach for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.
Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.
8,527 citations
[...]
TL;DR: In this paper, a new vector strategy for the insertion of foreign genes into the genomes of gram negative bacteria not closely related to Escherichia coli was developed, which can utilize any gram negative bacterium as a recipient for conjugative DNA transfer.
Abstract: We have developed a new vector strategy for the insertion of foreign genes into the genomes of gram negative bacteria not closely related to Escherichia coli. The system consists of two components: special E. coli donor strains and derivatives of E. coli vector plasmids. The donor strains (called mobilizing strains) carry the transfer genes of the broad host range IncP–type plasmid RP4 integrated in their chromosomes. They can utilize any gram negative bacterium as a recipient for conjugative DNA transfer. The vector plasmids contain the P–type specific recognition site for mobilization (Mob site) and can be mobilized with high frequency from the donor strains. The mobilizable vectors are derived from the commonly used E. coli vectors pACYC184, pACYC177, and pBR325, and are unable to replicate in strains outside the enteric bacterial group. Therefore, they are widely applicable as transposon carrier replicons for random transposon insertion mutagenesis in any strain into which they can be mobilized but not stably maintained. The vectors are especially useful for site–directed transposon mutagenesis and for site–specific gene transfer in a wide variety of gram negative organisms.
7,028 citations