scispace - formally typeset
Search or ask a question
Author

Michelle Schorn

Bio: Michelle Schorn is an academic researcher from University of California, San Diego. The author has contributed to research in topics: Medicine & Biology. The author has an hindex of 11, co-authored 15 publications receiving 2733 citations. Previous affiliations of Michelle Schorn include Life Technologies & Wageningen University and Research Centre.
Topics: Medicine, Biology, Metabolome, Genome, Metagenomics

Papers
More filters
Journal ArticleDOI
21 Jul 2011-Nature
TL;DR: A DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes, showing its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome.
Abstract: The seminal importance of DNA sequencing to the life sciences, biotechnology and medicine has driven the search for more scalable and lower-cost solutions. Here we describe a DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes. Sequence data are obtained by directly sensing the ions produced by template-directed DNA polymerase synthesis using all-natural nucleotides on this massively parallel semiconductor-sensing device or ion chip. The ion chip contains ion-sensitive, field-effect transistor-based sensors in perfect register with 1.2 million wells, which provide confinement and allow parallel, simultaneous detection of independent sequencing reactions. Use of the most widely used technology for constructing integrated circuits, the complementary metal-oxide semiconductor (CMOS) process, allows for low-cost, large-scale production and scaling of the device to higher densities and larger array sizes. We show the performance of the system by sequencing three bacterial genomes, its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome.

2,246 citations

Journal ArticleDOI
TL;DR: In this article, the authors report marine bacteria as producers of polybrominated diphenyl ethers (PBDEs) and establish a genetic and molecular foundation for their production that unifies paradigms for the elaboration of bromophenols and bromopyrroles abundant in marine biota.
Abstract: Polybrominated diphenyl ethers (PBDEs) and polybrominated bipyrroles are natural products that bioaccumulate in the marine food chain. PBDEs have attracted widespread attention because of their persistence in the environment and potential toxicity to humans. However, the natural origins of PBDE biosynthesis are not known. Here we report marine bacteria as producers of PBDEs and establish a genetic and molecular foundation for their production that unifies paradigms for the elaboration of bromophenols and bromopyrroles abundant in marine biota. We provide biochemical evidence of marine brominases revealing decarboxylative-halogenation enzymology previously unknown among halogenating enzymes. Biosynthetic motifs discovered in our study were used to mine sequence databases to discover unrealized marine bacterial producers of organobromine compounds.

228 citations

Journal ArticleDOI
TL;DR: The results establish the genetic and molecular foundation for the production of PBDEs in one of the most abundant natural sources of these molecules, further setting the stage for a metagenomic-based inventory of other PBDE sources in the marine environment.
Abstract: Naturally produced polybrominated diphenyl ethers (PBDEs) pervade the marine environment and structurally resemble toxic man-made brominated flame retardants. PBDEs bioaccumulate in marine animals and are likely transferred to the human food chain. However, the biogenic basis for PBDE production in one of their most prolific sources, marine sponges of the order Dysideidae, remains unidentified. Here, we report the discovery of PBDE biosynthetic gene clusters within sponge-microbiome-associated cyanobacterial endosymbionts through the use of an unbiased metagenome-mining approach. Using expression of PBDE biosynthetic genes in heterologous cyanobacterial hosts, we correlate the structural diversity of naturally produced PBDEs to modifications within PBDE biosynthetic gene clusters in multiple sponge holobionts. Our results establish the genetic and molecular foundation for the production of PBDEs in one of the most abundant natural sources of these molecules, further setting the stage for a metagenomic-based inventory of other PBDE sources in the marine environment.

120 citations

Journal ArticleDOI
TL;DR: By indexing the Pseudomonas specialized metabolome, this work reports the molecular-networking-based discovery of four molecules and their evolutionary relationships: a poaeamide analogue and a molecular subfamily of cyclic lipopeptides, bananamides 1, 2 and 3.
Abstract: Pseudomonads are cosmopolitan microorganisms able to produce a wide array of specialized metabolites These molecules allow Pseudomonas to scavenge nutrients, sense population density and enhance or inhibit growth of competing microorganisms However, these valuable metabolites are typically characterized one-molecule-one-microbe at a time, instead of being inventoried in large numbers To index and map the diversity of molecules detected from these organisms, 260 strains of ecologically diverse origins were subjected to mass-spectrometry-based molecular networking Molecular networking not only enables dereplication of molecules, but also sheds light on their structural relationships Moreover, it accelerates the discovery of new molecules Here, by indexing the Pseudomonas specialized metabolome, we report the molecular-networking-based discovery of four molecules and their evolutionary relationships: a poaeamide analogue and a molecular subfamily of cyclic lipopeptides, bananamides 1, 2 and 3 Analysis of their biosynthetic gene cluster shows that it constitutes a distinct evolutionary branch of the Pseudomonas cyclic lipopeptides Through analysis of an additional 370 extracts of wheat-associated Pseudomonas, we demonstrate how the detailed knowledge from our reference index can be efficiently propagated to annotate complex metabolomic data from other studies, akin to the way in which newly generated genomic information can be compared to data from public databases

111 citations

Journal ArticleDOI
TL;DR: The study of tetrabromopyrrole biosynthesis revealed a uniquely adapted halogenase–thioesterase enzyme pair that catalyzes an unprecedented series of halogenations on a pyrrole, providing a biogenetic basis for the biosynthesis of 1 and setting a firm foundation for querying the biosynthetic potential for the production of 1 in marine (meta)genomes.
Abstract: Halogenated pyrroles (halopyrroles) are common chemical moieties found in bioactive bacterial natural products. The halopyrrole moieties of mono- and dihalopyrrole-containing compounds arise from a conserved mechanism in which a proline-derived pyrrolyl group bound to a carrier protein is first halogenated and then elaborated by peptidic or polyketide extensions. This paradigm is broken during the marine pseudoalteromonad bacterial biosynthesis of the coral larval settlement cue tetrabromopyrrole (1), which arises from the substitution of the proline-derived carboxylate by a bromine atom. To understand the molecular basis for decarboxylative bromination in the biosynthesis of 1, we sequenced two Pseudoalteromonas genomes and identified a conserved four-gene locus encoding the enzymes involved in its complete biosynthesis. Through total in vitro reconstitution of the biosynthesis of 1 using purified enzymes and biochemical interrogation of individual biochemical steps, we show that all four bromine atoms in 1 are installed by the action of a single flavin-dependent halogenase: Bmp2. Tetrabromination of the pyrrole induces a thioesterase-mediated offloading reaction from the carrier protein and activates the biosynthetic intermediate for decarboxylation. Insights into the tetrabrominating activity of Bmp2 were obtained from the high-resolution crystal structure of the halogenase contrasted against structurally homologous halogenase Mpy16 that forms only a dihalogenated pyrrole in marinopyrrole biosynthesis. Structure-guided mutagenesis of the proposed substrate-binding pocket of Bmp2 led to a reduction in the degree of halogenation catalyzed. Our study provides a biogenetic basis for the biosynthesis of 1 and sets a firm foundation for querying the biosynthetic potential for the production of 1 in marine (meta)genomes.

73 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations

Journal ArticleDOI
TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

30,684 citations

01 Jun 2012
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

10,124 citations

Journal ArticleDOI
TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.
Abstract: De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

6,369 citations

Journal ArticleDOI
TL;DR: The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.
Abstract: 16S ribosomal RNA gene (rDNA) amplicon analysis remains the standard approach for the cultivation-independent investigation of microbial diversity. The accuracy of these analyses depends strongly on the choice of primers. The overall coverage and phylum spectrum of 175 primers and 512 primer pairs were evaluated in silico with respect to the SILVA 16S/18S rDNA non-redundant reference dataset (SSURef 108 NR). Based on this evaluation a selection of 'best available' primer pairs for Bacteria and Archaea for three amplicon size classes (100-400, 400-1000, ≥ 1000 bp) is provided. The most promising bacterial primer pair (S-D-Bact-0341-b-S-17/S-D-Bact-0785-a-A-21), with an amplicon size of 464 bp, was experimentally evaluated by comparing the taxonomic distribution of the 16S rDNA amplicons with 16S rDNA fragments from directly sequenced metagenomes. The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

5,346 citations