Home
/
Authors
/
Robert Schmieder

Author

Robert Schmieder

Other affiliations: The Chinese University of Hong Kong

Bio: Robert Schmieder is an academic researcher from San Diego State University. The author has contributed to research in topics: Metagenomics & Genome. The author has an hindex of 26, co-authored 33 publications receiving 6831 citations. Previous affiliations of Robert Schmieder include The Chinese University of Hong Kong.

Topics: Metagenomics, Genome, Human virome, Perl, Viral metagenomics ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Quality control and preprocessing of metagenomic datasets

[...]

Robert Schmieder¹, Robert Edwards¹•Institutions (1)

San Diego State University¹

01 Mar 2011-Bioinformatics

TL;DR: PRINSEQ is presented for easy and rapid quality control and data preprocessing of genomic and metagenomic datasets and can be used as a stand alone version or accessed online through a user-friendly web interface.

...read moreread less

Abstract: Summary: Here, we present PRINSEQ for easy and rapid quality control and data preprocessing of genomic and metagenomic datasets. Summary statistics of FASTA (and QUAL) or FASTQ files are generated in tabular and graphical form and sequences can be filtered, reformatted and trimmed by a variety of options to improve downstream analysis. Availability and Implementation: This open-source application was implemented in Perl and can be used as a stand alone version or accessed online through a user-friendly web interface. The source code, user help and additional information are available at http://prinseq.sourceforge.net/. Contact:[email protected]; [email protected]

...read moreread less

4,028 citations

Journal Article•DOI•

Fast identification and removal of sequence contamination from genomic and metagenomic datasets.

[...]

Robert Schmieder¹, Robert Edwards¹, Robert Edwards²•Institutions (2)

San Diego State University¹, Argonne National Laboratory²

09 Mar 2011-PLOS ONE

TL;DR: DeconSeq is a robust framework for the rapid, automated identification and removal of sequence contamination in longer-read datasets (150 bp mean read length) and allows scientists to automatically detect and efficiently remove unwanted sequence contamination from their datasets while eliminating critical limitations of current methods.

...read moreread less

Abstract: High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and cost-effective way of generating draft genomes and exploring microbial diversity. However, sequences obtained from impure nucleic acid preparations may contain DNA from sources other than the sample. Those sequence contaminations are a serious concern to the quality of the data used for downstream analysis, causing misassembly of sequence contigs and erroneous conclusions. Therefore, the removal of sequence contaminants is a necessary and required step for all sequencing projects. We developed DeconSeq, a robust framework for the rapid, automated identification and removal of sequence contamination in longer-read datasets (150 bp mean read length). DeconSeq is publicly available as standalone and web-based versions. The results can be exported for subsequent analysis, and the databases used for the web-based version are automatically updated on a regular basis. DeconSeq categorizes possible contamination sequences, eliminates redundant hits with higher similarity to non-contaminant genomes, and provides graphical visualizations of the alignment results and classifications. Using DeconSeq, we conducted an analysis of possible human DNA contamination in 202 previously published microbial and viral metagenomes and found possible contamination in 145 (72%) metagenomes with as high as 64% contaminating sequences. This new framework allows scientists to automatically detect and efficiently remove unwanted sequence contamination from their datasets while eliminating critical limitations of current methods. DeconSeq's web interface is simple and user-friendly. The standalone version allows offline analysis and integration into existing data processing pipelines. DeconSeq's results reveal whether the sequencing experiment has succeeded, whether the correct sample was sequenced, and whether the sample contains any sequence contamination from DNA preparation or host. In addition, the analysis of 202 metagenomes demonstrated significant contamination of the non-human associated metagenomes, suggesting that this method is appropriate for screening all metagenomes. DeconSeq is available at http://deconseq.sourceforge.net/.

...read moreread less

670 citations

Journal Article•DOI•

Metagenomic analysis of respiratory tract DNA viral communities in cystic fibrosis and non-cystic fibrosis individuals.

[...]

Dana Willner¹, Mike Furlan¹, Matthew Haynes¹, Robert Schmieder¹, Florent E. Angly¹, Joas L. da Silva¹, Sassan Tammadoni¹, Bahador Nosrat¹, Douglas Conrad², Douglas Conrad³, Forest Rohwer¹ - Show less +7 more•Institutions (3)

San Diego State University¹, Veterans Health Administration², University of California, San Diego³

09 Oct 2009-PLOS ONE

TL;DR: Functional metagenomics showed that all Non-CF virome communities were similar, and that CF viromes were enriched in aromatic amino acid metabolism, indicating that therapeutic measures may be more effective if used to change the respiratory environment, as opposed to shifting the taxonomic composition of resident microbiota.

...read moreread less

Abstract: The human respiratory tract is constantly exposed to a wide variety of viruses, microbes and inorganic particulates from environmental air, water and food. Physical characteristics of inhaled particles and airway mucosal immunity determine which viruses and microbes will persist in the airways. Here we present the first metagenomic study of DNA viral communities in the airways of diseased and non-diseased individuals. We obtained sequences from sputum DNA viral communities in 5 individuals with cystic fibrosis (CF) and 5 individuals without the disease. Overall, diversity of viruses in the airways was low, with an average richness of 175 distinct viral genotypes. The majority of viral diversity was uncharacterized. CF phage communities were highly similar to each other, whereas Non-CF individuals had more distinct phage communities, which may reflect organisms in inhaled air. CF eukaryotic viral communities were dominated by a few viruses, including human herpesviruses and retroviruses. Functional metagenomics showed that all Non-CF viromes were similar, and that CF viromes were enriched in aromatic amino acid metabolism. The CF metagenomes occupied two different metabolic states, probably reflecting different disease states. There was one outlying CF virome which was characterized by an over-representation of Guanosine-5′-triphosphate,3′-diphosphate pyrophosphatase, an enzyme involved in the bacterial stringent response. Unique environments like the CF airway can drive functional adaptations, leading to shifts in metabolic profiles. These results have important clinical implications for CF, indicating that therapeutic measures may be more effective if used to change the respiratory environment, as opposed to shifting the taxonomic composition of resident microbiota.

...read moreread less

393 citations

Journal Article•DOI•

Insights into antibiotic resistance through metagenomic approaches

[...]

Robert Schmieder¹, Robert Edwards¹•Institutions (1)

San Diego State University¹

01 Jan 2012-Future Microbiology

TL;DR: Recent findings and future challenges in the study of antibiotic resistance through metagenomic approaches are discussed, especially for the unculturable majority of environmental bacteria.

...read moreread less

Abstract: The consequences of bacterial infections have been curtailed by the introduction of a wide range of antibiotics. However, infections continue to be a leading cause of mortality, in part due to the evolution and acquisition of antibiotic-resistance genes. Antibiotic misuse and overprescription have created a driving force influencing the selection of resistance. Despite the problem of antibiotic resistance in infectious bacteria, little is known about the diversity, distribution and origins of resistance genes, especially for the unculturable majority of environmental bacteria. Functional and sequence-based metagenomics have been used for the discovery of novel resistance determinants and the improved understanding of antibiotic-resistance mechanisms in clinical and natural environments. This review discusses recent findings and future challenges in the study of antibiotic resistance through metagenomic approaches.

...read moreread less

277 citations

Journal Article•DOI•

TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets

[...]

Robert Schmieder¹, Yan Wei Lim¹, Forest Rohwer¹, Robert Edwards¹, Robert Edwards² - Show less +1 more•Institutions (2)

San Diego State University¹, Argonne National Laboratory²

23 Jun 2010-BMC Bioinformatics

TL;DR: TagCleaner is a publicly available web application that is able to automatically detect and efficiently remove tag sequences from metagenomic datasets and is easily configurable and provides a user-friendly interface.

...read moreread less

Abstract: Sequencing metagenomes that were pre-amplified with primer-based methods requires the removal of the additional tag sequences from the datasets. The sequenced reads can contain deletions or insertions due to sequencing limitations, and the primer sequence may contain ambiguous bases. Furthermore, the tag sequence may be unavailable or incorrectly reported. Because of the potential for downstream inaccuracies introduced by unwanted sequence contaminations, it is important to use reliable tools for pre-processing sequence data. TagCleaner is a web application developed to automatically identify and remove known or unknown tag sequences allowing insertions and deletions in the dataset. TagCleaner is designed to filter the trimmed reads for duplicates, short reads, and reads with high rates of ambiguous sequences. An additional screening for and splitting of fragment-to-fragment concatenations that gave rise to artificial concatenated sequences can increase the quality of the dataset. Users may modify the different filter parameters according to their own preferences. TagCleaner is a publicly available web application that is able to automatically detect and efficiently remove tag sequences from metagenomic datasets. It is easily configurable and provides a user-friendly interface. The interactive web interface facilitates export functionality for subsequent data processing, and is available at http://edwards.sdsu.edu/tagcleaner .

...read moreread less

225 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Modern Applied Statistics With S

[...]

Christina Gloeckner

01 Jan 2016

TL;DR: The modern applied statistics with s is universally compatible with any devices to read, and is available in the digital library an online access to it is set as public so you can download it instantly.

...read moreread less

Abstract: Thank you very much for downloading modern applied statistics with s. As you may know, people have search hundreds times for their favorite readings like this modern applied statistics with s, but end up in harmful downloads. Rather than reading a good book with a cup of coffee in the afternoon, instead they cope with some harmful virus inside their laptop. modern applied statistics with s is available in our digital library an online access to it is set as public so you can download it instantly. Our digital library saves in multiple countries, allowing you to get the most less latency time to download any of our books like this one. Kindly say, the modern applied statistics with s is universally compatible with any devices to read.

...read moreread less

5,249 citations

Journal Article•DOI•

Quality control and preprocessing of metagenomic datasets

[...]

Robert Schmieder¹, Robert Edwards¹•Institutions (1)

San Diego State University¹

01 Mar 2011-Bioinformatics

...read moreread less

4,028 citations

Journal Article•DOI•

The Treatment-Naive Microbiome in New-Onset Crohn’s Disease

[...]

Dirk Gevers¹, Subra Kugathasan², Lee A. Denson³, Yoshiki Vázquez-Baeza⁴, Will Van Treuren⁴, Boyu Ren⁵, Emma Schwager⁵, Dan Knights⁶, Se Jin Song⁴, Moran Yassour¹, Xochitl C. Morgan⁵, Aleksandar Kostic¹, Chengwei Luo¹, Antonio Gonzalez⁴, Daniel McDonald⁴, Yael Haberman³, Thomas D. Walters⁷, Susan S. Baker⁸, Joel R. Rosh⁹, Michael C. Stephens¹⁰, Melvin B. Heyman¹¹, James Markowitz¹², Robert N. Baldassano¹³, Anne M. Griffiths, Francisco A. Sylvester, David R. Mack¹⁴, Sandra C. Kim¹⁵, Wallace Crandall¹⁵, Jeffrey S. Hyams, Curtis Huttenhower¹, Curtis Huttenhower⁵, Rob Knight⁴, Rob Knight¹⁶, Ramnik J. Xavier⁵, Ramnik J. Xavier¹ - Show less +31 more•Institutions (16)

Broad Institute¹, Emory University², Cincinnati Children's Hospital Medical Center³, University of Colorado Boulder⁴, Harvard University⁵, University of Minnesota⁶, University of Toronto⁷, Women & Children's Hospital of Buffalo⁸, Boston Children's Hospital⁹, Mayo Clinic¹⁰, University of California, San Francisco¹¹, Long Island Jewish Medical Center¹², Children's Hospital of Philadelphia¹³, Children's Hospital of Eastern Ontario¹⁴, Nationwide Children's Hospital¹⁵, Howard Hughes Medical Institute¹⁶

12 Mar 2014-Cell Host & Microbe

TL;DR: Comparing the microbial signatures between the ileum, the rectum, and fecal samples indicates that at this early stage of disease, assessing the rectal mucosal-associated microbiome offers unique potential for convenient and early diagnosis of CD.

...read moreread less

2,410 citations

Journal Article•DOI•

NGS QC Toolkit: a toolkit for quality control of next generation sequencing data.

[...]

Ravi K. Patel, Mukesh K. Jain

01 Feb 2012-PLOS ONE

TL;DR: The toolkit is comprised of user-friendly tools for QC of sequencing data generated using Roche 454 and Illumina platforms, and additional tools to aid QC (sequence format converter and trimming tools) and analysis and analysis (statistics tools).

...read moreread less

Abstract: Next generation sequencing (NGS) technologies provide a high-throughput means to generate large amount of sequence data. However, quality control (QC) of sequence data generated from these technologies is extremely important for meaningful downstream analysis. Further, highly efficient and fast processing tools are required to handle the large volume of datasets. Here, we have developed an application, NGS QC Toolkit, for quality check and filtering of high-quality data. This toolkit is a standalone and open source application freely available at http://www.nipgr.res.in/ngsqctoolkit.html. All the tools in the application have been implemented in Perl programming language. The toolkit is comprised of user-friendly tools for QC of sequencing data generated using Roche 454 and Illumina platforms, and additional tools to aid QC (sequence format converter and trimming tools) and analysis (statistics tools). A variety of options have been provided to facilitate the QC at user-defined parameters. The toolkit is expected to be very useful for the QC of NGS data to facilitate better downstream analysis.

...read moreread less

2,387 citations

Journal Article•DOI•

SortMeRNA: Fast and accurate filtering of ribosomal RNAs in metatranscriptomic data.

[...]

Evguenia Kopylova¹, Laurent Noé¹, Hélène Touzet¹•Institutions (1)

Laboratoire d'Informatique Fondamentale de Lille¹

01 Dec 2012-Bioinformatics

TL;DR: SortMeRNA, a new software designed to rapidly filter rRNA fragments from metatranscriptomic data, is presented, capable of handling large sets of reads and sorting out all fragments matching to the rRNA database with high sensitivity and low running time.

...read moreread less

Abstract: MOTIVATION: The application of Next-Generation Sequencing (NGS) technologies to RNAs directly extracted from a community of organisms yields a mixture of fragments characterizing both coding and non-coding types of RNAs. The tasks to distinguish among these and to further categorize the families of messenger RNAs and ribosomal RNAs is an important step for examining gene expression patterns of an interactive environment and the phylogenetic classification of the constituting species. RESULTS: We present SortMeRNA, a new software designed to rapidly filter ribosomal RNA fragments from metatranscriptomic data. It is capable of handling large sets of reads and sorting out all fragments matching to the rRNA database with high sensitivity and low running time. AVAILABILITY: http://bioinfo.lifl.fr/RNA/sortmerna CONTACT: evguenia.kopylova@lifl.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

...read moreread less

1,868 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse