Home
/
Authors
/
Michael E. Smoot

Author

Michael E. Smoot

Other affiliations: University of Virginia, University of California, Berkeley, Tel Aviv University ...read more

Bio: Michael E. Smoot is an academic researcher from University of California, San Diego. The author has contributed to research in topics: Biological network & Visualization. The author has an hindex of 13, co-authored 18 publications receiving 12131 citations. Previous affiliations of Michael E. Smoot include University of Virginia & University of California, Berkeley.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Versatile and open software for comparing large genomes

[...]

Stefan Kurtz¹, Adam M. Phillippy, Arthur L. Delcher, Michael E. Smoot², Martin Shumway, Corina Antonescu, Steven L. Salzberg - Show less +3 more•Institutions (2)

University of Hamburg¹, University of Virginia²

30 Jan 2004-Genome Biology

TL;DR: The newest version of MUMmer easily handles comparisons of large eukaryotic genomes at varying evolutionary distances, as demonstrated by applications to multiple genomes.

...read moreread less

Abstract: The newest version of MUMmer easily handles comparisons of large eukaryotic genomes at varying evolutionary distances, as demonstrated by applications to multiple genomes. Two new graphical viewing tools provide alternative ways to analyze genome alignments. The new system is the first version of MUMmer to be released as open-source software. This allows other developers to contribute to the code base and freely redistribute the code. The MUMmer sources are available at http://www.tigr.org/software/mummer.

...read moreread less

4,886 citations

Journal Article•DOI•

Cytoscape 2.8

[...]

Michael E. Smoot¹, Keiichiro Ono¹, Johannes Ruscheinski¹, Peng-Liang Wang¹, Trey Ideker¹ - Show less +1 more•Institutions (1)

University of California, San Diego¹

01 Feb 2011-Bioinformatics

TL;DR: Version 2.8 introduces two powerful new features—Custom Node Graphics and Attribute Equations—which can be used jointly to greatly enhance Cytoscape's data integration and visualization capabilities.

...read moreread less

Abstract: Summary: Cytoscape is a popular bioinformatics package for biological network visualization and data integration. Version 2.8 introduces two powerful new features—Custom Node Graphics and Attribute Equations—which can be used jointly to greatly enhance Cytoscape's data integration and visualization capabilities. Custom Node Graphics allow an image to be projected onto a node, including images generated dynamically or at remote locations. Attribute Equations provide Cytoscape with spreadsheet-like functionality in which the value of an attribute is computed dynamically as a function of other attributes and network properties. Availability and implementation: Cytoscape is a desktop Java application released under the Library Gnu Public License (LGPL). Binary install bundles and source code for Cytoscape 2.8 are available for download from http://cytoscape.org. Contact: [email protected]

...read moreread less

4,186 citations

Journal Article•DOI•

Integration of biological networks and gene expression data using Cytoscape

[...]

Melissa S. Cline¹, Michael E. Smoot, Ethan Cerami², Allan Kuchinsky³, Nerius Landys, Christopher T. Workman⁴, Rowan H. Christmas⁵, Iliana Avila-Campilo⁶, Iliana Avila-Campilo⁵, Michael L. Creech, Benjamin Gross², Kristina Hanspers, Ruth Isserlin⁷, Ryan Kelley, Sarah Killcoyne⁵, Samad Lotia, Steven Maere⁸, John H. Morris⁹, Keiichiro Ono, Vuk Pavlovic⁷, Alexander R. Pico, Aditya Vailaya³, Peng-Liang Wang, Annette M. Adler³, Bruce R. Conklin, Leroy Hood⁵, Martin Kuiper⁸, Chris Sander², Ilya Schmulevich⁵, Benno Schwikowski¹, Guy J. Warner, Trey Ideker, Gary D. Bader⁷ - Show less +29 more•Institutions (9)

Pasteur Institute¹, Memorial Sloan Kettering Cancer Center², Agilent Technologies³, Technical University of Denmark⁴, Institute for Systems Biology⁵, Merck & Co.⁶, University of Toronto⁷, Ghent University⁸, University of California, San Francisco⁹

01 Jan 2007-Nature Protocols

TL;DR: This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest.

...read moreread less

Abstract: Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape.

...read moreread less

2,313 citations

Journal Article•DOI•

A travel guide to Cytoscape plugins

[...]

Rintaro Saito¹, Michael E. Smoot¹, Keiichiro Ono¹, Johannes Ruscheinski¹, Peng-Liang Wang¹, Samad Lotia², Alexander R. Pico², Gary D. Bader³, Trey Ideker¹ - Show less +5 more•Institutions (3)

University of California, San Diego¹, Gladstone Institutes², University of Toronto³

01 Nov 2012-Nature Methods

TL;DR: A travel guide to the world of plugins, covering the 152 publicly available plugins for Cytoscape 2.5–2.8 and ongoing efforts to distribute, organize and maintain the quality of the collection.

...read moreread less

Abstract: Cytoscape is open-source software for integration, visualization and analysis of biological networks. It can be extended through Cytoscape plugins, enabling a broad community of scientists to contribute useful features. This growth has occurred organically through the independent efforts of diverse authors, yielding a powerful but heterogeneous set of tools. We present a travel guide to the world of plugins, covering the 152 publicly available plugins for Cytoscape 2.5-2.8. We also describe ongoing efforts to distribute, organize and maintain the quality of the collection.

...read moreread less

1,250 citations

Journal Article•DOI•

Fast Statistical Alignment

[...]

Robert K. Bradley¹, Adam Roberts¹, Michael E. Smoot², Sudeep Juvekar¹, Jaeyoung Do³, Colin N. Dewey³, Ian Holmes¹, Lior Pachter¹ - Show less +4 more•Institutions (3)

University of California, Berkeley¹, University of California, San Diego², University of Wisconsin-Madison³

29 May 2009-PLOS Computational Biology

TL;DR: The Fast Statistical Alignment program is based on pair hidden Markov models which approximate an insertion/deletion process on a tree and uses a sequence annealing algorithm to combine the posterior probabilities estimated from these models into a multiple alignment.

...read moreread less

Abstract: We describe a new program for the alignment of multiple biological sequences that is both statistically motivated and fast enough for problem sizes that arise in practice. Our Fast Statistical Alignment program is based on pair hidden Markov models which approximate an insertion/deletion process on a tree and uses a sequence annealing algorithm to combine the posterior probabilities estimated from these models into a multiple alignment. FSA uses its explicit statistical model to produce multiple alignments which are accompanied by estimates of the alignment accuracy and uncertainty for every column and character of the alignment—previously available only with alignment programs which use computationally-expensive Markov Chain Monte Carlo approaches—yet can align thousands of long sequences. Moreover, FSA utilizes an unsupervised query-specific learning procedure for parameter estimation which leads to improved accuracy on benchmark reference alignments in comparison to existing programs. The centroid alignment approach taken by FSA, in combination with its learning procedure, drastically reduces the amount of false-positive alignment on biological data in comparison to that given by other methods. The FSA program and a companion visualization tool for exploring uncertainty in alignments can be used via a web interface at http://orangutan.math.berkeley.edu/fsa/, and the source code is available at http://fsa.sourceforge.net/.

...read moreread less

345 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

Journal Article•DOI•

Geneious Basic

[...]

Matthew Kearse, Richard Moir¹, Amy Wilson¹, Steven Stones-Havas¹, Matthew Cheung¹, Shane Sturrock¹, Simon Buxton¹, Alex Cooper¹, Sidney Markowitz¹, Chris Duran¹, Tobias Thierer¹, Bruce Ashton¹, Peter Meintjes¹, Alexei J. Drummond¹ - Show less +10 more•Institutions (1)

University of Queensland¹

01 Jun 2012-Bioinformatics

TL;DR: Geneious Basic has been designed to be an easy-to-use and flexible desktop software application framework for the organization and analysis of biological data, with a focus on molecular sequences and related data types.

...read moreread less

Abstract: Summary: The two main functions of bioinformatics are the organization and analysis of biological data using computational resources. Geneious Basic has been designed to be an easy-to-use and flexible desktop software application framework for the organization and analysis of biological data, with a focus on molecular sequences and related data types. It integrates numerous industry-standard discovery analysis tools, with interactive visualizations to generate publication-ready images. One key contribution to researchers in the life sciences is the Geneious public application programming interface (API) that affords the ability to leverage the existing framework of the Geneious Basic software platform for virtually unlimited extension and customization. The result is an increase in the speed and quality of development of computation tools for the life sciences, due to the functionality and graphical user interface available to the developer through the public API. Geneious Basic represents an ideal platform for the bioinformatics community to leverage existing components and to integrate their own specific requirements for the discovery, analysis and visualization of biological data. Availability and implementation: Binaries and public API freely available for download at http://www.geneious.com/basic, implemented in Java and supported on Linux, Apple OSX and MS Windows. The software is also available from the Bio-Linux package repository at http://nebc.nerc.ac.uk/news/geneiousonbl. Contact: peter@biomatters.com

...read moreread less

15,089 citations

Journal Article•DOI•

Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega

[...]

Fabian Sievers¹, Andreas Wilm², David Dineen¹, Toby J. Gibson, Kevin Karplus³, Weizhong Li⁴, Rodrigo Lopez⁴, Hamish McWilliam⁴, Michael Remmert⁵, Johannes Söding⁵, Julie D. Thompson⁶, Desmond G. Higgins¹ - Show less +8 more•Institutions (6)

University College Dublin¹, Genome Institute of Singapore², University of California, Santa Cruz³, European Bioinformatics Institute⁴, Ludwig Maximilian University of Munich⁵, University of Strasbourg⁶

01 Jan 2011-Molecular Systems Biology

TL;DR: A new program called Clustal Omega is described, which can align virtually any number of protein sequences quickly and that delivers accurate alignments, and which outperforms other packages in terms of execution time and quality.

...read moreread less

Abstract: Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.

...read moreread less

12,489 citations

Journal Article•DOI•

An obesity-associated gut microbiome with increased capacity for energy harvest

[...]

Peter J. Turnbaugh¹, Ruth E. Ley, Michael A. Mahowald, Vincent Magrini¹, Elaine R. Mardis¹, Jeffrey I. Gordon - Show less +2 more•Institutions (1)

Washington University in St. Louis¹

21 Dec 2006-Nature

TL;DR: It is demonstrated through metagenomic and biochemical analyses that changes in the relative abundance of the Bacteroidetes and Firmicutes affect the metabolic potential of the mouse gut microbiota and indicates that the obese microbiome has an increased capacity to harvest energy from the diet.

...read moreread less

Abstract: The worldwide obesity epidemic is stimulating efforts to identify host and environmental factors that affect energy balance. Comparisons of the distal gut microbiota of genetically obese mice and their lean littermates, as well as those of obese and lean human volunteers have revealed that obesity is associated with changes in the relative abundance of the two dominant bacterial divisions, the Bacteroidetes and the Firmicutes. Here we demonstrate through metagenomic and biochemical analyses that these changes affect the metabolic potential of the mouse gut microbiota. Our results indicate that the obese microbiome has an increased capacity to harvest energy from the diet. Furthermore, this trait is transmissible: colonization of germ-free mice with an 'obese microbiota' results in a significantly greater increase in total body fat than colonization with a 'lean microbiota'. These results identify the gut microbiota as an additional contributing factor to the pathophysiology of obesity.

...read moreread less

10,126 citations

Journal Article•DOI•

FastTree 2--approximately maximum-likelihood trees for large alignments.

[...]

Morgan N. Price¹, Paramvir S. Dehal¹, Adam P. Arkin¹, Adam P. Arkin²•Institutions (2)

Lawrence Berkeley National Laboratory¹, University of California, Berkeley²

10 Mar 2010-PLOS ONE

TL;DR: Improvements to FastTree are described that improve its accuracy without sacrificing scalability, and FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments.

...read moreread less

Abstract: Background We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability.

...read moreread less

10,010 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse