Home
/
Authors
/
Robin B Kodner

Author

Robin B Kodner

Bio: Robin B Kodner is an academic researcher from University of Washington. The author has contributed to research in topics: Tree (data structure) & Phylogenetic tree. The author has an hindex of 5, co-authored 5 publications receiving 1695 citations.

Topics: Tree (data structure), Phylogenetic tree, Time complexity, Botany, Biology ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

[...]

Frederick A. Matsen¹, Robin B Kodner², E. Virginia Armbrust²•Institutions (2)

Fred Hutchinson Cancer Research Center¹, University of Washington²

30 Oct 2010-BMC Bioinformatics

TL;DR: Pplacer as discussed by the authors is a software package for phylogenetic placement and subsequent visualization, which can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, and is easy to run in parallel.

...read moreread less

Abstract: Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service.

...read moreread less

851 citations

Posted Content•

pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

[...]

Frederick A. Matsen¹, Robin B Kodner², E. Virginia Armbrust²•Institutions (2)

Fred Hutchinson Cancer Research Center¹, University of Washington²

30 Mar 2010-arXiv: Populations and Evolution

TL;DR: Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service.

...read moreread less

Abstract: Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power of likelihood-based approaches to large data sets. This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is available as source code, binaries, and a web service.

...read moreread less

746 citations

Journal Article•DOI•

Comparative metatranscriptomics identifies molecular bases for the physiological responses of phytoplankton to varying iron availability

[...]

Adrian Marchetti¹, David M. Schruth¹, Colleen A. Durkin¹, Micaela S. Parker¹, Robin B Kodner¹, Chris T. Berthiaume¹, Rhonda Morales¹, Andrew E. Allen², E. Virginia Armbrust¹ - Show less +5 more•Institutions (2)

University of Washington¹, J. Craig Venter Institute²

07 Feb 2012-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Oceanic diatoms appear to display a distinctive transcriptional response to iron enrichment that allows chemical reduction of available nitrogen and carbon sources along with a continued dependence on iron-free photosynthetic proteins rather than substituting for iron-containing functional equivalents present within their gene repertoire.

...read moreread less

Abstract: In vast expanses of the oceans, growth of large phytoplankton such as diatoms is limited by iron availability. Diatoms respond almost immediately to the delivery of iron and rapidly compose the majority of phytoplankton biomass. The molecular bases underlying the subsistence of diatoms in iron-poor waters and the plankton community dynamics that follow iron resupply remain largely unknown. Here we use comparative metatranscriptomics to identify changes in gene expression associated with iron-stimulated growth of diatoms and other eukaryotic plankton. A microcosm iron-enrichment experiment using mixed-layer waters from the northeastern Pacific Ocean resulted in increased proportions of diatom transcripts and reduced proportions of transcripts from most other taxa within 98 h after iron addition. Hundreds of diatom genes were differentially expressed in the iron-enriched community compared with the iron-limited community; transcripts of diatom genes required for synthesis of photosynthesis and chlorophyll components, nitrate assimilation and the urea cycle, and synthesis of carbohydrate storage compounds were significantly overrepresented. Transcripts of genes encoding rhodopsins in eukaryotic phytoplankton were significantly underrepresented following iron enrichment, suggesting rhodopsins help cells cope with low-iron conditions. Oceanic diatoms appear to display a distinctive transcriptional response to iron enrichment that allows chemical reduction of available nitrogen and carbon sources along with a continued dependence on iron-free photosynthetic proteins rather than substituting for iron-containing functional equivalents present within their gene repertoire. This ability of diatoms to divert their newly acquired iron toward nitrate assimilation may underlie why diatoms consistently dominate iron enrichments in high-nitrate, low-chlorophyll regions.

...read moreread less

277 citations

Journal Article•DOI•

Phylogenetic investigation of the aliphatic, non-hydrolyzable biopolymer algaenan, with a focus on green algae

[...]

Robin B Kodner¹, Roger E. Summons², Andrew H. Knoll³•Institutions (3)

University of Washington¹, Massachusetts Institute of Technology², Harvard University³

01 Aug 2009-Organic Geochemistry

TL;DR: The results suggest that the biopolymer is not widespread ecologically or phylogenetically, is not found abundantly in marine organisms and likely represents a functional description of molecular class, rather than a biomarker for green algae.

...read moreread less

77 citations

Journal Article•DOI•

Identification of G protein-coupled receptor signaling pathway proteins in marine diatoms using comparative genomics

[...]

Jesse A. Port¹, Micaela S. Parker¹, Robin B Kodner¹, James C. Wallace¹, E. Virginia Armbrust¹, Elaine M. Faustman¹ - Show less +2 more•Institutions (1)

University of Washington¹

24 Jul 2013-BMC Genomics

TL;DR: The presence of sequences in all four diatoms that code for the proteins required for a functional mammalian GPCR pathway highlights the highly conserved nature of this pathway and suggests a complex signaling machinery related to environmental perception and response in these unicellular organisms.

...read moreread less

Abstract: Background: The G protein-coupled receptor (GPCR) signaling pathway plays an essential role in signal transmission and response to external stimuli in mammalian cells. Protein components of this pathway have been characterized in plants and simpler eukaryotes such as yeast, but their presence and role in unicellular photosynthetic eukaryotes have not been determined. We use a comparative genomics approach using whole genome sequences and gene expression libraries of four diatoms (Pseudo-nitzschia multiseries, Thalassiosira pseudonana, Phaeodactylum tricornutum and Fragilariopsis cylindrus) to search for evidence of GPCR signaling pathway proteins that share sequence conservation to known GPCR pathway proteins. Results: The majority of the core components of GPCR signaling were well conserved in all four diatoms, with protein sequence similarity to GPCRs, human G protein α- and β-subunits and downstream effectors. There was evidence for the Gγ-subunit and thus a full heterotrimeric G protein only in T. pseudonana. Phylogenetic analysis of putative diatom GPCRs indicated similarity but deep divergence to the class C GPCRs, with branches basal to the GABAB receptor subfamily. The extracellular and intracellular regions of these putative diatom GPCR sequences exhibited large variation in sequence length, and seven of these sequences contained the necessary ligand binding domain for class C GPCR activation. Transcriptional data indicated that a number of the putative GPCR sequences are expressed in diatoms under various stress conditions in culture, and that many of the GPCR-activated signaling proteins, including the G protein, are also expressed. Conclusions: The presence of sequences in all four diatoms that code for the proteins required for a functional mammalian GPCR pathway highlights the highly conserved nature of this pathway and suggests a complex signaling machinery related to environmental perception and response in these unicellular organisms. The lack of evidence for some GPCR pathway proteins in one or more of the diatoms, such as the Gγ-subunit, may be due to differences in genome completeness and genome coverage for the four diatoms. The high divergence of putative diatom GPCR sequences to known class C GPCRs suggests these sequences may represent another, potentially ancestral, subfamily of class C GPCRs.

...read moreread less

25 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

[...]

Kazutaka Katoh¹, Daron M. Standley¹•Institutions (1)

Osaka University¹

01 Apr 2013-Molecular Biology and Evolution

TL;DR: This version of MAFFT has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update.

...read moreread less

Abstract: We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.

...read moreread less

27,771 citations

Journal Article•DOI•

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

[...]

Donovan H. Parks¹, Michael Imelfort¹, Connor T. Skennerton¹, Philip Hugenholtz¹, Gene W. Tyson¹ - Show less +1 more•Institutions (1)

University of Queensland¹

01 Jul 2015-Genome Research

TL;DR: An objective measure of genome quality is proposed that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities and is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches.

...read moreread less

Abstract: Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.

...read moreread less

5,788 citations

Journal Article•DOI•

Interactive Tree Of Life (iTOL) v4: recent updates and new developments.

[...]

Ivica Letunic, Peer Bork¹•Institutions (1)

European Bioinformatics Institute¹

02 Jul 2019-Nucleic Acids Research

TL;DR: The current version of iTOL v4 introduces four new dataset types, together with numerous new features, and is the first tool which supports direct visualization of Qiime 2 trees and associated annotations.

...read moreread less

Abstract: The Interactive Tree Of Life (https://itol.embl.de) is an online tool for the display, manipulation and annotation of phylogenetic and other trees. It is freely available and open to everyone. The current version introduces four new dataset types, together with numerous new features. Annotation options have been expanded and new control options added for many display elements. An interactive spreadsheet-like editor has been implemented, providing dataset creation and editing directly in the web interface. Font support has been rewritten with full support for UTF-8 character encoding throughout the user interface. Google Web Fonts are now fully supported in the tree text labels. iTOL v4 is the first tool which supports direct visualization of Qiime 2 trees and associated annotations. The user account system has been streamlined and expanded with new navigation options, and currently handles >700 000 trees from more than 40 000 individual users. Full batch access has been implemented allowing programmatic upload and export of trees and annotations.

...read moreread less

4,233 citations

Journal Article•DOI•

Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees

[...]

Ivica Letunic, Peer Bork¹•Institutions (1)

University of Würzburg¹

08 Jul 2016-Nucleic Acids Research

TL;DR: ITOL 3 is the first tool which supports direct visualization of the recently proposed phylogenetic placements format, and its account system has been redesigned to simplify the management of trees in user-defined workspaces and projects.

...read moreread less

Abstract: Interactive Tree Of Life (http://itol.embl.de) is a web-based tool for the display, manipulation and annotation of phylogenetic trees. It is freely available and open to everyone. The current version was completely redesigned and rewritten, utilizing current web technologies for speedy and streamlined processing. Numerous new features were introduced and several new data types are now supported. Trees with up to 100,000 leaves can now be efficiently displayed. Full interactive control over precise positioning of various annotation features and an unlimited number of datasets allow the easy creation of complex tree visualizations. iTOL 3 is the first tool which supports direct visualization of the recently proposed phylogenetic placements format. Finally, iTOL's account system has been redesigned to simplify the management of trees in user-defined workspaces and projects, as it is heavily used and currently handles already more than 500,000 trees from more than 10,000 individual users.

...read moreread less

4,190 citations

Journal Article•DOI•

Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation.

[...]

Ivica Letunic, Peer Bork¹, Peer Bork², Peer Bork³•Institutions (3)

European Bioinformatics Institute¹, University of Würzburg², Yonsei University³

02 Jul 2021-Nucleic Acids Research

TL;DR: The Interactive Tree Of Life (ITOL) as mentioned in this paper is an online tool for the display, manipulation and annotation of phylogenetic and other trees, which allows users to draw shapes, labels and other features directly onto the trees.

...read moreread less

Abstract: The Interactive Tree Of Life (https://itol.embl.de) is an online tool for the display, manipulation and annotation of phylogenetic and other trees. It is freely available and open to everyone. iTOL version 5 introduces a completely new tree display engine, together with numerous new features. For example, a new dataset type has been added (MEME motifs), while annotation options have been expanded for several existing ones. Node metadata display options have been extended and now also support non-numerical categorical values, as well as multiple values per node. Direct manual annotation is now available, providing a set of basic drawing and labeling tools, allowing users to draw shapes, labels and other features by hand directly onto the trees. Support for tree and dataset scales has been extended, providing fine control over line and label styles. Unrooted tree displays can now use the equal-daylight algorithm, proving a much greater display clarity. The user account system has been streamlined and expanded with new navigation options and currently handles >1 million trees from >70 000 individual users.

...read moreread less

2,856 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse