Home
/
Authors
/
Emiley A. Eloe-Fadrosh

Author

Emiley A. Eloe-Fadrosh

Other affiliations: Joint Genome Institute, University of Maryland, Baltimore, United States Department of Energy

Bio: Emiley A. Eloe-Fadrosh is an academic researcher from Lawrence Berkeley National Laboratory. The author has contributed to research in topics: Metagenomics & Genome. The author has an hindex of 31, co-authored 88 publications receiving 4638 citations. Previous affiliations of Emiley A. Eloe-Fadrosh include Joint Genome Institute & University of Maryland, Baltimore.

Topics: Metagenomics, Genome, Medicine, Microbiome, Biology ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

[...]

Robert M. Bowers¹, Nikos C. Kyrpides¹, Ramunas Stepanauskas², Miranda Harmon-Smith¹, Devin F. R. Doud¹, T. B. K. Reddy¹, Frederik Schulz¹, Jessica K. Jarett¹, Adam R. Rivers³, Adam R. Rivers¹, Emiley A. Eloe-Fadrosh¹, Susannah G. Tringe¹, Susannah G. Tringe⁴, Natalia Ivanova¹, Alex Copeland¹, Alicia Clum¹, Eric D. Becraft², Rex R. Malmstrom¹, Bruce W. Birren⁵, Mircea Podar⁶, Peer Bork, George M. Weinstock, George M. Garrity⁷, Jeremy A. Dodsworth⁸, Shibu Yooseph⁹, Granger G. Sutton⁹, Frank Oliver Gloeckner¹⁰, Jack A. Gilbert¹¹, William C. Nelson¹², Steven J. Hallam¹³, Sean P. Jungbluth¹⁴, Sean P. Jungbluth¹, Thijs J. G. Ettema¹⁵, Scott Tighe¹⁶, Konstantinos T. Konstantinidis¹⁷, Wen Tso Liu¹⁸, Brett J. Baker¹⁹, Thomas Rattei²⁰, Jonathan A. Eisen²¹, Brian P. Hedlund²², Katherine D. McMahon²³, Noah Fierer²⁴, Rob Knight²⁵, Robert D. Finn²⁶, Guy Cochrane²⁶, Ilene Karsch-Mizrachi²⁷, Gene W. Tyson²⁸, Christian Rinke²⁸, Alla Lapidus²⁹, Folker Meyer¹¹, Pelin Yilmaz¹⁰, Donovan H. Parks²⁸, A. M. Eren, Lynn M. Schriml, Jillian F. Banfield³⁰, Philip Hugenholtz²⁸, Tanja Woyke¹⁰ - Show less +53 more•Institutions (30)

Joint Genome Institute¹, Bigelow Laboratory For Ocean Sciences², United States Department of Agriculture³, University of California, Merced⁴, Broad Institute⁵, Oak Ridge National Laboratory⁶, Michigan State University⁷, California State University, San Bernardino⁸, J. Craig Venter Institute⁹, Max Planck Society¹⁰, Argonne National Laboratory¹¹, Pacific Northwest National Laboratory¹², University of British Columbia¹³, University of Southern California¹⁴, Science for Life Laboratory¹⁵, University of Vermont¹⁶, Georgia Institute of Technology¹⁷, University of Illinois at Urbana–Champaign¹⁸, University of Texas at Austin¹⁹, University of Vienna²⁰, University of California, Davis²¹, University of Nevada, Las Vegas²², University of Wisconsin-Madison²³, Cooperative Institute for Research in Environmental Sciences²⁴, University of California, San Diego²⁵, European Bioinformatics Institute²⁶, National Institutes of Health²⁷, University of Queensland²⁸, Saint Petersburg State University²⁹, University of California, Berkeley³⁰

01 Jul 2018-Nature Biotechnology

TL;DR: Two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences are presented, including the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum information about a Metagenome-Assembled Genomes (MIMAG), including estimates of genome completeness and contamination.

...read moreread less

Abstract: We present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a Metagenome-Assembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Gene Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.

...read moreread less

1,171 citations

Journal Article•DOI•

Uncovering Earth’s virome

[...]

David Paez-Espino¹, Emiley A. Eloe-Fadrosh¹, Georgios A. Pavlopoulos¹, Alex D. Thomas¹, Marcel Huntemann¹, Natalia Mikhailova¹, Edward M. Rubin¹, Edward M. Rubin², Natalia Ivanova¹, Nikos C. Kyrpides¹ - Show less +6 more•Institutions (2)

Joint Genome Institute¹, Lawrence Berkeley National Laboratory²

17 Aug 2016-Nature

TL;DR: Analysis of viral distribution across diverse ecosystems revealed strong habitat-type specificity for the vast majority of viruses, but also identified some cosmopolitan groups, and detailed insight into viral habitat distribution and host–virus interactions is provided.

...read moreread less

Abstract: Viruses are the most abundant biological entities on Earth, but challenges in detecting, isolating, and classifying unknown viruses have prevented exhaustive surveys of the global virome. Here we analysed over 5 Tb of metagenomic sequence data from 3,042 geographically diverse samples to assess the global distribution, phylogenetic diversity, and host specificity of viruses. We discovered over 125,000 partial DNA viral genomes, including the largest phage yet identified, and increased the number of known viral genes by 16-fold. Half of the predicted partial viral genomes were clustered into genetically distinct groups, most of which included genes unrelated to those in known viruses. Using CRISPR spacers and transfer RNA matches to link viral groups to microbial host(s), we doubled the number of microbial phyla known to be infected by viruses, and identified viruses that can infect organisms from different phyla. Analysis of viral distribution across diverse ecosystems revealed strong habitat-type specificity for the vast majority of viruses, but also identified some cosmopolitan groups. Our results highlight an extensive global viral diversity and provide detailed insight into viral habitat distribution and host–virus interactions.

...read moreread less

778 citations

Journal Article•DOI•

IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes.

[...]

I-Min A. Chen¹, Ken Chu¹, Krishna Palaniappan¹, Manoj Pillay¹, Anna Ratner¹, Jinghua Huang¹, Marcel Huntemann¹, Neha Varghese¹, James R. White, Rekha Seshadri¹, Tatyana Smirnova¹, Edward Kirton¹, Sean P. Jungbluth¹, Tanja Woyke¹, Emiley A. Eloe-Fadrosh¹, Natalia Ivanova¹, Nikos C. Kyrpides¹ - Show less +13 more•Institutions (1)

Joint Genome Institute¹

08 Jan 2019-Nucleic Acids Research

TL;DR: The Integrated Microbial Genomes & Microbiomes system v.5.0 has a new and more powerful genome search feature, new statistical tools, and supports metagenome binning.

...read moreread less

Abstract: The Integrated Microbial Genomes & Microbiomes system v.5.0 (IMG/M: https://img.jgi.doe.gov/m/) contains annotated datasets categorized into: archaea, bacteria, eukarya, plasmids, viruses, genome fragments, metagenomes, cell enrichments, single particle sorts, and metatranscriptomes. Source datasets include those generated by the DOE's Joint Genome Institute (JGI), submitted by external scientists, or collected from public sequence data archives such as NCBI. All submissions are typically processed through the IMG annotation pipeline and then loaded into the IMG data warehouse. IMG's web user interface provides a variety of analytical and visualization tools for comparative analysis of isolate genomes and metagenomes in IMG. IMG/M allows open access to all public genomes in the IMG data warehouse, while its expert review (ER) system (IMG/MER: https://img.jgi.doe.gov/mer/) allows registered users to access their private genomes and to store their private datasets in workspace for sharing and for further analysis. IMG/M data content has grown by 60% since the last report published in the 2017 NAR Database Issue. IMG/M v.5.0 has a new and more powerful genome search feature, new statistical tools, and supports metagenome binning.

...read moreread less

667 citations

Journal Article•DOI•

A genomic catalog of Earth’s microbiomes

[...]

Stephen Nayfach¹, Simon Roux¹, Rekha Seshadri¹, Daniel W. Udwary¹, Neha Varghese¹, Frederik Schulz¹, Dongying Wu¹, David Paez-Espino¹, I-Min Chen¹, Marcel Huntemann¹, Krishna Palaniappan¹, Joshua Ladau¹, Supratim Mukherjee¹, T. B. K. Reddy¹, Torben Nielsen¹, Edward Kirton¹, José P. Faria², Janaka N. Edirisinghe², Christopher S. Henry², Sean P. Jungbluth¹, Dylan Chivian³, Paramvir S. Dehal³, Elisha M. Wood-Charlson³, Adam P. Arkin³, Susannah G. Tringe¹, Axel Visel¹, Tanja Woyke¹, Nigel J Mouncey¹, Natalia Ivanova¹, Nikos C. Kyrpides¹, Emiley A. Eloe-Fadrosh¹ - Show less +27 more•Institutions (3)

Joint Genome Institute¹, Argonne National Laboratory², Lawrence Berkeley National Laboratory³

01 Apr 2021-Nature Biotechnology

TL;DR: The utility of this collection of >10,000 metagenomes collected from diverse habitats covering all of Earth’s continents and oceans is demonstrated for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses.

...read moreread less

Abstract: The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth’s continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.

...read moreread less

378 citations

Journal Article•DOI•

CheckV assesses the quality and completeness of metagenome-assembled viral genomes.

[...]

Stephen Nayfach¹, Antonio P. Camargo², Frederik Schulz¹, Emiley A. Eloe-Fadrosh¹, Simon Roux¹, Nikos C. Kyrpides¹ - Show less +2 more•Institutions (2)

Lawrence Berkeley National Laboratory¹, State University of Campinas²

01 May 2021-Nature Biotechnology

TL;DR: CheckV as discussed by the authors is an automated pipeline for identifying closed closed viral genomes, estimating the completeness of genome fragments and removing flanking host regions from integrated proviruses, which significantly improves the accuracy of identification of auxiliary metabolic genes and interpretation of viral-encoded functions.

...read moreread less

Abstract: Millions of new viral sequences have been identified from metagenomes, but the quality and completeness of these sequences vary considerably. Here we present CheckV, an automated pipeline for identifying closed viral genomes, estimating the completeness of genome fragments and removing flanking host regions from integrated proviruses. CheckV estimates completeness by comparing sequences with a large database of complete viral genomes, including 76,262 identified from a systematic search of publicly available metagenomes, metatranscriptomes and metaviromes. After validation on mock datasets and comparison to existing methods, we applied CheckV to large and diverse collections of metagenome-assembled viral sequences, including IMG/VR and the Global Ocean Virome. This revealed 44,652 high-quality viral genomes (that is, >90% complete), although the vast majority of sequences were small fragments, which highlights the challenge of assembling viral genomes from short-read metagenomes. Additionally, we found that removal of host contamination substantially improved the accurate identification of auxiliary metabolic genes and interpretation of viral-encoded functions. The quality of viral genomes assembled from metagenome data is assessed by CheckV.

...read moreread less

368 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23

Collapse

Cited by

PDF

Open Access

More filters

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•

Fast Tree: Computing Large Minimum-Evolution Trees with Profiles instead of a Distance Matrix

[...]

Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin

18 Jun 2009-Lawrence Berkeley National Laboratory

TL;DR: FastTree as mentioned in this paper uses sequence profiles of internal nodes in the tree to implement neighbor-joining and uses heuristics to quickly identify candidate joins, then uses nearest-neighbor interchanges to reduce the length of the tree.

...read moreread less

Abstract: Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement neighbor-joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest-neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N^2) space and O(N^2 L) time, but FastTree requires just O( NLa + N sqrt(N) ) memory and O( N sqrt(N) log(N) L a ) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 hours and 2.4 gigabytes of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 hours and 50 gigabytes of memory. In simulations, FastTree was slightly more accurate than neighbor joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.

...read moreread less

2,436 citations

Journal Article•DOI•

The Treatment-Naive Microbiome in New-Onset Crohn’s Disease

[...]

Dirk Gevers¹, Subra Kugathasan², Lee A. Denson³, Yoshiki Vázquez-Baeza⁴, Will Van Treuren⁴, Boyu Ren⁵, Emma Schwager⁵, Dan Knights⁶, Se Jin Song⁴, Moran Yassour¹, Xochitl C. Morgan⁵, Aleksandar Kostic¹, Chengwei Luo¹, Antonio Gonzalez⁴, Daniel McDonald⁴, Yael Haberman³, Thomas D. Walters⁷, Susan S. Baker⁸, Joel R. Rosh⁹, Michael C. Stephens¹⁰, Melvin B. Heyman¹¹, James Markowitz¹², Robert N. Baldassano¹³, Anne M. Griffiths, Francisco A. Sylvester, David R. Mack¹⁴, Sandra C. Kim¹⁵, Wallace Crandall¹⁵, Jeffrey S. Hyams, Curtis Huttenhower⁵, Curtis Huttenhower¹, Rob Knight⁴, Rob Knight¹⁶, Ramnik J. Xavier⁵, Ramnik J. Xavier¹ - Show less +31 more•Institutions (16)

Broad Institute¹, Emory University², Cincinnati Children's Hospital Medical Center³, University of Colorado Boulder⁴, Harvard University⁵, University of Minnesota⁶, University of Toronto⁷, Women & Children's Hospital of Buffalo⁸, Boston Children's Hospital⁹, Mayo Clinic¹⁰, University of California, San Francisco¹¹, Long Island Jewish Medical Center¹², Children's Hospital of Philadelphia¹³, Children's Hospital of Eastern Ontario¹⁴, Nationwide Children's Hospital¹⁵, Howard Hughes Medical Institute¹⁶

12 Mar 2014-Cell Host & Microbe

TL;DR: Comparing the microbial signatures between the ileum, the rectum, and fecal samples indicates that at this early stage of disease, assessing the rectal mucosal-associated microbiome offers unique potential for convenient and early diagnosis of CD.

...read moreread less

2,410 citations

Journal Article•DOI•

High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

[...]

Chirag Jain¹, Luis M. Rodriguez-R¹, Adam M. Phillippy², Konstantinos T. Konstantinidis¹, Srinivas Aluru¹ - Show less +1 more•Institutions (2)

Georgia Institute of Technology¹, National Institutes of Health²

30 Nov 2018-Nature Communications

TL;DR: FastANI is developed, a method to compute ANI using alignment-free approximate sequence mapping, and it is shown 95% ANI is an accurate threshold for demarcating prokaryotic species by analyzing about 90,000 proKaryotic genomes.

...read moreread less

Abstract: A fundamental question in microbiology is whether there is continuum of genetic diversity among genomes, or clear species boundaries prevail instead. Whole-genome similarity metrics such as Average Nucleotide Identity (ANI) help address this question by facilitating high resolution taxonomic analysis of thousands of genomes from diverse phylogenetic lineages. To scale to available genomes and beyond, we present FastANI, a new method to estimate ANI using alignment-free approximate sequence mapping. FastANI is accurate for both finished and draft genomes, and is up to three orders of magnitude faster compared to alignment-based approaches. We leverage FastANI to compute pairwise ANI values among all prokaryotic genomes available in the NCBI database. Our results reveal clear genetic discontinuity, with 99.8% of the total 8 billion genome pairs analyzed conforming to >95% intra-species and <83% inter-species ANI values. This discontinuity is manifested with or without the most frequently sequenced species, and is robust to historic additions in the genome databases. Average Nucleotide Identity (ANI) is a robust and useful measure to gauge genetic relatedness between two genomes. Here, the authors develop FastANI, a method to compute ANI using alignment-free approximate sequence mapping, and show 95% ANI is an accurate threshold for demarcating prokaryotic species by analyzing about 90,000 prokaryotic genomes.

...read moreread less

2,176 citations

Journal Article•DOI•

A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life

[...]

Donovan H. Parks¹, Maria Chuvochina¹, David W. Waite¹, Christian Rinke¹, Adam Skarshewski¹, Pierre-Alain Chaumeil¹, Philip Hugenholtz¹ - Show less +3 more•Institutions (1)

University of Queensland¹

27 Aug 2018-Nature Biotechnology

TL;DR: This work used a concatenated protein phylogeny as the basis for a bacterial taxonomy that conservatively removes polyphyletic groups and normalizes taxonomic ranks on the basis of relative evolutionary divergence.

...read moreread less

Abstract: Taxonomy is an organizing principle of biology and is ideally based on evolutionary relationships among organisms. Development of a robust bacterial taxonomy has been hindered by an inability to obtain most bacteria in pure culture and, to a lesser extent, by the historical use of phenotypes to guide classification. Culture-independent sequencing technologies have matured sufficiently that a comprehensive genome-based taxonomy is now possible. We used a concatenated protein phylogeny as the basis for a bacterial taxonomy that conservatively removes polyphyletic groups and normalizes taxonomic ranks on the basis of relative evolutionary divergence. Under this approach, 58% of the 94,759 genomes comprising the Genome Taxonomy Database had changes to their existing taxonomy. This result includes the description of 99 phyla, including six major monophyletic units from the subdivision of the Proteobacteria, and amalgamation of the Candidate Phyla Radiation into a single phylum. Our taxonomy should enable improved classification of uncultured bacteria and provide a sound basis for ecological and evolutionary studies.

...read moreread less

2,098 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse