Home
/
Authors
/
Philippe Lemey

Author

Philippe Lemey

Other affiliations: University of Oxford, University of Southampton, Catholic University of Leuven ...read more

Bio: Philippe Lemey is an academic researcher from Katholieke Universiteit Leuven. The author has contributed to research in topics: Population & Phylogenetic tree. The author has an hindex of 77, co-authored 357 publications receiving 26102 citations. Previous affiliations of Philippe Lemey include University of Oxford & University of Southampton.

Topics: Population, Phylogenetic tree, Medicine, Biological dispersal, Genome ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
1948

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10

[...]

Marc A. Suchard¹, Philippe Lemey², Guy Baele², Daniel L. Ayres³, Alexei J. Drummond⁴, Andrew Rambaut⁵ - Show less +2 more•Institutions (5)

University of California, Los Angeles¹, Katholieke Universiteit Leuven², University of Maryland, College Park³, University of Auckland⁴, University of Edinburgh⁵

01 Jan 2018-Virus Evolution

TL;DR: The BEAST software package unifies molecular phylogenetic reconstruction with complex discrete and continuous trait evolution, divergence-time dating, and coalescent demographic models in an efficient statistical inference engine using Markov chain Monte Carlo integration.

...read moreread less

Abstract: The Bayesian Evolutionary Analysis by Sampling Trees (BEAST) software package has become a primary tool for Bayesian phylogenetic and phylodynamic inference from genetic sequence data. BEAST unifies molecular phylogenetic reconstruction with complex discrete and continuous trait evolution, divergence-time dating, and coalescent demographic models in an efficient statistical inference engine using Markov chain Monte Carlo integration. A convenient, cross-platform, graphical user interface allows the flexible construction of complex evolutionary analyses.

...read moreread less

2,184 citations

Journal Article•DOI•

RDP3: a flexible and fast computer program for analyzing recombination

[...]

Darren P. Martin¹, Philippe Lemey¹, Martin Lott¹, Vincent Moulton¹, David Posada¹, Pierre Lefeuvre¹ - Show less +2 more•Institutions (1)

University of East Anglia¹

01 Oct 2010-Bioinformatics

TL;DR: RDP3 is a new version of the RDP program for characterizing recombination events in DNA-sequence alignments that includes four new recombination analysis methods, new tests for recombination hot-spots, and a range of matrix methods for visualizing over-all patterns of recombination within datasets and recombination-aware ancestral sequence reconstruction.

...read moreread less

Abstract: rpd3 is a computer program for statistical identification and characterization of historical recombination events. Given a set of aligned nucleotide sequences, rpd3 will rapidly analyze these with a range of powerful non-parametric recombination detection methods (including bootscan, maxchi, chimaera, 3seq, geneconv, siscan, phylpro and visrd; Boni et al., 2007; Gibbs et al., 2000; Lemey et al., 2009; Padidam et al., 1999, Posada and Crandall, 2001; Weiller, 1998). It will provide a detailed breakdown of recombination breakpoint locations, and the identities of recombinant and parental sequences. For further downstream analyses, the program enables users to save edited sequence alignments with (i) recombinant sequences removed; (ii) recombinationally derived tracts of sequence removed; or (iii) recombinant sequences split into their constituent parts. An important strength of rdp3 that makes it applicable to a variety of recombination analysis problems is that, unlike many other recombination detection programs such as simplot (Lole et al., 1999), dual brothers (Minin et al., 2005), jphmm (Schultz et al., 2006) or scueal (Kosakovsky et al., 2009), it does not screen predefined sets of potentially recombinant (or query) sequences against other predefined sets of non-recombinant (or reference) sequences. rdp3 instead treats every sequence within an input alignment as a potential recombinant and systematically screens large numbers of sequence triplets and/or quartets to identify sets of three or four sequences that contain a recombinant and two sequences resembling its parents. Such an approach means that rdp3 can simultaneously detect the entire scope of recombination evident within a dataset (i.e. not just that occurring between the reference strains or species) enabling its use in the characterization of complex recombinants such as those derived through recombination between parental sequences that were themselves recombinant. The drawback of such a flexible, exploratory framework is that it can often be difficult to assess the uncertainty associated with inferred recombination patterns. However, with its wide range of cross-checking tools, rpd3 is complementary to probabilistic recombination analysis approaches.

...read moreread less

1,655 citations

Journal Article•DOI•

Bayesian phylogeography finds its roots.

[...]

Philippe Lemey¹, Andrew Rambaut², Alexei J. Drummond³, Marc A. Suchard•Institutions (3)

Katholieke Universiteit Leuven¹, University of Edinburgh², University of Auckland³

25 Sep 2009-PLOS Computational Biology

TL;DR: It is concluded that the Bayesian phylogeographic framework will make an important asset in molecular epidemiology that can be easily generalized to infer biogeogeography from genetic data for many organisms.

...read moreread less

Abstract: As a key factor in endemic and epidemic dynamics, the geographical distribution of viruses has been frequently interpreted in the light of their genetic histories. Unfortunately, inference of historical dispersal or migration patterns of viruses has mainly been restricted to model-free heuristic approaches that provide little insight into the temporal setting of the spatial dynamics. The introduction of probabilistic models of evolution, however, offers unique opportunities to engage in this statistical endeavor. Here we introduce a Bayesian framework for inference, visualization and hypothesis testing of phylogeographic history. By implementing character mapping in a Bayesian software that samples time-scaled phylogenies, we enable the reconstruction of timed viral dispersal patterns while accommodating phylogenetic uncertainty. Standard Markov model inference is extended with a stochastic search variable selection procedure that identifies the parsimonious descriptions of the diffusion process. In addition, we propose priors that can incorporate geographical sampling distributions or characterize alternative hypotheses about the spatial dynamics. To visualize the spatial and temporal information, we summarize inferences using virtual globe software. We describe how Bayesian phylogeography compares with previous parsimony analysis in the investigation of the influenza A H5N1 origin and H5N1 epidemiological linkage among sampling localities. Analysis of rabies in West African dog populations reveals how virus diffusion may enable endemic maintenance through continuous epidemic cycles. From these analyses, we conclude that our phylogeographic framework will make an important asset in molecular epidemiology that can be easily generalized to infer biogeogeography from genetic data for many organisms.

...read moreread less

1,535 citations

Journal Article•DOI•

Improving the Accuracy of Demographic and Molecular Clock Model Comparison While Accommodating Phylogenetic Uncertainty

[...]

Guy Baele¹, Philippe Lemey¹, Trevor Bedford², Andrew Rambaut², Marc A. Suchard³, Alexander V. Alekseyenko⁴ - Show less +2 more•Institutions (4)

Katholieke Universiteit Leuven¹, University of Edinburgh², University of California, Los Angeles³, New York University⁴

01 Sep 2012-Molecular Biology and Evolution

TL;DR: It is shown that PS and SS sampling substantially outperform these estimators and adjust the conclusions made concerning previous analyses for the three real-world data sets that were reanalyzed.

...read moreread less

Abstract: Recent developments in marginal likelihood estimation for model selection in the field of Bayesian phylogenetics and molecular evolution have emphasized the poor performance of the harmonic mean estimator (HME). Although these studies have shown the merits of new approaches applied to standard normally distributed examples and small real-world data sets, not much is currently known concerning the performance and computational issues of these methods when fitting complex evolutionary and population genetic models to empirical real-world data sets. Further, these approaches have not yet seen widespread application in the field due to the lack of implementations of these computationally demanding techniques in commonly used phylogenetic packages. We here investigate the performance of some of these new marginal likelihood estimators, specifically, path sampling (PS) and stepping-stone (SS) sampling for comparing models of demographic change and relaxed molecular clocks, using synthetic data and real-world examples for which unexpected inferences were made using the HME. Given the drastically increased computational demands of PS and SS sampling, we also investigate a posterior simulation-based analogue of Akaike’s information criterion (AIC) through Markov chain Monte Carlo (MCMC), a model comparison approach that shares with the HME the appealing feature of having a low computational overhead over the original MCMC analysis. We confirm that the HME systematically overestimates the marginal likelihood and fails to yield reliable model classification and show that the AICM performs better and may be a useful initial evaluation of model choice but that it is also, to a lesser degree, unreliable. We show that PS and SS sampling substantially outperform these estimators and adjust the conclusions made concerning previous analyses for the three real-world data sets that we reanalyzed. The methods used in this article are now available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.

...read moreread less

988 citations

Journal Article•DOI•

Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil.

[...]

Nuno R. Faria, Thomas A. Mellan¹, Charles Whittaker¹, Ingra Morales Claro², Darlan da Silva Candido³, Darlan da Silva Candido², Swapnil Mishra¹, Myuki A E Crispim, Flavia C. S. Sales², Iwona Hawryluk¹, John T. McCrone⁴, Ruben J.G. Hulswit³, Lucas A M Franco², Mariana S. Ramundo², Jaqueline Goes de Jesus², Pamela S Andrade², Thais M. Coletti², Giulia M. Ferreira⁵, Camila A. M. Silva², Erika R. Manuli², Rafael Henrique Moraes Pereira, Pedro S. Peixoto², Moritz U. G. Kraemer³, Nelson Gaburo, Cecilia da C. Camilo, Henrique Hoeltgebaum¹, William Marciel de Souza², Esmenia C. Rocha², Leandro Marques de Souza², Mariana C. Pinho², Leonardo José Tadeu de Araújo⁶, Frederico S V Malta, Aline B. de Lima, Joice do P. Silva, Danielle A G Zauli, Alessandro C. S. Ferreira, Ricardo P Schnekenberg³, Daniel J Laydon¹, Patrick G T Walker¹, Hannah M. Schlüter¹, Ana L. P. dos Santos, Maria S. Vidal, Valentina S. Del Caro, Rosinaldo M. F. Filho, Helem M. dos Santos, Renato Santana Aguiar⁷, José Luiz Proença-Módena⁸, Bruce Walker Nelson⁹, James A. Hay¹⁰, Melodie Monod¹, Xenia Miscouridou¹, Helen Coupland¹, Raphael Sonabend¹, Michaela A. C. Vollmer¹, Axel Gandy¹, Carlos A. Prete², Vitor H. Nascimento², Marc A. Suchard¹¹, Thomas A. Bowden³, Sergei L Kosakovsky Pond¹², Chieh-Hsi Wu¹³, Oliver Ratmann¹, Neil M. Ferguson¹, Christopher Dye³, Nicholas J. Loman¹⁴, Philippe Lemey¹⁵, Andrew Rambaut⁴, Nelson Abrahim Fraiji, Maria Perpétuo Socorro Sampaio Carvalho, Oliver G. Pybus¹⁶, Oliver G. Pybus³, Seth Flaxman¹, Samir Bhatt¹, Samir Bhatt¹⁷, Ester Cerdeira Sabino² - Show less +71 more•Institutions (17)

Imperial College London¹, University of São Paulo², University of Oxford³, University of Edinburgh⁴, Federal University of Uberlandia⁵, Instituto Adolfo Lutz⁶, Universidade Federal de Minas Gerais⁷, State University of Campinas⁸, National Institute of Amazonian Research⁹, Harvard University¹⁰, University of California, Los Angeles¹¹, Temple University¹², University of Southampton¹³, University of Birmingham¹⁴, Katholieke Universiteit Leuven¹⁵, Royal Veterinary College¹⁶, University of Copenhagen¹⁷

21 May 2021-Science

TL;DR: In this article, the authors used a two-category dynamical model that integrates genomic and mortality data to estimate that P.1 may be 1.7-to 2.4-fold more transmissible and that previous (non-P.1) infection provides 54 to 79% of the protection against infection with P.

...read moreread less

Abstract: Cases of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in Manaus, Brazil, resurged in late 2020 despite previously high levels of infection. Genome sequencing of viruses sampled in Manaus between November 2020 and January 2021 revealed the emergence and circulation of a novel SARS-CoV-2 variant of concern. Lineage P.1 acquired 17 mutations, including a trio in the spike protein (K417T, E484K, and N501Y) associated with increased binding to the human ACE2 (angiotensin-converting enzyme 2) receptor. Molecular clock analysis shows that P.1 emergence occurred around mid-November 2020 and was preceded by a period of faster molecular evolution. Using a two-category dynamical model that integrates genomic and mortality data, we estimate that P.1 may be 1.7- to 2.4-fold more transmissible and that previous (non-P.1) infection provides 54 to 79% of the protection against infection with P.1 that it provides against non-P.1 lineages. Enhanced global genomic surveillance of variants of concern, which may exhibit increased transmissibility and/or immune evasion, is critical to accelerate pandemic responsiveness.

...read moreread less

985 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

BEAST: Bayesian evolutionary analysis by sampling trees

[...]

Alexei J. Drummond¹, Andrew Rambaut²•Institutions (2)

University of Auckland¹, University of Edinburgh²

08 Nov 2007-BMC Evolutionary Biology

TL;DR: BEAST is a fast, flexible software architecture for Bayesian analysis of molecular sequences related by an evolutionary tree that provides models for DNA and protein sequence evolution, highly parametric coalescent analysis, relaxed clock phylogenetics, non-contemporaneous sequence data, statistical alignment and a wide range of options for prior distributions.

...read moreread less

Abstract: The evolutionary analysis of molecular sequence variation is a statistical enterprise. This is reflected in the increased use of probabilistic models for phylogenetic inference, multiple sequence alignment, and molecular population genetics. Here we present BEAST: a fast, flexible software architecture for Bayesian analysis of molecular sequences related by an evolutionary tree. A large number of popular stochastic models of sequence evolution are provided and tree-based models suitable for both within- and between-species sequence data are implemented. BEAST version 1.4.6 consists of 81000 lines of Java source code, 779 classes and 81 packages. It provides models for DNA and protein sequence evolution, highly parametric coalescent analysis, relaxed clock phylogenetics, non-contemporaneous sequence data, statistical alignment and a wide range of options for prior distributions. BEAST source code is object-oriented, modular in design and freely available at http://beast-mcmc.googlecode.com/ under the GNU LGPL license. BEAST is a powerful and flexible evolutionary analysis package for molecular sequence variation. It also provides a resource for the further development of new models and statistical methods of evolutionary analysis.

...read moreread less

11,916 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

A new coronavirus associated with human respiratory disease in China.

[...]

Fan Wu¹, Su Zhao², Bin Yu³, Yan-Mei Chen¹, Wen Wang³, Zhi gang Song¹, Yi Hu², Zhao Wu Tao², Jun Hua Tian³, Yuan Yuan Pei¹, Ming Li Yuan², Yu Ling Zhang¹, Fa Hui Dai¹, Yi Liu¹, Qi Min Wang¹, Jiao Jiao Zheng¹, Lin Xu¹, Edward C. Holmes⁴, Edward C. Holmes¹, Yong-Zhen Zhang³, Yong-Zhen Zhang¹ - Show less +17 more•Institutions (4)

Fudan University¹, Huazhong University of Science and Technology², Centers for Disease Control and Prevention³, University of Sydney⁴

03 Feb 2020-Nature

TL;DR: Phylogenetic and metagenomic analyses of the complete viral genome of a new coronavirus from the family Coronaviridae reveal that the virus is closely related to a group of SARS-like coronaviruses found in bats in China.

...read moreread less

Abstract: Emerging infectious diseases, such as severe acute respiratory syndrome (SARS) and Zika virus disease, present a major threat to public health1–3. Despite intense research efforts, how, when and where new diseases appear are still a source of considerable uncertainty. A severe respiratory disease was recently reported in Wuhan, Hubei province, China. As of 25 January 2020, at least 1,975 cases had been reported since the first patient was hospitalized on 12 December 2019. Epidemiological investigations have suggested that the outbreak was associated with a seafood market in Wuhan. Here we study a single patient who was a worker at the market and who was admitted to the Central Hospital of Wuhan on 26 December 2019 while experiencing a severe respiratory syndrome that included fever, dizziness and a cough. Metagenomic RNA sequencing4 of a sample of bronchoalveolar lavage fluid from the patient identified a new RNA virus strain from the family Coronaviridae, which is designated here ‘WH-Human 1’ coronavirus (and has also been referred to as ‘2019-nCoV’). Phylogenetic analysis of the complete viral genome (29,903 nucleotides) revealed that the virus was most closely related (89.1% nucleotide similarity) to a group of SARS-like coronaviruses (genus Betacoronavirus, subgenus Sarbecovirus) that had previously been found in bats in China5. This outbreak highlights the ongoing ability of viral spill-over from animals to cause severe disease in humans. Phylogenetic and metagenomic analyses of the complete viral genome of a new coronavirus from the family Coronaviridae reveal that the virus is closely related to a group of SARS-like coronaviruses found in bats in China.

...read moreread less

9,231 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse