Home
/
Authors
/
Thomas H. Jukes

Author

Thomas H. Jukes

Other affiliations: Columbia University, University of California, Santa Barbara

Bio: Thomas H. Jukes is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Genetic code & Transfer RNA. The author has an hindex of 37, co-authored 226 publications receiving 16017 citations. Previous affiliations of Thomas H. Jukes include Columbia University & University of California, Santa Barbara.

Papers published on a yearly basis

2007
2001
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969
1968
1967
1966
1965
1964
1948
1947
1945
1940
1938
1936
1935
1934

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

CHAPTER 24 – Evolution of Protein Molecules

[...]

Thomas H. Jukes¹•Institutions (1)

University of California, Berkeley¹

01 Jan 1969

10,262 citations

Journal Article•DOI•

Non-Darwinian Evolution

[...]

Jack Lester King, Thomas H. Jukes

16 May 1969-Science

TL;DR: NonDarwinian evolution of protein and DNA, comparing expectations of evolution models for protein and amino acid changes is compared.

...read moreread less

Abstract: NonDarwinian evolution of protein and DNA, comparing expectations of evolution models for protein and amino acid changes

...read moreread less

1,480 citations

Journal Article•DOI•

“Homology” in proteins and nucleic acids: A terminology muddle and a way out of it

[...]

Gerald R. Reeck¹, Christoph de Haën², David C. Teller², Russell F. Doolittle³, Walter M. Fitch⁴, Richard E. Dickerson⁵, Pierre Chambon, A.D. McLachlan⁶, Emanuel Margoliash⁷, Thomas H. Jukes⁸, Emile Zuckerkandl⁹ - Show less +7 more•Institutions (9)

Kansas State University¹, University of Washington², University of California, San Diego³, University of Southern California⁴, University of California, Los Angeles⁵, Laboratory of Molecular Biology⁶, Northwestern University⁷, University of California, Berkeley⁸, Linus Pauling Institute⁹

28 Aug 1987-Cell

TL;DR: It is believed that the concepts of evolutionary homology and sequence or three-dimensional similarity can be kept distinct only if they are referred to with different words.

...read moreread less

403 citations

Journal Article•DOI•

Codon reassignment (codon capture) in evolution

[...]

Syozo Osawa¹, Thomas H. Jukes²•Institutions (2)

Nagoya University¹, University of California, Berkeley²

01 Apr 1989-Journal of Molecular Evolution

TL;DR: The role of directional mutationpressure in the pressure in the events is discussed, and the possibility that such events have taken place during early evolution of the genetic code and can occur during its present evolution is described.

...read moreread less

Abstract: The genetic code, once thought to be "frozen," shows variations from the universal code. Variations are found in mitochondria, Mycoplasma, and ciliated protozoa. The variations result from reassignment of codons, especially stop codons. The reassignments take place by disappearance of a codon from coding sequences, followed by its reappearance in a new role. Simultaneously, a changed anticodon must appear. We discuss the role of directional mutation pressure in the events, and we also describe the possibility that such events have taken place during early evolution of the genetic code and can occur during its present evolution.

...read moreread less

227 citations

Journal Article•DOI•

The welfare state

[...]

Thomas H. Jukes¹•Institutions (1)

University of California, Berkeley¹

12 Nov 1992-Nature

180 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The neighbor-joining method: a new method for reconstructing phylogenetic trees.

[...]

Naruya Saitou¹, Masatoshi Nei•Institutions (1)

University of Texas Health Science Center at Houston¹

01 Jul 1987-Molecular Biology and Evolution

TL;DR: The neighbor-joining method and Sattath and Tversky's method are shown to be generally better than the other methods for reconstructing phylogenetic trees from evolutionary distance data.

...read moreread less

Abstract: A new method called the neighbor-joining method is proposed for reconstructing phylogenetic trees from evolutionary distance data. The principle of this method is to find pairs of operational taxonomic units (OTUs [= neighbors]) that minimize the total branch length at each stage of clustering of OTUs starting with a starlike tree. The branch lengths as well as the topology of a parsimonious tree can quickly be obtained by using this method. Using computer simulation, we studied the efficiency of this method in obtaining the correct unrooted tree in comparison with that of five other tree-making methods: the unweighted pair group method of analysis, Farris's method, Sattath and Tversky's method, Li's method, and Tateno et al.'s modified Farris method. The new, neighbor-joining method and Sattath and Tversky's method are shown to be generally better than the other methods.

...read moreread less

57,055 citations

Journal Article•DOI•

A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

[...]

Motoo Kimura¹•Institutions (1)

National Institute of Genetics¹

01 Dec 1980-Journal of Molecular Evolution

TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.

...read moreread less

Abstract: Some simple formulae were obtained which enable us to estimate evolutionary distances in terms of the number of nucleotide substitutions (and, also, the evolutionary rates when the divergence times are known). In comparing a pair of nucleotide sequences, we distinguish two types of differences; if homologous sites are occupied by different nucleotide bases but both are purines or both pyrimidines, the difference is called type I (or “transition” type), while, if one of the two is a purine and the other is a pyrimidine, the difference is called type II (or “transversion” type). Letting P and Q be respectively the fractions of nucleotide sites showing type I and type II differences between two sequences compared, then the evolutionary distance per site is K = — (1/2) ln {(1 — 2P — Q) }. The evolutionary rate per year is then given by k = K/(2T), where T is the time since the divergence of the two sequences. If only the third codon positions are compared, the synonymous component of the evolutionary base substitutions per site is estimated by K'S = — (1/2) ln (1 — 2P — Q). Also, formulae for standard errors were obtained. Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.

...read moreread less

26,016 citations

Journal Article•DOI•

MODELTEST: testing the model of DNA substitution.

[...]

David Posada¹, Keith A. Crandall•Institutions (1)

Brigham Young University¹

01 Jan 1998-Bioinformatics

TL;DR: The program MODELTEST uses log likelihood scores to establish the model of DNA evolution that best fits the data.

...read moreread less

Abstract: Summary: The program MODELTEST uses log likelihood scores to establish the model of DNA evolution that best fits the data. Availability: The MODELTEST package, including the source code and some documentation is available at http://bioag.byu.edu/zoology/crandall―lab/modeltest.html. Contact: dp47@email.byu.edu.

...read moreread less

20,105 citations

Journal Article•DOI•

A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

[...]

Stéphane Guindon¹, Olivier Gascuel¹•Institutions (1)

Centre national de la recherche scientifique¹

01 Oct 2003-Systematic Biology

TL;DR: This work has used extensive and realistic computer simulations to show that the topological accuracy of this new method is at least as high as that of the existing maximum-likelihood programs and much higher than the performance of distance-based and parsimony approaches.

...read moreread less

Abstract: The increase in the number of large data sets and the complexity of current probabilistic sequence evolution models necessitates fast and reliable phylogeny reconstruction methods. We describe a new approach, based on the maximum- likelihood principle, which clearly satisfies these requirements. The core of this method is a simple hill-climbing algorithm that adjusts tree topology and branch lengths simultaneously. This algorithm starts from an initial tree built by a fast distance-based method and modifies this tree to improve its likelihood at each iteration. Due to this simultaneous adjustment of the topology and branch lengths, only a few iterations are sufficient to reach an optimum. We used extensive and realistic computer simulations to show that the topological accuracy of this new method is at least as high as that of the existing maximum-likelihood programs and much higher than the performance of distance-based and parsimony approaches. The reduction of computing time is dramatic in comparison with other maximum-likelihood packages, while the likelihood maximization ability tends to be higher. For example, only 12 min were required on a standard personal computer to analyze a data set consisting of 500 rbcL sequences with 1,428 base pairs from plant plastids, thus reaching a speed of the same order as some popular distance-based and parsimony algorithms. This new method is implemented in the PHYML program, which is freely available on our web page: http://www.lirmm.fr/w3ifa/MAAS/. (Algorithm; computer simulations; maximum likelihood; phylogeny; rbcL; RDPII project.) The size of homologous sequence data sets has in- creased dramatically in recent years, and many of these data sets now involve several hundreds of taxa. More- over, current probabilistic sequence evolution models (Swofford et al., 1996 ; Page and Holmes, 1998 ), notably those including rate variation among sites (Uzzell and Corbin, 1971 ; Jin and Nei, 1990 ; Yang, 1996 ), require an increasing number of calculations. Therefore, the speed of phylogeny reconstruction methods is becoming a sig- nificant requirement and good compromises between speed and accuracy must be found. The maximum likelihood (ML) approach is especially accurate for building molecular phylogenies. Felsenstein (1981) brought this framework to nucleotide-based phy- logenetic inference, and it was later also applied to amino acid sequences (Kishino et al., 1990). Several vari- ants were proposed, most notably the Bayesian meth- ods (Rannala and Yang 1996; and see below), and the discrete Fourier analysis of Hendy et al. (1994), for ex- ample. Numerous computer studies (Huelsenbeck and Hillis, 1993; Kuhner and Felsenstein, 1994; Huelsenbeck, 1995; Rosenberg and Kumar, 2001; Ranwez and Gascuel, 2002) have shown that ML programs can recover the cor- rect tree from simulated data sets more frequently than other methods can. Another important advantage of the ML approach is the ability to compare different trees and evolutionary models within a statistical framework (see Whelan et al., 2001, for a review). However, like all optimality criterion-based phylogenetic reconstruction approaches, ML is hampered by computational difficul- ties, making it impossible to obtain the optimal tree with certainty from even moderate data sets (Swofford et al., 1996). Therefore, all practical methods rely on heuristics that obtain near-optimal trees in reasonable computing time. Moreover, the computation problem is especially difficult with ML, because the tree likelihood not only depends on the tree topology but also on numerical pa- rameters, including branch lengths. Even computing the optimal values of these parameters on a single tree is not an easy task, particularly because of possible local optima (Chor et al., 2000). The usual heuristic method, implemented in the pop- ular PHYLIP (Felsenstein, 1993 ) and PAUP ∗ (Swofford, 1999 ) packages, is based on hill climbing. It combines stepwise insertion of taxa in a growing tree and topolog- ical rearrangement. For each possible insertion position and rearrangement, the branch lengths of the resulting tree are optimized and the tree likelihood is computed. When the rearrangement improves the current tree or when the position insertion is the best among all pos- sible positions, the corresponding tree becomes the new current tree. Simple rearrangements are used during tree growing, namely "nearest neighbor interchanges" (see below), while more intense rearrangements can be used once all taxa have been inserted. The procedure stops when no rearrangement improves the current best tree. Despite significant decreases in computing times, no- tably in fastDNAml (Olsen et al., 1994 ), this heuristic becomes impracticable with several hundreds of taxa. This is mainly due to the two-level strategy, which sepa- rates branch lengths and tree topology optimization. In- deed, most calculations are done to optimize the branch lengths and evaluate the likelihood of trees that are finally rejected. New methods have thus been proposed. Strimmer and von Haeseler (1996) and others have assembled four- taxon (quartet) trees inferred by ML, in order to recon- struct a complete tree. However, the results of this ap- proach have not been very satisfactory to date (Ranwez and Gascuel, 2001 ). Ota and Li (2000, 2001) described

...read moreread less

16,261 citations

Journal Article•DOI•

New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0

[...]

Stéphane Guindon, Jean-François Dufayard, Vincent Lefort, Maria Anisimova, Wim Hordijk, Olivier Gascuel - Show less +2 more

29 Mar 2010-Systematic Biology

TL;DR: A new algorithm to search the tree space with user-defined intensity using subtree pruning and regrafting topological moves and a new test to assess the support of the data for internal branches of a phylogeny are introduced.

...read moreread less

Abstract: PhyML is a phylogeny software based on the maximum-likelihood principle. Early PhyML versions used a fast algorithm performing nearest neighbor interchanges to improve a reasonable starting tree topology. Since the original publication (Guindon S., Gascuel O. 2003. A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52:696-704), PhyML has been widely used (>2500 citations in ISI Web of Science) because of its simplicity and a fair compromise between accuracy and speed. In the meantime, research around PhyML has continued, and this article describes the new algorithms and methods implemented in the program. First, we introduce a new algorithm to search the tree space with user-defined intensity using subtree pruning and regrafting topological moves. The parsimony criterion is used here to filter out the least promising topology modifications with respect to the likelihood function. The analysis of a large collection of real nucleotide and amino acid data sets of various sizes demonstrates the good performance of this method. Second, we describe a new test to assess the support of the data for internal branches of a phylogeny. This approach extends the recently proposed approximate likelihood-ratio test and relies on a nonparametric, Shimodaira-Hasegawa-like procedure. A detailed analysis of real alignments sheds light on the links between this new approach and the more classical nonparametric bootstrap method. Overall, our tests show that the last version (3.0) of PhyML is fast, accurate, stable, and ready to use. A Web server and binary files are available from http://www.atgc-montpellier.fr/phyml/.

...read moreread less

14,385 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse