scispace - formally typeset
Search or ask a question

Showing papers in "Journal of Molecular Evolution in 1985"


Journal ArticleDOI
TL;DR: A new statistical method for estimating divergence dates of species from DNA sequence data by a molecular clock approach is developed, and this dating may pose a problem for the widely believed hypothesis that the bipedal creatureAustralopithecus afarensis, which lived some 3.7 million years ago, was ancestral to man and evolved after the human-ape splitting.
Abstract: A new statistical method for estimating divergence dates of species from DNA sequence data by a molecular clock approach is developed. This method takes into account effectively the information contained in a set of DNA sequence data. The molecular clock of mitochondrial DNA (mtDNA) was calibrated by setting the date of divergence between primates and ungulates at the Cretaceous-Tertiary boundary (65 million years ago), when the extinction of dinosaurs occurred. A generalized least-squares method was applied in fitting a model to mtDNA sequence data, and the clock gave dates of 92.3 +/- 11.7, 13.3 +/- 1.5, 10.9 +/- 1.2, 3.7 +/- 0.6, and 2.7 +/- 0.6 million years ago (where the second of each pair of numbers is the standard deviation) for the separation of mouse, gibbon, orangutan, gorilla, and chimpanzee, respectively, from the line leading to humans. Although there is some uncertainty in the clock, this dating may pose a problem for the widely believed hypothesis that the pipedal creature Australopithecus afarensis, which lived some 3.7 million years ago at Laetoli in Tanzania and at Hadar in Ethiopia, was ancestral to man and evolved after the human-ape splitting. Another likelier possibility is that mtDNA was transferred through hybridization between a proto-human and a proto-chimpanzee after the former had developed bipedalism.

8,124 citations


Journal ArticleDOI
TL;DR: Evidence is summarized that supports the hypothesis that A and T nucleotides are favored at all locations in the D. yakuba mtDNA molecule where these nucleotide are compatible with function.
Abstract: The sequence of the 16,019 nucleotide-pair mitochondrial DNA (mtDNA) molecule ofDrosophila yakuba is presented. This molecule contains the genes for two rRNAs, 22 tRNAs, six identified proteins [cytochrome b, cytochrome c oxidase subunits I, II, and III (COI-III), and ATPase subunits 6 and 8] and seven presumptive proteins (URF1-6 and URF4L). Replication originates within a region of 1077 nucleotides that is 92.8% A+T and lacks any open reading frame larger than 123 nucleotides. An equivalent to the sequence found in all mammalian mtDNAs that is associated with initiation of second-strand DNA synthesis is not present inD. yakuba mtDNA. Introns are absent fromD. yakuba mitochondrial genes and there are few (0–31) intergenic nucleotides. The genes found inD. yakuba and mammalian mtDNAs are the same, but there are differences in their arrangement and in the relative proportions of the complementary strands of the molecule that serve as templates for transcription. Although theD. yakuba small and large mitochondrial rRNA genes are exceptionally low in G and C and are shorter than any other metazoan rRNA genes reported, they can be folded into secondary structures remarkably similar to the secondary structures proposed for mammalian mitochondrial rRNAs.D. yakuba mitochondrial tRNA genes, like their mammalian counterparts, are more variable in sequence than nonorganelle tRNAs. In mitochrondrial protein genes ATG, ATT, ATA, and in one case (COI) ATAA appear to be used as translation initiation codons. The only termination codon found in these genes is TAA. In theD. yakuba mitochondrial genetic code, AGA, ATA, and TGA specify serine, isoleucine, and tryptophan, respectively. Fifty-nine types of sense codon are used in theD. yakuba mitochondrial protein genes, but 93.8% of all codons end in A or T. Codon-anticodon interactions may include both G-A and C-A pairing in the wobble position. Evidence is summarized that supports the hypothesis that A and T nucleotides are favored at all locations in theD. yakuba mtDNA molecule where these nucleotides are compatible with function.

1,436 citations


Journal ArticleDOI
TL;DR: On the average, the Dayhoff LOM approach was the most effective in verifying distant relationships, as judged by an empirical “jumbling test.”
Abstract: We examined two extensive families of protein sequences using four different alignment schemes that employ various degrees of “weighting” in order to determine which approach is most sensitive in establishing relationships. All alignments used a similarity approach based on a general algorithm devised by Needleman and Wunsch. The approaches included a simple program, UM (unitary matrix), whereby only identities are scored; a scheme in which the genetic code is used as a basis for weighting (GC); another that employs a matrix based on structural similarity of amino acids taken together with the genetic basis of mutation (SG); and a fourth that uses the empirical log-odds matrix (LOM) developed by Dayhoff on the basis of observed amino acid replacements. The two sequence families examined were (a) nine different globins and (b) nine different tyrosine kinase-like proteins. It was assumed a priori that all members of a family share common ancestry. In cases where two sequences were more than 30% identical, alignments by all four methods were almost always the same. In cases where the percentage identity was less than 20%, however, there were often significant differences in the alignments. On the average, the Dayhoff LOM approach was the most effective in verifying distant relationships, as judged by an empirical “jumbling test.” This was not universally the case, however, and in some instances the simple UM was actually as good or better. Trees constructed on the basis of the various alignments differed with regard to their limb lengths, but had essentially the same branching orders. We suggest some reasons for the different effectivenesses of the four approaches in the two different sequence settings, and offer some rules of thumb for assessing the significance of sequence relationships.

378 citations


Journal ArticleDOI
TL;DR: A tentative model of the secondary structure of soybean 18S rRNA is presented and discussed in the context of the functions of the various conserved regions within the sequence, and the short basepaired sequences defining the four structural and functional domains of all 18s rRNAs are seen to be well conserved.
Abstract: We present the sequence of the nuclearencoded ribosomal small-subunit RNA from soybean. The soybean 18S rRNA sequence of 1807 nucleotides (nt) is contained in a gene family of approximately 800 closely related members per haploid genome. This sequence is compared with the ribosomal small-subunit RNAs of maize (1805 nt), yeast (1789 nt),Xenopus (1825 nt), rat (1869 nt), andEscherichia coli (1541 nt). Significant sequence homology is observed among the eukaryotic small-subunit rRNAs examined, and some sequence homology is observed between eukaryotic and prokaryotic small-subunit rRNAs. Conserved regions are found to be interspersed among highly diverged sequences. The significance of these comparisons is evaluated using computer simulation of a random sequence model. A tentative model of the secondary structure of soybean 18S rRNA is presented and discussed in the context of the functions of the various conserved regions within the sequence. On the basis of this model, the short basepaired sequences defining the four structural and functional domains of all 18S rRNAs are seen to be well conserved. The potential roles of other conserved soybean 18S rRNA sequences in protein synthesis are discussed.

150 citations


Journal ArticleDOI
TL;DR: In the maximum-likelihood trees for both large- and small-subunit rRNAs, Animalia and Fungi were the most closely related eukaryotic kingdoms, and Plantae is the nextmost closely related kingdom, although other branching orders among Plantae, AnimalIA, and F Bungi were not excluded by this work.
Abstract: Phylogenetic trees among eukaryotic kingdoms were inferred for large- and small-subunit rRNAs by using a maximum-likelihood method developed by Felsenstein. Although Felsenstein's method assumes equal evolutionary rates for transitions and transversions, this is apparently not the case for these data. Therefore, only transversiontype substitutions were taken into account. The molecules used were large-subunit rRNAs fromXenopus laevis (Animalia), rice (Plantae),Saccharomyces cerevisiae (Fungi),Dictyostelium discoideum (Protista), andPhysarum polycephalum (Protista); and small-subunit rRNAs from maize (Plantae),S. cerevisiae, X. laevis, rat (Animalia), andD. discoideum. Only conservative regions of the nucleotide sequences were considered for this study. In the maximum-likelihood trees for both large- and small-subunit rRNAs, Animalia and Fungi were the most closely related eukaryotic kingdoms, and Plantae is the next most closely related kingdom, although other branching orders among Plantae, Animalia, and Fungi were not excluded by this work. These three eukaryotic kingdoms apparently shared a common ancestor after the divergence of the two species of Protista,D. discoideum andP. polycephalum. These two species of Protista do not form a clade, andP. polycephalum diverged first andD. discoideum second from the line leading to the common ancestor of Plantae, Animalia, and Fungi. The sequence data indicate that a drastic change occurred in the nucleotide sequences of rRNAs during the evolutionary separation between prokaryote and eukaryote.

146 citations


Journal ArticleDOI
TL;DR: Mycoplasmas are actually tachytelic bacteria, and the unusual evolutionary characteristics of their ribosomal RNAs are the imprints of their rapid evolution.
Abstract: In phenotype the mycoplasmas are very different from ordinary bacteria. However, genotypically (i.e., phylogenetically) they are not. On the basis of ribosomal RNA homologies the mycoplasmas belong with the clostridia, and indeed have specific clostridial relatives. Mycoplasmas are, however, unlike almost all other bacteria in the evolutionary characteristics of their ribosomal RNAs. These RNAs contain relatively few of the highly conserved oligonucleotide sequences characteristic of normal eubacterial ribosomal RNAs. This is interpreted to be a reflection of an elevated mutation rate in mycoplasma lines of descent. A general consequence of this would be that the variation associated with a mycoplasma population is augmented both in number and kind, which in turn would lead to an unusual evolutionary course, one unique in all respects. Mycoplasmas, then, are actually tachytelic bacteria. The unusual evolutionary characteristics of their ribosomal RNAs are the imprints of their rapid evolution.

141 citations


Journal ArticleDOI
TL;DR: Comparisons of goose-type, chicken- type, and phage-type lysozymes are made to suggest that all three classes of lysozyme diverged from a common evolutionary precursor, even though their amino acid sequences appear to be unrelated.
Abstract: The three-dimensional structure of goose-type lysozyme (GEWL), determined by x-ray crystallography and refined at high resolution, has similarities to the structures of hen (chicken) eggwhite lysozyme (HEWL) and bacteriophage T4 lysozyme (T4L). The nature of the structural correspondence suggests that all three classes of lysozyme diverged from a common evolutionary precursor, even though their amino acid sequences appear to be unrelated (Grutter et al. 1983).

114 citations


Journal ArticleDOI
TL;DR: Three different relationships have been found: one for prokaryotes and viruses, one for lower eukaryotes, and one for vertebrates, which are linearly correlated with the GC levels of the corresponding genomes.
Abstract: The GC levels of codon third positions from 49 genomes coveering a wide phylogenetic range are linearly correlated with the GC levels of the corresponding genomes. Three different relationships have been found: one for prokaryotes and viruses, one for lower eukaryotes, and one for vertebrates. All points not fitting the first relationship can be brought into quasi coincidence with it when plotted against GC levels of coding sequences.

106 citations


Journal ArticleDOI
TL;DR: It is shown that the rate of amino acid substitution of a protein is correlated with its amino acid composition, and the content of glycine residues is negatively correlated with the rates of substitution.
Abstract: Based on the rates of amino acid substitution for 60 mammalian genes of 50 codons or more, it is shown that the rate of amino acid substitution of a protein is correlated with its amino acid composition. In particular, the content of glycine residues is negatively correlated with the rate of amino acid substitution, and this content alone explains about 38% of the total variation in amino acid substitution rates among different protein families. The propensity of a polypeptide to evolve fast or slowly may be predicted from an index or indices of protein mutability directly derivable from the amino acid composition. The propensity of an amino acid to remain conserved during evolutionary times depends not so much on its being features prominently in active sites, but on its stability index, defined as the mean chemical distance [R. Grantham (1974) Science 185∶862–864] between the amino acid and its mutational derivatives produced by single-nucleotide substitutions. Functional constraints related to active and binding sites of proteins play only a minor role in determining the overall rate of amino acid substitution. The importance of amino acid composition in determining rates of substitution is illustrated with examples involving cytochrome c, cytochrome b5,ras-related genes, the calmodulin protein family, and fibrinopeptides.

99 citations


Journal ArticleDOI
TL;DR: Close structural resemblances between several mammalian highly or moderately repetitive families and some specific tRNAs were detected, suggesting that these repetitive families may be generated by nonhomologous recombination between a t RNA gene and a tRNA-unrelated block.
Abstract: Close structural resemblances between several mammalian highly or moderately repetitive families and some specific tRNAs were detected. The rodent type 2 Alu family, rat identifier (ID) sequences, rabbit C family, and bovine or goat 73-bp repeat are most homologous with lysine tRNA5, phenylalanine tRNA, glycine tRNA, and glycine tRNA, respectively. The homologies extend to secondary structures, and the homologous nucleotides are located on nearly the same secondary structures. The repetitive families mentioned have a common structural organization, with a tRNA-like sequence devoid of an aminoacyl stem region. These features suggest that these repetitive families may be generated by nonhomologous recombination between a tRNA gene and a tRNA-unrelated block.

98 citations


Journal ArticleDOI
TL;DR: The S.S. solfataricus sequence is specifically related to those of the other archaebacteria, to the exclusion of the eukaryotic and eubacterial sequences, when examined either by evolutionary distance matrix analyses or by the criterion of minimum change (maximum parsimony).
Abstract: The sequence of the small-subunit rRNA from the thermoacidophilic archaebacterium Sulfolobus solfataricus has been determined and compared with its counterparts from halophilic and methanogenic archaebacteria, eukaryotes, and eubacteria. The S. solfataricus sequence is specifically related to those of the other archaebacteria, to the exclusion of the eukaryotic and eubacterial sequences, when examined either by evolutionary distance matrix analyses or by the criterion of minimum change (maximum parsimony). The archaebacterial 16S rRNA sequences all conform to a common secondary structure, with the S. solfataricus structure containing a higher proportion of canonical base pairs and fewer helical irregularities than the rRNAs from the mesophilic archaebacteria. S. solfataricus is unusual in that its 16S rRNA-23S rRNA intergenic spacer lacks a tRNA gene.

Journal ArticleDOI
TL;DR: The results show that a relationship exists between the physical-chemical properties of the amino acids and which of the A (U), U (A), or C (G) nucleotide is used in the second codon (anticodon) position.
Abstract: The 20 naturally occurring amino acids are characterized by 20 variables: pKNH2, pKCOOH, pI, molecular weight, substituent van der Waals volume, seven 1H and 13C nuclear magnetic resonance shift variables, and eight hydrophobicity-hydrophilicity scales. The 20-dimensional data set is reduced to a few new dimensions by principal components analysis. The three first principal components reveal relationships between the properties of the amino acids and the genetic code. Thus the amino acids coded for by adenosine (A), uracil (U), or cytosine (C) in their second codon position (corresponding to U, A, or G in the second anticodon position) are grouped in these components. No grouping was detected for the amino acids coded for by guanine (G) in the second codon position (corresponding to C in the second anticodon position). The results show that a relationship exists between the physical-chemical properties of the amino acids and which of the A (U), U (A), or C (G) nucleotide is used in the second codon (anticodon) position. The amino acids coded for by G (C) in the second codon (anticodon) position do not participate in this relationship.

Journal ArticleDOI
TL;DR: A considerable sample of eucaryotic nuclear DNA sequences have significant local structure over subsequences of three to five contiguous bases, and that this structure occurs throughout the total length of the sequence.
Abstract: Sixty-four eucaryotic nuclear DNA sequences, half of them coding and half noncoding, have been examined as expressions of first-, second-, or third-order Markov chains. Standard statistical tests found that most of the sequences required at least second-order Markov chains for their representation, and some required chains of third order. For all 64 sequences the observed one-step second-order transition count matrices were effective in predicting the two-step transition count matrices, and 56 of 64 were effective in predicting the three-step transition count matrices. The departure from random expectation of the observed first- and second-order transition count matrices meant that a considerable sample of eucaryotic nuclear DNA sequences, both protein coding and noncoding, have significant local structure over subsequences of three to five contiguous bases, and that this structure occurs throughout the total length of the sequence. These results suggested that present DNA sequences may have arisen from the duplication, concatenation, and gradual modification of very early short sequences.

Journal ArticleDOI
TL;DR: The relationships among the bacterial ferredoxins were reexamined and an evolutionary tree consistent with this new understanding was derived, concluding that the amino-terminal domain of theAzotobacter-type ferredoxin, which contains the novel 3Fe∶3S cluster binding site, is homologous with the carboxyl-terminals of the fer redoxins from the anaerobic photosynthetic bacteria.
Abstract: Recent evidence indicates that a gene transposition event occurred during the evolution of the bacterial ferredoxins subsequent to the ancestral intrasequence gene duplication. In light of this new information, the relationships among the bacterial ferredoxins were reexamined and an evolutionary tree consistent with this new understanding was derived. The bacterial ferredoxins can be divided into several groups based on their sequence properties; these include the clostridial-type ferredoxins, theAzotobacter-type ferredoxins, and a group containing the ferredoxins from the anaerobic, green, and purple sulfur bacteria. Based on sequence comparison, it was concluded that the amino-terminal domain of theAzotobacter-type ferredoxins, which contains the novel 3Fe∶3S cluster binding site, is homologous with the carboxyl-terminal domain of the ferredoxins from the anaerobic photosynthetic bacteria. A number of ferredoxin sequences do not fit into any of the groups described above. Based on sequence properties, these sequences can be separated into three groups: a group containingMethanosarcina barkeri ferredoxin andDesulfovibrio desulfuricans ferredoxin II, a group containingDesulfovibrio gigas ferredoxin andClostridium thermoaceticum ferredoxin, and a group containingDesulfovibrio africanus ferredoxin I andBacillus stearothermophilus ferredoxin. The last two groups differ from all of the other bacterial ferredoxins in that they bind only one Fe∶S cluster per polypeptide, whereas the others bind two. Sequence examination indicates that the second binding site has been either partially or completely lost from these ferredoxins.Methanosarcina barkeri ferredoxin andDesulfovibrio desulfuricans ferredoxin II are of interest because, of all the ferredoxins whose sequences are presently known, they show the strongest evidence of internal gene duplication. However, the derived evolutionary tree indicates that they diverged from theAzotobacter-type ferredoxins well after the ancestral internal gene duplication. This apparent discrepancy is explained by postulating a duplication of one halfchain sequence and a deletion of the other halfchain. TheClostridium thermoaceticum andBacillus stearothermophilus groups diverged from this line and subsequently lost one of the Fe∶S binding sites. It has recently become apparent that gene duplication is ubiquitous among the ferredoxins. Several organisms are now known to have a variety of ferredoxins with widely divergent properties. Unfortunately, in only one case are the sequences of more than one ferredoxin from the same organism known. Thus, although the major features of the bacterial ferredoxin tree are now understood, a complete bacterial phylogeny cannot be inferred until more sequence information is available.

Journal ArticleDOI
TL;DR: Codon usage in the 50 genes of T7 is nonrandom, both over the whole code and among groups of synonymous codons, and there is a great excess of purineany base-pyrimidine (RNY) codons.
Abstract: We searched the complete 39,936 base DNA sequence of bacteriophage T7 for nonrandomness that might be attributed to natural selection. Codon usage in the 50 genes of T7 is nonrandom, both over the whole code and among groups of synonymous codons. There is a great excess of purineany base-pyrimidine (RNY) codons. Codon usage varies between genes, but from the pooled data for the whole genome (12,145 codons) certain putative selective constraints can be identified. Codon usage appears to be influenced by host tRNA abundance (particularly in highly expressed genes), tRNA-mRNA interactions (one such interaction being perhaps responsible for maintaining the excess of RNY codons) and a lack of short palindromes. This last constraint is probably due to selection against host restriction enzyme recognition sites; this is the first report of an effect of this kind on codon usage. Selection against susceptibility to mutational damage does not appear to have been involved.

Journal ArticleDOI
TL;DR: The sequences of seven pairs of chimpanzee and human Alu repeats are compared, finding that the identical Alu repeat is located at identical sites in the human and chimpanzee genomes.
Abstract: The DNA sequences of three members of the Alu family of repeated sequences located 5′ to the chimpanzee α2 gene have been determined. The base sequences of the three corresponding human Alu family repeats have been previously determined, permitting the comparison of identical Alu family members in human and chimpanzee. Here we compare the sequences of seven pairs of chimpanzee and human Alu repeats. In each case, with the exception of minor sequence differences, the identical Alu repeat is located at identical sites in the human and chimpanzee genomes. The Alu repeats diverge at the rate expected for nonselected sequences. Sequence conversion has not replaced any of these 14 Alu family members since the divergence between chimpanzee and human.

Journal ArticleDOI
TL;DR: This model implies that proton translocation in a closed-membrane system preceded photochemical or electron transport mechanisms and that chemically transferable metabolic energy was needed at a much earlier stage in the development of life than has usually been assumed.
Abstract: It is proposed that the first entity capable of adaptive Darwinian evolution consisted of a liposome vesicle formed of (1) abiotically produced phospholipidlike molecules; (2) a very few informational macromolecules; and (3) some abiogenic, lipid-soluble, organic molecule serving as a symporter for phosphate and protons and as a means of high-energy-bond generation. The genetic material had functions that led to the production of phospholipidlike materials (leading to growth and division of the primitive cells) and of the carrier needed for energy transduction. It is suggested that the most primitive exploitable energy source was the donation of 2H++2e− at the external face of the primitive cell. The electrons were transferred (by metal impurities) to internal sinks of organic material, thus creating, via a deficit, a protonmotive force that could drive both the active transport of phosphate and high-energy-bond formation.

Journal ArticleDOI
TL;DR: The phylogenetic analyses of the D. melanogaster sequences show that the fastslow distinction is not perfect, and suggest that intragenic recombination or gene conversion occurred in the evolution of this locus, which is roughly compatible with the neutral theory of molecular evolution.
Abstract: Recent sequencing of over 2300 nucleotides containing the alcohol dehydrogenase (Adh) locus in each of 11 Drosophila melanogaster lines makes it possible to estimate the approximate age of the electrophoretic "fast-slow" polymorphism. Our estimates, based on various possible patterns of evolution, range from 610,000 to 3,500,000 years, with 1,000,000 years as a reasonable point estimate. Furthermore, comparison of these sequences with those of the homologous region of D. simulans and D. mauritiana allows us to infer the pattern of evolutionary change of the D. melanogaster sequences. The integrity of the Adh-f electrophoretic alleles as a single lineage is supported by both unweighted pair-group method (UPGMA) and parsimony analyses. However, considerable divergence among the Adh-s lines seems to have preceded the origin of the Adh-f allele. Comparisons of the sequences of D. melanogaster genes with those of D. simulans and D. mauritiana genes suggest that the split between the latter two species occurred more recently than the divergence of some of the present-day Adh-s genes in D. melanogaster. The phylogenetic analyses of the D. melanogaster sequences show that the fast-slow distinction is not perfect, and suggest that intragenic recombination or gene conversion occurred in the evolution of this locus. We extended conventional phylogenetic analyses by using a statistical technique for detecting and characterizing recombination events. We show that the pattern of differentiation of DNA sequences in D. melanogaster is roughly compatible with the neutral theory of molecular evolution.

Journal ArticleDOI
TL;DR: Comparative mapping of truncated elements indicates that a specific endonucleolytic activity might bei involved in illegitimate (nonhomologous) recombination events, and sequence divergence analyses provide insights into the origin and molecular evolution of these elements.
Abstract: We present approximately 7.0 kb of composite DNA sequence of a long interspersed middle repetitive element (LINE1) present in high copy number in the rat genome. The family of these repeats, which includes transcribing members, is the rat homologue of the mouse MIF-Bam-R and human Kpn I LINEs. Sequence alignments between specimens from these three species define the length of a putative unidentified open reading frame, and document extensive recombination events that, in conjunction with retroposition, have generated this large family of pseudogenes and pseudogene fragments. Comparative mapping of truncated elements indicates that a specific endonucleolytic activity might be involved in illegitimate (nonhomologous) recombination events. Sequence divergence analyses provide insights into the origin and molecular evolution of these elements.

Journal ArticleDOI
TL;DR: A method for estimating quantitatively the influence of point mutations and selection on the frequencies of codons and amino acids is outlined and a notable suppression of replacements leading to tryptophan, glutamate, lysine, and methionine is revealed.
Abstract: We outline a method for estimating quantitatively the influence of point mutations and selection on the frequencies of codons and amino acids. We show how the mutation rate, i.e., the rate of amino acid replacement due to point mutation, can be affected by the codon usage as well as by the rates of the involved base exchanges. A comparison of the mutation rates calculated from reliable values of codon usage and base exchange probabilities with those that would be expected on the basis of chance reveals a notable suppression of replacements leading to tryptophan, glutamate, lysine, and methionine, and particularly of those leading to the termination codons.

Journal ArticleDOI
TL;DR: Comparisons show that the vertebrate muscle creatine kinases constitute a remarkably conserved protein family with a unit evolutionary period of 30 and retain marked sequence similarity with the more distantly related invertebrate guanidino kinases.
Abstract: The nucleotide sequence of cloned DNA corresponding to full-length mouse muscle creatine kinase mRNA has been determined. This 1415 base pair DNA sequence and the deduced 381 amino acid sequence of the protein have been compared to creatine kinase sequences from other vertebrate species and to invertebrate guanidino kinase sequences. These comparisons show that the vertebrate muscle creatine kinases constitute a remarkably conserved protein family with a unit evolutionary period of 30. The creatine kinases also retain marked sequence similarity with the more distantly related invertebrate guanidino kinases. A portion of the sequence, presumably part of the ATP binding site, shows similarity to other nucleotide binding proteins with diverse functions. Comparisons of the untranslated regions of the creatine kinase cDNA sequences show that the 5′ untranslated regions are more highly conserved than are the 3′ untranslated regions; this may point to some regulatory function in the 5′ region.

Journal ArticleDOI
TL;DR: A significantly elevated rate of change of synonymouscodons among the adjacent codons 5′ to amino acid replacement positions provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.
Abstract: We examined the codon usages in wellconserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5′ to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.

Journal ArticleDOI
TL;DR: The results support the conclusion that although duplication and specialization of tubulin genes in metazoans may have led to distinct types of tubulins, the axonemal one has remained highly conserved.
Abstract: In spite of their overall evolutionary conservation, the tubulins of ciliates display electrophoretic and structural particularities. We show here that antibodies raised against Paramecium and Tetrahymena ciliary tubulins fail to recognize the cytoplasmic tubulins of all the metazoans tested. Immunoblotting of peptide maps of ciliate tubulins reveals that these antibodies react with one or very few ciliate-specific epitopes, in contrast to polyclonal antibodies against vertebrate tubulins, which are equivalent to autoantibodies and recognize several epitopes in both ciliate and vertebrate tubulins. Furthermore, we show that the anti-ciliate antibodies recognize ciliary and flagellar tubulins of metazoans ranging from sea urchin to mammals (with the exception of humans). The results support the conclusion that although duplication and specialization of tubulin genes in metazoans may have led to distinct types of tubulins, the axonemal one has remained highly conserved.

Journal ArticleDOI
TL;DR: Comparisons are drawn between this H1 gene and the coding sequences of other vertebrate H1 genes from chicken and Xenopus, and a strong homology is seen in the region of amino acids 22–101, which form the hydrophobic “head” of the H1 molecule.
Abstract: A 1.7-kbp DNA region from the 10.2-kb cluster containing the five rainbow trout histone genes has been subcloned in pBR322 and completely sequenced. It contains a trout histone H1 gene together with its 5′ and 3′ flanking sequences. This H1 gene codes for a H1 variant different from the major trout testis H1 previously sequenced by Macleod et al. (1977). Northern blots of total RNA from trout testis, kidney, and liver indicate that this H1 gene is expressed in all three tissues but that the level of H1 mRNA is much higher in testis than in other tissues. The lack of heterogeneity in the sizes and 5′ initiation sites of trout H1 mRNAs is surprising in view of the substantial heterogeneity of H1 variant proteins observed previously. The coding sequence of the H1 gene shows strong evidence of repeated partial duplications of a hexapeptide motif of the form Ala.Ala.Ala.Lys.Lys.Pro and of a pentapeptide phosphorylation-site sequence, Lys.Ser.Pro.Lys.Lys, during its evolution. Comparisons are drawn between this gene and the coding sequences of other vertebrate H1 genes from chicken andXenopus, and a strong homology is seen in the region of amino acids 22–101, which form the hydrophobic “head” of the H1 molecule. The 5′ and 3′ regulatory signals in the trout H1 are also compared with those of H1 genes from other sequences.

Journal ArticleDOI
TL;DR: The sequence homology suggests a pathway for the evolution of the four-domain molecule from a single-domain dimer that can be correlated with simultaneous development of the saccharide-binding sites.
Abstract: We compared the homologous amino acid sequences of hevein and each of the four domains (A, B, C, and D) of wheat germ agglutinin and used them to construct a pseudophylogenetic tree relating these sequences to a hypothetical common ancestor sequence. In the crystal structure of the wheat germ agglutinin dimer, six pseudo-twofold rotational symmetry axes have previously been located in addition to the true twofold axis. Four of these relate two nonidentical domains to each other in each of the four possible pairs constituting the sugar-binding sites (A1D2, A2D1, B1C2, and B2C1). The remaining two relate contiguous unique pairs of sugar-binding sites to each other (A1D2 to B1C2, and A2D1 to B2C1). These latter two sets of pairs are related to each other by the true twofold axis. Side chains that mediate sugar binding in the interfaces of each of the four pairs were found to be largely conserved. The sequence homology, taken together with these pseudo-symmetry elements in the dimer structure, suggests a pathway for the evolution of the four-domain molecule from a single-domain dimer that can be correlated with simultaneous development of the saccharide-binding sites.

Journal ArticleDOI
TL;DR: Mycoplasma capricolum was previously found to use UGA instead of UGG as its codon for tryptophan and to contain 75%A+ T in its DNA, but the codon change could have been due to mutational pressure to replace C+G by A+T.
Abstract: Mycoplasma capricolum was previously found to use UGA instead of UGG as its codon for tryptophan and to contain 75 percent A + T in its DNA. The codon change could have been due to mutational pressure to replace C + G by A + T, resulting in the replacement of UGA stop codons by UAA, change of the anticodon in tryptophan tRNA from CCA to UCA, and replacement of UGG tryptophan codons by UGA. None of these changes should have been deleterious.

Journal ArticleDOI
TL;DR: A 1.6-kb fragment of DNA from the thermophilic, methane-producing, anaerobic archaebacteriumMethanobacterium thermoautotrophicum ΔH has been cloned and sequenced and predicts that complementation in E. coli results from the synthesis of a polypeptide with a molecular weight of 36,249.
Abstract: A 1.6-kb fragment of DNA from the thermophilic, methane-producing, anaerobic archaebacteriumMethanobacterium thermoautotrophicum ΔH has been cloned and sequenced. This DNA complements mutations in both the purE1 and purE2 loci ofEscherichia coli. The sequence of theM. thermoautotrophicum DNA predicts that complementation inE. coli results from the synthesis of a polypeptide with a molecular weight of 36,249. A polypeptide apparently of this molecular weight is synthesized inE. coli minicells containing recombinant plasmids that carry the cloned fragment of methanogen DNA. We have previously cloned and sequenced a purE-complementing gene from the mesophilic methanogenMethanobrevibacter smithii. The two methanogen-derived purE-complementing genes are 53% homologous and encode polypeptides that are 45% homologous in their amino acid sequences but would be 74% homologous if conservative amino acid substitutions were considered as maintaining sequence homology. The genome ofM. thermoautotrophicum has a molar G+C content of 49.7%, whereas the genome ofM. smithii is 30.6% G+C. Conservation of encoded amino acids while accommodating the very different G+C contents is accomplished by use of different codons that encode the same amino acid. The majority of base changes occur at the third codon position. The intergenic regions of the clonedM. thermoautotrophicum DNA contain sequences previously identified as ribosome binding sites and as putative methanogen promoters. Although the two purE-complementing genes are apparently derived from a common ancestor, only the gene fromM. smithii maintains a codon usage that conforms to the RNY rule.

Journal ArticleDOI
TL;DR: It is shown that adsorption on ferric hydroxide and on several other minerals has no effect, under the conditions studies, on the template-directed oligomerization of guanylic acid on polycytidylic acid.
Abstract: Ferric hydroxide, a plausible prebiotic material, strongly adsorbs polynucleotides. We show that adsorption on ferric hydroxide and on several other minerals has no effect, under the conditions studies, on the template-directed oligomerization of guanylic acid on polycytidylic acid.

Journal ArticleDOI
TL;DR: The low level of GATC in enterobacteriophages is probably due not to6mA hypermutability, but to selection against GATCs in order to bypass a GATc-mediated host function.
Abstract: Postreplicative methylation of adenine inEscherichia coli DNA to produce G6m ATC (where6mA is 6-methyladenine) has been associated with preferential daughter-strand repair and possibly regulation of replication. An analysis was undertaken to determine if these, or other, as yet unknown roles of GATC, have had an effect on the frequency of GATC inE. coli or bacteriophage DNA. It was first ascertained that the most accurate predictions of GATC frequency were based on the observed frequencies of GAT and ATC, which would be expected since these predictors take into account preferences in codon usage. The predicted frequencies were compared with observed GATC frequencies in all available bacterial and phage nucleotide sequences. The frequency of GATC was close to the predicted frequency in most genes ofE. coli and its RNA bacteriophages and in the genes of nonenteric bacteria and their bacteriophages. However, for DNA enterobacteriophages the observed frequency of DNA enterobacteriophages the observed frequency of GATC was generally significantly lower than predicted when assessed by the chi square test. No elevation in the rate of mutation of6mA in GATC relative to other bases was found when pairs of DNA sequences from closely related phages or pairs of homologous genes from enterobacteria were compared, nor was any preferred pathway for mutation of6mA evident in theE. coli DNA bacteriophages. This situation contrasts with that of 5-methylcytosine, which is hypermutable, with a preferred pathway to thymine. Thus, the low level of GATC in enterobacteriophages is probably due not to6mA hypermutability, but to selection against GATC in order to bypass a GATC-mediated host function.

Journal ArticleDOI
TL;DR: Life on Earth may have begun about 4×109 years (4 Ga) ago, with rapid spreading at mid-ocean ridges, a komatiitic oceanic crust, active volcanic arcs and the development of extensional basins on continental crust.
Abstract: Life on Earth may have begun about 4×109 years (4 Ga) ago. Plate tectonics probably operated in the early Archaean, with rapid spreading at mid-ocean ridges, a komatiitic (magnesium-rich) oceanic crust, active volcanic arcs and the development of extensional basins on continental crust. Shallow water environments would have been more restricted and probably shorter-lived than in later geological times; however, extensive shallow seas existed in the later phases of the development of extensional basins. Bacterial communities-presumably photosynthetic-have probably existed in such shallow-water settings and probably at shallow depths in the oceans for at least 3.5 Ga. Because the mid-ocean ridges were probably subaqueous, hydrothermal systems would have been very vigorous and would have offered suitable habitats for early chemo-autotrophic bacterial communities. Early life forms probably also occupied vesicles in lavas, pumice and volcanic breccias, and pores in soft sediments, living in the constant flux of fluid flushing through permeable strata. Other, similar habitats would have existed in volcanic island arcs and in extensional basins.