A universal trend of amino acid gain and loss in protein evolution

doi:10.1038/NATURE03306

Journal ArticleDOI

A universal trend of amino acid gain and loss in protein evolution

I. King Jordan, +6 more

- 10 Feb 2005 -

Nature

- Vol. 433, Iss: 7026, pp 633-638

Chats0

TLDR

Comparison of sets of orthologous proteins encoded by triplets of closely related genomes from 15 taxa representing all three domains of life and phylogenies to polarize amino acid substitutions shows expansion of initially under-represented amino acids apparently continues to this day.

Abstract:

A comparison of corresponding sets of proteins encoded by closely related genes from organisms representing all three domains of life (Bacteria, Archaea and Eukaryota) suggests that the order in which the genetic code was assembled over 3.5 billion years ago continues to influence the evolution of proteins today. Across these diverse genomes, evolving proteins have accumulated Cys, Met, His, Ser and Phe, and lost many of their Pro, Ala, Glu and Gly residues. The same nine amino acids are currently accrued or lost in human proteins as shown by analysis of nucleotide polymorphisms. The amino acids with declining frequencies were probably among the first incorporated into the genetic code, and most of those with increasing frequencies were probably recruited late. Amino acid composition of proteins varies substantially between taxa and, thus, can evolve. For example, proteins from organisms with (G + C)-rich (or (A + T)-rich) genomes contain more (or fewer) amino acids encoded by (G + C)-rich codons1,2,3,4. However, no universal trends in ongoing changes of amino acid frequencies have been reported. We compared sets of orthologous proteins encoded by triplets of closely related genomes from 15 taxa representing all three domains of life (Bacteria, Archaea and Eukaryota), and used phylogenies to polarize amino acid substitutions. Cys, Met, His, Ser and Phe accrue in at least 14 taxa, whereas Pro, Ala, Glu and Gly are consistently lost. The same nine amino acids are currently accrued or lost in human proteins, as shown by analysis of non-synonymous single-nucleotide polymorphisms. All amino acids with declining frequencies are thought to be among the first incorporated into the genetic code; conversely, all amino acids with increasing frequencies, except Ser, were probably recruited late5,6,7. Thus, expansion of initially under-represented amino acids, which began over 3,400 million years ago8,9, apparently continues to this day.

A universal trend of amino acid gain and loss in protein evolution

Citations

A protein evolution model with independent sites that reproduces site-specific amino acid distributions from the Protein Data Bank.

Protein evolution: causes of trends in amino-acid gain and loss.

Extended HP model for protein structure prediction.

Evolutionary patterns in the sequence and structure of transfer RNA: early origins of archaea and viruses.

Hybridization Probe for Femtomolar Quantification of Selected Nucleic Acid Sequences on a Disposable Electrode

References

Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

A Simple Sequentially Rejective Multiple Test Procedure

Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana

A genomic perspective on protein families

Efficient transformation of rice (Oryza sativa L.) mediated by Agrobacterium and sequence analysis of the boundaries of the T-DNA.

Related Papers (5)

The origin of the genetic code.

A Production of Amino Acids Under Possible Primitive Earth Conditions

A Co-Evolution Theory of the Genetic Code

Basic Local Alignment Search Tool

Enantiomeric Excesses in Meteoritic Amino Acids