scispace - formally typeset
Search or ask a question

Showing papers on "Ancestral reconstruction published in 2010"


Journal ArticleDOI
TL;DR: Count performs ancestral reconstruction, and infers family- and lineage-specific characteristics along the evolutionary tree, and implements popular methods employed in gene content analysis such as Dollo and Wagner parsimony, propensity for gene loss, as well as probabilistic methods involving a phylogenetic birth-and-death model.
Abstract: Summary: Count is a software package for the analysis of numerical profiles on a phylogeny. It is primarily designed to deal with profiles derived from the phyletic distribution of homologous gene families, but is suited to study any other integer-valued evolutionary characters. Count performs ancestral reconstruction, and infers family- and lineage-specific characteristics along the evolutionary tree. It implements popular methods employed in gene content analysis such as Dollo and Wagner parsimony, propensity for gene loss, as well as probabilistic methods involving a phylogenetic birth-and-death model. Availability: Count is available as a stand-alone Java application, as well as an application bundle for MacOS X, at the web site http://www.iro.umontreal.ca/∼csuros/gene_content/count.html. It can also be launched using Java Webstart from the same site. The software is distributed under a BSD-style license. Source code is available upon request from the author. Contact: csuros@iro.umontreal.ca.

367 citations


Journal ArticleDOI
TL;DR: Phylogenetic reconstruction from data revealed correlated substitutions at Y120/Q125 or N157/S169 repeated across multiple lineages in every host, indicating convergent within-host evolution shaped by epistatic interactions.
Abstract: Rapidly evolving viruses such as HIV-1 display extensive sequence variation in response to host-specific selection, while simultaneously maintaining functions that are critical to replication and infectivity. This apparent conflict between diversifying and purifying selection may be resolved by an abundance of epistatic interactions such that the same functional requirements can be met by highly divergent sequences. We investigate this hypothesis by conducting an extensive characterization of sequence variation in the HIV-1 nef gene that encodes a highly variable multifunctional protein. Population-based sequences were obtained from 686 patients enrolled in the HOMER cohort in British Columbia, Canada, from which the distribution of nonsynonymous substitutions in the phylogeny was reconstructed by maximum likelihood. We used a phylogenetic comparative method on these data to identify putative epistatic interactions between residues. Two interactions (Y120/Q125 and N157/S169) were chosen to further investigate within-host evolution using HIV-1 RNA extractions from plasma samples from eight patients. Clonal sequencing confirmed strong linkage between polymorphisms at these sites in every case. We used massively parallel pyrosequencing (MPP) to reconstruct within-host evolution in these patients. Experimental error associated with MPP was quantified by performing replicates at two different stages of the protocol, which were pooled prior to analysis to reduce this source of variation. Phylogenetic reconstruction from these data revealed correlated substitutions at Y120/Q125 or N157/S169 repeated across multiple lineages in every host, indicating convergent within-host evolution shaped by epistatic interactions.

76 citations


Journal ArticleDOI
TL;DR: The results introduce a powerful experimental approach for comparative analysis of functional specificity in protein families even in the cases of pronounced epistasis, provide foundation for the detailed studies of evolutionary trajectories leading to novelty and complexity, and will help rational modification of existing fluorescent labels.
Abstract: Proteins of the green fluorescent protein family represent a convenient experimental model to study evolution of novelty at the molecular level. Here, we focus on the origin of Kaede-like red fluorescent proteins characteristic of the corals of the Faviina suborder. We demonstrate, using an original approach involving resurrection and analysis of the library of possible evolutionary intermediates, that it takes on the order of 12 mutations, some of which strongly interact epistatically, to fully recapitulate the evolution of a red fluorescent phenotype from the ancestral green. Five of the identified mutations would not have been found without the help of ancestral reconstruction, because the corresponding site states are shared between extant red and green proteins due to their recent descent from a dual-function common ancestor. Seven of the 12 mutations affect residues that are not in close contact with the chromophore and thus must exert their effect indirectly through adjustments of the overall protein fold; the relevance of these mutations could not have been anticipated from the purely theoretical analysis of the protein's structure. Our results introduce a powerful experimental approach for comparative analysis of functional specificity in protein families even in the cases of pronounced epistasis, provide foundation for the detailed studies of evolutionary trajectories leading to novelty and complexity, and will help rational modification of existing fluorescent labels.

71 citations


Journal ArticleDOI
TL;DR: Using a phylogenetic tree of concatenated ribosomal proteins, the analysis of amino acid compositional bias detects a strong and unique signal associated with the early expansion of the genetic code, placing the root of the translation machinery along the bacterial branch.
Abstract: The origin of the genetic code and the rooting of the tree of life (ToL) are two of the most challenging problems in the study of life’s early evolution. Although both have been the focus of numerous investigations utilizing a variety of methods, until now, each problem has been addressed independently. Typically, attempts to root the ToL have relied on phylogenies of genes with ancient duplications, which are subject to artifacts of tree reconstruction and horizontal gene transfer, or specific physiological characters believed to be primitive, which are often based on subjective criteria. Here, we demonstrate a unique method for rooting based on the identification of amino acid usage biases comprising the residual signature of a more primitive genetic code. Using a phylogenetic tree of concatenated ribosomal proteins, our analysis of amino acid compositional bias detects a strong and unique signal associated with the early expansion of the genetic code, placing the root of the translation machinery along the bacterial branch.

69 citations


Journal ArticleDOI
TL;DR: It is revealed that the patterns of morphological evolution in Polypteridae are similar to those seen in other osteichthyans, thus implying the underlying genetic and developmental mechanisms were established early in the evolutionary history of Osteichthyes.
Abstract: The family Polypteridae, commonly known as "bichirs", is a lineage that diverged early in the evolutionary history of Actinopterygii (ray-finned fish), but has been the subject of far less evolutionary study than other members of that clade. Uncovering patterns of morphological change within Polypteridae provides an important opportunity to evaluate if the mechanisms underlying morphological evolution are shared among actinoptyerygians, and in fact, perhaps the entire osteichthyan (bony fish and tetrapods) tree of life. However, the greatest impediment to elucidating these patterns is the lack of a well-resolved, highly-supported phylogenetic tree of Polypteridae. In fact, the interrelationships of polypterid species have never been subject to molecular phylogenetic analysis. Here, we infer the first molecular phylogeny of bichirs, including all 12 recognized species and multiple subspecies using Bayesian analyses of 16S and cyt-b mtDNA. We use this mitochondrial phylogeny, ancestral state reconstruction, and geometric morphometrics to test whether patterns of morphological evolution, including the evolution of body elongation, pelvic fin reduction, and craniofacial morphology, are shared throughout the osteichthyan tree of life. Our molecular phylogeny reveals 1) a basal divergence between Erpetoichthys and Polypterus, 2) polyphyly of P. endlicheri and P. palmas, and thus 3) the current taxonomy of Polypteridae masks its underlying genetic diversity. Ancestral state reconstructions suggest that pelvic fins were lost independently in Erpetoichthys, and unambiguously estimate multiple independent derivations of body elongation and shortening. Our mitochondrial phylogeny suggested species that have lower jaw protrusion and up-righted orbit are closely related to each other, indicating a single transformation of craniofacial morphology. The mitochondrial phylogeny of polypterid fish provides a strongly-supported phylogenetic framework for future comparative evolutionary, physiological, ecological, and genetic analyses. Indeed, ancestral reconstruction and geometric morphometric analyses revealed that the patterns of morphological evolution in Polypteridae are similar to those seen in other osteichthyans, thus implying the underlying genetic and developmental mechanisms responsible for those patterns were established early in the evolutionary history of Osteichthyes. We propose developmental and genetic mechanisms to be tested under the light of this new phylogenetic framework.

22 citations


Journal ArticleDOI
TL;DR: Evidence is found that Tribolonotus originated on either Greater Bougainville or in New Guinea, and subsequently colonized surrounding islands via multiple colonization events, and it is argued that character displacement, combined with ecological differences between New Guinean species (T. gracilis and T. novaeguineae), best explains the evolution of body size in the genus Tribolonotsus.

16 citations


Journal ArticleDOI
09 Oct 2010
TL;DR: The consistency problem for different gene cluster models on sequences with restricted gene multiplicities is discussed and results range from linear-time algorithms for the simple model of adjacencies to NP completeness for more complex models like common intervals.
Abstract: In comparative genomics, various combinatorial models can be used to specify gene clusters--groups of genes that are co-located in a set of genomes. Several approaches have been proposed to reconstruct putative ancestral gene clusters based on the gene order of contemporary species. One prevalent and natural reconstruction criterion is consistency: For a set of reconstructed gene clusters, there should exist a gene order that comprises all given clusters. In this paper, we discuss the consistency problem for different gene cluster models on sequences with restricted gene multiplicities. Our results range from linear-time algorithms for the simple model of adjacencies to NP completeness for more complex models like common intervals.

12 citations


Journal ArticleDOI
TL;DR: A comprehensive probabilistic model to describe the evolution of binary characters on a bifurcating phylogenetic tree is presented and a fast software tool is provided, using maximum likelihood to estimate the parameters of the model and to reconstruct ancestral states and events.
Abstract: Evolutionary binary characters are features of species or genes, indicating the absence (value zero) or presence (value one) of some property. Examples include eukaryotic gene architecture (the presence or absence of an intron in a particular locus), gene content, and morphological characters. In many studies, the acquisition of such binary characters is assumed to represent a rare evolutionary event, and consequently, their evolution is analyzed using various flavors of parsimony. However, when gain and loss of the character are not rare enough, a probabilistic analysis becomes essential. Here, we present a comprehensive probabilistic model to describe the evolution of binary characters on a bifurcating phylogenetic tree. A fast software tool, EREM, is provided, using maximum likelihood to estimate the parameters of the model and to reconstruct ancestral states (presence and absence in internal nodes) and events (gain and loss events along branches).

7 citations


Book ChapterDOI
01 Jan 2010
TL;DR: This work considers the less-studied problem of computing the expected reconstruction error of a maximum likelihood reconstruction, given the phylogenetic tree and model of evolution, but not the extant sequences.
Abstract: The ancestral sequence reconstruction problem asks to predict the DNA or protein sequence of an ancestral species, given the sequences of extant species. Such reconstructions are fundamental to comparative genomics, as they provide information about extant genomes and the process of evolution that gave rise to them. Arguably the best method for ancestral reconstruction is maximum likelihood estimation. Many eective algorithms for accurately computing the most likely ancestral sequence have been proposed. We consider the less-studied problem of computing the expected reconstruction error of a maximum likelihood reconstruction, given the phylogenetic tree and model of evolution, but not the extant sequences. This situation can arise, for example, when deciding which genomes to sequence for a reconstruction project given a gene-tree phylogeny (The Taxon Selection Problem). In most applications, the reconstruction error is necessarily very small, making Monte Carlo simulations very inecient for accurate estimation. We present the first practical algorithm for this problem and demonstrate how it can be used to quickly and accurately estimate the reconstruction accuracy. We then use our method as a kernel in a heuristic algorithm for the taxon selection problem. The implementation is available at http://www.mcb.mcgill.ca/ blanchem/mlerror

2 citations