Inference of Ancient Whole-Genome Duplications and the Evolution of Gene Duplication and Loss Rates
read more
Citations
A rooted phylogeny resolves early bacterial evolution.
The Origin of Land Plants Is Rooted in Two Bursts of Genomic Novelty.
Asterid Phylogenomics/Phylotranscriptomics Uncover Morphological Evolutionary Histories and Support Phylogenetic Placement for Numerous Whole-Genome Duplications
Distinct Expression and Methylation Patterns for Genes with Different Fates following a Single Whole-Genome Duplication in Flowering Plants
A rooted phylogeny resolves early bacterial evolution
References
MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice across a Large Model Space
The evolutionary fate and consequences of duplicate genes
The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)
Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods
MCMC using Hamiltonian dynamics
Related Papers (5)
Ancestral polyploidy in seed plants and angiosperms
RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.
Frequently Asked Questions (9)
Q2. What have the authors stated for future works in "Inference of ancient whole genome duplications and the evolution of the gene duplication and loss rate" ?
Accounting for these complexities in a probabilistic framework is another challenge for future research, and would require more sophisticated models that explicitly model the polyploid phase of the lineage under consideration. In particular, their model does not account for incomplete lineage sorting, and incorporating the multi-species coalescent in their framework to account for the possibility of deep coalescence would be an interesting future development. The authors believe these might be fruitful further research directions. In particular genome-scale molecular dating would be a promising avenue, where the temporal signal from both the gene family and sequence evolution process could be employed using relaxed clock priors on both duplication, loss and substitution rates to date species divergence times and WGDs in an integrative fashion.
Q3. What are the main methods used to uncover evidence for ancient WGDs?
Especially heuristic gene tree - species tree reconciliation methods have been widely employed to unveil evidence for ancient WGDs (e.g. Jiao et al.
Q4. What is the merit of the amalgamation approach?
The authors note that, besides being very efficient, the amalgamation approach has the merit that it only requires a sample from the posterior distribution over gene tree topologies.
Q5. What is the reason for the conflicting signals for the putative gymnosperm WGD?
A possible explanation for the conflicting signals for the putative gymnosperm WGD in the nine-taxon and five-taxon analyses may be that it is an artifact due to a strong drop in duplication rate in the Ginkgo lineage compared to the gymnosperm stem and the lineage leading to P. abies.
Q6. What is the effect of a DL model on the number of duplicates?
To assess whether a particular number of duplicates corresponds to a significant increase in the number of duplications (possibly stemming from a WGD) they simulated gene tree topologies under the species tree of interest using a constant-rates DL model, with four sets of duplication and loss rates, which are estimated using gene count data.
Q7. What is the effect of assuming a very low prior probability on multiple genes at the root?
As expected, assuming a very low prior probability on multiple genes at the root (1/η ≈ 1) leads to overestimation of λ and underestimation of µ.
Q8. What are the special considerations needed when handling the root of S?
The authors next consider the special considerations needed when handling the root of S and the ubiquitous clade Γ.Prior on the number of lineages at the root and conditioningA fundamental issue in probabilistic gene tree - species tree reconciliation is that an explicit or implicit assumption on the number of lineages present at the root of the species tree is required.
Q9. What is the probability of a lineage leaving no descendants at the end of the time slice?
The propagation probability is the probability that a single lineage entering a time slice at time t ‘propagates’ through the time slice to generate exactly one lineage at the end of the time slice (time t′) which has observed descendants at the present (t0 = 0).