scispace - formally typeset
Search or ask a question
Author

Laura Cano

Bio: Laura Cano is an academic researcher from University of Edinburgh. The author has contributed to research in topics: Synonymous substitution & Selection (genetic algorithm). The author has an hindex of 3, co-authored 4 publications receiving 51 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: An evolutionarily informed approach to attenuation is proposed that, unusually, seeks to increase usage of the already most common synonymous codons in SARS-CoV-2 genes.
Abstract: Large-scale re-engineering of synonymous sites is a promising strategy to generate vaccines either through synthesis of attenuated viruses or via codon optimized genes in DNA vaccines. Attenuation typically relies on de-optimisation of codon pairs and maximization of CpG dinucleotide frequencies. So as to formulate evolutionarily-informed attenuation strategies that aim to force nucleotide usage against the direction favoured by selection, here we examine available whole-genome sequences of SARS-CoV-2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias towards U. In turn, analysis of observed synonymous site composition implicates selection against U. Accounting for dinucleotide effects reinforces this conclusion, observed UU content being a quarter of that expected under neutrality. Possible mechanisms of selection against U mutations includes selection for higher expression, for high mRNA stability or lower immunogenicity of viral genes. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV-2 genes. We propose an evolutionarily-informed approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.

68 citations

Journal ArticleDOI
TL;DR: In this article, it was shown that the SARS-CoV-2 mutation rate is at least 49-67% higher than would be estimated based on the rate of appearance of variants in sampled genomes.
Abstract: Owing to a lag between a deleterious mutation's appearance and its selective removal, gold-standard methods for mutation rate estimation assume no meaningful loss of mutations between parents and offspring. Indeed, from analysis of closely related lineages, in SARS-CoV-2 the Ka/Ks ratio was previously estimated as 1.008, suggesting no within-host selection. By contrast, we find a higher number of observed SNPs at 4-fold degenerate sites than elsewhere and, allowing for the virus's complex mutational and compositional biases, estimate that the mutation rate is at least 49-67% higher than would be estimated based on the rate of appearance of variants in sampled genomes. Given the high Ka/Ks one might assume that the majority of such intra-host selection is the purging of nonsense mutations. However, we estimate that selection against nonsense mutations accounts for only ∼10% of all the "missing" mutations. Instead, classical protein-level selective filters (against chemically disparate amino acids and those predicted to disrupt protein functionality) account for many missing mutations. It is less obvious why for an intracellular parasite, amino acid cost parameters, notably amino acid decay rate, are also significant. Perhaps most surprisingly, we also find evidence for real time selection against synonymous mutations that move codon usage away from that of humans. We conclude that there is common intra-host selection on SARS-CoV-2 that acts on nonsense, missense and possibly synonymous mutations. This has implications for methods of mutation rate estimation, for determining times to common ancestry and the potential for intra-host evolution including vaccine escape.

27 citations

Journal ArticleDOI
TL;DR: This paper analyzed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation, and found that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses.
Abstract: The nucleotide composition, dinucleotide composition, and codon usage of many viruses differs from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analysed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host - instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.

16 citations

Posted ContentDOI
11 May 2020-bioRxiv
TL;DR: An evolutionarily informed gene-bespoke approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons in relation to SARS-CoV2 genes.
Abstract: Large-scale re-engineering of synonymous sites is a promising strategy to generate attenuated viruses for vaccines. Attenuation typically relies on de-optimisation of codon pairs and maximization of CpG dinculeotide frequencies. So as to formulate evolutionarily-informed attenuation strategies, that aim to force nucleotide usage against the estimated direction favoured by selection, here we examine available whole-genome sequences of SARS-CoV2 to infer patterns of mutation and selection on synonymous sites. Analysis of mutational profiles indicates a strong mutation bias towards T with concomitant selection against T. Accounting for dinucleotide effects reinforces this conclusion, observed TT content being a quarter of that expected under neutrality. A significantly different mutational profile at CDS sites that are not 4-fold degenerate is consistent with contemporaneous selection against T mutations more widely. Although selection against CpG dinucleotides is expected to drive synonymous site G+C content below mutational equilibrium, observed G+C content is slightly above equilibrium, possibly because of selection for higher expression. Consistent with gene-specific selection against CpG dinucleotides, we observe systematic differences of CpG content between SARS-CoV2 genes. We propose an evolutionarily informed gene-bespoke approach to attenuation that, unusually, seeks to increase usage of the already most common synonymous codons. Comparable analysis of H1N1 and Ebola finds that GC3 deviated from neutral equilibrium is not a universal feature, cautioning against generalization of results.

12 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: It is found that there is no evidence for significantly more transmissible lineages of SARS-CoV-2 due to recurrent mutations, and recurrent mutations currently in circulation appear to be evolutionary neutral and primarily induced by the human immune system via RNA editing, rather than being signatures of adaptation.
Abstract: COVID-19 is caused by the coronavirus SARS-CoV-2, which jumped into the human population in late 2019 from a currently uncharacterised animal reservoir. Due to this recent association with humans, SARS-CoV-2 may not yet be fully adapted to its human host. This has led to speculations that SARS-CoV-2 may be evolving towards higher transmissibility. The most plausible mutations under putative natural selection are those which have emerged repeatedly and independently (homoplasies). Here, we formally test whether any homoplasies observed in SARS-CoV-2 to date are significantly associated with increased viral transmission. To do so, we develop a phylogenetic index to quantify the relative number of descendants in sister clades with and without a specific allele. We apply this index to a curated set of recurrent mutations identified within a dataset of 46,723 SARS-CoV-2 genomes isolated from patients worldwide. We do not identify a single recurrent mutation in this set convincingly associated with increased viral transmission. Instead, recurrent mutations currently in circulation appear to be evolutionary neutral and primarily induced by the human immune system via RNA editing, rather than being signatures of adaptation. At this stage we find no evidence for significantly more transmissible lineages of SARS-CoV-2 due to recurrent mutations.

269 citations

Journal ArticleDOI
Yang Wang1, Ziqi Zhang1, Jingwen Luo1, Xuejiao Han1, Yuquan Wei1, Xiawei Wei1 
TL;DR: In this article, the molecular biology of mRNA vaccines and underlying anti-virus and anti-tumor mechanisms, with an introduction of their immunological phenomena, delivery strategies, their importance on Corona Virus Disease 2019 (COVID-19) and related clinical trials against cancer and viral diseases.
Abstract: mRNA vaccines have tremendous potential to fight against cancer and viral diseases due to superiorities in safety, efficacy and industrial production. In recent decades, we have witnessed the development of different kinds of mRNAs by sequence optimization to overcome the disadvantage of excessive mRNA immunogenicity, instability and inefficiency. Based on the immunological study, mRNA vaccines are coupled with immunologic adjuvant and various delivery strategies. Except for sequence optimization, the assistance of mRNA-delivering strategies is another method to stabilize mRNAs and improve their efficacy. The understanding of increasing the antigen reactiveness gains insight into mRNA-induced innate immunity and adaptive immunity without antibody-dependent enhancement activity. Therefore, to address the problem, scientists further exploited carrier-based mRNA vaccines (lipid-based delivery, polymer-based delivery, peptide-based delivery, virus-like replicon particle and cationic nanoemulsion), naked mRNA vaccines and dendritic cells-based mRNA vaccines. The article will discuss the molecular biology of mRNA vaccines and underlying anti-virus and anti-tumor mechanisms, with an introduction of their immunological phenomena, delivery strategies, their importance on Corona Virus Disease 2019 (COVID-19) and related clinical trials against cancer and viral diseases. Finally, we will discuss the challenge of mRNA vaccines against bacterial and parasitic diseases.

126 citations

Journal ArticleDOI
01 Jan 2021-Genomics
TL;DR: The most critical findings related to the genetics of the SARS-CoV-2 are reviewed, with a specific focus on genetic diversity and reported mutations, molecular-based diagnosis assays, using interfering RNA technology for the treatment of patients, and genetic-related vaccination strategies.

123 citations

Journal ArticleDOI
TL;DR: A toolkit is presented to compare, analyze and combine SARS-CoV-2 phylogenies, find and remove potential sequencing errors and establish a widely shared, stable clade structure for a more accurate scientific inference and discourse.
Abstract: The SARS-CoV-2 pandemic has led to unprecedented, nearly real-time genetic tracing due to the rapid community sequencing response. Researchers immediately leveraged these data to infer the evolutionary relationships among viral samples and to study key biological questions, including whether host viral genome editing and recombination are features of SARS-CoV-2 evolution. This global sequencing effort is inherently decentralized and must rely on data collected by many labs using a wide variety of molecular and bioinformatic techniques. There is thus a strong possibility that systematic errors associated with lab-or protocol-specific practices affect some sequences in the repositories. We find that some recurrent mutations in reported SARS-CoV-2 genome sequences have been observed predominantly or exclusively by single labs, co-localize with commonly used primer binding sites and are more likely to affect the protein-coding sequences than other similarly recurrent mutations. We show that their inclusion can affect phylogenetic inference on scales relevant to local lineage tracing, and make it appear as though there has been an excess of recurrent mutation or recombination among viral lineages. We suggest how samples can be screened and problematic variants removed, and we plan to regularly inform the scientific community with our updated results as more SARS-CoV-2 genome sequences are shared (https://virological.org/t/issues-with-sars-cov-2-sequencing-data/473 and https://virological.org/t/masking-strategies-for-sars-cov-2-alignments/480). We also develop tools for comparing and visualizing differences among very large phylogenies and we show that consistent clade- and tree-based comparisons can be made between phylogenies produced by different groups. These will facilitate evolutionary inferences and comparisons among phylogenies produced for a wide array of purposes. Building on the SARS-CoV-2 Genome Browser at UCSC, we present a toolkit to compare, analyze and combine SARS-CoV-2 phylogenies, find and remove potential sequencing errors and establish a widely shared, stable clade structure for a more accurate scientific inference and discourse.

91 citations

Journal ArticleDOI
TL;DR: Three human defense mechanisms, the apolipoprotein B mRNA editing catalytic polypeptide-like proteins (APOBEC), adenosine deaminase acting on RNA proteins (ADAR), and reactive oxygen species (ROS), are described and their potential implications on SARS-CoV-2 evolution are discussed.

71 citations