scispace - formally typeset
Search or ask a question

Showing papers in "bioRxiv in 2021"


Posted ContentDOI
04 Oct 2021-bioRxiv
TL;DR: In this article, an AlphaFold model trained specifically for multimeric inputs of known stoichiometry was proposed, which significantly increases the accuracy of predicted multimimeric interfaces over input-adapted single-chain AlphaFolds.
Abstract: While the vast majority of well-structured single protein chains can now be predicted to high accuracy due to the recent AlphaFold [1] model, the prediction of multi-chain protein complexes remains a challenge in many cases. In this work, we demonstrate that an AlphaFold model trained specifically for multimeric inputs of known stoichiometry, which we call AlphaFold-Multimer, significantly increases accuracy of predicted multimeric interfaces over input-adapted single-chain AlphaFold while maintaining high intra-chain accuracy. On a benchmark dataset of 17 heterodimer proteins without templates (introduced in [2]) we achieve at least medium accuracy (DockQ [3] [≥] 0.49) on 14 targets and high accuracy (DockQ [≥] 0.8) on 6 targets, compared to 9 targets of at least medium accuracy and 4 of high accuracy for the previous state of the art system (an AlphaFold-based system from [2]). We also predict structures for a large dataset of 4,433 recent protein complexes, from which we score all non-redundant interfaces with low template identity. For heteromeric interfaces we successfully predict the interface (DockQ [≥] 0.23) in 67% of cases, and produce high accuracy predictions (DockQ [≥] 0.8) in 23% of cases, an improvement of +25 and +11 percentage points over the flexible linker modification of AlphaFold [4] respectively. For homomeric interfaces we successfully predict the interface in 69% of cases, and produce high accuracy predictions in 34% of cases, an improvement of +5 percentage points in both instances.

1,023 citations


Posted ContentDOI
26 Jan 2021-bioRxiv
TL;DR: In this paper, the authors report that B.1.7 is refractory to neutralization by most mAbs to the N-terminal domain (NTD) of spike and relatively resistant to a number of mabs to the receptor-binding domain (RBD).
Abstract: The Covid-19 pandemic has ravaged the globe, and its causative agent, SARS-CoV-2, continues to rage. Prospects of ending this pandemic rest on the development of effective interventions. Two monoclonal antibody (mAb) therapeutics have received emergency use authorization, and more are in the pipeline. Furthermore, multiple vaccine constructs have shown promise, including two with ~95% protective efficacy against Covid-19. However, these interventions were directed toward the initial SARS-CoV-2 that emerged in 2019. Considerable viral evolution has occurred since, including variants with a D614G mutation that have become dominant. Viruses with this mutation alone do not appear to be antigenically distinct, however. Recent emergence of new SARS-CoV-2 variants B.1.1.7 in the UK and B.1.351 in South Africa is of concern because of their purported ease of transmission and extensive mutations in the spike protein. We now report that B.1.1.7 is refractory to neutralization by most mAbs to the N-terminal domain (NTD) of spike and relatively resistant to a number of mAbs to the receptor-binding domain (RBD). It is modestly more resistant to convalescent plasma (~3 fold) and vaccinee sera (~2 fold). Findings on B.1.351 are more worrisome in that this variant is not only refractory to neutralization by most NTD mAbs but also by multiple potent mAbs to the receptor-binding motif on RBD, largely due to an E484K mutation. Moreover, B.1.351 is markedly more resistant to neutralization by convalescent plasma (~11-33 fold) and vaccinee sera (~6.5-8.6 fold). B.1.351 and emergent variants with similar spike mutations present new challenges for mAb therapy and threaten the protective efficacy of current vaccines.

677 citations


Posted ContentDOI
03 Jun 2021-bioRxiv
TL;DR: For example, EggNOG-mapper v2 as mentioned in this paper is a tool for functional annotation based on precomputed orthology assignments, optimized for vast (meta)genomic data sets.
Abstract: Even though automated functional annotation of genes represents a fundamental step in most genomic and metagenomic workflows, it remains challenging at large scales. Here, we describe a major upgrade to eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments, now optimized for vast (meta)genomic data sets. Improvements in version 2 include a full update of both the genomes and functional databases underlying eggNOG v5, as well as several efficiency enhancements and new features. Most notably, eggNOG-mapper v2 now allows: (i) de novo gene prediction from raw contigs, (ii) built-in pairwise orthology prediction, (iii) fast protein domain discovery, and (iv) automated GFF decoration. eggNOG-mapper v2 is available as a standalone tool or as an online service at http://emapperdev.compgenomics.org.

623 citations


Posted ContentDOI
12 Feb 2021-bioRxiv
TL;DR: In this paper, the authors report that B.1.7 is refractory to neutralization by most mAbs to the N-terminal domain (NTD) of spike and relatively resistant to a few mabs to the receptor-binding domain (RBD).
Abstract: The COVID-19 pandemic has ravaged the globe, and its causative agent, SARS-CoV-2, continues to rage. Prospects of ending this pandemic rest on the development of effective interventions. Single and combination monoclonal antibody (mAb) therapeutics have received emergency use authorization1-3, with more in the pipeline4-7. Furthermore, multiple vaccine constructs have shown promise8, including two with ~95% protective efficacy against COVID-199,10. However, these interventions were directed toward the initial SARS-CoV-2 that emerged in 2019. The recent emergence of new SARS-CoV-2 variants B.1.1.7 in the UK11 and B.1.351 in South Africa12 is of concern because of their purported ease of transmission and extensive mutations in the spike protein. We now report that B.1.1.7 is refractory to neutralization by most mAbs to the N-terminal domain (NTD) of spike and relatively resistant to a few mAbs to the receptor-binding domain (RBD). It is not more resistant to convalescent plasma or vaccinee sera. Findings on B.1.351 are more worrisome in that this variant is not only refractory to neutralization by most NTD mAbs but also by multiple individual mAbs to the receptor-binding motif on RBD, largely due to an E484K mutation. Moreover, B.1.351 is markedly more resistant to neutralization by convalescent plasma (9.4 fold) and vaccinee sera (10.3-12.4 fold). B.1.351 and emergent variants13,14 with similar spike mutations present new challenges for mAb therapy and threaten the protective efficacy of current vaccines.

551 citations


Posted ContentDOI
25 Jan 2021-bioRxiv
TL;DR: In this paper, the authors assessed the neutralizing capacity of sera from human subjects or non-human primates (NHPs) that received mRNA-1273 vaccination, using two orthogonal VSV and lentivirus PsVN assays expressing spike variants of 20E (EU1), 20A.7, and B.1.351.
Abstract: Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) is the causative infection of a global pandemic that has led to more than 2 million deaths worldwide. The Moderna mRNA-1273 vaccine has demonstrated ~94% efficacy in a Phase 3 study and has been approved under Emergency Use Authorization. The emergence of SARS-CoV-2 variants with mutations in the spike protein, most recently circulating isolates from the United Kingdom (B.1.1.7) and Republic of South Africa (B.1.351), has led to lower neutralization from convalescent serum by pseudovirus neutralization (PsVN) assays and resistance to certain monoclonal antibodies. Here, using two orthogonal VSV and lentivirus PsVN assays expressing spike variants of 20E (EU1), 20A.EU2, D614G-N439, mink cluster 5, B.1.1.7, and B.1.351 variants, we assessed the neutralizing capacity of sera from human subjects or non-human primates (NHPs) that received mRNA-1273. No significant impact on neutralization against the B.1.1.7 variant was detected in either case, however reduced neutralization was measured against the mutations present in B.1.351. Geometric mean titer (GMT) of human sera from clinical trial participants in VSV PsVN assay using D614G spike was 1/1852. VSV pseudoviruses with spike containing K417N-E484K-N501Y-D614G and full B.1.351 mutations resulted in 2.7 and 6.4-fold GMT reduction, respectively, when compared to the D614G VSV pseudovirus. Importantly, the VSV PsVN GMT of these human sera to the full B.1.351 spike variant was still 1/290, with all evaluated sera able to fully neutralize. Similarly, sera from NHPs immunized with 30 or 100μg of mRNA-1273 had VSV PsVN GMTs of ~ 1/323 or 1/404, respectively, against the full B.1.351 spike variant with a ~ 5 to 10-fold reduction compared to D614G. Individual mutations that are characteristic of the B.1.1.7 and B.1.351 variants had a similar impact on neutralization when tested in VSV or in lentivirus PsVN assays. Despite the observed decreases, the GMT of VSV PsVN titers in human vaccinee sera against the B.1.351 variant remained at ~1/300. Taken together these data demonstrate reduced but still significant neutralization against the full B.1.351 variant following mRNA-1273 vaccination.

374 citations


Posted ContentDOI
04 Jan 2021-bioRxiv
TL;DR: The memory B cell response to SARS-CoV-2 evolves between 1.3 and 6.2 months after infection in a manner that is consistent with antigen persistence, and the antibodies they express have greater somatic hypermutation, increased potency and resistance to RBD mutations, indicative of continued evolution of the humoral response.
Abstract: Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has infected 78 million individuals and is responsible for over 1.7 million deaths to date. Infection is associated with development of variable levels of antibodies with neutralizing activity that can protect against infection in animal models. Antibody levels decrease with time, but the nature and quality of the memory B cells that would be called upon to produce antibodies upon re-infection has not been examined. Here we report on the humoral memory response in a cohort of 87 individuals assessed at 1.3 and 6.2 months after infection. We find that IgM, and IgG anti-SARS-CoV-2 spike protein receptor binding domain (RBD) antibody titers decrease significantly with IgA being less affected. Concurrently, neutralizing activity in plasma decreases by five-fold in pseudotype virus assays. In contrast, the number of RBD-specific memory B cells is unchanged. Memory B cells display clonal turnover after 6.2 months, and the antibodies they express have greater somatic hypermutation, increased potency and resistance to RBD mutations, indicative of continued evolution of the humoral response. Analysis of intestinal biopsies obtained from asymptomatic individuals 4 months after coronavirus disease-2019 (COVID-19) onset, using immunofluorescence, or polymerase chain reaction, revealed persistence of SARS-CoV-2 nucleic acids and immunoreactivity in the small bowel of 7 out of 14 volunteers. We conclude that the memory B cell response to SARS-CoV-2 evolves between 1.3 and 6.2 months after infection in a manner that is consistent with antigen persistence.

362 citations


Posted ContentDOI
19 Jan 2021-bioRxiv
TL;DR: In this paper, the antibody and memory B cell responses in a cohort of 20 volunteers who received either the Moderna (mRNA-1273) or Pfizer-BioNTech (BNT162b2) vaccines were reported.
Abstract: To date severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has infected over 100 million individuals resulting in over two million deaths. Many vaccines are being deployed to prevent coronavirus disease 2019 (COVID-19) including two novel mRNA-based vaccines 1,2 . These vaccines elicit neutralizing antibodies and appear to be safe and effective, but the precise nature of the elicited antibodies is not known 3-6 . Here we report on the antibody and memory B cell responses in a cohort of 20 volunteers who received either the Moderna (mRNA-1273) or Pfizer-BioNTech (BNT162b2) vaccines. Consistent with prior reports, 8 weeks after the second vaccine injection volunteers showed high levels of IgM, and IgG anti-SARS-CoV-2 spike protein (S) and receptor binding domain (RBD) binding titers 3,5,6 . Moreover, the plasma neutralizing activity, and the relative numbers of RBD-specific memory B cells were equivalent to individuals who recovered from natural infection 7,8 . However, activity against SARS-CoV-2 variants encoding E484K or N501Y or the K417N:E484K:N501Y combination was reduced by a small but significant margin. Consistent with these findings, vaccine-elicited monoclonal antibodies (mAbs) potently neutralize SARS-CoV-2, targeting a number of different RBD epitopes in common with mAbs isolated from infected donors. Structural analyses of mAbs complexed with S trimer suggest that vaccine- and virus-encoded S adopts similar conformations to induce equivalent anti-RBD antibodies. However, neutralization by 14 of the 17 most potent mAbs tested was reduced or abolished by either K417N, or E484K, or N501Y mutations. Notably, the same mutations were selected when recombinant vesicular stomatitis virus (rVSV)/SARS-CoV-2 S was cultured in the presence of the vaccine elicited mAbs. Taken together the results suggest that the monoclonal antibodies in clinical use should be tested against newly arising variants, and that mRNA vaccines may need to be updated periodically to avoid potential loss of clinical efficacy.

288 citations


Posted ContentDOI
19 Jan 2021-bioRxiv
TL;DR: SARS-CoV-2 501Y.V2 (B.1.351), a lineage of coronavirus causing COVID-19, contains substitutions in two immunodominant domains of the spike protein this article.
Abstract: SARS-CoV-2 501Y.V2 (B.1.351), a novel lineage of coronavirus causing COVID-19, contains substitutions in two immunodominant domains of the spike protein. Here, we show that pseudovirus expressing 501Y.V2 spike protein completely escapes three classes of therapeutically relevant antibodies. This pseudovirus also exhibits substantial to complete escape from neutralization, but not binding, by convalescent plasma. These data highlight the prospect of reinfection with antigenically distinct variants and foreshadows reduced efficacy of spike-based vaccines.

269 citations


Posted ContentDOI
24 Apr 2021-bioRxiv
TL;DR: In this paper, a whole genome sequencing and analysis of spike protein mutations using sequence and structural approaches was undertaken to identify possible new variants and gauge the fitness of current circulating strains of SARS-CoV-2.
Abstract: As the global severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic expands, genomic epidemiology and whole genome sequencing are being constantly used to investigate its transmissions and evolution. In the backdrop of the global emergence of “variants of concern” (VOCs) during December 2020 and an upsurge in a state in the western part of India since January 2021, whole genome sequencing and analysis of spike protein mutations using sequence and structural approaches was undertaken to identify possible new variants and gauge the fitness of current circulating strains. Phylogenetic analysis revealed that the predominant clade in circulation was a distinct newly identified lineage B.1.617 possessing common signature mutations D111D, G142D, L452R, E484Q, D614G and P681R, in the spike protein including within the receptor binding domain (RBD). Of these, the mutations at residue positions 452, 484 and 681 have been reported in other globally circulating lineages. The structural analysis of RBD mutations L452R and E484Q along with P681R in the furin cleavage site, may possibly result in increased ACE2 binding and rate of S1-S2 cleavage resulting in better transmissibility. The same two RBD mutations indicated decreased binding to selected monoclonal antibodies (mAbs) and may affect their neutralization potential. Experimental validation is warranted for accessing both ACE2 binding and the effectiveness of commonly elicited neutralizing mAbs for the strains of lineage B.1.617. The emergence of such local variants through the accumulation of convergent mutations during the COVID-19 second wave needs to be further investigated for their public health impact in the rest of the country and its possibility of becoming a VOC.

222 citations


Posted ContentDOI
27 May 2021-bioRxiv
TL;DR: The SARS-CoV-2 B.1.617 lineage emerged in October 2020 in India1-6.7.1 and has since then become dominant in some indian regions and further spread to many countries.
Abstract: The SARS-CoV-2 B.1.617 lineage emerged in October 2020 in India1-6. It has since then become dominant in some indian regions and further spread to many countries. The lineage includes three main subtypes (B1.617.1, B.1617.2 and B.1.617.3), which harbour diverse Spike mutations in the N-terminal domain (NTD) and the receptor binding domain (RBD) which may increase their immune evasion potential. B.1.617.2 is believed to spread faster than the other versions. Here, we isolated infectious B.1.617.2 from a traveller returning from India. We examined its sensitivity to monoclonal antibodies (mAbs) and to antibodies present in sera from COVID-19 convalescent individuals or vaccine recipients, in comparison to other viral lineages. B.1.617.2 was resistant to neutralization by some anti-NTD and anti-RBD mAbs, including Bamlanivimab, which were impaired in binding to the B.1.617.2 Spike. Sera from convalescent patients collected up to 12 months post symptoms and from Pfizer Comirnaty vaccine recipients were 3 to 6 fold less potent against B.1.617.2, relative to B.1.1.7. Sera from individuals having received one dose of AstraZeneca Vaxzevria barely inhibited B.1.617.2. Thus, B.1.617.2 spread is associated with an escape to antibodies targeting non-RBD and RBD Spike epitopes.

200 citations


Posted ContentDOI
30 Jun 2021-bioRxiv
TL;DR: The CellProfiler 4 as mentioned in this paper is a new version of this software which has been ported to the Python 3 language based on user feedback, which has made several user interface refinements to improve the usability of the software.
Abstract: CellProfiler is a free, open source image analysis program which enables researchers to generate modular pipelines with which to process microscopy images into interpretable measurements. Here we describe CellProfiler 4, a new version of this software which has been ported to the Python 3 language. Based on user feedback, we have made several user interface refinements to improve the usability of the software. We introduced new modules to expand the capabilities of the software. We also evaluated performance and made targeted optimisations to reduce the time and cost associated with running common large-scale analysis pipelines. This release will ensure that researchers will have continued access to CellProfilers powerful computational tools in the coming years.

Posted ContentDOI
13 Aug 2021-bioRxiv
TL;DR: In this paper, the Delta spike mutation P681R plays a key role in the Alpha-to-Delta variant replacement and the Delta SARS-CoV-2 Delta variant has rapidly replaced the Alpha variant around the world.
Abstract: SARS-CoV-2 Delta variant has rapidly replaced the Alpha variant around the world. The mechanism that drives this global replacement has not been defined. Here we report that Delta spike mutation P681R plays a key role in the Alpha-to-Delta variant replacement. In a replication competition assay, Delta SARS-CoV-2 efficiently outcompeted the Alpha variant in human lung epithelial cells and primary human airway tissues. Delta SARS-CoV-2 bearing the Alpha-spike glycoprotein replicated less efficiently than the wild-type Delta variant, suggesting the importance of Delta spike in enhancing viral replication. The Delta spike has accumulated mutation P681R located at a furin cleavage site that separates the spike 1 (S1) and S2 subunits. Reverting the P681R mutation to wild-type P681 significantly reduced the replication of Delta variant, to a level lower than the Alpha variant. Mechanistically, the Delta P681R mutation enhanced the cleavage of the full-length spike to S1 and S2, leading to increased infection via cell surface entry. In contrast, the Alpha spike also has a mutation at the same amino acid (P681H), but the spike cleavage from purified Alpha virions was reduced compared to the Delta spike. Collectively, our results indicate P681R as a key mutation in enhancing Delta variant replication via increased S1/S2 cleavage. Spike mutations that potentially affect furin cleavage efficiency must be closely monitored for future variant surveillance.

Posted ContentDOI
15 Feb 2021-bioRxiv
TL;DR: This article introduced a protein language model which takes as input a set of sequences in the form of a multiple sequence alignment and interleaves row and column attention across the input sequences and is trained with a variant of the masked language modeling objective across many protein families.
Abstract: Unsupervised protein language models trained across millions of diverse sequences learn structure and function of proteins. Protein language models studied to date have been trained to perform inference from individual sequences. The longstanding approach in computational biology has been to make inferences from a family of evolutionarily related sequences by fitting a model to each family independently. In this work we combine the two paradigms. We introduce a protein language model which takes as input a set of sequences in the form of a multiple sequence alignment. The model interleaves row and column attention across the input sequences and is trained with a variant of the masked language modeling objective across many protein families. The performance of the model surpasses current state-of-the-art unsupervised structure learning methods by a wide margin, with far greater parameter efficiency than prior state-of-the-art protein language models.

Posted ContentDOI
10 Mar 2021-bioRxiv
TL;DR: In this article, dual action monoclonal antibodies (mAbs) targeting the spike glycoprotein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) were derived from a parent antibody (S309) isolated from memory B cells of a 2003 SARS CoV survivor.
Abstract: VIR-7831 and VIR-7832 are dual action monoclonal antibodies (mAbs) targeting the spike glycoprotein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). VIR-7831 and VIR-7832 were derived from a parent antibody (S309) isolated from memory B cells of a 2003 severe acute respiratory syndrome coronavirus (SARS-CoV) survivor. Both mAbs contain an “LS” mutation in the Fc region to prolong serum half-life and potentially enhance distribution to the respiratory mucosa. In addition, VIR-7832 encodes an Fc GAALIE mutation that has been shown previously to evoke CD8+ T-cells in the context of an in vivo viral respiratory infection. VIR-7831 and VIR-7832 potently neutralize wild-type and variant authentic virus in vitro as well as variant pseudotyped viruses. In addition, they retain activity against monoclonal antibody resistance mutations conferring reduced susceptibility to currently authorized mAbs. The VIR-7831/VIR-7832 epitope does not overlap with mutational sites in current variants of concern and continues to be highly conserved among circulating sequences consistent with the high barrier to resistance observed in vitro. Furthermore, both mAbs can recruit effector mechanisms in vitro that may contribute to clinical efficacy via elimination of infected host cells. In vitro studies with these mAbs demonstrated no enhancement of infection. In a Syrian Golden hamster proof-of concept wildtype SARS-CoV-2 infection model, animals treated with VIR-7831 had less weight loss, and significantly decreased total viral load and infectious virus levels in the lung compared to a control mAb. Taken together, these data indicate that VIR-7831 and VIR-7832 are promising new agents in the fight against COVID-19.

Posted ContentDOI
13 Jan 2021-bioRxiv
TL;DR: In this article, a simulation of the spike receptor binding domain (S RBD) and its binding to human angiotensin-converting enzyme 2 (hACE2) was used to predict alterations in molecular interactions resulting from the presence of the E484K, K417N, and N501Y variants found in the South African 501Y.1.V2 strain.
Abstract: Rapidly spreading SARS-CoV-2 variants present not only an increased threat to human health due to the confirmed greater transmissibility of several of these new strains but, due to conformational changes induced by the mutations, may render first-wave SARS-CoV-2 convalescent sera, vaccine-induced antibodies, or recombinant neutralizing antibodies (nAbs) ineffective. To be able to assess the risk of viral escape from neutralization by first-wave antibodies, we leveraged our capability for Molecular Dynamic (MD) simulation of the spike receptor binding domain (S RBD) and its binding to human angiotensin-converting enzyme 2 (hACE2) to predict alterations in molecular interactions resulting from the presence of the E484K, K417N, and N501Y variants found in the South African 501Y.V2 strain – alone and in combination. We report here the combination of E484K, K417N and N501Y results in the highest degree of conformational alterations of S RBD when bound to hACE2, compared to either E484K or N501Y alone. Both E484K and N501Y increase affinity of S RBD for hACE2 and E484K in particular switches the charge on the flexible loop region of RBD which leads to the formation of novel favorable contacts. Enhanced affinity of S RBD for hACE2 very likely underpins the greater transmissibility conferred by the presence of either E484K or N501Y; while the induction of conformational changes may provide an explanation for evidence that the 501Y.V2 variant, distinguished from the B.1.1.7 UK variant by the presence of E484K, is able to escape neutralization by existing first-wave anti-SARS-CoV-2 antibodies and re-infect COVID-19 convalescent individuals.

Posted ContentDOI
10 Jul 2021-bioRxiv
TL;DR: This paper used zero-shot inference to capture the functional effects of sequence variation, and achieved state-of-the-art performance on protein language models without any supervision from experimental data or additional training.
Abstract: Modeling the effect of sequence variation on function is a fundamental problem for understanding and designing proteins. Since evolution encodes information about function into patterns in protein sequences, unsupervised models of variant effects can be learned from sequence data. The approach to date has been to fit a model to a family of related sequences. The conventional setting is limited, since a new model must be trained for each prediction task. We show that using only zero-shot inference, without any supervision from experimental data or additional training, protein language models capture the functional effects of sequence variation, performing at state-of-the-art.

Posted ContentDOI
04 May 2021-bioRxiv
TL;DR: In this paper, the authors trained two auto-regressive models (Transformer-XL, XLNet) and four auto-encoder models (BERT, Albert, Electra, T5) on data from UniRef and BFD containing up to 393 billion amino acids.
Abstract: Computational biology and bioinformatics provide vast data gold-mines from protein sequences, ideal for Language Models taken from NLP. These LMs reach for new prediction frontiers at low inference costs. Here, we trained two auto-regressive models (Transformer-XL, XLNet) and four auto-encoder models (BERT, Albert, Electra, T5) on data from UniRef and BFD containing up to 393 billion amino acids. The LMs were trained on the Summit supercomputer using 5616 GPUs and TPU Pod up-to 1024 cores. Dimensionality reduction revealed that the raw protein LM-embeddings from unlabeled data captured some biophysical features of protein sequences. We validated the advantage of using the embeddings as exclusive input for several subsequent tasks. The first was a per-residue prediction of protein secondary structure (3-state accuracy Q3=81%-87%); the second were per-protein predictions of protein sub-cellular localization (ten-state accuracy: Q10=81%) and membrane vs. water-soluble (2-state accuracy Q2=91%). For the per-residue predictions the transfer of the most informative embeddings (ProtT5) for the first time outperformed the state-of-the-art without using evolutionary information thereby bypassing expensive database searches. Taken together, the results implied that protein LMs learned some of the grammar of the language of life. To facilitate future work, we released our models at https://github.com/agemagician/ProtTrans.

Posted ContentDOI
02 Mar 2021-bioRxiv
TL;DR: The P.1 trimer as mentioned in this paper adopts a conformation in which one of the receptor-binding domains is in the "up" position, with the functional impact of mutations appearing to arise from local changes instead of global conformational alterations.
Abstract: The relative resistance of SARS-CoV-2 variants B.1.1.7 and B.1.351 to antibody neutralization has been described recently. We now report that another emergent variant from Brazil, P.1, is not only refractory to multiple neutralizing monoclonal antibodies, but also more resistant to neutralization by convalescent plasma (3.4 fold) and vaccinee sera (3.8-4.8 fold). The cryo-electron microscopy structure of a soluble prefusion-stabilized spike reveals the P.1 trimer to adopt exclusively a conformation in which one of the receptor-binding domains is in the "up" position, with the functional impact of mutations appearing to arise from local changes instead of global conformational alterations. The P.1 variant threatens current antibody therapies but less so the protective efficacy of our vaccines.

Posted ContentDOI
14 Jan 2021-bioRxiv
TL;DR: In this paper, the authors describe 41 human monoclonal antibodies derived from memory B cells, which recognize the SARS-CoV-2 S N-terminal domain (NTD) and show that a subset of them neutralize SARS CoV2 ultrapotently.
Abstract: SARS-CoV-2 entry into host cells is orchestrated by the spike (S) glycoprotein that contains an immunodominant receptor-binding domain (RBD) targeted by the largest fraction of neutralizing antibodies (Abs) in COVID-19 patient plasma. Little is known about neutralizing Abs binding to epitopes outside the RBD and their contribution to protection. Here, we describe 41 human monoclonal Abs (mAbs) derived from memory B cells, which recognize the SARS-CoV-2 S N-terminal domain (NTD) and show that a subset of them neutralize SARS-CoV-2 ultrapotently. We define an antigenic map of the SARS-CoV-2 NTD and identify a supersite recognized by all known NTD-specific neutralizing mAbs. These mAbs inhibit cell-to-cell fusion, activate effector functions, and protect Syrian hamsters from SARS-CoV-2 challenge. SARS-CoV-2 variants, including the 501Y.V2 and B.1.1.7 lineages, harbor frequent mutations localized in the NTD supersite suggesting ongoing selective pressure and the importance of NTD-specific neutralizing mAbs to protective immunity.

Posted ContentDOI
30 Apr 2021-bioRxiv
TL;DR: The LY-CoV1404 (also known as bebtelovimab) is a highly potent, neutralizing, SARS CoV-2 spike glycoprotein receptor binding domain (RBD)-specific antibody identified from a convalescent COVID-19 patient sample, obtained approximately 60 days after symptom onset as mentioned in this paper.
Abstract: LY-CoV1404 (also known as bebtelovimab) is a highly potent, neutralizing, SARS-CoV-2 spike glycoprotein receptor binding domain (RBD)-specific antibody identified from a convalescent COVID-19 patient sample, obtained approximately 60 days after symptom onset. In pseudovirus studies, LY-CoV1404 retains potent neutralizing activity against numerous variants including B.1.617.2, B.1.1.7, B.1.351, B.1.427/B.1.429, P.1, and B.1.526, binding to these variants in the presence of their underlying RBD mutations (which include K417N, L452R, E484K, and N501Y). LY-CoV1404 also neutralizes multiple isolates of the authentic SARS-CoV-2 virus in two different assays. The RBD positions comprising the LY-CoV1404 epitope are highly conserved, with the exception of N439 and N501; notably the binding and neutralizing activity of LY-CoV1404 is unaffected by the most common mutations at these positions (N439K and N501Y). New variant-resistant treatments such as LY-CoV1404 are desperately needed, given that some of the existing therapeutic antibodies are less effective or ineffective against certain variants and the impact of variants on vaccine efficacy is still poorly understood. The breadth of variant binding, potent neutralizing activity and the relatively conserved epitope suggest that LY-CoV1404 is one in a panel of well-characterized, clinically developable antibodies that could be deployed as potentially long-term solutions to address current and emerging variants. In Brief LY-CoV1404 is a potent SARS-CoV-2-binding antibody that neutralizes all known variants of concern and whose epitope is rarely mutated. Highlights LY-CoV1404 potently neutralizes SARS-CoV-2 authentic virus and all known variants of concern including the B.1.617.2 (Delta) variant No loss of potency against currently circulating variants Binding epitope on RBD of SARS-CoV-2 is rarely mutated in GISAID database Breadth of neutralizing activity and potency supports clinical development

Posted ContentDOI
01 Mar 2021-bioRxiv
TL;DR: In this article, the authors performed a comprehensive analysis of SARS-CoV-2-specific CD4+ and CD8+ T cell responses from COVID-19 convalescent subjects recognizing the ancestral strain.
Abstract: The emergence of SARS-CoV-2 variants highlighted the need to better understand adaptive immune responses to this virus. It is important to address whether also CD4+ and CD8+ T cell responses are affected, because of the role they play in disease resolution and modulation of COVID-19 disease severity. Here we performed a comprehensive analysis of SARS-CoV-2-specific CD4+ and CD8+ T cell responses from COVID-19 convalescent subjects recognizing the ancestral strain, compared to variant lineages B.1.1.7, B.1.351, P.1, and CAL.20C as well as recipients of the Moderna (mRNA-1273) or Pfizer/BioNTech (BNT162b2) COVID-19 vaccines. Similarly, we demonstrate that the sequences of the vast majority of SARS-CoV-2 T cell epitopes are not affected by the mutations found in the variants analyzed. Overall, the results demonstrate that CD4+ and CD8+ T cell responses in convalescent COVID-19 subjects or COVID-19 mRNA vaccinees are not substantially affected by mutations found in the SARS-CoV-2 variants.

Posted ContentDOI
30 Sep 2021-bioRxiv
TL;DR: In this article, the authors describe new tools for the processing of electron cryo-microscopy (cryo-EM) images in the fourth major release of the RELION software.
Abstract: 1We describe new tools for the processing of electron cryo-microscopy (cryo-EM) images in the fourth major release of the RELION software. In particular, we introduce VDAM, a Variable-metric gradient Descent algorithm with Adaptive Moments estimation, for image refinement; a convolutional neural network for unsupervised selection of 2D classes; and a flexible framework for the design and execution of multiple jobs in pre-defined workflows. In addition, we present a stand-alone utility called MDCatch that links the execution of jobs within this framework with metadata gathering during microscope data acquisition. The new tools are aimed at providing fast and robust procedures for unsupervised cryo-EM structure determination, with potential applications for on-the-fly processing and the development of flexible, high-throughput structure determination pipelines. We illustrate their potential on twelve publicly available cryo-EM data sets.

Posted ContentDOI
22 Feb 2021-bioRxiv
TL;DR: In this article, all mutations to the SARS-CoV-2 spike receptor binding domain (RBD) that escape binding by a leading monoclonal antibody, LYCoV555, and its cocktail combination with LY CoV016 are completely mapped to the RBD.
Abstract: Monoclonal antibodies and antibody cocktails are a promising therapeutic and prophylaxis for COVID-19. However, ongoing evolution of SARS-CoV-2 can render monoclonal antibodies ineffective. Here we completely map all mutations to the SARS-CoV-2 spike receptor binding domain (RBD) that escape binding by a leading monoclonal antibody, LY-CoV555, and its cocktail combination with LY-CoV016. Individual mutations that escape binding by each antibody are combined in the circulating B.1.351 and P.1 SARS-CoV-2 lineages (E484K escapes LY-CoV555, K417N/T escape LY-CoV016). Additionally, the L452R mutation in the B.1.429 lineage escapes LY-CoV555. Furthermore, we identify single amino acid changes that escape the combined LY-CoV555+LY-CoV016 cocktail. We suggest that future efforts should diversify the epitopes targeted by antibodies and antibody cocktails to make them more resilient to antigenic evolution of SARS-CoV-2.

Posted ContentDOI
30 May 2021-bioRxiv
TL;DR: In this paper, the authors used computational methods to render the three-dimensional structure containing 57,216 cells, hundreds of millions of neurites and 133.7 million synaptic connections from the temporal lobe of the cerebral cortex.
Abstract: We acquired a rapidly preserved human surgical sample from the temporal lobe of the cerebral cortex. We stained a 1 mm3 volume with heavy metals, embedded it in resin, cut more than 5000 slices at ~30 nm and imaged these sections using a high-speed multibeam scanning electron microscope. We used computational methods to render the three-dimensional structure containing 57,216 cells, hundreds of millions of neurites and 133.7 million synaptic connections. The 1.4 petabyte electron microscopy volume, the segmented cells, cell parts, blood vessels, myelin, inhibitory and excitatory synapses, and 104 manually proofread cells are available to peruse online. Many interesting and unusual features were evident in this dataset. Glia outnumbered neurons 2:1 and oligodendrocytes were the most common cell type in the volume. Excitatory spiny neurons comprised 69% of the neuronal population, and excitatory synapses also were in the majority (76%). The synaptic drive onto spiny neurons was biased more strongly toward excitation (70%) than was the case for inhibitory interneurons (48%). Despite incompleteness of the automated segmentation caused by split and merge errors, we could automatically generate (and then validate) connections between most of the excitatory and inhibitory neuron types both within and between layers. In studying these neurons we found that deep layer excitatory cell types can be classified into new subsets, based on structural and connectivity differences, and that chandelier interneurons not only innervate excitatory neuron initial segments as previously described, but also each others initial segments. Furthermore, among the thousands of weak connections established on each neuron, there exist rarer highly powerful axonal inputs that establish multi-synaptic contacts (up to ~20 synapses) with target neurons. Our analysis indicates that these strong inputs are specific, and allow small numbers of axons to have an outsized role in the activity of some of their postsynaptic partners.

Posted ContentDOI
04 May 2021-bioRxiv
TL;DR: The LY-CoV1404 as discussed by the authors antibody is a highly potent, neutralizing, SARS CoV-2 spike glycoprotein receptor binding domain (RBD)-specific antibody identified from a convalescent COVID-19 patient approximately 60 days after symptom onset.
Abstract: Summary LY-CoV1404 is a highly potent, neutralizing, SARS-CoV-2 spike glycoprotein receptor binding domain (RBD)-specific antibody identified from a convalescent COVID-19 patient approximately 60 days after symptom onset. In pseudovirus studies, LY-CoV1404 retains potent neutralizing activity against numerous variants including B.1.1.7, B.1.351, B.1.427/B.1.429, P.1, and B.1.526 and binds to these variants in the presence of their underlying RBD mutations (which include K417N, L452R, E484K, and N501Y). LY-CoV1404 also neutralizes authentic SARS-CoV-2 in two different assays against multiple isolates. The RBD positions comprising the LY-CoV1404 epitope are highly conserved, with the exception of N439 and N501; notably the binding and neutralizing activity of LY-CoV1404 is unaffected by the most common mutations at these positions (N439K and N501Y). The breadth of variant binding, potent neutralizing activity and the relatively conserved epitope suggest that LY-CoV1404 is one in a panel of well-characterized, clinically developable antibodies that could be deployed rapidly to address current and emerging variants. New variant-resistant treatments such as LY-CoV1404 are desperately needed, given that some of the existing therapeutic antibodies are less effective or ineffective against certain variants and the impact of variants on vaccine efficacy is still poorly understood. In Brief LY-CoV1404 is a potent SARS-CoV-2-binding antibody that neutralizes all known variants of concern and whose epitope is rarely mutated. Highlights LY-CoV1404 potently neutralizes SARS-CoV-2 authentic virus and all known variants of concern No loss of potency against current variants Binding epitope on RBD of SARS-CoV-2 is rarely mutated in GISAID database Breadth of neutralizing activity and potency supports clinical development

Posted ContentDOI
04 Jan 2021-bioRxiv
TL;DR: In this article, the authors map how convalescent serum antibodies are impacted by all mutations to the spike9s receptor-binding domain (RBD), the main target of serum neutralizing activity.
Abstract: The evolution of SARS-CoV-2 could impair recognition of the virus by human antibody-mediated immunity. To facilitate prospective surveillance for such evolution, we map how convalescent serum antibodies are impacted by all mutations to the spike9s receptor-binding domain (RBD), the main target of serum neutralizing activity. Binding by polyclonal serum antibodies is affected by mutations in three main epitopes in the RBD, but there is substantial variation in the impact of mutations both among individuals and within the same individual over time. Despite this inter- and intra-person heterogeneity, the mutations that most reduce antibody binding usually occur at just a few sites in the RBD9s receptor binding motif. The most important site is E484, where neutralization by some sera is reduced >10-fold by several mutations, including one in emerging viral lineages in South Africa and Brazil. Going forward, these serum escape maps can inform surveillance of SARS-CoV-2 evolution.

Posted ContentDOI
07 Jan 2021-bioRxiv
TL;DR: The N501 and Y501 variants of SARS-CoV-2 share the spike N501Y substitution, which is of particular concern because it is located in the viral receptor binding site for cell entry and increases binding to the receptor (angiotensin converting enzyme 2).
Abstract: Rapidly spreading variants of SARS-CoV-2 that have arisen in the United Kingdom and South Africa share the spike N501Y substitution, which is of particular concern because it is located in the viral receptor binding site for cell entry and increases binding to the receptor (angiotensin converting enzyme 2). We generated isogenic N501 and Y501 SARS-CoV-2. Sera of 20 participants in a previously reported trial of the mRNA-based COVID-19 vaccine BNT162b2 had equivalent neutralizing titers to the N501 and Y501 viruses.

Posted ContentDOI
17 Feb 2021-bioRxiv
TL;DR: In this paper, the authors used chronic two-photon microscopy to image longitudinally the apical dendritic spines of layer 5 pyramidal neurons in the mouse medial frontal cortex and found that a single dose of psilocybin led to ∼10% increases in spine size and density.
Abstract: Summary Psilocybin is a serotonergic psychedelic with untapped therapeutic potential. There are hints that the use of psychedelics can produce neural adaptations, although the extent and time scale of the impact in a mammalian brain are unknown. In this study, we used chronic two-photon microscopy to image longitudinally the apical dendritic spines of layer 5 pyramidal neurons in the mouse medial frontal cortex. We found that a single dose of psilocybin led to ∼10% increases in spine size and density, driven by an elevated spine formation rate. The structural remodeling occurred quickly within 24 hours and was persistent 1 month later. Psilocybin also ameliorated stress-related behavioral deficit and elevated excitatory neurotransmission. Overall, the results demonstrate that psilocybin-evoked synaptic rewiring in the cortex is fast and enduring, potentially providing a structural trace for long-term integration of experiences and lasting beneficial actions.

Posted ContentDOI
15 Mar 2021-bioRxiv
TL;DR: In this paper, the antigenic and structural impacts of the S protein mutations from four variants, one that was involved in transmission between minks and humans, and three that rapidly spread in human populations and originated in the United Kingdom, Brazil or South Africa.
Abstract: New SARS-CoV-2 variants that have accumulated multiple mutations in the spike (S) glycoprotein enable increased transmission and resistance to neutralizing antibodies. Here, we study the antigenic and structural impacts of the S protein mutations from four variants, one that was involved in transmission between minks and humans, and three that rapidly spread in human populations and originated in the United Kingdom, Brazil or South Africa. All variants either retained or improved binding to the ACE2 receptor. The B.1.1.7 (UK) and B.1.1.28 (Brazil) spike variants showed reduced binding to neutralizing NTD and RBD antibodies, respectively, while the B.1.351 (SA) variant showed reduced binding to both NTD- and RBD-directed antibodies. Cryo-EM structural analyses revealed allosteric effects of the mutations on spike conformations and revealed mechanistic differences that either drive inter-species transmission or promotes viral escape from dominant neutralizing epitopes. HighlightsO_LICryo-EM structures reveal changes in SARS-CoV-2 S protein during inter-species transmission or immune evasion. C_LIO_LIAdaptation to mink resulted in increased ACE2 binding and spike destabilization. C_LIO_LIB.1.1.7 S mutations reveal an intricate balance of stabilizing and destabilizing effects that impact receptor and antibody binding. C_LIO_LIE484K mutation in B.1.351 and B.1.1.28 S proteins drives immune evasion by altering RBD conformation. C_LIO_LIS protein uses different mechanisms to converge upon similar solutions for altering RBD up/down positioning. C_LI

Posted ContentDOI
11 Mar 2021-bioRxiv
TL;DR: In this article, the authors investigated genetic variations in a 414-583 amino acid region of the Spike protein, partially encompassing the ACE2 receptor-binding domain (RBD), across a subset of 570 nasopharyngeal samples isolated between April 2020 and February 2021, from Washington, California, Arizona, Colorado, Minnesota and Illinois.
Abstract: The recent rise in mutational variants of SARS-CoV-2, especially with changes in the Spike protein, is of significant concern due to the potential ability for these mutations to increase viral infectivity, virulence and/or ability to escape protective antibodies. Here, we investigated genetic variations in a 414-583 amino acid region of the Spike protein, partially encompassing the ACE2 receptor-binding domain (RBD), across a subset of 570 nasopharyngeal samples isolated between April 2020 and February 2021, from Washington, California, Arizona, Colorado, Minnesota and Illinois. We found that samples isolated since November have an increased number of amino acid mutations in the region, with L452R being the dominant mutation. This mutation is associated with a recently discovered CAL.20C viral variant from clade 20C, lineage B.1.429, that since November-December 2020 is associated with multiple outbreaks and is undergoing massive expansion across California. In some samples, however, we found a distinct L452R-carrying variant of the virus that, upon detailed analysis of the GISAID database genomes, is also circulating primarily in California, but emerged even more recently. The newly identified variant derives from the clade 20A (lineage B.1.232) and is named CAL.20A. We also found that the SARS-CoV-2 strain that caused the only recorded case of infection in an ape - gorillas in the San Diego Zoo, reported in January 2021 - is CAL.20A. In contrast to CAL.20C that carries two additional to L452R mutations in the Spike protein, L452R is the only mutation found in CAL.20A. According to the phylogenetic analysis, however, emergence of CAL.20C was also specifically triggered by acquisition of the L452R mutation. Further analysis of GISAID-deposited genomes revealed that several independent L452R-carrying lineages have recently emerged across the globe, with over 90% of the isolates reported between December 2020 - February 2021. Taken together, these results indicate that the L452R mutation alone is of significant adaptive value to SARS-CoV-2 and, apparently, the positive selection for this mutation became particularly strong only recently, possibly reflecting viral adaptation to the containment measures or increasing population immunity. While the functional impact of L452R has not yet been extensively evaluated, leucine-452 is positioned in the receptor-binding motif of RBD, in the interface of direct contact with the ACE2 receptor. Its replacement with arginine is predicted to result in both a much stronger binding to the receptor and escape from neutralizing antibodies. If true, this in turn might lead to significantly increased infectivity of the L452R variants, warranting their close surveillance and in-depth functional studies.