scispace - formally typeset
Open accessJournal ArticleDOI: 10.1038/S41598-021-84637-4

Potential neutralizing antibodies discovered for novel corona virus using machine learning.

04 Mar 2021-Scientific Reports (Nature Publishing Group)-Vol. 11, Iss: 1, pp 5261-5261
Abstract: The fast and untraceable virus mutations take lives of thousands of people before the immune system can produce the inhibitory antibody. The recent outbreak of COVID-19 infected and killed thousands of people in the world. Rapid methods in finding peptides or antibody sequences that can inhibit the viral epitopes of SARS-CoV-2 will save the life of thousands. To predict neutralizing antibodies for SARS-CoV-2 in a high-throughput manner, in this paper, we use different machine learning (ML) model to predict the possible inhibitory synthetic antibodies for SARS-CoV-2. We collected 1933 virus-antibody sequences and their clinical patient neutralization response and trained an ML model to predict the antibody response. Using graph featurization with variety of ML methods, like XGBoost, Random Forest, Multilayered Perceptron, Support Vector Machine and Logistic Regression, we screened thousands of hypothetical antibody sequences and found nine stable antibodies that potentially inhibit SARS-CoV-2. We combined bioinformatics, structural biology, and Molecular Dynamics (MD) simulations to verify the stability of the candidate antibodies that can inhibit SARS-CoV-2.

... read more


25 results found

Open accessJournal ArticleDOI: 10.1109/ACCESS.2020.3009328
01 Jan 2020-IEEE Access
Abstract: The very first infected novel coronavirus case (COVID-19) was found in Hubei, China in Dec. 2019. The COVID-19 pandemic has spread over 214 countries and areas in the world, and has significantly affected every aspect of our daily lives. At the time of writing this article, the numbers of infected cases and deaths still increase significantly and have no sign of a well-controlled situation, e.g., as of 13 July 2020, from a total number of around 13.1 million positive cases, 571,527 deaths were reported in the world. Motivated by recent advances and applications of artificial intelligence (AI) and big data in various areas, this paper aims at emphasizing their importance in responding to the COVID-19 outbreak and preventing the severe effects of the COVID-19 pandemic. We firstly present an overview of AI and big data, then identify the applications aimed at fighting against COVID-19, next highlight challenges and issues associated with state-of-the-art solutions, and finally come up with recommendations for the communications to effectively control the COVID-19 situation. It is expected that this paper provides researchers and communities with new insights into the ways AI and big data improve the COVID-19 situation, and drives further studies in stopping the COVID-19 outbreak.

... read more

123 Citations

Open accessJournal ArticleDOI: 10.1109/ACCESS.2021.3093633
01 Jan 2021-IEEE Access
Abstract: The beginning of 2020 has seen the emergence of coronavirus outbreak caused by a novel virus called SARS-CoV-2. The sudden explosion and uncontrolled worldwide spread of COVID-19 show the limitations of existing healthcare systems in timely handling public health emergencies. In such contexts, innovative technologies such as blockchain and Artificial Intelligence (AI) have emerged as promising solutions for fighting coronavirus epidemic. In particular, blockchain can combat pandemics by enabling early detection of outbreaks, ensuring the ordering of medical data, and ensuring reliable medical supply chain during the outbreak tracing. Moreover, AI provides intelligent solutions for identifying symptoms caused by coronavirus for treatments and supporting drug manufacturing. Therefore, we present an extensive survey on the use of blockchain and AI for combating COVID-19 epidemics. First, we introduce a new conceptual architecture which integrates blockchain and AI for fighting COVID-19. Then, we survey the latest research efforts on the use of blockchain and AI for fighting COVID-19 in various applications. The newly emerging projects and use cases enabled by these technologies to deal with coronavirus pandemic are also presented. A case study is also provided using federated AI for COVID-19 detection. Finally, we point out challenges and future directions that motivate more research efforts to deal with future coronavirus-like epidemics.

... read more

21 Citations

Open accessJournal ArticleDOI: 10.1109/ACCESS.2021.3068276
Yudi Dong1, Yu-Dong Yao1Institutions (1)
23 Mar 2021-IEEE Access
Abstract: As a result of the worldwide transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), coronavirus disease 2019 (COVID-19) has evolved into an unprecedented pandemic. Currently, with unavailable pharmaceutical treatments and low vaccination rates, this novel coronavirus results in a great impact on public health, human society, and global economy, which is likely to last for many years. One of the lessons learned from the COVID-19 pandemic is that a long-term system with non-pharmaceutical interventions for preventing and controlling new infectious diseases is desirable to be implemented. Internet of things (IoT) platform is preferred to be utilized to achieve this goal, due to its ubiquitous sensing ability and seamless connectivity. IoT technology is changing our lives through smart healthcare, smart home, and smart city, which aims to build a more convenient and intelligent community. This paper presents how the IoT could be incorporated into the epidemic prevention and control system. Specifically, we demonstrate a potential fog-cloud combined IoT platform that can be used in the systematic and intelligent COVID-19 prevention and control, which involves five interventions including COVID-19 Symptom Diagnosis, Quarantine Monitoring, Contact Tracing & Social Distancing, COVID-19 Outbreak Forecasting, and SARS-CoV-2 Mutation Tracking. We investigate and review the state-of-the-art literatures of these five interventions to present the capabilities of IoT in countering against the current COVID-19 pandemic or future infectious disease epidemics.

... read more

Topics: Smart city (54%)

20 Citations

Open accessJournal ArticleDOI: 10.1613/JAIR.1.12162
Abstract: COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization, with over 2.5 million confirmed cases as of April 23, 2020. In this review, we present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence, to tackle many aspects of the COVID-19 crisis at different scales including molecular, clinical, and societal applications. We also review datasets, tools, and resources needed to facilitate AI research. Finally, we discuss strategic considerations related to the operational implementation of projects, multidisciplinary partnerships, and open science. We highlight the need for international cooperation to maximize the potential of AI in this and future pandemics.

... read more

18 Citations

Open accessPosted Content
19 Feb 2021-arXiv: Learning
Abstract: Molecular machine learning bears promise for efficient molecule property prediction and drug discovery. However, due to the limited labeled data and the giant chemical space, machine learning models trained via supervised learning perform poorly in generalization. This greatly limits the applications of machine learning methods for molecular design and discovery. In this work, we present MolCLR: Molecular Contrastive Learning of Representations via Graph Neural Networks (GNNs), a self-supervised learning framework for large unlabeled molecule datasets. Specifically, we first build a molecular graph, where each node represents an atom and each edge represents a chemical bond. A GNN is then used to encode the molecule graph. We propose three novel molecule graph augmentations: atom masking, bond deletion, and subgraph removal. A contrastive estimator is utilized to maximize the agreement of different graph augmentations from the same molecule. Experiments show that molecule representations learned by MolCLR can be transferred to multiple downstream molecular property prediction tasks. Our method thus achieves state-of-the-art performance on many challenging datasets. We also prove the efficiency of our proposed molecule graph augmentations on supervised molecular classification tasks.

... read more

11 Citations


119 results found

Open accessJournal ArticleDOI: 10.1023/A:1010933404324
Leo Breiman1Institutions (1)
01 Oct 2001-
Abstract: Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, aaa, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.

... read more

Topics: Random forest (63%), Multivariate random variable (57%), Random subspace method (57%) ... show more

58,232 Citations

Open accessJournal ArticleDOI: 10.1093/NAR/28.1.235
Abstract: The Protein Data Bank (PDB; ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

... read more

30,190 Citations

Open accessJournal ArticleDOI: 10.1063/1.464397
Tom Darden1, Darrin M. York1, Lee G. Pedersen1Institutions (1)
Abstract: An N⋅log(N) method for evaluating electrostatic energies and forces of large periodic systems is presented. The method is based on interpolation of the reciprocal space Ewald sums and evaluation of the resulting convolutions using fast Fourier transforms. Timings and accuracies are presented for three large crystalline ionic systems.

... read more

Topics: Ewald summation (69%), P3M (64%), Particle Mesh (51%)

20,639 Citations

Journal ArticleDOI: 10.1038/256495A0
G Köhler1, C. Milstein1Institutions (1)
07 Aug 1975-Nature
Abstract: THE manufacture of predefined specific antibodies by means of permanent tissue culture cell lines is of general interest. There are at present a considerable number of permanent cultures of myeloma cells1,2 and screening procedures have been used to reveal antibody activity in some of them. This, however, is not a satisfactory source of monoclonal antibodies of predefined specificity. We describe here the derivation of a number of tissue culture cell lines which secrete anti-sheep red blood cell (SRBC) antibodies. The cell lines are made by fusion of a mouse myeloma and mouse spleen cells from an immunised donor. To understand the expression and interactions of the Ig chains from the parental lines, fusion experiments between two known mouse myeloma lines were carried out.

... read more

Topics: Hybridoma technology (63%), Monoclonal antibody (57%), Cell culture (55%) ... show more

18,616 Citations

Open accessProceedings ArticleDOI: 10.1145/2939672.2939785
Tianqi Chen1, Carlos Guestrin1Institutions (1)
13 Aug 2016-
Abstract: Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges. We propose a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning. More importantly, we provide insights on cache access patterns, data compression and sharding to build a scalable tree boosting system. By combining these insights, XGBoost scales beyond billions of examples using far fewer resources than existing systems.

... read more

Topics: Incremental decision tree (64%), Gradient boosting (61%), ID3 algorithm (60%) ... show more

10,428 Citations