scispace - formally typeset
Search or ask a question
Book ChapterDOI

Application of support vector machines in viral biology

TL;DR: This chapter provides lucid and easy to understand details of SVM algorithms along with applications in virology, one such robust tool based rigorously on statistical learning theory.
Abstract: Novel experimental and sequencing techniques have led to an exponential explosion and spiraling of data in viral genomics. To analyse such data, rapidly gain information, and transform this information to knowledge, interdisciplinary approaches involving several different types of expertise are necessary. Machine learning has been in the forefront of providing models with increasing accuracy due to development of newer paradigms with strong fundamental bases. Support Vector Machines (SVM) is one such robust tool, based rigorously on statistical learning theory. SVM provides very high quality and robust solutions to classification and regression problems. Several studies in virology employ high performance tools including SVM for identification of potentially important gene and protein functions. This is mainly due to the highly beneficial aspects of SVM. In this chapter we briefly provide lucid and easy to understand details of SVM algorithms along with applications in virology.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: In this paper, the authors used machine learning techniques to characterize and predict the developing spontaneous activity in mouse cortical neurons on microelectrode arrays (MEAs) during the first three weeks in vitro.
Abstract: Synchronization and bursting activity are intrinsic electrophysiological properties of in vivo and in vitro neural networks. During early development, cortical cultures exhibit a wide repertoire of synchronous bursting dynamics whose characterization may help to understand the parameters governing the transition from immature to mature networks. Here we used machine learning techniques to characterize and predict the developing spontaneous activity in mouse cortical neurons on microelectrode arrays (MEAs) during the first three weeks in vitro. Network activity at three stages of early development was defined by 18 electrophysiological features of spikes, bursts, synchrony, and connectivity. The variability of neuronal network activity during early development was investigated by applying k-means and self-organizing map (SOM) clustering analysis to features of bursts and synchrony. These electrophysiological features were predicted at the third week in vitro with high accuracy from those at earlier times using three machine learning models: Multivariate Adaptive Regression Splines, Support Vector Machines, and Random Forest. Our results indicate that initial patterns of electrical activity during the first week in vitro may already predetermine the final development of the neuronal network activity. The methodological approach used here may be applied to explore the biological mechanisms underlying the complex dynamics of spontaneous activity in developing neuronal cultures.

7 citations

Journal ArticleDOI
TL;DR: In this article , the authors demonstrate the possibility of applying surface-enhanced Raman spectroscopy (SERS) combined with machine learning technology to detect and differentiate influenza type A and B viruses in a buffer environment.

2 citations

Posted ContentDOI
26 Jul 2021-bioRxiv
TL;DR: In this paper, the authors use olfactory behaviour data from the nematode C. elegans, which has isogenic populations in a laboratory setting, and employ machine learning approaches for a binary classification task predicting whether or not the worm will be attracted to a given monomolecular odorant.
Abstract: The chemical basis of smell remains an unsolved problem, with ongoing studies mapping perceptual descriptor data from human participants to the chemical structures using computational methods. These approaches are, however, limited by linguistic capabilities and inter-individual differences in participants. We use olfactory behaviour data from the nematode C. elegans, which has isogenic populations in a laboratory setting, and employ machine learning approaches for a binary classification task predicting whether or not the worm will be attracted to a given monomolecular odorant. Among others, we use architectures based on Natural Language Processing methods on the SMILES representation of chemicals for molecular descriptor generation and show that machine learning algorithms trained on the descriptors give robust prediction results. We further show, by data augmentation, that increasing the number of samples increases the accuracy of the models. From this detailed analysis, we are able to achieve accuracies comparable to that in human studies and infer that there exists a non trivial relationship between the features of chemical structures and the nematode9s behaviour.

1 citations

References
More filters
Journal ArticleDOI
TL;DR: It is demonstrated that pocket profiles are nearly independent of the remaining HLA-DR cleft, and this approach has implications for the development of epitope-based vaccines.
Abstract: Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices

805 citations

Journal ArticleDOI
TL;DR: This book is an introduction to support vector machines and related kernel methods in supervised learning, whose task is to estimate an input-output functional relationship from a training set of examples.
Abstract: This book is an introduction to support vector machines and related kernel methods in supervised learning, whose task is to estimate an input-output functional relationship from a training set of examples. A learning problem is referred to as classification if its output take discrete values in a set of possible categories and regression if it has continuous real-valued output.

689 citations

Journal ArticleDOI
TL;DR: The use of high-throughput sequencing during an outbreak of disease facilitated the identification of a new arenavirus transmitted through solid-organ transplantation, related to lymphocytic choriomeningitis viruses.
Abstract: High-throughput sequencing yielded 103,632 sequences, of which 14 represented an Old World arenavirus. Additional sequence analysis showed that this new arenavirus was related to lymphocytic choriomeningitis viruses. Specific PCR assays based on a unique sequence confirmed the presence of the virus in the kidneys, liver, blood, and cerebrospinal fluid of the recipients. Immunohistochemical analysis revealed arenavirus antigen in the liver and kidney transplants in the recipients. IgM and IgG antiviral antibodies were detected in the serum of the donor. Seroconversion was evident in serum specimens obtained from one recipient at two time points.

642 citations

Journal ArticleDOI
TL;DR: BCPred, a novel method for predicting linear B‐cell epitopes using the subsequence kernel, is proposed and it is shown that the predictive performance of BCPred outperforms 11 SVM‐based classifiers developed and evaluated in the authors' experiments as well as the implementation of AAP (AUC = 0.7).
Abstract: The identification and characterization of B-cell epitopes play an important role in vaccine design, immunodiagnostic tests, and antibody production Therefore, computational tools for reliably predicting linear B-cell epitopes are highly desirable We evaluated Support Vector Machine (SVM) classifiers trained utilizing five different kernel methods using fivefold cross-validation on a homology-reduced data set of 701 linear B-cell epitopes, extracted from Bcipep database, and 701 non-epitopes, randomly extracted from SwissProt sequences Based on the results of our computational experiments, we propose BCPred, a novel method for predicting linear B-cell epitopes using the subsequence kernel We show that the predictive performance of BCPred (AUC = 0758) outperforms 11 SVM-based classifiers developed and evaluated in our experiments as well as our implementation of AAP (AUC = 07), a recently proposed method for predicting linear B-cell epitopes using amino acid pair antigenicity Furthermore, we compared BCPred with AAP and ABCPred, a method that uses recurrent neural networks, using two data sets of unique B-cell epitopes that had been previously used to evaluate ABCPred Analysis of the data sets used and the results of this comparison show that conclusions about the relative performance of different B-cell epitope prediction methods drawn on the basis of experiments using data sets of unique B-cell epitopes are likely to yield overly optimistic estimates of performance of evaluated methods This argues for the use of carefully homology-reduced data sets in comparing B-cell epitope prediction methods to avoid misleading conclusions about how different methods compare to each other Our homology-reduced data set and implementations of BCPred as well as the APP method are publicly available through our web-based server, BCPREDS, at: http://ailabcsiastateedu/bcpreds/

623 citations

Journal ArticleDOI
TL;DR: An improved rolling circle amplification (RCA) technique to isolate circular DNA viral genomes from human skin swabs identified two previously unknown polyomavirus species that are named human polyomvirus-6 (HPyV6) and HPyV7 and indicate that infection or coinfection with these three skin-tropicpolyomaviruses is very common.

538 citations

Trending Questions (1)
What are the potential applications of machine learning in virology?

Machine learning, specifically Support Vector Machines (SVM), is used in virology for tasks such as epitope prediction, virus-host interaction detection, and influenza host prediction.