Application of support vector machines in viral biology

doi:10.1007/978-3-030-29022-1_12

Home
/
Papers
/
Application of support vector machines in viral biology

Book Chapter•DOI•

Application of support vector machines in viral biology

Sonal Modak, Swati Mehta, Deepak Sehgal, Jayaraman Valadi¹•Institutions (1)

Savitribai Phule Pune University¹

23 Nov 2019-pp 361-403

TL;DR: This chapter provides lucid and easy to understand details of SVM algorithms along with applications in virology, one such robust tool based rigorously on statistical learning theory.

read less

Abstract: Novel experimental and sequencing techniques have led to an exponential explosion and spiraling of data in viral genomics. To analyse such data, rapidly gain information, and transform this information to knowledge, interdisciplinary approaches involving several different types of expertise are necessary. Machine learning has been in the forefront of providing models with increasing accuracy due to development of newer paradigms with strong fundamental bases. Support Vector Machines (SVM) is one such robust tool, based rigorously on statistical learning theory. SVM provides very high quality and robust solutions to classification and regression problems. Several studies in virology employ high performance tools including SVM for identification of potentially important gene and protein functions. This is mainly due to the highly beneficial aspects of SVM. In this chapter we briefly provide lucid and easy to understand details of SVM algorithms along with applications in virology.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Early prediction of developing spontaneous activity in cultured neuronal networks.

[...]

David Cabrera-García¹, David Cabrera-García², Davide Warm³, Davide Warm², Pablo de la Fuente², M. Teresa Fernández-Sánchez², Antonello Novelli², Joaquín Villanueva Balsera² - Show less +4 more•Institutions (3)

Netherlands Institute for Neuroscience¹, University of Oviedo², University of Mainz³

14 Oct 2021-Scientific Reports

TL;DR: In this paper, the authors used machine learning techniques to characterize and predict the developing spontaneous activity in mouse cortical neurons on microelectrode arrays (MEAs) during the first three weeks in vitro.

...read moreread less

Abstract: Synchronization and bursting activity are intrinsic electrophysiological properties of in vivo and in vitro neural networks. During early development, cortical cultures exhibit a wide repertoire of synchronous bursting dynamics whose characterization may help to understand the parameters governing the transition from immature to mature networks. Here we used machine learning techniques to characterize and predict the developing spontaneous activity in mouse cortical neurons on microelectrode arrays (MEAs) during the first three weeks in vitro. Network activity at three stages of early development was defined by 18 electrophysiological features of spikes, bursts, synchrony, and connectivity. The variability of neuronal network activity during early development was investigated by applying k-means and self-organizing map (SOM) clustering analysis to features of bursts and synchrony. These electrophysiological features were predicted at the third week in vitro with high accuracy from those at earlier times using three machine learning models: Multivariate Adaptive Regression Splines, Support Vector Machines, and Random Forest. Our results indicate that initial patterns of electrical activity during the first week in vitro may already predetermine the final development of the neuronal network activity. The methodological approach used here may be applied to explore the biological mechanisms underlying the complex dynamics of spontaneous activity in developing neuronal cultures.

...read moreread less

7 citations

Journal Article•DOI•

Detection of A and B Influenza Viruses by Surface-Enhanced Raman Scattering Spectroscopy and Machine Learning

[...]

A.T. Tabarov, Vladimir Vitkin, O.V. Andreeva, A.A. Shemanaeva, E.Y. Popov, A.A. Dobroslavin, Valeria Kurikova, O.B. Kuznetsova, Konstantin Grigorenko, Ivan Tzibizov, Anton Kovalev, Vitaliy Savchenko, A.S. Zheltuhina, Andrey Gorshkov, Daria Danilenko - Show less +11 more

23 Nov 2022-Biosensors

TL;DR: In this article , the authors demonstrate the possibility of applying surface-enhanced Raman spectroscopy (SERS) combined with machine learning technology to detect and differentiate influenza type A and B viruses in a buffer environment.

...read moreread less

2 citations

Posted Content•DOI•

A machine learning based analysis to probe the relationship between odorant structure and olfactory behaviour in C. elegans

[...]

Aayushi Vishnoi¹, Rati Sharma¹•Institutions (1)

Indian Institute of Science¹

26 Jul 2021-bioRxiv

TL;DR: In this paper, the authors use olfactory behaviour data from the nematode C. elegans, which has isogenic populations in a laboratory setting, and employ machine learning approaches for a binary classification task predicting whether or not the worm will be attracted to a given monomolecular odorant.

...read moreread less

Abstract: The chemical basis of smell remains an unsolved problem, with ongoing studies mapping perceptual descriptor data from human participants to the chemical structures using computational methods. These approaches are, however, limited by linguistic capabilities and inter-individual differences in participants. We use olfactory behaviour data from the nematode C. elegans, which has isogenic populations in a laboratory setting, and employ machine learning approaches for a binary classification task predicting whether or not the worm will be attracted to a given monomolecular odorant. Among others, we use architectures based on Natural Language Processing methods on the SMILES representation of chemicals for molecular descriptor generation and show that machine learning algorithms trained on the descriptors give robust prediction results. We further show, by data augmentation, that increasing the number of samples increases the accuracy of the models. From this detailed analysis, we are able to achieve accuracies comparable to that in human studies and infer that there exists a non trivial relationship between the features of chemical structures and the nematode9s behaviour.

...read moreread less

1 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices

[...]

Tiziana Sturniolo¹, Elisa Bono¹, Jiayi Ding¹, Laura Raddrizzani¹, Oezlem Tuereci², Ugur Sahin², Michael Braxenthaler¹, Fabio Gallazzi¹, Maria Pia Protti, Francesco Sinigaglia¹, Juergen Hammer¹ - Show less +7 more•Institutions (2)

Hoffmann-La Roche¹, Saarland University²

01 Jun 1999-Nature Biotechnology

TL;DR: It is demonstrated that pocket profiles are nearly independent of the remaining HLA-DR cleft, and this approach has implications for the development of epitope-based vaccines.

...read moreread less

Abstract: Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices

...read moreread less

805 citations

Journal Article•DOI•

An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods

[...]

Tong Zhang

15 Jun 2001-Ai Magazine

TL;DR: This book is an introduction to support vector machines and related kernel methods in supervised learning, whose task is to estimate an input-output functional relationship from a training set of examples.

...read moreread less

Abstract: This book is an introduction to support vector machines and related kernel methods in supervised learning, whose task is to estimate an input-output functional relationship from a training set of examples. A learning problem is referred to as classification if its output take discrete values in a set of possible categories and regression if it has continuous real-valued output.

...read moreread less

689 citations

Journal Article•DOI•

A new arenavirus in a cluster of fatal transplant-associated diseases.

[...]

Gustavo Palacios¹, Julian Druce, Lei Du, Thomas Tran, Chris Birch, Thomas Briese, Sean Conlan, Phuong-Lan Quan, Jeffrey Hui, John C. Marshall, Jan Fredrik Simons, Michael Egholm, Christopher D. Paddock, Wun-Ju Shieh, Cynthia S. Goldsmith, Sherif R. Zaki, Mike Catton, W. Ian Lipkin - Show less +14 more•Institutions (1)

Columbia University¹

06 Mar 2008-The New England Journal of Medicine

TL;DR: The use of high-throughput sequencing during an outbreak of disease facilitated the identification of a new arenavirus transmitted through solid-organ transplantation, related to lymphocytic choriomeningitis viruses.

...read moreread less

Abstract: High-throughput sequencing yielded 103,632 sequences, of which 14 represented an Old World arenavirus. Additional sequence analysis showed that this new arenavirus was related to lymphocytic choriomeningitis viruses. Specific PCR assays based on a unique sequence confirmed the presence of the virus in the kidneys, liver, blood, and cerebrospinal fluid of the recipients. Immunohistochemical analysis revealed arenavirus antigen in the liver and kidney transplants in the recipients. IgM and IgG antiviral antibodies were detected in the serum of the donor. Seroconversion was evident in serum specimens obtained from one recipient at two time points.

...read moreread less

642 citations

Journal Article•DOI•

Predicting linear B-cell epitopes using string kernels.

[...]

Yasser EL-Manzalawy, Drena Dobbs¹, Vasant Honavar•Institutions (1)

Iowa State University¹

01 Jul 2008-Journal of Molecular Recognition

TL;DR: BCPred, a novel method for predicting linear B‐cell epitopes using the subsequence kernel, is proposed and it is shown that the predictive performance of BCPred outperforms 11 SVM‐based classifiers developed and evaluated in the authors' experiments as well as the implementation of AAP (AUC = 0.7).

...read moreread less

Abstract: The identification and characterization of B-cell epitopes play an important role in vaccine design, immunodiagnostic tests, and antibody production Therefore, computational tools for reliably predicting linear B-cell epitopes are highly desirable We evaluated Support Vector Machine (SVM) classifiers trained utilizing five different kernel methods using fivefold cross-validation on a homology-reduced data set of 701 linear B-cell epitopes, extracted from Bcipep database, and 701 non-epitopes, randomly extracted from SwissProt sequences Based on the results of our computational experiments, we propose BCPred, a novel method for predicting linear B-cell epitopes using the subsequence kernel We show that the predictive performance of BCPred (AUC = 0758) outperforms 11 SVM-based classifiers developed and evaluated in our experiments as well as our implementation of AAP (AUC = 07), a recently proposed method for predicting linear B-cell epitopes using amino acid pair antigenicity Furthermore, we compared BCPred with AAP and ABCPred, a method that uses recurrent neural networks, using two data sets of unique B-cell epitopes that had been previously used to evaluate ABCPred Analysis of the data sets used and the results of this comparison show that conclusions about the relative performance of different B-cell epitope prediction methods drawn on the basis of experiments using data sets of unique B-cell epitopes are likely to yield overly optimistic estimates of performance of evaluated methods This argues for the use of carefully homology-reduced data sets in comparing B-cell epitope prediction methods to avoid misleading conclusions about how different methods compare to each other Our homology-reduced data set and implementations of BCPred as well as the APP method are publicly available through our web-based server, BCPREDS, at: http://ailabcsiastateedu/bcpreds/

...read moreread less

623 citations

Journal Article•DOI•

Merkel Cell Polyomavirus and Two Previously Unknown Polyomaviruses Are Chronically Shed from Human Skin

[...]

Rachel M. Schowalter, Diana V. Pastrana, Katherine A. Pumphrey, Adam L. Moyer, Christopher B. Buck - Show less +1 more

17 Jun 2010-Cell Host & Microbe

TL;DR: An improved rolling circle amplification (RCA) technique to isolate circular DNA viral genomes from human skin swabs identified two previously unknown polyomavirus species that are named human polyomvirus-6 (HPyV6) and HPyV7 and indicate that infection or coinfection with these three skin-tropicpolyomaviruses is very common.

...read moreread less

538 citations