Conference

Pacific Symposium on Biocomputing

About: Pacific Symposium on Biocomputing is an academic conference. The conference publishes majorly in the area(s): Population & Gene. Over the lifetime, 1287 publications have been published by the conference receiving 48728 citations.

...read moreread less

Topics: Population, Gene, Protein structure, Genome, Ontology (information science) ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

The spectrum kernel: a string kernel for SVM protein classification.

[...]

Christina S. Leslie¹, Eleazar Eskin¹, William Stafford Noble¹•Institutions (1)

Columbia University¹

01 Dec 2001

TL;DR: A new sequence-similarity kernel, the spectrum kernel, is introduced for use with support vector machines (SVMs) in a discriminative approach to the protein classification problem and performs well in comparison with state-of-the-art methods for homology detection.

...read moreread less

Abstract: We introduce a new sequence-similarity kernel, the spectrum kernel, for use with support vector machines (SVMs) in a discriminative approach to the protein classification problem. Our kernel is conceptually simple and efficient to compute and, in experiments on the SCOP database, performs well in comparison with state-of-the-art methods for homology detection. Moreover, our method produces an SVM classifier that allows linear time classification of test sequences. Our experiments provide evidence that string-based kernels, in conjunction with SVMs, could offer a viable and computationally efficient alternative to other methods of protein classification and homology detection.

...read moreread less

1,077 citations

Proceedings Article•DOI•

Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements.

[...]

Atul J. Butte¹, Isaac S. Kohane•Institutions (1)

Boston Children's Hospital¹

01 Dec 1999

TL;DR: A technique that computes comprehensive pair-wise mutual information for all genes in such a data set and shows how this technique was used on a public data set of 79 RNA expression measurements of 2,467 genes to construct 22 clusters, or Relevance Networks.

...read moreread less

Abstract: Increasing numbers of methodologies are available to find functional genomic clusters in RNA expression data. We describe a technique that computes comprehensive pair-wise mutual information for all genes in such a data set. An association with a high mutual information means that one gene is non-randomly associated with another; we hypothesize this means the two are related biologically. By picking a threshold mutual information and using only associations at or above the threshold, we show how this technique was used on a public data set of 79 RNA expression measurements of 2,467 genes to construct 22 clusters, or Relevance Networks. The biological significance of each Relevance Network is explained.

...read moreread less

1,056 citations

Proceedings Article•

Reveal, a general reverse engineering algorithm for inference of genetic network architectures

[...]

Shoudan Liang¹, Stefanie Fuhrman¹, Roland Somogyi¹•Institutions (1)

Ames Research Center¹

01 Jan 1998

TL;DR: This study investigates the possibility of completely infer a complex regulatory network architecture from input/output patterns of its variables using binary models of genetic networks, and finds the problem to be tractable within the conditions tested so far.

...read moreread less

Abstract: Given the immanent gene expression mapping covering whole genomes during development, health and disease, we seek computational methods to maximize functional inference from such large data sets. Is it possible, in principle, to completely infer a complex regulatory network architecture from input/output patterns of its variables? We investigated this possibility using binary models of genetic networks. Trajectories, or state transition tables of Boolean nets, resemble time series of gene expression. By systematically analyzing the mutual information between input states and output states, one is able to infer the sets of input elements controlling each element or gene in the network. This process is unequivocal and exact for complete state transition tables. We implemented this REVerse Engineering ALgorithm (REVEAL) in a C program, and found the problem to be tractable within the conditions tested so far. For n = 50 (elements) and k = 3 (inputs per element), the analysis of incomplete state transition tables (100 state transition pairs out of a possible 10(exp 15)) reliably produced the original rule and wiring sets. While this study is limited to synchronous Boolean networks, the algorithm is generalizable to include multi-state models, essentially allowing direct application to realistic biological data sets. The ability to adequately solve the inverse problem may enable in-depth analysis of complex dynamic systems in biology and other fields.

...read moreread less

1,031 citations

Proceedings Article•DOI•

BioProspector: discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes.

[...]

X. Liu¹, Douglas L. Brutlag, Jun Liu•Institutions (1)

Stanford University¹

01 Dec 2000

TL;DR: BioProspector, a C program using a Gibbs sampling strategy, examines the upstream region of genes in the same gene expression pattern group and looks for regulatory sequence motifs, showing preliminary success in finding the binding motifs for Saccharomyces cerevisiae RAP1, Bacillus subtilis RNA polymerase, and Escherichia coli CRP.

...read moreread less

Abstract: The development of genome sequencing and DNA microarray analysis of gene expression gives rise to the demand for data-mining tools. BioProspector, a C program using a Gibbs sampling strategy, examines the upstream region of genes in the same gene expression pattern group and looks for regulatory sequence motifs. BioProspector uses zero to third-order Markov background models whose parameters are either given by the user or estimated from a specified sequence file. The significance of each motif found is judged based on a motif score distribution estimated by a Monte Carlo method. In addition, BioProspector modifies the motif model used in the earlier Gibbs samplers to allow for the modeling of gapped motifs and motifs with palindromic patterns. All these modifications greatly improve the performance of the program. Although testing and development are still in progress, the program has shown preliminary success in finding the binding motifs for Saccharomyces cerevisiae RAP1, Bacillus subtilis RNA polymerase, and Escherichia coli CRP. We are currently working on combining BioProspector with a clustering program to explore gene expression networks and regulatory mechanisms.

...read moreread less

893 citations

Proceedings Article•DOI•

Principal components analysis to summarize microarray experiments: application to sporulation time series

[...]

Soumya Raychaudhuri¹, Joshua M. Stuart, Russ B. Altman•Institutions (1)

Stanford University¹

01 Dec 1999

TL;DR: This work shows that application of PCA to expression data allows us to summarize the ways in which gene responses vary under different conditions, and suggests that much of the observed variability in the experiment can be summarized in just 2 components.

...read moreread less

Abstract: A series of microarray experiments produces observations of differential expression for thousands of genes across multiple conditions. It is often not clear whether a set of experiments are measuring fundamentally different gene expression states or are measuring similar states created through different mechanisms. It is useful, therefore, to define a core set of independent features for the expression states that allow them to be compared directly. Principal components analysis (PCA) is a statistical technique for determining the key variables in a multidimensional data set that explain the differences in the observations, and can be used to simplify the analysis and visualization of multidimensional data sets. We show that application of PCA to expression data (where the experimental conditions are the variables, and the gene expression measurements are the observations) allows us to summarize the ways in which gene responses vary under different conditions. Examination of the components also provides insight into the underlying factors that are measured in the experiments. We applied PCA to the publicly released yeast sporulation data set (Chu et al. 1998). In that work, 7 different measurements of gene expression were made over time. PCA on the time-points suggests that much of the observed variability in the experiment can be summarized in just 2 components--i.e. 2 variables capture most of the information. These components appear to represent (1) overall induction level and (2) change in induction level over time. We also examined the clusters proposed in the original paper, and show how they are manifested in principal component space. Our results are available on the internet at http:?www.smi.stanford.edu/project/helix/PCArray .

...read moreread less

815 citations

Collapse

Performance

Metrics

1,287

Papers

48,727

Citations

No. of papers from the Conference in previous years
Year	Papers
2020	32
2019	63
2018	98
2017	56
2016	53
2015	2