Journal ArticleDOI
DiscoveryLink: a system for integrated access to life sciences data sources
Reads0
Chats0
TLDR
The DiscoveryLink offering is described, focusing on two key elements, the wrapper architecture and the query optimizer, and how it can be used to integrate the access to life sciences data from heterogeneous data sources.Abstract:
Vast amounts of life sciences data reside today in specialized data sources, with specialized query processing capabilities. Data from one source often must be combined with data from other sources to give users the information they desire. There are database middleware systems that extract data from multiple sources in response to a single query. IBM's DiscoveryLink is one such system, targeted to applications from the life sciences industry. DiscoveryLink provides users with a virtual database to which they can pose arbitrarily complex queries, even though the actual data needed to answer the query may originate from several different sources, and none of those sources, by itself, is capable of answering the query. We describe the DiscoveryLink offering, focusing on two key elements, the wrapper architecture and the query optimizer, and illustrate how it can be used to integrate the access to life sciences data from heterogeneous data sources.read more
Citations
More filters
Journal ArticleDOI
FlyMine: an integrated database for Drosophila and Anopheles genomics
Rachel Lyne,Richard J.H. Smith,Kim Rutherford,Matthew Wakeling,Andrew Varley,Francois Guillier,Hilde Janssens,Wenyan Ji,Peter McLaren,Philip North,Debashis Rana,Tom Riley,Julie Sullivan,Xavier Watkins,Mark Woodbridge,Kathryn S. Lilley,Steve Russell,Michael Ashburner,Kenji Mizuguchi,Gos Micklem +19 more
TL;DR: FlyMine is a data warehouse that addresses one of the important challenges of modern biology: how to integrate and make use of the diversity and volume of current biological data.
Journal ArticleDOI
BioWarehouse: a bioinformatics database warehouse toolkit
Thomas J. Lee,Yannick Pouliot,Valerie A. Wagner,Priyanka Gupta,David W J Stringer-Calvert,Jessica D. Tenenbaum,Peter D. Karp +6 more
TL;DR: BioWarehouse integrates its component databases into a common representational framework within a single database management system, thus enabling multi-database queries using the Structured Query Language (SQL) but also facilitating a variety of database integration tasks such as comparative analysis and data mining.
Journal ArticleDOI
A suite of daml+oil ontologies to describe bioinformatics web services and data
TL;DR: A description logic approach using the web ontology language DAML+OIL that uses property based service descriptions that is designed to formally capture at least some of this knowledge within a virtual workbench and middleware framework to assist a wider range of biologists in utilizing bioinformatics resources.
Journal ArticleDOI
Methodological Review: Data integration and genomic medicine
TL;DR: The opportunities of genomic medicine are discussed as well as the informatics challenges in this domain and concepts and methodologies in the field of data integration are reviewed to identify potential solutions.
Journal ArticleDOI
Integration of biological sources: current systems and challenges ahead
TL;DR: The pros and cons of the current approaches and systems are identified and what an integration system for biologists ought to be are discussed.
References
More filters
Journal ArticleDOI
Basic Local Alignment Search Tool
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI
SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules
TL;DR: This chapter discusses the construction of Benzenoid and Coronoid Hydrocarbons through the stages of enumeration, classification, and topological properties in a number of computers used for this purpose.
Journal ArticleDOI
The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999.
Amos Marc Bairoch,Rolf Apweiler +1 more
TL;DR: The Human Proteomics Initiative (HPI), a major project to annotate all known human sequences according to the quality standards of SWISS-PROT, is described.
Proceedings ArticleDOI
Access path selection in a relational database management system
TL;DR: System R as mentioned in this paper is an experimental database management system developed to carry out research on the relational model of data, which chooses access paths for both simple (single relation) and complex queries (such as joins), given a user specification of desired data as a boolean expression of predicates.
Journal ArticleDOI
Structure-based strategies for drug design and discovery.
TL;DR: The combination of molecular structure determination and computation is emerging as an important tool for drug development and will be applied to acquired immunodeficiency syndrome (AIDS) and bacterial drug resistance.
Related Papers (5)
SRS: information retrieval system for molecular biology data banks.
Gene Ontology: tool for the unification of biology
M Ashburner,Catherine A. Ball,Judith A. Blake,David Botstein,Heather Butler,J. M. Cherry,Allan Peter Davis,Kara Dolinski,Selina S. Dwight,J.T. Eppig,Midori A. Harris,David P. Hill,Laurie Issel-Tarver,Andrew Kasarskis,Suzanna E. Lewis,John C. Matese,Joel E. Richardson,M. Ringwald,Gerald M. Rubin,Gavin Sherlock +19 more