scispace - formally typeset
Open Access

Database Mining in the Human Genome Initiative

Reads0
Chats0
TLDR
Improvements in genome, gene expression and proteome database mining algorithms will enable the prediction of protein function in the context of higher order processes such as the regulation of gene expression, metabolic pathways and signalling cascades and the elucidation of high-resolution structural and functional maps of the human genome.
Abstract
The Human Genome Initiative is an international research program for the creation of detailed genetic and physical maps of the human genome. Genome research projects generate enormous quantities of data. Database mining is the process of finding and extracting useful information from raw datasets. Computational genomics has identified a classification of three successive levels for the management and analysis of genetic data in scientific databases: Genomics. 1. Gene expression. 2. Proteomics. 3. Genome database mining is the identification of the protein-encoding regions of a genome and the assignment of functions to these genes on the basis of sequence similarity homologies against other genes of known function. Gene expression database mining is the identification of intrinsic patterns and relationships in transcriptional expression data generated by large-scale gene expression experiments. Proteome database mining is the identification of intrinsic patterns and relationships in translational expression data generated by large-scale proteomics experiments. Improvements in genome, gene expression and proteome database mining algorithms will enable the prediction of protein function in the context of higher order processes such as the regulation of gene expression, metabolic pathways and signalling cascades. Thus, the final objective of such higher-level functional analysis will be the elucidation of high-resolution structural and functional maps of the human genome. Contents

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Comparison of measures of marker informativeness for ancestry and admixture mapping

TL;DR: Various methods for selecting ancestry informative SNPs using simulations as well as SNP genotype data from samples of admixed populations are compared and it is shown that the In measure estimates ancestry proportion with lower bias and mean square error.
Journal ArticleDOI

Mapping genes that predict treatment outcome in admixed populations.

TL;DR: The current review explores analytical strategies that leverage population structure to more fully characterize genetic determinants of outcome in large clinical practice-based cohorts.
Dissertation

The geographical dimension of genetic diversity : a GIScience contribution for the conservation of animal genetic resources

TL;DR: In this paper, a combination of GIScience with molecular biology leads to the development of a spatial analysis method to detect signatures of natural selection within the genome, which are identical to the ones detected by a standard approach developed in population genomics.
Journal ArticleDOI

AncestrySNPminer: a bioinformatics tool to retrieve and develop ancestry informative SNP panels.

TL;DR: AncestrySNPminer is presented, the first web-based bioinformatics tool specifically designed to retrieve Ancestry Informative Markers (AIMs) from genomic data sets and link these informative markers to genes and ontological annotation classes.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

Improved tools for biological sequence comparison.

TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.
Journal ArticleDOI

Identification of common molecular subsequences.

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI

Rapid and sensitive protein similarity searches

TL;DR: An algorithm was developed which facilitates the search for similarities between newly determined amino acid sequences and sequences already available in databases and increases sensitivity by giving high scores to those amino acid replacements which occur frequently in evolution.
Journal ArticleDOI

Prediction of Complete Gene Structures in Human Genomic DNA

TL;DR: A general probabilistic model of the gene structure of human genomic sequences which incorporates descriptions of the basic transcriptional, translational and splicing signals, as well as length distributions and compositional features of exons, introns and intergenic regions is introduced.
Related Papers (5)