Database Mining in the Human Genome Initiative

Open Access

Database Mining in the Human Genome Initiative

Chats0

TLDR

Improvements in genome, gene expression and proteome database mining algorithms will enable the prediction of protein function in the context of higher order processes such as the regulation of gene expression, metabolic pathways and signalling cascades and the elucidation of high-resolution structural and functional maps of the human genome.

Abstract:

The Human Genome Initiative is an international research program for the creation of detailed genetic and physical maps of the human genome. Genome research projects generate enormous quantities of data. Database mining is the process of finding and extracting useful information from raw datasets. Computational genomics has identified a classification of three successive levels for the management and analysis of genetic data in scientific databases: Genomics. 1. Gene expression. 2. Proteomics. 3. Genome database mining is the identification of the protein-encoding regions of a genome and the assignment of functions to these genes on the basis of sequence similarity homologies against other genes of known function. Gene expression database mining is the identification of intrinsic patterns and relationships in transcriptional expression data generated by large-scale gene expression experiments. Proteome database mining is the identification of intrinsic patterns and relationships in translational expression data generated by large-scale proteomics experiments. Improvements in genome, gene expression and proteome database mining algorithms will enable the prediction of protein function in the context of higher order processes such as the regulation of gene expression, metabolic pathways and signalling cascades. Thus, the final objective of such higher-level functional analysis will be the elucidation of high-resolution structural and functional maps of the human genome. Contents

Database Mining in the Human Genome Initiative

Citations

[서평]「Algorithms on Strings, Trees, and Sequences」

Comparison of measures of marker informativeness for ancestry and admixture mapping

Mapping genes that predict treatment outcome in admixed populations.

The geographical dimension of genetic diversity : a GIScience contribution for the conservation of animal genetic resources

AncestrySNPminer: a bioinformatics tool to retrieve and develop ancestry informative SNP panels.

References

Basic Local Alignment Search Tool

Improved tools for biological sequence comparison.

Identification of common molecular subsequences.

Rapid and sensitive protein similarity searches

Prediction of Complete Gene Structures in Human Genomic DNA

Related Papers (5)

Integrative database analysis in structural genomics.

Genome-scale Gene Expression Analysis and Pathway Reconstruction in KEGG.

MotifMap: integrative genome-wide maps of regulatory motif sites for model species.

Annotating eukaryote genomes

AraPath: a knowledgebase for pathway analysis in Arabidopsis