scispace - formally typeset
Search or ask a question
Topic

Human genome

About: Human genome is a research topic. Over the lifetime, 11571 publications have been published within this topic receiving 1018402 citations. The topic is also known as: genome, human.


Papers
More filters
Journal ArticleDOI
26 Sep 2008-Science
TL;DR: It is found that pancreatic cancers contain an average of 63 genetic alterations, the majority of which are point mutations, which defined a core set of 12 cellular signaling pathways and processes that were each genetically altered in 67 to 100% of the tumors.
Abstract: There are currently few therapeutic options for patients with pancreatic cancer, and new insights into the pathogenesis of this lethal disease are urgently needed. Toward this end, we performed a comprehensive genetic analysis of 24 pancreatic cancers. We first determined the sequences of 23,219 transcripts, representing 20,661 protein-coding genes, in these samples. Then, we searched for homozygous deletions and amplifications in the tumor DNA by using microarrays containing probes for approximately 10(6) single-nucleotide polymorphisms. We found that pancreatic cancers contain an average of 63 genetic alterations, the majority of which are point mutations. These alterations defined a core set of 12 cellular signaling pathways and processes that were each genetically altered in 67 to 100% of the tumors. Analysis of these tumors' transcriptomes with next-generation sequencing-by-synthesis technologies provided independent evidence for the importance of these pathways and processes. Our data indicate that genetically altered core pathways and regulatory processes only become evident once the coding regions of the genome are analyzed in depth. Dysregulation of these core pathways and processes through mutation can explain the major features of pancreatic tumorigenesis.

3,721 citations

Journal ArticleDOI
TL;DR: A comprehensive search for conserved elements in vertebrate genomes is conducted, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes), using a two-state phylogenetic hidden Markov model (phylo-HMM).
Abstract: We have conducted a comprehensive search for conserved elements in vertebrate genomes, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes). Parallel searches have been performed with multiple alignments of four insect species (three species of Drosophila and Anopheles gambiae), two species of Caenorhabditis, and seven species of Saccharomyces. Conserved elements were identified with a computer program called phastCons, which is based on a two-state phylogenetic hidden Markov model (phylo-HMM). PhastCons works by fitting a phylo-HMM to the data by maximum likelihood, subject to constraints designed to calibrate the model across species groups, and then predicting conserved elements based on this model. The predicted elements cover roughly 3%-8% of the human genome (depending on the details of the calibration procedure) and substantially higher fractions of the more compact Drosophila melanogaster (37%-53%), Caenorhabditis elegans (18%-37%), and Saccharaomyces cerevisiae (47%-68%) genomes. From yeasts to vertebrates, in order of increasing genome size and general biological complexity, increasing fractions of conserved bases are found to lie outside of the exons of known protein-coding genes. In all groups, the most highly conserved elements (HCEs), by log-odds score, are hundreds or thousands of bases long. These elements share certain properties with ultraconserved elements, but they tend to be longer and less perfectly conserved, and they overlap genes of somewhat different functional categories. In vertebrates, HCEs are associated with the 3' UTRs of regulatory genes, stable gene deserts, and megabase-sized regions rich in moderately conserved noncoding sequences. Noncoding HCEs also show strong statistical evidence of an enrichment for RNA secondary structure.

3,719 citations

Journal ArticleDOI
TL;DR: A general probabilistic model of the gene structure of human genomic sequences which incorporates descriptions of the basic transcriptional, translational and splicing signals, as well as length distributions and compositional features of exons, introns and intergenic regions is introduced.

3,709 citations

Journal ArticleDOI
01 Mar 1985-Nature
TL;DR: A probe based on a tandem-repeat of the core sequence can detect many highly variable loci simultaneously and can provide an individual-specific DNA ‘fingerprint’ of general use in human genetic analysis.
Abstract: The human genome contains many dispersed tandem-repetitive 'minisatellite' regions detected via a shared 10-15-base pair 'core' sequence similar to the generalized recombination signal (chi) of Escherichia coli. Many minisatellites are highly polymorphic due to allelic variation in repeat copy number in the minisatellite. A probe based on a tandem-repeat of the core sequence can detect many highly variable loci simultaneously and can provide an individual-specific DNA 'fingerprint' of general use in human genetic analysis.

3,552 citations

Journal ArticleDOI
Piero Carninci, Takeya Kasukawa1, Shintaro Katayama, Julian Gough  +194 moreInstitutions (36)
02 Sep 2005-Science
TL;DR: Detailed polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Abstract: This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

3,412 citations


Network Information
Related Topics (5)
Genome
74.2K papers, 3.8M citations
92% related
Gene
211.7K papers, 10.3M citations
90% related
Regulation of gene expression
85.4K papers, 5.8M citations
89% related
Transcription (biology)
56.5K papers, 2.9M citations
89% related
RNA
111.6K papers, 5.4M citations
88% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20241
2023191
2022352
2021346
2020392
2019344