scispace - formally typeset
Search or ask a question
Institution

Johns Hopkins University School of Medicine

HealthcareBaltimore, Maryland, United States
About: Johns Hopkins University School of Medicine is a healthcare organization based out in Baltimore, Maryland, United States. It is known for research contribution in the topics: Population & Cancer. The organization has 44277 authors who have published 79222 publications receiving 4788882 citations.


Papers
More filters
Journal ArticleDOI
J. Craig Venter1, Mark Raymond Adams1, Eugene W. Myers1, Peter W. Li1  +269 moreInstitutions (12)
16 Feb 2001-Science
TL;DR: Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems are indicated.
Abstract: A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp DNA sequence was generated over 9 months from 27,271,853 high-quality sequence reads (5.11-fold coverage of the genome) from both ends of plasmid clones made from the DNA of five individuals. Two assembly strategies-a whole-genome assembly and a regional chromosome assembly-were used, each combining sequence data from Celera and the publicly funded genome effort. The public data were shredded into 550-bp segments to create a 2.9-fold coverage of those genome regions that had been sequenced, without including biases inherent in the cloning and assembly procedure used by the publicly funded group. This brought the effective coverage in the assemblies to eightfold, reducing the number and size of gaps in the final assembly over what would be obtained with 5.11-fold coverage. The two assembly strategies yielded very similar results that largely agree with independent mapping data. The assemblies effectively cover the euchromatic regions of the human chromosomes. More than 90% of the genome is in scaffold assemblies of 100,000 bp or more, and 25% of the genome is in scaffolds of 10 million bp or larger. Analysis of the genome sequence revealed 26,588 protein-encoding transcripts for which there was strong corroborating evidence and an additional approximately 12,000 computationally derived genes with mouse matches or other weak supporting evidence. Although gene-dense clusters are obvious, almost half the genes are dispersed in low G+C sequence separated by large tracts of apparently noncoding sequence. Only 1.1% of the genome is spanned by exons, whereas 24% is in introns, with 75% of the genome being intergenic DNA. Duplications of segmental blocks, ranging in size up to chromosomal lengths, are abundant throughout the genome and reveal a complex evolutionary history. Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems. DNA sequence comparisons between the consensus sequence and publicly funded genome data provided locations of 2.1 million single-nucleotide polymorphisms (SNPs). A random pair of human haploid genomes differed at a rate of 1 bp per 1250 on average, but there was marked heterogeneity in the level of polymorphism across the genome. Less than 1% of all SNPs resulted in variation in proteins, but the task of determining which SNPs have functional consequences remains an open challenge.

12,098 citations

Journal ArticleDOI
01 Jun 1990-Cell
TL;DR: A model for the genetic basis of colorectal neoplasia that includes the following salient features is presented, which may be applicable to other common epithelial neoplasms, in which tumors of varying stage are more difficult to study.

11,576 citations

Journal ArticleDOI
TL;DR: TopHat2 is described, which incorporates many significant enhancements to TopHat, and combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes.
Abstract: TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.

11,380 citations

Journal ArticleDOI
TL;DR: This protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results, which takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.
Abstract: Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

10,913 citations

Journal ArticleDOI
TL;DR: Preliminary clinical findings with blockers of additional immune-checkpoint proteins, such as programmed cell death protein 1 (PD1), indicate broad and diverse opportunities to enhance antitumour immunity with the potential to produce durable clinical responses.
Abstract: Immune checkpoints refer to the plethora of inhibitory pathways that are crucial to maintaining self-tolerance. Tumour cells induce immune checkpoints to evade immunosurveillance. This Review discusses the progress in targeting immune checkpoints, the considerations for combinatorial therapy and the potential for additional immune-checkpoint targets.

10,602 citations


Authors

Showing all 44754 results

NameH-indexPapersCitations
Robert Langer2812324326306
Bert Vogelstein247757332094
Solomon H. Snyder2321222200444
Steven A. Rosenberg2181204199262
Kenneth W. Kinzler215640243944
Hagop M. Kantarjian2043708210208
Mark P. Mattson200980138033
Stuart H. Orkin186715112182
Paul G. Richardson1831533155912
Aaron R. Folsom1811118134044
Gonçalo R. Abecasis179595230323
Jie Zhang1784857221720
Daniel R. Weinberger177879128450
David Baker1731226109377
Eliezer Masliah170982127818
Network Information
Related Institutions (5)
University of California, San Francisco
186.2K papers, 12M citations

99% related

Baylor College of Medicine
94.8K papers, 5M citations

99% related

University of Texas Southwestern Medical Center
75.2K papers, 4.4M citations

98% related

National Institutes of Health
297.8K papers, 21.3M citations

98% related

University of Alabama at Birmingham
86.7K papers, 3.9M citations

97% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
2023149
2022622
20216,078
20205,107
20194,444
20183,848