scispace - formally typeset
Search or ask a question
Author

Andrew B. Sparks

Bio: Andrew B. Sparks is an academic researcher from Hoffmann-La Roche. The author has contributed to research in topics: Peptide & Tumor suppressor gene. The author has an hindex of 29, co-authored 51 publications receiving 16302 citations. Previous affiliations of Andrew B. Sparks include Johns Hopkins University School of Medicine & University of North Carolina at Chapel Hill.


Papers
More filters
Journal ArticleDOI
04 Sep 1998-Science
TL;DR: The c-MYC oncogene is identified as a target gene in this signaling pathway and shown to be repressed by wild-type APC and activated by beta-catenin, and these effects were mediated through Tcf-4 binding sites in the c- MYC promoter.
Abstract: The adenomatous polyposis coli gene (APC) is a tumor suppressor gene that is inactivated in most colorectal cancers. Mutations of APC cause aberrant accumulation of beta-catenin, which then binds T cell factor-4 (Tcf-4), causing increased transcriptional activation of unknown genes. Here, the c-MYC oncogene is identified as a target gene in this signaling pathway. Expression of c-MYC was shown to be repressed by wild-type APC and activated by beta-catenin, and these effects were mediated through Tcf-4 binding sites in the c-MYC promoter. These results provide a molecular framework for understanding the previously enigmatic overexpression of c-MYC in colorectal cancers.

4,686 citations

Journal ArticleDOI
21 Mar 1997-Science
TL;DR: Results indicate that regulation of β-catenin is critical to APC's tumor suppressive effect and that this regulation can be circumvented by mutations in either APC or β- catenin.
Abstract: Inactivation of the adenomatous polyposis coli (APC) tumor suppressor gene initiates colorectal neoplasia. One of the biochemical activities associated with the APC protein is down-regulation of transcriptional activation mediated by beta-catenin and T cell transcription factor 4 (Tcf-4). The protein products of mutant APC genes present in colorectal tumors were found to be defective in this activity. Furthermore, colorectal tumors with intact APC genes were found to contain activating mutations of beta-catenin that altered functionally significant phosphorylation sites. These results indicate that regulation of beta-catenin is critical to APC's tumor suppressive effect and that this regulation can be circumvented by mutations in either APC or beta-catenin.

3,859 citations

Journal ArticleDOI
01 Jan 2010-Science
TL;DR: A genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs is described.
Abstract: Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.

1,343 citations

Journal Article
TL;DR: It is suggested that CTNNB1 mutations can uniquely substitute for APC mutations in CR tumors and that β-catenin signaling plays a critical role in CR tumorigenesis.
Abstract: Mutation of the adenomatous polyposis coli ( APC ) tumor suppressor gene initiates the majority of colorectal (CR) cancers. One consequence of this inactivation is constitutive activation of β-catenin/Tcf-mediated transcription. To further explore the role of the APC/β-catenin/Tcf pathway in CR tumorigenesis, we searched for mutations in genes implicated in this pathway in CR tumors lacking APC mutations. No mutations of the γ-catenin ( CTNNG1 ), GSK-3α ( GSK3A ), or GSK-3β ( GSK3B ) genes were detected. In contrast, mutations in the NH 2 -terminal regulatory domain of β-catenin ( CTNNB1 ) were found in 13 of 27 (48%) CR tumors lacking APC mutations. Mutations in the β-catenin regulatory domain and APC were observed to be mutually exclusive, consistent with their equivalent effects on β-catenin stability and Tcf transactivation. In addition, we found that CTNNB1 mutations can occur in the early, adenomatous stage of CR neoplasia, as has been observed previously with APC mutations. These results suggest that CTNNB1 mutations can uniquely substitute for APC mutations in CR tumors and that β-catenin signaling plays a critical role in CR tumorigenesis.

978 citations

Journal ArticleDOI
TL;DR: The development of a method—called long serial analysis of gene expression (LongSAGE), an adaption of the original SAGE approach—that can be used to rapidly identify novel genes and exons is described.
Abstract: A remaining challenge for the human genome project involves the identification and annotation of expressed genes. The public and private sequencing efforts have identified ∼15,000 sequences that meet stringent criteria for genes, such as correspondence with known genes from humans or other species, and have made another ∼10,000–20,000 gene predictions of lower confidence, supported by various types of in silico evidence, including homology studies, domain searches, and ab initio gene predictions1,2. These computational methods have limitations, both because they are unable to identify a significant fraction of genes and exons and because they are unable to provide definitive evidence about whether a hypothetical gene is actually expressed3,4. As the in silico approaches identified a smaller number of genes than anticipated5,6,7,8,9, we wondered whether high-throughput experimental analyses could be used to provide evidence for the expression of hypothetical genes and to reveal previously undiscovered genes. We describe here the development of such a method—called long serial analysis of gene expression (LongSAGE), an adaption of the original SAGE approach10—that can be used to rapidly identify novel genes and exons.

632 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

20,557 citations

28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: A unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs is presented.
Abstract: Recent advances in sequencing technology make it possible to comprehensively catalogue genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (1) initial read mapping; (2) local realignment around indels; (3) base quality score recalibration; (4) SNP discovery and genotyping to find all potential variants; and (5) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We discuss the application of these tools, instantiated in the Genome Analysis Toolkit (GATK), to deep whole-genome, whole-exome capture, and multi-sample low-pass (~4×) 1000 Genomes Project datasets.

10,056 citations

Journal ArticleDOI
TL;DR: This review discusses recent information on functions and mechanisms of the ubiquitin system and focuses on what the authors know, and would like to know, about the mode of action of ubi...
Abstract: The selective degradation of many short-lived proteins in eukaryotic cells is carried out by the ubiquitin system. In this pathway, proteins are targeted for degradation by covalent ligation to ubiquitin, a highly conserved small protein. Ubiquitin-mediated degradation of regulatory proteins plays important roles in the control of numerous processes, including cell-cycle progression, signal transduction, transcriptional regulation, receptor down-regulation, and endocytosis. The ubiquitin system has been implicated in the immune response, development, and programmed cell death. Abnormalities in ubiquitin-mediated processes have been shown to cause pathological conditions, including malignant transformation. In this review we discuss recent information on functions and mechanisms of the ubiquitin system. Since the selectivity of protein degradation is determined mainly at the stage of ligation to ubiquitin, special attention is focused on what we know, and would like to know, about the mode of action of ubiquitin-protein ligation systems and about signals in proteins recognized by these systems.

7,888 citations

Journal ArticleDOI
01 Nov 2012-Nature
TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.
Abstract: By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information across several algorithms and diverse data sources, we provide a validated haplotype map of 38 million single nucleotide polymorphisms, 1.4 million short insertions and deletions, and more than 14,000 larger deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites. This resource, which captures up to 98% of accessible single nucleotide polymorphisms at a frequency of 1% in related populations, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations.

7,710 citations