Author
Sreenath Nampally
Other affiliations: Science Applications International Corporation, University of Maryland, Baltimore
Bio: Sreenath Nampally is an academic researcher from AstraZeneca. The author has contributed to research in topics: Interface (computing) & Workflow management system. The author has an hindex of 2, co-authored 5 publications receiving 5024 citations. Previous affiliations of Sreenath Nampally include Science Applications International Corporation & University of Maryland, Baltimore.
Papers
More filters
••
Drexel University1, Yeshiva University2, Roswell Park Cancer Institute3, Virginia Commonwealth University4, Van Andel Institute5, Science Applications International Corporation6, Massachusetts Institute of Technology7, Harvard University8, University of Miami9, Icahn School of Medicine at Mount Sinai10, University of Chicago11, Howard Hughes Medical Institute12, University of Geneva13, Stanford University14, University of Oxford15, University of North Carolina at Chapel Hill16, National Institutes of Health17
TL;DR: The Genotype-Tissue Expression (GTEx) project is described, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
Abstract: Genome-wide association studies have identified thousands of loci for common diseases, but, for the majority of these, the mechanisms underlying disease susceptibility remain unknown. Most associated variants are not correlated with protein-coding changes, suggesting that polymorphisms in regulatory regions probably contribute to many disease phenotypes. Here we describe the Genotype-Tissue Expression (GTEx) project, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
6,545 citations
••
TL;DR: A workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data and was designed to be accessible to a broad class of users and provides a user friendly, web-based interface.
Abstract: Motivation: The growth of sequence data has been accompanied by an increasing need to analyze data on distributed computer clusters. The use of these systems for routine analysis requires scalable and robust software for data management of large datasets. Software is also needed to simplify data management and make large-scale bioinformatics analysis accessible and reproducible to a wide class of target users.
Results: We have developed a workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data. Ergatis contains preconfigured components and template pipelines for a number of common bioinformatics tasks such as prokaryotic genome annotation and genome comparisons. Outputs from many of these components can be loaded into a Chado relational database. Ergatis was designed to be accessible to a broad class of users and provides a user friendly, web-based interface. Ergatis supports high-throughput batch processing on distributed compute clusters and has been used for data management in a number of genome annotation and comparative genomics projects.
Availability: Ergatis is an open-source project and is freely available at http://ergatis.sourceforge.net
Contact: ten.egrofecruos.sresu@sivroj
95 citations
••
TL;DR: In this paper, the authors assessed the relationship between hazard ratio (HR) OS and progression-free survival (PFS) at 4 and 6 months, stratified according to the mechanism of action of the investigational product.
Abstract: Early endpoints, such as progression-free survival (PFS), are increasingly used as surrogates for overall survival (OS) to accelerate approval of novel oncology agents. Compiling trial-level data from randomized controlled trials (RCTs) could help to develop a predictive framework to ascertain correlation trends between treatment effects for early and late endpoints. Through trial-level correlation and random-effects meta-regression analysis, we assessed the relationship between hazard ratio (HR) OS and (1) HR PFS and (2) odds ratio (OR) PFS at 4 and 6 months, stratified according to the mechanism of action of the investigational product. Using multiple source databases, we compiled a data set including 81 phase II-IV RCTs (35 drugs and 156 observations) of patients with non-small-cell lung cancer. Low-to-moderate correlations were generally observed between treatment effects for early endpoints (based on PFS) and HR OS across trials of agents with different mechanisms of action. Moderate correlations were seen between treatment effects for HR PFS and HR OS across all trials, and in the programmed cell death-1/programmed cell death ligand-1 and epidermal growth factor receptor trial subsets. Although these results constitute an important step, caution is advised, as there are some limitations to our evaluation, and an additional patient-level analysis would be needed to establish true surrogacy.
7 citations
01 May 2013
TL;DR: In this article, the authors proposed a new method for the detection of cancer using a set of genes extracted from the human brain, which they called LSTM-CRF.
Abstract: National Institutes of Health (U.S.) (US NIH to the Broad Institute of Harvard and MIT, R01 DA006227-17)
2 citations
••
TL;DR: The OSPred tool offers interactive visualization of clinical trial end point correlations with reference to a large pool of historical NSCLC studies and has the potential to digitally transform and accelerate data-driven decision making as part of the drug development process.
Abstract: PURPOSE Overall survival (OS) is the gold standard end point for establishing clinical benefits in phase III oncology trials. However, these trials are associated with low success rates, largely driven by failure to meet the primary end point. Surrogate end points such as progression-free survival (PFS) are increasingly being used as indicators of biologic drug activity and to inform early go/no-go decisions in oncology drug development. We developed OSPred, a digital health aid that combines actual clinical data and machine intelligence approaches to visualize correlation trends between early (PFS-based) and late (OS) end points and provide support for shared decision making in the drug development pipeline. METHODS OSPred is based on a trial-level data set of 81 reports (35 anticancer drugs with various mechanisms of action; 156 observations) in non–small-cell lung cancer (NSCLC). OSPred was developed using R Shiny, with packages ggplot2, metafor, boot, dplyr, and mvtnorm, to analyze and visualize correlation results and predict OS hazard ratio (HR OS) on the basis of user-inputted PFS-based data, namely, HR PFS, or the odds ratio of PFS at 4 (OR PFS4) or 6 (OR PFS6) months. RESULTS The three main features of the tool are as follows: prediction of HR OS on the basis of user-inputted early end point values; visualization of comparisons of the user's investigational drug with other drugs in the NSCLC setting, including by specific MoA; and creation of a probability density chart, providing point prediction and CIs for HR OS. A working version of the tool for download is linked. CONCLUSION The OSPred tool offers interactive visualization of clinical trial end point correlations with reference to a large pool of historical NSCLC studies. Its focused capability has the potential to digitally transform and accelerate data-driven decision making as part of the drug development process.
1 citations
Cited by
More filters
••
TL;DR: GEPIA (Gene Expression Profiling Interactive Analysis) fills in the gap between cancer genomics big data and the delivery of integrated information to end users, thus helping unleash the value of the current data resources.
Abstract: Tremendous amount of RNA sequencing data have been produced by large consortium projects such as TCGA and GTEx, creating new opportunities for data mining and deeper understanding of gene functions. While certain existing web servers are valuable and widely used, many expression analysis functions needed by experimental biologists are still not adequately addressed by these tools. We introduce GEPIA (Gene Expression Profiling Interactive Analysis), a web-based tool to deliver fast and customizable functionalities based on TCGA and GTEx data. GEPIA provides key interactive and customizable functions including differential expression analysis, profiling plotting, correlation analysis, patient survival analysis, similar gene detection and dimensionality reduction analysis. The comprehensive expression analyses with simple clicking through GEPIA greatly facilitate data mining in wide research areas, scientific discussion and the therapeutic discovery process. GEPIA fills in the gap between cancer genomics big data and the delivery of integrated information to end users, thus helping unleash the value of the current data resources. GEPIA is available at http://gepia.cancer-pku.cn/.
5,980 citations
••
TL;DR: This work presents a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index, and uses it to represent and search an expanded model of the human reference genome.
Abstract: The human reference genome represents only a small number of individuals, which limits its usefulness for genotyping. We present a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index. We use HISAT2 to represent and search an expanded model of the human reference genome in which over 14.5 million genomic variants in combination with haplotypes are incorporated into the data structure used for searching and alignment. We benchmark HISAT2 using simulated and real datasets to demonstrate that our strategy of representing a population of genomes, together with a fast, memory-efficient search algorithm, provides more detailed and accurate variant analyses than other methods. We apply HISAT2 for HLA typing and DNA fingerprinting; both applications form part of the HISAT-genotype software that enables analysis of haplotype-resolved genes or genomic regions. HISAT-genotype outperforms other computational methods and matches or exceeds the performance of laboratory-based assays. A graph-based genome indexing scheme enables variant-aware alignment of sequences with very low memory requirements.
4,855 citations
••
TL;DR: The landscape of gene expression across tissues is described, thousands of tissue-specific and shared regulatory expression quantitative trait loci (eQTL) variants are cataloged, complex network relationships are described, and signals from genome-wide association studies explained by eQTLs are identified.
Abstract: Understanding the functional consequences of genetic variation, and how it affects complex human disease and quantitative traits, remains a critical challenge for biomedicine. We present an analysi...
4,418 citations
••
TL;DR: The genome sequence of single cells isolated from brain glioblastomas was examined, which revealed shared chromosomal changes but also extensive transcription variation, including genes related to signaling, which represent potential therapeutic targets.
Abstract: Human cancers are complex ecosystems composed of cells with distinct phenotypes, genotypes, and epigenetic states, but current models do not adequately reflect tumor composition in patients. We used single-cell RNA sequencing (RNA-seq) to profile 430 cells from five primary glioblastomas, which we found to be inherently variable in their expression of diverse transcriptional programs related to oncogenic signaling, proliferation, complement/immune response, and hypoxia. We also observed a continuum of stemness-related expression states that enabled us to identify putative regulators of stemness in vivo. Finally, we show that established glioblastoma subtype classifiers are variably expressed across individual cells within a tumor and demonstrate the potential prognostic implications of such intratumoral heterogeneity. Thus, we reveal previously unappreciated heterogeneity in diverse regulatory programs central to glioblastoma biology, prognosis, and therapy.
3,475 citations
••
TL;DR: The remarkable range of discoveriesGWASs has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics are reviewed.
Abstract: Application of the experimental design of genome-wide association studies (GWASs) is now 10 years old (young), and here we review the remarkable range of discoveries it has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics. We predict the likely discoveries in the next 10 years, when GWASs will be based on millions of samples with array data imputed to a large fully sequenced reference panel and on hundreds of thousands of samples with whole-genome sequencing data.
2,669 citations