scispace - formally typeset
Search or ask a question

Showing papers by "Anil K. Madugundu published in 2014"


Journal ArticleDOI
29 May 2014-Nature
TL;DR: A draft map of the human proteome is presented using high-resolution Fourier-transform mass spectrometry to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-c coding RNAs and upstream open reading frames.
Abstract: The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.

1,965 citations



Journal ArticleDOI
TL;DR: A nonredundant catalogue of protein-coding genes for Chromosome 12 (Chr. 12) and further annotated proteins associated with major neurological disorders and their secretability which can be used to drive efforts for clinical applications are generated.
Abstract: In line with the aims of the Chromosome-centric Human Proteome Project (C-HPP) to completely annotate proteins of each chromosome and biology/disease driven HPP (B/D-HPP) to decipher their relation to diseases, we have generated a nonredundant catalogue of protein-coding genes for Chromosome 12 (Chr. 12) and further annotated proteins associated with major neurological disorders. Integrating high level proteomic evidence from four major databases (neXtProt, Global Proteome Machine (GPMdb), PeptideAtlas, and Human Protein Atlas (HPA)) along with Ensembl data resource resulted in the identification of 1066 protein coding genes, of which 171 were defined as “missing proteins” based on the weak or complete absence of experimental evidence. With functional annotations using DAVID and GAD, about 40% of the proteins could be grouped as brain related with implications in cancer or neurological disorders. We used published and unpublished high confidence mass spectrometry data from our group and other literature c...

23 citations


Journal ArticleDOI
TL;DR: A global quantitative proteomic approach adopted in this study generated a comparative proteome profile of brain tissues from toxoplasma encephalitis patients co-infected with HIV, which indicated deregulation of several pathways involved in antigen processing, immune response, neuronal growth, neurotransmitter transport and energy metabolism.
Abstract: Background Toxoplasma encephalitis is caused by the opportunistic protozoan parasite Toxoplasma gondii. Primary infection with T. gondii in immunocompetent individuals remains largely asymptomatic. In contrast, in immunocompromised individuals, reactivation of the parasite results in severe complications and mortality. Molecular changes at the protein level in the host central nervous system and proteins associated with pathogenesis of toxoplasma encephalitis are largely unexplored. We used a global quantitative proteomic strategy to identify differentially regulated proteins and affected molecular networks in the human host during T. gondii infection with HIV co-infection.

18 citations


Journal ArticleDOI
TL;DR: Proteogenomic investigation described here facilitated the validation and refinement of computationally derived gene models in the intron-rich genome of C. neoformans, an important fungal pathogen in humans.
Abstract: Cryptococcus neoformans, a basidiomycetous fungus of universal occurrence, is a significant opportunistic human pathogen causing meningitis. Owing to an increase in the number of immunosuppressed individuals along with emergence of drug-resistant strains, C. neoformans is gaining importance as a pathogen. Although, whole genome sequencing of three varieties of C. neoformans has been completed recently, no global proteomic studies have yet been reported. We performed a comprehensive proteomic analysis of C. neoformans var. grubii (Serotype A), which is the most virulent variety, in order to provide protein-level evidence for computationally predicted gene models and to refine the existing annotations. We confirmed the protein-coding potential of 3,674 genes from a total of 6,980 predicted protein-coding genes. We also identified 4 novel genes and corrected 104 predicted gene models. In addition, our studies led to the correction of translational start site, splice junctions and reading frame used for translation in a number of proteins. Finally, we validated a subset of our novel findings by RT-PCR and sequencing. Proteogenomic investigation described here facilitated the validation and refinement of computationally derived gene models in the intron-rich genome of C. neoformans, an important fungal pathogen in humans.

18 citations


Journal ArticleDOI
TL;DR: A proteogenomic analysis using high resolution Fourier transform MS to characterize the proteome of the GS115 strain led to the discovery of 64 novel genes and correction of 11 predicted gene models, demonstrating the utility of high resolution MS‐derived peptide sequence data to cover near complete proteomes of organisms.
Abstract: Pichia pastoris is a widely used eukaryotic host for production of recombinant proteins. We performed a proteogenomic analysis using high resolution Fourier transform MS to characterize the proteome of the GS115 strain. Our analysis resulted in identification of 46,889 unique peptides mapping to 3914 unique protein groups, which corresponds to ∼ 80% of the predicted genes. In addition, our proteogenomic analysis led to the discovery of 64 novel genes and correction of 11 predicted gene models. The strategy used here demonstrates the utility of high resolution MS-derived peptide sequence data to cover near complete proteomes of organisms. Given the popularity of P. pastoris as a protein expression host, this proteome map should provide a list of contaminants derived from the host to assist in optimization of heterologous protein production. All MS data have been deposited in the ProteomeXchange with identifier PXD000627 (http://proteomecentral.proteomexchange.org/dataset/PXD000627).

15 citations