scispace - formally typeset
Search or ask a question
Author

Amin Moghaddas Gholami

Bio: Amin Moghaddas Gholami is an academic researcher from Technische Universität München. The author has contributed to research in topics: Proteome & Bioconductor. The author has an hindex of 17, co-authored 21 publications receiving 3269 citations. Previous affiliations of Amin Moghaddas Gholami include La Jolla Institute for Allergy and Immunology & German Cancer Research Center.

Papers
More filters
Journal ArticleDOI
29 May 2014-Nature
TL;DR: A mass-spectrometry-based draft of the human proteome and a public, high-performance, in-memory database for real-time analysis of terabytes of big data, called ProteomicsDB are presented, which enables navigation of proteomes, provides biological insight and fosters the development of proteomic technology.
Abstract: Proteomes are characterized by large protein-abundance differences, cell-type- and time-dependent expression patterns and post-translational modifications, all of which carry biological information that is not accessible by genomics or transcriptomics. Here we present a mass-spectrometry-based draft of the human proteome and a public, high-performance, in-memory database for real-time analysis of terabytes of big data, called ProteomicsDB. The information assembled from human tissues, cell lines and body fluids enabled estimation of the size of the protein-coding genome, and identified organ-specific proteins and a large number of translated lincRNAs (long intergenic non-coding RNAs). Analysis of messenger RNA and protein-expression profiles of human tissues revealed conserved control of protein abundance, and integration of drug-sensitivity data enabled the identification of proteins predicting resistance or sensitivity. The proteome profiles also hold considerable promise for analysing the composition and stoichiometry of protein complexes. ProteomicsDB thus enables navigation of proteomes, provides biological insight and fosters the development of proteomic technology.

1,660 citations

Journal ArticleDOI
TL;DR: It is concluded that a HF diet markedly affects the gut bacterial ecosystem at the functional level.
Abstract: The intestinal microbiota is known to regulate host energy homeostasis and can be influenced by high-calorie diets. However, changes affecting the ecosystem at the functional level are still not well characterized. We measured shifts in cecal bacterial communities in mice fed a carbohydrate or high-fat (HF) diet for 12 weeks at the level of the following: (i) diversity and taxa distribution by high-throughput 16S ribosomal RNA gene sequencing; (ii) bulk and single-cell chemical composition by Fourier-transform infrared- (FT-IR) and Raman micro-spectroscopy and (iii) metaproteome and metabolome via high-resolution mass spectrometry. High-fat diet caused shifts in the diversity of dominant gut bacteria and altered the proportion of Ruminococcaceae (decrease) and Rikenellaceae (increase). FT-IR spectroscopy revealed that the impact of the diet on cecal chemical fingerprints is greater than the impact of microbiota composition. Diet-driven changes in biochemical fingerprints of members of the Bacteroidales and Lachnospiraceae were also observed at the level of single cells, indicating that there were distinct differences in cellular composition of dominant phylotypes under different diets. Metaproteome and metabolome analyses based on the occurrence of 1760 bacterial proteins and 86 annotated metabolites revealed distinct HF diet-specific profiles. Alteration of hormonal and anti-microbial networks, bile acid and bilirubin metabolism and shifts towards amino acid and simple sugars metabolism were observed. We conclude that a HF diet markedly affects the gut bacterial ecosystem at the functional level.

550 citations

Journal ArticleDOI
TL;DR: A quantitative proteome and kinome profile of the NCI-60 panel covering, in total, 10,350 proteins (including 375 protein kinases) and including a core cancer proteome of 5,578 proteins that were consistently quantified across all tissue types is presented.

288 citations

Journal ArticleDOI
TL;DR: This work explores dimension reduction techniques as one of the emerging approaches for data integration, and how these can be applied to increase the understanding of biological systems in normal physiological function and disease.
Abstract: State-of-the-art next-generation sequencing, transcriptomics, proteomics and other high-throughput ‘omics' technologies enable the efficient generation of large experimental data sets. These data may yield unprecedented knowledge about molecular pathways in cells and their role in disease. Dimension reduction approaches have been widely used in exploratory analysis of single omics data sets. This review will focus on dimension reduction approaches for simultaneous exploratory analyses of multiple data sets. These methods extract the linear relationships that best explain the correlated structure across data sets, the variability both within and between variables (or observations) and may highlight data issues such as batch effects or outliers. We explore dimension reduction techniques as one of the emerging approaches for data integration, and how these can be applied to increase our understanding of biological systems in normal physiological function and disease.

280 citations

Journal ArticleDOI
TL;DR: Multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets, is described, an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals.
Abstract: Background: To leverage the potential of multi-omics studies, exploratory data analysis methods that provide systematic integration and comparison of multiple layers of omics information are required. We describe multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets. Based on a covariance optimization criterion, MCIA simultaneously projects several datasets into the same dimensional space, transforming diverse sets of features onto the same scale, to extract the most variant from each dataset and facilitate biological interpretation and pathway analysis. Results: We demonstrate integration of multiple layers of information using MCIA, applied to two typical “omics” research scenarios. The integration of transcriptome and proteome profiles of cells in the NCI-60 cancer cell line panel revealed distinct, complementary features, which together increased the coverage and power of pathway analysis. Our analysis highlighted the importance of the leukemia extravasation signaling pathway in leukemia that was not highly ranked in the analysis of any individual dataset. Secondly, we compared transcriptome profiles of high grade serous ovarian tumors that were obtained, on two different microarray platforms and next generation RNA-sequencing, to identify the most informative platform and extract robust biomarkers of molecular subtypes. We discovered that the variance of RNA-sequencing data processed using RPKM had greater variance than that with MapSplice and RSEM. We provided novel markers highly associated to tumor molecular subtype combined from four data platforms. MCIA is implemented and available in the R/Bioconductor “omicade4” package. Conclusion: We believe MCIA is an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals. The method is not dependent on feature annotation, and thus it can extract important features even when there are not present across all datasets. MCIA provides simple graphical representations for the identification of relationships between large datasets.

227 citations


Cited by
More filters
Journal ArticleDOI
23 Jan 2015-Science
TL;DR: In this paper, a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level.
Abstract: Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.

9,745 citations

Journal ArticleDOI
TL;DR: A significant update to one of the tools in this domain called Enrichr, a comprehensive resource for curated gene sets and a search engine that accumulates biological knowledge for further biological discoveries is presented.
Abstract: Enrichment analysis is a popular method for analyzing gene sets generated by genome-wide experiments. Here we present a significant update to one of the tools in this domain called Enrichr. Enrichr currently contains a large collection of diverse gene set libraries available for analysis and download. In total, Enrichr currently contains 180 184 annotated gene sets from 102 gene set libraries. New features have been added to Enrichr including the ability to submit fuzzy sets, upload BED files, improved application programming interface and visualization of the results as clustergrams. Overall, Enrichr is a comprehensive resource for curated gene sets and a search engine that accumulates biological knowledge for further biological discoveries. Enrichr is freely available at: http://amp.pharm.mssm.edu/Enrichr.

6,201 citations

Journal ArticleDOI
TL;DR: The developments in PRIDE resources and related tools are summarized and a brief update on the resources under development 'PRIDE Cluster' and 'PRide Proteomes', which provide a complementary view and quality-scored information of the peptide and protein identification data available inPRIDE Archive are given.
Abstract: The PRoteomics IDEntifications (PRIDE) database is one of the world-leading data repositories of mass spectrometry (MS)-based proteomics data Since the beginning of 2014, PRIDE Archive (http://wwwebiacuk/pride/archive/) is the new PRIDE archival system, replacing the original PRIDE database Here we summarize the developments in PRIDE resources and related tools since the previous update manuscript in the Database Issue in 2013 PRIDE Archive constitutes a complete redevelopment of the original PRIDE, comprising a new storage backend, data submission system and web interface, among other components PRIDE Archive supports the most-widely used PSI (Proteomics Standards Initiative) data standard formats (mzML and mzIdentML) and implements the data requirements and guidelines of the ProteomeXchange Consortium The wide adoption of ProteomeXchange within the community has triggered an unprecedented increase in the number of submitted data sets (around 150 data sets per month) We outline some statistics on the current PRIDE Archive data contents We also report on the status of the PRIDE related stand-alone tools: PRIDE Inspector, PRIDE Converter 2 and the ProteomeXchange submission tool Finally, we will give a brief update on the resources under development 'PRIDE Cluster' and 'PRIDE Proteomes', which provide a complementary view and quality-scored information of the peptide and protein identification data available in PRIDE Archive

3,375 citations

Journal ArticleDOI
25 Jun 2020-Cell
TL;DR: Using HLA class I and II predicted peptide ‘megapools’, circulating SARS-CoV-2−specific CD8+ and CD4+ T cells were identified in ∼70% and 100% of COVID-19 convalescent patients, respectively, suggesting cross-reactive T cell recognition between circulating ‘common cold’ coronaviruses and SARS.

3,043 citations