scispace - formally typeset
Search or ask a question
Author

Zhen Zhang

Bio: Zhen Zhang is an academic researcher from Johns Hopkins University. The author has contributed to research in topics: Medicine & Ovarian cancer. The author has an hindex of 57, co-authored 189 publications receiving 13444 citations. Previous affiliations of Zhen Zhang include Medical University of South Carolina & Children's National Medical Center.


Papers
More filters
Journal ArticleDOI
18 Sep 2014-Nature
TL;DR: Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.
Abstract: Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to cancer phenotypes. To address this problem, we analysed proteomes of colon and rectal tumours characterized previously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA 'microsatellite instability/CpG island methylation phenotype' transcriptomic subtype, but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.

1,183 citations

Journal ArticleDOI
TL;DR: These biomarkers demonstrated the potential to improve the detection of early stage ovarian cancer by demonstrating the sensitivity and specificity of a multivariate model combining the three biomarkers.
Abstract: Early detection remains the most promising approach to improve long-term survival of patients with ovarian cancer. In a five-center case-control study, serum proteomic expressions were analyzed on 153 patients with invasive epithelial ovarian cancer, 42 with other ovarian cancers, 166 with benign pelvic masses, and 142 healthy women. Data from patients with early stage ovarian cancer and healthy women at two centers were analyzed independently and the results cross-validated to discover potential biomarkers. The results were validated using the samples from two of the remaining centers. After protein identification, biomarkers for which an immunoassay was available were tested on samples from the fifth center, which included 41 healthy women, 41 patients with ovarian cancer, and 20 each with breast, colon, and prostate cancers. Three biomarkers were identified as follows: (a) apolipoprotein A1 (down-regulated in cancer); (b) a truncated form of transthyretin (down-regulated); and (c) a cleavage fragment of inter-alpha-trypsin inhibitor heavy chain H4 (up-regulated). In independent validation to detect early stage invasive epithelial ovarian cancer from healthy controls, the sensitivity of a multivariate model combining the three biomarkers and CA125 [74% (95% CI, 52-90%)] was higher than that of CA125 alone [65% (95% CI, 43-84%)] at a matched specificity of 97% (95% CI, 89-100%). When compared at a fixed sensitivity of 83% (95% CI, 61-95%), the specificity of the model [94% (95% CI, 85-98%)] was significantly better than that of CA125 alone [52% (95% CI, 39-65%)]. These biomarkers demonstrated the potential to improve the detection of early stage ovarian cancer.

1,000 citations

Journal ArticleDOI
TL;DR: Proteomics approaches such as SELDI mass spectrometry, in conjunction with bioinformatics tools, could greatly facilitate the discovery of new and better biomarkers and show great potential for the early detection of breast cancer.
Abstract: Background: Surface-enhanced laser desorption/ionization (SELDI) is an affinity-based mass spectrometric method in which proteins of interest are selectively adsorbed to a chemically modified surface on a biochip, whereas impurities are removed by washing with buffer. This technology allows sensitive and high-throughput protein profiling of complex biological specimens. Methods: We screened for potential tumor biomarkers in 169 serum samples, including samples from a cancer group of 103 breast cancer patients at different clinical stages [stage 0 (n = 4), stage I (n = 38), stage II (n = 37), and stage III (n = 24)], from a control group of 41 healthy women, and from 25 patients with benign breast diseases. Diluted serum samples were applied to immobilized metal affinity capture Ciphergen ProteinChip® Arrays previously activated with Ni2+. Proteins bound to the chelated metal were analyzed on a ProteinChip Reader Model PBS II. Complex protein profiles of different diagnostic groups were compared and analyzed using the ProPeak software package. Results: A panel of three biomarkers was selected based on their collective contribution to the optimal separation between stage 0–I breast cancer patients and noncancer controls. The same separation was observed using independent test data from stage II–III breast cancer patients. Bootstrap cross-validation demonstrated that a sensitivity of 93% for all cancer patients and a specificity of 91% for all controls were achieved by a composite index derived by multivariate logistic regression using the three selected biomarkers. Conclusions: Proteomics approaches such as SELDI mass spectrometry, in conjunction with bioinformatics tools, could greatly facilitate the discovery of new and better biomarkers. The high sensitivity and specificity achieved by the combined use of the selected biomarkers show great potential for the early detection of breast cancer.

938 citations

Journal ArticleDOI
28 Jul 2016-Cell
TL;DR: A view of how the somatic genome drives the cancer proteome and associations between protein and post-translational modification levels and clinical outcomes in HGSC is provided.

728 citations

Patent
25 Jul 1996
TL;DR: The simultaneous multi access reasoning technology system of as discussed by the authors utilizes both existing knowledge and implicit information that can be numerically extracted from training data to provide a method and apparatus for diagnosing disease and treating a patient.
Abstract: The simultaneous multi access reasoning technology system of the present invention utilizes both existing knowledge and implicit information that can be numerically extracted from training data to provide a method and apparatus for diagnosing disease and treating a patient. This technology further comprises a system for receiving patient data from another location, analyzing the data in a trained neural network, producing a diagnostic value, and optionally transmitting the diagnostic value to another location.

526 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

01 Aug 2000
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

4,833 citations

Journal ArticleDOI
TL;DR: An international consortium dedicated to large-scale data sharing and analytics across expert groups is formed, showing marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features.
Abstract: Colorectal cancer (CRC) is a frequently lethal disease with heterogeneous outcomes and drug responses. To resolve inconsistencies among the reported gene expression-based CRC classifications and facilitate clinical translation, we formed an international consortium dedicated to large-scale data sharing and analytics across expert groups. We show marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features: CMS1 (microsatellite instability immune, 14%), hypermutated, microsatellite unstable and strong immune activation; CMS2 (canonical, 37%), epithelial, marked WNT and MYC signaling activation; CMS3 (metabolic, 13%), epithelial and evident metabolic dysregulation; and CMS4 (mesenchymal, 23%), prominent transforming growth factor-β activation, stromal invasion and angiogenesis. Samples with mixed features (13%) possibly represent a transition phenotype or intratumoral heterogeneity. We consider the CMS groups the most robust classification system currently available for CRC-with clear biological interpretability-and the basis for future clinical stratification and subtype-based targeted interventions.

3,351 citations

Journal ArticleDOI
TL;DR: This article shows how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario.
Abstract: To evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet. Accuracy and F1 score computed on confusion matrices have been (and still are) among the most popular adopted metrics in binary classification tasks. However, these statistical measures can dangerously show overoptimistic inflated results, especially on imbalanced datasets. The Matthews correlation coefficient (MCC), instead, is a more reliable statistical rate which produces a high score only if the prediction obtained good results in all of the four confusion matrix categories (true positives, false negatives, true negatives, and false positives), proportionally both to the size of positive elements and the size of negative elements in the dataset. In this article, we show how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario. We believe that the Matthews correlation coefficient should be preferred to accuracy and F1 score in evaluating binary classification tasks by all scientific communities.

2,358 citations