scispace - formally typeset
Search or ask a question
Author

Tao Liu

Bio: Tao Liu is an academic researcher from Pacific Northwest National Laboratory. The author has contributed to research in topics: Proteomics & Proteome. The author has an hindex of 51, co-authored 141 publications receiving 10385 citations. Previous affiliations of Tao Liu include University of California, Los Angeles & Environmental Molecular Sciences Laboratory.


Papers
More filters
Journal ArticleDOI
18 Sep 2014-Nature
TL;DR: Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.
Abstract: Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to cancer phenotypes. To address this problem, we analysed proteomes of colon and rectal tumours characterized previously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA 'microsatellite instability/CpG island methylation phenotype' transcriptomic subtype, but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.

1,183 citations

Journal ArticleDOI
28 Jul 2016-Cell
TL;DR: A view of how the somatic genome drives the cancer proteome and associations between protein and post-translational modification levels and clinical outcomes in HGSC is provided.

728 citations

Journal ArticleDOI
TL;DR: The results demonstrate that the concatenated high pH reversed‐phased strategy is an attractive alternative to strong cation exchange for two‐dimensional shotgun proteomic analysis of trypsin‐digested human MCF10A cell sample.
Abstract: In this study, we evaluated a concatenated low pH (pH 3) and high pH (pH 10) reversed-phase liquid chromatography strategy as a first dimension for two-dimensional liquid chromatography tandem mass spectrometry ("shotgun") proteomic analysis of trypsin-digested human MCF10A cell sample. Compared with the more traditional strong cation exchange method, the use of concatenated high pH reversed-phase liquid chromatography as a first-dimension fractionation strategy resulted in 1.8- and 1.6-fold increases in the number of peptide and protein identifications (with two or more unique peptides), respectively. In addition to broader identifications, advantages of the concatenated high pH fractionation approach include improved protein sequence coverage, simplified sample processing, and reduced sample losses. The results demonstrate that the concatenated high pH reversed-phased strategy is an attractive alternative to strong cation exchange for two-dimensional shotgun proteomic analysis.

517 citations

Journal ArticleDOI
TL;DR: A workshop was held at the National Institutes of Health with representatives from the multiple communities developing and employing targeted MS assays and defined three tiers of assays distinguished by their performance and extent of analytical characterization.

476 citations

Journal ArticleDOI
02 May 2019-Cell
TL;DR: Comparative proteomic and phosphoproteomic analysis of paired tumor and normal adjacent tissues produced a catalog of colon cancer-associated proteins and phosphosites, including known and putative new biomarkers, drug targets, and cancer/testis antigens, which suggested glycolysis as a potential target to overcome the resistance of MSI-H tumors to immune checkpoint blockade.

439 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data, detects peaks, isotope clusters and stable amino acid isotope–labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space and achieves mass accuracy in the p.p.b. range.
Abstract: Efficient analysis of very large amounts of raw data for peptide identification and protein quantification is a principal challenge in mass spectrometry (MS)-based proteomics. Here we describe MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data. Using correlation analysis and graph theory, MaxQuant detects peaks, isotope clusters and stable amino acid isotope-labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space. By integrating multiple mass measurements and correcting for linear and nonlinear mass offsets, we achieve mass accuracy in the p.p.b. range, a sixfold increase over standard techniques. We increase the proportion of identified fragmentation spectra to 73% for SILAC peptide pairs via unambiguous assignment of isotope and missed-cleavage state and individual mass precision. MaxQuant automatically quantifies several hundred thousand peptides per SILAC-proteome experiment and allows statistically robust identification and quantification of >4,000 proteins in mammalian cell lysates.

12,340 citations

Journal ArticleDOI
TL;DR: This work clarifies the preferred methodology by addressing four issues based on observed decoy hit frequencies: the major assumptions made with this database search strategy are reasonable, concatenated target-decoy database searches are preferable to separate target and decoydatabase searches, and the theoretical error associated with target-Decoy false positive (FP) rate measurements can be estimated.
Abstract: Liquid chromatography and tandem mass spectrometry (LC-MS/MS) has become the preferred method for conducting large-scale surveys of proteomes. Automated interpretation of tandem mass spectrometry (MS/MS) spectra can be problematic, however, for a variety of reasons. As most sequence search engines return results even for 'unmatchable' spectra, proteome researchers must devise ways to distinguish correct from incorrect peptide identifications. The target-decoy search strategy represents a straightforward and effective way to manage this effort. Despite the apparent simplicity of this method, some controversy surrounds its successful application. Here we clarify our preferred methodology by addressing four issues based on observed decoy hit frequencies: (i) the major assumptions made with this database search strategy are reasonable; (ii) concatenated target-decoy database searches are preferable to separate target and decoy database searches; (iii) the theoretical error associated with target-decoy false positive (FP) rate measurements can be estimated; and (iv) alternate methods for constructing decoy databases are similarly effective once certain considerations are taken into account.

3,602 citations

Journal ArticleDOI
TL;DR: An international consortium dedicated to large-scale data sharing and analytics across expert groups is formed, showing marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features.
Abstract: Colorectal cancer (CRC) is a frequently lethal disease with heterogeneous outcomes and drug responses. To resolve inconsistencies among the reported gene expression-based CRC classifications and facilitate clinical translation, we formed an international consortium dedicated to large-scale data sharing and analytics across expert groups. We show marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features: CMS1 (microsatellite instability immune, 14%), hypermutated, microsatellite unstable and strong immune activation; CMS2 (canonical, 37%), epithelial, marked WNT and MYC signaling activation; CMS3 (metabolic, 13%), epithelial and evident metabolic dysregulation; and CMS4 (mesenchymal, 23%), prominent transforming growth factor-β activation, stromal invasion and angiogenesis. Samples with mixed features (13%) possibly represent a transition phenotype or intratumoral heterogeneity. We consider the CMS groups the most robust classification system currently available for CRC-with clear biological interpretability-and the basis for future clinical stratification and subtype-based targeted interventions.

3,351 citations

Journal ArticleDOI
TL;DR: Current understanding of the major factors regulating protein expression is summarized to demonstrate a substantial role for regulatory processes occurring after mRNA is made in controlling steady-state protein abundances.
Abstract: Recent advances in next-generation DNA sequencing and proteomics provide an unprecedented ability to survey mRNA and protein abundances. Such proteome-wide surveys are illuminating the extent to which different aspects of gene expression help to regulate cellular protein abundances. Current data demonstrate a substantial role for regulatory processes occurring after mRNA is made - that is, post-transcriptional, translational and protein degradation regulation - in controlling steady-state protein abundances. Intriguing observations are also emerging in relation to cells following perturbation, single-cell studies and the apparent evolutionary conservation of protein and mRNA abundances. Here, we summarize current understanding of the major factors regulating protein expression.

3,308 citations

Journal ArticleDOI
TL;DR: The basic concept of the nanoparticle corona is reviewed and its structure and composition is highlighted, and how the properties of the corona may be linked to its biological impacts are highlighted.
Abstract: The search for understanding the interactions of nanosized materials with living organisms is leading to the rapid development of key applications, including improved drug delivery by targeting nanoparticles, and resolution of the potential threat of nanotechnological devices to organisms and the environment. Unless they are specifically designed to avoid it, nanoparticles in contact with biological fluids are rapidly covered by a selected group of biomolecules to form a corona that interacts with biological systems. Here we review the basic concept of the nanoparticle corona and its structure and composition, and highlight how the properties of the corona may be linked to its biological impacts. We conclude with a critical assessment of the key problems that need to be resolved in the near future.

2,245 citations