scispace - formally typeset
Search or ask a question
Author

Joshua E. Elias

Bio: Joshua E. Elias is an academic researcher from Stanford University. The author has contributed to research in topics: Proteome & Major histocompatibility complex. The author has an hindex of 42, co-authored 106 publications receiving 16207 citations. Previous affiliations of Joshua E. Elias include Boston Children's Hospital & Harvard University.


Papers
More filters
Journal ArticleDOI
TL;DR: This work clarifies the preferred methodology by addressing four issues based on observed decoy hit frequencies: the major assumptions made with this database search strategy are reasonable, concatenated target-decoy database searches are preferable to separate target and decoydatabase searches, and the theoretical error associated with target-Decoy false positive (FP) rate measurements can be estimated.
Abstract: Liquid chromatography and tandem mass spectrometry (LC-MS/MS) has become the preferred method for conducting large-scale surveys of proteomes. Automated interpretation of tandem mass spectrometry (MS/MS) spectra can be problematic, however, for a variety of reasons. As most sequence search engines return results even for 'unmatchable' spectra, proteome researchers must devise ways to distinguish correct from incorrect peptide identifications. The target-decoy search strategy represents a straightforward and effective way to manage this effort. Despite the apparent simplicity of this method, some controversy surrounds its successful application. Here we clarify our preferred methodology by addressing four issues based on observed decoy hit frequencies: (i) the major assumptions made with this database search strategy are reasonable; (ii) concatenated target-decoy database searches are preferable to separate target and decoy database searches; (iii) the theoretical error associated with target-decoy false positive (FP) rate measurements can be estimated; and (iv) alternate methods for constructing decoy databases are similarly effective once certain considerations are taken into account.

3,602 citations

Journal ArticleDOI
TL;DR: The combination of strong cation exchange (SCX) and reversed-phase (RP) chromatography to achieve two-dimensional separation prior to MS/MS and 1,504 yeast proteins were unambiguously identified in this single analysis.
Abstract: Highly complex protein mixtures can be directly analyzed after proteolysis by liquid chromatography coupled with tandem mass spectrometry (LC−MS/MS). In this paper, we have utilized the combination of strong cation exchange (SCX) and reversed-phase (RP) chromatography to achieve two-dimensional separation prior to MS/MS. One milligram of whole yeast protein was proteolyzed and separated by SCX chromatography (2.1 mm i.d.) with fraction collection every minute during an 80-min elution. Eighty fractions were reduced in volume and then re-injected via an autosampler in an automated fashion using a vented-column (100 μm i.d.) approach for RP-LC−MS/MS analysis. More than 162 000 MS/MS spectra were collected with 26 815 matched to yeast peptides (7537 unique peptides). A total of 1504 yeast proteins were unambiguously identified in this single analysis. We present a comparison of this experiment with a previously published yeast proteome analysis by Yates and colleagues (Washburn, M. P.; Wolters, D.; Yates, J. ...

1,654 citations

Journal ArticleDOI
TL;DR: A proteomics approach to enrich, recover, and identify ubiquitin conjugates from Saccharomyces cerevisiae lysate provides a general tool for the large-scale analysis and characterization of protein ubiquitination.
Abstract: There is a growing need for techniques that can identify and characterize protein modifications on a large or global scale. We report here a proteomics approach to enrich, recover, and identify ubiquitin conjugates from Saccharomyces cerevisiae lysate. Ubiquitin conjugates from a strain expressing 6xHis-tagged ubiquitin were isolated, proteolyzed with trypsin and analyzed by multidimensional liquid chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for amino acid sequence determination. We identified 1,075 proteins from the sample. In addition, we detected 110 precise ubiquitination sites present in 72 ubiquitin-protein conjugates. Finally, ubiquitin itself was found to be modified at seven lysine residues providing evidence for unexpected diversity in polyubiquitin chain topology in vivo. The methodology described here provides a general tool for the large-scale analysis and characterization of protein ubiquitination.

1,626 citations

Journal ArticleDOI
23 Dec 2010-Cell
TL;DR: The data suggest that the "typical" phosphoprotein is widely expressed yet displays variable, often tissue-specific phosphorylation that tunes protein activity to the specific needs of each tissue, and is offered as an online resource for the biological research community.

1,520 citations

Journal ArticleDOI
TL;DR: Using a strategy based on strong cation exchange chromatography, phosphopeptides were enriched from the nuclear fraction of HeLa cell lysate and determined 2,002 phosphorylation sites, an unprecedented large collection of sites permitted a detailed accounting of known and unknown kinase motifs and substrates.
Abstract: Determining the site of a regulatory phosphorylation event is often essential for elucidating specific kinase–substrate relationships, providing a handle for understanding essential signaling pathways and ultimately allowing insights into numerous disease pathologies. Despite intense research efforts to elucidate mechanisms of protein phosphorylation regulation, efficient, large-scale identification and characterization of phosphorylation sites remains an unsolved problem. In this report we describe an application of existing technology for the isolation and identification of phosphorylation sites. By using a strategy based on strong cation exchange chromatography, phosphopeptides were enriched from the nuclear fraction of HeLa cell lysate. From 967 proteins, 2,002 phosphorylation sites were determined by tandem MS. This unprecedented large collection of sites permitted a detailed accounting of known and unknown kinase motifs and substrates.

1,415 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

Journal ArticleDOI
TL;DR: MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data, detects peaks, isotope clusters and stable amino acid isotope–labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space and achieves mass accuracy in the p.p.b. range.
Abstract: Efficient analysis of very large amounts of raw data for peptide identification and protein quantification is a principal challenge in mass spectrometry (MS)-based proteomics. Here we describe MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data. Using correlation analysis and graph theory, MaxQuant detects peaks, isotope clusters and stable amino acid isotope-labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space. By integrating multiple mass measurements and correcting for linear and nonlinear mass offsets, we achieve mass accuracy in the p.p.b. range, a sixfold increase over standard techniques. We increase the proportion of identified fragmentation spectra to 73% for SILAC peptide pairs via unambiguous assignment of isotope and missed-cleavage state and individual mass precision. MaxQuant automatically quantifies several hundred thousand peptides per SILAC-proteome experiment and allows statistically robust identification and quantification of >4,000 proteins in mammalian cell lysates.

12,340 citations

Journal ArticleDOI
13 Mar 2003-Nature
TL;DR: The ability of mass spectrometry to identify and, increasingly, to precisely quantify thousands of proteins from complex samples can be expected to impact broadly on biology and medicine.
Abstract: Recent successes illustrate the role of mass spectrometry-based proteomics as an indispensable tool for molecular and cellular biology and for the emerging field of systems biology. These include the study of protein-protein interactions via affinity-based isolations on a small and proteome-wide scale, the mapping of numerous organelles, the concurrent description of the malaria parasite genome and proteome, and the generation of quantitative protein profiles from diverse species. The ability of mass spectrometry to identify and, increasingly, to precisely quantify thousands of proteins from complex samples can be expected to impact broadly on biology and medicine.

6,597 citations

Journal ArticleDOI
TL;DR: In a recent study, this article showed that low cerebrospinal fluid (CSF) Aβ42 and amyloid-PET positivity precede other AD manifestations by many years.
Abstract: Despite continuing debate about the amyloid β‐protein (or Aβ hypothesis, new lines of evidence from laboratories and clinics worldwide support the concept that an imbalance between production and clearance of Aβ42 and related Aβ peptides is a very early, often initiating factor in Alzheimer9s disease (AD). Confirmation that presenilin is the catalytic site of γ‐secretase has provided a linchpin: all dominant mutations causing early‐onset AD occur either in the substrate (amyloid precursor protein, APP) or the protease (presenilin) of the reaction that generates Aβ. Duplication of the wild‐type APP gene in Down9s syndrome leads to Aβ deposits in the teens, followed by microgliosis, astrocytosis, and neurofibrillary tangles typical of AD. Apolipoprotein E4, which predisposes to AD in > 40% of cases, has been found to impair Aβ clearance from the brain. Soluble oligomers of Aβ42 isolated from AD patients9 brains can decrease synapse number, inhibit long‐term potentiation, and enhance long‐term synaptic depression in rodent hippocampus, and injecting them into healthy rats impairs memory. The human oligomers also induce hyperphosphorylation of tau at AD‐relevant epitopes and cause neuritic dystrophy in cultured neurons. Crossing human APP with human tau transgenic mice enhances tau‐positive neurotoxicity. In humans, new studies show that low cerebrospinal fluid (CSF) Aβ42 and amyloid‐PET positivity precede other AD manifestations by many years. Most importantly, recent trials of three different Aβ antibodies (solanezumab, crenezumab, and aducanumab) have suggested a slowing of cognitive decline in post hoc analyses of mild AD subjects. Although many factors contribute to AD pathogenesis, Aβ dyshomeostasis has emerged as the most extensively validated and compelling therapeutic target.

3,824 citations

Journal ArticleDOI
TL;DR: This work clarifies the preferred methodology by addressing four issues based on observed decoy hit frequencies: the major assumptions made with this database search strategy are reasonable, concatenated target-decoy database searches are preferable to separate target and decoydatabase searches, and the theoretical error associated with target-Decoy false positive (FP) rate measurements can be estimated.
Abstract: Liquid chromatography and tandem mass spectrometry (LC-MS/MS) has become the preferred method for conducting large-scale surveys of proteomes. Automated interpretation of tandem mass spectrometry (MS/MS) spectra can be problematic, however, for a variety of reasons. As most sequence search engines return results even for 'unmatchable' spectra, proteome researchers must devise ways to distinguish correct from incorrect peptide identifications. The target-decoy search strategy represents a straightforward and effective way to manage this effort. Despite the apparent simplicity of this method, some controversy surrounds its successful application. Here we clarify our preferred methodology by addressing four issues based on observed decoy hit frequencies: (i) the major assumptions made with this database search strategy are reasonable; (ii) concatenated target-decoy database searches are preferable to separate target and decoy database searches; (iii) the theoretical error associated with target-decoy false positive (FP) rate measurements can be estimated; and (iv) alternate methods for constructing decoy databases are similarly effective once certain considerations are taken into account.

3,602 citations