scispace - formally typeset
Journal ArticleDOI

Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures

TLDR
A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples, providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria.
Abstract
A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples. The algorithm has been designed for the analysis of data originating from data independent acquisitions, whereby multiple precursor ions are fragmented simultaneously. Measurements used by the algorithm include retention time, ion intensities, charge state, and accurate masses on both precursor and product ions from LC-MS data. The search algorithm uses an iterative process whereby each iteration incrementally increases the selectivity, specificity, and sensitivity of the overall strategy. Increased specificity is obtained by utilizing a subset database search approach, whereby for each subsequent stage of the search, only those peptides from securely identified proteins are queried. Tentative peptide and protein identifications are ranked and scored by their relative correlation to a number of models of known and empirically derived physicochemical attributes of proteins and peptides. In addition, the algorithm utilizes decoy database techniques for automatically determining the false positive identification rates. The search algorithm has been tested by comparing the search results from a four-protein mixture, the same four-protein mixture spiked into a complex biological background, and a variety of other "system" type protein digest mixtures. The method was validated independently by data dependent methods, while concurrently relying on replication and selectivity. Comparisons were also performed with other commercially and publicly available peptide fragmentation search algorithms. The presented results demonstrate the ability to correctly identify peptides and proteins from data independent acquisition strategies with high sensitivity and specificity. They also illustrate a more comprehensive analysis of the samples studied; providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria, with a concurrent increase in both sequence coverage and the number of modified peptides.

read more

Citations
More filters
Journal ArticleDOI

Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis

TL;DR: A new strategy that systematically queries sample sets for the presence and quantity of essentially any protein of interest is presented, using the information available in fragment ion spectral libraries to mine the complete fragment ion maps generated using a data-independent acquisition method.
Journal ArticleDOI

Quantitative mass spectrometry in proteomics: critical review update from 2007 to the present

TL;DR: The notion that mass spectrometry has the potential to eventually retire the western blot is no longer in the realm of science fiction and major technical and conceptual developments since 2007 are focused on.
Journal ArticleDOI

OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data

TL;DR: This research presents a novel and scalable approaches that allow for real-time measurement of the response of the immune system to treatments such as chemotherapy and immunotherapy.
Journal ArticleDOI

Data‐independent acquisition‐based SWATH ‐ MS for quantitative proteomics: a tutorial

TL;DR: This tutorial provides guidelines on how to set up and plan a SWATH‐MS experiment, how to perform the mass spectrometric measurement and how to analyse SWath‐MS data using peptide‐centric scoring.
Journal ArticleDOI

DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics

TL;DR: DIA-Umpire enables targeted extraction of quantitative information based on peptides initially identified in only a subset of the samples, resulting in more consistent quantification across multiple samples.
References
More filters
Journal ArticleDOI

Probability-based protein identification by searching sequence databases using mass spectrometry data.

TL;DR: A new computer program, Mascot, is presented, which integrates all three types of search for protein identification by searching a sequence database using mass spectrometry data, and the scoring algorithm is probability based.
Journal ArticleDOI

Mass spectrometry-based proteomics

TL;DR: The ability of mass spectrometry to identify and, increasingly, to precisely quantify thousands of proteins from complex samples can be expected to impact broadly on biology and medicine.
Journal ArticleDOI

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

TL;DR: The approach described in this manuscript provides a convenient method to interpret tandem mass spectra with known sequences in a protein database.
Journal ArticleDOI

Large-scale analysis of the yeast proteome by multidimensional protein identification technology.

TL;DR: MudPIT was applied to the proteome of the Saccharomyces cerevisiae strain BJ5460 grown to mid-log phase and yielded the largest proteome analysis to date, identifying 131 proteins with three or more predicted transmembrane domains which allowed us to map the soluble domains of many of the integral membrane proteins.
Journal ArticleDOI

Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry

TL;DR: This work clarifies the preferred methodology by addressing four issues based on observed decoy hit frequencies: the major assumptions made with this database search strategy are reasonable, concatenated target-decoy database searches are preferable to separate target and decoydatabase searches, and the theoretical error associated with target-Decoy false positive (FP) rate measurements can be estimated.
Related Papers (5)