The Perseus computational platform for comprehensive analysis of (prote)omics data.
read more
Citations
The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.
Singular Value Decomposition for Genome-Wide Expression Data Processing and Modeling
Mass-spectrometric exploration of proteome structure and function
Proteomics of SARS-CoV-2-infected host cells reveals therapy targets.
Papain-like protease regulates SARS-CoV-2 viral spread and innate immunity.
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
LIBSVM: A library for support vector machines
The Nature of Statistical Learning Theory
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks
Related Papers (5)
MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.
Frequently Asked Questions (18)
Q2. What have the authors stated for future works in "The perseus computational platform for comprehensive analysis of (prote)omics data" ?
Their guiding principle was to put the expertise of bioinformatics scientists in the hands of all life science researchers, allowing them to focus on their biological questions while benefitting from both powerful statistical tools and cutting edge scalable analytic possibilities without depending on often unavailable specialists. In the future, metabolomics data with relative quantification profiles for a global set of metabolites over several samples, which is similar to label free quantification proteomics data, will be accommodated by Perseus with only slight adaptations such as customization of the annotation of molecular species. One major challenge and opportunity that will drive the future development of Perseus is to bridge the currently existing gap between large-scale proteomics data generation and modeling of signaling pathways and biochemical reactions. As the experimental designs become more and more complex, the functionality of Perseus will be enriched accordingly, building upon its extensible architecture to offer more tools and to support future data types.
Q3. What is the learning plug-in for perseus?
The Learning plug-in in Perseus provides implementation of classification and regression analyses and implements various feature selection methods.
Q4. What is the purpose of the machine learning section of Perseus?
The machine learning section of Perseus has a cross validation structure for the purpose of measuring how the prediction performance of classification or regression will generalize to independent data that have not been used for model building, thereby avoiding notorious problems such as over-fitting61.
Q5. What is the function of the time series set of plug-ins?
The time series set of plug-ins of Perseus contains a periodicity analysis component that allows detection of periodic oscillations in protein expression over time.
Q6. What is the forerunner of many methods for analyzing molecular profiling data?
25 GSEA is the forerunner of many methods for analyzing molecular profiling data to determine which sets of genes or proteins are correlated with a phenotypic class distinction.
Q7. What are the other numerical values that serve as annotations?
Other numerical values that serve as annotations such as sequence length, number of identified peptides or posterior error probabilities are stored in ‘Numerical columns’.
Q8. What is the way to determine the length of the cycle?
To derive the length of the cycle from the data, a Fourier-based periodicity analysis can be performed that determines the base frequency of periodic expression changes and also allows screening for possible other cycle lengths (e.g. harmonics of the base frequency).
Q9. What is the hope of the authors?
Their hope is that this novel platform will contribute to better communication between disciplines and more effective application of computational tools.
Q10. What is the laborious step in data analysis?
Perseus platform for proteomics data18Box 3. Data integrationOne of the most laborious and error-prone steps in data analysis is matching and integration of different data types.
Q11. What is the main reason why Perseus is important to the scientific community?
The authors believe the latter feature is crucial for the scientific community as it fosters transparency and reproducibility of the reported results.
Q12. What is the common way to make a plugin available?
Once users have programmed a new plugin they can make it available through the Perseus pluginPerseus platform for proteomics data7store (www.perseus-framework.org/plugins).
Q13. What is the core set of plugins in perseus?
The authors provide a core set of plugins containing more than 100 activities that are bundled with the standard Perseus download and that can also be re-used in newly developed activities (SupplementaryTable 1).
Q14. What is the importance of quantitative information for understanding the functional role of the modification sites?
In addition to scores reflecting the reliability of identification and the confidence in the localization of each site in the protein sequence28, 29, quantitative information is crucial for understanding the functional role of the modification sites.
Q15. How is the amplitude and phase determined by the software?
(a) The amplitude (expression level) and phase (up- or downregulation) are determined by the software by optimizing a cosine function fit to the data.
Q16. What is the common way to analyze protein clusters?
Once an interesting cluster of proteins has been identified, enrichment analysis25 of biological processes, complexes or pathways is done in a variety of ways, for instance with the Fisher’s exact test checking for contingency between cluster membership and the property of interest.
Q17. What is the way to filter out phosphorylation site errors?
The phosphorylation site table is another example, in which such filtering is desirable, as sites with occupancy errors larger than a fixed threshold can be filtered out using a ‘Quality matrix’ containing the site-specific errors.
Q18. What is the free version of Perseus?
Perseus can be downloaded for free from www.perseus-framework.org under acceptance of their freeware license agreement and user account registration.