Analyzing and interpreting genome data at the network level with ConsensusPathDB.
read more
Citations
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets.
Interactome networks and human disease
The BioGRID interaction database: 2019 update
Temporal Control of Mammalian Cortical Neurogenesis by m6A Methylation
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Hallmarks of cancer: the next generation.
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.
Related Papers (5)
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
Frequently Asked Questions (16)
Q2. How long does it take to load the full list of interactions?
For highly connected biomolecules such as EGFR, it may take 10–30 s until the full list of interactions is loaded, depending on the Internet connection.
Q3. How many databases are used to integrate the interactome?
information on molecular interactions is scattered across > 500 different databases worldwide11, which necessitates the integration of as many resources as possible into meta-databases such as ConsensusPathDB (Box 1).
Q4. What is the main limitation of ConsensusPathDB?
For some of its functionality, ConsensusPathDB already offers web services in order to allow the integration of analysis steps into automated workflows and stand-alone tools.
Q5. How long does it take to execute the protocol?
The time required to execute the above protocol is strongly dependent on the size of the analyzed data set, the load generated from the number of users on the local servers and in general on the network traffic.
Q6. What is the induced network graph algorithm?
ConsensusPathDB allows the generation of a network that connects as many members of an inputgene list (seed genes) as possible with intermediate nodes using the induced network graph algorithm.
Q7. What can be used as a resource for the generation of molecular interaction gene sets?
In addition to these analyses, the tool can be used as a resource for the generation of molecular interaction gene sets, which themselves can be used as predictive signatures.
Q8. What can be done to reveal network-level information for biomarkers of interest?
The interaction neighborhood of a single molecule can be inferred and a corresponding network can be generated; this can be done, for example, to reveal network-level information (i.e., interaction partners) for biomarkers of interest.
Q9. What are the three approaches to genome analysis?
All three approaches aim at enriching genome analysis with mechanistic network information, which enables an understanding of the underlying biological processes.
Q10. What are the common pathways annotated by ConsensusPathDB?
These sets comprise metabolic, signaling and gene regulatory pathways annotated by 12 source databases for human (4 each for mouse and yeast).
Q11. What is the consensus score for a binary PPI?
In ConsensusPathDB, all binary PPIs have an aggregated confidence score, range [0,1], that was computed as a consensus score across the six methods described above.
Q12. What is the way to perform a metabolite set analysis?
As in the case ofgenes/proteins, submit a list of metabolites of interest and perform over-representation with predefined functional sets such as pathways and GO associations.
Q13. What is the Z score for a network?
To judge the significance of the intermediate node, a Z score value is computed using a binomial proportions test as follows:Za c b db db dd= − − 1Here, a equals the number of links from the intermediate node being examined to nodes from the input seed list, b equals the number of total links for the intermediate node in the consolidated background reference network, c is the number of total links in the output subnetwork and d is the number of total links in the consolidated background reference network.
Q14. What is the way to map a metabolite to a database entry?
To map these names to database entries, it is usually necessary to convert them into the corresponding identifiers; otherwise, the system cannot match the entry.
Q15. What is the WSDL file needed for connecting to the web page?
A WSDL file needed for connecting to the SOAP/WSDL interface is provided in the ‘download / data access’ section on the web page.
Q16. Who is grateful to the authors for their contributions?
The authors are grateful to all scientists who provided annotation of the original molecular interaction data and are allowing automated access to their databases.