scispace - formally typeset
Journal ArticleDOI

A General Framework for Weighted Gene Co-Expression Network Analysis

Reads0
Chats0
TLDR
A general framework for `soft' thresholding that assigns a connection weight to each gene pair is described and several node connectivity measures are introduced and provided empirical evidence that they can be important for predicting the biological significance of a gene.
Abstract
Gene co-expression networks are increasingly used to explore the system-level functionality of genes. The network construction is conceptually straightforward: nodes represent genes and nodes are connected if the corresponding genes are significantly co-expressed across appropriately chosen tissue samples. In reality, it is tricky to define the connections between the nodes in such networks. An important question is whether it is biologically meaningful to encode gene co-expression using binary information (connected=1, unconnected=0). We describe a general framework for ;soft' thresholding that assigns a connection weight to each gene pair. This leads us to define the notion of a weighted gene co-expression network. For soft thresholding we propose several adjacency functions that convert the co-expression measure to a connection weight. For determining the parameters of the adjacency function, we propose a biologically motivated criterion (referred to as the scale-free topology criterion). We generalize the following important network concepts to the case of weighted networks. First, we introduce several node connectivity measures and provide empirical evidence that they can be important for predicting the biological significance of a gene. Second, we provide theoretical and empirical evidence that the ;weighted' topological overlap measure (used to define gene modules) leads to more cohesive modules than its ;unweighted' counterpart. Third, we generalize the clustering coefficient to weighted networks. Unlike the unweighted clustering coefficient, the weighted clustering coefficient is not inversely related to the connectivity. We provide a model that shows how an inverse relationship between clustering coefficient and connectivity arises from hard thresholding. We apply our methods to simulated data, a cancer microarray data set, and a yeast microarray data set.

read more

Citations
More filters
Journal ArticleDOI

A protein network descriptor server and its use in studying protein, disease, metabolic and drug targeted networks

TL;DR: The usefulness of the PROFEAT computed network descriptors is illustrated by their literature-reported applications in studying the protein–protein, gene regulatory, gene co-expression, protein–drug and metabolic networks.
Journal ArticleDOI

A set of genes previously implicated in the hypoxia response might be an important modulator in the rat ear tissue response to mechanical stretch

TL;DR: It appears that the hypoxia pathway may be an important modulator of response of soft tissue to forces, and insights are given into clinical interventions that could be designed to mimic within wounded tissue the effects of forces without all the negative effects that forces themselves create.
Journal ArticleDOI

Identification of Novel Potentially Pleiotropic Variants Associated With Osteoporosis and Obesity Using the cFDR Method.

TL;DR: This study identified seven potentially pleiotropic genes associated with osteoporosis and obesity that may provide new insights into a potential genetic determination and codetermination mechanism of arthritis and obesity.
Journal ArticleDOI

Comparative transcriptome profiling of longissimus muscle tissues from Qianhua Mutton Merino and Small Tail Han sheep.

TL;DR: The results suggested that some DEGs, including MRFs, GXP1 and STAC3, play crucial roles in muscle growth and development processes, and genome-wide transcriptome analysis of QHMM and STH muscle is reported for the first time.
Journal ArticleDOI

Integration of Metabolomic and Other Omics Data in Population-Based Study Designs: An Epidemiological Perspective.

TL;DR: In this review, epidemiologic principles of study design, including selection of biospecimen source(s) and the implications of the timing of sample collection, are discussed in the context of a multi-omic investigation, and the strengths and limitations of various techniques of data integration across multi-omics data types are discussed.
References
More filters
Journal ArticleDOI

Emergence of Scaling in Random Networks

TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
Journal ArticleDOI

Statistical mechanics of complex networks

TL;DR: In this paper, a simple model based on the power-law degree distribution of real networks was proposed, which was able to reproduce the power law degree distribution in real networks and to capture the evolution of networks, not just their static topology.
Journal ArticleDOI

Cluster analysis and display of genome-wide expression patterns

TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Book

Finding Groups in Data: An Introduction to Cluster Analysis

TL;DR: An electrical signal transmission system, applicable to the transmission of signals from trackside hot box detector equipment for railroad locomotives and rolling stock, wherein a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count.
Journal ArticleDOI

R: A Language for Data Analysis and Graphics

TL;DR: In this article, the authors discuss their experience designing and implementing a statistical computing language, which combines what they felt were useful features from two existing computer languages, and they feel that the new language provides advantages in the areas of portability, computational efficiency, memory management, and scope.
Related Papers (5)