scispace - formally typeset
Search or ask a question
Institution

Institute for Systems Biology

NonprofitSeattle, Washington, United States
About: Institute for Systems Biology is a nonprofit organization based out in Seattle, Washington, United States. It is known for research contribution in the topics: Population & Proteomics. The organization has 1277 authors who have published 2777 publications receiving 353165 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A method of computing P-values based on tail approximation, where the tail of the distribution of permutation values is approximated by a generalized Pareto distribution, and a good fit and thus accurate P-value estimates can be obtained with a drastically reduced number of permutations when compared with the standard empirical way of computingP-values.
Abstract: Motivation: Permutation tests have become a standard tool to assess the statistical significance of an event under investigation. The statistical significance, as expressed in a P-value, is calculated as the fraction of permutation values that are at least as extreme as the original statistic, which was derived from non-permuted data. This empirical method directly couples both the minimal obtainable P-value and the resolution of the P-value to the number of permutations. Thereby, it imposes upon itself the need for a very large number of permutations when small P-values are to be accurately estimated. This is computationally expensive and often infeasible. Results: A method of computing P-values based on tail approximation is presented. The tail of the distribution of permutation values is approximated by a generalized Pareto distribution. A good fit and thus accurate P-value estimates can be obtained with a drastically reduced number of permutations when compared with the standard empirical way of computing P-values. Availability: The Matlab code can be obtained from the corresponding author on request. Contact: tknijnenburg@systemsbiology.org Supplementary information: Supplementary data are available at Bioinformatics online.

216 citations

Journal ArticleDOI
Diane Lefaudeux1, Bertrand De Meulder1, Matthew J. Loza2, Nancy Peffer2  +219 moreInstitutions (21)
TL;DR: Clustering based on clinicophysiologic parameters yielded 4 stable and reproducible clusters of asthmatic patients that associate with different pathobiological pathways.
Abstract: Background Asthma is a heterogeneous disease in which there is a differential response to asthma treatments. This heterogeneity needs to be evaluated so that a personalized management approach can be provided. Objectives We stratified patients with moderate-to-severe asthma based on clinicophysiologic parameters and performed an omics analysis of sputum. Methods Partition-around-medoids clustering was applied to a training set of 266 asthmatic participants from the European Unbiased Biomarkers for the Prediction of Respiratory Diseases Outcomes (U-BIOPRED) adult cohort using 8 prespecified clinic-physiologic variables. This was repeated in a separate validation set of 152 asthmatic patients. The clusters were compared based on sputum proteomics and transcriptomics data. Results Four reproducible and stable clusters of asthmatic patients were identified. The training set cluster T1 consists of patients with well-controlled moderate-to-severe asthma, whereas cluster T2 is a group of patients with late-onset severe asthma with a history of smoking and chronic airflow obstruction. Cluster T3 is similar to cluster T2 in terms of chronic airflow obstruction but is composed of nonsmokers. Cluster T4 is predominantly composed of obese female patients with uncontrolled severe asthma with increased exacerbations but with normal lung function. The validation set exhibited similar clusters, demonstrating reproducibility of the classification. There were significant differences in sputum proteomics and transcriptomics between the clusters. The severe asthma clusters (T2, T3, and T4) had higher sputum eosinophilia than cluster T1, with no differences in sputum neutrophil counts and exhaled nitric oxide and serum IgE levels. Conclusion Clustering based on clinicophysiologic parameters yielded 4 stable and reproducible clusters that associate with different pathobiological pathways.

216 citations

Journal ArticleDOI
TL;DR: A novel method, Lirnet, is presented that automatically learns a regulatory potential for each sequence polymorphism, estimating how likely it is to have a significant effect on gene expression, and produces significantly better regulatory programs than other recent approaches.
Abstract: Genome-wide RNA expression data provide a detailed view of an organism's biological state; hence, a dataset measuring expression variation between genetically diverse individuals (eQTL data) may provide important insights into the genetics of complex traits. However, with data from a relatively small number of individuals, it is difficult to distinguish true causal polymorphisms from the large number of possibilities. The problem is particularly challenging in populations with significant linkage disequilibrium, where traits are often linked to large chromosomal regions containing many genes. Here, we present a novel method, Lirnet, that automatically learns a regulatory potential for each sequence polymorphism, estimating how likely it is to have a significant effect on gene expression. This regulatory potential is defined in terms of “regulatory features”—including the function of the gene and the conservation, type, and position of genetic polymorphisms—that are available for any organism. The extent to which the different features influence the regulatory potential is learned automatically, making Lirnet readily applicable to different datasets, organisms, and feature sets. We apply Lirnet both to the human HapMap eQTL dataset and to a yeast eQTL dataset and provide statistical and biological results demonstrating that Lirnet produces significantly better regulatory programs than other recent approaches. We demonstrate in the yeast data that Lirnet can correctly suggest a specific causal sequence variation within a large, linked chromosomal region. In one example, Lirnet uncovered a novel, experimentally validated connection between Puf3—a sequence-specific RNA binding protein—and P-bodies—cytoplasmic structures that regulate translation and RNA stability—as well as the particular causative polymorphism, a SNP in Mkt1, that induces the variation in the pathway.

215 citations

Journal ArticleDOI
TL;DR: The isotope-coded affinity tag (ICAT) technique of quantitative mass spectrometry is used to compare proteins interacting with NF-E2p18/MafK during differentiation and suggests that the LCR promotes an active repression of β-globin transcription in committed cells before terminal differentiation.
Abstract: Dynamic changes in transcription factor complexes during erythroid differentiation revealed by quantitative proteomics

214 citations

Journal ArticleDOI
TL;DR: The generation of haploid mouse ESC lines from parthenogenetic embryos is reported, opening the possibility of combining the power of a haploid genome with pluripotency of embryonic stem cells to uncover fundamental biological processes in defined cell types at a genomic scale.

214 citations


Authors

Showing all 1292 results

NameH-indexPapersCitations
Younan Xia216943175757
Ruedi Aebersold182879141881
David Haussler172488224960
Steven P. Gygi172704129173
Nahum Sonenberg167647104053
Leroy Hood158853128452
Mark H. Ellisman11763755289
Wei Zhang112118993641
John Ralph10944239238
Eric H. Davidson10645447058
James R. Heath10342558548
Alan Aderem9924646682
Anne-Claude Gingras9733640714
Trey Ideker9730672276
Michael H. Gelb9450634714
Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20233
202260
2021216
2020204
2019188
2018168