Constraint-based models predict metabolic and associated cellular functions

doi:10.1038/NRG3643

Home
/
Papers
/
Constraint-based models predict metabolic and associated cellular functions

Journal Article•DOI•

Constraint-based models predict metabolic and associated cellular functions

Aarash Bordbar¹, Jonathan M. Monk¹, Zachary A. King¹, Bernhard O. Palsson¹•Institutions (1)

University of California, San Diego¹

01 Feb 2014-Nature Reviews Genetics (Nature Research)-Vol. 15, Iss: 2, pp 107-120

TL;DR: This work states that an increasing number of studies have recently combined models with high-throughput data sets for prospective experimentation, leading to validation of increasingly important and relevant biological predictions.

read less

Abstract: The prediction of cellular function from a genotype is a fundamental goal in biology. For metabolism, constraint-based modelling methods systematize biochemical, genetic and genomic knowledge into a mathematical framework that enables a mechanistic description of metabolic physiology. The use of constraint-based approaches has evolved over ~30 years, and an increasing number of studies have recently combined models with high-throughput data sets for prospective experimentation. These studies have led to validation of increasingly important and relevant biological predictions. As reviewed here, these recent successes have tangible implications in the fields of microbial evolution, interaction networks, genetic engineering and drug discovery.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Gut microbiota functions: metabolism of nutrients and other food components

[...]

Ian Rowland¹, Glenn R. Gibson¹, Almut Heinken², Karen P. Scott³, Jonathan R. Swann⁴, Ines Thiele², Kieran Tuohy - Show less +3 more•Institutions (4)

University of Reading¹, University of Luxembourg², University of Aberdeen³, Imperial College London⁴

01 Feb 2018-European Journal of Nutrition

TL;DR: This review discusses the main gut microorganisms, particularly bacteria, and microbial pathways associated with the metabolism of dietary carbohydrates, proteins, plant polyphenols, bile acids, and vitamins, and the methodologies, existing and novel, that can be employed to explore gut microbial pathways of metabolism.

...read moreread less

Abstract: The diverse microbial community that inhabits the human gut has an extensive metabolic repertoire that is distinct from, but complements the activity of mammalian enzymes in the liver and gut mucosa and includes functions essential for host digestion. As such, the gut microbiota is a key factor in shaping the biochemical profile of the diet and, therefore, its impact on host health and disease. The important role that the gut microbiota appears to play in human metabolism and health has stimulated research into the identification of specific microorganisms involved in different processes, and the elucidation of metabolic pathways, particularly those associated with metabolism of dietary components and some host-generated substances. In the first part of the review, we discuss the main gut microorganisms, particularly bacteria, and microbial pathways associated with the metabolism of dietary carbohydrates (to short chain fatty acids and gases), proteins, plant polyphenols, bile acids, and vitamins. The second part of the review focuses on the methodologies, existing and novel, that can be employed to explore gut microbial pathways of metabolism. These include mathematical models, omics techniques, isolated microbes, and enzyme assays.

...read moreread less

1,294 citations

Cites background from "Constraint-based models predict met..."

..., biomass production, metabolic fluxes through the network that satisfy this objective are predicted [146]....
[...]

Journal Article•DOI•

Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0

[...]

Laurent Heirendt¹, Sylvain Arreckx¹, Thomas Pfau¹, Sebastián N. Mendoza², Anne Richelle³, Almut Heinken¹, Hulda S. Haraldsdóttir¹, Jacek Wachowiak¹, Sarah M. Keating⁴, Vanja Vlasov¹, Stefania Magnusdottir¹, Chiam Yu Ng⁵, German Preciat¹, Alise Žagare¹, Siu Hung Joshua Chan⁵, Maike K. Aurich¹, Catherine M. Clancy¹, Jennifer Modamio¹, John T. Sauls³, Alberto Noronha¹, Aarash Bordbar, Benjamin Cousins⁶, Diana C. El Assal¹, Luis Vitores Valcárcel⁷, Iñigo Apaolaza⁷, Susan Ghaderi¹, Masoud Ahookhosh¹, Marouen Ben Guebila¹, Andrejs Kostromins⁸, Nicolas Sompairac⁹, Hoai M. Le¹, Ding Ma¹⁰, Yuekai Sun¹¹, Lin Wang⁵, James T. Yurkovich³, Miguel A.P. Oliveira¹, Phan Tu Vuong¹, Lemmer P. El Assal¹, Inna Kuperstein⁹, Andrei Zinovyev⁹, H. Scott Hinton¹², William A. Bryant¹³, Francisco J. Aragón Artacho¹⁴, Francisco J. Planes⁷, Egils Stalidzans⁸, Alejandro Maass², Santosh Vempala⁶, Michael Hucka¹⁵, Michael A. Saunders¹⁰, Costas D. Maranas⁵, Nathan E. Lewis³, Thomas Sauter¹, Bernhard O. Palsson³, Bernhard O. Palsson¹⁶, Ines Thiele¹, Ronan M. T. Fleming¹, Ronan M. T. Fleming¹⁷ - Show less +53 more•Institutions (17)

University of Luxembourg¹, University of Chile², University of California, San Diego³, European Bioinformatics Institute⁴, Pennsylvania State University⁵, Georgia Institute of Technology⁶, University of Navarra⁷, University of Latvia⁸, PSL Research University⁹, Stanford University¹⁰, University of Michigan¹¹, Utah State University¹², Imperial College London¹³, University of Alicante¹⁴, California Institute of Technology¹⁵, Technical University of Denmark¹⁶, Leiden University¹⁷

01 Mar 2019-Nature Protocols

TL;DR: This protocol provides an overview of all new features of the COBRA Toolbox and can be adapted to generate and analyze constraint-based models in a wide variety of scenarios.

...read moreread less

Abstract: Constraint-based reconstruction and analysis (COBRA) provides a molecular mechanistic framework for integrative analysis of experimental molecular systems biology data and quantitative prediction of physicochemically and biochemically feasible phenotypic states. The COBRA Toolbox is a comprehensive desktop software suite of interoperable COBRA methods. It has found widespread application in biology, biomedicine, and biotechnology because its functions can be flexibly combined to implement tailored COBRA protocols for any biochemical network. This protocol is an update to the COBRA Toolbox v.1.0 and v.2.0. Version 3.0 includes new methods for quality-controlled reconstruction, modeling, topological analysis, strain and experimental design, and network visualization, as well as network integration of chemoinformatic, metabolomic, transcriptomic, proteomic, and thermochemical data. New multi-lingual code integration also enables an expansion in COBRA application scope via high-precision, high-performance, and nonlinear numerical optimization solvers for multi-scale, multi-cellular, and reaction kinetic modeling, respectively. This protocol provides an overview of all these new features and can be adapted to generate and analyze constraint-based models in a wide variety of scenarios. The COBRA Toolbox v.3.0 provides an unparalleled depth of COBRA methods.

...read moreread less

719 citations

Journal Article•DOI•

BiGG Models: A platform for integrating, standardizing and sharing genome-scale models.

[...]

Zachary A. King¹, Justin S. Lu¹, Andreas Dräger², Andreas Dräger¹, Philip C. Miller¹, Stephen Federowicz¹, Joshua A. Lerman¹, Ali Ebrahim¹, Bernhard O. Palsson¹, Nathan E. Lewis¹ - Show less +6 more•Institutions (2)

University of California, San Diego¹, University of Tübingen²

04 Jan 2016-Nucleic Acids Research

TL;DR: BiGG Models is presented, a completely redesigned Biochemical, Genetic and Genomic knowledge base that contains more than 75 high-quality, manually-curated genome-scale metabolic models that will facilitate diverse systems biology studies and support knowledge-based analysis of diverse experimental data.

...read moreread less

Abstract: Genome-scale metabolic models are mathematically-structured knowledge bases that can be used to predict metabolic pathway usage and growth phenotypes. Furthermore, they can generate and test hypotheses when integrated with experimental data. To maximize the value of these models, centralized repositories of high-quality models must be established, models must adhere to established standards and model components must be linked to relevant databases. Tools for model visualization further enhance their utility. To meet these needs, we present BiGG Models (http://bigg.ucsd.edu), a completely redesigned Biochemical, Genetic and Genomic knowledge base. BiGG Models contains more than 75 high-quality, manually-curated genome-scale metabolic models. On the website, users can browse, search and visualize models. BiGG Models connects genome-scale models to genome annotations and external databases. Reaction and metabolite identifiers have been standardized across models to conform to community standards and enable rapid comparison across models. Furthermore, BiGG Models provides a comprehensive application programming interface for accessing BiGG Models with modeling and analysis tools. As a resource for highly curated, standardized and accessible models of metabolism, BiGG Models will facilitate diverse systems biology studies and support knowledge-based analysis of diverse experimental data.

...read moreread less

690 citations

Cites background or methods from "Constraint-based models predict met..."

...GEMs can be used to predict cellular phenotypes (8), contextualize omics data (9–11), design cell factories (12,13) and understand evolutionary trajectories (14)....
[...]
...The GEMs in BiGG Models can be analyzed using the many available Constraint-Based Reconstruction and Analysis (COBRA) methods (8,9,32) or any software that reads SBML....
[...]
...The next generation of models can eventually be included in BiGG; these models incorporate expression networks, increased spatial resolution, regulation and protein structures into GEMs (8,12,31)....
[...]

Journal Article•DOI•

Insights from 20 years of bacterial genome sequencing

[...]

Miriam Land¹, Loren Hauser¹, Se-Ran Jun¹, Intawat Nookaew¹, Michael R. Leuze¹, Tae-Hyuk Ahn¹, Tatiana Karpinets¹, Ole Lund², Guruprased H. Kora¹, Trudy M. Wassenaar, Suresh Poudel¹, David W. Ussery - Show less +8 more•Institutions (2)

Oak Ridge National Laboratory¹, Technical University of Denmark²

27 Feb 2015-Functional & Integrative Genomics

TL;DR: A series of questions are explored to highlight some insights that comparative genomics has produced and how it could revolutionize medicine in terms of speed and accuracy of finding pathogens and knowing how to treat them.

...read moreread less

Abstract: Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome sequencing? There are many practical applications, such as genome-scale metabolic modeling, biosurveillance, bioforensics, and infectious disease epidemiology. In the near future, high-throughput sequencing of patient metagenomic samples could revolutionize medicine in terms of speed and accuracy of finding pathogens and knowing how to treat them.

...read moreread less

577 citations

Cites background from "Constraint-based models predict met..."

...This has enabled several developments on large-scale network analysis (McCloskey et al. 2013) that can have several applications (Bordbar et al. 2014)....
[...]
...Applying this framework and its derivatives, several studies in microbial evolution, metabolic engineering, biomedical applications, etc. have been highly successful (Bordbar et al. 2014; Monk and Palsson 2014)....
[...]

Journal Article•DOI•

Using Genome-scale Models to Predict Biological Capabilities.

[...]

Edward J. O’Brien¹, Jonathan M. Monk¹, Bernhard O. Palsson¹, Bernhard O. Palsson²•Institutions (2)

University of California, San Diego¹, Technical University of Denmark²

21 May 2015-Cell

TL;DR: This Primer will get you started in constraint-based reconstruction and analysis at the genome scale for metabolic engineering, antibiotic design, and organismal and enzyme evolution.

...read moreread less

576 citations

Cites background or methods from "Constraint-based models predict met..."

...Recapitulation Given its simplicity and utility, FBA has become one of the most widely employed computational techniques for the systemslevel analysis of living organisms (Bordbar et al., 2014; Lewis et al., 2012)....
[...]
...The first GEM was created for Haemophilus influenza and appeared shortly after this first genome was sequenced (Edwards and Palsson, 1999), and GEMs have now grown to the level where they enable predictive biology (Bordbar et al., 2014; McCloskey et al., 2013; Oberhardt et al., 2009)....
[...]
...The first GEM was created for Haemophilus influenza and appeared shortly after this first genome was sequenced (Edwards and Palsson, 1999), and GEMs have now grown to the level where they enable predictive biology (Bordbar et al., 2014; McCloskey et al., 2013; Oberhardt et al., 2009)....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection.

[...]

Tomoya Baba¹, Takeshi Ara¹, Miki Hasegawa¹, Yuki Takai¹, Yoshiko Okumura¹, Miki Baba¹, Kirill A. Datsenko², Masaru Tomita¹, Barry L. Wanner², Hirotada Mori³, Hirotada Mori¹ - Show less +7 more•Institutions (3)

Keio University¹, Purdue University², Nara Institute of Science and Technology³

01 Jan 2006-Molecular Systems Biology

TL;DR: These mutants—the ‘Keio collection’—provide a new resource not only for systematic analyses of unknown gene functions and gene regulatory networks but also for genome‐wide testing of mutational effects in a common strain background, E. coli K‐12 BW25113.

...read moreread less

Abstract: We have systematically made a set of precisely defined, single-gene deletions of all nonessential genes in Escherichia coli K-12. Open-reading frame coding regions were replaced with a kanamycin cassette flanked by FLP recognition target sites by using a one-step method for inactivation of chromosomal genes and primers designed to create in-frame deletions upon excision of the resistance cassette. Of 4288 genes targeted, mutants were obtained for 3985. To alleviate problems encountered in high-throughput studies, two independent mutants were saved for every deleted gene. These mutants-the 'Keio collection'-provide a new resource not only for systematic analyses of unknown gene functions and gene regulatory networks but also for genome-wide testing of mutational effects in a common strain background, E. coli K-12 BW25113. We were unable to disrupt 303 genes, including 37 of unknown function, which are candidates for essential genes. Distribution is being handled via GenoBase (http://ecoli.aist-nara.ac.jp/).

...read moreread less

7,428 citations

Journal Article•DOI•

Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.

[...]

Fleischmann Rd¹, Adams², Owen White², Rebecca A. Clayton², Ewen F. Kirkness², Anthony R. Kerlavage², Carol J. Bult², J F Tomb¹, Brian Dougherty¹, Merrick Jm³ - Show less +6 more•Institutions (3)

Johns Hopkins University School of Medicine¹, TigerLogic², State University of New York System³

28 Jul 1995-Science

TL;DR: An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence of the genome from the bacterium Haemophilus influenzae Rd.

...read moreread less

Abstract: An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism.

...read moreread less

5,944 citations

Journal Article•DOI•

A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae

[...]

Peter Uetz¹, Loic Giot, Gerard Cagney, Traci A. Mansfield, Richard S. Judson, James R. Knight, Daniel Lockshon, Vaibhav A. Narayan, Maithreyan Srinivasan, Pascale Pochart, Alia Qureshi-Emili¹, Ying Li, Brian C. Godwin, Diana Conover¹, Theodore S. Kalbfleisch, Govindan Vijayadamodar, Meijia Yang, Mark Johnston², Stanley Fields¹, Jonathan M. Rothberg - Show less +16 more•Institutions (2)

University of Washington¹, Washington University in St. Louis²

10 Feb 2000-Nature

TL;DR: Examination of large-scale yeast two-hybrid screens reveals interactions that place functionally unclassified proteins in a biological context, interactions between proteins involved in the same biological function, and interactions that link biological functions together into larger cellular processes.

...read moreread less

Abstract: Two large-scale yeast two-hybrid screens were undertaken to identify protein-protein interactions between full-length open reading frames predicted from the Saccharomyces cerevisiae genome sequence. In one approach, we constructed a protein array of about 6,000 yeast transformants, with each transformant expressing one of the open reading frames as a fusion to an activation domain. This array was screened by a simple and automated procedure for 192 yeast proteins, with positive responses identified by their positions in the array. In a second approach, we pooled cells expressing one of about 6,000 activation domain fusions to generate a library. We used a high-throughput screening procedure to screen nearly all of the 6,000 predicted yeast proteins, expressed as Gal4 DNA-binding domain fusion proteins, against the library, and characterized positives by sequence analysis. These approaches resulted in the detection of 957 putative interactions involving 1,004 S. cerevisiae proteins. These data reveal interactions that place functionally unclassified proteins in a biological context, interactions between proteins involved in the same biological function, and interactions that link biological functions together into larger cellular processes. The results of these screens are shown here.

...read moreread less

4,877 citations

Journal Article•DOI•

KEGG for integration and interpretation of large-scale molecular data sets

[...]

Minoru Kanehisa¹, Susumu Goto², Yoko Sato², Miho Furumichi², Mao Tanabe² - Show less +1 more•Institutions (2)

Kyoto University¹, University of Tokyo²

01 Jan 2012-Nucleic Acids Research

TL;DR: KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets and recent enhancements to the K EGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.

...read moreread less

Abstract: Kyoto Encyclopedia of Genes and Genomes (KEGG, http://www.genome.jp/kegg/ or http://www.kegg.jp/) is a database resource that integrates genomic, chemical and systemic functional information. In particular, gene catalogs from completely sequenced genomes are linked to higher-level systemic functions of the cell, the organism and the ecosystem. Major efforts have been undertaken to manually create a knowledge base for such systemic functions by capturing and organizing experimental knowledge in computable forms; namely, in the forms of KEGG pathway maps, BRITE functional hierarchies and KEGG modules. Continuous efforts have also been made to develop and improve the cross-species annotation procedure for linking genomes to the molecular networks through the KEGG Orthology system. Here we report KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets. We also report a variant of the KEGG mapping procedure to extend the knowledge base, where different types of data and knowledge, such as disease genes and drug targets, are integrated as part of the KEGG molecular networks. Finally, we describe recent enhancements to the KEGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.

...read moreread less

4,259 citations

Journal Article•DOI•

Using Bayesian networks to analyze expression data

[...]

Nir Friedman¹, Michal Linial, Iftach Nachman, Dana Pe'er•Institutions (1)

Hebrew University of Jerusalem¹

01 Jan 2000-Journal of Computational Biology

TL;DR: A new framework for discovering interactions between genes based on multiple expression measurements is proposed and a method for recovering gene interactions from microarray data is described using tools for learning Bayesian networks.

...read moreread less

Abstract: DNA hybridization arrays simultaneously measure the expression level for thousands of genes. These measurements provide a "snapshot" of transcription levels within the cell. A major challenge in computational biology is to uncover, from such measurements, gene/protein interactions and key biological features of cellular systems. In this paper, we propose a new framework for discovering interactions between genes based on multiple expression measurements. This framework builds on the use of Bayesian networks for representing statistical dependencies. A Bayesian network is a graph-based model of joint multivariate probability distributions that captures properties of conditional independence between variables. Such models are attractive for their ability to describe complex stochastic processes and because they provide a clear methodology for learning from (noisy) observations. We start by showing how Bayesian networks can describe interactions between genes. We then describe a method for recovering gene interactions from microarray data using tools for learning Bayesian networks. Finally, we demonstrate this method on the S. cerevisiae cell-cycle measurements of Spellman et al. (1998).

...read moreread less

3,507 citations