Home
/
Authors
/
Johannes Andries Roubos

Author

Johannes Andries Roubos

Other affiliations: Delft University of Technology, Netherlands Bioinformatics Centre, Wageningen University and Research Centre

Bio: Johannes Andries Roubos is an academic researcher from DSM. The author has contributed to research in topics: Fuzzy logic & CRISPR. The author has an hindex of 27, co-authored 82 publications receiving 4111 citations. Previous affiliations of Johannes Andries Roubos include Delft University of Technology & Netherlands Bioinformatics Centre.

Topics: Fuzzy logic, CRISPR, Aspergillus niger, Genome, Peptide sequence ...read more

Papers published on a yearly basis

2023
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2003
2002
2001
1999
1998
1997

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88

[...]

Herman Jan Pel¹, Johannes H. de Winde¹, Johannes H. de Winde², David B. Archer³, Paul S. Dyer³, Gerald Hofmann⁴, Peter J. Schaap⁵, Geoffrey Turner⁶, Ronald P. de Vries⁷, Richard Albang⁸, Kaj Albermann⁸, Mikael Rørdam Andersen⁴, Jannick Dyrløv Bendtsen⁹, Jacques A.E. Benen⁵, Marco A. van den Berg¹, Stefaan Breestraat¹, Mark X. Caddick¹⁰, Roland Contreras¹¹, Michael Cornell¹², Pedro M. Coutinho¹³, Etienne Danchin¹³, Alfons J. M. Debets⁵, Peter J. T. Dekker¹, Piet W.M. van Dijck¹, Alard Van Dijk¹, Lubbert Dijkhuizen¹⁴, Arnold J. M. Driessen¹⁴, Christophe d'Enfert¹⁵, Steven Geysens¹¹, Coenie Goosen¹⁴, Gert S.P. Groot¹, Piet W. J. de Groot¹⁶, Thomas Guillemette¹⁷, Bernard Henrissat¹³, Marga Herweijer¹, Johannes Petrus Theodorus Wilhelmus Van Den Hombergh¹, Cees A. M. J. J. van den Hondel¹⁸, René T. J. M. van der Heijden¹⁹, Rachel M. van der Kaaij¹⁴, Frans M. Klis¹⁶, Harrie J. Kools⁵, Christian P. Kubicek, Patricia Ann van Kuyk¹⁸, Jürgen Lauber, Xin Lu, Marc J. E. C. van der Maarel, Rogier Meulenberg¹, Hildegard Henna Menke¹, Martin Mortimer¹⁰, Jens Nielsen⁴, Stephen G. Oliver¹², Maurien M.A. Olsthoorn¹, K. Pal²⁰, K. Pal⁵, Noël Nicolaas Maria Elisabeth Van Peij¹, Arthur F. J. Ram¹⁸, Ursula Rinas, Johannes Andries Roubos¹, Cornelis Maria Jacobus Sagt¹, Monika Schmoll, Jibin Sun, David W. Ussery⁴, János Varga²⁰, Wouter Vervecken¹¹, Peter J.J. Van De Vondervoort¹⁸, Holger Wedler, Han A. B. Wösten⁷, An-Ping Zeng, Albert J. J. van Ooyen¹, Jaap Visser, Hein Stam¹ - Show less +67 more•Institutions (20)

DSM¹, Delft University of Technology², University of Nottingham³, Technical University of Denmark⁴, Wageningen University and Research Centre⁵, University of Sheffield⁶, Utrecht University⁷, Biomax Informatics AG⁸, CLC bio⁹, University of Liverpool¹⁰, Ghent University¹¹, University of Manchester¹², University of Provence¹³, University of Groningen¹⁴, Pasteur Institute¹⁵, University of Amsterdam¹⁶, University of Angers¹⁷, Leiden University¹⁸, Radboud University Nijmegen¹⁹, University of Szeged²⁰

01 Feb 2007-Nature Biotechnology

TL;DR: The filamentous fungus Aspergillus niger is widely exploited by the fermentation industry for the production of enzymes and organic acids, particularly citric acid, and the sequenced genome revealed a large number of major facilitator superfamily transporters and fungal zinc binuclear cluster transcription factors.

...read moreread less

Abstract: The filamentous fungus Aspergillus niger is widely exploited by the fermentation industry for the production of enzymes and organic acids, particularly citric acid. We sequenced the 33.9-megabase genome of A. niger CBS 513.88, the ancestor of currently used enzyme production strains. A high level of synteny was observed with other aspergilli sequenced. Strong function predictions were made for 6,506 of the 14,165 open reading frames identified. A detailed description of the components of the protein secretion pathway was made and striking differences in the hydrolytic enzyme spectra of aspergilli were observed. A reconstructed metabolic network comprising 1,069 unique reactions illustrates the versatile metabolism of A. niger. Noteworthy is the large number of major facilitator superfamily transporters and fungal zinc binuclear cluster transcription factors, and the presence of putative gene clusters for fumonisin and ochratoxin A synthesis.

...read moreread less

1,161 citations

Journal Article•DOI•

Genome sequencing and analysis of the filamentous fungus Penicillium chrysogenum

[...]

Marco A. van den Berg¹, Richard Albang², Kaj Albermann², Jonathan H. Badger³, Jean-Marc Daran⁴, Arnold J. M. Driessen⁵, Carlos García-Estrada, Natalie D. Fedorova³, Diana M. Harris⁴, Wilbert H. M. Heijne¹, Vinita Joardar³, Jan A.K.W. Kiel, Andriy Kovalchuk⁵, Juan F. Martín⁶, William C. Nierman⁷, William C. Nierman³, Jeroen G. Nijland⁵, Jack T. Pronk⁴, Johannes Andries Roubos¹, Ida J. van der Klei, Noël Nicolaas Maria Elisabeth Van Peij¹, Marten Veenhuis, Hans von Döhren, Christian Wagner², Jennifer R. Wortman³, Roel A. L. Bovenberg¹ - Show less +22 more•Institutions (7)

DSM¹, Biomax Informatics AG², J. Craig Venter Institute³, Delft University of Technology⁴, University of Groningen⁵, University of León⁶, George Washington University⁷

01 Oct 2008-Nature Biotechnology

TL;DR: Genes predicted to encode transporters were strongly overrepresented among the genes transcriptionally upregulated under conditions that stimulate penicillinG production, illustrating potential for future genomics-driven metabolic engineering.

...read moreread less

Abstract: Industrial penicillin production with the filamentous fungus Penicillium chrysogenum is based on an unprecedented effort in microbial strain improvement. To gain more insight into penicillin synthesis, we sequenced the 32.19 Mb genome of P. chrysogenum Wisconsin54-1255 and identified numerous genes responsible for key steps in penicillin production. DNA microarrays were used to compare the transcriptomes of the sequenced strain and a penicillinG high-producing strain, grown in the presence and absence of the side-chain precursor phenylacetic acid. Transcription of genes involved in biosynthesis of valine, cysteine and alpha-aminoadipic acid-precursors for penicillin biosynthesis-as well as of genes encoding microbody proteins, was increased in the high-producing strain. Some gene products were shown to be directly controlling beta-lactam output. Many key cellular transport processes involving penicillins and intermediates remain to be characterized at the molecular level. Genes predicted to encode transporters were strongly overrepresented among the genes transcriptionally upregulated under conditions that stimulate penicillinG production, illustrating potential for future genomics-driven metabolic engineering.

...read moreread less

457 citations

Journal Article•DOI•

Comparative genomics of citric-acid-producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

[...]

Mikael Rørdam Andersen¹, Margarita Salazar, Peter J. Schaap, Peter J. I. van de Vondervoort², David E. Culley³, Jette Thykaer, Jens Christian Frisvad, Kristian Fog Nielsen, Richard Albang⁴, Kaj Albermann⁴, Randy M. Berka⁵, Gerhard H. Braus⁶, Susanna A. Braus-Stromeyer⁶, Luis M. Corrochano⁷, Ziyu Dai³, Piet W.M. van Dijck², Gerald Hofmann⁵, Linda L. Lasure³, Jon K. Magnuson³, Hildegard Henna Menke², M. Meijer, Susan Lisette Meijer, Jakob Blæsbjerg Nielsen, Michael Lynge Nielsen, Albert J. J. van Ooyen², Herman Jan Pel², Lars Kongsbak Poulsen, Rob Samson, Hein Stam², Adrian Tsang⁸, Johannes Maarten Van Den Brink⁹, Alex Atkins¹⁰, Andrea Aerts¹⁰, Harris Shapiro¹⁰, Jasmyn Pangilinan¹⁰, Asaf Salamov¹⁰, Yigong Lou¹⁰, Erika Lindquist¹⁰, Susan Lucas¹⁰, Jane Grimwood¹¹, Igor V. Grigoriev¹⁰, Christian P. Kubicek¹², Diego Martinez¹³, Noël Nicolaas Maria Elisabeth Van Peij², Johannes Andries Roubos², Jens Nielsen, Scott E. Baker³ - Show less +43 more•Institutions (13)

Technical University of Denmark¹, DSM², Pacific Northwest National Laboratory³, Biomax Informatics AG⁴, Novozymes⁵, University of Göttingen⁶, University of Seville⁷, Concordia University⁸, Chr. Hansen⁹, United States Department of Energy¹⁰, Stanford University¹¹, Vienna University of Technology¹², Los Alamos National Laboratory¹³

01 Jun 2011-Genome Research

TL;DR: In this article, the authors performed whole-genome sequencing of the Aspergillus niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality.

...read moreread less

Abstract: The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compel additional exploration. We therefore undertook whole-genome sequencing of the acidogenic A. niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence, and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was used to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 Mb of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis supported up-regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases, and protein transporters in the protein producing CBS 513.88 strain. Our results and data sets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi.

...read moreread less

308 citations

Comparative genomics of citric-acid producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88 [working title: Aspergillus niger strain evolution]

[...]

Susan Lucas, Igor V. Grigoriev, Harris Shapiro, Andrea Aerts, Alex Atkins, Asaf Salamov, Erika Lindquist, Jasmyn Pangilinan, Yunian Lou, Jane Grimwood, Mikael Rørdam Andersen, Margarita Salazar, Peter J. Schaap, P de Vondervoot, David E. Culley, Scott E. Baker, Jette Thykaer, Jens Christian Frisvad, Kristian Fog Nielsen, Richard Albang, Kaj Albermann, Randy M. Berka, Gerhard H. Braus, Susanna A. Braus-Stromeyer, Luis M. Corrochano, Ziyu Dai, P van Dijck, Gerald Hofmann, Linda L. Lasure, J Magnusson, Susan Lisette Meijer, Jens Nielsen, Michael Lynge Nielsen, A van Ooyen, K Panther, Herman Jan Pel, Lars Kongsbak Poulsen, Rob Samson, Hein Stam, Adrian Tsang, J den Brink, Christian P. Kubicek, Diego Martinez, N van Peij, Johannes Andries Roubos - Show less +41 more

29 Apr 2011

TL;DR: In this paper, the authors performed whole-genome sequencing of the Aspergillus niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality.

...read moreread less

306 citations

Journal Article•DOI•

Learning fuzzy classification rules from labeled data

[...]

Johannes Andries Roubos¹, M. Setnes, János Abonyi•Institutions (1)

Delft University of Technology¹

01 Mar 2003

TL;DR: An iterative approach for developing fuzzy classifiers is proposed and the initial model is derived from the data and subsequently, feature selection and rule-base simplification are applied to reduce the model, while a genetic algorithm is used for parameter optimization.

...read moreread less

Abstract: The automatic design of fuzzy rule-based classification systems based on labeled data is considered. It is recognized that both classification performance and interpretability are of major importance and effort is made to keep the resulting rule bases small and comprehensible. For this purpose, an iterative approach for developing fuzzy classifiers is proposed. The initial model is derived from the data and subsequently, feature selection and rule-base simplification are applied to reduce the model, while a genetic algorithm is used for parameter optimization. An application to the Wine data classification problem is shown.

...read moreread less

193 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17

Collapse

Cited by

PDF

Open Access

More filters

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

2,187 citations

Journal Article•DOI•

Genome sequencing and analysis of the biomass-degrading fungus Trichoderma reesei (syn. Hypocrea jecorina).

[...]

Diego Martinez¹, Diego Martinez², Randy M. Berka³, Bernard Henrissat⁴, Markku Saloheimo⁵, Mikko Arvas⁵, Scott E. Baker⁶, Jarod Chapman⁷, Olga Chertkov², Pedro M. Coutinho⁴, Dan Cullen⁸, Etienne Danchin⁴, Igor V. Grigoriev⁷, Paul Harris³, Melissa Jackson², Christian P. Kubicek⁹, Cliff Han², Isaac Ho⁷, Luis F. Larrondo¹⁰, Alfredo Lopez de Leon³, Jon K. Magnuson⁶, Sandy Merino³, Monica Misra², Beth Nelson³, Nicholas H. Putnam⁷, Barbara Robbertse¹¹, Asaf Salamov⁷, Monika Schmoll⁹, Astrid Terry⁷, Nina Thayer², Ann Westerholm-Parvinen⁵, Conrad L. Schoch¹¹, Jian Yao¹², Ravi D. Barabote², Mary Anne Nelson¹, Chris Detter², David Bruce², Cheryl R. Kuske², Gary Xie², Paul G. Richardson⁷, Daniel S. Rokhsar⁷, Susan Lucas⁷, Edward M. Rubin⁷, Nigel Dunn-Coleman, Michael Ward¹², Thomas Brettin⁷ - Show less +42 more•Institutions (12)

University of New Mexico¹, Los Alamos National Laboratory², Novozymes³, University of Provence⁴, VTT Technical Research Centre of Finland⁵, Pacific Northwest National Laboratory⁶, Joint Genome Institute⁷, United States Department of Agriculture⁸, Vienna University of Technology⁹, Pontifical Catholic University of Chile¹⁰, Oregon State University¹¹, Genencor¹²

01 May 2008-Nature Biotechnology

TL;DR: This work assembled 89 scaffolds to generate 34 Mbp of nearly contiguous T. reesei genome sequence comprising 9,129 predicted gene models, providing a roadmap for constructing enhanced T.Reesei strains for industrial applications such as biofuel production.

...read moreread less

Abstract: Trichoderma reesei is the main industrial source of cellulases and hemicellulases used to depolymerize biomass to simple sugars that are converted to chemical intermediates and biofuels, such as ethanol. We assembled 89 scaffolds (sets of ordered and oriented contigs) to generate 34 Mbp of nearly contiguous T. reesei genome sequence comprising 9,129 predicted gene models. Unexpectedly, considering the industrial utility and effectiveness of the carbohydrate-active enzymes of T. reesei, its genome encodes fewer cellulases and hemicellulases than any other sequenced fungus able to hydrolyze plant cell wall polysaccharides. Many T. reesei genes encoding carbohydrate-active enzymes are distributed nonrandomly in clusters that lie between regions of synteny with other Sordariomycetes. Numerous genes encoding biosynthetic pathways for secondary metabolites may promote survival of T. reesei in its competitive soil habitat, but genome analysis provided little mechanistic insight into its extraordinary capacity for protein secretion. Our analysis, coupled with the genome sequence data, provides a roadmap for constructing enhanced T. reesei strains for industrial applications such as biofuel production.

...read moreread less

1,085 citations

Journal Article•DOI•

Data-driven smart manufacturing

[...]

Fei Tao¹, Qinglin Qi¹, Ang Liu², Andrew Kusiak³•Institutions (3)

Beihang University¹, University of New South Wales², University of Iowa³

01 Jul 2018-Journal of Manufacturing Systems

TL;DR: The role of big data in supporting smart manufacturing is discussed, a historical perspective to data lifecycle in manufacturing is overviewed, and a conceptual framework proposed in the paper is proposed.

...read moreread less

937 citations

Journal Article•DOI•

Genetic circuit design automation

[...]

Alec A. K. Nielsen¹, Bryan S. Der², Bryan S. Der¹, Jonghyeon Shin¹, Prashant Vaidyanathan², Vanya Paralanov³, Elizabeth A. Strychalski³, David J. Ross³, Douglas Densmore², Christopher A. Voigt¹ - Show less +6 more•Institutions (3)

Massachusetts Institute of Technology¹, Boston University², National Institute of Standards and Technology³

01 Apr 2016-Science

TL;DR: Electronic design automation principles from EDA are applied to enable increased circuit complexity and to simplify the incorporation of synthetic gene regulation into genetic engineering projects, and it is demonstrated that engineering principles can be applied to identify and suppress errors that complicate the compositions of larger systems.

...read moreread less

Abstract: INTRODUCTION Cells respond to their environment, make decisions, build structures, and coordinate tasks. Underlying these processes are computational operations performed by networks of regulatory proteins that integrate signals and control the timing of gene expression. Harnessing this capability is critical for biotechnology projects that require decision-making, control, sensing, or spatial organization. It has been shown that cells can be programmed using synthetic genetic circuits composed of regulators organized to generate a desired operation. However, the construction of even simple circuits is time-intensive and unreliable. RATIONALE Electronic design automation (EDA) was developed to aid engineers in the design of semiconductor-based electronics. In an effort to accelerate genetic circuit design, we applied principles from EDA to enable increased circuit complexity and to simplify the incorporation of synthetic gene regulation into genetic engineering projects. We used the hardware description language Verilog to enable a user to describe a circuit function. The user also specifies the sensors, actuators, and “user constraints file” (UCF), which defines the organism, gate technology, and valid operating conditions. Cello (www.cellocad.org) uses this information to automatically design a DNA sequence encoding the desired circuit. This is done via a set of algorithms that parse the Verilog text, create the circuit diagram, assign gates, balance constraints to build the DNA, and simulate performance. RESULTS Cello designs circuits by drawing upon a library of Boolean logic gates. Here, the gate technology consists of NOT/NOR logic based on repressors. Gate connection is simplified by defining the input and output signals as RNA polymerase (RNAP) fluxes. We found that the gates need to be insulated from their genetic context to function reliably in the context of different circuits. Each gate is isolated using strong terminators to block RNAP leakage, and input interchangeability is improved using ribozymes and promoter spacers. These parts are varied for each gate to avoid breakage due to recombination. Measuring the load of each gate and incorporating this into the optimization algorithms further reduces evolutionary pressure. Cello was applied to the design of 60 circuits for Escherichia coli , where the circuit function was specified using Verilog code and transformed to a DNA sequence. The DNA sequences were built as specified with no additional tuning, requiring 880,000 base pairs of DNA assembly. Of these, 45 circuits performed correctly in every output state (up to 10 regulators and 55 parts). Across all circuits, 92% of the 412 output states functioned as predicted. CONCLUSION Our work constitutes a hardware description language for programming living cells. This required the co-development of design algorithms with gates that are sufficiently simple and robust to be connected by automated algorithms. We demonstrate that engineering principles can be applied to identify and suppress errors that complicate the compositions of larger systems. This approach leads to highly repetitive and modular genetics, in stark contrast to the encoding of natural regulatory networks. The use of a hardware-independent language and the creation of additional UCFs will allow a single design to be transformed into DNA for different organisms, genetic endpoints, operating conditions, and gate technologies.

...read moreread less

813 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse