scispace - formally typeset
Journal ArticleDOI

Identification of novel biomass‐degrading enzymes from genomic dark matter: Populating genomic sequence space with functional annotation

TLDR
A gene context‐based approach can be used to assign function to genes that are otherwise categorized as “genomic dark matter” and to identify biomass‐degrading enzymes that have little sequence similarity to already known cellulases.
Abstract
Although recent nucleotide sequencing technologies have significantly enhanced our understanding of microbial genomes, the function of ∼35% of genes identified in a genome currently remains unknown. To improve the understanding of microbial genomes and consequently of microbial processes it will be crucial to assign a function to this “genomic dark matter.” Due to the urgent need for additional carbohydrate-active enzymes for improved production of transportation fuels from lignocellulosic biomass, we screened the genomes of more than 5,500 microorganisms for hypothetical proteins that are located in the proximity of already known cellulases. We identified, synthesized and expressed a total of 17 putative cellulase genes with insufficient sequence similarity to currently known cellulases to be identified as such using traditional sequence annotation techniques that rely on significant sequence similarity. The recombinant proteins of the newly identified putative cellulases were subjected to enzymatic activity assays to verify their hydrolytic activity towards cellulose and lignocellulosic biomass. Eleven (65%) of the tested enzymes had significant activity towards at least one of the substrates. This high success rate highlights that a gene context-based approach can be used to assign function to genes that are otherwise categorized as “genomic dark matter” and to identify biomass-degrading enzymes that have little sequence similarity to already known cellulases. The ability to assign function to genes that have no related sequence representatives with functional annotation will be important to enhance our understanding of microbial processes and to identify microbial proteins for a wide range of applications. Biotechnol. Bioeng. 2014;111: 1550–1565. © 2014 Wiley Periodicals, Inc.

read more

Citations
More filters
Journal ArticleDOI

Destructuring plant biomass: Focus on fungal and extremophilic cell wall hydrolases

TL;DR: The present review discusses the current research trends on fungal, as well as extremophilic cell wall hydrolases that can withstand extreme physico-chemical conditions required in efficient industrial processes and suggests further research avenues for improving saccharification.
Journal ArticleDOI

The importance of sourcing enzymes from non-conventional fungi for metabolic engineering and biomass breakdown

TL;DR: Select classes of fungal enzymes that are currently in biotechnological use are discussed, and more basal, non-conventional fungi and their underexploited biomass-degrading mechanisms are explored as promising agents in the transition towards a bio-based society.
Journal ArticleDOI

Defining functional diversity for lignocellulose degradation in a microbial community using multi-omics studies.

TL;DR: A combination of mass spectrometry-based proteomics coupled with metatranscriptomics has enabled the identification of a large number of lignocellulose degrading enzymes that can now be further explored for the development of improved enzyme cocktails for the treatment of plant-based feedstocks.
Journal ArticleDOI

Biocatalysts for biomass deconstruction from environmental genomics.

TL;DR: Recent progress in recovering biological devices from environmental genomes from uncultivated microbial communities are reviewed and how this information can be used to build better biorefining ecosystems are considered.
Journal ArticleDOI

Roles of intestinal Parabacteroides in human health and diseases.

TL;DR: The stability of gut microbiota is essential for the host health and Parabacteroides spp.
References
More filters
Journal ArticleDOI

Clustal W and Clustal X version 2.0

TL;DR: The Clustal W and ClUSTal X multiple sequence alignment programs have been completely rewritten in C++ to facilitate the further development of the alignment algorithms in the future and has allowed proper porting of the programs to the latest versions of Linux, Macintosh and Windows operating systems.
Journal ArticleDOI

The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics

TL;DR: The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates and has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation.
Journal ArticleDOI

An overview of statistical learning theory

TL;DR: How the abstract learning theory established conditions for generalization which are more general than those discussed in classical statistical paradigms are demonstrated and how the understanding of these conditions inspired new algorithmic approaches to function estimation problems are demonstrated.
Journal ArticleDOI

A genomic perspective on protein families

TL;DR: Comparison of proteins encoded in seven complete genomes from five major phylogenetic lineages and elucidation of consistent patterns of sequence similarities allowed the delineation of 720 clusters of orthologous groups (COGs), which comprise a framework for functional and evolutionary genome analysis.
Journal ArticleDOI

Locating proteins in the cell using TargetP, SignalP and related tools

TL;DR: The properties of three well-known N-terminal sequence motifs directing proteins to the secretory pathway, mitochondria and chloroplasts are described and a brief history of methods to predict subcellular localization based on these sorting signals and other sequence properties are sketched.
Related Papers (5)