scispace - formally typeset
Open AccessJournal ArticleDOI

HMMER web server: interactive sequence similarity searching

TLDR
This work has focused on minimizing search times and the ability to rapidly display tabular results, regardless of the number of matches found, developing graphical summaries of the search results to provide quick, intuitive appraisement of them.
Abstract
HMMER is a software suite for protein sequence similarity searches using probabilistic methods Previously, HMMER has mainly been available only as a computationally intensive UNIX command-line tool, restricting its use Recent advances in the software, HMMER3, have resulted in a 100-fold speed gain relative to previous versions It is now feasible to make efficient profile hidden Markov model (profile HMM) searches via the web A HMMER web server (http://hmmerjaneliaorg) has been designed and implemented such that most protein database searches return within a few seconds Methods are available for searching either a single protein sequence, multiple protein sequence alignment or profile HMM against a target sequence database, and for searching a protein sequence against Pfam The web server is designed to cater to a range of different user expertise and accepts batch uploading of multiple queries at once All search methods are also available as RESTful web services, thereby allowing them to be readily integrated as remotely executed tasks in locally scripted workflows We have focused on minimizing search times and the ability to rapidly display tabular results, regardless of the number of matches found, developing graphical summaries of the search results to provide quick, intuitive appraisement of them

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization

TL;DR: The Web interface for recently developed options for large data and interactive usage to refine sequence data sets and MSAs for multiple sequence alignment are explained.
Journal ArticleDOI

Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses

TL;DR: It is found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that l incRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes.
Journal ArticleDOI

JPred4: a protein secondary structure prediction server

TL;DR: JPred4 as discussed by the authors is the latest version of the popular JPred protein secondary structure prediction server which provides predictions by the JNet algorithm, one of the most accurate methods for secondary structure predictions.
Journal ArticleDOI

MEROPS: the database of proteolytic enzymes, their substrates and inhibitors

TL;DR: The MEROPS database has been expanded to include proteolytic enzymes other than peptidases, and the inclusion of small-molecule inhibitors in the tables of peptidase–inhibitor interactions is included.
Journal ArticleDOI

dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

TL;DR: The dbCAN2 (http://cys.bios.niu.edu/dbCAN2) as mentioned in this paper is an updated meta server, which integrates three state-of-the-art tools for CAZome annotation: (i) HMMER search against the dbCAN HMM (hidden Markov model) database; (ii) DIAMOND search against CAZy pre-annotated CAZY sequence database and (iii) Hotpep search against conserved CAZys short peptide database.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

The Pfam protein families database

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI

Database resources of the National Center for Biotechnology Information

TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Journal ArticleDOI

Pfam: the protein families database.

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.
Related Papers (5)