scispace - formally typeset
Open AccessJournal ArticleDOI

BLAST+: architecture and applications.

TLDR
The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Abstract
Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications. We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site. The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Analysis of long non-coding RNA expression profiles in gastric cancer

TL;DR: A set of lncRNAs differentially expressed in gastric cancer is identified, providing useful information for discovery of new biomarkers and therapeutic targets in Gastric cancer.
Journal ArticleDOI

Distinct Processing of lncRNAs Contributes to Non-conserved Functions in Stem Cells.

TL;DR: In this paper, the authors reported differing subcellular localization of lncRNAs in human and mouse embryonic stem cells (ESCs) and showed that a significantly higher fraction of lnRNAs is localized in the cytoplasm of hESCs than in mESCs.
Journal ArticleDOI

A high quality assembly of the Nile Tilapia ( Oreochromis niloticus ) genome reveals the structure of two sex determination regions

TL;DR: A significantly improved assembly of the tilapia genome is developed using the latest genome sequencing methods and it is shown how it improves the characterization of two sex determination regions in twotilapia species.
Journal ArticleDOI

Using machine learning to predict antimicrobial MICs and associated genomic features for nontyphoidal Salmonella

TL;DR: A collection of 5,278 nontyphoidal Salmonella genomes was used to generate extreme gradient boosting (XGBoost)-based machine learning models for predicting MICs for 15 antibiotics, showing that highly accurate MIC prediction models can be generated with less than 500 genomes.
Journal ArticleDOI

Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

Joshua B. Benoit, +82 more
TL;DR: Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

BLAT—The BLAST-Like Alignment Tool

TL;DR: How BLAT was optimized is described, which is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences.
Journal ArticleDOI

Initial sequencing and comparative analysis of the mouse genome.

Robert H. Waterston, +222 more
- 05 Dec 2002 - 
TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.
Journal ArticleDOI

A greedy algorithm for aligning DNA sequences.

TL;DR: A new greedy alignment algorithm is introduced with particularly good performance and it is shown that it computes the same alignment as does a certain dynamic programming algorithm, while executing over 10 times faster on appropriate data.
Related Papers (5)
Trending Questions (1)
Apakah fungsi dari Basic Local Alignment Search Tool?

The function of the Basic Local Alignment Search Tool (BLAST) is to perform sequence similarity searches by comparing a query sequence against a sequence database to find matches.