Prokka: Rapid Prokaryotic Genome Annotation

doi:10.1093/BIOINFORMATICS/BTU153

Open AccessJournal ArticleDOI

Prokka: Rapid Prokaryotic Genome Annotation

Torsten Seemann

- 15 Jul 2014 -

Bioinformatics

- Vol. 30, Iss: 14, pp 2068-2069

Chats0

TLDR

Prokka is introduced, a command line software tool to fully annotate a draft bacterial genome in about 10 min on a typical desktop computer, and produces standards-compliant output files for further analysis or viewing in genome browsers.

Abstract:

UNLABELLED: The multiplex capability and high yield of current day DNA-sequencing instruments has made bacterial whole genome sequencing a routine affair. The subsequent de novo assembly of reads into contigs has been well addressed. The final step of annotating all relevant genomic features on those contigs can be achieved slowly using existing web- and email-based systems, but these are not applicable for sensitive data or integrating into computational pipelines. Here we introduce Prokka, a command line software tool to fully annotate a draft bacterial genome in about 10 min on a typical desktop computer. It produces standards-compliant output files for further analysis or viewing in genome browsers. AVAILABILITY AND IMPLEMENTATION: Prokka is implemented in Perl and is freely available under an open source GPLv2 license from http://vicbioinformatics.com/.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

NCBI prokaryotic genome annotation pipeline

Tatiana Tatusova, +9 more

- 19 Aug 2016 -

Nucleic Acids Research

TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.

...read moreread less

Journal ArticleDOI

Roary: Rapid large-scale prokaryote pan genome analysis

Andrew J. Page, +9 more

- 15 Nov 2015 -

Bioinformatics

TL;DR: Roary, a tool that rapidly builds large-scale pan genomes, identifying the core and accessory genes, is introduced, making construction of the pan genome of thousands of prokaryote samples possible on a standard desktop without compromising on the accuracy of results.

...read moreread less

Journal ArticleDOI

RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

Thomas Brettin, +21 more

- 10 Feb 2015 -

Scientific Reports

TL;DR: The RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines and offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job.

...read moreread less

Journal ArticleDOI

A taxonomic note on the genus Lactobacillus: Description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae.

Jinshui Zheng, +15 more

- 15 Apr 2020 -

International Journal of Systematic and ...

TL;DR: This study evaluated the taxonomy of Lactobacillaceae and Leuconostocaceae on the basis of whole genome sequences and proposed reclassification reflects the phylogenetic position of the micro-organisms, and groups lactobacilli into robust clades with shared ecological and metabolic properties.

...read moreread less

Journal ArticleDOI

Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications.

Keith A. Jolley, +2 more

TL;DR: Developments in the BIGSdb software made from publication to June 2018 are described and it is shown how the platform realises microbial population genomics for a wide range of applications.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Pfam protein families database

Marco Punta, +15 more

- 01 Jan 2000 -

Nucleic Acids Research

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.

...read moreread less

Journal ArticleDOI

BLAST+: architecture and applications.

Christiam Camacho, +6 more

- 15 Dec 2009 -

BMC Bioinformatics

TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.

...read moreread less

Journal ArticleDOI

Pfam: the protein families database.

Robert D. Finn, +12 more

- 01 Jan 2014 -

Nucleic Acids Research

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.

...read moreread less

Journal ArticleDOI

The RAST Server: Rapid Annotations using Subsystems Technology

Ramy K. Aziz, +31 more

- 08 Feb 2008 -

BMC Genomics

TL;DR: A fully automated service for annotating bacterial and archaeal genomes that identifies protein-encoding, rRNA and tRNA genes, assigns functions to the genes, predicts which subsystems are represented in the genome, uses this information to reconstruct the metabolic network and makes the output easily downloadable for the user.

...read moreread less

Journal ArticleDOI

SignalP 4.0: discriminating signal peptides from transmembrane regions

Thomas Nordahl Petersen, +5 more

- 01 Oct 2011 -

Nature Methods

TL;DR: SignalP 4.0 was the best signal-peptide predictor for all three organism types but was not in all cases as good as SignalP 3.0 according to cleavage-site sensitivity or signal- peptide correlation when there are no transmembrane proteins present.

...read moreread less