scispace - formally typeset
Journal ArticleDOI

GenomeTools: A Comprehensive Software Library for Efficient Processing of Structured Genome Annotations

TLDR
The GenomeTools, a convenient and efficient software library and associated software tools for developing bioinformatics software intended to create, process or convert annotation graphs, strictly follow the annotation graph approach, offering a unified graph-based representation.
Abstract
Genome annotations are often published as plain text files describing genomic features and their subcomponents by an implicit annotation graph. In this paper, we present the GenomeTools, a convenient and efficient software library and associated software tools for developing bioinformatics software intended to create, process or convert annotation graphs. The GenomeTools strictly follow the annotation graph approach, offering a unified graph-based representation. This gives the developer intuitive and immediate access to genomic features and tools for their manipulation. To process large annotation sets with low memory overhead, we have designed and implemented an efficient pull-based approach for sequential processing of annotations. This allows to handle even the largest annotation sets, such as a complete catalogue of human variations. Our object-oriented C-based software library enables a developer to conveniently implement their own functionality on annotation graphs and to integrate it into larger workflows, simultaneously accessing compressed sequence data if required. The careful C implementation of the GenomeTools does not only ensure a light-weight memory footprint while allowing full sequential as well as random access to the annotation graph, but also facilitates the creation of bindings to a variety of script programming languages (like Python and Ruby) sharing the same interface.

read more

Citations
More filters
Posted ContentDOI

Genome dynamics in mosses: Extensive synteny coexists with a highly dynamic gene space

TL;DR: It is suggested that genome dynamics in mosses involves the coexistence of a collinear homologous and a highly dynamic species-specific gene sets, as well as its significance for understanding genome evolution.
Journal ArticleDOI

Near Chromosome-Level Genome Assembly and Annotation of Rhodotorula babjevae Strains Reveals High Intraspecific Divergence

TL;DR: In this article , the genomes of two R. babjevae strains, CBS 7808 (type strain) and DBVPG 8058, at the chromosomal level were resolved.
Journal ArticleDOI

The highly continuous reference genome of a leaf-chimeric red pineapple (Ananas comosus var. bracteatus f. tricolor) provides insights into elaboration of leaf color

TL;DR: The GL1 genome assembly and its comparisons with other 3 pineapple genome assemblies provide a valuable resource for the genetic improvement of pineapple and serve as a model for understanding the genomic basis of important traits in different pineapple varieties and other pan-cereal crops.
Journal ArticleDOI

Genome Sequence of the Diploid Yeast Debaryomyces hansenii TMW 3.1188

TL;DR: Debaryomyces hansenii TMW 3.1188 is a halotolerant diploid yeast that was isolated from lupine moromi fermentation and the 24.77-Mbp genome of a diploids strain of the species D. h Hansenii is reported on.
Book ChapterDOI

Finding and Characterizing Repeats in Plant Genomes

TL;DR: A detailed survey of the available software that can help biologists to scan automatically for repeated structures in sequence data or check hypothetical models intended to characterize their structures can be found in this article .
References
More filters
Book

Design Patterns: Elements of Reusable Object-Oriented Software

TL;DR: The book is an introduction to the idea of design patterns in software engineering, and a catalog of twenty-three common patterns, which most experienced OOP designers will find out they've known about patterns all along.
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Journal ArticleDOI

BEDTools: a flexible suite of utilities for comparing genomic features

TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.
Journal ArticleDOI

The variant call format and VCFtools

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.
Book ChapterDOI

Introduction to Algorithms

Xin-She Yang
TL;DR: This chapter provides an overview of the fundamentals of algorithms and their links to self-organization, exploration, and exploitation.
Related Papers (5)