Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

doi:10.1093/NAR/GKX1011

Open AccessJournal ArticleDOI

Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Paul J. Kersey, +46 more

- 04 Jan 2018 -

Nucleic Acids Research

- Vol. 46

TLDR

This paper provides an update to the previous publications about the Ensembl Genomes resource, with a focus on recent developments and expansions, including the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data.

Abstract:

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.

Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Citations

The EMBL-EBI search and sequence analysis tools APIs in 2019

HMMER web server: 2018 update.

The gasdermins, a protein family executing cell death and inflammation

Ensembl variation resources

Genenames.org: the HGNC and VGNC resources in 2019.

References

Basic Local Alignment Search Tool

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

UniProt: the Universal Protein knowledgebase

InterProScan 5: genome-scale protein function classification

Accelerated Profile HMM Searches

Related Papers (5)

The Sequence Alignment/Map format and SAMtools

Trimmomatic: a flexible trimmer for Illumina sequence data

MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability