scispace - formally typeset
Open AccessJournal ArticleDOI

SorGSD: updating and expanding the sorghum genome science database with new contents and tools.

TLDR
SorGSD is a comprehensive integration with large-scale genomic variation, phenotypic information and incorporates online data analysis tools for data mining, genome navigation and analysis and could provide a valuable resource for sorghum researchers to find variations they are interested in and generate customized high-throughput datasets for further analysis.
Abstract
As the fifth major cereal crop originated from Africa, sorghum (Sorghum bicolor) has become a key C4 model organism for energy plant research With the development of high-throughput detection technologies for various omics data, much multi-dimensional and multi-omics information has been accumulated for sorghum Integrating this information may accelerate genetic research and improve molecular breeding for sorghum agronomic traits We updated the Sorghum Genome SNP Database (SorGSD) by adding new data, new features and renamed it to Sorghum Genome Science Database (SorGSD) In comparison with the original version SorGSD, which contains SNPs from 48 sorghum accessions mapped to the reference genome BTx623 (v21), the new version was expanded to 289 sorghum lines with both single nucleotide polymorphisms (SNPs) and small insertions/deletions (INDELs), which were aligned to the newly assembled and annotated sorghum genome BTx623 (v31) Moreover, phenotypic data and panicle pictures of critical accessions were provided in the new version We implemented new tools including ID Conversion, Homologue Search and Genome Browser for analysis and updated the general information related to sorghum research, such as online sorghum resources and literature references In addition, we deployed a new database infrastructure and redesigned a new user interface as one of the Genome Variation Map databases The new version SorGSD is freely accessible online at http://ngdccncbaccn/sorgsd/ SorGSD is a comprehensive integration with large-scale genomic variation, phenotypic information and incorporates online data analysis tools for data mining, genome navigation and analysis We hope that SorGSD could provide a valuable resource for sorghum researchers to find variations they are interested in and generate customized high-throughput datasets for further analysis

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022.

TL;DR: The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global research in both academia and industry.
Journal ArticleDOI

Genomic footprints of sorghum domestication and breeding selection for multiple end uses.

TL;DR: In this article , population genomics analyses were performed on a worldwide collection of 445 sorghum accessions, covering wild and four end-use subpopulations with diverse agronomic traits.
Journal ArticleDOI

TeaPVs: a comprehensive genomic variation database for tea plant (Camellia sinensis)

TL;DR: Wang et al. as discussed by the authors constructed the first tea tree variation web service database TeaPVs (http://47.106.91:8025/ and http://liushang.top.
References
More filters
Journal ArticleDOI

The Sequence Alignment/Map format and SAMtools

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI

Fast and accurate short read alignment with Burrows–Wheeler transform

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Journal ArticleDOI

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Journal ArticleDOI

The variant call format and VCFtools

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.
Journal ArticleDOI

The Ensembl Variant Effect Predictor.

TL;DR: The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.
Related Papers (5)