Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022.
TLDR
The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global research in both academia and industry.Abstract:
The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global research in both academia and industry. With the explosively accumulated multi-omics data at ever-faster rates, CNCB-NGDC is constantly scaling up and updating its core database resources through big data archive, curation, integration and analysis. In the past year, efforts have been made to synthesize the growing data and knowledge, particularly in single-cell omics and precision medicine research, and a series of resources have been newly developed, updated and enhanced. Moreover, CNCB-NGDC has continued to daily update SARS-CoV-2 genome sequences, variants, haplotypes and literature. Particularly, OpenLB, an open library of bioscience, has been established by providing easy and open access to a substantial number of abstract texts from PubMed, bioRxiv and medRxiv. In addition, Database Commons is significantly updated by cataloguing a full list of global databases, and BLAST tools are newly deployed to provide online sequence search services. All these resources along with their services are publicly accessible at https://ngdc.cncb.ac.cn.read more
Citations
More filters
Journal ArticleDOI
KaKs_Calculator 3.0: Calculating Selective Pressure on Coding and Non-coding Sequences
TL;DR: KaKs_Calculator 3.0 as mentioned in this paper is an updated toolkit that is capable of calculating selective pressure on both coding and non-coding sequences, similar to the nonsynonymous/synonymous substitution rate ratio for coding sequences.
Journal ArticleDOI
EWAS Open Platform: integrated data, knowledge and toolkit for epigenome-wide association study.
Zhuang Xiong,Zhuang Xiong,Fei Yang,Fei Yang,Mengwei Li,Yingke Ma,Wei Zhao,Wei Zhao,Guo-Liang Wang,Guo-Liang Wang,Zhaohua Li,Zhaohua Li,Xinchang Zheng,Dong Zou,Wenting Zong,Wenting Zong,Hongen Kang,Hongen Kang,Yaokai Jia,Rujiao Li,Zhang Zhang,Zhang Zhang,Yiming Bao,Yiming Bao +23 more
TL;DR: In this paper, the authors present EWAS Open Platform (https://ngdc.cncb.ac.cn/ewas) that includes EWAS Atlas, EWAS Data Hub and the newly developed EWAS Toolkit.
Journal ArticleDOI
The genome of the rice variety LTH provides insight into its universal susceptibility mechanism to worldwide rice blast fungal strains
Lei Yang,Mengfei Zhao,Gan Sha,Qi Long Sun,Qiuwen Gong,Qun Yang,Kabin Xie,Meng Yuan,Jennifer C. Mortimer,Weibo Xie,Tong Wei,Zhensheng Kang,Guotian Li +12 more
TL;DR: Wang et al. as mentioned in this paper found that weak effector-trigger immunity (ETI)-mediated primarily by Pi genes but not PTI results in the universal susceptibility of Lijiangxintuanheigu to rice blast.
Journal ArticleDOI
Exploring the cellular landscape of circular RNAs using full-length single-cell RNA sequencing
TL;DR: In this article , a collection of 171 full-length single-cell RNA-seq datasets is presented to explore the cellular landscape of circRNAs in human and mouse tissues, and the authors identify a total of 139,643 human and 214,747 mouse circRNA in these scRNA-seq libraries.
Journal ArticleDOI
LncBook 2.0: integrating human long non-coding RNAs with multi-omics annotations
TL;DR: Li et al. as mentioned in this paper presented LncBook 2.0 (https://ngdc.cncb.ac.cn/lncbook), which incorporated 119 722 new transcripts, 9632 new genes, and gene structure update of 21 305 lncRNAs.
References
More filters
Journal ArticleDOI
Basic Local Alignment Search Tool
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI
Highly accurate protein structure prediction with AlphaFold
John M. Jumper,Richard O. Evans,Alexander Pritzel,Tim Green,Michael Figurnov,Olaf Ronneberger,Kathryn Tunyasuvunakool,Russell Bates,Augustin Žídek,Anna Potapenko,Alex Bridgland,Clemens Meyer,Simon A. A. Kohl,Andrew J. Ballard,Andrew Cowie,Bernardino Romera-Paredes,Stanislav Nikolov,R. D. Jain,Jonas Adler,Trevor Back,Stig Petersen,David Reiman,Ellen Clancy,Michal Zielinski,Martin Steinegger,Michalina Pacholska,Tamas Berghammer,Sebastian Bodenstein,David L. Silver,Oriol Vinyals,Andrew W. Senior,Koray Kavukcuoglu,Pushmeet Kohli,Demis Hassabis +33 more
TL;DR: For example, AlphaFold as mentioned in this paper predicts protein structures with an accuracy competitive with experimental structures in the majority of cases using a novel deep learning architecture. But the accuracy is limited by the fact that no homologous structure is available.
Journal ArticleDOI
The EMBL-EBI search and sequence analysis tools APIs in 2019
Fábio Madeira,Youngmi Park,Joon Lee,Nicola Buso,Tamer Gur,Nandana Madhusoodanan,Prasad Basutkar,Adrian R N Tivey,Simon C. Potter,Robert D. Finn,Rodrigo Lopez +10 more
TL;DR: The latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability are described.
Journal ArticleDOI
A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology.
Andrew Rambaut,Edward C. Holmes,Áine O'Toole,Verity Hill,John T. McCrone,Christopher Ruis,Louis du Plessis,Oliver G. Pybus +7 more
TL;DR: A rational and dynamic virus nomenclature that uses a phylogenetic framework to identify those lineages that contribute most to active spread and is designed to provide a real-time bird’s-eye view of the diversity of the hundreds of thousands of genome sequences collected worldwide.