scispace - formally typeset
Search or ask a question
Author

Junwei Zhu

Bio: Junwei Zhu is an academic researcher from Beijing Institute of Genomics. The author has contributed to research in topics: Population & Information repository. The author has an hindex of 13, co-authored 26 publications receiving 1116 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.

485 citations

Journal ArticleDOI
TL;DR: The Genome Sequence Archive (GSA) as discussed by the authors is a data repository for archiving raw sequence data, which provides data storage and sharing services for worldwide scientific communities, considering explosive data growth with diverse data types, here we present the GSA family by expanding into a set of resources for raw data archive with different purposes.

269 citations

Journal ArticleDOI
TL;DR: In the past year, efforts for update have been mainly devoted to BioProject, BioSample, GSA, GWH, GVM, NONCODE, LncBook, EWAS Atlas and IC4R, while four web services have been significantly improved and enhanced.
Abstract: The National Genomics Data Center (NGDC) provides a suite of database resources to support worldwide research activities in both academia and industry. With the rapid advancements in higher-throughput and lower-cost sequencing technologies and accordingly the huge volume of multi-omics data generated at exponential scales and rates, NGDC is continually expanding, updating and enriching its core database resources through big data integration and value-added curation. In the past year, efforts for update have been mainly devoted to BioProject, BioSample, GSA, GWH, GVM, NONCODE, LncBook, EWAS Atlas and IC4R. Newly released resources include three human genome databases (PGG. SNV, PGG. Han and CGVD), eLMSG, EWAS Data Hub, GWAS Atlas, iSheep and PADS Arsenal. In addition, four web services, namely, eGPS Cloud, BIG Search, BIG Submission and BIG SSO, have been significantly improved and enhanced.

181 citations

DOI
20 Feb 2020
TL;DR: Comprehensive descriptions on data deposition, management, release and utility in 2019nCoVR are provided, laying important foundations in aid of studies on virus classification and origin, genome variation and evolution, fast detection, drug development and pneumonia precision prevention and therapy.
Abstract: An ongoing outbreak of a novel coronavirus infection in Wuhan, China since December 2019 has led to 31,516 infected persons and 638 deaths across 25 countries (till 16:00 on February 7, 2020). The virus causing this pneumonia was then named as the 2019 novel coronavirus (2019-nCoV) by the World Health Organization. To promote the data sharing and make all relevant information of 2019-nCoV publicly available, we construct the 2019 Novel Coronavirus Resource (2019nCoVR, https://bigd.big.ac.cn/ncov). 2019nCoVR features comprehensive integration of genomic and proteomic sequences as well as their metadata information from the Global Initiative on Sharing All Influenza Data, National Center for Biotechnology Information, China National GeneBank, National Microbiology Data Center and China National Center for Bioinformation (CNCB)/National Genomics Data Center (NGDC). It also incorporates a wide range of relevant information including scientific literatures, news, and popular articles for science dissemination, and provides visualization functionalities for genome variation analysis results based on all collected 2019-nCoV strains. Moreover, by linking seamlessly with related databases in CNCB/NGDC, 2019nCoVR offers virus data submission and sharing services for raw sequence reads and assembled sequences. In this report, we provide comprehensive descriptions on data deposition, management, release and utility in 2019nCoVR, laying important foundations in aid of studies on virus classification and origin, genome variation and evolution, fast detection, drug development and pneumonia precision prevention and therapy.

168 citations

Journal ArticleDOI
TL;DR: The BIG Data Center at Beijing Institute of Genomics provides a suite of database resources in support of worldwide research activities in both academia and industry, and the Open Biodiversity and Health Big Data initiative is introduced.
Abstract: The BIG Data Center at Beijing Institute of Genomics (BIG) of the Chinese Academy of Sciences provides a suite of database resources in support of worldwide research activities in both academia and industry. With the vast amounts of multi-omics data generated at unprecedented scales and rates, the BIG Data Center is continually expanding, updating and enriching its core database resources through big data integration and value-added curation. Resources with significant updates in the past year include BioProject (a biological project library), BioSample (a biological sample library), Genome Sequence Archive (GSA, a data repository for archiving raw sequence reads), Genome Warehouse (GWH, a centralized resource housing genome-scale data), Genome Variation Map (GVM, a public repository of genome variations), Science Wikis (a catalog of biological knowledge wikis for community annotations) and IC4R (Information Commons for Rice). Newly released resources include EWAS Atlas (a knowledgebase of epigenome-wide association studies), iDog (an integrated omics data resource for dog) and RNA editing resources (for editome-disease associations and plant RNA editosome, respectively). To promote biodiversity and health big data sharing around the world, the Open Biodiversity and Health Big Data (BHBD) initiative is introduced. All of these resources are publicly accessible at http://bigd.big.ac.cn.

147 citations


Cited by
More filters
Journal Article
Fumio Tajima1
30 Oct 1989-Genomics
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

11,521 citations

Journal ArticleDOI
TL;DR: The basic virology of SARS-CoV-2 is described, including genomic characteristics and receptor use, highlighting its key difference from previously known coronaviruses.
Abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a highly transmissible and pathogenic coronavirus that emerged in late 2019 and has caused a pandemic of acute respiratory disease, named ‘coronavirus disease 2019’ (COVID-19), which threatens human health and public safety. In this Review, we describe the basic virology of SARS-CoV-2, including genomic characteristics and receptor use, highlighting its key difference from previously known coronaviruses. We summarize current knowledge of clinical, epidemiological and pathological features of COVID-19, as well as recent progress in animal models and antiviral treatment approaches for SARS-CoV-2 infection. We also discuss the potential wildlife hosts and zoonotic origin of this emerging virus in detail. In this Review, Shi and colleagues summarize the exceptional amount of research that has characterized acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease 2019 (COVID-19) since this virus has swept around the globe. They discuss what we know so far about the emergence and virology of SARS-CoV-2 and the pathogenesis and treatment of COVID-19.

2,904 citations

Journal ArticleDOI
TL;DR: The results reveal distinct host inflammatory cytokine profiles to SARS-CoV-2 infection in patients, and highlight the association between COVID-19 pathogenesis and excessive cytokine release such as CCL2/MCP-1, CXCL10/IP-10, CCL3/MIP-1A, and CCL4/Mip1B.
Abstract: Circulating in China and 158 other countries and areas, the ongoing COVID-19 outbreak has caused devastating mortality and posed a great threat to public health. However, efforts to identify effectively supportive therapeutic drugs and treatments has been hampered by our limited understanding of host immune response for this fatal disease. To characterize the transcriptional signatures of host inflammatory response to SARS-CoV-2 (HCoV-19) infection, we carried out transcriptome sequencing of the RNAs isolated from the bronchoalveolar lavage fluid (BALF) and peripheral blood mononuclear cells (PBMC) specimens of COVID-19 patients. Our results reveal distinct host inflammatory cytokine profiles to SARS-CoV-2 infection in patients, and highlight the association between COVID-19 pathogenesis and excessive cytokine release such as CCL2/MCP-1, CXCL10/IP-10, CCL3/MIP-1A, and CCL4/MIP1B. Furthermore, SARS-CoV-2 induced activation of apoptosis and P53 signalling pathway in lymphocytes may be the cause of patients' lymphopenia. The transcriptome dataset of COVID-19 patients would be a valuable resource for clinical guidance on anti-inflammatory medication and understanding the molecular mechansims of host response.

918 citations

Journal ArticleDOI
TL;DR: The expression of proinflammatory genes, especially chemokines, was markedly elevated in COVID-19 cases compared to community-acquired pneumonia patients and healthy controls, suggesting that SARS-CoV-2 infection causes hypercytokinemia.

767 citations

Journal ArticleDOI
22 Feb 2020-Viruses
TL;DR: The goal of this work is to systematically review and compare between SARS-CoV and SARS -CoV in the context of their virus incubation, originations, diagnosis and treatment methods, genomic and proteomic sequences, and pathogenic mechanisms.
Abstract: After the outbreak of the severe acute respiratory syndrome (SARS) in the world in 2003, human coronaviruses (HCoVs) have been reported as pathogens that cause severe symptoms in respiratory tract infections. Recently, a new emerged HCoV isolated from the respiratory epithelium of unexplained pneumonia patients in the Wuhan seafood market caused a major disease outbreak and has been named the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This virus causes acute lung symptoms, leading to a condition that has been named as “coronavirus disease 2019” (COVID-19). The emergence of SARS-CoV-2 and of SARS-CoV caused widespread fear and concern and has threatened global health security. There are some similarities and differences in the epidemiology and clinical features between these two viruses and diseases that are caused by these viruses. The goal of this work is to systematically review and compare between SARS-CoV and SARS-CoV-2 in the context of their virus incubation, originations, diagnosis and treatment methods, genomic and proteomic sequences, and pathogenic mechanisms.

559 citations