scispace - formally typeset
Search or ask a question
Author

Xuanting Jiang

Bio: Xuanting Jiang is an academic researcher. The author has contributed to research in topics: Genome & Genome evolution. The author has an hindex of 14, co-authored 16 publications receiving 4859 citations.

Papers
More filters
Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

1,806 citations

Journal ArticleDOI
12 May 2016-Nature
TL;DR: It is found that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products.
Abstract: The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.

852 citations

Journal ArticleDOI
25 Jan 2013-Science
TL;DR: An unexpected concentration of positively selected genes in the DNA damage checkpoint and nuclear factor κB pathways that may be related to the origin of flight, as well as expansion and contraction of important gene families are discovered.
Abstract: Bats are the only mammals capable of sustained flight and are notorious reservoir hosts for some of the world's most highly pathogenic viruses, including Nipah, Hendra, Ebola, and severe acute respiratory syndrome (SARS). To identify genetic changes associated with the development of bat-specific traits, we performed whole-genome sequencing and comparative analyses of two distantly related species, fruit bat Pteropus alecto and insectivorous bat Myotis davidii. We discovered an unexpected concentration of positively selected genes in the DNA damage checkpoint and nuclear factor κB pathways that may be related to the origin of flight, as well as expansion and contraction of important gene families. Comparison of bat genomes with other mammalian species has provided new insights into bat biology and evolution.

514 citations

Journal ArticleDOI
TL;DR: A draft 6.5 Gb genome sequence of Locusta migratoria is presented, which is the largest animal genome sequenced so far, and complex regulatory mechanisms involved in microtubule dynamic-mediated synapse plasticity during phase change are revealed.
Abstract: Locusts are one of the world's most destructive agricultural pests and represent a useful model system in entomology. Here we present a draft 6.5 Gb genome sequence of Locusta migratoria, which is the largest animal genome sequenced so far. Our findings indicate that the large genome size of L. migratoria is likely to be because of transposable element proliferation combined with slow rates of loss for these elements. Methylome and transcriptome analyses reveal complex regulatory mechanisms involved in microtubule dynamic-mediated synapse plasticity during phase change. We find significant expansion of gene families associated with energy consumption and detoxification, consistent with long-distance flight capacity and phytophagy. We report hundreds of potential insecticide target genes, including cys-loop ligand-gated ion channels, G-protein-coupled receptors and lethal genes. The L. migratoria genome sequence offers new insights into the biology and sustainable management of this pest species, and will promote its wide use as a model system.

431 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

4,104 citations

01 Jan 2011
TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.
Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

2,187 citations

Journal ArticleDOI
TL;DR: This study evaluated the taxonomy of Lactobacillaceae and Leuconostocaceae on the basis of whole genome sequences and proposed reclassification reflects the phylogenetic position of the micro-organisms, and groups lactobacilli into robust clades with shared ecological and metabolic properties.
Abstract: The genus Lactobacillus comprises 261 species (at March 2020) that are extremely diverse at phenotypic, ecological and genotypic levels. This study evaluated the taxonomy of Lactobacillaceae and Leuconostocaceae on the basis of whole genome sequences. Parameters that were evaluated included core genome phylogeny, (conserved) pairwise average amino acid identity, clade-specific signature genes, physiological criteria and the ecology of the organisms. Based on this polyphasic approach, we propose reclassification of the genus Lactobacillus into 25 genera including the emended genus Lactobacillus, which includes host-adapted organisms that have been referred to as the Lactobacillus delbrueckii group, Paralactobacillus and 23 novel genera for which the names Holzapfelia, Amylolactobacillus, Bombilactobacillus, Companilactobacillus, Lapidilactobacillus, Agrilactobacillus, Schleiferilactobacillus, Loigolactobacilus, Lacticaseibacillus, Latilactobacillus, Dellaglioa, Liquorilactobacillus, Ligilactobacillus, Lactiplantibacillus, Furfurilactobacillus, Paucilactobacillus, Limosilactobacillus, Fructilactobacillus, Acetilactobacillus, Apilactobacillus, Levilactobacillus, Secundilactobacillus and Lentilactobacillus are proposed. We also propose to emend the description of the family Lactobacillaceae to include all genera that were previously included in families Lactobacillaceae and Leuconostocaceae. The generic term 'lactobacilli' will remain useful to designate all organisms that were classified as Lactobacillaceae until 2020. This reclassification reflects the phylogenetic position of the micro-organisms, and groups lactobacilli into robust clades with shared ecological and metabolic properties, as exemplified for the emended genus Lactobacillus encompassing species adapted to vertebrates (such as Lactobacillus delbrueckii, Lactobacillus iners, Lactobacillus crispatus, Lactobacillus jensensii, Lactobacillus johnsonii and Lactobacillus acidophilus) or invertebrates (such as Lactobacillus apis and Lactobacillus bombicola).

1,496 citations

Journal ArticleDOI
22 May 2013-Nature
TL;DR: The draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm, is presented, revealing numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs, which opens up new genomic avenues for conifer forestry and breeding.
Abstract: Conifers have dominated forests for more than 200 million years and are of huge ecological and economic importance. Here we present the draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm. The number of well-supported genes (28,354) is similar to the >100 times smaller genome of Arabidopsis thaliana, and there is no evidence of a recent whole-genome duplication in the gymnosperm lineage. Instead, the large genome size seems to result from the slow and steady accumulation of a diverse set of long-terminal repeat transposable elements, possibly owing to the lack of an efficient elimination mechanism. Comparative sequencing of Pinus sylvestris, Abies sibirica, Juniperus communis, Taxus baccata and Gnetum gnemon reveals that the transposable element diversity is shared among extant conifers. Expression of 24-nucleotide small RNAs, previously implicated in transposable element silencing, is tissue-specific and much lower than in other plants. We further identify numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs. This opens up new genomic avenues for conifer forestry and breeding.

1,299 citations

Journal ArticleDOI
TL;DR: This work reviews the synthetic and electronic design strategies that have been employed thus far for producing frameworks with permanent porosity and long-range charge transport properties and selected applications for this subclass of MOFs.
Abstract: Owing to their outstanding structural, chemical, and functional diversity, metal-organic frameworks (MOFs) have attracted considerable attention over the last two decades in a variety of energy-related applications. Notably missing among these, until recently, were applications that required good charge transport coexisting with porosity and high surface area. Although most MOFs are electrical insulators, several materials in this class have recently demonstrated excellent electrical conductivity and high charge mobility. Herein we review the synthetic and electronic design strategies that have been employed thus far for producing frameworks with permanent porosity and long-range charge transport properties. In addition, key experiments that have been employed to demonstrate electrical transport, as well as selected applications for this subclass of MOFs, will be discussed.

1,279 citations