scispace - formally typeset
Search or ask a question
Author

Yao Ming

Bio: Yao Ming is an academic researcher. The author has contributed to research in topics: Genome & Biology. The author has an hindex of 1, co-authored 1 publications receiving 1560 citations.

Papers
More filters
Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

1,806 citations

Journal ArticleDOI
TL;DR:
Abstract: Microsatellites are one of the important genome characterizations that can be a valuable resource for variety identification, genetic diversity, phylogenetic analysis, as well as comparative and conservation genomics research. Here, we developed comprehensive microsatellites through genome-wide mining for the threatened cetacean Indo-Pacific humpback dolphin (Sousa chinensis). We found 87,757 microsatellites with 2–6 bp nucleotide motifs, showing that about 32.5 microsatellites per megabase comprises microsatellites sequences. Approximately 97.8% of the markers developed in this study were consistent with the published identified markers. About 75.3% microsatellites were with dinucleotide motifs, followed by tetranucleotide motifs (17.4%), sharing the same composition pattern as other cetaceans. The microsatellites were not evenly distributed in the S. chinensis genome, mainly in non-coding regions, with only about 0.5% of the markers located in coding regions. The microsatellite-containing genes were mainly functionally enriched in the methylation process, probably demonstrating the potential impacts of microsatellites on biological functions. Polymorphic microsatellites were developed between different genders of S. chinensis, which was expected to lay the foundation for genetic diversity investigation in cetaceans. The specific markers for a male Indo-Pacific humpback dolphin will provide comprehensive and representative male candidate markers for sex identification, providing a potential biomolecular tool for further analysis of population structure and social behavior of wild populations, population trend evaluation, and species conservation management.

1 citations

Journal ArticleDOI
TL;DR: In this paper , the authors employed the pearl oyster, Pinctada fucata marntensii, a widely used experimental model for biomineralization in invertebrates, to perform whole-genome resequencing of 878 individuals from wild and breeding populations.
Abstract: Biomineralization‐controlled exo‐/endoskeleton growth contributes to body growth and body size diversity. Molluscan shells undergo ectopic biomineralization to form the exoskeleton and biocalcified “pearl” involved in invading defence. Notably, exo‐/endoskeletons have a common ancestral origin, but their regulation and body growth are largely unknown. This study employed the pearl oyster, Pinctada fucata marntensii, a widely used experimental model for biomineralization in invertebrates, to perform whole‐genome resequencing of 878 individuals from wild and breeding populations. This study characterized the genetic architecture of biomineralization‐controlled growth and ectopic biomineralization. The insulin‐like growth factor (IGF) endocrine signal interacted with ancient single‐copy transcription factors to form the regulatory network. Moreover, the “cross‐phylum” regulation of key long noncoding RNA (lncRNA) in bivalves and mammals indicated the conserved genetic and epigenetic regulation in exo‐/endoskeleton growth. Thyroid hormone signal and apoptosis regulation in pearl oysters affected ectopic biomineralization in pearl oyster. These findings provide insights into the mechanism underlying the evolution and regulation of biomineralization in exo‐/endoskeleton animals and ectopic biomineralization.
Journal ArticleDOI
TL;DR: In this paper , a reference genome of Ramie (Boehmeria nivea L), belonging to Urticaceae, is generated, in which approximately 90.2% of the assembled sequences have been anchored to 14 pseudochromosomes.
Abstract: ABSTRACT Ramie (Boehmeria nivea L.), belonging to Urticaceae, is principally used for fabric production. It is a well-known natural fiber material for ancient clothing. Despite its important position and application value, the understanding on genetic regulation mechanism of fiber quality is limited. Here, we generate a chromosome-scale, high-quality reference genome of ramie, in which, approximately 90.2% of the assembled sequences have been anchored to 14 pseudochromosomes. Totally 27,664 protein-coding genes are predicted which cover 268.24 Mb region of the genome. Comparative genomic analysis reveals that 2,047 and 796 gene clusters expand and contract, respectively, underlying significant genes in plant hormone signal transduction and cellulose/lignin biosynthesis pathways. An integrative analysis combining quantitative trait loci (QTL), comparative transcriptomic data, and cytological experiments unravels the molecular regulatory mechanism of ramie fiber fineness, especially the critical regulating role of ethylene. This study would lay a solid foundation for the research of molecular biology in ramie and provide valuable reference for the improvement of high-quality fiber varieties.
Posted ContentDOI
17 Jun 2023-bioRxiv
TL;DR: In this article , the first high-quality chromosome-level genome for the complex octoploid Acipenser sinensis (Chinese sturgeon), a critically endangered species that also represents a poorly understood ploidy group in Acipensensiformes, was presented.
Abstract: The Order Acipenseriformes, which include sturgeons and paddlefishes, represent “living fossils” with complex genomes that are good models for understanding whole genome duplication (WGD) and ploidy evolution in fishes. Here we sequenced and assembled the first high-quality chromosome-level genome for the complex octoploid Acipenser sinensis (Chinese sturgeon), a critically endangered species that also represents a poorly understood ploidy group in Acipenseriformes. Our results show that A. sinensis is a complex autooctoploid species containing four kinds of octovalents (8 n), a hexavalent (6 n), two tetravalents (4 n), and a divalent (2 n). We propose based on an analysis taking into account delayed rediploidization that its octoploid genome composition results from two rounds of homologous whole genome duplications (WGDs), and further provide insight into the timing of its ploidy evolution. This study provides the first octoploid genome resource of Acipenseriformes for understanding ploidy composition and evolutionary trajectories of polyploidy fishes.

Cited by
More filters
Journal ArticleDOI
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

4,104 citations

Journal ArticleDOI
22 May 2013-Nature
TL;DR: The draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm, is presented, revealing numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs, which opens up new genomic avenues for conifer forestry and breeding.
Abstract: Conifers have dominated forests for more than 200 million years and are of huge ecological and economic importance. Here we present the draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm. The number of well-supported genes (28,354) is similar to the >100 times smaller genome of Arabidopsis thaliana, and there is no evidence of a recent whole-genome duplication in the gymnosperm lineage. Instead, the large genome size seems to result from the slow and steady accumulation of a diverse set of long-terminal repeat transposable elements, possibly owing to the lack of an efficient elimination mechanism. Comparative sequencing of Pinus sylvestris, Abies sibirica, Juniperus communis, Taxus baccata and Gnetum gnemon reveals that the transposable element diversity is shared among extant conifers. Expression of 24-nucleotide small RNAs, previously implicated in transposable element silencing, is tissue-specific and much lower than in other plants. We further identify numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs. This opens up new genomic avenues for conifer forestry and breeding.

1,299 citations

Journal ArticleDOI
TL;DR: Evidence is provided that micro-PS cause feeding modifications and reproductive disruption in oysters, with significant impacts on offspring, providing ground-breaking data on microplastic impacts in an invertebrate model, helping to predict ecological impact in marine ecosystems.
Abstract: Plastics are persistent synthetic polymers that accumulate as waste in the marine environment. Microplastic (MP) particles are derived from the breakdown of larger debris or can enter the environment as microscopic fragments. Because filter-feeder organisms ingest MP while feeding, they are likely to be impacted by MP pollution. To assess the impact of polystyrene microspheres (micro-PS) on the physiology of the Pacific oyster, adult oysters were experimentally exposed to virgin micro-PS (2 and 6 µm in diameter; 0.023 mg·L−1) for 2 mo during a reproductive cycle. Effects were investigated on ecophysiological parameters; cellular, transcriptomic, and proteomic responses; fecundity; and offspring development. Oysters preferentially ingested the 6-µm micro-PS over the 2-µm-diameter particles. Consumption of microalgae and absorption efficiency were significantly higher in exposed oysters, suggesting compensatory and physical effects on both digestive parameters. After 2 mo, exposed oysters had significant decreases in oocyte number (−38%), diameter (−5%), and sperm velocity (−23%). The D-larval yield and larval development of offspring derived from exposed parents decreased by 41% and 18%, respectively, compared with control offspring. Dynamic energy budget modeling, supported by transcriptomic profiles, suggested a significant shift of energy allocation from reproduction to structural growth, and elevated maintenance costs in exposed oysters, which is thought to be caused by interference with energy uptake. Molecular signatures of endocrine disruption were also revealed, but no endocrine disruptors were found in the biological samples. This study provides evidence that micro-PS cause feeding modifications and reproductive disruption in oysters, with significant impacts on offspring.

1,164 citations

Journal ArticleDOI
TL;DR: Genomes assembled de novo from short reads are highly fragmented relative to the finished chromosomes of Homo sapiens and key model organisms generated by the Human Genome Project, so genome-wide chromatin interaction data sets, such as those generated by Hi-C, are a rich source of long-range information for assigning, ordering and orienting genomic sequences to chromosomes, including across centromeres.
Abstract: Genomes assembled de novo from short reads are highly fragmented relative to the finished chromosomes of Homo sapiens and key model organisms generated by the Human Genome Project. To address this problem, we need scalable, cost-effective methods to obtain assemblies with chromosome-scale contiguity. Here we show that genome-wide chromatin interaction data sets, such as those generated by Hi-C, are a rich source of long-range information for assigning, ordering and orienting genomic sequences to chromosomes, including across centromeres. To exploit this finding, we developed an algorithm that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. We demonstrate the approach by combining shotgun fragment and short jump mate-pair sequences with Hi-C data to generate chromosome-scale de novo assemblies of the human, mouse and Drosophila genomes, achieving--for the human genome--98% accuracy in assigning scaffolds to chromosome groups and 99% accuracy in ordering and orienting scaffolds within chromosome groups. Hi-C data can also be used to validate chromosomal translocations in cancer genomes.

1,032 citations

Journal ArticleDOI
TL;DR: Platanus provides a novel and efficient approach for the assembly of gigabase-sized highly heterozygous genomes and is an attractive alternative to the existing assemblers designed for genomes of lower heterozygosity.
Abstract: Although many de novo genome assembly projects have recently been conducted using high-throughput sequencers, assembling highly heterozygous diploid genomes is a substantial challenge due to the increased complexity of the de Bruijn graph structure predominantly used. To address the increasing demand for sequencing of nonmodel and/or wild-type samples, in most cases inbred lines or fosmid-based hierarchical sequencing methods are used to overcome such problems. However, these methods are costly and time consuming, forfeiting the advantages of massive parallel sequencing. Here, we describe a novel de novo assembler, Platanus, that can effectively manage high-throughput data from heterozygous samples. Platanus assembles DNA fragments (reads) into contigs by constructing de Bruijn graphs with automatically optimized k-mer sizes followed by the scaffolding of contigs based on paired-end information. The complicated graph structures that result from the heterozygosity are simplified during not only the contig assembly step but also the scaffolding step. We evaluated the assembly results on eukaryotic samples with various levels of heterozygosity. Compared with other assemblers, Platanus yields assembly results that have a larger scaffold NG50 length without any accompanying loss of accuracy in both simulated and real data. In addition, Platanus recorded the largest scaffold NG50 values for two of the three low-heterozygosity species used in the de novo assembly contest, Assemblathon 2. Platanus therefore provides a novel and efficient approach for the assembly of gigabase-sized highly heterozygous genomes and is an attractive alternative to the existing assemblers designed for genomes of lower heterozygosity.

924 citations