scispace - formally typeset
Search or ask a question
Author

Fanyue Sun

Bio: Fanyue Sun is an academic researcher from Auburn University. The author has contributed to research in topics: Catfish & Genome. The author has an hindex of 21, co-authored 31 publications receiving 1969 citations. Previous affiliations of Fanyue Sun include Liaoning Normal University & University of Connecticut Health Center.

Papers
More filters
Journal ArticleDOI
TL;DR: A draft genome of domesticated C. carpio (strain Songpu) is presented, whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100).
Abstract: The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.

515 citations

Journal ArticleDOI
TL;DR: A high-quality reference genome sequence of channel catfish is reported, providing evidence that lack of secretory calcium-binding phosphoproteins accounts for the evolutionary loss of scales in catfish and demonstrates the power of comparative subtraction of candidate genes for traits of structural significance.
Abstract: Catfish represent 12% of teleost or 6.3% of all vertebrate species, and are of enormous economic value. Here we report a high-quality reference genome sequence of channel catfish (Ictalurus punctatus), the major aquaculture species in the US. The reference genome sequence was validated by genetic mapping of 54,000 SNPs, and annotated with 26,661 predicted protein-coding genes. Through comparative analysis of genomes and transcriptomes of scaled and scaleless fish and scale regeneration experiments, we address the genomic basis for the most striking physical characteristic of catfish, the evolutionary loss of scales and provide evidence that lack of secretory calcium-binding phosphoproteins accounts for the evolutionary loss of scales in catfish. The channel catfish reference genome sequence, along with two additional genome sequences and transcriptomes of scaled catfishes, provide crucial resources for evolutionary and biological studies. This work also demonstrates the power of comparative subtraction of candidate genes for traits of structural significance. Catfish represent 6.3% of all vertebrate species, and occupy a phylogenetic position close to the common ancestor of bony fish. Liu et al. present a reference genome of the channel catfish, and reveal a genomic basis for the evolutionary loss of scales in these species.

237 citations

Journal ArticleDOI
TL;DR: The first transcriptomic profiling of host responses to columnaris following an experimental challenge is conducted, highlighting several putative immune pathways and individual candidate genes deserving of further investigation in the context of development of therapeutic regimens and laying the foundation for selection of resistant catfish lines against columnaris.
Abstract: Outbreaks of columnaris disease (Flavobacterium columnare) are common in wild and cultured freshwater fish worldwide. Disease occurrences, particularly those caused by virulent genomovar II isolates, in aquaculture species such as channel catfish can be devastating. In contrast to other important aquaculture pathogens, little is known about host immune responses to columnaris. Adhesion of F. columnare to gill tissue has been correlated in some previous studies to virulence and host susceptibility. Here, therefore, we conducted the first transcriptomic profiling of host responses to columnaris following an experimental challenge. We utilized Illumina-based RNA-seq expression profiling to examine transcript profiles at three timepoints (4h, 24h, and 48h) in catfish gill after bath immersion infection. Enrichment and pathway analyses of the differentially expressed genes revealed several central signatures following infection. These included the dramatic upregulation of a rhamnose-binding lectin, with putative roles in bacterial attachment and aggregation, suppression of NF-κB signalling via IκBs, BCL-3, TAX1BP1, and olfactomedin 4, and strong induction of IFN-inducible responses including iNOS2b, IFI44, and VHSV genes. Fifteen differentially expressed genes with varying expression profiles by RNA-seq, were validated by QPCR (correlation coefficients 0.85-0.94, p-value <0.001). Our results highlight several putative immune pathways and individual candidate genes deserving of further investigation in the context of development of therapeutic regimens and laying the foundation for selection of resistant catfish lines against columnaris.

156 citations

Journal ArticleDOI
TL;DR: The SNPs identified in this report will provide a much needed resource for genetic studies in catfish and will contribute to the development of a high-density SNP array.
Abstract: Single nucleotide polymorphisms (SNPs) have become the marker of choice for genome-wide association studies. In order to provide the best genome coverage for the analysis of performance and production traits, a large number of relatively evenly distributed SNPs are needed. Gene-associated SNPs may fulfill these requirements of large numbers and genome wide distribution. In addition, gene-associated SNPs could themselves be causative SNPs for traits. The objective of this project was to identify large numbers of gene-associated SNPs using high-throughput next generation sequencing. Transcriptome sequencing was conducted for channel catfish and blue catfish using Illumina next generation sequencing technology. Approximately 220 million reads (15.6 Gb) for channel catfish and 280 million reads (19.6 Gb) for blue catfish were obtained by sequencing gene transcripts derived from various tissues of multiple individuals from a diverse genetic background. A total of over 35 billion base pairs of expressed short read sequences were generated. Over two million putative SNPs were identified from channel catfish and almost 2.5 million putative SNPs were identified from blue catfish. Of these putative SNPs, a set of filtered SNPs were identified including 342,104 intra-specific SNPs for channel catfish, 366,269 intra-specific SNPs for blue catfish, and 420,727 inter-specific SNPs between channel catfish and blue catfish. These filtered SNPs are distributed within 16,562 unique genes in channel catfish and 17,423 unique genes in blue catfish. For aquaculture species, transcriptome analysis of pooled RNA samples from multiple individuals using Illumina sequencing technology is both technically efficient and cost-effective for generating expressed sequences. Such an approach is most effective when coupled to existing EST resources generated using traditional sequencing approaches because the reference ESTs facilitate effective assembly of the expressed short reads. When multiple individuals with different genetic backgrounds are used, RNA-Seq is very effective for the identification of SNPs. The SNPs identified in this report will provide a much needed resource for genetic studies in catfish and will contribute to the development of a high-density SNP array. Validation and testing of these SNPs using SNP arrays will form the material basis for genome association studies and whole genome-based selection in catfish.

146 citations

Journal ArticleDOI
TL;DR: This is the first RNA-Seq-based expression study in catfish in response to heat stress, and the candidate genes identified should be valuable for further targeted studies on heat tolerance, thereby assisting the development of heat-tolerant catfish lines for aquaculture.
Abstract: Temperature is one of the most prominent abiotic factors affecting ectotherms. Most fish species, as ectotherms, have extraordinary ability to deal with a wide range of temperature changes. While the molecular mechanism underlying temperature adaptation has long been of interest, it is still largely unexplored with fish. Understanding of the fundamental mechanisms conferring tolerance to temperature fluctuations is a topic of increasing interest as temperature may continue to rise as a result of global climate change. Catfish have a wide natural habitat and possess great plasticity in dealing with environmental variations in temperature. However, no studies have been conducted at the transcriptomic level to determine heat stress-induced gene expression. In the present study, we conducted an RNA-Seq analysis to identify heat stress-induced genes in catfish at the transcriptome level. Expression analysis identified a total of 2,260 differentially expressed genes with a cutoff of twofold change. qRT-PCR validation suggested the high reliability of the RNA-Seq results. Gene ontology, enrichment, and pathway analyses were conducted to gain insight into physiological and gene pathways. Specifically, genes involved in oxygen transport, protein folding and degradation, and metabolic process were highly induced, while general protein synthesis was dramatically repressed in response to the lethal temperature stress. This is the first RNA-Seq-based expression study in catfish in response to heat stress. The candidate genes identified should be valuable for further targeted studies on heat tolerance, thereby assisting the development of heat-tolerant catfish lines for aquaculture.

134 citations


Cited by
More filters
10 Dec 2007
TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

1,528 citations

01 Jan 2001
TL;DR: Genetical genomics as discussed by the authors combines the power of genomics and genetics in a way that is likely to become instrumental in the further unravelling of metabolic, regulatory and developmental pathways.
Abstract: The recent successes of genome-wide expression profiling in biology tend to overlook the power of genetics. We here propose a merger of genomics and genetics into ‘genetical genomics’. This involves expression profiling and marker-based fingerprinting of each individual of a segregating population, and exploits all the statistical tools used in the analysis of quantitative trait loci. Genetical genomics will combine the power of two different worlds in a way that is likely to become instrumental in the further unravelling of metabolic, regulatory and developmental pathways.

952 citations

Journal ArticleDOI
01 Jan 2016-Database
TL;DR: The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects and generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets.
Abstract: The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based ann ...

849 citations

Journal ArticleDOI
TL;DR: A comprehensive landscape of different modes of gene duplication across the plant kingdom is identified by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.
Abstract: The sharp increase of plant genome and transcriptome data provide valuable resources to investigate evolutionary consequences of gene duplication in a range of taxa, and unravel common principles underlying duplicate gene retention. We survey 141 sequenced plant genomes to elucidate consequences of gene and genome duplication, processes central to the evolution of biodiversity. We develop a pipeline named DupGen_finder to identify different modes of gene duplication in plants. Genes derived from whole-genome, tandem, proximal, transposed, or dispersed duplication differ in abundance, selection pressure, expression divergence, and gene conversion rate among genomes. The number of WGD-derived duplicate genes decreases exponentially with increasing age of duplication events—transposed duplication- and dispersed duplication-derived genes declined in parallel. In contrast, the frequency of tandem and proximal duplications showed no significant decrease over time, providing a continuous supply of variants available for adaptation to continuously changing environments. Moreover, tandem and proximal duplicates experienced stronger selective pressure than genes formed by other modes and evolved toward biased functional roles involved in plant self-defense. The rate of gene conversion among WGD-derived gene pairs declined over time, peaking shortly after polyploidization. To provide a platform for accessing duplicated gene pairs in different plants, we constructed the Plant Duplicate Gene Database. We identify a comprehensive landscape of different modes of gene duplication across the plant kingdom by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.

461 citations