scispace - formally typeset
Search or ask a question
Author

Wenzhu Peng

Bio: Wenzhu Peng is an academic researcher from Xiamen University. The author has contributed to research in topics: Abalone & Genome. The author has an hindex of 12, co-authored 24 publications receiving 412 citations. Previous affiliations of Wenzhu Peng include Chinese Academy of Fishery Sciences & Shanghai Ocean University.

Papers
More filters
Journal ArticleDOI
TL;DR: Gen expression bias across surveyed tissues such that subgenome B is more dominant in homoeologous expression is found, and CG methylation in promoter regions may play an important role in altering gene expression in allotetraploid C. carpio.
Abstract: Common carp (Cyprinus carpio) is an allotetraploid species derived from recent whole genome duplication and provides a model to study polyploid genome evolution in vertebrates. Here, we generate three chromosome-level reference genomes of C. carpio and compare to related diploid Cyprinid genomes. We identify a Barbinae lineage as potential diploid progenitor of C. carpio and then divide the allotetraploid genome into two subgenomes marked by a distinct genome similarity to the diploid progenitor. We estimate that the two diploid progenitors diverged around 23 Mya and merged around 12.4 Mya based on the divergence rates of homoeologous genes and transposable elements in two subgenomes. No extensive gene losses are observed in either subgenome. Instead, we find gene expression bias across surveyed tissues such that subgenome B is more dominant in homoeologous expression. CG methylation in promoter regions may play an important role in altering gene expression in allotetraploid C. carpio.

136 citations

Journal ArticleDOI
TL;DR: The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly.
Abstract: High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly.

92 citations

Journal ArticleDOI
TL;DR: A genome scan and comparison of these two populations and a set of genomic regions under selective sweeps that harbor genes involved in ion homoeostasis, acid-base regulation, unfolded protein response, reactive oxygen species elimination, and urea excretion are identified.
Abstract: The Amur ide (Leuciscus waleckii) is a cyprinid fish that is widely distributed in Northeast Asia. The Lake Dali Nur population inhabits one of the most extreme aquatic environments on Earth, with an alkalinity up to 50 mmol/L (pH 9.6), thus providing an exceptional model with which to characterize the mechanisms of genomic evolution underlying adaptation to extreme environments. Here, we developed the reference genome assembly for L. waleckii from Lake Dali Nur. Intriguingly, we identified unusual expanded long terminal repeats (LTRs) with higher nucleotide substitution rates than in many other teleosts, suggesting their more recent insertion into the L. waleckii genome. We also identified expansions in genes encoding egg coat proteins and natriuretic peptide receptors, possibly underlying the adaptation to extreme environmental stress. We further sequenced the genomes of 10 additional individuals from freshwater and 18 from Lake Dali Nur populations, and we detected a total of 7.6 million SNPs from both populations. In a genome scan and comparison of these two populations, we identified a set of genomic regions under selective sweeps that harbor genes involved in ion homoeostasis, acid-base regulation, unfolded protein response, reactive oxygen species elimination, and urea excretion. Our findings provide comprehensive insight into the genomic mechanisms of teleost fish that underlie their adaptation to extreme alkaline environments.

60 citations

Journal ArticleDOI
TL;DR: A high-quality draft genome of C. argus is generated, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish.
Abstract: Background: The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene. Conclusion: We generated a high-quality draft genome of C. argus, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish.

48 citations

Journal ArticleDOI
TL;DR: A high-density genetic linkage map using double-digest restriction-site associated DNA (ddRAD)-based high-throughput SNP genotyping data of a F1 mapping family, which had been challenged with C. irritans for resistant trait measure, provides an important tool and resource for fine mapping, comparative genome analysis, and molecular selective breeding of large yellow croaker.
Abstract: The large yellow croaker (Larimichthys crocea) is the most economically important marine cage-farming fish in China in the past decade. However, the sustainable development of large yellow croaker aquaculture has been severely hampered by several diseases, of which, the white spot disease caused by ciliate protozoan parasite Cryptocaryon irritans ranks the most damaging disease in large yellow croaker cage farms. To better understand the genetic basis of parasite infection and disease resistance to C. irritans, it is vital to map the traits and localize the underlying candidate genes in L. crocea genome. Here, we constructed a high-density genetic linkage map using double-digest restriction-site associated DNA (ddRAD)-based high-throughput SNP genotyping data of a F1 mapping family, which had been challenged with C. irritans for resistant trait measure. A total of 5261 SNPs was grouped and oriented into 24 linkage groups (LGs), representing 24 chromosomes of L. crocea. The total genetic map length was 1885.67 cM with an average inter-locus distance of 0.36 cM. Quantitative trait loci (QTL) mapping identified seven significant QTLs in four LGs linked to C. irritans disease resistance. Candidate genes underlying disease resistance were identified from the reference genome, including ifnar1, ifngr2, ikbke, and CD112. Comparative genomic analysis between large yellow croaker and the four closely related species revealed high evolutionary conservation of chromosomes, though inter-chromosomal rearrangements do exist. Especially, the croaker genome structure was closer to the medaka genome than stickleback, indicating that the croaker genome might retain the teleost ancestral genome structure. The high-density genetic linkage map provides an important tool and resource for fine mapping, comparative genome analysis, and molecular selective breeding of large yellow croaker.

41 citations


Cited by
More filters
Journal ArticleDOI

3,734 citations

10 Dec 2007
TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

1,528 citations

14 Jan 2013
TL;DR: In this paper, the authors describe an active project that aims to identify and phenotype the disruptive mutations in every zebrafish protein-coding gene, using a well-annotated Zebrafish reference genome sequence, high-throughput sequencing and efficient chemical mutagenesis.
Abstract: Since the publication of the human reference genome, the identities of specific genes associated with human diseases are being discovered at a rapid rate. A central problem is that the biological activity of these genes is often unclear. Detailed investigations in model vertebrate organisms, typically mice, have been essential for understanding the activities of many orthologues of these disease-associated genes. Although gene-targeting approaches and phenotype analysis have led to a detailed understanding of nearly 6,000 protein-coding genes, this number falls considerably short of the more than 22,000 mouse protein-coding genes. Similarly, in zebrafish genetics, one-by-one gene studies using positional cloning, insertional mutagenesis, antisense morpholino oligonucleotides, targeted re-sequencing, and zinc finger and TAL endonucleases have made substantial contributions to our understanding of the biological activity of vertebrate genes, but again the number of genes studied falls well short of the more than 26,000 zebrafish protein-coding genes. Importantly, for both mice and zebrafish, none of these strategies are particularly suited to the rapid generation of knockouts in thousands of genes and the assessment of their biological activity. Here we describe an active project that aims to identify and phenotype the disruptive mutations in every zebrafish protein-coding gene, using a well-annotated zebrafish reference genome sequence, high-throughput sequencing and efficient chemical mutagenesis. So far we have identified potentially disruptive mutations in more than 38% of all known zebrafish protein-coding genes. We have developed a multi-allelic phenotyping scheme to efficiently assess the effects of each allele during embryogenesis and have analysed the phenotypic consequences of over 1,000 alleles. All mutant alleles and data are available to the community and our phenotyping scheme is adaptable to phenotypic analysis beyond embryogenesis.

503 citations

Journal ArticleDOI
TL;DR: An overview of hormones known to regulate food intake in fish is provided, emphasizing on major hormones and the main fish groups studied to date.
Abstract: Fish are the most diversified group of vertebrates and, although progress has been made in the past years, only relatively few fish species have been examined to date, with regards to the endocrine regulation of feeding in fish. In fish, as in mammals, feeding behavior is ultimately regulated by central effectors within feeding centers of the brain, which receive and process information from endocrine signals from both brain and peripheral tissues. Although basic endocrine mechanisms regulating feeding appear to be conserved among vertebrates, major physiological differences between fish and mammals and the diversity of fish, in particular in regard to feeding habits, digestive tract anatomy and physiology, suggest the existence of fish- and species-specific regulating mechanisms. This review provides an overview of hormones known to regulate food intake in fish, emphasizing on major hormones and the main fish groups studied to date.

234 citations

Journal ArticleDOI
TL;DR: Direct genotyping by sequencing (GBS) techniques have underpinned many of the advances in aquaculture genetics and breeding to date, and have been extensively applied to generate population‐level SNP genotype data.
Abstract: Selective breeding is increasingly recognized as a key component of sustainable production of aquaculture species. The uptake of genomic technology in aquaculture breeding has traditionally lagged behind terrestrial farmed animals. However, the rapid development and application of sequencing technologies has allowed aquaculture to narrow the gap, leading to substantial genomic resources for all major aquaculture species. While high-density single-nucleotide polymorphism (SNP) arrays for some species have been developed recently, direct genotyping by sequencing (GBS) techniques have underpinned many of the advances in aquaculture genetics and breeding to date. In particular, restriction-site associated DNA sequencing (RAD-Seq) and subsequent variations have been extensively applied to generate population-level SNP genotype data. These GBS techniques are not dependent on prior genomic information such as a reference genome assembly for the species of interest. As such, they have been widely utilized by researchers and companies focussing on nonmodel aquaculture species with relatively small research communities. Applications of RAD-Seq techniques have included generation of genetic linkage maps, performing genome-wide association studies, improvements of reference genome assemblies and, more recently, genomic selection for traits of interest to aquaculture like growth, sex determination or disease resistance. In this review, we briefly discuss the history of GBS, the nuances of the various GBS techniques, bioinformatics approaches and application of these techniques to various aquaculture species.

186 citations