Author
Shengyi Liu
Other affiliations: Chinese Ministry of Agriculture, Hubei University
Bio: Shengyi Liu is an academic researcher from Crops Research Institute. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 26, co-authored 73 publications receiving 6317 citations. Previous affiliations of Shengyi Liu include Chinese Ministry of Agriculture & Hubei University.
Topics: Genome, Gene, Biology, Medicine, Brassica rapa
Papers
More filters
••
Civil Aviation Authority of Singapore1, Rothamsted Research2, Beijing Institute of Genomics3, University of Copenhagen4, Rural Development Administration5, John Innes Centre6, North China University of Science and Technology7, University of Georgia8, University of California, Berkeley9, University of Missouri10, University of Queensland11, Australian Research Council12, National Research Council13, Bielefeld University14, Australian Centre for Plant Functional Genomics15, University of Rennes16, Wageningen University and Research Centre17, Agriculture and Agri-Food Canada18, Huazhong Agricultural University19, French Alternative Energies and Atomic Energy Commission20, Chungnam National University21, Norwich Research Park22
TL;DR: The annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage, and used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution.
Abstract: We report the annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage. We modeled 41,174 protein coding genes in the B. rapa genome, which has undergone genome triplication. We used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution. The extent of gene loss (fractionation) among triplicated genome segments varies, with one of the three copies consistently retaining a disproportionately large fraction of the genes expected to have been present in its ancestor. Variation in the number of members of gene families present in the genome may contribute to the remarkable morphological plasticity of Brassica species. The B. rapa genome sequence provides an important resource for studying the evolution of polyploid genomes and underpins the genetic improvement of Brassica oil and vegetable crops.
1,811 citations
••
University of Évry Val d'Essonne1, Crops Research Institute2, Agriculture and Agri-Food Canada3, Fujian Agriculture and Forestry University4, J. Craig Venter Institute5, Plant Genome Mapping Laboratory6, University of Giessen7, French Alternative Energies and Atomic Energy Commission8, Institut national de la recherche agronomique9, National Research Council10, Australian Centre for Plant Functional Genomics11, University of Cologne12, Purdue University13, University of California, Berkeley14, University of British Columbia15, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain16, Huazhong Agricultural University17, Hunan Agricultural University18, Chungnam National University19, University of Arizona20, University of York21, University of Missouri22, Southern Cross University23, University of Western Australia24, Centre national de la recherche scientifique25
TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.
Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.
1,743 citations
••
Crops Research Institute1, Australian Centre for Plant Functional Genomics2, Agriculture and Agri-Food Canada3, Purdue University4, Plant Genome Mapping Laboratory5, Southwest University6, University of York7, Seoul National University8, Southern Cross University9, University of Missouri10, Centre national de la recherche scientifique11, Huazhong Agricultural University12, Hunan Agricultural University13, University of Queensland14, National Research Council15, Central University, India16, Sahmyook University17, King Abdulaziz University18
TL;DR: A draft genome sequence of Brassica oleracea is described, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks.
Abstract: Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear Brassica is an ideal model to increase knowledge of polyploid evolution Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B oleracea This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus
884 citations
••
TL;DR: BRAD, a new database which focuses on the genetics and genomics of the Brassica plants has been developed, it aims at helping scientists and breeders to fully and efficiently use the information of genome data of Brassicas plants.
Abstract: Brassica species include both vegetable and oilseed crops, which are very important to the daily life of common human beings. Meanwhile, the Brassica species represent an excellent system for studying numerous aspects of plant biology, specifically for the analysis of genome evolution following polyploidy, so it is also very important for scientific research. Now, the genome of Brassica rapa has already been assembled, it is the time to do deep mining of the genome data.
412 citations
••
Fujian Agriculture and Forestry University1, University of Florida2, International Crops Research Institute for the Semi-Arid Tropics3, National Cheng Kung University4, North China University of Science and Technology5, University of Western Australia6, Agricultural Research Service7, Crops Research Institute8, Zhongkai University of Agriculture and Engineering9, Tsinghua University10, Shenzhen University11, Tuskegee University12, Osmania University13, National Tsing Hua University14, Plant Genome Mapping Laboratory15, University of Illinois at Urbana–Champaign16
TL;DR: High-quality genome sequence of cultivated peanut provides insights into genome evolution and the genetic mechanisms underlying seed size and leaf resistance in peanut, providing a cornerstone for functional genomics and peanut improvement.
Abstract: High oil and protein content make tetraploid peanut a leading oil and food legume. Here we report a high-quality peanut genome sequence, comprising 2.54 Gb with 20 pseudomolecules and 83,709 protein-coding gene models. We characterize gene functional groups implicated in seed size evolution, seed oil content, disease resistance and symbiotic nitrogen fixation. The peanut B subgenome has more genes and general expression dominance, temporally associated with long-terminal-repeat expansion in the A subgenome that also raises questions about the A-genome progenitor. The polyploid genome provided insights into the evolution of Arachis hypogaea and other legume chromosomes. Resequencing of 52 accessions suggests that independent domestications formed peanut ecotypes. Whereas 0.42–0.47 million years ago (Ma) polyploidy constrained genetic variation, the peanut genome sequence aids mapping and candidate-gene discovery for traits such as seed size and color, foliar disease resistance and others, also providing a cornerstone for functional genomics and peanut improvement.
320 citations
Cited by
More filters
01 Jan 2000
3,536 citations
•
TL;DR: FastTree as mentioned in this paper uses sequence profiles of internal nodes in the tree to implement neighbor-joining and uses heuristics to quickly identify candidate joins, then uses nearest-neighbor interchanges to reduce the length of the tree.
Abstract: Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement neighbor-joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest-neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N^2) space and O(N^2 L) time, but FastTree requires just O( NLa + N sqrt(N) ) memory and O( N sqrt(N) log(N) L a ) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 hours and 2.4 gigabytes of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 hours and 50 gigabytes of memory. In simulations, FastTree was slightly more accurate than neighbor joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.
2,436 citations
01 Jan 2011
TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.
Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.
2,187 citations
••
University of Évry Val d'Essonne1, Crops Research Institute2, Agriculture and Agri-Food Canada3, Fujian Agriculture and Forestry University4, J. Craig Venter Institute5, Plant Genome Mapping Laboratory6, University of Giessen7, French Alternative Energies and Atomic Energy Commission8, Institut national de la recherche agronomique9, National Research Council10, Australian Centre for Plant Functional Genomics11, University of Cologne12, Purdue University13, University of California, Berkeley14, University of British Columbia15, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain16, Huazhong Agricultural University17, Hunan Agricultural University18, Chungnam National University19, University of Arizona20, University of York21, University of Missouri22, Southern Cross University23, University of Western Australia24, Centre national de la recherche scientifique25
TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.
Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.
1,743 citations
10 Dec 2007
TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
1,528 citations