scispace - formally typeset
Search or ask a question

Showing papers by "Qijian Song published in 2015"


Journal ArticleDOI
TL;DR: This study identified multiple new loci and refined chromosomal regions of known loci associated with DTF, DTM, DFTM and/or PH in soybean, demonstrating that GWAS is powerful in dissecting complex traits and identifying candidate genes although LD decayed slowly in soy bean.
Abstract: Soybean (Glycine max) is a photoperiod-sensitive and self-pollinated species. Days to flowering (DTF) and maturity (DTM), duration of flowering-to-maturity (DFTM) and plant height (PH) are crucial for soybean adaptability and yield. To dissect the genetic architecture of these agronomically important traits, a population consisting of 309 early maturity soybean germplasm accessions was genotyped with the Illumina Infinium SoySNP50K BeadChip and phenotyped in multiple environments. A genome-wide association study (GWAS) was conducted using a mixed linear model that involves both relative kinship and population structure. The linkage disequilibrium (LD) decayed slowly in soybean, and a substantial difference in LD pattern was observed between euchromatic and heterochromatic regions. A total of 27, 6, 18 and 27 loci for DTF, DTM, DFTM and PH were detected via GWAS, respectively. The Dt1 gene was identified in the locus strongly associated with both DTM and PH. Ten candidate genes homologous to Arabidopsis flowering genes were identified near the peak single nucleotide polymorphisms (SNPs) associated with DTF. Four of them encode MADS-domain containing proteins. Additionally, a pectin lyase-like gene was also identified in a major-effect locus for PH where LD decayed rapidly. This study identified multiple new loci and refined chromosomal regions of known loci associated with DTF, DTM, DFTM and/or PH in soybean. It demonstrates that GWAS is powerful in dissecting complex traits and identifying candidate genes although LD decayed slowly in soybean. The loci and trait-associated SNPs identified in this study can be used for soybean genetic improvement, especially the major-effect loci associated with PH could be used to improve soybean yield potential. The candidate genes may serve as promising targets for studies of molecular mechanisms underlying the related traits in soybean.

201 citations


Journal ArticleDOI
TL;DR: A dramatic reduction of genetic diversity was detected based on linkage disequilibrium and haplotype structure analyses of the wild, landrace, and North American cultivar populations and identified candidate regions associated with domestication and selection imposed by North American breeding.
Abstract: The United States Department of Agriculture, Soybean Germplasm Collection includes 18,480 domesticated soybean and 1168 wild soybean accessions introduced from 84 countries or developed in the United States. This collection was genotyped with the SoySNP50K BeadChip containing greater than 50K single-nucleotide polymorphisms. Redundant accessions were identified in the collection, and distinct genetic backgrounds of soybean from different geographic origins were observed that could be a unique resource for soybean genetic improvement. We detected a dramatic reduction of genetic diversity based on linkage disequilibrium and haplotype structure analyses of the wild, landrace, and North American cultivar populations and identified candidate regions associated with domestication and selection imposed by North American breeding. We constructed the first soybean haplotype block maps in the wild, landrace, and North American cultivar populations and observed that most recombination events occurred in the regions between haplotype blocks. These haplotype maps are crucial for association mapping aimed at the identification of genes controlling traits of economic importance. A case-control association test delimited potential genomic regions along seven chromosomes that most likely contain genes controlling seed weight in domesticated soybean. The resulting dataset will facilitate germplasm utilization, identification of genes controlling important traits, and will accelerate the creation of soybean varieties with improved seed yield and quality.

187 citations


Journal ArticleDOI
TL;DR: The first comprehensive analysis of population structure on the collection of 14,000 soybean accessions using a 50K‐SNP chip provides a fuller understanding of the distribution of genetic variation contained within the USDA soybean collection and how it relates to phenotypic variation for economically important traits.
Abstract: Population structure analyses and genome-wide association studies (GWAS) conducted on crop germplasm collections provide valuable information on the frequency and distribution of alleles governing economically important traits. The value of these analyses is substantially enhanced when the accession numbers can be increased from ∼1,000 to ∼10,000 or more. In this research, we conducted the first comprehensive analysis of population structure on the collection of 14,000 soybean accessions [Glycine max (L.) Merr. and G. soja Siebold & Zucc.] using a 50K-SNP chip. Accessions originating from Japan were relatively homogenous and distinct from the Korean accessions. As a whole, both Japanese and Korean accessions diverged from the Chinese accessions. The ancestry of founders of the American accessions derived mostly from two Chinese subpopulations, which reflects the composition of the American accessions as a whole. A 12,000 accession GWAS conducted on seed protein and oil is the largest reported to date in plants and identified single nucleotide polymorphisms (SNPs) with strong signals on chromosomes 20 and 15. A chromosome 20 region previously reported to be important for protein and oil content was further narrowed and now contains only three plausible candidate genes. The haplotype effects show a strong negative relationship between oil and protein at this locus, indicating negative pleiotropic effects or multiple closely linked loci in repulsion phase linkage. The vast majority of accessions carry the haplotype allele conferring lower protein and higher oil. Our results provide a fuller understanding of the distribution of genetic variation contained within the USDA soybean collection and how it relates to phenotypic variation for economically important traits.

150 citations


Journal ArticleDOI
TL;DR: A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean varieties with the aid of next-generation sequencing technology.
Abstract: A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.

128 citations


Journal ArticleDOI
TL;DR: Using genome-wide association studies, 39 SNP markers likely tagging 21 different loci for carbon isotope ratio (δ13C) were identified in soybean.
Abstract: Key message Using genome-wide association studies, 39 SNP markers likely tagging 21 different loci for carbon isotope ratio (δ13C) were identified in soybean.

84 citations


Journal ArticleDOI
TL;DR: This study indicates that integration of population divergence analysis, genome-wide association study and expression analysis is an efficient approach to identify candidate domestication-related genes.
Abstract: Flowering time and seed size are traits related to domestication. However, identification of domestication-related loci/genes of controlling the traits in soybean is rarely reported. In this study, we identified a total of 48 domestication-related loci based on RAD-seq genotyping of a natural population comprising 286 accessions. Among these, four on chromosome 12 and additional two on chromosomes 11 and 15 were associated with flowering time, and four on chromosomes 11 and 16 were associated with seed size. Of the five genes associated with flowering time and the three genes associated with seed size, three genes Glyma11g18720, Glyma11g15480 and Glyma15g35080 were homologous to Arabidopsis genes, additional five genes were found for the first time to be associated with these two traits. Glyma11g18720 and Glyma05g28130 were co-expressed with five genes homologous to flowering time genes in Arabidopsis, and Glyma11g15480 was co-expressed with 24 genes homologous to seed development genes in Arabidopsis. This study indicates that integration of population divergence analysis, genome-wide association study and expression analysis is an efficient approach to identify candidate domestication-related genes.

58 citations


Journal ArticleDOI
TL;DR: These findings illustrate that comparative genomic analyses, especially when combined with GWAS, are a promising approach to dissect the genetic architecture of complex traits.
Abstract: Crop improvement always involves selection of specific alleles at genes controlling traits of agronomic importance, likely resulting in detectable signatures of selection within the genome of modern soybean (Glycine max L. Merr.). The identification of these signatures of selection is meaningful from the perspective of evolutionary biology and for uncovering the genetic architecture of agronomic traits. To this end, two populations of soybean, consisting of 342 landraces and 1062 improved lines, were genotyped with the SoySNP50K Illumina BeadChip containing 52,041 single nucleotide polymorphisms (SNPs), and systematically phenotyped for 9 agronomic traits. A cross-population composite likelihood ratio (XP-CLR) method was used to screen the signals of selective sweeps. A total of 125 candidate selection regions were identified, many of which harbored genes potentially involved in crop improvement. To further investigate whether these candidate regions were in fact enriched for genes affected by selection, genome-wide association studies (GWAS) were conducted on 7 selection traits targeted in soybean breeding (grain yield, plant height, lodging, maturity date, seed coat color, seed protein and oil content) and 2 non-selection traits (pubescence and flower color). Major genomic regions associated with selection traits overlapped with candidate selection regions, whereas no overlap of this kind occurred for the non-selection traits, suggesting that the selection sweeps identified are associated with traits of agronomic importance. Multiple novel loci and refined map locations of known loci related to these traits were also identified. These findings illustrate that comparative genomic analyses, especially when combined with GWAS, are a promising approach to dissect the genetic architecture of complex traits.

53 citations


Journal ArticleDOI
TL;DR: In this paper, the shoot ureide concentrations in 374 Maturity Group IV soybean accessions were determined and identified genomic regions associated with shoot Ureide concentration, including 53 putative loci on 18 chromosomes.
Abstract: Ureides are the N-rich products of N-fixation that are transported from soybean nodules to the shoot. Ureides are known to accumulate in leaves in response to water-deficit stress, and this has been used to identify genotypes with reduced N-fixation sensitivity to drought. Our objectives in this research were to determine shoot ureide concentrations in 374 Maturity Group IV soybean accessions and to identify genomic regions associated with shoot ureide concentration. The accessions were grown at two locations (Columbia, MO, and Stuttgart, AR) in 2 yr (2009 and 2010) and characterized for ureide concentration at beginning flowering to full bloom. Average shoot ureide concentrations across all four environments (two locations and two years) and 374 accessions ranged from 12.4 to 33.1 µmol g−1 and were comparable to previously reported values. SNP–ureide associations within and across the four environments were assessed using 33,957 SNPs with a MAF ≥0.03. In total, 53 putative loci on 18 chromosomes were identified as associated with ureide concentration. Two of the putative loci were located near previously reported QTL associated with ureide concentration and 30 loci were located near genes associated with ureide metabolism. The remaining putative loci were not near chromosomal regions previously associated with shoot ureide concentration and may mark new genes involved in ureide metabolism. Ultimately, confirmation of these putative loci will provide new sources of variation for use in soybean breeding programs.

39 citations


Journal ArticleDOI
TL;DR: The rr138 rI mapping population was developed from the cross of oSU5446, a susceptible line that meets current snap bean processing industry standards, and rr6950, a root rot resistant dry bean with small brown seeds, for resistance to Fusarium solani f.
Abstract: root rot diseases of bean (Phaseolus vulgaris L.) are a constraint to dry and snap bean production. We developed the rr138 rI mapping population from the cross of oSU5446, a susceptible line that meets current snap bean processing industry standards, and rr6950, a root rot resistant dry bean with small brown seeds. We evaluated the rr138 rI population beginning in the F6 generation for resistance to Fusarium solani f. sp. phaseoli (Burk.) root rot in oregon and Aphanomyces euteiches (Drechsler) root rot in Wisconsin. The population was evaluated for a set of root architecture traits at the oregon location. Fusarium solani root rot resistance was evaluated in three seasons, whereas A. euteiches resistance was evaluated in two seasons. For each disease, rr6950 was resistant and oSU 5446 was susceptible. The recombinant inbred (rI) population was normally distributed for reaction to both diseases. We assembled a high-density linkage map using 1689 single-nucleotide polymorphisms (SNps) from an Illumina 6000-SNp BArCbean6K_3 Beadchip. The map spanned 1196 cM over 11 linkage groups at a density of one SNp per 1.4 cM. Three quantitative trait loci (QTL) associated with A. euteiches were identified, each accounting for 5 to 15% of the total genetic variation, and two QTL associated with F. solani resistance accounted for 9 and 22% of the total genetic variation. A QTL for taproot diameter (TD) and one QTL for basal root angle were identified. The QTL for resistance to the two diseases mapped to different genome locations indicating a different genetic control. C.H. Hagerty, Dep. of Botany and Plant Pathology, Oregon State Univ., Corvallis, OR 97331; J.R. Myers, Dep. of Horticulture, Oregon State Univ., Corvallis, OR 97331; A. Cuesta-Marcos, Dep. of Crop and Soil Science, Oregon State Univ., Corvallis, OR 97331; P.B. Cregan and Q. Song, USDA–ARS, Soybean Genomics and Improvement Lab., Beltsville, MD 20705; P. McClean, Dep. of Plant Sciences, North Dakota State Univ., Fargo, ND 58108; S. Noffsinger, Seneca Foods Corporation, 799 E. Washington Ave., Dayton, WA 99328. Received 29 Nov. 2014. Accepted 11 May 2015. *Corresponding author ( james. myers@oregonstate.edu). Abbreviations: ARR, Aphanomyces euteiches resistance; CIM, composite interval mapping; CTAB, hexadecyltrimethyl ammonium bromide; FRR, Fusarium solani resistance; HARS, Hancock Agricultural Research Station; LOD, logarithm of the odds; LR, likelihood ratio; OSU, Oregon State University; QTL, quantitative trait loci; RAPD, random amplified polymorphic DNA; RCBD, randomized complete block design; RI, recombinant inbred; SNP, single-nucleotide polymorphism; TD, taproot diameter; VRF, Vegetable Research Farm. Published in Crop Sci. 55:1969–1977 (2015). doi: 10.2135/cropsci2014.11.0805 Freely available online through the author-supported open-access option. © Crop Science Society of America | 5585 Guilford Rd., Madison, WI 53711 USA All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Permission for printing and for reprinting the material contained herein has been obtained by the publisher. Published August 17, 2015

36 citations


Journal ArticleDOI
TL;DR: This high-density genetic map shows the utility of the BARCSoySNP6K BeadChip array and will lead to a better understanding of the genetic architecture of complex traits and identification of SNPs tightly linked to QTL for many important soybean traits.
Abstract: The objective of this study was to construct a high-density genetic map of soybean (Glycine max L. Merr) using a high-throughput single nucleotide polymorphism (SNP) genotyping on 357 F7 recombinant inbred lines from a cross of ‘Wyandot’ × PI 567301B. Of 5,403 SNP loci scored from the Infinium BARCSoySNP6K BeadChip array, 2,585 (48 %) were polymorphic between the two parents and subsequently 2,563 SNPs remained after filtering for minor allele frequency, heterozygosity, and missing data. A total of 2,545 SNPs were mapped into 20 linkage groups corresponding to the 20 chromosomes of soybean. The total length of the map was 2,346 cM with 2,213 unique SNP loci with 86 to 162 unique loci per chromosome. Average marker interval ranged from 0.9 to 1.3 cM with an overall mean of 1.1 cM, but 22 marker intervals were still greater than 10 cM. Colinear relationship was observed between genetic (cM) and physical positions (Mb) of SNPs for most of the genome, highlighting the improvements in the updated soybean genome assembly Glyma.Wm82.a2 in comparison with the previous Glyma.Wm82.a1. This high-density genetic map shows the utility of the BARCSoySNP6K BeadChip array and will lead to a better understanding of the genetic architecture of complex traits and identification of SNPs tightly linked to QTL for many important soybean traits.

32 citations


Journal ArticleDOI
19 May 2015-PLOS ONE
TL;DR: Based on phenotype, genotype and haplotype analysis results, these two soybean accessions might carry different resistance alleles of the same gene or two different gene(s) that are controlled by a novel gene on chromosome 13 that is different from Rcs3.
Abstract: Frogeye leaf spot (FLS), caused by the fungus Cercospora sojina K. Hara, may cause a significant yield loss to soybean growers in regions with a warm and humid climate. Two soybean accessions, PI 594891 and PI 594774, were identified to carry a high level of resistance similar to that conditioned by the Rcs3 gene in 'Davis'. Previously, we reported that the resistance to FLS in these two plant introductions (PIs) was controlled by a novel gene (s) on chromosome 13 that is different from Rcs3. To fine-map the novel FLS resistance gene(s) in these two PIs, F2: 3 seeds from the crosses between PI 594891 and PI 594774, and the FLS susceptible genotype 'Blackhawk' were genotyped with SNP markers that were designed based on the SoySNP50k iSelect BeadChip data to identify recombinant events and locate candidate genes. Analysis of lines possessing key recombination events helped narrow down the FLS-resistance genomic region in PI 594891 from 3.3 Mb to a 72.6 kb region with five annotated genes. The resistance gene in PI 594774 was fine-mapped into a 540 kb region that encompasses the 72.6 kb region found in PI 594891. Sequencing five candidate genes in PI 594891 identified three genes that have several mutations in the promoter, intron, 5', and 3' UTR regions. qPCR analysis showed a difference in expression levels of these genes in both lines compared to Blackhawk in the presence of C. sojina. Based on phenotype, genotype and haplotype analysis results, these two soybean accessions might carry different resistance alleles of the same gene or two different gene(s). The identified SNPs were used to develop Kompetitive Allele Specific PCR (KASP) assays to detect the resistance alleles on chromosome 13 from the two PIs for marker-assisted selection.

Journal ArticleDOI
TL;DR: Correlation analysis indicated that there was a highly positive correlation between cysteine and methionine contents, and most of the genomic regions linked to the QTLs for the sulfur-containing amino acids were also associated with protein content.
Abstract: Soybean (Glycine max (L.) Merr.) is a major source of plant protein for humans and livestock. Deficiency of sulfur-containing amino acids (cysteine and methionine) in soybean protein is a main limitation of soybean meal as an animal feed ingredient. The objectives of this study were to identify and validate quantitative trait loci (QTLs) associated with cysteine and methionine contents in two recombinant inbred line (RIL) populations, and to analyze the genetic effects of individual QTLs and QTL combinations in soybean. Both the mapping population of SD02-4-59 × A02-381100 and validation population of SD02-911 × SD00-1501 were evaluated for cysteine and methionine contents in multiple environments. Correlation analysis indicated that there was a highly positive correlation between cysteine and methionine contents. Significant positive correlations were also observed between the sulfur-containing amino acid contents and protein content. In the mapping population, eight QTLs for both cysteine and methionine contents were consistently detected in any individual environment and the average data over all three environments. Three of these QTLs were confirmed in the validation population. A comparison with the previous studies indicated that most of the genomic regions linked to the QTLs for the sulfur-containing amino acids were also associated with protein content. Cumulative effects of multiple QTLs for the traits were significant in both populations. This information should be useful for the improvement of the levels of protein and amino acids in soybean seeds.

Journal ArticleDOI
TL;DR: The de novo assembled draft genome sequence of the P. longicolla type strain TWH P74 (ATCC 60325), which was originally isolated by Hobbs et al. from soybean seed in Ohio in 1983, is reported.
Abstract: Phomopsis longicolla is the primary cause of Phomopsis seed decay in soybean. Here, we report the de novo assembled draft genome sequence of the P. longicolla type strain TWH P74 (ATCC 60325), which was originally isolated by Hobbs et al. from soybean seed in Ohio in 1983.