scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps

01 Apr 2001-Genetics (Genetics Society of America)-Vol. 157, Iss: 4, pp 1819-1829
TL;DR: It was concluded that selection on genetic values predicted from markers could substantially increase the rate of genetic gain in animals and plants, especially if combined with reproductive techniques to shorten the generation interval.
Abstract: Recent advances in molecular genetic techniques will make dense marker maps available and genotyping many individuals for these markers feasible. Here we attempted to estimate the effects of ∼50,000 marker haplotypes simultaneously from a limited number of phenotypic records. A genome of 1000 cM was simulated with a marker spacing of 1 cM. The markers surrounding every 1-cM region were combined into marker haplotypes. Due to finite population size (Ne = 100), the marker haplotypes were in linkage disequilibrium with the QTL located between the markers. Using least squares, all haplotype effects could not be estimated simultaneously. When only the biggest effects were included, they were overestimated and the accuracy of predicting genetic values of the offspring of the recorded animals was only 0.32. Best linear unbiased prediction of haplotype effects assumed equal variances associated to each 1-cM chromosomal segment, which yielded an accuracy of 0.73, although this assumption was far from true. Bayesian methods that assumed a prior distribution of the variance associated with each chromosome segment increased this accuracy to 0.85, even when the prior was not correct. It was concluded that selection on genetic values predicted from markers could substantially increase the rate of genetic gain in animals and plants, especially if combined with reproductive techniques to shorten the generation interval.
Citations
More filters
Journal ArticleDOI
08 Oct 2009-Nature
TL;DR: This paper examined potential sources of missing heritability and proposed research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.
Abstract: Genome-wide association studies have identified hundreds of genetic variants associated with complex human diseases and traits, and have provided valuable insights into their genetic architecture. Most variants identified so far confer relatively small increments in risk, and explain only a small proportion of familial clustering, leading many to question how the remaining, 'missing' heritability can be explained. Here we examine potential sources of missing heritability and propose research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

7,797 citations

Journal ArticleDOI
TL;DR: Efficient methods for processing genomic data were developed to increase reliability of estimated breeding values and to estimate thousands of marker effects simultaneously, and a blend of first- and second-order Jacobi iteration using 2 separate relaxation factors converged well for allele frequencies and effects.

4,196 citations


Cites background or methods from "Prediction of Total Genetic Value U..."

  • ...costs from quadratic with number of markers (Meuwissen et al., 2001) to linear....

    [...]

  • ...Genomic selection increases the rate of genetic improvement and reduces cost of progeny testing by allowing breeders to preselect animals that inherited chromosome segments of greater merit (Meuwissen et al., 2001; Schaeffer, 2006)....

    [...]

  • ...Nonlinear predictions A and B are analogous but not identical to Bayesian A and B methods of Meuwissen et al. (2001), and other prior distributions could fit actual data better....

    [...]

  • ...Genomic selection increases the rate of genetic improvement and reduces cost of progeny testing by allowing breeders to preselect animals that inherited chromosome segments of greater merit (Meuwissen et al., 2001; Schaeffer, 2006)....

    [...]

  • ...That procedure reduced computing Journal of Dairy Science Vol. 91 No. 11, 2008 costs from quadratic with number of markers (Meuwissen et al., 2001) to linear....

    [...]

Journal ArticleDOI
TL;DR: An R package called GAPIT is developed that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection and can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time.
Abstract: Summary: Software programs that conduct genome-wide association studies and genomic prediction and selection need to use methodologies that maximize statistical power, provide high prediction accuracy and run in a computationally efficient manner. We developed an R package called Genome Association and Prediction Integrated Tool (GAPIT) that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection. The GAPIT package can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time, while providing user-friendly access and concise tables and graphs to interpret

1,583 citations

Journal ArticleDOI
28 Feb 2012-PLOS ONE
TL;DR: The GBS approach presented here provides a powerful method of developing high-density markers in species without a sequenced genome while providing valuable tools for anchoring and ordering physical maps and whole-genome shotgun sequence.
Abstract: Advancements in next-generation sequencing technology have enabled whole genome re-sequencing in many species providing unprecedented discovery and characterization of molecular polymorphisms. There are limitations, however, to next-generation sequencing approaches for species with large complex genomes such as barley and wheat. Genotyping-by-sequencing (GBS) has been developed as a tool for association studies and genomics-assisted breeding in a range of species including those with complex genomes. GBS uses restriction enzymes for targeted complexity reduction followed by multiplex sequencing to produce high-quality polymorphism data at a relatively low per sample cost. Here we present a GBS approach for species that currently lack a reference genome sequence. We developed a novel two-enzyme GBS protocol and genotyped bi-parental barley and wheat populations to develop a genetically anchored reference map of identified SNPs and tags. We were able to map over 34,000 SNPs and 240,000 tags onto the Oregon Wolfe Barley reference map, and 20,000 SNPs and 367,000 tags on the Synthetic W9784 × Opata85 (SynOpDH) wheat reference map. To further evaluate GBS in wheat, we also constructed a de novo genetic map using only SNP markers from the GBS data. The GBS approach presented here provides a powerful method of developing high-density markers in species without a sequenced genome while providing valuable tools for anchoring and ordering physical maps and whole-genome shotgun sequence. Development of the sequenced reference genome(s) will in turn increase the utility of GBS data enabling physical mapping of genes and haplotype imputation of missing data. Finally, as a result of low per-sample costs, GBS will have broad application in genomics-assisted plant breeding programs.

1,492 citations

Journal ArticleDOI
TL;DR: A new software package for R called rrBLUP, which is a fast maximum‐likelihood algorithm for mixed models with a single variance component besides the residual error, which allows for efficient prediction with unreplicated training data.
Abstract: Many important traits in plant breeding are polygenic and therefore recalcitrant to traditional marker-assisted selection. Genomic selection addresses this complexity by including all markers in the prediction model. A key method for the genomic prediction of breeding values is ridge regression (RR), which is equivalent to best linear unbiased prediction (BLUP) when the genetic covariance between lines is proportional to their similarity in genotype space. This additive model can be broadened to include epistatic effects by using other kernels, such as the Gaussian, which represent inner products in a complex feature space. To facilitate the use of RR and nonadditive kernels in plant breeding, a new software package for R called rrBLUP has been developed. At its core is a fast maximum-likelihood algorithm for mixed models with a single variance component besides the residual error, which allows for effi cient prediction with unreplicated training data. Use of the rrBLUP software is demonstrated through several examples, including the identifi cation of optimal crosses based on superior progeny value. In cross-validation tests, the prediction accuracy with nonadditive kernels was signifi cantly higher than RR for wheat (Triticum aestivum L.) grain yield but equivalent for several maize (Zea mays L.) traits. THE ABILITY TO PREDICT COMPLEX TRAITS from marker data is becoming increasingly important in plant breeding (Bernardo, 2008). Th e earliest attempts, now over 20 years old, involved fi rst identifying signifi cant markers and then combining them in a multiple regression model (Lande and Th ompson, 1990). Th e focus over the last decade has been on genomic selection methods, in which all markers are included in the prediction model (Bernardo and Yu, 2007; Heff ner et al., 2009; Jannink et al., 2010). One of the fi rst methods proposed for genomic selection was ridge regression (RR), which is equivalent to best linear unbiased prediction (BLUP) in the context of mixed models (Whittaker et al., 2000; Meuwissen et al., 2001). Th e basic RR-BLUP model is

1,475 citations


Cites methods from "Prediction of Total Genetic Value U..."

  • ...One of the fi rst methods proposed for genomic selection was ridge regression (RR), which is equivalent to best linear unbiased prediction (BLUP) in the context of mixed models (Whittaker et al., 2000; Meuwissen et al., 2001)....

    [...]

References
More filters
Book
01 Jan 1981
TL;DR: The genetic constitution of a population: Hardy-Weinberg equilibrium and changes in gene frequency: migration mutation, changes of variance, and heritability are studied.
Abstract: Part 1 Genetic constitution of a population: Hardy-Weinberg equilibrium. Part 2 Changes in gene frequency: migration mutation. Part 3 Small populations - changes in gene frequency under simplified conditions. Part 4 Small populations - less simplified conditions. Part 5 Small populations - pedigreed populations and close inbreeding. Part 6 Continuous variation. Part 7 Values and means. Part 8 Variance. Part 9 Resemblance between relatives. Part 10 Heritability. Part 11 Selection - the response and its prediction. Part 12 Selection - the results of experiments. Part 13 Selection - information from relatives. Part 14 Inbreeding and crossbreeding - changes of mean value. Part 15 Inbreeding and crossbreeding - changes of variance. Part 16 Inbreeding and crossbreeding - applications. Part 17 Scale. Part 18 Threshold characters. Part 19 Correlated characters. Part 20 Metric characters under natural selection.

20,288 citations

BookDOI
TL;DR: The Markov Chain Monte Carlo Implementation Results Summary and Discussion MEDICAL MONITORING Introduction Modelling Medical Monitoring Computing Posterior Distributions Forecasting Model Criticism Illustrative Application Discussion MCMC for NONLINEAR HIERARCHICAL MODELS.
Abstract: INTRODUCING MARKOV CHAIN MONTE CARLO Introduction The Problem Markov Chain Monte Carlo Implementation Discussion HEPATITIS B: A CASE STUDY IN MCMC METHODS Introduction Hepatitis B Immunization Modelling Fitting a Model Using Gibbs Sampling Model Elaboration Conclusion MARKOV CHAIN CONCEPTS RELATED TO SAMPLING ALGORITHMS Markov Chains Rates of Convergence Estimation The Gibbs Sampler and Metropolis-Hastings Algorithm INTRODUCTION TO GENERAL STATE-SPACE MARKOV CHAIN THEORY Introduction Notation and Definitions Irreducibility, Recurrence, and Convergence Harris Recurrence Mixing Rates and Central Limit Theorems Regeneration Discussion FULL CONDITIONAL DISTRIBUTIONS Introduction Deriving Full Conditional Distributions Sampling from Full Conditional Distributions Discussion STRATEGIES FOR IMPROVING MCMC Introduction Reparameterization Random and Adaptive Direction Sampling Modifying the Stationary Distribution Methods Based on Continuous-Time Processes Discussion IMPLEMENTING MCMC Introduction Determining the Number of Iterations Software and Implementation Output Analysis Generic Metropolis Algorithms Discussion INFERENCE AND MONITORING CONVERGENCE Difficulties in Inference from Markov Chain Simulation The Risk of Undiagnosed Slow Convergence Multiple Sequences and Overdispersed Starting Points Monitoring Convergence Using Simulation Output Output Analysis for Inference Output Analysis for Improving Efficiency MODEL DETERMINATION USING SAMPLING-BASED METHODS Introduction Classical Approaches The Bayesian Perspective and the Bayes Factor Alternative Predictive Distributions How to Use Predictive Distributions Computational Issues An Example Discussion HYPOTHESIS TESTING AND MODEL SELECTION Introduction Uses of Bayes Factors Marginal Likelihood Estimation by Importance Sampling Marginal Likelihood Estimation Using Maximum Likelihood Application: How Many Components in a Mixture? Discussion Appendix: S-PLUS Code for the Laplace-Metropolis Estimator MODEL CHECKING AND MODEL IMPROVEMENT Introduction Model Checking Using Posterior Predictive Simulation Model Improvement via Expansion Example: Hierarchical Mixture Modelling of Reaction Times STOCHASTIC SEARCH VARIABLE SELECTION Introduction A Hierarchical Bayesian Model for Variable Selection Searching the Posterior by Gibbs Sampling Extensions Constructing Stock Portfolios With SSVS Discussion BAYESIAN MODEL COMPARISON VIA JUMP DIFFUSIONS Introduction Model Choice Jump-Diffusion Sampling Mixture Deconvolution Object Recognition Variable Selection Change-Point Identification Conclusions ESTIMATION AND OPTIMIZATION OF FUNCTIONS Non-Bayesian Applications of MCMC Monte Carlo Optimization Monte Carlo Likelihood Analysis Normalizing-Constant Families Missing Data Decision Theory Which Sampling Distribution? Importance Sampling Discussion STOCHASTIC EM: METHOD AND APPLICATION Introduction The EM Algorithm The Stochastic EM Algorithm Examples GENERALIZED LINEAR MIXED MODELS Introduction Generalized Linear Models (GLMs) Bayesian Estimation of GLMs Gibbs Sampling for GLMs Generalized Linear Mixed Models (GLMMs) Specification of Random-Effect Distributions Hyperpriors and the Estimation of Hyperparameters Some Examples Discussion HIERARCHICAL LONGITUDINAL MODELLING Introduction Clinical Background Model Detail and MCMC Implementation Results Summary and Discussion MEDICAL MONITORING Introduction Modelling Medical Monitoring Computing Posterior Distributions Forecasting Model Criticism Illustrative Application Discussion MCMC FOR NONLINEAR HIERARCHICAL MODELS Introduction Implementing MCMC Comparison of Strategies A Case Study from Pharmacokinetics-Pharmacodynamics Extensions and Discussion BAYESIAN MAPPING OF DISEASE Introduction Hypotheses and Notation Maximum Likelihood Estimation of Relative Risks Hierarchical Bayesian Model of Relative Risks Empirical Bayes Estimation of Relative Risks Fully Bayesian Estimation of Relative Risks Discussion MCMC IN IMAGE ANALYSIS Introduction The Relevance of MCMC to Image Analysis Image Models at Different Levels Methodological Innovations in MCMC Stimulated by Imaging Discussion MEASUREMENT ERROR Introduction Conditional-Independence Modelling Illustrative examples Discussion GIBBS SAMPLING METHODS IN GENETICS Introduction Standard Methods in Genetics Gibbs Sampling Approaches MCMC Maximum Likelihood Application to a Family Study of Breast Cancer Conclusions MIXTURES OF DISTRIBUTIONS: INFERENCE AND ESTIMATION Introduction The Missing Data Structure Gibbs Sampling Implementation Convergence of the Algorithm Testing for Mixtures Infinite Mixtures and Other Extensions AN ARCHAEOLOGICAL EXAMPLE: RADIOCARBON DATING Introduction Background to Radiocarbon Dating Archaeological Problems and Questions Illustrative Examples Discussion Index

7,399 citations

Book
01 Jan 1996
TL;DR: This book discusses the genetic Basis of Quantitative Variation, Properties of Distributions, Covariance, Regression, and Correlation, and Properties of Single Loci, and Sources of Genetic Variation for Multilocus Traits.
Abstract: I. The Genetic Basis of Quantitative Variation - An Overview of Quantitative Genetics - Properties of Distributions - Covariance, Regression, and Correlation - Properties of Single Loci - Sources of Genetic Variation for Multilocus Traits - Sources of Environmental Variation - Resemblance Between Relatives - Introduction to Matrix Algebra and Linear Models - Analysis of Line Crosses - Inbreeding Depression - Matters of Scale - II. Quantitative-Trait Loci - Polygenes and Polygenic Mutation - Detecting Major Genes - Basic Concepts of Marker-Based Analysis - Mapping and Characterizing QTLs: Inbred-Line Crosses - Mapping and Characterizing QTLs: Outbred Populations - III. Estimation Procedures - Parent-Offspring Regression - Sib AnalysisTwins and Clones - Cross-Classified Designs - Correlations Between Characters - Genotype x Environment Interaction - Maternal Effects Sex Linkage and Sexual Dimorphism - Threshold Characters - Estimation of Breeding Values - Variance-Component Estimation with Complex Pedigrees - Appendices - Expectations, Variances and Covariances of Compound Variables - Path Analysis - Matrix Algebra and Linear Models - Maximum Likelihood Estimation and Likelihood-Ratio Tests - Estimation of Power of Statistical Tests -

6,530 citations

Journal ArticleDOI
TL;DR: An introduction to population genetics theory, An introduction to Population Genetics Theory, Population Genetics theory, Population genetics theory as discussed by the authors, Population genetics, population genetics, and population genetics theories, Population Genetic Theory
Abstract: An introduction to population genetics theory , An introduction to population genetics theory , مرکز فناوری اطلاعات و اطلاع رسانی کشاورزی

4,817 citations

Book
01 Jan 1970
TL;DR: An introduction to population genetics theory, An introduction to Population Genetics theory, and more.
Abstract: An introduction to population genetics theory , An introduction to population genetics theory , مرکز فناوری اطلاعات و اطلاع رسانی کشاورزی

4,273 citations


Additional excerpts

  • ...…follows a binomial distribution with probability p yielding E(s2gi|s 5 0) 5 0, and probabilityf(p) 5 K/[p(1 2 p)] (1 2 p) yielding E(s2gi|s 5 1), we have (Crow and Kimura 1970), where p 5 allele frequency, Vs[E(s2gi|s)] 5 p(1 2 p) [E(s2gi|s 5 1)]2 5 4.98 3 1027.and the constant K 5 0.5/ln(2Ne 2 1)....

    [...]