scispace - formally typeset
Search or ask a question

Showing papers in "Genetics in 2008"


Journal ArticleDOI
01 Oct 2008-Genetics
TL;DR: It is shown that the inclusion of isolated populations that underwent a strong bottleneck can lead to a high rate of false positives, and it is demonstrated that it is possible to avoid them by carefully choosing the populations that should be included in the analysis.
Abstract: Identifying loci under natural selection from genomic surveys is of great interest in different research areas. Commonly used methods to separate neutral effects from adaptive effects are based on locus-specific population differentiation coefficients to identify outliers. Here we extend such an approach to estimate directly the probability that each locus is subject to selection using a Bayesian method. We also extend it to allow the use of dominant markers like AFLPs. It has been shown that this model is robust to complex demographic scenarios for neutral genetic differentiation. Here we show that the inclusion of isolated populations that underwent a strong bottleneck can lead to a high rate of false positives. Nevertheless, we demonstrate that it is possible to avoid them by carefully choosing the populations that should be included in the analysis. We analyze two previously published data sets: a human data set of codominant markers and a Littorina saxatilis data set of dominant markers. We also perform a detailed sensitivity study to compare the power of the method using amplified fragment length polymorphism (AFLP), SNP, and microsatellite markers. The method has been implemented in a new software available at our website (http://www-leca.ujf-grenoble.fr/logiciels.htm).

2,366 citations


Journal ArticleDOI
01 Mar 2008-Genetics
TL;DR: A new method, efficient mixed-model association (EMMA), which corrects for population structure and genetic relatedness in model organism association mapping and takes advantage of the specific nature of the optimization problem in applying mixed models for association mapping, which allows for substantially increase the computational speed and reliability of the results.
Abstract: Genomewide association mapping in model organisms such as inbred mouse strains is a promising approach for the identification of risk factors related to human diseases. However, genetic association studies in inbred model organisms are confronted by the problem of complex population structure among strains. This induces inflated false positive rates, which cannot be corrected using standard approaches applied in human association studies such as genomic control or structured association. Recent studies demonstrated that mixed models successfully correct for the genetic relatedness in association mapping in maize and Arabidopsis panel data sets. However, the currently available mixed-model methods suffer from computational inefficiency. In this article, we propose a new method, efficient mixed-model association (EMMA), which corrects for population structure and genetic relatedness in model organism association mapping. Our method takes advantage of the specific nature of the optimization problem in applying mixed models for association mapping, which allows us to substantially increase the computational speed and reliability of the results. We applied EMMA to in silico whole-genome association mapping of inbred mouse strains involving hundreds of thousands of SNPs, in addition to Arabidopsis and maize data sets. We also performed extensive simulation studies to estimate the statistical power of EMMA under various SNP effects, varying degrees of population structure, and differing numbers of multiple measurements per strain. Despite the limited power of inbred mouse association mapping due to the limited number of available inbred strains, we are able to identify significantly associated SNPs, which fall into known QTL or genes identified through previous studies while avoiding an inflation of false positives. An R package implementation and webserver of our EMMA method are publicly available.

1,765 citations


Journal ArticleDOI
01 Jan 2008-Genetics
TL;DR: The genetic and statistical properties of the nested association mapping (NAM) design currently being implemented in maize is investigated to dissect the genetic basis of complex quantitative traits.
Abstract: We investigated the genetic and statistical properties of the nested association mapping (NAM) design currently being implemented in maize (26 diverse founders and 5000 distinct immortal genotypes) to dissect the genetic basis of complex quantitative traits. The NAM design simultaneously exploits the advantages of both linkage analysis and association mapping. We demonstrated the power of NAM for high-power cost-effective genome scans through computer simulations based on empirical marker data and simulated traits with different complexities. With common-parent-specific (CPS) markers genotyped for the founders and the progenies, the inheritance of chromosome segments nested within two adjacent CPS markers was inferred through linkage. Genotyping the founders with additional high-density markers enabled the projection of genetic information, capturing linkage disequilibrium information, from founders to progenies. With 5000 genotypes, 30-79% of the simulated quantitative trait loci (QTL) were precisely identified. By integrating genetic design, natural diversity, and genomics technologies, this new complex trait dissection strategy should greatly facilitate endeavors to link molecular variation with phenotypic variation for various complex traits.

1,012 citations


Journal ArticleDOI
01 Jul 2008-Genetics
TL;DR: The correlation of r values between populations for the same marker pairs was close to 1 for pairs of very close markers and decreased with increasing marker distance and the extent of divergence between the populations, which indicates that genomic selection within cattle breeds with r2 ≥ 0.20 between adjacent markers would require ∼50,000 SNPs.
Abstract: When a genetic marker and a quantitative trait locus (QTL) are in linkage disequilibrium (LD) in one population, they may not be in LD in another population or their LD phase may be reversed. The objectives of this study were to compare the extent of LD and the persistence of LD phase across multiple cattle populations. LD measures r and r(2) were calculated for syntenic marker pairs using genomewide single-nucleotide polymorphisms (SNP) that were genotyped in Dutch and Australian Holstein-Friesian (HF) bulls, Australian Angus cattle, and New Zealand Friesian and Jersey cows. Average r(2) was approximately 0.35, 0.25, 0.22, 0.14, and 0.06 at marker distances 10, 20, 40, 100, and 1000 kb, respectively, which indicates that genomic selection within cattle breeds with r(2) >or= 0.20 between adjacent markers would require approximately 50,000 SNPs. The correlation of r values between populations for the same marker pairs was close to 1 for pairs of very close markers (<10 kb) and decreased with increasing marker distance and the extent of divergence between the populations. To find markers that are in LD with QTL across diverged breeds, such as HF, Jersey, and Angus, would require approximately 300,000 markers.

468 citations


Journal ArticleDOI
01 Jun 2008-Genetics
TL;DR: The history of studies on aneuploidy is reviewed, some of its major characteristics are summarized and speculations as to whether and how aneuPLoidy contributes to tumorigenesis are considered.
Abstract: A change in chromosome number that is not the exact multiple of the haploid karyotype is known as aneuploidy. This condition interferes with growth and development of an organism and is a common characteristic of solid tumors. Here, we review the history of studies on aneuploidy and summarize some of its major characteristics. We will then discuss the molecular basis for the defects caused by aneuploidy and end with speculations as to whether and how aneuploidy, despite its deleterious effects on organismal and cellular fitness, contributes to tumorigenesis.

384 citations


Journal ArticleDOI
01 Jan 2008-Genetics
TL;DR: It was concluded that genomic selection is considerably more accurate than traditional selection, especially for a low-heritability trait.
Abstract: Genomic selection uses total breeding values for juvenile animals, predicted from a large number of estimated marker haplotype effects across the whole genome. In this study the accuracy of predicting breeding values is compared for four different models including a large number of markers, at different marker densities for traits with heritabilities of 50 and 10%. The models estimated the effect of (1) each single-marker allele [single-nucleotide polymorphism (SNP)1], (2) haplotypes constructed from two adjacent marker alleles (SNP2), and (3) haplotypes constructed from 2 or 10 markers, including the covariance between haplotypes by combining linkage disequilibrium and linkage analysis (HAP_IBD2 and HAP_IBD10). Between 119 and 2343 polymorphic SNPs were simulated on a 3-M genome. For the trait with a heritability of 10%, the differences between models were small and none of them yielded the highest accuracies across all marker densities. For the trait with a heritability of 50%, the HAP_IBD10 model yielded the highest accuracies of estimated total breeding values for juvenile and phenotyped animals at all marker densities. It was concluded that genomic selection is considerably more accurate than traditional selection, especially for a low-heritability trait.

381 citations


Journal ArticleDOI
01 Sep 2008-Genetics
TL;DR: In this data set, within- family information was more accurate than across-family information, and populational linkage disequilibrium was not a completely accurate source of information for genetic evaluation, which questions some applications of genomic selection.
Abstract: Selection plans in plant and animal breeding are driven by genetic evaluation. Recent developments suggest using massive genetic marker information, known as “genomic selection.” There is little evidence of its performance, though. We empirically compared three strategies for selection: (1) use of pedigree and phenotypic information, (2) use of genomewide markers and phenotypic information, and (3) the combination of both. We analyzed four traits from a heterogeneous mouse population (http://gscan.well.ox.ac.uk/), including 1884 individuals and 10,946 SNP markers. We used linear mixed models, using extensions of association analysis. Cross-validation techniques were used, providing assumption-free estimates of predictive ability. Sampling of validation and training data sets was carried out across and within families, which allows comparing across- and within-family information. Use of genomewide genetic markers increased predictive ability up to 0.22 across families and up to 0.03 within families. The latter is not statistically significant. These values are roughly comparable to increases of up to 0.57 (across family) and 0.14 (within family) in accuracy of prediction of genetic value. In this data set, within-family information was more accurate than across-family information, and populational linkage disequilibrium was not a completely accurate source of information for genetic evaluation. This fact questions some applications of genomic selection.

379 citations


Journal ArticleDOI
01 Apr 2008-Genetics
TL;DR: It is shown that the statistical specification admits a standard mixed-effects linear model representation, with smoothing parameters treated as variance components, in reproducing kernel Hilbert spaces regression.
Abstract: Reproducing kernel Hilbert spaces regression procedures for prediction of total genetic value for quantitative traits, which make use of phenotypic and genomic data simultaneously, are discussed from a theoretical perspective. It is argued that a nonparametric treatment may be needed for capturing the multiple and complex interactions potentially arising in whole-genome models, i.e., those based on thousands of single-nucleotide polymorphism (SNP) markers. After a review of reproducing kernel Hilbert spaces regression, it is shown that the statistical specification admits a standard mixed-effects linear model representation, with smoothing parameters treated as variance components. Models for capturing different forms of interaction, e.g., chromosome-specific, are presented. Implementations can be carried out using software for likelihood-based or Bayesian inference.

363 citations


Journal ArticleDOI
01 Jan 2008-Genetics
TL;DR: The genetic basis of grain yield, heading date, and plant height was investigated in a durum wheat population of 249 recombinant inbred lines evaluated in 16 environments characterized by a broad range of water availability and GY.
Abstract: Grain yield is a major goal for the improvement of durum wheat, particularly in drought-prone areas. In this study, the genetic basis of grain yield (GY), heading date (HD), and plant height (PH) was investigated in a durum wheat population of 249 recombinant inbred lines evaluated in 16 environments (10 rainfed and 6 irrigated) characterized by a broad range of water availability and GY (from 5.6 to 58.8 q ha−1). Among the 16 quantitative trait loci (QTL) that affected GY, two major QTL on chromosomes 2BL and 3BS showed significant effects in 8 and 7 environments, with R2 values of 21.5 and 13.8% (mean data of all 16 environments), respectively. In both cases, extensive overlap was observed between the LOD profiles of GY and PH, but not with those for HD. QTL specific for PH were identified on chromosomes 1BS, 3AL, and 7AS. Additionally, three major QTL for HD on chromosomes 2AS, 2BL, and 7BS showed limited or no effects on GY. For both PH and GY, notable epistasis between the chromosome 2BL and 3BS QTL was detected across several environments.

354 citations


Journal ArticleDOI
01 Dec 2008-Genetics
TL;DR: The results of the complementation analysis and an evaluation of the resistance specificity of the transgenic lines to blast isolates demonstrated that Pikm-specific resistance is conferred by cooperation of Pikm1-TS and Pikm2-TS.
Abstract: The rice blast resistance gene Pikm was cloned by a map-based cloning strategy. High-resolution genetic mapping and sequencing of the gene region in the Pikm-containing cultivar Tsuyuake narrowed down the candidate region to a 131-kb genomic interval. Sequence analysis predicted two adjacently arranged resistance-like genes, Pikm1-TS and Pikm2-TS, within this candidate region. These genes encoded proteins with a nucleotide-binding site (NBS) and leucine-rich repeats (LRRs) and were considered the most probable candidates for Pikm. However, genetic complementation analysis of transgenic lines individually carrying these two genes negated the possibility that either Pikm1-TS or Pikm2-TS alone was Pikm. Instead, it was revealed that transgenic lines carrying both of these genes expressed blast resistance. The results of the complementation analysis and an evaluation of the resistance specificity of the transgenic lines to blast isolates demonstrated that Pikm-specific resistance is conferred by cooperation of Pikm1-TS and Pikm2-TS. Although these two genes are not homologous with each other, they both contain all the conserved motifs necessary for an NBS-LRR class gene to function independently as a resistance gene.

343 citations


Journal ArticleDOI
01 Jan 2008-Genetics
TL;DR: An improved method for the accurate calculation of mutation rates based on the fluctuation assay is described and a definition for the effective target size of genes (the probability that a mutation inactivates the gene) is proposed that acknowledges that the mutation rate is nonuniform across the genome.
Abstract: Although mutation rates are a key determinant of the rate of evolution they are difficult to measure precisely and global mutations rates (mutations per genome per generation) are often extrapolated from the per-base-pair mutation rate assuming that mutation rate is uniform across the genome. Using budding yeast, we describe an improved method for the accurate calculation of mutation rates based on the fluctuation assay. Our analysis suggests that the per-base-pair mutation rates at two genes differ significantly (3.80 × 10−10 at URA3 and 6.44 × 10−10 at CAN1) and we propose a definition for the effective target size of genes (the probability that a mutation inactivates the gene) that acknowledges that the mutation rate is nonuniform across the genome.

Journal ArticleDOI
01 Jun 2008-Genetics
TL;DR: Several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers are proposed, including a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model.
Abstract: The mapping of quantitative trait loci (QTL) is to identify molecular markers or genomic loci that influence the variation of complex traits. The problem is complicated by the facts that QTL data usually contain a large number of markers across the entire genome and most of them have little or no effect on the phenotype. In this article, we propose several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers. The proposed models use prior distributions for the genetic effects that are scale mixtures of normal distributions with mean zero and variances distributed to give each effect a high probability of being near zero. We consider two types of priors for the variances, exponential and scaled inverse-χ2 distributions, which result in a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model, respectively. Unlike most applications where fixed values are preset for hyperparameters in the priors, we treat all hyperparameters as unknowns and estimate them along with other parameters. Markov chain Monte Carlo (MCMC) algorithms are developed to simulate the parameters from the posteriors. The methods are illustrated using well-known barley data.

Journal ArticleDOI
01 Oct 2008-Genetics
TL;DR: The genetic redundancy suggests that the presence of duplicated copies of phyA genes accounts for the generation of photoperiod insensitivity, while protecting against the deleterious effects of mutation.
Abstract: Gene and genome duplications underlie the origins of evolutionary novelty in plants. Soybean, Glycine max, is considered to be a paleopolyploid species with a complex genome. We found multiple homologs of the phytochrome A gene (phyA) in the soybean genome and determined the DNA sequences of two paralogs designated GmphyA1 and GmphyA2. Analysis of the GmphyA2 gene from the lines carrying a recessive allele at a photoperiod insensitivity locus, E4, revealed that a Ty1/copia-like retrotransposon was inserted in exon 1 of the gene, which resulted in dysfunction of the gene. Mapping studies suggested that GmphyA2 is encoded by E4. The GmphyA1 gene was mapped to a region of linkage group O, which is homeologous to the region harboring E4 in linkage group I. Plants homozygous for the e4 allele were etiolated under continuous far red light, but the de-etiolation occurred partially, indicating that the mutation alone did not cause a complete loss of phyA function. The genetic redundancy suggests that the presence of duplicated copies of phyA genes accounts for the generation of photoperiod insensitivity, while protecting against the deleterious effects of mutation. Thus, this phenomenon provides a link between gene duplication and establishment of an adaptive response of plants to environments.

Journal ArticleDOI
01 May 2008-Genetics
TL;DR: It is shown that tests that rely on haplotype frequencies are the most powerful for detecting expansions on nonrecombining genomic regions and should not be used when recombination levels are unknown, so class I tests, particularly Tajima's D or R2, are recommended.
Abstract: Several tests have been proposed to detect departures of nucleotide variability patterns from neutral expectations. However, very different kinds of evolutionary processes, such as selective events or demographic changes, can produce similar deviations from these tests, thus making interpretation difficult when a significant departure of neutrality is detected. Here we study the effects of demography and recombination upon neutrality tests by analyzing their power under sudden population expansions, sudden contractions, and bottlenecks. We evaluate tests based on the frequency spectrum of mutations and the distribution of haplotypes and explore the consequences of using incorrect estimates of the rates of recombination when testing for neutrality. We show that tests that rely on haplotype frequencies-especially Fs and ZnS, which are based, respectively, on the number of different haplotypes and on the r2 values between all pairs of polymorphic sites-are the most powerful for detecting expansions on nonrecombining genomic regions. Nevertheless, they are strongly affected by misestimations of recombination, so they should not be used when recombination levels are unknown. Instead, class I tests, particularly Tajima's D or R2, are recommended.

Journal ArticleDOI
01 Mar 2008-Genetics
TL;DR: The mixed-model association-mapping approaches using a kinship matrix estimated by REML are more appropriate for association mapping than the recently proposed QK method with respect to (i) the adherence to the nominal α-level and (ii) the adjusted power for detection of quantitative trait loci.
Abstract: Association-mapping methods promise to overcome the limitations of linkage-mapping methods. The main objectives of this study were to (i) evaluate various methods for association mapping in the autogamous species wheat using an empirical data set, (ii) determine a marker-based kinship matrix using a restricted maximum-likelihood (REML) estimate of the probability of two alleles at the same locus being identical in state but not identical by descent, and (iii) compare the results of association-mapping approaches based on adjusted entry means (two-step approaches) with the results of approaches in which the phenotypic data analysis and the association analysis were performed in one step (one-step approaches). On the basis of the phenotypic and genotypic data of 303 soft winter wheat ( Triticum aestivum L.) inbreds, various association-mapping methods were evaluated. Spearman's rank correlation between P -values calculated on the basis of one- and two-stage association-mapping methods ranged from 0.63 to 0.93. The mixed-model association-mapping approaches using a kinship matrix estimated by REML are more appropriate for association mapping than the recently proposed QK method with respect to (i) the adherence to the nominal α-level and (ii) the adjusted power for detection of quantitative trait loci. Furthermore, we showed that our data set could be analyzed by using two-step approaches of the proposed association-mapping method without substantially increasing the empirical type I error rate in comparison to the corresponding one-step approaches.

Journal ArticleDOI
01 Feb 2008-Genetics
TL;DR: The identified genomewide quantitative trait loci can be applied in marker-assisted selection programs to improve the resistance of salmon to IPN and reduce disease-related mortality.
Abstract: Infectious pancreatic necrosis (IPN) is a viral disease currently presenting a major problem in the production of Atlantic salmon (Salmon salar) IPN can cause significant mortality to salmon fry within freshwater hatcheries and to smolts following transfer to seawater, although challenged populations show clear genetic variation in resistance To determine whether this genetic variation includes loci of major effect, a genomewide quantitative trait loci (QTL) scan was performed within 10 full-sib families that had received a natural seawater IPN challenge To utilize the large difference between Atlantic salmon male and female recombination rates, a two-stage mapping strategy was employed Initially, a sire-based QTL analysis was used to detect linkage groups with significant effects on IPN resistance, using two to three microsatellite markers per linkage group A dam-based analysis with additional markers was then used to confirm and position any detected QTL Two genomewide significant QTL and one suggestive QTL were detected in the genome scan The most significant QTL was mapped to linkage group 21 and was significant at the genomewide level in both the sire and the dam-based analyses The identified QTL can be applied in marker-assisted selection programs to improve the resistance of salmon to IPN and reduce disease-related mortality

Journal ArticleDOI
01 Mar 2008-Genetics
TL;DR: Results indicate that ATG10 is essential for ATG 12 conjugation and that the ATG12-ATG5 conjugate is necessary to form autophagic vesicles and for the timely progression of senescence and PCD in plants.
Abstract: Autophagy is an important intracellular recycling system in eukaryotes that utilizes small vesicles to traffic cytosolic proteins and organelles to the vacuole for breakdown. Vesicle formation requires the conjugation of the two ubiquitin-fold polypeptides ATG8 and ATG12 to phosphatidylethanolamine and the ATG5 protein, respectively. Using Arabidopsis thaliana mutants affecting the ATG5 target or the ATG7 E1 required to initiate ligation of both ATG8 and ATG12, we previously showed that the ATG8/12 conjugation pathways together are important when plants encounter nutrient stress and during senescence. To characterize the ATG12 conjugation pathway specifically, we characterized a null mutant eliminating the E2-conjugating enzyme ATG10 that, similar to plants missing ATG5 or ATG7, cannot form the ATG12-ATG5 conjugate. atg10-1 plants are hypersensitive to nitrogen and carbon starvation and initiate senescence and programmed cell death (PCD) more quickly than wild type, as indicated by elevated levels of senescence- and PCD-related mRNAs and proteins during carbon starvation. As detected with a GFP-ATG8a reporter, atg10-1 and atg5-1 mutant plants fail to accumulate autophagic bodies inside the vacuole. These results indicate that ATG10 is essential for ATG12 conjugation and that the ATG12-ATG5 conjugate is necessary to form autophagic vesicles and for the timely progression of senescence and PCD in plants.

Journal ArticleDOI
01 Aug 2008-Genetics
TL;DR: Homing endonuclease genes (HEGs) encode proteins that in the heterozygous state cause double-strand breaks in the homologous chromosome at the precise position opposite the HEG, which represents a powerful genetic drive mechanism that might be used as a tool in managing vector or pest populations.
Abstract: Homing endonuclease genes (HEGs) encode proteins that in the heterozygous state cause double-strand breaks in the homologous chromosome at the precise position opposite the HEG. If the double-strand break is repaired using the homologous chromosome, the HEG becomes homozygous, and this represents a powerful genetic drive mechanism that might be used as a tool in managing vector or pest populations. HEGs may be used to decrease population fitness to drive down population densities (possibly causing local extinction) or, in disease vectors, to knock out a gene required for pathogen transmission. The relative advantages of HEGs that target viability or fecundity, that are active in one sex or both, and whose target is expressed before or after homing are explored. The conditions under which escape mutants arise are also analyzed. A different strategy is to place HEGs on the Y chromosome that cause one or more breaks on the X chromosome and so disrupt sex ratio. This strategy can cause severe sex-ratio biases with efficiencies that depend on the details of sperm competition and zygote mortality. This strategy is probably less susceptible to escape mutants, especially when multiple X shredders are used.

Journal ArticleDOI
01 Jul 2008-Genetics
TL;DR: The sequencing of the 12 genomes of members of the genus Drosophila was taken as an opportunity to reevaluate the genetic and physical maps for 11 of the species, in part to aid in the mapping of assembled scaffolds.
Abstract: The sequencing of the 12 genomes of members of the genus Drosophila was taken as an opportunity to reevaluate the genetic and physical maps for 11 of the species, in part to aid in the mapping of assembled scaffolds Here, we present an overview of the importance of cytogenetic maps to Drosophila biology and to the concepts of chromosomal evolution Physical and genetic markers were used to anchor the genome assembly scaffolds to the polytene chromosomal maps for each species In addition, a computational approach was used to anchor smaller scaffolds on the basis of the analysis of syntenic blocks We present the chromosomal map data from each of the 11 sequenced non-Drosophila melanogaster species as a series of sections Each section reviews the history of the polytene chromosome maps for each species, presents the new polytene chromosome maps, and anchors the genomic scaffolds to the cytological maps using genetic and physical markers The mapping data agree with Muller's idea that the majority of Drosophila genes are syntenic Despite the conservation of genes within homologous chromosome arms across species, the karyotypes of these species have changed through the fusion of chromosomal arms followed by subsequent rearrangement events

Journal ArticleDOI
01 May 2008-Genetics
TL;DR: It is proposed that a new de novo protein-coding gene may have evolved from a previously expressed noncoding sequence in Saccharomyces cerevisiae, and the evidence suggests that BSC4 may be involved in the DNA repair pathway during the stationary phase of S. Cerevisiae and contribute to the robustness of S., when shifted to a nutrient-poor environment.
Abstract: Origination of new genes is an important mechanism generating genetic novelties during the evolution of an organism. Processes of creating new genes using preexisting genes as the raw materials are well characterized, such as exon shuffling, gene duplication, retroposition, gene fusion, and fission. However, the process of how a new gene is de novo created from noncoding sequence is largely unknown. On the basis of genome comparison among yeast species, we have identified a new de novo protein-coding gene, BSC4 in Saccharomyces cerevisiae. The BSC4 gene has an open reading frame (ORF) encoding a 132-amino-acid-long peptide, while there is no homologous ORF in all the sequenced genomes of other fungal species, including its closely related species such as S. paradoxus and S. mikatae. The functional protein-coding feature of the BSC4 gene in S. cerevisiae is supported by population genetics, expression, proteomics, and synthetic lethal data. The evidence suggests that BSC4 may be involved in the DNA repair pathway during the stationary phase of S. cerevisiae and contribute to the robustness of S. cerevisiae, when shifted to a nutrient-poor environment. Because the corresponding noncoding sequences in S. paradoxus, S. mikatae, and S. bayanus also transcribe, we propose that a new de novo protein-coding gene may have evolved from a previously expressed noncoding sequence.

Journal ArticleDOI
01 Mar 2008-Genetics
TL;DR: The results reject the hypotheses that regulation exists to prevent the waste of amino acids in useless protein or the detrimental activity of unnecessary proteins, as a single selective pressure likely acting across the entire E. coli transcriptome.
Abstract: Transcriptional regulatory networks allow bacteria to express proteins only when they are needed. Adaptive hypotheses explaining the evolution of regulatory networks assume that unneeded expression is costly and therefore decreases fitness, but the proximate cause of this cost is not clear. We show that the cost in fitness to Escherichia coli strains constitutively expressing the lactose operon when lactose is absent is associated with the process of making the lac gene products, i.e., associated with the acts of transcription and/or translation. These results reject the hypotheses that regulation exists to prevent the waste of amino acids in useless protein or the detrimental activity of unnecessary proteins. While the cost of the process of protein expression occurs in all of the environments that we tested, the expression of the lactose permease could be costly or beneficial, depending on the environment. Our results identify the basis of a single selective pressure likely acting across the entire E. coli transcriptome.

Journal ArticleDOI
01 Jul 2008-Genetics
TL;DR: The results provide estimated chromosomal evolution rates across this set of species on the basis of whole-genome synteny analysis, which are found to be higher than those previously reported.
Abstract: The availability of 12 complete genomes of various species of genus Drosophila provides a unique opportunity to analyze genome-scale chromosomal rearrangements among a group of closely related species. This article reports on the comparison of gene order between these 12 species and on the fixed rearrangement events that disrupt gene order. Three major themes are addressed: the conservation of syntenic blocks across species, the disruption of syntenic blocks (via chromosomal inversion events) and its relationship to the phylogenetic distribution of these species, and the rate of rearrangement events over evolutionary time. Comparison of syntenic blocks across this large genomic data set confirms that genetic elements are largely (95%) localized to the same Muller element across genus Drosophila species and paracentric inversions serve as the dominant mechanism for shuffling the order of genes along a chromosome. Gene-order scrambling between species is in accordance with the estimated evolutionary distances between them and we find it to approximate a linear process over time (linear to exponential with alternate divergence time estimates). We find the distribution of synteny segment sizes to be biased by a large number of small segments with comparatively fewer large segments. Our results provide estimated chromosomal evolution rates across this set of species on the basis of whole-genome synteny analysis, which are found to be higher than those previously reported. Identification of conserved syntenic blocks across these genomes suggests a large number of conserved blocks with varying levels of embryonic expression correlation in Drosophila melanogaster. On the other hand, an analysis of the disruption of syntenic blocks between species allowed the identification of fixed inversion breakpoints and estimates of breakpoint reuse and lineage-specific breakpoint event segregation.

Journal ArticleDOI
01 Jul 2008-Genetics
TL;DR: It was concluded that epistasis together with all levels of dominance from partial to overdominance is responsible for the expression of heterosis in rapeseed.
Abstract: The main objective in this research was the genetic analysis of heterosis in rapeseed at the QTL level. A linkage map comprising 235 SSR and 144 AFLP markers covering 2045 cM was constructed in a doubled-haploid population from a cross between the cultivar “Express” and the resynthesized line “R53.” In field experiments at four locations in Germany 250 doubled-haploid (DH) lines and their corresponding testcrosses with Express were evaluated for grain yield and three yield components. The heterosis ranged from 30% for grain yield to 0.7% for kernel weight. QTL were mapped using three different data sets, allowing the estimation of additive and dominance effects as well as digenic epistatic interactions. In total, 33 QTL were detected, of which 10 showed significant dominance effects. For grain yield, mainly complete dominance or overdominance was observed, whereas the other traits showed mainly partial dominance. A large number of epistatic interactions were detected. It was concluded that epistasis together with all levels of dominance from partial to overdominance is responsible for the expression of heterosis in rapeseed.

Journal ArticleDOI
01 Sep 2008-Genetics
TL;DR: This study reinforces the notion that linkage disequilibrium is low and decays to negligible levels within a few hundred base pairs in P. tremula.
Abstract: I have studied nucleotide polymorphism and linkage disequilibrium using multilocus data from 77 fragments, with an average length of fragments of 550 bp, in the deciduous tree Populus tremula (Salicaceae). The frequency spectrum across loci showed a modest excess of mutations segregating at low frequency and a marked excess of high-frequency derived mutations at silent sites, relative to neutral expectations. These excesses were also seen at replacement sites, but were not so pronounced for high-frequency derived mutations. There was a marked excess of low-frequency mutations at replacement sites, likely indicating deleterious amino acid-changing mutations that segregate at low frequencies in P. tremula. I used approximate Bayesian computation (ABC) to evaluate a number of different demographic scenarios and to estimate parameters for the best-fitting model. The data were found to be consistent with a historical reduction in the effective population size of P. tremula through a bottleneck. The timing inferred for this bottleneck is largely consistent with geological data and with data from several other long-lived plant species. The results show that P. tremula harbors substantial levels of nucleotide polymorphism with the posterior mode of the scaled mutation rate, theta = 0.0177 across loci. The ABC analyses also provided an estimate of the scaled recombination rate that indicates that recombination rates in P. tremula are likely to be 2-10 times higher than the mutation rate. This study reinforces the notion that linkage disequilibrium is low and decays to negligible levels within a few hundred base pairs in P. tremula.

Journal ArticleDOI
01 Nov 2008-Genetics
TL;DR: The results suggest that phenotypic traits exhibiting clear-cut molecular signatures of selection may represent a biased subset of all adaptive traits.
Abstract: We model selection at a locus affecting a quantitative trait (QTL) in the presence of genetic variance due to other loci. The dynamics at the QTL are related to the initial genotypic value and to the background genetic variance of the trait, assuming that background genetic values are normally distributed, under three different forms of selection on the trait. Approximate dynamics are derived under the assumption of small mutation effect. For similar strengths of selection on the trait (i.e, gradient of directional selection β) the way background variation affects the dynamics at the QTL critically depends on the shape of the fitness function. It generally causes the strength of selection on the QTL to decrease with time. The resulting neutral heterozygosity pattern resembles that of a selective sweep with a constant selection coefficient corresponding to the early conditions. The signature of selection may also be blurred by mutation and recombination in the later part of the sweep. We also study the race between the QTL and its genetic background toward a new optimum and find the conditions for a complete sweep. Overall, our results suggest that phenotypic traits exhibiting clear-cut molecular signatures of selection may represent a biased subset of all adaptive traits.

Journal ArticleDOI
01 Jan 2008-Genetics
TL;DR: This project directly enhances genetics education research by providing a valid and reliable instrument for assessing the Genetics Literacy Assessment Instrument, a 31-item multiple-choice test that addresses 17 concepts identified as central to genetics literacy.
Abstract: There is continued emphasis on increasing and improving genetics education for grades K-12, for medical professionals, and for the general public. Another critical audience is undergraduate students in introductory biology and genetics courses. To improve the learning of genetics, there is a need to first assess students' understanding of genetics concepts and their level of genetics literacy (i.e., genetics knowledge as it relates to, and affects, their lives). We have developed and evaluated a new instrument to assess the genetics literacy of undergraduate students taking introductory biology or genetics courses. The Genetics Literacy Assessment Instrument is a 31-item multiple-choice test that addresses 17 concepts identified as central to genetics literacy. The items were selected and modified on the basis of reviews by 25 genetics professionals and educators. The instrument underwent additional analysis in student focus groups and pilot testing. It has been evaluated using approximately 400 students in eight introductory nonmajor biology and genetics courses. The content validity, discriminant validity, internal reliability, and stability of the instrument have been considered. This project directly enhances genetics education research by providing a valid and reliable instrument for assessing the genetics literacy of undergraduate students.

Journal ArticleDOI
01 Dec 2008-Genetics
TL;DR: A theory of segregation distortion is developed and it is shown that the presence of only a few SDL can cause the entire chromosome to distort from Mendelian segregation.
Abstract: Segregation distortion is a phenomenon that has been observed in many experimental systems. How segregation distortion among markers arises and its impact on mapping studies are the focus of this work. Segregation distortion of markers can be considered to arise from segregation distortion loci (SDL). I develop a theory of segregation distortion and show that the presence of only a few SDL can cause the entire chromosome to distort from Mendelian segregation. Segregation distortion is detrimental to the power of detecting quantitative trait loci (QTL) with dominance effects, but it is not always a detriment to QTL mapping for additive effects. When segregation distortion of a locus is a random event, the SDL is beneficial to QTL mapping ∼44% of the time. If SDL are present and ignored, power loss can be substantial. A dense marker map can be used to ameliorate the situation, and if dense marker information is incorporated, power loss is minimal. However, other situations are less benign. A method that can simultaneously map QTL and SDL is discussed, maximizing both use of mapping resources and use by agricultural and evolutionary biologists.

Journal ArticleDOI
01 Sep 2008-Genetics
TL;DR: The reinterpreted model showed that the optimal lysis time is shorter for phage strains with a high adsorption rate and vice versa, and competition between high- and low-adsorption strains showed that, under current conditions and phenotype configurations, the adsor adaptation rate has a much larger impact on phage relative fitness than the lysisTime.
Abstract: The first step of bacteriophage (phage) infection is the attachment of the phage virion onto a susceptible host cell. This adsorption process is usually described by mass-action kinetics, which implicitly assume an equal influence of host density and adsorption rate on the adsorption process. Therefore, an environment with high host density can be considered as equivalent to a phage endowed with a high adsorption rate, and vice versa. On the basis of this assumption, the effect of adsorption rate on the evolution of phage optimal lysis time can be reinterpreted from previous optimality models on the evolution of optimal lysis time. That is, phage strains with a higher adsorption rate would have a shorter optimal lysis time and vice versa. Isogenic phage λ-strains with different combinations of six different lysis times (ranging from 29.3 to 68 min), two adsorption rates (9.9 × 10−9 and 1.3 × 10−9 phage−1 cell−1 ml−1 min−1), and two markers (resulting in “blue” or “white” plaques) were constructed. Various pairwise competitions among these strains were conducted to test the model prediction. As predicted by the reinterpreted model, the results showed that the optimal lysis time is shorter for phage strains with a high adsorption rate and vice versa. Competition between high- and low-adsorption strains also showed that, under current conditions and phenotype configurations, the adsorption rate has a much larger impact on phage relative fitness than the lysis time.

Journal ArticleDOI
01 May 2008-Genetics
TL;DR: A new method is developed for estimating effective population sizes, Ne, and selection coefficients, s, from time-series data of allele frequencies sampled from a single diallelic locus, based on calculating transition probabilities and assuming independent binomial sampling from this diffusion process at each time point.
Abstract: We develop a new method for estimating effective population sizes, Ne, and selection coefficients, s, from time-series data of allele frequencies sampled from a single diallelic locus. The method is based on calculating transition probabilities, using a numerical solution of the diffusion process, and assuming independent binomial sampling from this diffusion process at each time point. We apply the method in two example applications. First, we estimate selection coefficients acting on the CCR5-Δ32 mutation on the basis of published samples of contemporary and ancient human DNA. We show that the data are compatible with the assumption of s = 0, although moderate amounts of selection acting on this mutation cannot be excluded. In our second example, we estimate the selection coefficient acting on a mutation segregating in an experimental phage population. We show that the selection coefficient acting on this mutation is ∼0.43.

Journal ArticleDOI
01 Nov 2008-Genetics
TL;DR: A systematic pedigree-based analysis of heritable silencing processes resulting from initiation of interference targeted at the C. elegans oocyte maturation factor oma-1 reveals that silencing in the early generations is transmitted independently of the original targeted locus, in a manner indicative of a diffusible epigenetic element.
Abstract: Heritable silencing effects are gene suppression phenomena that can persist for generations after induction In the majority of RNAi experiments conducted in Caenorhabditis elegans, the silencing response results in a hypomorphic phenotype where the effects recede after the F1 generation F2 and subsequent generations revert to the original phenotype Specific examples of transgenerational RNAi in which effects persist to the F2 generation and beyond have been described In this study, we describe a systematic pedigree-based analysis of heritable silencing processes resulting from initiation of interference targeted at the C elegans oocyte maturation factor oma-1 Heritable silencing of oma-1 is a dose-dependent process where the inheritance of the silencing factor is unequally distributed among the population Heritability is not constant over generational time, with silenced populations appearing to undergo a bottleneck three to four generations following microinjection of RNA Transmission of silencing through these generations can be through either maternal or paternal gamete lines and is surprisingly more effective through the male gametic line Genetic linkage tests reveal that silencing in the early generations is transmitted independently of the original targeted locus, in a manner indicative of a diffusible epigenetic element