scispace - formally typeset
Open AccessJournal ArticleDOI

SweeD: Likelihood-Based Detection of Selective Sweeps in Thousands of Genomes

Reads0
Chats0
TLDR
It is shown that an increase of sample size results in more precise detection of positive selection and the ability to analyze substantially larger sample sizes by using SweeD leads to more accurate sweep detection.
Abstract
The advent of modern DNA sequencing technology is the driving force in obtaining complete intra-specific genomes that can be used to detect loci that have been subject to positive selection in the recent past. Based on selective sweep theory, beneficial loci can be detected by examining the single nucleotide polymorphism patterns in intraspecific genome alignments. In the last decade, a plethora of algorithms for identifying selective sweeps have been developed. However, the majority of these algorithms have not been designed for analyzing whole-genome data. We present SweeD (Sweep Detector), an open-source tool for the rapid detection of selective sweeps in whole genomes. It analyzes site frequency spectra and represents a substantial extension of the widely used SweepFinder program. The sequential version of SweeD is up to 22 times faster than SweepFinder and, more importantly, is able to analyze thousands of sequences. We also provide a parallel implementation of SweeD for multi-core processors. Furthermore, we implemented a checkpointing mechanism that allows to deploy SweeD on cluster systems with queue execution time restrictions, as well as to resume long-running analyses after processor failures. In addition, the user can specify various demographic models via the command-line to calculate their theoretically expected site frequency spectra. Therefore, (in contrast to SweepFinder) the neutral site frequencies can optionally be directly calculated from a given demographic model. We show that an increase of sample size results in more precise detection of positive selection. Thus, the ability to analyze substantially larger sample sizes by using SweeD leads to more accurate sweep detection. We validate SweeD via simulations and by scanning the first chromosome from the 1000 human Genomes project for selective sweeps. We compare SweeD results with results from a linkage-disequilibrium-based approach and identify common outliers.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Identifying Potentially Beneficial Genetic Mutations Associated with Monophyletic Selective Sweep and a Proof-of-Concept Study with Viral Genetic Data

TL;DR: In this paper, the authors proposed a method to detect mutations at the single amino acid/nucleotide level that could be responsible for monophyletic selective sweep evolution, which can lead to discovery of previously unknown biological functions and mechanisms related to those mutations.
Posted ContentDOI

The Haplotype-resolved Autotetraploid Genome Assembly Provides Insights into the genomic evolution and fruit divergence in Wax apple (Syzygium samarangense (BI.) Merr.et Perry)

TL;DR: In this paper , a haplotype-resolved autotetraploid genome assembly of the wax apple with size of 1.59 Gb was presented, which revealed three rounds of wholegenome duplication (WGD) events, including two independent WGDs after WGT-γ.
Journal ArticleDOI

Signatures of selection in riverine buffalo populations revealed by genome-wide SNP data.

TL;DR: In this paper , the authors identify signatures of selection in five buffalo populations (Brazilian Murrah, Bulgarian Murrah and Indian Murrah), using Axiom Buffalo 90 K Genotyping Array data, and identify selection signatures in 374 genomic regions, spanning a total of 381 genes and 350 quantitative trait loci (QTLs).
Journal ArticleDOI

A missense mutation in RRM1 contributes to animal tameness

TL;DR: In this paper , the authors established a link between RRM1I241V and tameness, demonstrating that the complex behavioral change can be achieved by mutations under strong selection during animal domestication.
Posted ContentDOI

The evolutionary pathways for local adaptation in mountain hares

TL;DR: In this article, the authors identify genetic signatures of local adaptation in mountain hares (Lepus timidus) from isolated and distinctive habitats of its wide distribution: Ireland, the Alps and Fennoscandia.
References
More filters
Journal ArticleDOI

An integrated map of genetic variation from 1,092 human genomes

TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.
Book

Practical Methods of Optimization

TL;DR: The aim of this book is to provide a Discussion of Constrained Optimization and its Applications to Linear Programming and Other Optimization Problems.
Journal ArticleDOI

The hitch-hiking effect of a favourable gene.

TL;DR: If the selective coefficients at the linked locus are small compared to those at the substituted locus, it is shown that the probability of complete fixation at the links is approximately exp (− Nc), where c is the recombinant fraction and N the population size.
Journal ArticleDOI

Generating samples under a Wright-Fisher neutral model of genetic variation.

TL;DR: A Monte Carlo computer program is available to generate samples drawn from a population evolving according to a Wright-Fisher neutral model, and the samples produced can be used to investigate the sampling properties of any sample statistic under these neutral models.
Related Papers (5)