SweeD: Likelihood-Based Detection of Selective Sweeps in Thousands of Genomes
Reads0
Chats0
TLDR
It is shown that an increase of sample size results in more precise detection of positive selection and the ability to analyze substantially larger sample sizes by using SweeD leads to more accurate sweep detection.Abstract:
The advent of modern DNA sequencing technology is the driving force in obtaining complete intra-specific genomes that can be used to detect loci that have been subject to positive selection in the recent past. Based on selective sweep theory, beneficial loci can be detected by examining the single nucleotide polymorphism patterns in intraspecific genome alignments. In the last decade, a plethora of algorithms for identifying selective sweeps have been developed. However, the majority of these algorithms have not been designed for analyzing whole-genome data. We present SweeD (Sweep Detector), an open-source tool for the rapid detection of selective sweeps in whole genomes. It analyzes site frequency spectra and represents a substantial extension of the widely used SweepFinder program. The sequential version of SweeD is up to 22 times faster than SweepFinder and, more importantly, is able to analyze thousands of sequences. We also provide a parallel implementation of SweeD for multi-core processors. Furthermore, we implemented a checkpointing mechanism that allows to deploy SweeD on cluster systems with queue execution time restrictions, as well as to resume long-running analyses after processor failures. In addition, the user can specify various demographic models via the command-line to calculate their theoretically expected site frequency spectra. Therefore, (in contrast to SweepFinder) the neutral site frequencies can optionally be directly calculated from a given demographic model. We show that an increase of sample size results in more precise detection of positive selection. Thus, the ability to analyze substantially larger sample sizes by using SweeD leads to more accurate sweep detection. We validate SweeD via simulations and by scanning the first chromosome from the 1000 human Genomes project for selective sweeps. We compare SweeD results with results from a linkage-disequilibrium-based approach and identify common outliers.read more
Citations
More filters
Journal ArticleDOI
ImaGene: a convolutional neural network to quantify natural selection from genomic data
Luis Torada,Lucrezia Lorenzon,Lucrezia Lorenzon,Alice E. Beddis,Ulas Isildak,Linda Pattini,Sara Mathieson,Matteo Fumagalli +7 more
TL;DR: Deep learning in evolutionary genomics is explored and its potential to detect informative patterns from large-scale genomic data is demonstrated, including methods to process genomic data for deep learning in a user-friendly program called ImaGene.
Journal ArticleDOI
Modes of Rapid Polygenic Adaptation
Kavita Jain,Wolfgang Stephan +1 more
TL;DR: This paper showed that polygenic adaptation may be a rapid process and can proceed via subtle or dramatic changes in the allele frequency depending on the sizes of the phenotypic effects relative to a threshold value.
Journal ArticleDOI
The population genomics of rapid adaptation: disentangling signatures of selection and demography in white sands lizards
Stefan Laurent,Stefan Laurent,Susanne P. Pfeifer,Susanne P. Pfeifer,Matthew L. Settles,Samuel S. Hunter,Kayla M. Hardwick,Louise Ormond,Louise Ormond,Vitor C. Sousa,Vitor C. Sousa,Jeffrey D. Jensen,Jeffrey D. Jensen,Erica Bree Rosenblum +13 more
TL;DR: A number of similarities are identified between the two focal species, including strong evidence of selection in the blanched populations in the Mc1r region, and important differences between the species are found, suggesting different colonization times, different genetic architecture underlying the bl Blanched phenotype and different ages of the beneficial alleles.
Journal ArticleDOI
Detecting signatures of positive selection in non-model species using genomic data
Hannah Weigand,Florian Leese +1 more
Journal ArticleDOI
Selective sweeps on novel and introgressed variation shape mimicry loci in a butterfly adaptive radiation.
Markus Moest,Markus Moest,Steven M. Van Belleghem,Steven M. Van Belleghem,Jennifer E. James,Jennifer E. James,Camilo Salazar,Simon H. Martin,Simon H. Martin,Sarah L. Barker,Gilson R. P. Moreira,Claire Mérot,Mathieu Joron,Nicola J. Nadeau,Florian M. Steiner,Chris D. Jiggins +15 more
TL;DR: Analysis of high-coverage genome sequence data from 4 major colour pattern loci sampled from nearly 600 individuals in 53 populations reveals a surprisingly dynamic history of colour pattern selection and co-evolution in this adaptive radiation of Heliconius butterflies.
References
More filters
Journal ArticleDOI
An integrated map of genetic variation from 1,092 human genomes
Gonçalo R. Abecasis,Adam Auton,Lisa D. Brooks,Mark A. DePristo,Richard Durbin,Robert E. Handsaker,Robert E. Handsaker,Hyun Min Kang,Gabor T. Marth,Gil McVean +9 more
TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.
Book
Practical Methods of Optimization
TL;DR: The aim of this book is to provide a Discussion of Constrained Optimization and its Applications to Linear Programming and Other Optimization Problems.
Journal ArticleDOI
The hitch-hiking effect of a favourable gene.
John Maynard Smith,John Haigh +1 more
TL;DR: If the selective coefficients at the linked locus are small compared to those at the substituted locus, it is shown that the probability of complete fixation at the links is approximately exp (− Nc), where c is the recombinant fraction and N the population size.
Journal ArticleDOI
Generating samples under a Wright-Fisher neutral model of genetic variation.
TL;DR: A Monte Carlo computer program is available to generate samples drawn from a population evolving according to a Wright-Fisher neutral model, and the samples produced can be used to investigate the sampling properties of any sample statistic under these neutral models.