The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference.

doi:10.1093/MOLBEV/MSY224

Open AccessJournal ArticleDOI

The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference.

Lex E. Flagel, +3 more

- 01 Feb 2019 -

Molecular Biology and Evolution

- Vol. 36, Iss: 2, pp 220-238

Chats0

TLDR

CNNs are capable of outperforming expert-derived statistical methods and offer a new path forward in cases where no likelihood approach exists, and are shown to perform accurate evolutionary model selection and parameter estimation, even on problems that have not received detailed theoretical treatments.

Abstract:

Population-scale genomic data sets have given researchers incredible amounts of information from which to infer evolutionary histories. Concomitant with this flood of data, theoretical and methodological advances have sought to extract information from genomic sequences to infer demographic events such as population size changes and gene flow among closely related populations/species, construct recombination maps, and uncover loci underlying recent adaptation. To date, most methods make use of only one or a few summaries of the input sequences and therefore ignore potentially useful information encoded in the data. The most sophisticated of these approaches involve likelihood calculations, which require theoretical advances for each new problem, and often focus on a single aspect of the data (e.g., only allele frequency information) in the interest of mathematical and computational tractability. Directly interrogating the entirety of the input sequence data in a likelihood-free manner would thus offer a fruitful alternative. Here, we accomplish this by representing DNA sequence alignments as images and using a class of deep learning methods called convolutional neural networks (CNNs) to make population genetic inferences from these images. We apply CNNs to a number of evolutionary questions and find that they frequently match or exceed the accuracy of current methods. Importantly, we show that CNNs perform accurate evolutionary model selection and parameter estimation, even on problems that have not received detailed theoretical treatments. Thus, when applied to population genetic alignments, CNNs are capable of outperforming expert-derived statistical methods and offer a new path forward in cases where no likelihood approach exists.

Citations

PDF

Open Access

More filters

Journal Article

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Fumio Tajima

- 30 Oct 1989 -

Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

Artificial neural networks

Andrea Roli

TL;DR: Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems as mentioned in this paper, and they have been widely used in computer vision applications.

...read moreread less

Journal ArticleDOI

pixy: Unbiased estimation of nucleotide diversity and divergence in the presence of missing data.

Katharine L Korunes, +1 more

- 01 May 2021 -

Molecular Ecology Resources

TL;DR: Pixy as mentioned in this paper is a UNIX command line utility that generates unbiased estimates of π and dXY in the face of missing data, regardless of the form or amount of data.

...read moreread less

Journal ArticleDOI

Inference and analysis of population-specific fine-scale recombination maps across 26 diverse human populations.

Jeffrey P. Spence, +1 more

- 01 Oct 2019 -

Science Advances

TL;DR: Differences in the recombination landscape across the genome and between populations are driven by variation in the gene that encodes the DNA binding protein PRDM9, and a demography-aware method is developed and applied to 26 diverse human populations, inferring population-specific recombination maps.

...read moreread less

Journal ArticleDOI

Adaptive Introgression: An Untapped Evolutionary Mechanism for Crop Adaptation.

Concetta Burgarella, +11 more

- 01 Feb 2019 -

Frontiers in Plant Science

TL;DR: It is argued that screening the wild introgression already existing in the cultivated gene pool may be an effective strategy for uncovering wild diversity relevant for crop adaptation to current environmental changes and for informing new breeding directions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less