diploS/HIC: An Updated Approach to Classifying Selective Sweeps

G3 (Bethesda). 2018 May 31;8(6):1959-1970. doi: 10.1534/g3.118.200262.

Abstract

Identifying selective sweeps in populations that have complex demographic histories remains a difficult problem in population genetics. We previously introduced a supervised machine learning approach, S/HIC, for finding both hard and soft selective sweeps in genomes on the basis of patterns of genetic variation surrounding a window of the genome. While S/HIC was shown to be both powerful and precise, the utility of S/HIC was limited by the use of phased genomic data as input. In this report we describe a deep learning variant of our method, diploS/HIC, that uses unphased genotypes to accurately classify genomic windows. diploS/HIC is shown to be quite powerful even at moderate to small sample sizes.

Keywords: Adaptation; Deep learning; Machine Learning; Selective Sweeps; and Population genetics.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Anopheles / genetics
  • Base Sequence
  • Chromosomes / genetics
  • Genome, Insect
  • Genotype
  • Image Processing, Computer-Assisted
  • Machine Learning*
  • Neural Networks, Computer
  • ROC Curve
  • Recombination, Genetic / genetics
  • Selection, Genetic*