Detecting inversions with PCA in the presence of population structure

PLoS One. 2020 Oct 29;15(10):e0240429. doi: 10.1371/journal.pone.0240429. eCollection 2020.

Abstract

Chromosomal inversions can lead to reproductive isolation and adaptation in insects such as Drosophila melanogaster and the non-model malaria vector Anopheles gambiae. Inversions can be detected and characterized using principal component analysis (PCA) of single nucleotide polymorphisms (SNPs). To aid in developing such methods, we formed a new benchmark derived from three publicly-available insect data. We then used this benchmark to perform an extended validation of our software for inversion analysis (Asaph). Through that process, we identified and characterized several problematic test cases liable to misinterpretation that can help guide PCA-based inversion detection. Lastly, we re-analyzed the 2R chromosome arm of 150 An. gambiae and coluzzii samples and observed two inversions (2Rc and 2Rd) that were previously known but not annotated in these particular individuals. The resulting benchmark data set and methods will be useful for future inversion detection based solely on SNP data.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Anopheles / genetics*
  • Chromosome Inversion
  • Chromosomes, Insect / genetics*
  • Computational Biology / methods*
  • Datasets as Topic
  • Drosophila melanogaster / genetics*
  • Polymorphism, Single Nucleotide
  • Principal Component Analysis
  • Software

Grants and funding

RJN was supported by the National Science Foundation under Grant No. 1947257.