HmmUFOtu: An HMM and phylogenetic placement based ultra-fast taxonomic assignment and OTU picking tool for microbiome amplicon sequencing studies

Genome Biol. 2018 Jun 27;19(1):82. doi: 10.1186/s13059-018-1450-0.

Abstract

Culture-independent analysis of microbial communities frequently relies on amplification and sequencing of the prokaryotic 16S ribosomal RNA gene. Typical analysis pipelines group sequences into operational taxonomic units (OTUs) to infer taxonomic and phylogenetic relationships. Here, we present HmmUFOtu, a novel tool for processing microbiome amplicon sequencing data, which performs rapid per-read phylogenetic placement, followed by phylogenetically informed clustering into OTUs and taxonomy assignment. Compared to standard pipelines, HmmUFOtu more accurately and reliably recapitulates microbial community diversity and composition in simulated and real datasets without relying on heuristics or sacrificing speed or accuracy.

Keywords: 16S rRNA gene; DNA substitution models; Dirichlet models; FM-index; HMM profile alignment; Microbiome; Operational taxonomic unit; Phylogenetic placement; Taxonomic assignment.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology
  • High-Throughput Nucleotide Sequencing / methods
  • Microbiota / genetics*
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics
  • Sequence Analysis, DNA / methods

Substances

  • RNA, Ribosomal, 16S