SCC: an accurate imputation method for scRNA-seq dropouts based on a mixture model

BMC Bioinformatics. 2021 Jan 6;22(1):5. doi: 10.1186/s12859-020-03878-8.

Abstract

Background: Single-cell RNA sequencing (scRNA-seq) enables the possibility of many in-depth transcriptomic analyses at a single-cell resolution. It's already widely used for exploring the dynamic development process of life, studying the gene regulation mechanism, and discovering new cell types. However, the low RNA capture rate, which cause highly sparse expression with dropout, makes it difficult to do downstream analyses.

Results: We propose a new method SCC to impute the dropouts of scRNA-seq data. Experiment results show that SCC gives competitive results compared to two existing methods while showing superiority in reducing the intra-class distance of cells and improving the clustering accuracy in both simulation and real data.

Conclusions: SCC is an effective tool to resolve the dropout noise in scRNA-seq data. The code is freely accessible at https://github.com/nwpuzhengyan/SCC .

Keywords: Dropouts identification; Gene expression estimation; Mixture model; Noise; ScRNA-seq.

MeSH terms

  • Gene Expression Profiling / methods*
  • Gene Expression Regulation / genetics
  • Genomics / methods
  • Models, Genetic
  • RNA, Small Cytoplasmic / genetics*
  • Single-Cell Analysis / methods*

Substances

  • RNA, Small Cytoplasmic