Format

Send to

Choose Destination
BMC Bioinformatics. 2018 May 21;19(1):175. doi: 10.1186/s12859-018-2189-z.

SAMSA2: a standalone metatranscriptome analysis pipeline.

Author information

1
Genome Center, University of California, Davis, California, USA.
2
Department of Food Science and Technology, University of California, Davis, California, USA.
3
USDA ARS Western Nutrition Research Center, Davis, CA, USA.
4
Genome Center, University of California, Davis, California, USA. danielle.lemay@ars.usda.gov.
5
USDA ARS Western Nutrition Research Center, Davis, CA, USA. danielle.lemay@ars.usda.gov.

Abstract

BACKGROUND:

Complex microbial communities are an area of growing interest in biology. Metatranscriptomics allows researchers to quantify microbial gene expression in an environmental sample via high-throughput sequencing. Metatranscriptomic experiments are computationally intensive because the experiments generate a large volume of sequence data and each sequence must be compared with reference sequences from thousands of organisms.

RESULTS:

SAMSA2 is an upgrade to the original Simple Annotation of Metatranscriptomes by Sequence Analysis (SAMSA) pipeline that has been redesigned for standalone use on a supercomputing cluster. SAMSA2 is faster due to the use of the DIAMOND aligner, and more flexible and reproducible because it uses local databases. SAMSA2 is available with detailed documentation, and example input and output files along with examples of master scripts for full pipeline execution.

CONCLUSIONS:

SAMSA2 is a rapid and efficient metatranscriptome pipeline for analyzing large RNA-seq datasets in a supercomputing cluster environment. SAMSA2 provides simplified output that can be examined directly or used for further analyses, and its reference databases may be upgraded, altered or customized to fit the needs of any experiment.

KEYWORDS:

Annotation; Bacteria; Bioinformatics; Cluster; Functions; GALAXY; Metagenomics; Metatranscriptome; Metatranscriptomics; Microbiome; Open access; Pipeline; RNA-seq; SAMSA; Software; Tool

PMID:
29783945
PMCID:
PMC5963165
DOI:
10.1186/s12859-018-2189-z
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center