Format

Send to

Choose Destination
Methods Mol Biol. 2011;722:141-55. doi: 10.1007/978-1-61779-040-9_10.

A bioinformatics pipeline for sequence-based analyses of fungal biodiversity.

Author information

1
Institute of Arctic Biology, University of Alaska, Fairbanks, AK, USA. ltaylor@iab.alaska.edu

Abstract

The internal transcribed spacer (ITS) is the locus of choice with which to characterize fungal diversity in environmental samples. However, methods to analyze ITS datasets have lagged behind the capacity to generate large amounts of sequence information. Here, we describe our bioinformatics pipeline to process large fungal ITS sequence datasets, from raw chromatograms to a spreadsheet of operational taxonomic unit (OTU) abundances across samples. Steps include assembling of reads originating from one clone, identifying primer "barcodes" or "tags," trimming vectors and primers, marking low-quality base calls and removing low-quality sequences, orienting sequences, extracting the ITS region from longer amplicons, and grouping sequences into OTUs. We expect that the principles and tools presented here are relevant to datasets arising from ever-evolving new technologies.

PMID:
21590418
DOI:
10.1007/978-1-61779-040-9_10
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Springer
Loading ...
Support Center