Display Settings:

Format

Send to:

Choose Destination
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
    Bioinformatics. 2012 Mar 1;28(5):614-8. doi: 10.1093/bioinformatics/bts014. Epub 2012 Jan 11.

    PHACTS, a computational approach to classifying the lifestyle of phages.

    Source

    Computational Science Research Center, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA. katelyn@rohan.sdsu.edu

    Abstract

    MOTIVATION:

    Bacteriophages have two distinct lifestyles: virulent and temperate. The virulent lifestyle has many implications for phage therapy, genomics and microbiology. Determining which lifestyle a newly sequenced phage falls into is currently determined using standard culturing techniques. Such laboratory work is not only costly and time consuming, but also cannot be used on phage genomes constructed from environmental sequencing. Therefore, a computational method that utilizes the sequence data of phage genomes is needed.

    RESULTS:

    Phage Classification Tool Set (PHACTS) utilizes a novel similarity algorithm and a supervised Random Forest classifier to make a prediction whether the lifestyle of a phage, described by its proteome, is virulent or temperate. The similarity algorithm creates a training set from phages with known lifestyles and along with the lifestyle annotation, trains a Random Forest to classify the lifestyle of a phage. PHACTS predictions are shown to have a 99% precision rate.

    AVAILABILITY AND IMPLEMENTATION:

    PHACTS was implemented in the PERL programming language and utilizes the FASTA program (Pearson and Lipman, 1988) and the R programming language library 'Random Forest' (Liaw and Weiner, 2010). The PHACTS software is open source and is available as downloadable stand-alone version or can be accessed online as a user-friendly web interface. The source code, help files and online version are available at http://www.phantome.org/PHACTS/.

    SUPPLEMENTARY INFORMATION:

    Supplementary data are available at Bioinformatics online.

    PMID:
    22238260
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC3289917
    Free PMC Article

    Images from this publication.See all images (3)Free text

    Fig. 2.
    Fig. 1.
    Fig. 3.

      Supplemental Content

      Icon for HighWire Icon for PubMed Central

      Save items

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk