Format

Send to

Choose Destination
Genome Inform. 2007;19:73-82.

Recognition of polyadenylation sites from Arabidopsis genomic sequences.

Author information

1
School of Computing, National University of Singapore, COM1, Law Link, Singapore 117590. kohchuan@comp.nus.edu.sg

Abstract

A polyadenine tail is found at the 3' end of nearly every fully processed eukaryotic mRNA and has been suggested to influence virtually all aspects of mRNA metabolism. The ability to predict polyadenylation site will allow us to define gene boundaries, predict number of genes present in a particular gene locus and perhaps better understand mRNA metabolism. To this end, we built an arabidopsis polyadenylation prediction model. The prediction model uses a machine learning method which consists of four sequential steps: feature generation, feature selection, feature integration and cascade classifier. We have tested our model on public datasets and achieved more than 97% sensitivity and specificity. We have also directly compared with another arabidopsis prediction model, PASS 1.0, and have achieved better results.

PMID:
18546506
[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center