Format

Send to

Choose Destination

Prediction of transcription start sites based on feature selection using AMOSA.

Author information

1
Bioinformatics Division, TNLIST and Department of Automation, Tsinghua Univ., Beijing 100084, China.

Abstract

To understand the regulation of the gene expression, the identification of transcription start sites (TSSs) is a primary and important step. With the aim to improve the computational prediction accuracy, we focus on the most challenging task, i.e., to identify the TSSs within 50 bp in non-CpG related promoter regions. Due to the diversity of non-CpG related promoters, a large number of features are extracted. Effective feature selection can minimize the noise, improve the prediction accuracy, and also to discover biologically meaningful intrinsic properties. In this paper, a newly proposed multi-objective simulated annealing based optimization method, Archive Multi-Objective Simulated Annealing (AMOSA), is integrated with Linear Discriminant Analysis (LDA) to yield a combined feature selection and classification system. This system is found to be comparable to, often better than, several existing methods in terms of different quantitative performance measures.

PMID:
17951823
[Indexed for MEDLINE]
Free full text

Supplemental Content

Full text links

Icon for Life Sciences Society
Loading ...
Support Center