Send to

Choose Destination

Prediction of transcription start sites based on feature selection using AMOSA.

Author information

Bioinformatics Division, TNLIST and Department of Automation, Tsinghua Univ., Beijing 100084, China.


To understand the regulation of the gene expression, the identification of transcription start sites (TSSs) is a primary and important step. With the aim to improve the computational prediction accuracy, we focus on the most challenging task, i.e., to identify the TSSs within 50 bp in non-CpG related promoter regions. Due to the diversity of non-CpG related promoters, a large number of features are extracted. Effective feature selection can minimize the noise, improve the prediction accuracy, and also to discover biologically meaningful intrinsic properties. In this paper, a newly proposed multi-objective simulated annealing based optimization method, Archive Multi-Objective Simulated Annealing (AMOSA), is integrated with Linear Discriminant Analysis (LDA) to yield a combined feature selection and classification system. This system is found to be comparable to, often better than, several existing methods in terms of different quantitative performance measures.

[Indexed for MEDLINE]
Free full text

Supplemental Content

Full text links

Icon for Life Sciences Society
Loading ...
Support Center