Format

Send to

Choose Destination
J Bioinform Comput Biol. 2009 Apr;7(2):373-88.

Finding non-coding RNAs through genome-scale clustering.

Author information

1
Department of Computer Science & Engineering, University of Washington, Seattle, WA 98195-2350, USA. lachesis@cs.washington.edu

Abstract

Non-coding RNAs (ncRNAs) are transcripts that do not code for proteins. Recent findings have shown that RNA-mediated regulatory mechanisms influence a substantial portion of typical microbial genomes. We present an efficient method for finding potential ncRNAs in bacteria by clustering genomic sequences based on homology inferred from both primary sequence and secondary structure. We evaluate our approach using a set of predominantly Firmicutes sequences. Our results showed that, though primary sequence based-homology search was inaccurate for diverged ncRNA sequences, through our clustering method, we were able to infer motifs that recovered nearly all members of most known ncRNA families. Hence, our method shows promise for discovering new families of ncRNA.

PMID:
19340921
PMCID:
PMC3417115
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center