Using the enrichment sequence to facilitate visual interpretation and in silico boundary mapping. The enrichment sequences for two different clusters from the same deep-sequenced final pool of Hfq Genomic SELEX are visualized here (see Sections 3.2 and 3.3 for description of the analysis). Each read from deep sequencing represents a full-length aptamer recovered from the experiment. (A) The positional enrichment depth can be summarized using an “enrichment sequence”, a sequence of numbers representing the number of high-throughput reads aligned to each base position. Here we see a visualization of one such sequence, represented as a histogram. The first 15 bases occur in four reads, the next two, seven reads, and so on. The escalating number of reads at each position can give clues as to where the consensus sequence lies. In this case, the highest plateau of enrichment in this cluster, underlined in red, contains an instance of a winning motif from the Hfq Genomic SELEX experiment, which was 5′-AAYAAYAA-3′ [23]. This can be seen as an in silico version of the boundary mapping technique, previously used to find the minimal aptamer required for binding [26]. (B) The visualization of read alignments from high-throughput sequencing can take up more space than a screen can hold and is often difficult to interpret. Incorporating the enrichment sequence histogram into a genome browser compacts the visualization, as shown for Cluster 1238 rendered here by GBROWSE [39]. Additionally, the 2D analysis gives better insight into the genomic location of the cluster. In this example, we see the ygcN gene, whose start codon is at the left end of the beige bar, a cluster, represented with a green arrow and its corresponding enrichment sequence, visualized with a histogram. In a regular 1D analysis, the underlying region of the cluster would be deemed to be primarily upstream of the open reading frame. However, by comparing the areas under the histogram within and upstream of the open reading frame, the mass of the enrichment is clearly favoring the open reading frame, and furthermore is quite sharply increasing just downstream of the start codon.