Distribution between gene locations and abundance of annotated tags. To annotate the sequenced tags, we created a database of all possible 17-bp tag sequences next to an NlaIII site from the Ensembl transcriptome database. (a) The bars show the proportion of the total number of tags by location in gene regions. The canonical location refers to the tag originating from the most 3′ NlaIII site in a gene, to which 41% mapped. Overall, 22% of the tags were mapped to exons, which may represent transcript isoforms that are not listed in Ensembl. In all, 3.4% of the tags, which presumably originate from unprocessed pre-mRNA transcripts, mis-spliced transcripts, or unannotated exons, were annotated to intronic gene sequences. Overall, 3.6% of the sequenced tags mapped in an antisense orientation to gene regions in the Ensembl database. The remaining tags had multiple annotations (23%) or were not found in our database of possible tag sequences (7%). No significant difference in the distribution between gene locations was observed for the annotated tags between the ALL subtypes (data not shown). (b) The bars show the proportion of annotated tags at different bins of expression levels. The expression levels are in tags per million (TPM) on a log2- transformed scale on the horizontal axis. Black bars indicate tags annotated to genes in the sense direction and light gray bars indicate tags annotated to genes in the antisense direction.