(A) Sequences of genes (from annotated transcriptional start to poly-adenylation sites) were extracted from 15,050 full-length genes. Each gene sequence was divided into 20 equally sized bins. Because gene lengths differ, bin sizes differ from gene to gene. The x-axis lists these 20 bins 5′ to 3′. For each gene, the number of Mu insertions in each of the 20 bins was determined. Subsequently, the numbers of Mu insertions in each of the 20 bins and the lengths of each of the 20 bins were summed across the 15,050 genes. It was then possible to calculate the number of Mu insertions per Mb (y-axis) for each of the 20 bins. (B) 200-bp sequences around translation start sites (ATG, 200 bp left side and 200 bp right side) from each full-length gene were extracted and were divided into 20 bins, each of which was 20 bp in size. The x-axis lists these 20 bins 5′ to 3′. For each gene, the number of Mu insertions in each of the 20 bins was calculated. Subsequently, the numbers of Mu insertions in each of the 20 bins were summed across the 15,050 genes. The total summed length of each bin is 150,500 bp (20 bp bin length×15,050 genes). Using these data it was then possible to calculate the number of Mu insertions per Mb (y-axis) for each of the 20 bins. (C) 200-bp sequences around transcription start sites (TSS, 200 bp left side and 200 bp right side) from each full-length gene were extracted and were divided into 20 bins, each of which was 20 bp in size. The x-axis lists these 20 bins 5′ to 3′. For each gene, the number of Mu insertions in each of the 20 bins was calculated. Subsequently, the numbers of Mu insertions in each of the 20 bins were summed across the 15,050 genes. The total summed length of each bin is 150,500 bp (20 bp bin length×15,050 genes). Using these data it was then possible to calculate the number of Mu insertions per Mb (y-axis) for each of the 20 bins.