TCR alpha chain rearrangement distribution inferred from sequence data taken from . (
a) The log-likelihood of the data given the model saturates as a function of the number of iterations of the Expectation–Maximization algorithm. (
b) Shannon entropy of rearrangements (top row) and sequences (middle row). The sequence entropy is lower than the total recombination entropy because of convergent rearrangements. The rearrangement entropy is the sum of entropies of its elementary events (bottom row). (
c) Distribution of the number of inserted nucleotides (solid curve). For comparison, the same distribution obtained by the MiXCR software is represented by a dashed line. (
d) Distributions of the number of deletions for both V and J genes, averaged over genes. (
e) Joint distribution for V and J usage,
P(
V,
J). Genes are ordered by position along the genome. (
f) The covariance

clearly shows strong correlations for genes that are either close to the separation between the V and J segments, or far from it