Identification of Intragenomic Loci with Extended Similarity to MIRNA Genes.
(A) Detection of MIRNA-related loci in the A. thaliana and A. lyrata genomes. The expected values for the top four FASTA alignments between each MIRNA foldback and the respective genome sequence are plotted. A. thaliana MIRNAs aligned to the A. thaliana genome are plotted as red dots. A. lyrata MIRNAs aligned to the A. lyrata genome are plotted as black dots. MIRNA are grouped on the x axis based on conservation. The horizontal gray line marks the false discovery rate (FDR) = 0.05 boundary, where points above the line have a FDR < 0.05.
(B) Proportion of A. thaliana and A. lyrata MIRNA families conserved with both Arabidopsis species and P. trichocarpa, conserved between A. thaliana and A. lyrata but not with P. trichocarpa, or MIRNAs unique to either A. thaliana or A. lyrata (inner pie chart). The percentage of MIRNA families with evidence of foldback origination from another locus (see [A]) in each conservation group is shown in the outer ring.
(C) MIRNA formation by inverted duplication. A generic model (top) and three example MIRNA loci that were formed by inverted duplication events.
(D) MIRNA formation by direct duplication of a locus containing a previous intralocus inverted duplication. A generic model (top) and three examples from A. lyrata are shown. In (C) and (D), the MIRNA-related genomic locus is on top and the MIRNA genomic locus is on the bottom of each diagram. Direct and inverted duplications are shown by gray and blue shadings, respectively, connecting the MIRNA-related locus to the MIRNA locus. Annotated features are shown as highlighted boxes in each region (genes = yellow, exons = blue, transposable elements = green, and MIRNA = red boxes). Arrows indicate the duplicated segments in each locus. In (D), sequences produced by inverted intralocus duplication in the MIRNA-related loci are connected by black boxes.
(E) Proportion of A. thaliana and A. lyrata MIRNA families that have detectable MIRNA-related loci in their respective genomes (inner pie chart). The proportion of MIRNA-related loci in each class (inverted or direct duplication) forming transcripts that are predicted or validated targets of the miRNA formed from the duplication event (outer ring).
(F) Proportion of MIRNA-related loci that are exons, introns, intergenic sequences, or MITEs (inner pie chart). Proportion of MIRNA-related loci forming transcripts that are predicted or validated targets of the miRNA formed from the duplication event (outer ring).