The GRB model, the evolutionary scenario to define RCEs and an example of an RCE. (A) A GRB is defined as a genomic region where a target gene (red) receives long-range regulatory inputs from an array of HCNEs (green ovals) that span the entire GRB and often intertwine with exons of unrelated bystander genes (orange). The regulatory elements need to stay in cis to their target gene to function, leading to the conservation of synteny between the target and its long-range regulatory inputs. In the evolutionary scenario illustrated, teleost WGD (red circle) and subsequent rediploidization (yellow fork) resulted in each gene being retained in a single functional copy. However, one exon fragment (blue dashed frame) that overlaps a regulatory element was retained in duplicate, with one copy remaining in conserved synteny with the target gene just like the HCNEs, and the other remaining as part of a functional gene elsewhere in the genome. We named such zebrafish exonic remnants and their vertebrate orthologs RCEs, and named the genes they are or were part of ‘RCE host genes’ (blue). (B) The PROX1 - RPS6KC1 locus. The prospero homeobox protein PROX1, which is essential for early development of the central nervous system (CNS), is an example of a 1-to-1 GRB orthology scenario. PROX1 has a bystander gene RPS6KC1 in the synteny block defined by PROX1 and the HCNEs spanning the locus. RPS6KC1 encodes a ribosomal protein kinase, which has no evidence for involvement in CNS development or for being tightly regulated in general. In this case, RPS6KC1, as the bystander gene, was lost in the zebrafish synteny block, leaving several human–zebrafish HCNEs in the gene desert created by its disappearance. Interestingly, three out of 15 exons were also kept as highly conserved remnants in the zebrafish (referred as RCE 9, 10, 11 in Supplementary Table S1).