Format

Send to

Choose Destination
PLoS Genet. 2015 Apr 15;11(4):e1005147. doi: 10.1371/journal.pgen.1005147. eCollection 2015 Apr.

Systematic dissection of the sequence determinants of gene 3' end mediated expression control.

Author information

1
Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel; Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel.
2
Department of Computer Science, Technion, Haifa, Israel; Agilent Laboratories, Tel Aviv, Israel.

Abstract

The 3'end genomic region encodes a wide range of regulatory process including mRNA stability, 3' end processing and translation. Here, we systematically investigate the sequence determinants of 3' end mediated expression control by measuring the effect of 13,000 designed 3' end sequence variants on constitutive expression levels in yeast. By including a high resolution scanning mutagenesis of more than 200 native 3' end sequences in this designed set, we found that most mutations had only a mild effect on expression, and that the vast majority (~90%) of strongly effecting mutations localized to a single positive TA-rich element, similar to a previously described 3' end processing efficiency element, and resulted in up to ten-fold decrease in expression. Measurements of 3' UTR lengths revealed that these mutations result in mRNAs with aberrantly long 3'UTRs, confirming the role for this element in 3' end processing. Interestingly, we found that other sequence elements that were previously described in the literature to be part of the polyadenylation signal had a minor effect on expression. We further characterize the sequence specificities of the TA-rich element using additional synthetic 3' end sequences and show that its activity is sensitive to single base pair mutations and strongly depends on the A/T content of the surrounding sequences. Finally, using a computational model, we show that the strength of this element in native 3' end sequences can explain some of their measured expression variability (R = 0.41). Together, our results emphasize the importance of efficient 3' end processing for endogenous protein levels and contribute to an improved understanding of the sequence elements involved in this process.

PMID:
25875337
PMCID:
PMC4398552
DOI:
10.1371/journal.pgen.1005147
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center