Send to

Choose Destination
Nat Commun. 2018 Sep 10;9(1):3676. doi: 10.1038/s41467-018-05997-6.

SMRT-Cappable-seq reveals complex operon variants in bacteria.

Author information

New England Biolabs Inc., 240 County Road, Ipswich, MA, 01938, USA.
PacBio, 1305 O'Brien Drive, Menlo Park, CA, 94025, USA.
New England Biolabs Inc., 240 County Road, Ipswich, MA, 01938, USA.


Current methods for genome-wide analysis of gene expression require fragmentation of original transcripts into small fragments for short-read sequencing. In bacteria, the resulting fragmented information hides operon complexity. Additionally, in vivo processing of transcripts confounds the accurate identification of the 5' and 3' ends of operons. Here we develop a methodology called SMRT-Cappable-seq that combines the isolation of un-fragmented primary transcripts with single-molecule long read sequencing. Applied to E. coli, this technology results in an accurate definition of the transcriptome with 34% of known operons from RegulonDB being extended by at least one gene. Furthermore, 40% of transcription termination sites have read-through that alters the gene content of the operons. As a result, most of the bacterial genes are present in multiple operon variants reminiscent of eukaryotic splicing. By providing such granularity in the operon structure, this study represents an important resource for the study of prokaryotic gene network and regulation.

Supplemental Content

Full text links

Icon for Nature Publishing Group Icon for PubMed Central
Loading ...
Support Center