Format

Send to

Choose Destination
BMC Genomics. 2015 Dec 9;16:1039. doi: 10.1186/s12864-015-2257-y.

Single-molecule real-time transcript sequencing facilitates common wheat genome annotation and grain transcriptome research.

Dong L1, Liu H2, Zhang J3, Yang S4,5, Kong G6, Chu JS7,8, Chen N9,10, Wang D11,12.

Author information

1
The State Key Laboratory of Plant cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. lldong@genetics.ac.cn.
2
Frasergen, Wuhan East Lake High-tech Zone, Wuhan, 430075, China. liuhongfang@frasergen.com.
3
The State Key Laboratory of Plant cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. jczhang@genetics.ac.cn.
4
The State Key Laboratory of Plant cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. sjyang@genetics.ac.cn.
5
University of Chinese Academy of Sciences, Beijing, 100049, China. sjyang@genetics.ac.cn.
6
Frasergen, Wuhan East Lake High-tech Zone, Wuhan, 430075, China. kongguanyi@frasergen.com.
7
Frasergen, Wuhan East Lake High-tech Zone, Wuhan, 430075, China. jeffreychu@frasergen.com.
8
School of Pharmaceutical Sciences, Wuhan University, Wuhan, 430071, China. jeffreychu@frasergen.com.
9
School of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430075, China. chenn@sfu.ca.
10
Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada. chenn@sfu.ca.
11
The State Key Laboratory of Plant cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. dwwang@genetics.ac.cn.
12
The Collaborative Innovation Center for Grain Crops, Henan Agricultural University, Zhengzhou, 450002, China. dwwang@genetics.ac.cn.

Abstract

BACKGROUND:

The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.

RESULTS:

We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat.

CONCLUSIONS:

Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.

PMID:
26645802
PMCID:
PMC4673716
DOI:
10.1186/s12864-015-2257-y
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center