Format

Send to

Choose Destination
Mol Cells. 2014 Jan;37(1):36-42. doi: 10.14348/molcells.2014.2241. Epub 2014 Jan 27.

Genome-wide SNP calling using next generation sequencing data in tomato.

Author information

1
SEEDERS Inc., Daejeon 305-509, Korea.

Abstract

The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

PMID:
24552708
PMCID:
PMC3907006
DOI:
10.14348/molcells.2014.2241
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Publishing M2Community Icon for PubMed Central
Loading ...
Support Center