show Abstracthide AbstractSorghum (Sorghum bicolor) is globally produced as a source of food, feed, fibre and fuel. Grain and sweet sorghums differ in a number of important traits including stem sugar and juice accumulation, plant height and production of grain and biomass. The first whole genome sequence of a grain sorghum is available, but additional genome sequences are required to study genome-wide and intraspecies variation for dissecting the genetic basis of these important traits and for tailor-designed breeding of this important C4 crop. We resequenced two sweet and one grain sorghum inbred lines, and identified a set of nearly 1,500 genes differentiating sweet and grain sorghum. In addition, we uncovered 1,057,018 SNPs, 99,948 indels of 1-10bp in length and 16,487 presence/absence variations. In addition, 17,111 CNVs were detected. This is a first report on the identification of genome-wide patterns of genetic variation in sorghum. Because some genes might exist in sorghum but are missed in the currently assembled BTx623 sorghum genome. We assembled unmapped reads with SOAPdenovo and obtained contigs with a total length of 7.2 Mb of sequences. Annotation of these contigs showed 73 putative absent genes with an average length of 409bp (only coding regions were considered). A BLAST search against Arabidopsis, rice and maize genome databases revealed that 33 of these genes showed homology with known proteins (E value < 1e-617 ).