|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Apr 21, 2011 |
Title |
Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems |
Organism |
Medicago sativa |
Experiment type |
Expression profiling by high throughput sequencing Genome variation profiling by high throughput sequencing Other
|
Summary |
Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. The application of genomic approaches would advance development of alfalfa as a cellulosic feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. These ESTs were de novo assembled into 132,153 unique sequences. By combining the de novo assembled ESTs (132,153 sequences) with our previously identified EST sequences (341,984 sequences, unpublished data), and the ESTs available from GenBank (12,371 sequences), we built the first Alfalfa Gene Index (MSGI 1.0). MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1, 294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Transcript profiling of stem internodes of genotypes 708 and 773 was conducted by quantifying the number of Illumina EST reads that were mapped to sequences in MSGI 1.0. We identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index (MSGI 1.0) assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a cellulosic feedstock.
|
|
|
Overall design |
Examination of 2 different tissue types at different developmental stages (Elongating vs. post-elongation stem internodes) in two alfalfa genotypes (708 and 773) with divergent cell wall composition in stems.
|
|
|
Contributor(s) |
Yang SS, Vance CP, Gronwald JW |
Citation(s) |
21504589 |
Submission date |
Jan 20, 2011 |
Last update date |
May 15, 2019 |
Contact name |
Sam Yang |
E-mail(s) |
yangsh38@hotmail.com
|
Phone |
612-626-6582
|
Organization name |
USDA
|
Department |
ARS
|
Lab |
Carroll Vance
|
Street address |
411 Upper Buford Circle
|
City |
St Paul |
State/province |
MN |
ZIP/Postal code |
55108 |
Country |
USA |
|
|
Platforms (2) |
GPL11627 |
Illumina Genome Analyzer II (Medicago sativa) |
GPL11643 |
454 GS (Medicago sativa) |
|
Samples (6)
|
|
Relations |
SRA |
SRP005429 |
BioProject |
PRJNA136349 |
Supplementary file |
Size |
Download |
File type/resource |
GSE26757_Alfalfa_Gene_Index_1.0.fasta.gz |
15.7 Mb |
(ftp)(http) |
FASTA |
GSE26757_Digital_Gene_Expression_Profile_Matrix.txt.gz |
2.2 Mb |
(ftp)(http) |
TXT |
GSE26757_Filtered_708ES_Illumina.fasta.gz |
3.2 Gb |
(ftp)(http) |
FASTA |
GSE26757_Filtered_708PES_Illumina.fasta.gz |
291.8 Mb |
(ftp)(http) |
FASTA |
GSE26757_Filtered_773ES_Illumina.fasta.gz |
2.9 Gb |
(ftp)(http) |
FASTA |
GSE26757_Filtered_773PES_Illumina.fasta.gz |
1.6 Gb |
(ftp)(http) |
FASTA |
GSE26757_MSGI1_0_Annotation.txt.gz |
15.1 Mb |
(ftp)(http) |
TXT |
GSE26757_SNPs_between_708_and_773.txt.gz |
197.9 Kb |
(ftp)(http) |
TXT |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|