NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM1015166 Query DataSets for GSM1015166
Status Public on Dec 22, 2012
Title Frog liver
Sample type SRA
 
Source name Frog_liver_pooled
Organism Xenopus tropicalis
Characteristics gender: mixed
age: adult
tissue: liver
molecule subtype: poly-A enriched RNA
Extracted molecule total RNA
Extraction protocol About 20 μg of each tissue was homogenized in 700 μl QIAzol Lysis Reagent (Qiagen) using ceramic beads (Precellys). RNA was extracted according to manufacturer’s recommendations. Briefly, 140 μl of chloroform was added to the homogenate. After phase separation, 450 μl of isopropanol was added to the upper, aqueous phase. The yield and quality of the total RNA were monitored by spectrophotometry at 260, 280, and 230 nm using a Bioanalyzer Eukaryote Total RNA Nano Series II chip (Agilent).
10 μg extracted total RNA was DNase-treated (Turbo DNase, Ambion) and polyadenylated RNA was enriched twice from total RNA using polyATtract mRNA isolation system IV (Promega). RNA was reversed transcribed and converted into double-stranded cDNA (SuperScript cDNA synthesis kit, Invitrogen), sheared by sonication followed by end-repairment, A-tailing, paired end adapter (Illumina) ligation and, prior to PCR amplification, cDNA was UNG-treated to maintain strand-specificity. cDNA was amplified by 15 cycles of PCR and size selected (200-300 bp).
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina HiSeq 2000
 
Description Sample 17
Data processing Full genomic sequences for the analyzed species were downloaded from the UCSC Genome Browser database. Full transcriptomic sequences for all species were downloaded from Ensembl. For each gene, a canonical transcript was selected for gene expression analysis based on the hierarchy derived from the BioMart associated transcript names. For the cases in which such information was not available, the longest protein-coding transcript was selected as the gene representative.
For each species, we assembled all canonical transcript (as defined above) sequences. In order to calculate the effective number of unique mappable positions in each transcript (i.e. the effective length) we performed the following steps. For each read length k, we extracted the L-k+1 (L being the transcript length) k-mer sequences from each canonical transcript and then aligned the full set of k-mers against the respective genome using Bowtie, allowing for a maximum of two mismatches. k-mers with no or one unique genomic alignment were then likewise aligned back to the canonical transcriptome. For each transcript, the number of such k-mers having a unique transcriptomic alignment was determined. This corresponds to the transcript’s effective number of unique mappable positions for k-mer mRNA-Seq reads. For each sample, the corresponding mRNA-Seq data were aligned against the respective genome using Bowtie, allowing for a maximum of two mismatches. Reads with one unique genomic alignment were then aligned against the canonical transcriptome and, for each transcript, the number of reads with one unique transcriptomic alignment were counted. Gene expression levels were determined as reads per thousand mappable positions of target transcript sequence per million of reads, where the reads uniquely align to the analyzed transcriptome. This procedure for estimating gene expression levels is a corrected version of the widely used RPKM (reads per kilobase of target transcript sequence per million of total reads) metric, and is referred to as “cRPKM” [Labbe RM et al. Stem Cells (2012)].
Genome_build: mm9, galGal3, anoCar2, xenTro2, tetNig2
Supplementary_files_format_and_content: Tab-delimited text files include cRPKM values (as defined above), as well as the mappability and the total read count for each canonical transcript, for each Sample. To generate bigWig files we aligned the mRNA-Seq data against the respective genomes using TopHat; genomeCoverageBed (part of the BEDTools suite) was run on the resulting BAM files for the quantification of genomic coverage, outputting bedGraph files; bedGraphToBigWig (dowloaded from UCSC) was then used to generate bigWig files from the bedGraph files.
 
Submission date Oct 04, 2012
Last update date May 15, 2019
Contact name Nuno L Barbosa-Morais
E-mail(s) nuno.barbosa.morais@utoronto.ca
Phone 4169784633
Fax 4169465545
URL http://www.utoronto.ca/intron/barbosa-morais.html
Organization name University of Toronto
Department The Donnelly Centre
Lab Blencowe
Street address 160 College Street, Room 908
City Toronto
State/province ON
ZIP/Postal code M5S 3E1
Country Canada
 
Platform ID GPL15472
Series (1)
GSE41338 The evolutionary landscape of alternative splicing in vertebrate species
Relations
SRA SRX191165
BioSample SAMN01758127

Supplementary file Size Download File type/resource
GSM1015166_Frog_liver.GE.transC.txt.gz 260.7 Kb (ftp)(http) TXT
GSM1015166_Frog_liver.bw 66.5 Mb (ftp)(http) BW
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap