NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE81861 Query DataSets for GSE81861
Status Public on Mar 20, 2017
Title Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors
Organism Homo sapiens
Experiment type Expression profiling by high throughput sequencing
Summary Intratumoralmheterogeneity is a major obstacle to cancer treatment and a significant confounding factor in bulk-tumor profiling. We performed an unbiased analysis of transcriptional heterogeneity in colorectal tumors and their microenvironments using single-cell RNA–seq from 11 primary colorectal tumors and matched normal mucosa. To robustly cluster single-cell transcriptomes, we developed reference component analysis (RCA), an algorithm that substantially improves clustering accuracy. Using RCA, we identified two distinct subtypes of cancer-associated fibroblasts (CAFs). Additionally, epithelial–mesenchymal transition (EMT)-related genes were found to be upregulated only in the CAF subpopulation of tumor samples. Notably, colorectal tumors previously assigned to a single subtype on the basis of bulk transcriptomics could be divided into subgroups with divergent survival probability by using single-cell signatures, thus underscoring the prognostic value of our approach. Overall, our results demonstrate that unbiased single-cell RNA–seq profiling of tumor and matched normal samples provides a unique opportunity to characterize aberrant cell states within a tumor.
 
Overall design Two single cell dataset are included: (1) 1,591 single cells from 11 colorectal cancer patients were profiled using Fluidigm based single cell RNA-seq protocol to characterized cellular heterogeneity of colorectal cancer. (2) 630 single cells from 7 cell lines were profiled similarly to benchmark de novo cell type identification algorithms, these include 83 A549 cells, 65 H1437 cells, 55 HCT116 cells, 23 IMR90 cells, 96 K562 cells, and 134 GM12878 cells (38 from batch 1, 96 from batch 2), 174 H1 cells (96 from batch 1, 78 from batch 2).

Please note that [1] only the QC-passed samples are included in the records [2] Raw data is available through EGA [accn: EGAS00001001945; ERP016958] [3] The 'GEO_EGA_ID_match.csv' contains the ERSnnnnnn accession numbers correspoding to each GEO sample raw data.
 
Contributor(s) Li H, Courtois ET
Citation(s) 28319088
Submission date May 25, 2016
Last update date Jul 01, 2020
Contact name Huipeng Li
E-mail(s) lihuipengsmacsb@gmail.com
Organization name Genome Institute of Singapore
Street address 60 Biopolis St, #02-01
City Singapore
ZIP/Postal code 138672
Country Singapore
 
Platforms (1)
GPL11154 Illumina HiSeq 2000 (Homo sapiens)
Samples (1220)
GSM2180156 RHC3546
GSM2180157 RHC3552
GSM2180158 RHC3553
Relations
BioProject PRJNA323703

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE81861_CRC_NM_all_cells_COUNT.csv.gz 3.2 Mb (ftp)(http) CSV
GSE81861_CRC_NM_all_cells_FPKM.csv.gz 4.7 Mb (ftp)(http) CSV
GSE81861_CRC_NM_epithelial_cells_COUNT.csv.gz 2.5 Mb (ftp)(http) CSV
GSE81861_CRC_NM_epithelial_cells_FPKM.csv.gz 4.0 Mb (ftp)(http) CSV
GSE81861_CRC_tumor_all_cells_COUNT.csv.gz 4.3 Mb (ftp)(http) CSV
GSE81861_CRC_tumor_all_cells_FPKM.csv.gz 7.9 Mb (ftp)(http) CSV
GSE81861_CRC_tumor_epithelial_cells_COUNT.csv.gz 3.6 Mb (ftp)(http) CSV
GSE81861_CRC_tumor_epithelial_cells_FPKM.csv.gz 6.5 Mb (ftp)(http) CSV
GSE81861_Cell_Line_COUNT.csv.gz 13.1 Mb (ftp)(http) CSV
GSE81861_Cell_Line_FPKM.csv.gz 28.9 Mb (ftp)(http) CSV
GSE81861_GEO_EGA_ID_match.csv.gz 14.4 Kb (ftp)(http) CSV
Processed data are available on Series record
Raw data not provided for this record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap