![]() | ![]() |
Formats:
|
||||||||||||||||||||
A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes 1 Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94270 2 Comprehensive Cancer Center, 2340 Sutter Street, University of California, San Francisco, San Francisco, California 94143 3 Department of Statistics, University of California, Berkeley, Berkeley, California 94720 4 Barbara Ann Karmanos Cancer Institute, Wayne State University School of Medicine, Detroit, Michigan 4820 5 University of Texas Southwestern Medical Center, Dallas, Texas 75390 6 Department of Epidemiology and Biostatistics, Division of Biostatistics, University of California, San Francisco, California 94143 7 Lombardi Comprehensive Cancer Center and Georgetown University School of Medicine, Washington, D.C. 20057 8 University of Michigan Medical School, Ann Arbor, Michigan 48109 *Correspondence: Email: rmneve/at/lbl.gov 9These authors contributed equally to this work. The publisher's final edited version of this article is available at Cancer Cell. See commentary "Integrated breast cancer genomics." in Cancer Cell, volume 10 on page 453. See other articles in PMC that cite the published article.Summary Recent studies suggest that thousands of genes may contribute to breast cancer pathophysiologies when deregulated by genomic or epigenomic events. Here, we describe a model “system” to appraise the functional contributions of these genes to breast cancer subsets. In general, the recurrent genomic and transcriptional characteristics of 51 breast cancer cell lines mirror those of 145 primary breast tumors, although some significant differences are documented. The cell lines that comprise the system also exhibit the substantial genomic, transcriptional, and biological heterogeneity found in primary tumors. We show, using Trastuzumab (Herceptin) monotherapy as an example, that the system can be used to identify molecular features that predict or indicate response to targeted therapies or other physiological perturbations. SIGNIFICANCE The description of the in vitro breast cancer cell line system described here allows assessment of similarities and differences between the cell lines and primary human breast tumors. In general, the system seems well suited to assess the functional contributions of genome copy number abnormalities to breast cancer pathophysiologies, since most of the recurrent genomic deregulation of transcription present in primary tumors is retained in the cell lines. The genomically and biologically heterogeneous cell line system also may be used to identify molecular features that predict or indicate response (or lack thereof) to pathway-targeted therapeutic agents. These features may be assessed as candidate response predictors/indicators to guide early-phase clinical trials. Introduction The evolution of a normal, finite-life-span somatic epithelial cell into an immortalized, metastatic cell requires deregulation of multiple cellular processes including genome stability, proliferation, apoptosis, motility, and angiogenesis (Albertson et al., 2003; Hanahan and Weinberg, 2000). Changes in genome copy number and/or structure are particularly important as deregulating events in cancer progression (Hyman et al., 2002; Jeffrey et al., 2005; Kallioniemi et al., 1994; Loo et al., 2004; Pollack et al., 2002; Roylance et al., 1999; Tirkkonen et al., 1998), and elucidation of recurrent aberrations has revealed many important oncogenes and tumor suppressors (Neve et al., 2004). In fact, over a thousand genes have now been reported to be deregulated by recurrent genome aberrations in breast cancer alone (Fridlyand et al., 2006; Hyman et al., 2002; Pollack et al., 2002). Functional assessment of several of these genes in cell lines and xenografts has rovided invaluable insights into the roles they play in cellular physiology (Alimandi et al., 1995; Cheng et al., 2004). However, interpreting these results in the context of breast cancer pathophysiology requires an understanding of the extent to which the cell lines mirror aberrations that are present in primary tumors. To this end, we describe here a comprehensive comparison of the molecular and biological features of a collection of 51 breast cancer cell lines with those measured for primary breast tumors. The central result of the study is a comparison of genome copy number and transcriptional profiles for the cell lines with those measured for primary breast tumors (Fridlyand et al., 2006). We also evaluated protein and phosphoprotein levels for selected genes in signaling pathways that are frequently deregulated in cancer. These analyses show that the cell lines display the same heterogeneity in copy number and expression abnormalities as the primary tumors, and they carry almost all of the recurrent genomic abnormalities associated with clinical outcome in primary tumors. In addition, the breast cancer cell lines cluster into basal-like and luminal expression subsets, as do primary tumors, although the partitioning of genome aberrations between these subsets is somewhat different than that in basal-like and luminal primary tumors. Importantly, the cell line collection exhibits heterogeneous responses to targeted therapeutics paralleling clinical observations. From these studies, we conclude that the cell line collection mirrors most of the important genomic and resulting transcriptional abnormalities found in primary breast tumors and that analysis of the functions of these genes in the ensemble of cell lines will accurately reflect how they contribute to breast cancer pathophysiologies. We also illustrate the possibility that correlative analyses of the heterogeneous responses to treatment with therapeutic agents that attack these genes may allow identification of molecular features that predict response in individual patients. Results Genomic features We performed array CGH using arrays at 1 Mb resolution (see Experimental Procedures). Our analyses of genome copy number abnormalities in 51 cell lines (listed in Table 1) are provided in Table S1 in the Supplemental Data available with this article online (see also http://cancer.lbl.gov/breastcancer/data.php). As with primary tumors, cell lines exhibit pronounced genomic heterogeneity, even between lines with similar transcriptional profiles (e.g., luminal or basal-like) and biological characteristics, although the number of genome abnormalities per cell line is, on average, higher than that in primary tumors. Figure S1 shows genome copy number abnormality profiles for cell lines that exhibit different levels of genome aberration complexity, in agreement with published array CGH for these cell lines (Larramendy et al., 2000; Shadeo and Lam, 2006; Snijders et al., 2001). Several, like SUM159PT, show relatively few abnormalities. Others, like T47D, show many low-level abnormalities, and many, like BT474 and MCF7, show many abnormalities with high-level amplification. A few, like HCC1500, show extraordinary levels of abnormality not typically found in primary tumors.
Figures 1A and 1B
Transcription profiles Hierarchical clustering of our analyses of transcriptional profiles (Table S2; see also http://cancer.lbl.gov/breastcancer/data.php) of the 51 breast cell lines using transcripts showing substantial variation across the samples revealed two major branches (Figure 3A
To identify genes that classify the luminal, Basal A, and Basal B subtypes, PAM analysis was performed (see Experimental Procedures) (Tibshirani et al., 2002). Table S3 lists 305 classifier genes, and Figure 3B In addition to published histological markers, luminal A cell lines also preferentially expressed genes, such as GATA3, TOB1, ERBB3, and SPDEF, that have been associated with a more differentiated, noninvasive phenotype (Beck et al., 2001; Charafe-Jauffret et al., 2006; Feldman et al., 2003; Lim et al., 2000). Basal B cell lines were more clearly distinct from luminal cells than those in the Basal A cluster and preferentially expressed genes such as CD44, MSN, TGFBR2, CAV1/2, VIM, SPARC, and AXL, while CD24 was weakly expressed. Interestingly, MCF10A and MCF12A (two immortal, nontransformed cell lines) share transcriptional characteristics with all other identified subtypes and had many features of basal progenitor cells (Dontu et al., 2003a; Stingl et al., 1998), suggesting that these cells may represent a multipotent lineage. In contrast to Perou et al. (2000), we did not find a distinct HER2 cluster. Rather, HER2-amplified cells were scattered across the luminal cluster and the Basal A cluster. Although the gene expression patterns generally reflected the major transcriptional classes found in primary tumors, the differences in frequency of genome copy number abnormalities between basal-like and luminal cell lines were different than those between luminal and basal-like primary tumors (Fridlyand et al., 2006). For example, luminal tumors (Figure 4A
Genomic deregulation of gene expression Our analyses of gene expression and copy number in the 51 cell lines revealed 1778 gene transcripts whose levels were correlated with genome copy number (Pearson’s correlation ≥ 0.5, Holm-adjusted p value ≤ 0.05), suggesting that expression levels were deregulated by genomic aberrations. Table S4 summarizes the statistically significant genome copy number versus gene expression correlations discovered in this study. A similar analysis in primary breast tumors (Fridlyand et al., 2006) identified 1182 significantly correlated genes (Pearson’s correlation ≥0.5, Holm-adjusted p value ≤0.05). We assessed the agreement between the tumor and cell line correlation data sets and found that 72% of the genes scored as significantly deregulated in primary tumors also were significantly deregulated in the cell line set (odds ratio for agreement of 16 for correlation > 0.7). This indicates that the cell lines retain most of the genome-aberration-mediated gene deregulation present in primary tumors (Table S4). Sixty-six of the deregulated genes in the tumors were in regions of high-level amplification associated with reduced survival duration and so are both markers for tumors that are resistant to current therapies and candidate therapeutic targets. We identified cell lines in which the 66 candidate therapeutic target genes were amplified and overexpressed by clustering the cell lines using probe sets matched to these genes. Figure 5
Protein profiles We measured levels of 49 gene products or posttranslationally modified gene products associated with aspects of signal transduction and cell cycle regulation and/or frequently found to be aberrant in human cancers using western analysis. These analyses revealed cell lines in which these regulatory processes may be aberrant and allowed an initial assessment of the extent to which genome aberrations affected the protein/phosphoprotein levels. These data also allowed us to assess the extent to which the RNA levels in the cell lines reflected protein levels. Western blots for the cell line collection are shown in Figure S4, and semiquantitative measures of protein levels are summarized in Table S5. Figures S3B and S3C show that the degree of concordance between semiquantitative measures of protein levels from the western blots and RNA expression levels from the Affymetrix expression array analyses varied considerably among the genes (Souchelnytskyi, 2002). We found that concordance for 54% was strong (e.g., ESR1, CDKN1B), while 46% showed low or no concordance (e.g., PTEN). This finding is not surprising considering the high degree of posttranslational processing and degradation that occurs in signaling pathways that regulate proliferation and survival. Biological and molecular associations One important use of the cell line collection is identification of molecular events that are associated with biological phenotype. Establishing such associations is a first step in the development of a molecular understanding of the biological phenotype. The molecular diversity between the cell lines allows this to be accomplished in a robust manner. Morphology and invasion One of the clearest associations in the cell line collection is the relationship between the transcriptionally defined subgroups and distinctive biological characteristics such as morphology and invasive potential illustrated in Figure 6
Predictors of therapeutic response One of the promising potential applications of association analysis using the cell line system is identification of molecular signatures that predict responses to therapies that target genes that are deregulated by genome abnormalities. To illustrate this application, we assessed biological responses to Trastuzumab in nine HER2-amplified cell lines and two control cell lines. Genome copy number profiles for the HER2-amplified cell lines are shown in Figure 7A
Indicators of therapeutic response Association studies also identify molecular events that change in response to treatment with targeted therapies. A previous study suggested that regulation of p27KIP1 is critical in mediating response to Trastuzumab (Nahta et al., 2004). However, this study was limited to clonal Trastuzumab-resistant variants of SKBR3 cells. Our analyses of the molecular and biological responses of HER2 amplified cell lines to Trastuzumab, shown in Figures 7B and 7C Discussion Breast cancer is a remarkably heterogeneous disease, but subsets of tumors show recurrent patterns of transcriptional, genomic, and biological abnormality. Understanding how genes in these “patterns” collectively function in an otherwise heterogeneous biological setting to enable progression and modulate response to therapy is critical to improving management of the disease. Association studies in primary tumors provide clues about molecular events that may be important in cancer pathophysiology, but more formal proof requires model systems that mirror both the heterogeneity and recurrent molecular aberrations found in primary tumors and that can be manipulated to test associations. The comparisons between cell lines and primary tumors in this study show that the cell line collection, as a system, mirrors many but not all of the biological and genomic properties of primary tumors. In general, the cell lines mirror both the genomic heterogeneity (Figure S1) and the recurrent genome copy number abnormalities found in primary tumors with high fidelity (Figures 1 However, some aspects of primary tumor cancer genomics will be difficult to study using the current collection. For example, additional cell lines derived from early-stage breast cancers will be needed to study aberration patterns such as the “1q/16q” breast tumor subtype. This may require development of cell culture conditions that are permissive to the growth of these cells. It is noteworthy in this context that the HCC cell lines (Gazdar et al., 1998; Larramendy et al., 2000) preferentially populate the basal-like lineage, while other cell lines (e.g., the SUM cells [Ethier et al., 1993]) are represented across all the lineage subtypes. This suggests the possibility that culture conditions may bias selection of breast tumor subtypes. Alternatively, the laboratory-specific lineage dependence of the derived cell lines may be explained by the tissue origin. For example, the HCC cell lines were typically derived from primary breast tumors, while many of the other cell lines were derived from pleural effusions (Table 1). Other aspects of cancer biology also are more or less accurately represented by the cell line system. For example, the cell lines can be classified into luminal and basal-like subtypes as found in primary tumors (Figure 3 The high fidelity to which genome-aberration-induced transcriptional changes are preserved in the cell lines and the existence of substantial genomic, transcriptional, translational, and biological heterogeneity in the overall system support the idea that assessment of responses to inhibitors of the resulting dominant or dominant-negative genes will reveal molecular events that predict response/resistance. This concept is supported by studies of responses to Iressa of lung cancer cell lines (Tracy et al., 2004; Zhao et al., 2004) and leukemia cells (Carter et al., 2005; Mahon et al., 2000). Our analyses of the subset of HER2-amplified breast cancer cell lines show variable response to treatment with Trastuzumab as observed in the clinical trials (Vogel et al., 2001) and identify molecular features that may allow more precise identification of HER2-positive patients that will respond to therapeutic protocols containing Trastuzumab. Specifically, increased protein levels of ESR1, TYK2, FASN, GRB7, MEK (S217/219), and MAPK1/3 (Thr202/Tyr204) predict Trastuzumab sensitivity, whereas increased SFN, CAV2, GRB2, RB1, and FLNA expression is associated with Trastuzumab resistance. Many of these genes are known signaling targets or signal integrators of HER2 (Hynes and Lane, 2005), and its downstream pathways, PI3K and MAPK; therefore, mutations in these pathways may be responsible for loss of HER2 oncogene “addiction” and may modulate therapeutic response. In support of this, gene expression profiles indicate that increased expression of several insulin/MAPK pathway genes predicts response, whereas increased mTOR, Toll-like receptor, N-Glycan biosynthesis, and inositol-phosphate signaling predicts resistance. These and subsequent studies set the stage for detailed study of mechanisms of resistance, development of markers that predict or indicate response, and potential new therapeutic targets. In sum, we have cataloged the genomic and molecular properties of a panel of cell lines and demonstrated a fidelity to those found in primary breast tumors. Recurrent genome aberrations and the resulting transcriptional changes are well preserved in the cell line collection. Thus, the cell lines seem well suited to assessment of the functional consequences of genome-aberration-mediated gene deregulation and to identification of molecular features that predict resistance/sensitivity to agents that target these aberrations. Continuing characterization of these cell lines, development of more cell lines and more realistic cell culture environments, and assessment of multiple aberration-targeted agents should provide an increasingly useful resource for the assessment of how genome aberrations contribute to breast cancer pathophysiology. This will facilitate our understanding of the mechanisms of tumorigenesis and stimulate development of new therapies targeted to selectively interfere with one or more of these processes. Experimental procedures Cell culture Breast cancer cell lines were obtained from the ATCC or from collections developed in the laboratories of Drs. Steve Ethier and Adi Gazdar. Cell lines were obtained from these sources to avoid errors that occur when obtaining lines through “secondhand” sources. Since we acknowledge the existence of multiple clonal variants of some cell lines throughout the scientific community, all results presented here are reflective of the cell lines we have in our collection. To maintain the collections’ integrity, cell lines have been carefully maintained in culture, and stocks of the earliest-passage cells have been stored. Quality control is maintained by careful analysis and reanalysis of morphology, growth rates, gene expression, and protein levels. Cell lines can be accurately identified by CGH analysis. All extracts were made from subconfluent cells in the exponential phase of growth in full media. Information about the biological characteristics of the cell lines and the culture conditions are summarized in Table 1 and are available at http://cancer.lbl.gov/breastcancer/data.php. Nucleic acid isolation DNA isolation Cells growing exponentially in culture were washed in phosphate buffered saline (PBS), pelleted by centrifugation, resuspended in PBS, and pelleted again. Pellets were either frozen for long-term storage or used to extract genomic DNA directly. Genomic DNA was extracted using the Wizard DNA Purification Kit (Promega), further purified with a phenol/chloroform extraction, and quantified using a fluorimeter. Phenol/chloroform extraction of the resulting DNA increased measurement precision significantly in some experiments, presumably by removing proteins that interfered with DNA labeling and hybridization. RNA isolation Total RNA was extracted from cell lines using Trizol, according to standard protocols (Invitrogen). RNA integrity was assessed by denaturing formaldehyde agarose gel electrophoresis or by microanalysis (Agilent Bioanalyzer, Palo Alto, CA). Cell lysates Protein lysates were prepared from cells at 50%–75% confluency. The cells were washed in ice-cold PBS containing 1 mM phenylmethylsulfonyl fluoride (PMSF) and then with a buffer containing 50 mM HEPES (pH 7.5), 150 mM NaCl, 25 mM β-glycerophosphate, 25 mM NaF, 5 mM EGTA, 1 mM EDTA, 15 mM pyrophosphate, 2 mM sodium orthovanadate, 10 mM sodium molybdate, leupeptin (10 μg/ml), aprotinin (10μg/ml), and 1 mM PMSF. Cells were extracted in the same buffer containing 1% Nonidet-P40. Lysates were then clarified by centrifugation and frozen at −80°C. Protein concentrations were determined using the Bio-Rad protein assay kit. Immunochemical techniques and immunoblot quantification Immunoblot analyses were performed using 20 μg cleared cell lysates. This material was electrophoretically resolved on denaturing sodium doedecyl sulfate (SDS)-polyacrylamide gels (4%–12%), transferred to polyvinylidene difluoride membranes (PVDF; Millipore), and probed with specific antisera using standard techniques. Bound antibodies on immunoblots were detected by either chemiluminescent (ECL, Pierce) or infrared (LiCor, Odyssey) imaging. Images were recorded as TIFF files for quantitation (see below). Immunoblots analysis of each protein was performed at least twice in all cases to ensure reproducibility. Antibodies used in these western analysis are described in Table S7. Protein quantification Protein levels were measured by quantifying emitted chemiluminescence or infrared radiation recorded from labeled antibodies using Scion Image (http://www.scioncorp.com/) or Odyssey software (http://www.licor.com/). For each protein, the blots were made for 4 sets of 11 cell lines, each set including the same pair (SKBR3 and MCF12A) to permit intensity normalization across sets. A basic multiplicative normalization was carried out by fitting a linear mixed-effects model to log intensity values and adjusting within each set to equalize the log intensities of the pair of reference cell lines across the sets. Boyden chamber invasion assays Assays were performed in modified Boyden chambers with 8 μm pore filter inserts for 24-well plates (BD Bioscience). Filters were coated with 12.5 μl of ice-cold 20% basement membrane extract (Matrigel, BD Bioscience). Epithelial cells were added to the upper chamber in 300 μl of serum-free medium. For the invasion assay, 7.5 × 104 cells were seeded on the 20% Matrigel-coated filters and incubated for 24 hr. The lower chamber was filled with 300 μl of full medium. After incubation, epithelial cells on the underside of the filter were fixed with 2.5% glutaraldehyde in PBS and stained with 0.5% toluidine blue in 2% Na2CO3. Cells that remained in the gel or attached to the upper side of the filter were removed with cotton tips. Cells on the underside of the filter were counted using light microscopy. Assays were performed in triplicate or quadruplicate. The results were expressed as an average ± one standard deviation. Comparative genomic hybridization Each sample was analyzed using Scanning and OncoBAC arrays. Scanning arrays were comprised of 2464 BACs selected at approximately megabase intervals along the genome as described previously (Hodgson et al., 2001; Snijders et al., 2001). OncoBAC arrays were comprised of 1860 P1, PAC, or BAC clones. About three-quarters of the clones on the OncoBAC arrays contained genes and STSs implicated in cancer development or progression. All clones were printed in quadruplicate. Data presented are the union of these two data sets. Arrays were prepared as described (Fridlyand et al., 2006; Snijders et al., 2001). Briefly, we random-prime labeled 500~1000 ng of test (cell line) and reference (normal female, Promega) genomic DNA with CY3-dUTP and CY5-dUTP (Amersham), respectively, using Bioprime kit (In-vitrogen). Labeled DNA samples were coprecipitated with 50 μg of human Cot-1 DNA (Invitrogen), denatured, hybridized to BAC arrays for 48–72 hr, washed, and counterstained with DAPI. Most of the data presented are based on the results of a single hybridization. Repeated measurements of genome aberrations in other experiments show that the results are highly reproducible. Data processing Array CGH data image analyses were performed as described previously (Jain et al., 2002). In this process, an array probe was assigned a missing value for an array if there were fewer than two valid replicates or the standard deviation of the replicates exceeded 0.3. Array probes missing in more than 50% of samples in the OncoBAC or scanning array data sets were excluded in the corresponding set. Array probes representing the same DNA sequence were averaged within each data set and then between the two data sets. Finally, the two data sets were combined, and the array probes missing in more than 25% of the samples, unmapped array probes, and probes mapped to chromosome Y were eliminated. The final data set contained 2696 unique probes representing a resolution of 1 Mb. Affymetrix microarray analysis Total RNA was prepared from samples using Trizol reagent (GIBCO BRL Life Technologies), and quality was assessed on the Agilent Bioanalyser 2100. Preparation of in vitro transcription (IVT) products, oligonucleotide array hybridization, and scanning were performed according to Affymetrix (Santa Clara, California) protocols. In brief, 5 μg of total RNA from each breast cancer cell line and T7-linked oligo-dT primers were used for first-strand cDNA synthesis. IVT reactions were performed to generate biotinylated cRNA targets, which were chemically fragmented at 95°C for 35 min. Fragmented biotinylated cRNA (10 μg) was hybridized at 45°C for 16 hr to Affymetrix high-density oligonucleotide array human HG-U133A chip. The arrays were washed and stained with streptavidin-phycoerythrin (SAPE; final concentration 10 μg/ml). Signal amplification was performed using a biotinylated anti-streptavidin antibody. The array was scanned according to the manufacturer’s instructions (2001 Affymetrix Genechip Technical Manual). Scanned images were inspected for the presence of obvious defects (artifacts or scratches) on the array. Defective chips were excluded, and the sample was reanalyzed. Data processing Probe set based gene expression measurements were generated from quantified Affymetrix image files (“.CEL” files) using the RMA algorithm (Irizarry et al., 2003) from the BioConductor (http://www.bioconductor.org/) tools suite. All 51 CEL files were analyzed simultaneously, creating a data matrix of probe sets by cell lines in which each value is the calculated log abundance of each probe set gene for each cell line. Probe sets were annotated with Unigene annotations from the July 2003 mapping of the human genome (http://genome.ucsc.edu/), resulting in 19,764 annotated probe sets representing 13,406 unique unigenes. Gene expression values were centered by subtracting the mean value of each probe set across the cell line set from each measured value. The gene expression data were organized using hierarchical clustering to facilitate visualization of commonalities and differences in gene expression across the set of cell lines. These analyses were restricted to the set of genes that showed substantial variation across the data set by selecting all probe sets that had at least four measurements that varied by more than Log2 1.89. This resulted in 1438 probe sets corresponding to 1213 unigenes. This variation restriction was arbitrary but did not affect the outcome of the eventual analysis. Probe sets corresponding to the same gene were down-weighted inversely proportional to their frequency prior to clustering (Wouters et al., 2003). Agglomerative clustering (Eisen et al., 1998) was applied to probe sets and cell lines using the uncentered Pearson’s correlations. Resulting clusters were visualized using Java TreeView (Saldanha, 2004). All expression data, array CGH data, and cluster files are available at http://cancer.lbl.gov/breastcancer/data.php. PAM analysis Analysis was performed in R (http://www-stat.stanford.edu/%7Etibs/PAM/Rdist/index.html) following the instructions therein (http://www-stat.stanford.edu/%7Etibs/PAM/Rdist/doc/readme.html) (Tibshirani et al., 2002). Three classifiers were defined (luminal, Basal A, and Basal B, as determined from the hierarchical clustering of the cell line expression data). Classifier training, crossvalidation, and calculation of false discovery rates were performed, resulting in 396 genes identified at a threshold of 4.0. Subsequently, a better threshold scaling was calculated, and a threshold of 2.8 chosen based on the false discovery rate resulted in the 305 gene classifier. Association of copy number with expression The presence of an overall dosage effect was assessed by subdividing each chromosomal arm into nonoverlapping 20 Mb bins and computing the average of cross-Pearson’s-correlations for all gene-clone pairs that mapped to that bin. The average cross-correlations between clones and genes mapping to the same bin were significantly higher than those between clones and genes mapping to unlinked bins (p value < 10−16, Wilcoxon rank sum test). Pearson’s correlations and corresponding p values between expression level and copy number also were calculated for each gene. Each gene was assigned an observed copy number of the nearest mapped BAC array probe. Eighty percent of genes had a nearest clone within 1 Mbp, and 50% had a clone within 400 kb. Correlation between expression and copy number was only computed for the mapped genes whose absolute assigned copy number exceeded 0.2 in at least five samples. This was done to avoid spurious correlations in the absence of real copy number changes. The Holm p value adjustment was applied to correct for multiple testing. Genes with an adjusted p value < 0.05 were considered to have expression levels that were significantly affected by gene dosage. This corresponded to a minimum Pearson’s correlation of 0.44. Microarray data The raw data for expression profiling are available at ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) with accession number E-TABM-157. All expression data, array CGH data, and cluster files are also available at the CaBIG repository (http://caarraydb.nci.nih.gov/caarray/publicExperimentDetailAction.do?expId=1015897590151581), at http://cancer.lbl.gov/breastcancer/data.php, and in the Supplemental Data. Acknowledgments This work was supported by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research (contract DE-AC03-76SF00098); the National Institutes of Health under grants CA 58207, CA112970, and CA090788; and the Avon Foundation. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. For full disclaimer, see http://www-library.lbl.gov/public/tmRco/howto/RcoBerkeleyLabDisclaimer.htm. Footnotes Supplemental data The Supplemental Data include four supplemental figures and seven supplemental tables and can be found with this article online at http://www.cancercell.org/cgi/content/full/10/6/515/DC1/. Accession numbers The raw data for expression profiling are available at ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) with accession number E-TABM-157. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Nat Genet. 2003 Aug; 34(4):369-76.
[Nat Genet. 2003]Cell. 2000 Jan 7; 100(1):57-70.
[Cell. 2000]Cancer Res. 2002 Nov 1; 62(21):6240-5.
[Cancer Res. 2002]J Natl Compr Canc Netw. 2005 May; 3(3):291-300.
[J Natl Compr Canc Netw. 2005]Genes Chromosomes Cancer. 1994 Aug; 10(4):231-43.
[Genes Chromosomes Cancer. 1994]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]Cancer Genet Cytogenet. 2000 Jun; 119(2):132-8.
[Cancer Genet Cytogenet. 2000]Breast Cancer Res. 2006; 8(1):R9.
[Breast Cancer Res. 2006]Nat Genet. 2001 Nov; 29(3):263-4.
[Nat Genet. 2001]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]Cancer Res. 2004 May 1; 64(9):3037-45.
[Cancer Res. 2004]J Pathol. 2004 Jun; 203(2):661-71.
[J Pathol. 2004]Br J Cancer. 1988 Apr; 57(4):353-7.
[Br J Cancer. 1988]Lab Invest. 2002 Nov; 82(11):1525-33.
[Lab Invest. 2002]Clin Cancer Res. 2004 Aug 15; 10(16):5367-74.
[Clin Cancer Res. 2004]Proc Natl Acad Sci U S A. 2002 May 14; 99(10):6567-72.
[Proc Natl Acad Sci U S A. 2002]Cell Growth Differ. 2001 Feb; 12(2):61-83.
[Cell Growth Differ. 2001]Oncogene. 2006 Apr 6; 25(15):2273-84.
[Oncogene. 2006]Cancer Res. 2003 Aug 1; 63(15):4626-31.
[Cancer Res. 2003]Nat Genet. 2000 Jun; 25(2):209-12.
[Nat Genet. 2000]Genes Dev. 2003 May 15; 17(10):1253-70.
[Genes Dev. 2003]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]J Mammary Gland Biol Neoplasia. 2002 Oct; 7(4):359-71.
[J Mammary Gland Biol Neoplasia. 2002]Cancer Res. 2005 Mar 1; 65(5):2044.
[Cancer Res. 2005]Cancer Res. 2004 Jun 1; 64(11):3981-6.
[Cancer Res. 2004]Mol Cell Biol. 2000 May; 20(9):3210-23.
[Mol Cell Biol. 2000]Oncogene. 2000 Mar 23; 19(13):1647-56.
[Oncogene. 2000]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]Int J Cancer. 1998 Dec 9; 78(6):766-74.
[Int J Cancer. 1998]Cancer Genet Cytogenet. 2000 Jun; 119(2):132-8.
[Cancer Genet Cytogenet. 2000]Cancer Res. 1993 Feb 1; 53(3):627-35.
[Cancer Res. 1993]Proc Natl Acad Sci U S A. 1999 Aug 3; 96(16):9212-7.
[Proc Natl Acad Sci U S A. 1999]Proc Natl Acad Sci U S A. 2004 Apr 6; 101(14):4966-71.
[Proc Natl Acad Sci U S A. 2004]Science. 2004 Feb 6; 303(5659):775-7.
[Science. 2004]Proc Natl Acad Sci U S A. 2003 Apr 1; 100(7):3983-8.
[Proc Natl Acad Sci U S A. 2003]Cell Prolif. 2003 Oct; 36 Suppl 1():59-72.
[Cell Prolif. 2003]Cancer Res. 2004 Oct 15; 64(20):7241-4.
[Cancer Res. 2004]Cancer Res. 2004 May 1; 64(9):3060-71.
[Cancer Res. 2004]Proc Natl Acad Sci U S A. 2005 Aug 2; 102(31):11011-6.
[Proc Natl Acad Sci U S A. 2005]Blood. 2000 Aug 1; 96(3):1070-9.
[Blood. 2000]Oncology. 2001; 61 Suppl 2():37-42.
[Oncology. 2001]Nat Genet. 2001 Dec; 29(4):459-64.
[Nat Genet. 2001]Nat Genet. 2001 Nov; 29(3):263-4.
[Nat Genet. 2001]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]Genome Res. 2002 Feb; 12(2):325-32.
[Genome Res. 2002]Biostatistics. 2003 Apr; 4(2):249-64.
[Biostatistics. 2003]Biometrics. 2003 Dec; 59(4):1131-9.
[Biometrics. 2003]Proc Natl Acad Sci U S A. 1998 Dec 8; 95(25):14863-8.
[Proc Natl Acad Sci U S A. 1998]Bioinformatics. 2004 Nov 22; 20(17):3246-8.
[Bioinformatics. 2004]Proc Natl Acad Sci U S A. 2002 May 14; 99(10):6567-72.
[Proc Natl Acad Sci U S A. 2002]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]BMC Cancer. 2006 Apr 18; 6():96.
[BMC Cancer. 2006]Cancer Cell. 2006 Dec; 10(6):529-41.
[Cancer Cell. 2006]Int J Cancer. 1998 Dec 9; 78(6):766-74.
[Int J Cancer. 1998]