Concordant gene expression in leukemia cells and normal leukocytes is associated with germline cis-SNPs

Deborah French, Wenjian Yang, Leo H Hamilton, Geoffrey Neale, Yiping Fan, James R Downing, Nancy J Cox, Ching-Hon Pui, William E Evans, Mary V Relling, Deborah French, Wenjian Yang, Leo H Hamilton, Geoffrey Neale, Yiping Fan, James R Downing, Nancy J Cox, Ching-Hon Pui, William E Evans, Mary V Relling

Abstract

The degree to which gene expression covaries between different primary tissues within an individual is not well defined. We hypothesized that expression that is concordant across tissues is more likely influenced by genetic variability than gene expression which is discordant between tissues. We quantified expression of 11,873 genes in paired samples of primary leukemia cells and normal leukocytes from 92 patients with acute lymphoblastic leukemia (ALL). Genetic variation at >500,000 single nucleotide polymorphisms (SNPs) was also assessed. The expression of only 176/11,783 (1.5%) genes was correlated (p<0.008, FDR = 25%) in the two tissue types, but expression of a high proportion (20 of these 176 genes) was significantly related to cis-SNP genotypes (adjusted p<0.05). In an independent set of 134 patients with ALL, 14 of these 20 genes were validated as having expression related to cis-SNPs, as were 9 of 20 genes in a second validation set of HapMap cell lines. Genes whose expression was concordant among tissue types were more likely to be associated with germline cis-SNPs than genes with discordant expression in these tissues; genes affected were involved in housekeeping functions (GSTM2, GAPDH and NCOR1) and purine metabolism.

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1. The overall strategy: Left: determination…
Figure 1. The overall strategy: Left: determination of genes whose expression is concordant between diagnostic leukemia cells and normal leukocytes; here the expression of CTSW is depicted as an example.
Middle: discovery of germline cis-SNPs that are significantly related to the level of expression of such concordant genes; here a cis-SNP genotype is related to the expression of an example gene (CTSW). Right: validation of the relationship between these same cis-SNPs and their genes in two independent validation sets; here the same cis-SNP related to the expression of CTSW.
Figure 2
Figure 2
a) At adjusted pcis-SNPs associated with the expression of the concordant genes in the normal leukocytes and the leukemia cells in the discovery set. In the discovery set of normal leukocytes (yellow circle), there are 30 out of 176 genes that have significant cis-SNPs and in the discovery set of leukemia cells (green circle) there are 30 out of 176 genes that have significant cis-SNPs. Additionally, in the validation set of leukemia cells (blue circle) there are 28 out of 176 genes that have significant cis-SNPs. There are 14 out of 176 genes with significant cis-SNPs that are common to all 3 of the populations representing 7.95%* of the 176 genes. b) At adjusted p<0.05, there was no difference in the number of cis-SNPs associated with the expression of all 8853 expressed genes in the normal leukocytes and the leukemia cells in the discovery set. In the discovery set of normal leukocytes (yellow circle), there are 559 genes with significant cis-SNPs and in the discovery set of leukemia cells (green circle), there are 532 genes that have significant cis-SNPs (p = 0.41, Fishers exact test). Additionally, in the validation set of leukemia cells (blue circle), there are 619 genes that have significant cis-SNPs. There are 28 out of 8853 genes with significant cis-SNPs that are common to all 3 populations representing 0.32%* of the 8853 genes.
Figure 3
Figure 3
a) An example of one of the 176 genes (here LRAP) whose expression was concordant between diagnostic leukemia cells and normal leukocytes: Left: expression levels among 92 patients in leukemia cells and normal leukocytes are correlated (p = 1.11×10−14); Middle: expression of this gene (median, quartiles, range) in leukemia cells is associated (adjusted p = 0.0016) with the germline genotype of a particular cis-SNP in this gene; Right: expression of this gene in normal leukocytes is also associated (adjusted p = 0.0016) with the germline genotype at the same cis-SNP. (See Figure S4 for additional examples.) b) An example of one of the 8853 genes (here DST) whose expression was not concordant between diagnostic leukemia cells and normal leukocytes: Left: expression levels among 92 patients in leukemia cells and normal leukocytes are not correlated (p = 0.9384); Middle: expression of this gene in leukemia cells is not associated (adjusted p = 1.0000) with the germline genotype of a particular cis-SNP in this gene; Right: expression of this gene in normal leukocytes is also not associated (adjusted p = 0.9994) with the germline genotype at the same cis-SNP. (See Figure S4 for additional examples.)
Figure 4. The 176 genes (open bars)…
Figure 4. The 176 genes (open bars) whose expression was concordant between diagnostic leukemia cells and normal leukocytes were more likely (p = 3.71×10−18, Fishers exact test) to have cis-SNP genotypes (within 50 Kb of a gene) associated with their gene expression (20/176) than genes (closed bars) whose expression was discordant between the leukemia cells and normal leukocytes (53/8,853).
A high proportion of these 176 genes whose expression was concordant between diagnostic leukemia cells and normal leukocytes in the discovery set also had cis-SNPs affecting gene expression in validation set 1 consisting of leukemia cells (14/20) and in validation set 2 consisting of HapMap lymphoid cell lines (9/20).
Figure 5. Genes whose expression is significantly…
Figure 5. Genes whose expression is significantly associated between two tissue types (diagnostic leukemia cell and normal leukocytes) are overrepresented in specific pathways relative to all genes interrogated on the U133A microarray chip.

References

    1. Shyamsundar R, Kim YH, Higgins JP, Montgomery K, Jorden M, et al. A DNA microarray survey of gene expression in normal human tissues. Genome Biol. 2005;6:R22.
    1. Radich JP, Mao M, Stepaniants S, Biery M, Castle J, et al. Individual-specific variation of gene expression in peripheral blood leukocytes. Genomics. 2004;83:980–988.
    1. Eady JJ, Wortley GM, Wormstone YM, Hughes JC, Astley SB, et al. Variation in gene expression profiles of peripheral blood mononuclear cells from healthy volunteers. Physiol Genomics. 2005;22:402–411.
    1. Stranger BE, Forrest MS, Dunning M, Ingle CE, Beazley C, et al. Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science. 2007;315:848–853.
    1. Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, et al. Global variation in copy number in the human genome. Nature. 2006;444:444–454.
    1. Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003;422:297–302.
    1. Glinsky GV. Integration of HapMap-based SNP pattern analysis and gene expression profiling reveals common SNP profiles for cancer therapy outcome predictor genes. Cell Cycle. 2006;5:2613–2625.
    1. Mullighan CG, Goorha S, Radtke I, Miller CB, Coustan-Smith E, et al. Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia. Nature. 2007;446:758–764.
    1. Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, et al. Genetic analysis of genome-wide variation in human gene expression. Nature. 2004;430:743–747.
    1. Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. Genetic inheritance of gene expression in human cell lines. Am J Hum Genet. 2004;75:1094–1105.
    1. Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. Genome-Wide Associations of Gene Expression Variation in Humans. PLoS Genet. 2005;1:0695–0704.
    1. Goring HH, Curran JE, Johnson MP, Dyer TD, Charlesworth J, et al. Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes. Nat Genet. 2007;39:1208–1216.
    1. Bergen AW, Baccarelli A, McDaniel TK, Kuhn K, Pfeiffer R, et al. cis sequence effects on gene expression. BMC Genomics. 2007;8:296.
    1. Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, et al. Population genomics of human gene expression. Nat Genet. 2007;39:1217–1224.
    1. Spielman RS, Bastone LA, Burdick JT, Morley M, Ewens WJ, et al. Common genetic variants account for differences in gene expression among ethnic groups. Nat Genet. 2007;39:226–231.
    1. Cheung VG, Conlin LK, Weber TM, Arcaro M, Jen KY, et al. Natural variation in human gene expression assessed in lymphoblastoid cells. Nat Genet. 2003;33:422–425.
    1. Zhang W, Bleibel WK, Roe CA, Cox NJ, Eileen DM. Gender-specific differences in expression in human lymphoblastoid cell lines. Pharmacogenet Genomics. 2007;17:447–450.
    1. Warrington JA, Nair A, Mahadevappa M, Tsyganskaya M. Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. Physiol Genomics. 2000;2:143–147.
    1. Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, et al. Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci U S A. 2002;99:4465–4470.
    1. Tu Z, Wang L, Xu M, Zhou X, Chen T, et al. Further understanding human disease genes by comparing with housekeeping genes and other genes. BMC Genomics. 2006;7:31.
    1. Hsiao LL, Dangond F, Yoshida T, Hong R, Jensen RV, et al. A compendium of gene expression in normal human tissues. Physiol Genomics. 2001;7:97–104.
    1. Yeoh EJ, Ross ME, Shurtleff SA, Williams WK, Patel D, et al. Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer Cell. 2002;1:133–143.
    1. Ross ME, Zhou X, Song G, Shurtleff SA, Girtman K, et al. Classification of pediatric acute lymphoblastic leukemia by gene expression profiling. Blood. 2003;102:2951–2959.
    1. Holleman A, Cheok MH, den Boer ML, Yang W, Veerman AJ, et al. Gene-expression patterns in drug-resistant acute lymphoblastic leukemia cells and response to treatment. N Engl J Med. 2004;351:533–542.
    1. Lugthart S, Cheok MH, den Boer ML, Yang W, Holleman A, et al. Identification of genes associated with chemotherapy crossresistance and treatment response in childhood acute lymphoblastic leukemia. Cancer Cell. 2005;7:375–386.
    1. Whitney AR, Diehn M, Popper SJ, Alizadeh AA, Boldrick JC, et al. Individuality and variation in gene expression patterns in human blood. Proc Natl Acad Sci U S A. 2003;100:1896–1901.
    1. Kishi S, Cheng C, French D, Pei D, Das S, et al. Ancestry and pharmacogenetics of antileukemic drug toxicity. Blood. 2007;109:4151–4157.
    1. Hartford C, Yang W, Cheng C, Fan Y, Liu W, et al. Genome scan implicates adhesion biological pathways in secondary leukemia. Leukemia. 2007;21:2128–2136.
    1. Rocha JC, Cheng C, Liu W, Kishi S, Das S, et al. Pharmacogenetics of outcome in children with acute lymphoblastic leukemia. Blood. 2005;105:4752–4758.
    1. French D, Wilkinson MR, Yang W, de CL, Cook EH, et al. Global gene expression as a function of germline genetic variation. Hum Mol Genet. 2005;14:1621–1629.
    1. Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447:799–816.
    1. Kuehl P, Zhang J, Lin Y, Lamba J, Assem M, et al. Sequence diversity in CYP3A promoters and characterization of the genetic basis of polymorphic CYP3A5 expression. Nat Genet. 2001;27:383–391.
    1. Cheok MH, Evans WE. Acute lymphoblastic leukaemia: a model for the pharmacogenomics of cancer therapy 2. Nat Rev Cancer. 2006;6:117–129.
    1. Pui CH, Relling MV, Sandlund JT, Downing JR, Campana D, et al. Rationale and design of Total Therapy Study XV for newly diagnosed childhood acute lymphoblastic leukemia. Ann Hematol. 2004;83(Suppl 1):S124–S126.
    1. Thomas DJ, Trumbower H, Kern AD, Rhead BL, Kuhn RM, et al. Variation resources at UC Santa Cruz. Nucleic Acids Res. 2007;35:D716–D720.
    1. Dai M, Wang P, Boyd AD, Kostov G, Athey B, et al. Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005;33:e175.
    1. Huang RS, Duan S, Shukla SJ, Kistner EO, Clark TA, et al. Identification of genetic variants contributing to cisplatin-induced cytotoxicity by use of a genomewide approach. Am J Hum Genet. 2007;81:427–437.
    1. Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100:9440–9445.
    1. Westfall P, Young S. Resampling-Based Multiple Testing: Examples and Methods for P-value Adjustment. New York: Wiley; 1993.
    1. Dennis G, Jr, Sherman BT, Hosack DA, Yang J, Gao W, et al. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 2003;4:3.
    1. Bailey TL, Williams N, Misleh C, Li WW. MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006;34:W369–W373.

Source: PubMed

3
Se inscrever