Breast cancer PAM50 signature: correlation and concordance between RNA-Seq and digital multiplexed gene expression technologies in a triple negative breast cancer series

A C Picornell, I Echavarria, E Alvarez, S López-Tarruella, Y Jerez, K Hoadley, J S Parker, M Del Monte-Millán, R Ramos-Medina, J Gayarre, I Ocaña, M Cebollero, T Massarrah, F Moreno, J A García Saenz, H Gómez Moreno, A Ballesteros, M Ruiz Borrego, C M Perou, M Martin, A C Picornell, I Echavarria, E Alvarez, S López-Tarruella, Y Jerez, K Hoadley, J S Parker, M Del Monte-Millán, R Ramos-Medina, J Gayarre, I Ocaña, M Cebollero, T Massarrah, F Moreno, J A García Saenz, H Gómez Moreno, A Ballesteros, M Ruiz Borrego, C M Perou, M Martin

Abstract

Background: Full RNA-Seq is a fundamental research tool for whole transcriptome analysis. However, it is too costly and time consuming to be used in routine clinical practice. We evaluated the transcript quantification agreement between RNA-Seq and a digital multiplexed gene expression platform, and the subtype call after running the PAM50 assay in a series of breast cancer patients classified as triple negative by IHC/FISH. The goal of this study is to analyze the concordance between both expression platforms overall, and for calling PAM50 triple negative breast cancer intrinsic subtypes in particular.

Results: The analyses were performed in paraffin-embedded tissues from 96 patients recruited in a multicenter, prospective, non-randomized neoadjuvant triple negative breast cancer trial (NCT01560663). Pre-treatment core biopsies were obtained following clinical practice guidelines and conserved as FFPE for further RNA extraction. PAM50 was performed on both digital multiplexed gene expression and RNA-Seq platforms. Subtype assignment was based on the nearest centroid classification following this procedure for both platforms and it was concordant on 96% of the cases (N = 96). In four cases, digital multiplexed gene expression analysis and RNA-Seq were discordant. The Spearman correlation to each of the centroids and the risk of recurrence were above 0.89 in both platforms while the agreement on Proliferation Score reached up to 0.97. In addition, 82% of the individual PAM50 genes showed a correlation coefficient > 0.80.

Conclusions: In our analysis, the subtype calling in most of the samples was concordant in both platforms and the potential discordances had reduced clinical implications in terms of prognosis. If speed and cost are the main driving forces then the preferred technique is the digital multiplexed platform, while if whole genome patterns and subtype are the driving forces, then RNA-Seq is the preferred method.

Keywords: Breast cancer; Multiplexed gene expression; PAM50; RNA-Seq; Triple negative breast cancer.

Conflict of interest statement

C.M.P is an equity stock holder, consultant, and Board of Director Member, of BioClassifier LLC. C.M.P is also listed an inventor on patent applications on the Breast PAM50 assay. J.S.P is an inventor on patent applications on the Breast PAM50 assay.

Figures

Fig. 1
Fig. 1
PAM50 subtype calls by technique. Barplot represents counts of samples per subtype and technique. The cross table shows in detail the discordances between both platforms
Fig. 2
Fig. 2
Separate centroid correlation when NanoString nCounter® and RNA-Seq platforms are compared. The blue line represents the linear regression. The grey area surrounding it represents the confidence interval
Fig. 3
Fig. 3
Correlation of the correlation to the centroids in both platforms obtained in the PAM50 subtype classifier
Fig. 4
Fig. 4
Correlation of ROR and ROR + PS and their associated Bland-Altman plots in both platforms. The upper/lower dashed lines in the Bland-Altman plots represent the mean difference +/− 1.96 * standard deviation. The central dashed line represents the mean difference
Fig. 5
Fig. 5
Normalized gene expression levels for each gene contained in the PAM50 assay. The log2 normalized counts for RNA-Seq are represented in the X-axis and those for NanoString nCounter® are represented in the Y-axis. The red line represents the LOWESS smoother, which uses locally weighted polynomial regression

References

    1. Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–752. doi: 10.1038/35021093.
    1. Sørlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001;98(19):10869–10874. doi: 10.1073/pnas.191367098.
    1. Hu Z, Fan C, Oh DS, Marron JS, He X, Qaqish BF, et al. The molecular portraits of breast tumors are conserved across microarray platforms. BMC Genomics. 2006;7(1):96. doi: 10.1186/1471-2164-7-96.
    1. Parker JS, Mullins M, Cheang MC, Leung S, Voduc D, Vickery T, et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009;27(8):1160–1167. doi: 10.1200/JCO.2008.18.1370.
    1. Gnant M, Filipits M, Greil R, Stoeger H, Rudas M, Bago-Horvath Z, et al. Predicting distant recurrence in receptor-positive breast cancer patients with limited clinicopathological risk: using the PAM50 risk of recurrence score in 1478 postmenopausal patients of the ABCSG-8 trial treated with adjuvant endocrine therapy alone. Ann Oncol. 2014;25(2):339–345. doi: 10.1093/annonc/mdt494.
    1. Dowsett M, Sestak I, Lopez-Knowles E, Sidhu K, Dunbier AK, Cowens JW, et al. Comparison of PAM50 risk of recurrence score with Oncotype DX and IHC4 for predicting risk of distant recurrence after endocrine therapy. J Clin Oncol. 2013;31(22):2783–2790. doi: 10.1200/JCO.2012.46.1558.
    1. Geiss GK, Bumgarner RE, Birditt B, Dahl T, Dowidar N, Dunaway DL, et al. Direct multiplexed measurement of gene expression with color-coded probe pairs. Nat Biotechnol. 2008;26(3):317–325. doi: 10.1038/nbt1385.
    1. Reis PP, Waldron L, Goswami RS, Xu W, Xuan Y, Perez-Ordonez B, et al. mRNA transcript quantification in archival samples using multiplexed, color-coded probes. BMC Biotechnol. 2011;11(1):46. doi: 10.1186/1472-6750-11-46.
    1. Jovanović B, Sheng Q, Seitz RS, Lawrence KD, Morris SW, Thomas LR, et al. Comparison of triple-negative breast cancer molecular subtyping using RNA from matched fresh-frozen versus formalin-fixed paraffin-embedded tissue. BMC Cancer. 2017;17(1). [cited 2018 Jan 4] Available from: .
    1. Zhao W, He X, Hoadley KA, Parker JS, Hayes DN, Perou CM. Comparison of RNA-Seq by poly (a) capture, ribosomal RNA depletion, and DNA microarray for expression profiling. BMC Genomics. 2014;15(1):1. doi: 10.1186/1471-2164-15-1.
    1. Wallden B, Storhoff J, Nielsen T, Dowidar N, Schaper C, Ferree S, et al. Development and verification of the PAM50-based Prosigna breast cancer gene signature assay. BMC Med Genomics. 2015;8(1). [cited 2016 Sep 5] Available from:
    1. Nielsen TO. Parker JS, Leung S, Voduc D, Ebbert M, Vickery T, et al. A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor-positive breast Cancer. Clin Cancer Res. 2010;16(21):5222–5232. doi: 10.1158/1078-0432.CCR-10-1282.
    1. Bastien RRL, Rodríguez-Lescure Á, Ebbert MTW, Prat A, Munárriz B, Rowe L, et al. PAM50 breast cancer subtyping by RT-qPCR and concordance with standard clinical molecular markers. BMC Med Genet. 2012;5:44.
    1. Tutt A, Ellis P, Kilburn L, Gilett C, Pinder S, Abraham J. TNT: a randomized phase III trial of carboplatin compared with docetaxel for patients with metastatic or recurrent locally advanced triple negative or BRCA 1/2 breast cancer. San Antonio; 2014.
    1. Cheang MCU, Martin M, Nielsen TO. Prat A, Voduc D, Rodriguez-Lescure A, et al. Defining breast Cancer intrinsic subtypes by quantitative receptor expression. Oncologist. 2015;20(5):474–482. doi: 10.1634/theoncologist.2014-0372.
    1. Prat A, Ellis MJ, Perou CM. Practical implications of gene-expression-based assays for breast oncologists. Nat Rev Clin Oncol. 2012;9(1):48–57. doi: 10.1038/nrclinonc.2011.178.
    1. Chen X, Deane NG, Lewis KB, Li J, Zhu J, Washington MK, et al. Comparison of Nanostring nCounter® data on FFPE Colon Cancer samples and Affymetrix microarray data on matched frozen tissues. Wang X, editor. PLoS One. 2016;11(5):e0153784. doi: 10.1371/journal.pone.0153784.
    1. Veldman-Jones MH, Lai Z, Wappett M, Harbron CG, Barrett JC, Harrington EA, et al. Reproducible, quantitative, and flexible molecular subtyping of clinical DLBCL samples using the NanoString nCounter system. Clin Cancer Res. 2015;21(10):2367–2378. doi: 10.1158/1078-0432.CCR-14-0357.
    1. Vukmirovic M, Herazo-Maya JD, Blackmon J, Skodric-Trifunovic V, Jovanovic D, Pavlovic S, et al. Identification and validation of differentially expressed transcripts by RNA-sequencing of formalin-fixed, paraffin-embedded (FFPE) lung tissue from patients with Idiopathic Pulmonary Fibrosis. BMC Pulm Med. 2017;17(1):15. doi: 10.1186/s12890-016-0356-4.
    1. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10(1):57–63. doi: 10.1038/nrg2484.
    1. Sharma P, López-Tarruella S, García-Saenz JA, Ward C, Connor CS, Gómez HL, et al. Efficacy of neoadjuvant carboplatin plus docetaxel in triple-negative breast Cancer: combined analysis of two cohorts. Clin Cancer Res. 2017;23(3):649–657. doi: 10.1158/1078-0432.CCR-16-0162.
    1. Hammond MEH, Hayes DF, Dowsett M, Allred DC, Hagerty KL, Badve S, et al. American Society of Clinical Oncology/College of American Pathologists Guideline Recommendations for Immunohistochemical testing of estrogen and progesterone receptors in breast Cancer. J Clin Oncol. 2010;28(16):2784–2795. doi: 10.1200/JCO.2009.25.6529.
    1. Wolff AC, Hammond MEH, Hicks DG, Dowsett M, McShane LM, Allison KH, et al. Recommendations for human epidermal growth factor receptor 2 testing in breast Cancer: American Society of Clinical Oncology/College of American Pathologists Clinical Practice Guideline Update. J Clin Oncol. 2013;31(31):3997–4013. doi: 10.1200/JCO.2013.50.9984.
    1. Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010. Available from:
    1. Haeussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC genome browser database: 2019 update. Nucleic Acids Res. 2019;47(D1):D853–D858. doi: 10.1093/nar/gky1095.
    1. Wingett S, Andrews S. FastQ screen: a tool for multi-genome mapping and quality control. F1000Research. 2018;7:1338. doi: 10.12688/f1000research.15931.2.
    1. Nickes D, Sandmann T, Ziman R, Bourgon R. NanoStringQCPro: Quality metrics and data processing methods for NanoString mRNA gene expression data. 2018. Available from:
    1. R Core Team . R: a language and environment for statistical computing. Vienna: R Foundation for statistical Computing; 2017.
    1. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–428. doi: 10.1037/0033-2909.86.2.420.

Source: PubMed

3
S'abonner