Computational biomarker pipeline from discovery to clinical implementation: plasma proteomic biomarkers for cardiac transplantation

Gabriela V Cohen Freue, Anna Meredith, Derek Smith, Axel Bergman, Mayu Sasaki, Karen K Y Lam, Zsuzsanna Hollander, Nina Opushneva, Mandeep Takhar, David Lin, Janet Wilson-McManus, Robert Balshaw, Paul A Keown, Christoph H Borchers, Bruce McManus, Raymond T Ng, W Robert McMaster, Biomarkers in Transplantation and the NCE CECR Prevention of Organ Failure Centre of Excellence Teams, Gabriela V Cohen Freue, Anna Meredith, Derek Smith, Axel Bergman, Mayu Sasaki, Karen K Y Lam, Zsuzsanna Hollander, Nina Opushneva, Mandeep Takhar, David Lin, Janet Wilson-McManus, Robert Balshaw, Paul A Keown, Christoph H Borchers, Bruce McManus, Raymond T Ng, W Robert McMaster, Biomarkers in Transplantation and the NCE CECR Prevention of Organ Failure Centre of Excellence Teams

Abstract

Recent technical advances in the field of quantitative proteomics have stimulated a large number of biomarker discovery studies of various diseases, providing avenues for new treatments and diagnostics. However, inherent challenges have limited the successful translation of candidate biomarkers into clinical use, thus highlighting the need for a robust analytical methodology to transition from biomarker discovery to clinical implementation. We have developed an end-to-end computational proteomic pipeline for biomarkers studies. At the discovery stage, the pipeline emphasizes different aspects of experimental design, appropriate statistical methodologies, and quality assessment of results. At the validation stage, the pipeline focuses on the migration of the results to a platform appropriate for external validation, and the development of a classifier score based on corroborated protein biomarkers. At the last stage towards clinical implementation, the main aims are to develop and validate an assay suitable for clinical deployment, and to calibrate the biomarker classifier using the developed assay. The proposed pipeline was applied to a biomarker study in cardiac transplantation aimed at developing a minimally invasive clinical test to monitor acute rejection. Starting with an untargeted screening of the human plasma proteome, five candidate biomarker proteins were identified. Rejection-regulated proteins reflect cellular and humoral immune responses, acute phase inflammatory pathways, and lipid metabolism biological processes. A multiplex multiple reaction monitoring mass-spectrometry (MRM-MS) assay was developed for the five candidate biomarkers and validated by enzyme-linked immune-sorbent (ELISA) and immunonephelometric assays (INA). A classifier score based on corroborated proteins demonstrated that the developed MRM-MS assay provides an appropriate methodology for an external validation, which is still in progress. Plasma proteomic biomarkers of acute cardiac rejection may offer a relevant post-transplant monitoring tool to effectively guide clinical care. The proposed computational pipeline is highly applicable to a wide range of biomarker proteomic studies.

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1. Proteomic computational pipeline synopsis.
Figure 1. Proteomic computational pipeline synopsis.
The 3-stage computational pipeline enables an initial untargeted exploration of the plasma proteome resulting in a list of potential biomarkers, followed by the validation of a set of candidate biomarkers that emphasizes the combination of candidate protein biomarkers into a classifier score with clinical utility. The bottom panel outlines the main steps of the computational pipeline that provide a systematic process from discovery to validation to clinical implementation of plasma protein biomarkers.
Figure 2. Plasma protein panel.
Figure 2. Plasma protein panel.
Average Linear Discriminant Analysis classifier score (Classifier score) for all available acute rejection (AR) samples (pink solid point), and non-rejection (NR) samples from NR patients (green solid triangle), at each time point. The score was centered at the LDA cut-off point so that samples with positive and negative scores are classified as “rejection” or “non-rejection”, respectively. Vertical lines represent standard errors. Means and standard error bars can be used to assess differences of the score between groups at any of the studied time points. Sample sizes available at each time point are shown in the bottom table.
Figure 3. Transition plot.
Figure 3. Transition plot.
Linear Discriminant Analysis classifier score (Classifier score) when patients transitioned between non-rejection (NR) and acute rejection (AR) episodes. The first consecutive AR time points were averaged (AR, pink solid point) from 7 AR patients. Non-rejection samples from the same patients, before and after AR (“NR before AR” and “NR after AR”) were averaged (pink solid triangle). The average time trend for these samples is represented with a pink solid line. A control curve (dashed green line) was constructed from 9 NR patients matched to AR patients by available time points (green solid triangle). Vertical lines represent standard errors. The asterisk (*) means that the two-sided t-test p value<0.001.
Figure 4. Technical validation.
Figure 4. Technical validation.
A. For both ELISA/INA and MRM-MS corroboration analysis, p values were calculated by robust eBayes (12 acute rejection (AR) versus 18 non-rejection (NR) samples in ELISA/INA, and 6 AR versus 11 NR in MRM-MS, two-sided test). The correlations among platforms are based on all available common samples, i.e., 25 samples measured by iTRAQ and ELISA/INA, and 23 samples measured by iTRAQ, ELISA/INA, and MRM-MS (Figure S2). B. Validation performance (y-axis) estimated by a 6-fold cross-validation: sensitivity (blue diamond), specificity (brown square), and area under the receiver operating curve (AUC) (red star) for incremental classifier panels. The sensitivity and specificity estimates were calculated using a probability cut-off of 0.5. The x-axis shows three nested classifier panels based on a single candidate marker (B2M), 2 markers (B2M&ADIPOQ) and 3 markers (B2M&ADIPOQ&CP), respectively, measured by MRM-MS. As F10 was not validated in either ELISA/INA or MRM-MS, it was not included in any MRM-based classifier. C. MRM-MS classifier score generated by a 6-fold cross-validation using Linear Discriminant Analysis. Samples with a positive proteomic classifier score are classified as “rejection” and those with a negative score are classified as “non-rejection”.

References

    1. Heeger PS (2003) T-Cell Allorecognition and Transplant Rejection: A Summary and Update. American Journal of Transplantation 3: 525–533.
    1. Stewart S, Winters GL, Fishbein MC, Tazelaar HD, Kobashigawa J, et al. (2005) Revision of the 1990 Working Formulation for the Standardization of Nomenclature in the Diagnosis of Heart Rejection. J Heart Lung Transplant 24: 1710–1720.
    1. Tilkian A, Daily E (1986) Cardiovascular Procedures: diagnostic techniques and therapeutic procedures. St. Louis: Mosby. pp. 180–203.
    1. Atkinson AJ, Colburn WA, DeGruttola VG, DeMets DL, Downing GJ, et al. (2001) Biomarkers and surrogate endpoints: Preferred definitions and conceptual framework*. Clin Pharmacol Ther 69: 89–95.
    1. Anderson NL, Anderson NG (2002) The human plasma proteome: history, character, and diagnostic prospects. Mol Cell Proteomics 1: 845–867.
    1. Anderson NL (2005) The roles of multiple proteomic platforms in a pipeline for new diagnostics. Mol Cell Proteomics 4: 1441–1444.
    1. Paulovich AG, Whiteaker JR, Hoofnagle AN, Wang P (2008) The interface between biomarker discovery and clinical validation: The tar pit of the protein biomarker pipeline. Proteomics Clin Appl 2: 1386–1402.
    1. Rifai N, Gillette MA, Carr SA (2006) Protein biomarker discovery and validation: the long and uncertain path to clinical utility. Nat Biotechnol 24: 971–983.
    1. Addona TA, Shi X, Keshishian H, Mani DR, Burgess M, et al. (2011) A pipeline that integrates the discovery and verification of plasma protein biomarkers reveals candidate markers for cardiovascular disease. Nat Biotechnol 29: 635–643.
    1. Whiteaker JR, Lin C, Kennedy J, Hou L, Trute M, et al. (2011) A targeted proteomics-based pipeline for verification of biomarkers in plasma. Nat Biotechnol 29: 625–634.
    1. Nilsson T, Mann M, Aebersold R, Yates JR, Bairoch A, et al. (2010) Mass spectrometry in high-throughput proteomics: ready for the big time. Nat Meth 7: 681–685.
    1. Käll L, Vitek O (2011) Computational Mass Spectrometry-Based Proteomics. PLoS Comput Biol 7: e1002277.
    1. Noble WS, MacCoss MJ (2012) Computational and Statistical Analysis of Protein Mass Spectrometry Data. PLoS Comput Biol 8: e1002296.
    1. Cannataro M (2008) Computational proteomics: management and analysis of proteomics data. Briefings in Bioinformatics 9: 97–101.
    1. McGuire JN, Overgaard J, Pociot F (2008) Mass spectrometry is only one piece of the puzzle in clinical proteomics. Brief Funct Genomic Proteomic 7: 74–83.
    1. Nesvizhskii AI, Aebersold R (2005) Interpretation of shotgun proteomic data: the protein inference problem. Mol Cell Proteomics 4: 1419–1440.
    1. Bradshaw RA, Burlingame AL, Carr S, Aebersold R (2006) Reporting protein identification data: the next generation of guidelines. Mol Cell Proteomics 5: 787–788.
    1. Shilov IV, Seymour SL, Patel AA, Loboda A, Tang WH, et al. (2007) The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra. Mol Cell Proteomics 6: 1638–1655.
    1. Storey JD, Xiao W, Leek JT, Tompkins RG, Davis RW (2005) Significance analysis of time course microarray experiments. Proceedings of the National Academy of Sciences of the United States of America 102: 12837–12842.
    1. Forthofer RN, Lee ES (2007) Biostatistics, Second Edition: A Guide to Design, Analysis and Discovery. London, UK: Elsevier Inc. 528 p.
    1. Chong PK, Wright PC (2005) Identification and characterization of the Sulfolobus solfataricus P2 proteome. J Proteome Res 4: 1789–1798.
    1. Smyth GK (2004) Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3: Article3.
    1. Hilario M, Kalousis A (2008) Approaches to dimensionality reduction in proteomic biomarker studies. Brief Bioinform 9: 102–118.
    1. Hummel M, Meister R, Mansmann U (2008) GlobalANCOVA: exploration and assessment of gene group effects. Bioinformatics 24: 78–85.
    1. Culhane AC, Quackenbush J (2009) Confounding effects in “A six-gene signature predicting breast cancer lung metastasis”. Cancer Res 69: 7480–7485.
    1. Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17: 520–525.
    1. Gibbs AL, Keen KJ, Wang L (2011) Case studies in data analysis. Canadian Journal of Statistics 39: 181–217.
    1. Hastie T, Tibshirani R, Friedman J, Franklin J (2005) The elements of statistical learning: data mining, inference and prediction. The Mathematical Intelligencer 27: 83–85.
    1. Robin X, Turck N, Hainard A, Lisacek F, Sanchez JC, et al. (2009) Bioinformatics for protein biomarker panel classification: what is needed to bring biomarker panels into in vitro diagnostics? Expert Rev Proteomics 6: 675–689.
    1. McManus C, Rose M, Dunn M (2006) Proteomics of transplant rejection. Transpl Rev 20: 195–207.
    1. McLachlan GJ (2004) Discriminant analysis and statistical pattern recognition: Wiley-Interscience. 552 p.
    1. Fawcett T (2006) An introduction to ROC analysis. Pattern Recognition Letters 27: 861–874.
    1. Pepe MS, Feng Z, Janes H, Bossuyt PM, Potter JD (2008) Pivotal evaluation of the accuracy of a biomarker used for classification or prediction: standards for study design. J Natl Cancer Inst 100: 1432–1438.
    1. Kerr MK, Churchill GA (2001) Experimental design for gene expression microarrays. Biostatistics 2: 183–201.
    1. Song X, Bandow J, Sherman J, Baker JD, Brown PW, et al. (2008) iTRAQ experimental design for plasma biomarker discovery. J Proteome Res 7: 2952–2958.
    1. Ohlund LB, Hardie DB, Elliott MH, Smith DS, Reid JD, et al. (2010) Standard Operating Procedures and Protocols for the Preparation and Analysis of Plasma Samples using the iTRAQ Methodology. In: Aivanov A, Lazarev A, editors. Sample Preparation in Biological Mass Spectrometry. New York: Springer. pp. 574–624
    1. Cohen Freue G, Sasaki M, Meredith A, Gunther O, Bergman A, et al. (2010) Proteomic signatures in plasma during early acute renal allograft rejection. Mol Cell Proteomics 9: 1954–1967.
    1. Kersey PJ, Duarte J, Williams A, Karavidopoulou Y, Birney E, et al. (2004) The International Protein Index: an integrated database for proteomics experiments. Proteomics 4: 1985–1988.
    1. Kuzyk MA, Parker CE, Borchers CH (2012, in press) Development of MRM based assays for plasma proteins. In: Backvall, H, editor. Methods in Molecular Biology: Humana Press.
    1. Kuzyk M, Smith D, Yang J, Cross T, Jackson A, et al. (2009) MRM-based, Multiplexed, Absolute Quantitation of 45 proteins in human plasma. Molecular and Cellular Proteomics 8: 1860–1877.
    1. R Development Core Team (2005) R: A language and environment for statistical computing. R Foundation for Statistical Computing, version 2.11.1. R Foundation for Statistical Computing, Vienna, Austria. Available: . Accessed 12 February 2013.
    1. Smyth GK (2005) Limma: linear models for microarray data. . In: Gentleman R, Carey V, Dudoit S, Izarry R, Huber W, editors. Bioinformatics and Computational Biology Solutions using R and Bioconductor. New York: Springer. pp. 397–420.
    1. Sing T, Sander O, Beerenwinkel N, Lengauer T (2005) ROCR: visualizing classifier performance in R. Bioinformatics 21: 3940–3941.
    1. Borozdenkova S, Westbrook JA, Patel V, Wait R, Bolad I, et al. (2004) Use of proteomics to discover novel markers of cardiac allograft rejection. J Proteome Res 3: 282–288.
    1. Erez E, Aravot D, Erman A, Sharoni E, Raanani E, et al. (1997) Beta-2 microglobulin in heart transplanted patients. Transplant Proc 29: 2706–2707.
    1. Schmitt F, Myara I, Benoit MO, Guillemain R, Amrein C, et al. (1989) Monitoring of heart allograft rejection by simultaneous measurement of serum beta 2-microglobulin and urinary neopterin. Ann Biol Clin (Paris) 47: 237–241.
    1. Goldman MH, Landwehr DM, Lippman R, Hess M, Wolfgang T, et al. (1982) Beta 2 microglobulins in rejection and cytomegalovirus infection in a cardiac transplant recipient. Transplant Proc 14: 437–439.
    1. Roxe DM, Siddiqui F, Santhanam S, del Greco F, Wolf J (1981) Rationale and application of beta-2-microglobulin measurements to detect acute transplant rejection. Nephron 27: 260–264.
    1. Tang WH, Wu Y, Hartiala J, Fan Y, Stewart AF, et al. (2012) Clinical and genetic association of serum ceruloplasmin with cardiovascular risk. Arterioscler Thromb Vasc Biol 32: 516–522.
    1. Levi M, van der Poll T, Buller HR (2004) Bidirectional relation between inflammation and coagulation. Circulation 109: 2698–2704.
    1. Arora S, Gullestad L, Wergeland R, Simonsen S, Holm T, et al. (2007) Probrain natriuretic peptide and C-reactive protein as markers of acute rejection, allograft vasculopathy, and mortality in heart transplantation. Transplantation 83: 1308–1315.
    1. Magnani G, Carinci V, Magelli C, Potena L, Reggiani LB, et al. (2000) Role of statins in the management of dyslipidemia after cardiac transplant: randomized controlled trial comparing the efficacy and the safety of atorvastatin with pravastatin. J Heart Lung Transplant 19: 710–715.
    1. Okamoto Y, Kihara S, Funahashi T, Matsuzawa Y, Libby P (2006) Adiponectin: a key adipocytokine in metabolic syndrome. Clin Sci 110: 267–278.
    1. Nakano T, Chen C-L, Goto S, Lai C-Y, Hsu L-W, et al. (2007) The immunological role of lipid transfer/metabolic proteins in liver transplantation tolerance. Transplant Immunology 17: 130–136.
    1. Okamoto Y, Christen T, Shimizu K, Asano K, Kihara S, et al. (2009) Adiponectin Inhibits Allograft Rejection in Murine Cardiac Transplantation. Transplantation 88: 879–883.
    1. Horwitz PA, Tsai EJ, Putt ME, Gilmore JM, Lepore JJ, et al. (2004) Detection of cardiac allograft rejection and response to immunosuppressive therapy with peripheral blood gene expression. Circulation 110: 3815–3821.
    1. Deng MC, Eisen HJ, Mehra MR, Billingham M, Marboe CC, et al. (2006) Noninvasive discrimination of rejection in cardiac allograft recipients using gene expression profiling. Am J Transplant 6: 150–160.
    1. Schoels M, Dengler TJ, Richter R, Meuer SC, Giese T (2004) Detection of cardiac allograft rejection by real-time PCR analysis of circulating mononuclear cells. Clin Transplant 18: 513–517.
    1. Lin D, Hollander Z, Ng RT, Imai C, Ignaszewski A, et al. (2009) Whole blood genomic biomarkers of acute cardiac allograft rejection. J Heart Lung Transplant 28: 927–935.

Source: PubMed

3
Abonneren