Pathologists' diagnosis of invasive melanoma and melanocytic proliferations: observer accuracy and reproducibility study

Joann G Elmore, Raymond L Barnhill, David E Elder, Gary M Longton, Margaret S Pepe, Lisa M Reisch, Patricia A Carney, Linda J Titus, Heidi D Nelson, Tracy Onega, Anna N A Tosteson, Martin A Weinstock, Stevan R Knezevich, Michael W Piepkorn, Joann G Elmore, Raymond L Barnhill, David E Elder, Gary M Longton, Margaret S Pepe, Lisa M Reisch, Patricia A Carney, Linda J Titus, Heidi D Nelson, Tracy Onega, Anna N A Tosteson, Martin A Weinstock, Stevan R Knezevich, Michael W Piepkorn

Abstract

Objective To quantify the accuracy and reproducibility of pathologists' diagnoses of melanocytic skin lesions.Design Observer accuracy and reproducibility study.Setting 10 US states.Participants Skin biopsy cases (n=240), grouped into sets of 36 or 48. Pathologists from 10 US states were randomized to independently interpret the same set on two occasions (phases 1 and 2), at least eight months apart.Main outcome measures Pathologists' interpretations were condensed into five classes: I (eg, nevus or mild atypia); II (eg, moderate atypia); III (eg, severe atypia or melanoma in situ); IV (eg, pathologic stage T1a (pT1a) early invasive melanoma); and V (eg, ≥pT1b invasive melanoma). Reproducibility was assessed by intraobserver and interobserver concordance rates, and accuracy by concordance with three reference diagnoses.Results In phase 1, 187 pathologists completed 8976 independent case interpretations resulting in an average of 10 (SD 4) different diagnostic terms applied to each case. Among pathologists interpreting the same cases in both phases, when pathologists diagnosed a case as class I or class V during phase 1, they gave the same diagnosis in phase 2 for the majority of cases (class I 76.7%; class V 82.6%). However, the intraobserver reproducibility was lower for cases interpreted as class II (35.2%), class III (59.5%), and class IV (63.2%). Average interobserver concordance rates were lower, but with similar trends. Accuracy using a consensus diagnosis of experienced pathologists as reference varied by class: I, 92% (95% confidence interval 90% to 94%); II, 25% (22% to 28%); III, 40% (37% to 44%); IV, 43% (39% to 46%); and V, 72% (69% to 75%). It is estimated that at a population level, 82.8% (81.0% to 84.5%) of melanocytic skin biopsy diagnoses would have their diagnosis verified if reviewed by a consensus reference panel of experienced pathologists, with 8.0% (6.2% to 9.9%) of cases overinterpreted by the initial pathologist and 9.2% (8.8% to 9.6%) underinterpreted.Conclusion Diagnoses spanning moderately dysplastic nevi to early stage invasive melanoma were neither reproducible nor accurate in this large study of pathologists in the USA. Efforts to improve clinical practice should include using a standardized classification system, acknowledging uncertainty in pathology reports, and developing tools such as molecular markers to support pathologists' visual assessments.

Conflict of interest statement

Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf. All authors had financial support from the National Cancer Institute for the submitted work. RLB reports a financial relationship with Myriad Genetics outside of the submitted work, and GML reports grants from Fred Hutchinson Cancer Research Center during the conduct of the study.

Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

Figures

https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5485913/bin/elmj038616.f1.jpg
Fig 1 Diagnostic terms given to example case by 36 pathologists who each independently interpreted the same glass slide (top image 5× magnification, bottom image 10× magnification)
https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5485913/bin/elmj038616.f2.jpg
Fig 2 Participant interpretive variation on each of 240 cases, with cases organized based on the MPATH-Dx consensus reference diagnosis class
https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5485913/bin/elmj038616.f3.jpg
Fig 3 Comparison of accuracy (discordance rates of over-interpretation rate and under-interpretation rate) by all three reference diagnoses
https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5485913/bin/elmj038616.f4.jpg
Fig 4 Population level predicted proportions of cutaneous melanocytic biopsy interpretations that would be verified by the consensus reference panel or would be classified as over-interpretations or under-interpretations

References

    1. National Academies of Sciences, Engineering, and Medicine. Improving Diagnosis in Health Care.The National Academies Press, 2015.
    1. Feinstein AR. Boolean Algebra and Clinical Taxonomy. N Engl J Med 1963;269:929-38. 10.1056/NEJM196310312691801 .
    1. Ransohoff DF, Feinstein AR. Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. N Engl J Med 1978;299:926-30. 10.1056/NEJM197810262991705 .
    1. Welch HG, Woloshin S, Schwartz LM. Skin biopsy rates and incidence of melanoma: population based ecological study. BMJ 2005;331:481 10.1136/bmj.38516.649537.E0 .
    1. Gerami P, Busam K, Cochran A, et al. Histomorphologic assessment and interobserver diagnostic reproducibility of atypical spitzoid melanocytic neoplasms with long-term follow-up. Am J Surg Pathol 2014;38:934-40. 10.1097/PAS.0000000000000198 .
    1. Duncan LM, Berwick M, Bruijn JA, Byers HR, Mihm MC, Barnhill RL. Histopathologic recognition and grading of dysplastic melanocytic nevi: an interobserver agreement study. J Invest Dermatol 1993;100:318S-21S. 10.1038/jid.1993.55 .
    1. Duray PH, DerSimonian R, Barnhill R, et al. An analysis of interobserver recognition of the histopathologic features of dysplastic nevi from a mixed group of nevomelanocytic lesions. J Am Acad Dermatol 1992;27:741-9. 10.1016/0190-9622(92)70248-E .
    1. Corona R, Mele A, Amini M, et al. Interobserver variability on the histopathologic diagnosis of cutaneous melanoma and other pigmented skin lesions. J Clin Oncol 1996;14:1218-23. 10.1200/JCO.1996.14.4.1218 .
    1. Swerlick RA, Chen S. The melanoma epidemic. Is increased surveillance the solution or the problem?Arch Dermatol 1996;132:881-4. 10.1001/archderm.1996.03890320029004 .
    1. Onega T, Reisch LM, Frederick PD, et al. Use of Digital Whole Slide Imaging in Dermatopathology. J Digit Imaging 2016;29:243-53. 10.1007/s10278-015-9836-y .
    1. Knezevich SR, Barnhill RL, Elder DE, et al. Variability in mitotic figures in serial sections of thin melanomas. J Am Acad Dermatol 2014;71:1204-11. 10.1016/j.jaad.2014.07.056 .
    1. Carney PA, Frederick PD, Reisch LM, et al. How concerns and experiences with medical malpractice affect dermatopathologists’ perceptions of their diagnostic practices when interpreting cutaneous melanocytic lesions. J Am Acad Dermatol 2016;74:317-24, quiz 324.e1-8..
    1. Piepkorn MW, Barnhill RL, Elder DE, et al. The MPATH-Dx reporting schema for melanocytic proliferations and melanoma. J Am Acad Dermatol 2014;70:131-41. 10.1016/j.jaad.2013.07.027 .
    1. Carney PA, Reisch LM, Piepkorn MW, et al. Achieving consensus for the histopathologic diagnosis of melanocytic lesions: use of the modified Delphi method. J Cutan Pathol 2016;43:830-7. 10.1111/cup.12751 .
    1. Edge SB, Byrd DR, Compton CC, Fritz AG, Greene FL, Trotti A, eds. AJCC cancer staging manual (7th ed). Springer; 2010.
    1. Lott JP, Elmore JG, Zhao GA, et al. International Melanoma Pathology Study Group. Evaluation of the Melanocytic Pathology Assessment Tool and Hierarchy for Diagnosis (MPATH-Dx) classification scheme for diagnosis of cutaneous melanocytic neoplasms: Results from the International Melanoma Pathology Study Group. J Am Acad Dermatol 2016;75:356-63. 10.1016/j.jaad.2016.04.052 .
    1. Dalkey NC, Brown B, Cochran N. The Delphi Method, III. Use of Self Ratings to Improve Group Estimates.Rand Corp, 1969.
    1. American Medical Association. Physicians. Secondary Physicians 2011. .
    1. Elmore JG, Nelson HD, Pepe MS, et al. Variability in Pathologists’ Interpretations of Individual Breast Biopsy Slides: A Population Perspective. Ann Intern Med 2016;164:649-55. 10.7326/M15-0964 .
    1. Feinstein AR. A bibliography of publications on observer variability. J Chronic Dis 1985;38:619-32. 10.1016/0021-9681(85)90016-5 .
    1. Elmore JG, Feinstein AR. A bibliography of publications on observer variability (final installment). J Clin Epidemiol 1992;45:567-80. 10.1016/0895-4356(92)90128-A .
    1. Elmore JG, Longton GM, Carney PA, et al. Diagnostic concordance among pathologists interpreting breast biopsy specimens. JAMA 2015;313:1122-32. 10.1001/jama.2015.1405 .
    1. Elmore JG, Wells CK, Lee CH, Howard DH, Feinstein AR. Variability in radiologists’ interpretations of mammograms. N Engl J Med 1994;331:1493-9. 10.1056/NEJM199412013312206 .
    1. Carli P, De Giorgi V, Naldi L, Dosi G. Dermoscopy Panel. Reliability and inter-observer agreement of dermoscopic diagnosis of melanoma and melanocytic naevi. Eur J Cancer Prev 1998;7:397-402. 10.1097/00008469-199810000-00005 .
    1. Krieger N, Hiatt RA, Sagebiel RW, Clark WH Jr, , Mihm MC Jr. Inter-observer variability among pathologists’ evaluation of malignant melanoma: effects upon an analytic study. J Clin Epidemiol 1994;47:897-902. 10.1016/0895-4356(94)90193-7 .
    1. CRC Melanoma Pathology Panel. A nationwide survey of observer variation in the diagnosis of thin cutaneous malignant melanoma including the MIN terminology. J Clin Pathol 1997;50:202-5. 10.1136/jcp.50.3.202 .
    1. Heenan PJ, Matz LR, Blackwell JB, et al. Inter-observer variation between pathologists in the classification of cutaneous malignant melanoma in western Australia. Histopathology 1984;8:717-29. 10.1111/j.1365-2559.1984.tb02388.x .
    1. Murali R, Hughes MT, Fitzgerald P, Thompson JF, Scolyer RA. Interobserver variation in the histopathologic reporting of key prognostic parameters, particularly clark level, affects pathologic staging of primary cutaneous melanoma. Ann Surg 2009;249:641-7. 10.1097/SLA.0b013e31819ed973 .
    1. Shoo BA, Sagebiel RW, Kashani-Sabet M. Discordance in the histopathologic diagnosis of melanoma at a melanoma referral center. J Am Acad Dermatol 2010;62:751-6. 10.1016/j.jaad.2009.09.043 .
    1. Braun RP, Gutkowicz-Krusin D, Rabinovitz H, et al. Agreement of dermatopathologists in the evaluation of clinically difficult melanocytic lesions: how golden is the ‘gold standard’?Dermatology 2012;224:51-8. 10.1159/000336886 .
    1. Gaudi S, Zarandona JM, Raab SS, English JC 3rd, , Jukic DM. Discrepancies in dermatopathology diagnoses: the role of second review policies and dermatopathology fellowship training. J Am Acad Dermatol 2013;68:119-28. 10.1016/j.jaad.2012.06.034 .
    1. Patrawala S, Maley A, Greskovich C, et al. Discordance of histopathologic parameters in cutaneous melanoma: Clinical implications. J Am Acad Dermatol 2016;74:75-80. 10.1016/j.jaad.2015.09.008 .
    1. Niebling MG, Haydu LE, Karim RZ, Thompson JF, Scolyer RA. Pathology review significantly affects diagnosis and treatment of melanoma patients: an analysis of 5011 patients treated at a melanoma treatment center. Ann Surg Oncol 2014;21:2245-51. 10.1245/s10434-014-3682-x .
    1. Farmer ER, Gonin R, Hanna MP. Discordance in the histopathologic diagnosis of melanoma and melanocytic nevi between expert pathologists. Hum Pathol 1996;27:528-31. 10.1016/S0046-8177(96)90157-4 .
    1. Barnhill RL, Argenyi ZB, From L, et al. Atypical Spitz nevi/tumors: lack of consensus for diagnosis, discrimination from melanoma, and prediction of outcome. Hum Pathol 1999;30:513-20. 10.1016/S0046-8177(99)90193-4 .
    1. Meyer LJ, Piepkorn M, Goldgar DE, et al. Interobserver concordance in discriminating clinical atypia of melanocytic nevi, and correlations with histologic atypia. J Am Acad Dermatol 1996;34:618-25. 10.1016/S0190-9622(96)80061-2 .
    1. Ferrara G, Argenziano G, Soyer HP, et al. Dermoscopic and histopathologic diagnosis of equivocal melanocytic skin lesions: an interdisciplinary study on 107 cases. Cancer 2002;95:1094-100. 10.1002/cncr.10768 .
    1. Colloby PS, West KP, Fletcher A. Observer variation in the measurement of Breslow depth and Clark’s level in thin cutaneous malignant melanoma. J Pathol 1991;163:245-50. 10.1002/path.1711630310 .
    1. Cook MG, Clarke TJ, Humphreys S, et al. The evaluation of diagnostic and prognostic criteria and the terminology of thin cutaneous malignant melanoma by the CRC Melanoma Pathology Panel. Histopathology 1996;28:497-512. 10.1046/j.1365-2559.1996.d01-464.x .
    1. Lock-Andersen J, Hou-Jensen K, Hansen JP, Jensen NK, Søgaard H, Andersen PK. Observer variation in histological classification of cutaneous malignant melanoma. Scand J Plast Reconstr Surg Hand Surg 1995;29:141-8. 10.3109/02844319509034330 .
    1. Spatz A, Cook MG, Elder DE, Piepkorn M, Ruiter DJ, Barnhill RL. Interobserver reproducibility of ulceration assessment in primary cutaneous melanomas. Eur J Cancer 2003;39:1861-5. 10.1016/S0959-8049(03)00325-3 .
    1. Wechsler J, Bastuji-Garin S, Spatz A, et al. French Cutaneous Cancerology Group. Reliability of the histopathologic diagnosis of malignant melanoma in childhood. Arch Dermatol 2002;138:625-8. 10.1001/archderm.138.5.625 .
    1. Eriksson H, Frohm-Nilsson M, Hedblad MA, et al. Interobserver variability of histopathological prognostic parameters in cutaneous malignant melanoma: impact on patient management. Acta Derm Venereol 2013;93:411-6. 10.2340/00015555-1517 .
    1. Scolyer RA, Shaw HM, Thompson JF, et al. Interobserver reproducibility of histopathologic prognostic variables in primary cutaneous melanomas. Am J Surg Pathol 2003;27:1571-6. 10.1097/00000478-200312000-00011 .
    1. Brochez L, Verhaeghe E, Grosshans E, et al. Inter-observer variation in the histopathological diagnosis of clinically suspicious pigmented skin lesions. J Pathol 2002;196:459-66. 10.1002/path.1061 .
    1. Boiko PE, Piepkorn MW. Reliability of skin biopsy pathology. J Am Board Fam Pract 1994;7:371-4..
    1. Piepkorn MW, Barnhill RL, Cannon-Albright LA, et al. A multiobserver, population-based analysis of histologic dysplasia in melanocytic nevi. J Am Acad Dermatol 1994;30:707-14. 10.1016/S0190-9622(08)81499-5 .
    1. Weinstock MA, Barnhill RL, Rhodes AR, Brodsky GL. The Dysplastic Nevus Panel. Reliability of the histopathologic diagnosis of melanocytic dysplasia. Arch Dermatol 1997;133:953-8. 10.1001/archderm.1997.03890440019002 .
    1. Lott JP, Piepkorn MW, Elmore JG. Dermatology in an age of fully transparent electronic medical records. JAMA Dermatol 2015;151:477-8. 10.1001/jamadermatol.2014.4362 .
    1. Delbanco T, Walker J, Bell SK, et al. Inviting patients to read their doctors’ notes: a quasi-experimental study and a look ahead. Ann Intern Med 2012;157:461-70. 10.7326/0003-4819-157-7-201210020-00002 .
    1. Gordon GH, Joos SK, Byrne J. Physician expressions of uncertainty during patient encounters. Patient Educ Couns 2000;40:59-65. 10.1016/S0738-3991(99)00069-5 .
    1. Galloway M, Taiyeb T. The interpretation of phrases used to describe uncertainty in pathology reports. Patholog Res Int 2011;2011:656079..
    1. Attanoos RL, Bull AD, Douglas-Jones AG, Fligelstone LJ, Semararo D. Phraseology in pathology reports. A comparative study of interpretation among pathologists and surgeons. J Clin Pathol 1996;49:79-81. 10.1136/jcp.49.1.79 .
    1. McCullough BJ, Johnson GR, Martin BI, Jarvik JG. Lumbar MR imaging and reporting epidemiology: do epidemiologic data in reports affect clinical management?Radiology 2012;262:941-6. 10.1148/radiol.11110618 .
    1. Shain AH, Yeh I, Kovalyshyn I, et al. The Genetic Evolution of Melanoma from Precursor Lesions. N Engl J Med 2015;373:1926-36. 10.1056/NEJMoa1502583 .
    1. Shain AH, Bastian BC. From melanocytes to melanomas. Nat Rev Cancer 2016;16:345-58. 10.1038/nrc.2016.37 .

Source: PubMed

3
Abonnere