Confidence intervals and sample size planning for optimal cutpoints

Christian Thiele, Gerrit Hirschfeld, Christian Thiele, Gerrit Hirschfeld

Abstract

Various methods are available to determine optimal cutpoints for diagnostic measures. Unfortunately, many authors fail to report the precision at which these optimal cutpoints are being estimated and use sample sizes that are not suitable to achieve an adequate precision. The aim of the present study is to evaluate methods to estimate the variance of cutpoint estimations based on published descriptive statistics ('post-hoc') and to discuss sample size planning for estimating cutpoints. We performed a simulation study using widely-used methods to optimize the Youden index (empirical, normal, and transformed normal method) and three methods to determine confidence intervals (the delta method, the parametric bootstrap, and the nonparametric bootstrap). We found that both the delta method and the parametric bootstrap are suitable for post-hoc calculation of confidence intervals, depending on the sample size, the distribution of marker values, and the correctness of model assumptions. On average, the parametric bootstrap in combination with normal-theory-based cutpoint estimation has the best coverage. The delta method performs very well for normally distributed data, except in small samples, and is computationally more efficient. Obviously, not every combination of distributions, cutpoint optimization methods, and optimized metrics can be simulated and a lot of the literature is concerned specifically with cutpoints and confidence intervals for the Youden index. This complicates sample size planning for studies that estimate optimal cutpoints. As a practical tool, we introduce a web-application that allows for running simulations of width and coverage of confidence intervals using the percentile bootstrap with various distributions and cutpoint optimization methods.

Conflict of interest statement

The authors have declared that no competing interests exist.

Copyright: © 2023 Thiele, Hirschfeld. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Figures

Fig 1. Coverage probabilities and widths of…
Fig 1. Coverage probabilities and widths of 95% confidence intervals for the optimal cutpoints on normally distributed data.
Fig 2. Coverage probabilities and widths of…
Fig 2. Coverage probabilities and widths of 95% confidence intervals for the optimal cutpoints on lognormally distributed data.
Fig 3. Jitter plot of the width…
Fig 3. Jitter plot of the width of 95% confidence intervals in terms of the pooled standard deviation in different scenarios with normally distributed data depending on the total sample size, the ratio of the sizes of the two groups and for four ranges of Cohen’s d.

References

    1. Sackett DL, Haynes RB. The architecture of diagnostic research. BMJ. British Medical Journal Publishing Group; 2002;324: 539–541. doi: 10.1136/bmj.324.7336.539
    1. von Glischinski M, von Brachel R, Hirschfeld G. How depressed is depressed? A systematic review and diagnostic meta-analysis of optimal cut points for becks depression inventory-revised (BDI-II). Quality of Life Research. 2019;. doi: 10.1007/s11136-018-2050-x
    1. Leeflang MM, Moons KG, Reitsma JB, Zwinderman AH. Bias in sensitivity and specificity caused by data-driven selection of optimal cutoff values: Mechanisms, magnitude, and solutions. Clinical Chemistry. 2008; 729–738. doi: 10.1373/clinchem.2007.096032
    1. Hirschfeld G, do Brasil PEAA. A simulation study into the performance of optimal diagnostic thresholds in the population: Large effect sizes are not enough. Journal of clinical epidemiology. 2014;67: 449–453. doi: 10.1016/j.jclinepi.2013.07.018
    1. Fluss R, Faraggi D, Reiser B. Estimation of the Youden Index and its associated cutoff point. Biometrical Journal. 2005;47: 458–472. doi: 10.1002/bimj.200410135
    1. Wang D, Tian L, Zhao Y. Smoothed empirical likelihood for the Youden index. Computational Statistics & Data Analysis. 2017;115.
    1. Shan G. Improved Confidence Intervals for the Youden Index. PLoS ONE. 2015;10. doi: 10.1371/journal.pone.0127272
    1. Zhou H, Qin G. New Nonparametric Confidence Intervals for the Youden Index. Journal of Biopharmaceutical Statistics. 2012;22: 1244–1257. doi: 10.1080/10543406.2011.592234
    1. Lai C-Y, Tian L, Schisterman EF. Exact confidence interval estimation for the Youden index and its corresponding optimal cut-point. Computational statistics & data analysis. 2012;56: 1103–1114. doi: 10.1016/j.csda.2010.11.023
    1. Schisterman EF, Perkins N. Confidence intervals for the Youden index and corresponding optimal cut-point. Communications in Statistics Part B: Simulation and Computation. 2007;36: 549–563. doi: 10.1080/03610910701212181
    1. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2018. Available: .
    1. Canty A, Ripley BD. Boot: Bootstrap r (s-plus) functions [Internet]. 2021. Available: .
    1. Thiele C, Hirschfeld G. Cutpointr: Improved estimation and validation of optimal cutpoints in r. Journal of Statistical Software. 2021;98: 1–27. doi: 10.18637/jss.v098.i11
    1. Pedersen TL. Patchwork: The composer of plots [Internet]. 2020. Available: .
    1. Wickham H, Averick M, Bryan J, Chang W, McGowan LD, François R, et al.. Welcome to the tidyverse. Journal of Open Source Software. 2019;4: 1686. doi: 10.21105/joss.01686
    1. Zou KH, Tempany CM, Fielding JR, Silverman SG. Original smooth receiver operating characteristic curve estimation from continuous data: Statistical meth- ods for analyzing the predictive value of spiral CT of ureteral stones. Academic Radiology. 1998;5: 680–687. doi: 10.1016/S1076-6332(98)80562-X
    1. Atkinson AC, Riani M, Corbellini A. The Box–Cox Transformation: Review and Extensions. Statistical Science. 2021;36: 239–255. doi: 10.1214/20-STS778
    1. Carpenter J, Bithell J. Bootstrap confidence intervals: When, which, what? A practical guide for medical statisticians. Statistics in Medicine. 2000;19: 1141–1164. doi: 10.1002/(SICI)1097-0258(20000515)19:9<1141::AID-SIM479>;2-F
    1. Bantis LE, Nakas CT, Reiser B. Construction of confidence intervals for the maximum of the Youden index and the corresponding cutoff point of a continuous biomarker. Biometrical Journal. 2019;61: 138–156. doi: 10.1002/bimj.201700107
    1. Thiele C. Shiny application for cutpointr. 2022. Available: .

Source: PubMed

3
구독하다