Modeling observations with a detection limit using a truncated normal distribution with censoring

Justin R Williams, Hyung-Woo Kim, Catherine M Crespi, Justin R Williams, Hyung-Woo Kim, Catherine M Crespi

Abstract

Background: When data are collected subject to a detection limit, observations below the detection limit may be considered censored. In addition, the domain of such observations may be restricted; for example, values may be required to be non-negative.

Methods: We propose a method for estimating population mean and variance from censored observations that accounts for known domain restriction. The method finds maximum likelihood estimates assuming an underlying truncated normal distribution.

Results: We show that our method, tcensReg, has lower bias, Type I error rates, and mean squared error than other methods commonly used for data with detection limits such as Tobit regression and single imputation under a range of simulation settings from mild to heavy censoring and truncation. We further demonstrate the consistency of the maximum likelihood estimators. We apply our method to analyze vision quality data collected from ophthalmology clinical trials comparing different types of intraocular lenses implanted during cataract surgery. All of the methods yield similar conclusions regarding non-inferiority, but estimates from the tcensReg method suggest that there may be greater mean differences and overall variability.

Conclusions: In the presence of detection limits, our new method tcensReg provides a way to incorporate known domain restrictions in dependent variables that substantially improves inferences.

Trial registration: ClinicalTrials.gov NCT01510717 NCT01424189.

Keywords: Contrast sensitivity; Limited domain; Visual acuity; limited dependent variables.

Conflict of interest statement

Hyung-Woo Kim is a former employee of Alcon Laboratories, Inc.

Figures

Fig. 1
Fig. 1
CSV-1000E1 Contrast Sensitivity Chart for 12.0 CPD. 1This testing chart is distributed by Vector Vision and was accessed from http://www.vectorvision.com/csv1000-contrast-sensitivity/ on 29NOV2018
Fig. 2
Fig. 2
Marginal Histograms for Monofocal and Multifocal Lens at 12 CPD under Dim Lighting. Log contrast sensitivity scores are converted from contrast sensitivity threshold scores via Table 1. Detection limit for 12 CPD occurs at ν=0.61. Histogram is shown in the background with Gaussian kernel density estimate in the foreground with bandwidth set to 0.2
Fig. 3
Fig. 3
Truncated Normal Distribution with Censoring. Potential density for a left truncated normal distribution with left censoring. The density above was created with μ=0.8,σ=0.5,a=0, and ν=0.61. In our application, a=0 and ν=0.61
Fig. 4
Fig. 4
Performance Metrics for μ from Six Different Estimation Methods in Single Mean Model. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 5
Fig. 5
Performance Metrics for σ from Six Different Estimation Methods in Single Mean Model. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 6
Fig. 6
Average Bias for δ from Six Different Estimation Methods in Two Population Model. The vertical dashed black line corresponds to the case when δ=0, i.e., μ1=μ2. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 7
Fig. 7
Performance of Maximum Likelihood Estimate for μ as Function of Sample Size. The vertical dashed black line on the left figure corresponds to zero bias. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 8
Fig. 8
95% Confidence Intervals for Separate Standard Deviation for Monofocal vs Multifocal Lens at 12 CPD
Fig. 9
Fig. 9
90% Confidence Intervals for Difference in Monofocal vs Multifocal Lens at 12 CPD. The horizontal dashed line at δ=−0.15 indicates the non-inferiority margin. DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 10
Fig. 10
Estimate of Common Standard Deviation in Monofocal vs Multifocal Lens at 12 CPD. DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 11
Fig. 11
Average Log Mean Squared Error for δ from Six Different Estimation Methods in Two Population Model. The vertical dashed black line corresponds to the case when δ=0, i.e., μ1=μ2. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 12
Fig. 12
Performance Metrics for Common σ from Six Different Estimation Methods in Two Population Model. The vertical dashed black line corresponds to the case when δ=0, i.e., μ1=μ2. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment
Fig. 13
Fig. 13
Performance of Maximum Likelihood Estimate for σ as Function of Sample Size. The vertical dashed black line on the left figure corresponds to zero bias. GS = Gold Standard, i.e., uncensored observations with truncation adjustment; Uncens NT = Uncensored data with no truncation adjustment; DL = detection limit; Tobit = Tobit censored regression with no truncation adjustment; tcensReg = Censored regression with truncation adjustment

References

    1. Hornung RW, Reed LD. Estimation of average concentration in the presence of nondetectable values. Appl Occup Environ Hyg. 1990;5(1):46–51.
    1. Lubin JH, Colt JS, Camann D, Davis S, Cerhan JR, Severson RK, Bernstein L, Hartge P. Epidemiologic evaluation of measurement data in the presence of detection limits. Environ Health Perspect. 2004;112(17):1691–6.
    1. Schisterman EF, Vexler A, Whitcomb BW, Liu A. The limitations due to exposure detection limits for regression models. Am J Epidemiol. 2006;163(4):374–83.
    1. Helsel DR. Less than obvious-statistical treatment of data below the detection limit. Environ Sci Technol. 1990;24(12):1766–74.
    1. Analytical Methods Committee Recommendations for the definition, estimation and use of the detection limit. Analyst. 1987;112(2):199–204.
    1. Zaugg SD, Sandstrom MW, Smith SG, Fehlberg KM. Methods of Analysis by the U.S. Geological Survey National Water Quality Laboratory-Determination of Pesticides in Water by C-18 Solid Phase Extraction and Capillary-Column Gas Chromatography/Mass Spectrometry with Selected-Ion Monitoring. Open-File Report 95-181, U.S. Geological Survey, Denver, Colorado. 1995. .
    1. McDonald JF, Moffitt RA. The uses of tobit analysis. Rev Econ Stat. 1980;62(2):318–21.
    1. Greene WH. Chap. 19. "Limited Dependent Variables, Truncation, Censoring and Sample Selection". United Kingdom: Pearson; 2018. Econometric Analysis.
    1. Hald A. Maximum likelihood estimation of the parameters of a normal distribution which is truncated at a known point. Skandinavisk Aktuarietidskrift. 1949;32:119–34.
    1. Gupta AK. Estimation of the mean and standard deviation of a normal population from a censored sample. Biometrika. 1952;39(3):260–73.
    1. Harter HL, Moore AH. Iterative maximum-likelihood estimation of the parameters of normal population from singly and doubly censored samples. Biometrika. 1966;53(1-2):205–13.
    1. Tiku M. Estimating the mean and standard deviation from a censored normal sample. Biometrika. 1967;54(1):155–65.
    1. Sarhan AE, Greenberg BG. Estimation of location and scale parameters by order statistics from singly and doubly censored samples. Ann Math Stat. 1956;27(2):427–51.
    1. Dixon WJ. Simplified estimation from censored normal samples. Ann Math Stat. 1960;31(2):385–91.
    1. Tobin J. Estimation of relationships for limited dependent variables. Econometrica. 1958;26(1):24–36.
    1. Buckley J, James I. Linear regression with censored data. Biometrika. 1979;66(3):429–36.
    1. Cohen Jr AC. Estimating the mean and variance of normal populations from singly truncated and doubly truncated samples. Ann Math Stat. 1950;21(4):557–69.
    1. Halperin M. Maximum likelihood estimation in truncated samples. Ann Math Stat. 1952;23(2):226–38.
    1. Amemiya T. Regression analysis when the dependent variable is truncated normal. Econometrica. 1973;41(6):997–1016.
    1. Owsley C, Sloane ME. Contrast sensitivity, acuity, and the perception of ‘real-world’ targets. Br J Ophthalmol. 1987;71(10):791–6.
    1. Pelli DG, Bex P. Measuring contrast sensitivity. Vis Res. 2013;90:10–14.
    1. ISO 11979-7 . Ophthalmic implants – Intraocular lenses – Part 7: Clinical investigations of intraocular lenses for the correction of aphakia. Vernier, Geneva, CH: Standard, International Organization for Standardization; 2018.
    1. Lange K. Numerical Analysis for Statisticians. 2nd Edn. New York: Springer; 2010.
    1. Ypma TJ. Historical development of the newton-raphson method. Soc Ind Appl Math. 1995;37(4):531–51.
    1. Broyden CG. The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations. IMA J Appl Math. 1970;6(1):76–90.
    1. Fletcher R. A new approach to variable metric algorithms. Comput J. 1970;13(3):317–22.
    1. Goldfarb D. A family of variable-metric methods derived by variational means. Math Comput. 1970;24(109):23–26.
    1. Shanno DF. Conditioning of quasi-Newton methods for function minimization. Math Comput. 1970;24(111):647–56.
    1. Fletcher R, Reeves CM. Function minimization by conjugate gradients. Comput J. 1964;7(2):149–54.
    1. Henningsen A, Toomet O. maxLik: A package for maximum likelihood estimation in R. Comput Stat. 2011;26(3):443–58.
    1. Henningsen A. Estimating Censored Regression Models in R using the censReg Package. R Package vignettes. 2010; 5(2):12.
    1. Croissant Y, Zeileis A. Truncreg: Truncated Gaussian Regression Models. R package version 0.2-5. 2018. .
    1. R Core Team . R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2019.
    1. Burkardt J. The Truncated Normal Distribution [PDF File]. . Accessed 9 June 2020.
    1. Veall MR, Zimmermann KF. Pseudo-r2 measures for some common limited dependent variable models. J Econ Surv. 1996;10(3):241–59.
    1. McKelvey RD, Zavoina W. A statistical model for the analysis of ordinal level dependent variables. J Math Sociol. 1975;4(1):103–20.
    1. Lindstrom MJ, Bates DM. Newton-raphson and em algorithms for linear mixed-effects models for repeated-measures data. J Am Stat Assoc. 1988;83(404):1014–22.
    1. Zhang X, Wan AT, Zhou SZ. Focused information criteria, model selection, and model averaging in a Tobit model with a nonzero threshold. J Bus Econ Stat. 2012;30(1):132–42.

Source: PubMed

3
S'abonner