Measuring Spatial Dependence for Infectious Disease Epidemiology

Justin Lessler, Henrik Salje, M Kate Grabowski, Derek A T Cummings, Justin Lessler, Henrik Salje, M Kate Grabowski, Derek A T Cummings

Abstract

Global spatial clustering is the tendency of points, here cases of infectious disease, to occur closer together than expected by chance. The extent of global clustering can provide a window into the spatial scale of disease transmission, thereby providing insights into the mechanism of spread, and informing optimal surveillance and control. Here the authors present an interpretable measure of spatial clustering, τ, which can be understood as a measure of relative risk. When biological or temporal information can be used to identify sets of potentially linked and likely unlinked cases, this measure can be estimated without knowledge of the underlying population distribution. The greater our ability to distinguish closely related (i.e., separated by few generations of transmission) from more distantly related cases, the more closely τ will track the true scale of transmission. The authors illustrate this approach using examples from the analyses of HIV, dengue and measles, and provide an R package implementing the methods described. The statistic presented, and measures of global clustering in general, can be powerful tools for analysis of spatially resolved data on infectious diseases.

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1
The transmission kernel determines the spatial area where directly transmitted cases may be found (area of elevated risk, red circle in (B)). In addition, due to branching in the transmission chain there may be a larger area of elevated prevalence from more distally related cases (area of elevated prevalence, blue circle in (A) and (B)). (C) Example transmission chains. Each color represents a transmission chain of a different strain. The strains are divided into two serotypes, with the blue and red circles representing one serotype and the green and yellow squares a second serotype. (D) Increasing ability to discriminate between different transmission chains will increase the power of the τ-statistic to identify the spatial extent of elevated prevalence. Where case-pairs are known, we can identify the area of elevated risk (black line).
Fig 2. Overview of simulated epidemics of…
Fig 2. Overview of simulated epidemics of 100 separate transmission chains.
Each transmission chain consisted of cases caused by a single strain, with all strains divided into four serotypes. (A) Distribution of underlying population for simulated data. The black square represents the area of analysis to avoid edge effects from the simulation process. The two black dots represent two ‘surveillance hospitals’ (relevant for the spatially-biased observation scenario set out in Fig 3). (B) Using disease type information (blue) allow the τ-statistic to correctly show no spatial clustering of disease in a spatially clustered population, whereas the pair correlation function would falsely indicate clustering in disease cases. (C) τ-statistic results from simulated epidemics where the cases infected individuals between 0 and 100m away. Each case infected one individual (effective reproductive number of one). (D) As (C) but each case infects two individuals (effective reproductive number of two).
Fig 3. Performance of the τ-statistic under…
Fig 3. Performance of the τ-statistic under different observation scenarios: complete observation (blue solid line), spatially random observation (dashed blue line) and spatially biased observation (purple line).
For the spatially random observation all cases had a 1% probability of being observed. For the partially observed observation, the probability of observation was 0.1xexp(-d), where d was the distance (in km) to the closest ‘surveillance hospital’ in the area (marked by two black dots in Fig 2A).
Fig 4
Fig 4
(A) τ-statistic results from 1,912 dengue cases that presented at a Bangkok hospital between 1995 and 1999. The solid line represents the probability of cases occurring within the same month being of the same serotype, relative to any two cases being of the same serotype and occurring within the same month. The dashed line is the same analysis but with a temporal lag of four months and the dotted line are the results using a temporal lag of 15 months. Confidence intervals for the temporal lags of four months and 15 months are presented in S1 Fig. (B) Smoothed τ-statistic of HIV incident to HIV prevalent individuals in Rakai Uganda. The τ-statistic is calculated within households (i.e, at 0m) and in a sliding 250m window up to the average distance between community households, 3.3km. (C) τ-statistic results from 188 measles cases from Hagelloch, Germany in 1861–1862. The τ-statistic is calculated using a sliding 50m window and compares the risk of observing cases at that distance range for individuals sick within two weeks of each other to all time.

References

    1. Diggle PJ. Statistical analysis of spatial point patterns. London: Academic Press; 1983.
    1. Ali M, Emch M, Donnay JP, Yunus M, Sack RB. The spatial epidemiology of cholera in an endemic area of Bangladesh. Soc Sci Med. 2002;55(6):1015–1024.
    1. Zeman P. Objective assessment of risk maps of tick-borne encephalitis and Lyme borreliosis based on spatial patterns of located cases. Int J Epidemiol. 1997;26(5):1121–1129.
    1. Kulldorff M. A spatial scan statistic. Communications in Statistics-Theory and methods, 1997;26(6): 1481–1496.
    1. Viboud C, Bjørnstad ON, Smith DL, Simonsen L, Miller MA, Grenfell BT. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science. 2006;312(5772):447–451.
    1. Snow J. On the mode of communication of cholera. London: John Churchill; 1855.
    1. Rappole JH, Derrickson SR, Hubálek Z. Migratory birds and spread of West Nile virus in the Western Hemisphere. Emerg Infect Dis. 2000;6(4):319–328.
    1. Salje H, Lessler J, Endy TP, Curriero FC, Gibbons RV, Nisalak A, et al. Revealing the microscale spatial signature of dengue transmission and immunity in an urban population. Proc. Natl. Acad. Sci. U.S.A. 2012;109(24):9535–9538. 10.1073/pnas.1120621109
    1. Ferguson NM, Donnelly CA, Anderson RM. The foot-and-mouth epidemic in Great Britain: pattern of spread and impact of interventions. Science. 2001;292(5519):1155–1160.
    1. Omer SB, Enger KS, Moulton LH, Halsey NA, Stokley S, Salmon DA. Geographic clustering of nonmedical exemptions to school immunization requirements and associations with geographic clustering of pertussis. American Journal of Epidemiology. 2008;168(12):1389–1396. 10.1093/aje/kwn263
    1. Moran P. Notes on continuous stochastic phenomena. Biometrika. 1950;37(1–2):17–23.
    1. Geary RC. The Contiguity Ratio and Statistical Mapping. The Incorporated Statistician. 1954;5(3):115–127+ 129–146.
    1. Cliff AD, Ord JK. Model Building and the Analysis of Spatial Pattern in Human Geography. Journal of the Royal Statistical Society. Series B (Methodological). 1975;37(3):297–348.
    1. Gatrell AC, Bailey TC, Diggle PJ, S Rowlingson BS. Spatial Point Pattern Analysis and Its Application in Geographical Epidemiology. Transactions of the Institute of British Geographers, New Series. 1996;21(1):256–274.
    1. Wiegand T, He F, Hubbell SP. A systematic comparison of summary characteristics for quantifying point patterns in ecology. Ecography. 2013;36:092–103.
    1. Bjornstad ON, Ims RA, Lambin X, Spatial population dynamics: Analysing patterns and processes of population synchrony. Trends in Ecology and Evolution, 1999;11:427–431.
    1. Hall P, Fisher NI, Hoffmann B. On the nonparametric estimation of covariance functions. Annals of Statistics, 1994;22:2115–2134.
    1. Baddeley AJ, Møller J, Waagepetersen R. Non- and semi-parametric estimation of interaction in inhomogeneous point patterns, Stat Neerl, 2000;54(3);329–350.
    1. Baddeley A, Rubak E, Turner R. Spatial point patterns: Methodology and applications in R, CRC Press, 2015.
    1. Stoyan D, Stoyan H. Fractals, random shapes and point fields: methods of geometrical statistics. John Wiley and Sons, 1994.
    1. Halstead SB. Dengue. Imperial College Press, London; 2008.
    1. Grabowski MK, Lessler J, Redd AD, Kagaayi J, Laeyendecker O, Ndyanabo A, et al. The Role of Viral Introductions in Sustaining Community-Based HIV Epidemics in Rural Uganda: Evidence from Spatial Clustering, Phylogenetics, and Egocentric Transmission Models. PLoS Med. 2014;11(3):e1001610–e1001610. 10.1371/journal.pmed.1001610
    1. Höhle M, Meyer S, Paul M. Surveillance: Temporal and Spatio-Temporal Modeling and Monitoring of Epidemic Phenomena. R package, 2015.
    1. Neal PJ. Statistical inference and model selection for the 1861 Hagelloch measles epidemic. Biostatistics, 2004;5(2);249–261.
    1. Salje H, Cauchemez S, Alera MT, Rodriguez-Barraquer I, Thaisomboonsuk B, Srikiatkhachorn A, et al. Reconstruction of 60 years of chikungunya epidemiology in the Philippines demonstrates episodic and focal transmission. Journal of Infectious Diseases, 2016;213(3).
    1. Levy JW, Bhoomiboonchoo P, Simasathien S, Salje H, Huang A, Rangsin R, et al. Elevated transmission of upper respiratory illness among new recruits in military barracks in Thailand. Influenza and Other Respiratory Viruses. 2015;9(6):308–314.
    1. Bhoomiboonchoo P, Gibbons RV, Huang A, Yoon IK, Buddhari D, Nisalak A, et al. The Spatial Dynamics of Dengue Virus in Kamphaeng Phet, Thailand. de Silva AM, ed. PLoS Neglected Tropical Diseases. 2014;8(9):e3138 10.1371/journal.pntd.0003138

Source: PubMed

3
Abonnieren