Predictive symptoms for COVID-19 in the community: REACT-1 study of over 1 million people

Joshua Elliott, Matthew Whitaker, Barbara Bodinier, Oliver Eales, Steven Riley, Helen Ward, Graham Cooke, Ara Darzi, Marc Chadeau-Hyam, Paul Elliott, Joshua Elliott, Matthew Whitaker, Barbara Bodinier, Oliver Eales, Steven Riley, Helen Ward, Graham Cooke, Ara Darzi, Marc Chadeau-Hyam, Paul Elliott

Abstract

Background: Rapid detection, isolation, and contact tracing of community COVID-19 cases are essential measures to limit the community spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). We aimed to identify a parsimonious set of symptoms that jointly predict COVID-19 and investigated whether predictive symptoms differ between the B.1.1.7 (Alpha) lineage (predominating as of April 2021 in the US, UK, and elsewhere) and wild type.

Methods and findings: We obtained throat and nose swabs with valid SARS-CoV-2 PCR test results from 1,147,370 volunteers aged 5 years and above (6,450 positive cases) in the REal-time Assessment of Community Transmission-1 (REACT-1) study. This study involved repeated community-based random surveys of prevalence in England (study rounds 2 to 8, June 2020 to January 2021, response rates 22%-27%). Participants were asked about symptoms occurring in the week prior to testing. Viral genome sequencing was carried out for PCR-positive samples with N-gene cycle threshold value < 34 (N = 1,079) in round 8 (January 2021). In univariate analysis, all 26 surveyed symptoms were associated with PCR positivity compared with non-symptomatic people. Stability selection (1,000 penalized logistic regression models with 50% subsampling) among people reporting at least 1 symptom identified 7 symptoms as jointly and positively predictive of PCR positivity in rounds 2-7 (June to December 2020): loss or change of sense of smell, loss or change of sense of taste, fever, new persistent cough, chills, appetite loss, and muscle aches. The resulting model (rounds 2-7) predicted PCR positivity in round 8 with area under the curve (AUC) of 0.77. The same 7 symptoms were selected as jointly predictive of B.1.1.7 infection in round 8, although when comparing B.1.1.7 with wild type, new persistent cough and sore throat were more predictive of B.1.1.7 infection while loss or change of sense of smell was more predictive of the wild type. The main limitations of our study are (i) potential participation bias despite random sampling of named individuals from the National Health Service register and weighting designed to achieve a representative sample of the population of England and (ii) the necessary reliance on self-reported symptoms, which may be prone to recall bias and may therefore lead to biased estimates of symptom prevalence in England.

Conclusions: Where testing capacity is limited, it is important to use tests in the most efficient way possible. We identified a set of 7 symptoms that, when considered together, maximize detection of COVID-19 in the community, including infection with the B.1.1.7 lineage.

Conflict of interest statement

I have read the journal’s policy and the authors of this manuscript have the following competing interests: PE is the director of the MRC Centre of Environment and Health (MR/L01341X/1 and MC/S019669/1) and has no conflict of interest to disclose. M C-H holds shares in the O-SMOSE company and has no conflict of interest to disclose. Consulting activities conducted by the company are independent of the present work. All other authors have no conflict of interest to disclose.

Figures

Fig 1. Flow chart showing numbers of…
Fig 1. Flow chart showing numbers of participants by symptom status and PCR result.
(A) Rounds 2–7 and (B) round 8 of the REACT-1 study.
Fig 2. Results from univariate logistic regression…
Fig 2. Results from univariate logistic regression models of PCR positivity for 26 surveyed symptoms.
Effect size estimates are expressed as odds ratios (95% confidence intervals) in rounds 2–7 (left) and round 8 (right).
Fig 3. Selected symptoms predictive of COVID-19.
Fig 3. Selected symptoms predictive of COVID-19.
Results of LASSO stability selection using 1,000 models (with 50% subsamples of training data from rounds 2–7). Mean (penalized) log odds ratios (log ORs) across all models are shown in the top panel. Positive regression coefficients are presented in teal, and negative in red. Only symptoms selected at least once are displayed. The selection proportions (selection prop.; proportion of 1,000 models that included each symptom) are shown in the middle panel; the horizontal dashed line shows the selection threshold of 50%. Symptoms are ordered according to their selection proportions, and selected symptoms are in black. The bottom panel shows the area under the curve (AUC) of models adding each variable in order of selection proportion (from left to right) in both holdout data from rounds 2–7 (grey) and data from round 8 (red).
Fig 4. Symptoms predictive of B.1.1.7 infection.
Fig 4. Symptoms predictive of B.1.1.7 infection.
LASSO stability selection for symptoms predictive of B.1.1.7 (Alpha) lineage infection versus symptomatic people (aged 5+ years) testing PCR negative in round 8. Mean log odds ratio (Log OR) and selection proportion (selection prop.) are represented for each symptom in the top and bottom panels, respectively. Positive regression coefficients are presented in teal, and negative in red.
Fig 5. B.1.1.7 versus wild-type symptoms.
Fig 5. B.1.1.7 versus wild-type symptoms.
Comparison of symptom profile in B.1.1.7 (Alpha) lineage versus wild-type infection among 1,124 people testing positive in round 8 (other lineages excluded, N = 8). (A) Proportion of people reporting each symptom by lineage (left panel), and the differences in proportions with 95% confidence intervals (right panel). (B) Results of LASSO stability selection (using 1,000 models with 50% subsampling) with B.1.1.7 infection, versus wild-type infection as the outcome, summarized by the mean log odds ratio (Log OR) and selection proportion (selection prop.) for each of the symptoms selected at least once. Positive regression coefficients are presented in teal, and negative in red. The horizontal dashed line represents the selection threshold of 50%.

References

    1. Rannan-Eliya RP, Wijemunige N, Gunawardana JRNA, Amarasinghe SN, Sivagnanam I, Fonseka S, et al.. Increased intensity of PCR testing reduced COVID-19 transmission within countries during the first pandemic wave. Health Aff (Millwood). 2021;40:70–81. doi: 10.1377/hlthaff.2020.01409
    1. World Health Organization. Contact tracing in the context of COVID-19. Geneva: World Health Organization; 2021. [cited 2021 Sep 3]. .
    1. Kerr CC, Mistry D, Stuart RM, Rosenfeld K, Hart GR, Núñez RC, et al.. Controlling COVID-19 via test-trace-quarantine. Nat Commun. 2021;12:2993. doi: 10.1038/s41467-021-23276-9
    1. Hellewell J, Abbott S, Gimma A, Bosse NI, Jarvis CI, Russell TW, et al.. Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. Lancet Glob Health. 2020;8:e488–96. doi: 10.1016/S2214-109X(20)30074-7
    1. Centers for Disease Control and Prevention. Symptoms of COVID-19. Atlanta: Centers for Disease Control and Prevention; 2021. [cited 2021 Sep 3]. .
    1. Public Health England. SARS-CoV-2 variants of concern and variants under investigation in England. Technical briefing 8. London: Public Health England; 2021. [cited 2021 Sep 3]. .
    1. . B.1.1.7. ; 2021 [cited 2021 Sep 3]. .
    1. Centers for Disease Control and Prevention. COVID data tracker. Atlanta: Centers for Disease Control and Prevention; 2021. [cited 2021 Sep 3]. .
    1. Riley S, Atchison C, Ashby D, Donnelly CA, Barclay W, Cooke GS, et al.. REal-time Assessment of Community Transmission (REACT) of SARS-CoV-2 virus: study protocol. Wellcome Open Res. 2020;5:200. doi: 10.12688/wellcomeopenres.16228.2
    1. Quick J. nCoV-2019 sequencing protocol v3 (LoCost). Berkeley: ; 2020.
    1. Baker DJ, Aydin A, Le-Viet T, Kay GL, Rudder S, de Oliveira Martins L, et al.. CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021;13:21. doi: 10.1186/s13073-021-00839-5
    1. nf-core. nf-core/viralrecon: assembly and intrahost/low-frequency variant calling for viral samples. nf-core; 2021. [cited 2021 Jun 22]. .
    1. GitHub. cov-lineages/pangolin. Github; 2021 [cited 2021 Jun 22]. .
    1. Meinshausen N, Bühlmann P. Stability selection. J R Stat Soc Series B Stat Methodol. 2010;72:417–73.
    1. Shah RD, Samworth RJ. Variable selection with error control: another look at stability selection. J R Stat Soc Series B Stat Methodol. 2013;75:55–80.
    1. Scientific Pandemic Influenza Group on Behaviours. SPI-B: impact of financial and other targeted support on rates of self-isolation or quarantine, 16 September 2020. London: Scientific Advisory Group for Emergencies; 2020. [cited 2021 Sep 3]. .
    1. Caul S. Deaths involving COVID-19 by local area and socioeconomic deprivation: deaths occurring between 1 March and 31 May 2020. Newport: Office for National Statistics; 2020. [cited 2021 Sep 3]. .
    1. Menni C, Valdes AM, Freidin MB, Sudre CH, Nguyen LH, Drew DA, et al.. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat Med. 2020;26:1037–40. doi: 10.1038/s41591-020-0916-2
    1. Kemp SA, Collier DA, Datir RP, Ferreira IATM, Gayed S, Jahun A, et al.. SARS-CoV-2 evolution during treatment of chronic infection. Nature 2021;592:277–82. doi: 10.1038/s41586-021-03291-y
    1. Starr TN, Greaney AJ, Hilton SK, Ellis D, Crawford KHD, Dingens AS, et al.. Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. Cell. 2020;182:1295–310.e20. doi: 10.1016/j.cell.2020.08.012
    1. Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday JD, et al.. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science 2021;372:eabg3055. doi: 10.1126/science.abg3055
    1. Challen R, Brooks-Pollock E, Read JM, Dyson L, Tsaneva-Atanasova K, Danon L, et al.. Risk of mortality in patients infected with SARS-CoV-2 variant of concern 202012/1: matched cohort study. BMJ 2021;372:n579. doi: 10.1136/bmj.n579
    1. New and Emerging Respiratory Virus Threats Advisory Group on SARS-CoV-2 variant B.1.1.7. NERVTAG: update note on B.1.1.7 severity, 11 February 2021. London: Scientific Advisory Group for Emergencies; 2021 [cited 2021 Sep 3]. .
    1. Office for National Statistics. Coronavirus (COVID-19) infection survey: characteristics of people testing positive for COVID-19 in England, 27 January 2021. Newport: Office for National Statistics; 2021. [cited 2021 Sep 3]. .
    1. Riley S, Ainslie KEC, Eales O, Walters CE, Wang H, Atchison C, et al.. Resurgence of SARS-CoV-2: detection by community viral surveillance. Science 2021;372:990–5. doi: 10.1126/science.abf0874
    1. Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al.. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature 2021;593:266–9. doi: 10.1038/s41586-021-03470-x

Source: PubMed

3
Sottoscrivi