Reassessing Reported Deaths and Estimated Infection Attack Rate during the First 6 Months of the COVID-19 Epidemic, Delhi, India

38
Reassessing Reported Deaths and Estimated Infection Attack Rate during the First 6 Months of the COVID-19 Epidemic, Delhi, India
Reassessing Reported Deaths and Estimated Infection Attack Rate during the First 6 Months of the COVID-19 Epidemic, Delhi, India

Africa-Press – Malawi. Disclaimer: Early release articles are not considered as final versions. Any changes will be reflected in the online version in the month the article is officially released.

Figure 1 Multiple severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) seroprevalence surveys conducted during 2020 in Delhi, one of India’s largest metropolitan areas (20 million residents), offered us an opportunity to assess the completeness of reported COVID-19 deaths and estimate the actual infection attack rate. SARS-CoV-2 transmission in Delhi has led to several waves of infection and death (Figure 1). At the beginning of the epidemic, all SARS-CoV-2 testing relied on reverse transcription PCR (RT-PCR), but after mid-June 2020, use of antigen-based rapid diagnostic tests (Ag-RDTs), which have lower sensitivity, quickly exceeded use of RT-PCR tests (Appendix Figure 1). Three serosurveys conducted in Delhi during 2020 that sampled participants >4 years of age found age- and sex-adjusted seropositivity rates (uncorrected for test sensitivity and specificity) of 22.8% in July, 28.7% in August, and 25.1% in September (Appendix Table 1) (3). The July survey found a difference in seropositivity between residents living inside or outside of slum areas (25.3% vs. 19.2%; p<0.001), but the August survey did not (28.9% vs. 28.8%; p = 0.94), and the September survey did not report this information.

Materials and Methods

Study OverviewWe developed a SARS-CoV-2 transmission model to estimate the incidence of infection and changes in the reproduction number (R) after the start of nonpharmaceutical interventions, including lockdowns (Appendix Table 2, Figure 2). We used Bayesian Markov chain Monte Carlo to fit the model to the 3 seroprevalence surveys and the time-series of reported deaths. We estimated the proportion of COVID-19 deaths reported by comparing reported deaths to the number expected based on the age-adjusted infection-fatality ratio (IFR) we used in the model. We used age-specific IFR estimates based on data from 7 countries in Europe; New York, USA; and Brazil (4) to estimate a median age-adjusted 0.39% IFR (95% prediction interval 0.21%–0.85%) for Delhi; median age-adjusted IFR in high-income countries with older populations, such as the United Kingdom, was ≈1%, based on data through June 2020 (5,6). The age-adjusted IFR for Delhi that we used was very similar to the 0.39% obtained using early data from China (6) and 0.40% from a meta-analysis based on data from advanced economies (as defined by membership in the Organization for Economic Cooperation and Development [https://www.oecd.org]) (7).Epidemiologic and Demographic DataWe obtained data on the number of confirmed SARS-CoV-2 cases and deaths reported daily in Delhi beginning March 14, 2020, from COVID19India (8), a volunteer-driven, crowdsourced initiative that collates data from several sources, including the Ministry of Health and Family Welfare. Cases and deaths that occurred before March 14 were reported as cumulative numbers. Because we did not know specifically when these pre–March 14 cases and deaths occurred, we did not use these data for parameter inference. For our model, we used data from the 3 serosurveys conducted in Delhi (3) on dates of sample collection, number of samples tested, seropositivity rate found, and reported estimates of sensitivity and specificity of the assay used in each of the three serosurveys (Appendix Table 1). We used projections of the 2021 population in Delhi from the National Commission on Population (9) and stratified the population by 10-year age groups.Transmission ModelTo model SARS-CoV-2 transmission, we used a susceptible-exposed-infected-recovered (SEIR) deterministic transmission model (Appendix Figure 4).We did not stratify the population by age for the transmission parameters, assuming random mixing by age, meaning that epidemic growth was equivalent in all age groups in the model. We did not account for births or deaths from causes other than COVID-19 because of the model’s short timeframe.Because epidemic growth rate is determined by the reproductive number and the generation time, Tc (i.e., time interval between infection times of an infector-infectee pair) (10), we fixed the generation time to Tc = 6.5 days on the basis of previous observations (11) and estimated the reproduction number. We split the generation time into the mean durations of the preinfectious (dE = 1/ω) and infectious (dI = 1/γ) periods, so that Tc = dE + dI (10); we fixed dE and dI using information on the duration of the incubation period (i.e., time between infection and onset of symptoms) and the fact that infectiousness starts ≈1 day before symptoms start (12–14). Given an ≈5.5-day incubation (i.e., presymptomatic) period (15,16), to give the correct generation time, we assumed a mean duration of the preinfectious period of dE = 5.5 – 1.0 = 4.5 days and a mean duration of the infectious period of dI = 6.5 – 4.5 = 2 days.Disease Progression and Death ModelWe modeled disease progression and death after infection independent of the transmission process (Appendix Figure 4). Because the model has been used for other purposes, it also included transitions to hospitalizations, but these were not relevant for our work and did not affect the results. We used the 5.5-day mean incubation period and a peaked distribution modeled with an Erlang distribution with shape parameter 6 (15). We assumed that one third of infections were asymptomatic, although there is high variability in the observed proportion of asymptomatic infections across studies (17–19).We separately tracked the proportion of total infections leading to hospitalization and those leading to death; those hospitalized who eventually died were represented in both groups. We age-adjusted the proportion of infections leading to hospitalization with versus without critical care using demographics from Delhi and age-stratified estimates from China (6). That is, we computed a weighted average of the age-stratified estimates, assigning weights by the share of the corresponding age classes. We based the proportion of infections leading to death on estimates of age-stratified IFR (4) applied to the population of Delhi.We set average time from symptom onset to hospitalization as 5.8 days, consistent with observations in China (20). For hospitalization without critical care, we assumed a mean 9.8-day stay; if critical care was required, we assumed 9.8 days in critical care, followed by 3.3 days recovery outside of critical care, based on early estimates from the United Kingdom (21). The average time from symptom onset to death was ≈16 days (6). Using these estimates, we assumed a 10-day mean for time between hospitalization and death. These values might differ for India, but no domestic data were available at that time.Parameter InferenceWe fitted the transmission model to both the seroprevalence data and reported daily COVID-19 deaths (Appendix). We allowed the reproduction number to change at 5 different time points corresponding to changes in interventions (Appendix Table 2). Denoting the basic reproduction number during the first infection period (i.e., before any changes) as R0, the reproduction number after i number of changes as Ri (i in periods 1–5), we computed parameters of Ri as Ri = R0 × (1 + r1) ×…× (1 + ri), where r1,…,r5 measured the relative change in the reproduction number from one period to the previous one.We estimated R0 and the subsequent changes at each time point, r1,…,r5, the initial number of infected [E(0) + I(0)], the reporting, θ, and overdispersion of deaths, k. We assumed February 19, 2020 (28 days before the first 10 cases were reported), as the starting time (t0) for the simulations and estimated the number of infected persons at that time point, [E(0) + I(0)]. To prevent parameter estimates being biased by the earliest phase of the epidemic, when underreporting of deaths might have been greatest, we computed likelihood using data collected from March 29, 2020, when the first COVID-19 death was reported, through September 30, 2020, the end of the 6-month study period.We could not estimate a change in transmission at the first time point (r1), corresponding to the start of the lockdown on March 25, because no deaths were reported during March 15–28; we therefore assumed r1 = 0. We assumed May 4, when the first lockdown relaxations were introduced, as the next time point for change in the reproduction number (r2). Therefore, estimates of the reproduction number during February 19–May 4, 2020, from the beginning of the simulations through r2, implicitly accounted for any effects of the lockdown during that time. Because R0 was highly correlated with the initial number of infected, we estimated the total number of infections just before r2 and back-calculated the initial number of infected persons using a simple exponential growth model to define the relationship between R0 and the epidemic growth rate for a SEIR model (9,22). We performed 100,000 iterations using Markov chain Monte Carlo in the lazymcmc software package (23) and uniform prior distributions to estimate model parameters; we ran 4 chains with different starting values to check convergence. We performed all analyses using R version 4.0.2 (24).

Materials and Methods

We developed a SARS-CoV-2 transmission model to estimate the incidence of infection and changes in the reproduction number (R) after the start of nonpharmaceutical interventions, including lockdowns (Appendix Table 2, Figure 2). We used Bayesian Markov chain Monte Carlo to fit the model to the 3 seroprevalence surveys and the time-series of reported deaths. We estimated the proportion of COVID-19 deaths reported by comparing reported deaths to the number expected based on the age-adjusted infection-fatality ratio (IFR) we used in the model. We used age-specific IFR estimates based on data from 7 countries in Europe; New York, USA; and Brazil (4) to estimate a median age-adjusted 0.39% IFR (95% prediction interval 0.21%–0.85%) for Delhi; median age-adjusted IFR in high-income countries with older populations, such as the United Kingdom, was ≈1%, based on data through June 2020 (5,6). The age-adjusted IFR for Delhi that we used was very similar to the 0.39% obtained using early data from China (6) and 0.40% from a meta-analysis based on data from advanced economies (as defined by membership in the Organization for Economic Cooperation and Development [https://www.oecd.org]) (7).

We obtained data on the number of confirmed SARS-CoV-2 cases and deaths reported daily in Delhi beginning March 14, 2020, from COVID19India (8), a volunteer-driven, crowdsourced initiative that collates data from several sources, including the Ministry of Health and Family Welfare. Cases and deaths that occurred before March 14 were reported as cumulative numbers. Because we did not know specifically when these pre–March 14 cases and deaths occurred, we did not use these data for parameter inference. For our model, we used data from the 3 serosurveys conducted in Delhi (3) on dates of sample collection, number of samples tested, seropositivity rate found, and reported estimates of sensitivity and specificity of the assay used in each of the three serosurveys (Appendix Table 1). We used projections of the 2021 population in Delhi from the National Commission on Population (9) and stratified the population by 10-year age groups.

To model SARS-CoV-2 transmission, we used a susceptible-exposed-infected-recovered (SEIR) deterministic transmission model (Appendix Figure 4).We did not stratify the population by age for the transmission parameters, assuming random mixing by age, meaning that epidemic growth was equivalent in all age groups in the model. We did not account for births or deaths from causes other than COVID-19 because of the model’s short timeframe.

Because epidemic growth rate is determined by the reproductive number and the generation time, Tc (i.e., time interval between infection times of an infector-infectee pair) (10), we fixed the generation time to Tc = 6.5 days on the basis of previous observations (11) and estimated the reproduction number. We split the generation time into the mean durations of the preinfectious (dE = 1/ω) and infectious (dI = 1/γ) periods, so that Tc = dE + dI (10); we fixed dE and dI using information on the duration of the incubation period (i.e., time between infection and onset of symptoms) and the fact that infectiousness starts ≈1 day before symptoms start (12–14). Given an ≈5.5-day incubation (i.e., presymptomatic) period (15,16), to give the correct generation time, we assumed a mean duration of the preinfectious period of dE = 5.5 – 1.0 = 4.5 days and a mean duration of the infectious period of dI = 6.5 – 4.5 = 2 days.

We modeled disease progression and death after infection independent of the transmission process (Appendix Figure 4). Because the model has been used for other purposes, it also included transitions to hospitalizations, but these were not relevant for our work and did not affect the results. We used the 5.5-day mean incubation period and a peaked distribution modeled with an Erlang distribution with shape parameter 6 (15). We assumed that one third of infections were asymptomatic, although there is high variability in the observed proportion of asymptomatic infections across studies (17–19).

We separately tracked the proportion of total infections leading to hospitalization and those leading to death; those hospitalized who eventually died were represented in both groups. We age-adjusted the proportion of infections leading to hospitalization with versus without critical care using demographics from Delhi and age-stratified estimates from China (6). That is, we computed a weighted average of the age-stratified estimates, assigning weights by the share of the corresponding age classes. We based the proportion of infections leading to death on estimates of age-stratified IFR (4) applied to the population of Delhi.

We set average time from symptom onset to hospitalization as 5.8 days, consistent with observations in China (20). For hospitalization without critical care, we assumed a mean 9.8-day stay; if critical care was required, we assumed 9.8 days in critical care, followed by 3.3 days recovery outside of critical care, based on early estimates from the United Kingdom (21). The average time from symptom onset to death was ≈16 days (6). Using these estimates, we assumed a 10-day mean for time between hospitalization and death. These values might differ for India, but no domestic data were available at that time.

We fitted the transmission model to both the seroprevalence data and reported daily COVID-19 deaths (Appendix). We allowed the reproduction number to change at 5 different time points corresponding to changes in interventions (Appendix Table 2). Denoting the basic reproduction number during the first infection period (i.e., before any changes) as R0, the reproduction number after i number of changes as Ri (i in periods 1–5), we computed parameters of Ri as Ri = R0 × (1 + r1) ×…× (1 + ri), where r1,…,r5 measured the relative change in the reproduction number from one period to the previous one.

We estimated R0 and the subsequent changes at each time point, r1,…,r5, the initial number of infected [E(0) + I(0)], the reporting, θ, and overdispersion of deaths, k. We assumed February 19, 2020 (28 days before the first 10 cases were reported), as the starting time (t0) for the simulations and estimated the number of infected persons at that time point, [E(0) + I(0)]. To prevent parameter estimates being biased by the earliest phase of the epidemic, when underreporting of deaths might have been greatest, we computed likelihood using data collected from March 29, 2020, when the first COVID-19 death was reported, through September 30, 2020, the end of the 6-month study period.

We could not estimate a change in transmission at the first time point (r1), corresponding to the start of the lockdown on March 25, because no deaths were reported during March 15–28; we therefore assumed r1 = 0. We assumed May 4, when the first lockdown relaxations were introduced, as the next time point for change in the reproduction number (r2). Therefore, estimates of the reproduction number during February 19–May 4, 2020, from the beginning of the simulations through r2, implicitly accounted for any effects of the lockdown during that time. Because R0 was highly correlated with the initial number of infected, we estimated the total number of infections just before r2 and back-calculated the initial number of infected persons using a simple exponential growth model to define the relationship between R0 and the epidemic growth rate for a SEIR model (9,22). We performed 100,000 iterations using Markov chain Monte Carlo in the lazymcmc software package (23) and uniform prior distributions to estimate model parameters; we ran 4 chains with different starting values to check convergence. We performed all analyses using R version 4.0.2 (24).

Results

Figure 2 Figure 3 Figure 3. Effective reproduction number, Reff, for model of coronavirus disease deaths and severe acute respiratory syndrome coronavirus 2 attack rate, Delhi, India, March 15–September 30, 2020. A) Median and…

Our model fit the data well for both the death time-series (Figure 2, panel A) and seroprevalence survey data (Figure 2, panel B), except for the last serosurvey, in which we estimated an increase in seropositivity from the previous survey, instead of a slight decrease. This difference might have been because the observation model did not account for waning antibodies and the possibility of seroreversion. However, the third serosurvey used a different testing kit, which might also have contributed to this difference. We estimated that the first peak in infection incidence was reached on May 31, at a median of 294,930 (95% credible interval [CrI] 143,271–440,702) new infections per day (Appendix Figure 5). Incidence at the second peak, reached on September 17, was lower, at a median of 79,032 (95% CrI 40,484–109,140) new infections per day. Assuming that changes in transmission occurred beginning at the times of each change in interventions and accounting for the reduction in susceptible persons, we estimated that the effective reproduction number, Reff, increased with the first relaxation of the lockdown introduced May 4 (beginning of phase 3); in June and July, during the first 2 reopening phases, Reff was 1 (Figure 3, panel A), resulting in a median infection attack rate of 48.7% (95% CrI 22.1%–76.8%) by the end of September. After that, Delhi experienced a large third wave of cases and deaths (Figure 1), suggesting that even with approximately half the population having been infected, the herd immunity threshold had not yet been reached at that time. Of interest, a serosurvey conducted in January 2021 found a sex- and age-adjusted seroprevalence of 56.1%, probably indicating a steep increase in the cumulative number of infections, reflecting the effects of this third wave of transmission.

Figure 4 Figure 4. Estimated reporting of coronavirus disease deaths, Delhi, India, March 15–September 30, 2020. Violin plots show the posterior distribution of the estimate of death reporting for 3 different values for the…

Using a 0.39% age-adjusted IFR, we estimated reported deaths to be 15.0% (95% CrI 9.3%–34.0%) of actual deaths (Figure 4; Appendix Figure 6). Repeating the analysis using an age-adjusted IFR of 0.21%, corresponding to the lower bound of the 95% prediction interval for IFR based mostly on age-specific high-income country (HIC) data (4), increased the proportion of reported deaths to 28% (95% CrI 18–59%) of actual deaths (Figure 4).

Figure 5 Figure 5. Reported cases of symptomatic severe acute respiratory syndrome coronavirus 2 infections and estimated actual number of cases, Delhi, India, March 15–September 30, 2020. A) Daily number of newly reported cases…

On the basis of infection incidence determined using our model, we also estimated the probability of detecting COVID-19 cases over time by comparing the number of reported cases to the estimated incidence of symptomatic infections (Figure 5, panel A). The probability of detecting infection quickly increased over the last weeks of March, fluctuated until mid-June, then remained relatively consistent through the end of September; a median of 7.1% of all symptomatic infections was detected during July 1–September 30, 2020 (Figure 5, panel B).

Discussion

The low proportion of reported deaths relative to actual deaths we found is consistent with findings from other cities in India, where seroprevalence surveys suggested substantially greater exposure to infection than predicted on the basis of reported COVID-19 deaths. For example, comparing seroprevalence during the first half of July 2020 in Mumbai (25) with cumulative deaths at that time suggested that only 21% of deaths were reported (Appendix Table 3). Similarly, a large-scale prospective, active-surveillance study conducted in the district of Madurai, Tamil Nadu, India, during the first wave of COVID-19 in summer 2020 found that only 11.0% of deaths were reported, compared with expected deaths based on IFR estimates from other settings (26). This high level of underreporting might reflect incomplete or delayed reporting of deaths and a failure to report COVID-19 as a suspected or confirmed cause of death, particularly in the absence of a SARS-CoV-2 test result.The extent of underreporting might also reflect our use of an age-specific IFR for India derived from mostly HIC data. Age-specific IFR may be lower in India for several reasons. First, the prevalence of underlying medical conditions that increase the risk for severe COVID-19 after infection is somewhat lower in India than in the countries that informed the age-specific IFR estimates for our model (Appendix Figure 7) (27). However, correcting the Delhi IFR to account for the lower prevalence of underlying conditions only marginally reduced the age-adjusted IFR (50% in slum areas for the first half of July 2020 (25), suggesting that infection spread very quickly over the first few months of the epidemic in certain pockets. However, seroprevalence rates <20% in non–slum areas showed that the epidemic was spatially highly heterogeneous. Understanding what brought the number of cases down after the first wave in different parts of India and how to interpret the serosurvey results related to building population immunity are key to understanding and predicting the dynamics of subsequent waves of COVID-19.The SARS-CoV-2 Delta variant emerged in Maharashtra in late 2020 and spread across India during the first few months of 2021, replacing other variants. In vitro data characterizing the Delta variant found that it was less sensitive to serum neutralizing antibodies from persons previously infected with other variants and that it also had higher replication efficiency (31). These findings suggest that the predominance of the Delta variant in the upsurge of SARS-CoV-2 cases seen in India during April and May 2021 resulted from either immune escape in previously infected persons, increased transmissibility, or both. These mechanisms, together with possible waning of population immunity over time, likely explain the increase in SARS-CoV-2 cases in Delhi, despite the high attack rate that we estimated in September 2020 and the high reported seroprevalence (≈56% for both) in the round 5 (January 2021) and 6 (April 2021) cross-sectional serosurveys. Analysis of epidemiologic data is needed to disentangle how these mechanisms contributed to the second nationwide epidemic wave.In conclusion, our analysis found reported COVID-19 deaths in India during the first 6 months of the pandemic were well below the number of actual deaths. Underreporting of deaths might reflect incomplete or delayed documentation or failure to report COVID-19 as a cause of death but may also reflect our use of an age-specific IFR.

Discussion

The low proportion of reported deaths relative to actual deaths we found is consistent with findings from other cities in India, where seroprevalence surveys suggested substantially greater exposure to infection than predicted on the basis of reported COVID-19 deaths. For example, comparing seroprevalence during the first half of July 2020 in Mumbai (25) with cumulative deaths at that time suggested that only 21% of deaths were reported (Appendix Table 3). Similarly, a large-scale prospective, active-surveillance study conducted in the district of Madurai, Tamil Nadu, India, during the first wave of COVID-19 in summer 2020 found that only 11.0% of deaths were reported, compared with expected deaths based on IFR estimates from other settings (26). This high level of underreporting might reflect incomplete or delayed reporting of deaths and a failure to report COVID-19 as a suspected or confirmed cause of death, particularly in the absence of a SARS-CoV-2 test result.

The extent of underreporting might also reflect our use of an age-specific IFR for India derived from mostly HIC data. Age-specific IFR may be lower in India for several reasons. First, the prevalence of underlying medical conditions that increase the risk for severe COVID-19 after infection is somewhat lower in India than in the countries that informed the age-specific IFR estimates for our model (Appendix Figure 7) (27). However, correcting the Delhi IFR to account for the lower prevalence of underlying conditions only marginally reduced the age-adjusted IFR (50% in slum areas for the first half of July 2020 (25), suggesting that infection spread very quickly over the first few months of the epidemic in certain pockets. However, seroprevalence rates <20% in non–slum areas showed that the epidemic was spatially highly heterogeneous. Understanding what brought the number of cases down after the first wave in different parts of India and how to interpret the serosurvey results related to building population immunity are key to understanding and predicting the dynamics of subsequent waves of COVID-19.

The SARS-CoV-2 Delta variant emerged in Maharashtra in late 2020 and spread across India during the first few months of 2021, replacing other variants. In vitro data characterizing the Delta variant found that it was less sensitive to serum neutralizing antibodies from persons previously infected with other variants and that it also had higher replication efficiency (31). These findings suggest that the predominance of the Delta variant in the upsurge of SARS-CoV-2 cases seen in India during April and May 2021 resulted from either immune escape in previously infected persons, increased transmissibility, or both. These mechanisms, together with possible waning of population immunity over time, likely explain the increase in SARS-CoV-2 cases in Delhi, despite the high attack rate that we estimated in September 2020 and the high reported seroprevalence (≈56% for both) in the round 5 (January 2021) and 6 (April 2021) cross-sectional serosurveys. Analysis of epidemiologic data is needed to disentangle how these mechanisms contributed to the second nationwide epidemic wave.

In conclusion, our analysis found reported COVID-19 deaths in India during the first 6 months of the pandemic were well below the number of actual deaths. Underreporting of deaths might reflect incomplete or delayed documentation or failure to report COVID-19 as a cause of death but may also reflect our use of an age-specific IFR. Top

M.P.-S. is a Sir Henry Dale fellow, a program jointly funded by the Wellcome Trust and the Royal Society (grant number 216427/Z/19/Z). M.P.-S., O.J.W., N.F.B., R.V., and N.C.G. acknowledge funding from the MRC Centre for Global Infectious Disease Analysis (MR/R015600/1), which is jointly funded by the U.K. Medical Research Council (MRC) and U.K. Foreign, Commonwealth & Development Office (FCDO), under the MRC/FCDO Concordat agreement, and is also part of the EDCTP2 programme supported by the European Union. M.P.-S., O.J.W., N.F.B., R.V., and N.C.G. also acknowledge funding from Community Jameel.

The conclusions, findings, and opinions expressed by authors contributing to this journal do not necessarily reflect the official position of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors’ affiliated institutions. Use of trade names is for identification only and does not imply endorsement by any of the groups named above. Source : CDC Emerging Infectious Diseases Journal

For More News And Analysis About Malawi Follow Africa-Press

LEAVE A REPLY

Please enter your comment!
Please enter your name here