We developed an age-structured SIRS model to describe the transmission dynamics of RSV (
Fig. 2A). The model assumes individuals are born with protective maternal immunity (
M), which wanes exponentially (with a mean duration of 3–4 months) [65] (
link), leaving the infant susceptible to infection (
Sn, where
n is the number of previous infections). Following infection with RSV, individuals develop partial immunity, which reduces the rate of subsequent infection and the duration and relative infectiousness of such infections, consistent with epidemiological studies and previous models of RSV transmission [26] (
link), [27] (
link), [29] (
link). We assume a progressive build-up of immunity following one, two, and three or more previous infections (
In) [37] (
link), [66] (
link)–[68] (
link). Both age and number of previous infections were assumed to influence the risk of developing severe lower respiratory disease (
D) possibly requiring hospitalization [37] (
link), [66] (
link), [69] (
link). We parameterized the model based on data from cohort studies conducted in the US and Kenya (
Table 2) [37] (
link), . Transmission-relevant contact patterns were assumed to be frequency-dependent and were parameterized based on self-reported data on the number and age of conversational partners from one European study [54] (
link), [55] (
link); no such study has been conducted among a widely representative cohort in the US.
We initially fit our model to the age-stratified hospitalization data from all nine states with complete data from 1989–2009 in order to estimate the mean transmission rate, relative infectiousness of first and second versus subsequent infections, seasonality parameters, and reporting fraction (i.e. proportion of individuals with severe lower respiratory tract disease who are hospitalized, coded as RSV, and reported in our dataset), which are key unknown parameters. We then fixed the relative infectiousness and fit the model to the hospitalization data from each of the nine states plus Florida individually, using the other estimated parameters from the cumulative data fit as our starting conditions to estimate state-specific transmission rates, seasonality parameters, and reporting fractions. For each fit, we seeded the model with one infectious individual in each age group and used a burn-in period of 40 or 41 years, examining the fit using both even- and odd-year burn-in periods to allow for the biennial pattern of epidemics present in some states, and selected the best-fitting model for each state. We also explored longer burn-in periods and examined the model output to ensure that the equilibrium quasi-steady state had been reached.
Seasonality in the instantaneous rate of transmission of RSV was modeled using sinusoidal seasonal forcing with a period of 1 year (52.18 weeks) as follows: , where
β0 is the baseline transmission rate,
b is the amplitude of seasonality, and
φ is a seasonal offset parameter (a measure of timing of peak transmissibility), and
t is the time (in years) [31] (
link), [32] (
link). We constrained
φ to be between −0.5 and 0.5, where
φ = 0 represents January 1 and
φ = −0.5 and
φ = 0.5 both represent July 1. These parameters were estimated by fitting our model to the state-specific data.
We used maximum likelihood to determine the best-fitting models. For each set of parameters, the likelihood of the data given the model was calculated by assuming the number of hospitalizations in each age class (
a) during each week (
w),
xa,w, was Poisson-distributed with a mean equal to the model-predicted number of severe lower respiratory tract infections due to RSV (
Da,w) times the reporting (or hospitalized) fraction (
h), , as has been described previously [27] (
link), [36] (
link). The log-likelihood (log(
L)) of the model was given by the equation:
While this observation model may fail to capture the true variability in the distribution of cases, other observation models (e.g. negative binomial) would require estimating an additional parameter, which we do not feel is justified. We used the “fminsearch” command in MATLAB v7.14 (MathWorks, Natick, MA) to minimize the –log(
L), which employs a direct simplex search method.
Next, we fit the model to the laboratory data on RSV-positive tests from 38 states. The laboratory data did not contain detail on the age of cases; therefore, we could not derive reliable estimates of the baseline transmission rate by fitting our model to these data, since estimates of the baseline transmission rate and reporting fraction are inherently confounded. (The age distribution of cases, pattern of epidemics, and mean incidence rate inform estimates of the baseline transmission rate, while the mean incidence also informs estimates of the reporting fraction.) Instead, we estimated the baseline transmission rate for each state from the relationship we observed between population density and
R0 among the ten states with hospitalization data (
S3 Fig.). We fixed the
R0 for the District of Columbia at the maximum observed
R0 (for New Jersey). We then estimated the amplitude of seasonality, seasonal offset parameter, and reporting fraction by fitting our model to the rescaled laboratory data. We examined the sensitivity of our results to the method we used to correct for changes in testing and reporting effort for RSV over time by also fitting our model to the raw number of RSV-positive tests reported, instead applying the estimated scaling factor to the model output (i.e. dividing the model output by the scaling factor for each week). The log-likelihoods of the fitted models were similar (
S5 Table), and the results were qualitatively the same (
S8 Table).
We examined the correlation between the estimated model parameters for each state and the significant climatic variables from the univariate statistical analyses. We calculated the Pearson's correlation coefficient and associated
p-value for each state-specific parameter estimate and climatic variable of interest. We also examined the ability of the model to capture the biennial pattern of RSV epidemics present in some states by comparing the strength of the biennial cycle in the observed and predicted RSV time series. The strength of the biennial cycle was calculated as the ratio of the biennial to annual Fourier amplitude [73] , [74] . Finally, we examined whether monthly deviations from average climatic conditions could help explain the difference between observed and predicted monthly RSV activity across states.
Pitzer V.E., Viboud C., Alonso W.J., Wilcox T., Metcalf C.J., Steiner C.A., Haynes A.K, & Grenfell B.T. (2015). Environmental Drivers of the Spatiotemporal Dynamics of Respiratory Syncytial Virus in the United States. PLoS Pathogens, 11(1), e1004591.