For each risk factor, we systematically searched for published studies, household surveys, censuses, administrative data, ground monitor data, or remote sensing data that could inform estimates of risk exposure. To estimate mean levels of exposure by age-sex-location-year, specific methods varied across risk factors (appendix 1 sections 2.1, 4). For many risk factors, exposure data were modelled using either spatiotemporal Gaussian process regression or DisMod-MR 2.1,17 (link), 18 (link) which are Bayesian statistical models developed over the past 12 years for GBD analyses. For most risk factors, the distribution of exposure across individuals was estimated by modelling a measure of dispersion, usually the SD, and fitting an ensemble of parametric distributions to the predicted mean and SD. Ensemble distributions for each risk were estimated based on individual-level data. Details for each risk factor modelling for mean, SD, and ensemble distribution are available in appendix 1 (section 4). Because of the strong dependency between birthweight and gestational age, exposure for these risks was modelled as a joint distribution using the copula method.19
In many cases, exposure data were available for the reference method of ascertainment and for alternative methods, such as tobacco surveys reporting daily smoking versus total smoking; in these cases, we estimated the statistical relationship between the reference and alternative methods of ascertainment using network meta-regression and corrected the alternative data using this relationship.
Free full text: Click here