Statistical analysis was performed using SAS 9.2 (Cary, NC, USA). Statistical significance was assessed at the 0.05 level unless otherwise noted. Infection groups were defined by the presence or absence of Ascaris infection and/or malaria infection at each time point, resulting in the following four groups: (1) no infection; (2) infection with malaria only; (3) infection with Ascaris only; and, (4) infection with both Ascaris and malaria (co-infection). Haemoglobin values were analysed as a continuous variable and then categorized into multinomial responses for mild, moderate or severe anaemia.
The categorization of haemoglobin allowed for calculations of odds ratios between infection groups. Univariate analysis was performed to calculate descriptive statistics of the predictors and assess normality of the outcome variables. Due to the highly skewed distribution of parasite intensity in children, the ranks of the data were used in place of the actual values of parasite intensity. The ranks were determined by ordering the data from least to greatest and replacing each observation by its relative position in the order. Assuming no ties, the smallest parasite intensity would receive a rank of 1 while the largest intensity would receive a value of n (the number of observations). The demographics from each infection group were compared at baseline to describe the overall sample of children. Comparisons between the four groups were made using chi-squared tests, analysis of variances models, or Kruskal-Wallis tests. The Tukey-Kramer multiple comparison procedure was used to determine the significance of pair-wise comparisons.
Parasite intensity and haemoglobin levels were analysed using repeated measures linear mixed models using the MIXED procedure in SAS to account for unbalanced data and multiple observations per child. An infection group-by-time variable was initially included in the haemoglobin model but was removed after it failed to retain significance. Similarly, an anaemia severity group-by-time variable was included in the parasite intensity model but was not retained. The original study treated children with albenzadole or placebo according to the study design; however this factor was removed from the model because it failed to show an association with anaemia severity and did not significantly change the estimates in the model. Means and standard deviations were used to describe haemoglobin levels within each group. Medians and ranges were used to describe the parasite intensities within each anaemia severity group.
The effect of infection group on anaemia severity was analysed with generalized linear mixed models using a cumulative logit function with the assumption that the data are from a multinomial distribution. The link function allows for the ordinality of the outcome variable anaemia severity. Thus the parameter estimates represent the log odds of being anaemic vs normal and the estimates have been transformed to odds ratios for ease of interpretation. A random effect for each individual was added to the model to account for the repeated measures study design. The effect of sex, age, and SES were also included in the model. Since hookworm infection is known to cause anaemia [22 (link)], the model adjusted for the effect of hookworm in our initial model; however, this was dropped because this factor conferred little to no change in the odds ratio estimates.
The categorization of haemoglobin allowed for calculations of odds ratios between infection groups. Univariate analysis was performed to calculate descriptive statistics of the predictors and assess normality of the outcome variables. Due to the highly skewed distribution of parasite intensity in children, the ranks of the data were used in place of the actual values of parasite intensity. The ranks were determined by ordering the data from least to greatest and replacing each observation by its relative position in the order. Assuming no ties, the smallest parasite intensity would receive a rank of 1 while the largest intensity would receive a value of n (the number of observations). The demographics from each infection group were compared at baseline to describe the overall sample of children. Comparisons between the four groups were made using chi-squared tests, analysis of variances models, or Kruskal-Wallis tests. The Tukey-Kramer multiple comparison procedure was used to determine the significance of pair-wise comparisons.
Parasite intensity and haemoglobin levels were analysed using repeated measures linear mixed models using the MIXED procedure in SAS to account for unbalanced data and multiple observations per child. An infection group-by-time variable was initially included in the haemoglobin model but was removed after it failed to retain significance. Similarly, an anaemia severity group-by-time variable was included in the parasite intensity model but was not retained. The original study treated children with albenzadole or placebo according to the study design; however this factor was removed from the model because it failed to show an association with anaemia severity and did not significantly change the estimates in the model. Means and standard deviations were used to describe haemoglobin levels within each group. Medians and ranges were used to describe the parasite intensities within each anaemia severity group.
The effect of infection group on anaemia severity was analysed with generalized linear mixed models using a cumulative logit function with the assumption that the data are from a multinomial distribution. The link function allows for the ordinality of the outcome variable anaemia severity. Thus the parameter estimates represent the log odds of being anaemic vs normal and the estimates have been transformed to odds ratios for ease of interpretation. A random effect for each individual was added to the model to account for the repeated measures study design. The effect of sex, age, and SES were also included in the model. Since hookworm infection is known to cause anaemia [22 (link)], the model adjusted for the effect of hookworm in our initial model; however, this was dropped because this factor conferred little to no change in the odds ratio estimates.