Plasma 25(OH)D measurements were available from men and women who served as controls in previous nested case-control studies of chronic diseases. None of the participants had a history of cancer at the time of blood draw. For each cohort, we selected two independent samples: a “training” sample was used to develop the 25(OH)D prediction model and a “test” sample served as a validation dataset. Training samples comprised controls from all completed and on-going nested case-control studies with 25(OH)D assay results when analyses began. Test samples were drawn from more recently established nested case-control studies as this project unfolded and additional plasma 25(OH)D assay results became available. Prior to exclusions for missing data, the training sets consisted of 2246 women in NHS, 1646 women in NHSII, and 1255 men in HPFS. An additional 818 women in NHS, 479 women in NHSII, and 841 men in HPFS were available for the test sets.
In 2000 and 2001, all women in NHS who gave blood in 1989–1990 and were alive were invited to provide a second blood sample. Of the 18 473 women who participated in the second blood collection, 443 women with no history of cancer had measured 25(OH)D available at both time points. These samples were used to assess within-person variability of plasma 25(OH)D concentrations over 10–11 years.
This study was approved by the institutional review boards of the Harvard School of Public Health and Brigham and Women's Hospital. All participants gave written informed consent at enrollment.