The NPBDSS was established in 2006, and data collected by the NPBDSS have been included in the official system of the National Bureau of Statistics of China since 2007 [14] (link). This surveillance system covers 64 counties and districts in thirty provinces, municipalities or municipal districts that fall under the central government. This database represents a wide array of geographical locations and socioeconomic status. Details on data collection and quality control of the NPBDSS were described elsewhere [14] (link). In brief, fetus and neonates of 28 gestational weeks or more born to women living in the surveillance areas for at least one year were recruited and followed. The time period for identifying birth defects was from 28 gestation weeks to 42 days after birth, during which major birth defects (i.e., external malformations and chromosomal aberrations coded according to the International Classification of Diseases 10th edition) diagnosed for the first time were required to be reported. Surveillance staffs at the community, township, or village levels were responsible for birth information collection, verification, and follow-up. By comparing the data with related data from other systems like Birth Certification, Perinatal Death Registry, etc., the information on reported cases or births are checked for accuracy and completeness. In addition, annual surveys are conducted to identify and correct errors and inaccuracies in the collected data. It is required that the under-reporting rate of live births or malformations should be no more than 1% and errors or missing values on the report form should also be no more than 1%.
The gestational age at delivery was calculated in completed weeks from the first day of the last menstrual period (LMP). In the surveillance areas, women with suspected pregnancy have an ultrasound examination for confirmation according to obstetric clinical guidelines. For women with irregular menses and/or bleeding during pregnancy as well as those who could not remember the LMP, gestational ages were estimated based on their ultrasound examination. Birth weight of each neonate was measured by a trained midwife within one hour after birth, recorded to the nearest 5 g, and included in hospital delivery records. The data were then abstracted by trained surveillance staff and entered into a web-based reporting system [14] (link).
From October 2006 through September 2010, a total of 1,153,166 live and still births whose gestation age were equal to or greater than 28 weeks were identified by the NPBDSS. Stillbirth was defined as the delivery of a fetus that has died before birth for which there is no possibility of resuscitation.Figure 1 illustrates the records selection process for current study. Stillbirths (n = 5,337, 4.71‰), infants of foreign origin (n = 69), infants from multiple births (n = 19,914, 1.73%), and infants affected by congenital anomalies (n = 17,650, 1.56%), were first excluded from the analysis. Among the rest of 1,112,443 records, 6,608 (0.51%) with missing gestational age or birth weight or gender, and 545 outliers (0.05%) according to previous inclusion criterion [1] (link), were subsequently removed. Finally the procedure proposed by Alexander et al. [1] (link) was adopted to screen records with implausible combinations of gestational age and birth weight. Specifically, gestational age distributions were examined for each 125 g interval of birth weight for preterm infants aged 28–32 weeks. Gestational age values of +/−2.5 standard deviations from the mean were used as cutoffs for implausible records. Under a normal distribution, the cutoffs roughly correspond to the 1st and 99th percentiles. In Alexander et al. [1] (link), manual adjustments of the gestational age “by a week or more” were conducted for certain birth weight intervals. We did not perform such adjustments, due to the infrequent occurrence of abnormal observations. Following this procedure, a total of 7,319 newborns (0.66%) were removed from downstream analysis, yielding a final sample size of 1,105,214 for this study.
For statistical analysis, we first conducted a linear regression analysis and investigated maternal and infant characteristics that might affect birth weight. Since fitting smooth curves on sample quantiles of segmented age groups may demand a large sample size and lose information from nearby groups, we utilized the lambda-mu-sigma (LMS) method for the primary analysis of birth weight for specific gestational ages. The LMS method, which has been used in multiple reference curve studies, adopts a Box-Cox transformation based semiparametric technique and solves penalized likelihood equations. The centiles can be briefly summarized by the L (Box-Cox power), M (median) and S (coefficient variation), which are natural cubic splines with knots at each Tj (gestation week) as described in Cole and Green's paper [15] (link). The aforementioned analysis was achieved using R package VGAM [16] . To evaluate the impact of employing previous percentiles for the current study cohort, we calculated the relative percentual differences for the 10th, 50th and 90th percentiles between our data and those from other references as:
Relative percentual difference = (Otherperc - Chinaperc)/Chinaperc×100. Here, the Chinaperc represents the percentiles calculated from our study, while Otherperc denotes the percentiles published previously.
The gestational age at delivery was calculated in completed weeks from the first day of the last menstrual period (LMP). In the surveillance areas, women with suspected pregnancy have an ultrasound examination for confirmation according to obstetric clinical guidelines. For women with irregular menses and/or bleeding during pregnancy as well as those who could not remember the LMP, gestational ages were estimated based on their ultrasound examination. Birth weight of each neonate was measured by a trained midwife within one hour after birth, recorded to the nearest 5 g, and included in hospital delivery records. The data were then abstracted by trained surveillance staff and entered into a web-based reporting system [14] (link).
From October 2006 through September 2010, a total of 1,153,166 live and still births whose gestation age were equal to or greater than 28 weeks were identified by the NPBDSS. Stillbirth was defined as the delivery of a fetus that has died before birth for which there is no possibility of resuscitation.
For statistical analysis, we first conducted a linear regression analysis and investigated maternal and infant characteristics that might affect birth weight. Since fitting smooth curves on sample quantiles of segmented age groups may demand a large sample size and lose information from nearby groups, we utilized the lambda-mu-sigma (LMS) method for the primary analysis of birth weight for specific gestational ages. The LMS method, which has been used in multiple reference curve studies, adopts a Box-Cox transformation based semiparametric technique and solves penalized likelihood equations. The centiles can be briefly summarized by the L (Box-Cox power), M (median) and S (coefficient variation), which are natural cubic splines with knots at each Tj (gestation week) as described in Cole and Green's paper [15] (link). The aforementioned analysis was achieved using R package VGAM [16] . To evaluate the impact of employing previous percentiles for the current study cohort, we calculated the relative percentual differences for the 10th, 50th and 90th percentiles between our data and those from other references as:
Relative percentual difference = (Otherperc - Chinaperc)/Chinaperc×100. Here, the Chinaperc represents the percentiles calculated from our study, while Otherperc denotes the percentiles published previously.
Full text: Click here