Different subgroups of auto-scaled data were assessed for GDM prediction through five classification ML techniques: logistic regression (LR), linear support vector machine (L-SVM), partial least squares discriminant analysis (PLS-DA), classification and regression tree (CART) and extreme gradient boosting (XGB). Every model was subjected to leave-one-out cross-validation. LR, PLS-DA and XGBoost were executed in
PLS_Toolbox R8.9.2 (Eigenvector Research Inc, USA). L-SVM and CART were implemented by coding in
MATLAB R2021a (The MathWorks Inc, USA).
The classification predictive performance was assessed by the determination of the models sensitivity (Se), specificity (Sp) and non-error rate (NER) in both calibration and cross-validation. These parameters were calculated as follows:
Where TP, FN, TN and FP are the number of true positives, false negatives, true negatives and false positives, respectively.
The area under the receiver operating characteristic curve (AUC) of the final classification models was determined through the software
GraphPad Prism 9.2.0 (GraphPad Software Inc, USA).
Mennickent D., Ortega-Contreras B., Gutiérrez-Vega S., Castro E., Rodríguez A., Araya J, & Guzmán-Gutiérrez E. (2023). Evaluation of first and second trimester maternal thyroid profile on the prediction of gestational diabetes mellitus and post load glycemia. PLOS ONE, 18(1), e0280513.