Correction: Exploring Predictors of Type 2 Diabetes Within Animal-Sourced and Plant-Based Dietary Patterns with the XGBoost Machine Learning Classifier: NHANES 2013–2016 (Journal of Clinical Medicine, (2025), 14, 2, (458), 10.3390/jcm14020458)

Research output: Contribution to journalComment/debate

Abstract

In the original publication [1], there were mistakes in the Materials and Methods Section and paragraph 7 of the Results Section. In the Materials and Methods Section, the original wording may be interpreted as describing supervised PCA, in which Type 2 Diabetes (T2D) status served as the dependent variable. The authors clarify that, in Section 2.5 (Statistical Analyses), unsupervised principal component analysis (PCA) was conducted to derive new diet–lifestyle features. This clarification is essential to ensure an accurate understanding of the analytic approach, as the use of an unsupervised method indicates that component extraction was independent of T2D outcome status, thereby avoiding potential bias in feature generation. In addition, in the first sentence of paragraph 7 in the Results Section, the reported explained variance from the PCA model may be interpreted as describing variance in the T2D outcome. As the PCA conducted was unsupervised, the explained variance in both locations refers to the proportion of total variance accounted for by the extracted principal components, not variance in T2D status. These corrections do not affect the study’s results, statistical outputs, or conclusions. All analyses and findings remain unchanged. To examine multicollinearity, linear regression was used to derive each predictor’s variance inflation factor (VIF). We used unsupervised principal component analysis (PCA) to analyze diet–lifestyle patterns independently of disease status. Principal components were rotated via direct oblimin (∆ = 0) to maximize interpretability. Regression factor scores from each component were added as new features. The variable pattern loadings on each component were characterized according to the strength of partial correlations for each predictor in the pattern matrix. The PCA resulted in 10 components being extracted, explaining over 75% of the variance (Table 3). Component 1 explained nearly 20% of the variance and was influenced primarily by high unsaturated fatty acids and high total energy intake. Component 2 explained nearly 9% of the variance, with a high factor loading from the female gender. There was an error in the column heading “Median Diff. (%)” in Tables 1 and 2, which implies that the values represent percentage differences between the nutrient intakes of the groups compared. The reported values are, in fact, proportions (decimal form). The column heading will be revised to indicate proportions rather than percentages to ensure accurate representation of the data. The corrected Table 1 and Table 2 appear below. There was an error in the footer for Table 3; the reported explained variance from the PCA model may be interpreted as describing variance in the T2D outcome. As the PCA conducted was unsupervised, the explained variance refers to the proportion of total variance accounted for by the extracted principal components, not variance in T2D status. The correct legend appears below. Pattern matrix for ten principal components, explaining over 75% of the variance in T2D predictors. The table values are regression coefficients that reflect the unique contribution of each variable to the component. Components were rotated via direct oblimin (∆ = 0). Darker colors indicate a stronger coefficient between the predictor and the component. Blue = positive association with the component; red = negative association with the component. There was an error in the Confusion Matrix as published in Figure 3. The labels for the “Positive” and “Negative” classes were inadvertently swapped in the previous version of the figure. The corrected figure now displays the proper class labels for both the True and Predicted axes, accurately representing the performance of the XGBoost classification model. The corrected Figure 3 appears below. The authors state that the scientific conclusions are unaffected. These corrections were approved by the Academic Editor. The original publication has also been updated.

Original languageEnglish
Article number7600
JournalJournal of Clinical Medicine
Volume14
Issue number21
DOIs
StatePublished - Nov 2025

Fingerprint

Dive into the research topics of 'Correction: Exploring Predictors of Type 2 Diabetes Within Animal-Sourced and Plant-Based Dietary Patterns with the XGBoost Machine Learning Classifier: NHANES 2013–2016 (Journal of Clinical Medicine, (2025), 14, 2, (458), 10.3390/jcm14020458)'. Together they form a unique fingerprint.

Cite this