Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Machine learning prediction of early postpartum prediabetes in women with gestational diabetes mellitus

View ORCID ProfileDurga Parkhi, View ORCID ProfileNishanthi Periyathambi, View ORCID ProfileYonas Weldeselassie, View ORCID ProfileVinod Patel, View ORCID ProfileNithya Sukumar, View ORCID ProfileRahul Siddharthan, View ORCID ProfileLeelavati Narlikar, View ORCID ProfileSaravanan Ponnusamy
doi: https://doi.org/10.1101/2023.02.16.23286016
Durga Parkhi
1Populations, Evidence, and Technologies, Division of Health Sciences, University of Warwick, Coventry, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Durga Parkhi
Nishanthi Periyathambi
1Populations, Evidence, and Technologies, Division of Health Sciences, University of Warwick, Coventry, UK
2Department of Diabetes, Endocrinology, and Metabolism, George Eliot Hospital, Nuneaton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nishanthi Periyathambi
Yonas Weldeselassie
1Populations, Evidence, and Technologies, Division of Health Sciences, University of Warwick, Coventry, UK
3School of Mathematics and Statistics, The Open University, Milton Keynes, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yonas Weldeselassie
Vinod Patel
2Department of Diabetes, Endocrinology, and Metabolism, George Eliot Hospital, Nuneaton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vinod Patel
Nithya Sukumar
1Populations, Evidence, and Technologies, Division of Health Sciences, University of Warwick, Coventry, UK
2Department of Diabetes, Endocrinology, and Metabolism, George Eliot Hospital, Nuneaton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nithya Sukumar
Rahul Siddharthan
4Department of Computational Biology, The Institute of Mathematical Sciences, Chennai, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rahul Siddharthan
Leelavati Narlikar
5Department of Data Science, Indian Institute of Science Education and Research, Pune, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Leelavati Narlikar
Saravanan Ponnusamy
1Populations, Evidence, and Technologies, Division of Health Sciences, University of Warwick, Coventry, UK
2Department of Diabetes, Endocrinology, and Metabolism, George Eliot Hospital, Nuneaton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Saravanan Ponnusamy
  • For correspondence: p.saravanan{at}warwick.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Early onset of type 2 diabetes and cardiovascular disease are common complications for women diagnosed with gestational diabetes. About half of the women with gestational diabetes develop postpartum prediabetes within 10 years of the index pregnancy. These women also have double the risk of developing cardiovascular disease than women without a history of gestational diabetes. Currently, there is no accurate way of knowing which women with gestational diabetes are likely to develop postpartum prediabetes. This study aims to predict the risk of postpartum prediabetes in women diagnosed with gestational diabetes.

Methods We build a sparse logistic regression-based machine learning model to learn key variables significant for the prediction of postpartum prediabetes, from antenatal data with maternal anthropometric and biochemical variables as well as neonatal characteristics of 607 UK women diagnosed with gestational diabetes. We evaluate the performance of the proposed model in addition to other more advanced machine learning methods using established metrics such as the area under the receiver operating characteristic curve and specificity for pre-determined values of sensitivity. We use K-L divergence and information graphs to evaluate and compare different thresholds of classification for targeted screening options in resource-constrained settings. We also perform a decision curve analysis to study the net standardized benefit of our model compared to the universal screening approach.

Results Strikingly, our sparse logistic regression approach selects only two variables as relevant but gives an area under the receiver operating characteristic curve of 0.72, outperforming all other methods. It can identify postpartum prediabetes in women with gestational diabetes using the Rule-in test with 92% specificity at an optimal probability threshold of 0.381 and using the Rule-out test with 92% sensitivity at an optimal probability threshold of 0.140.

Conclusion We propose a simple logistic regression model, which needs only the antenatal fasting glucose at OGTT and HbA1c soon after the diagnosis of GDM, to predict, with remarkable accuracy, the probability of postpartum prediabetes in women with gestational diabetes. We envision this to be a practical solution, which coupled with a targeted follow-up of high-risk women, could yield better cardiometabolic outcomes in women with a history of GDM.

Background

Gestational Diabetes Mellitus (GDM) is defined as any degree of prediabetes with onset or first recognition during pregnancy. Women diagnosed with GDM have upto 10-fold higher risk of Type 2 Diabetes Mellitus (T2DM) compared to those without GDM1 and their lifetime risk is around 60% for developing T2DM.2 In addition to T2DM, GDM women have a twofold higher risk of Cardiovascular Disease (CVD), at a younger age, and independent of intercurrent T2DM.3–6 GDM is associated with an increased risk of cardiovascular dysfunction, including rise in cardiovascular risk factors like blood pressure, and adverse changes in cholesterol and triglycerides.7 However, this risk is not the same for all women diagnosed with GDM.

There is some evidence that glucose levels during pregnancy are predictive of prediabetes.8, 9 Retnakaran et al.10 have shown that the risk of dysglycaemia at 12 weeks postpartum increases across the groups from normal Glucose Challenge Test (GCT) and Normal Glucose Tolerance (NGT), to abnormal GCT and NGT, to Gestational Impaired Glucose Tolerance (GIGT), to GDM. This has been supported by other studies.11, 12 Higher fasting glucose shows a high tendency of conversion to T2DM in the postpartum period7, 13 and antenatal fasting glucose > 5.7 mmol/L is considered to be an important antenatal variable for the prediction of postpartum abnormal glucose metabolism.14

Along with glucose values in pregnancy, many studies have proposed the significance of gestational age at the time of diagnosis of GDM, in predicting postpartum prediabetes.15, 16 Specifically, women diagnosed at 24 weeks of gestation or earlier, are at higher risk of having postpartum prediabetes.17 Similarly, the requirement of insulin therapy during pregnancy, ethnicity, gravidity, BMI, weight at the time of delivery, and neonatal weight, are other factors that have been shown to be associated with the risk of prediabetes.18 While there is ample evidence of multiple factors being associated with T2DM onset in GDM-diagnosed women in general, there is no personalized risk score that can predict whether a specific GDM-diagnosed woman is likely to develop prediabetes/T2DM. Indeed, identifying women who are especially at high risk can help in implementing targeted, personalized interventions to delay and prevent the onset of T2DM and its future complications.

Artificial Intelligence has begun to play a dominant role in healthcare, facilitating optimal decision-making as well as personalized treatment. However, its use in the development of predictive models for T2DM onset is still in its nascent stages. Accurate prediabetes risk stratification at or before delivery for GDM women could assist policymakers and clinicians in specifically targeting those at the highest risk, especially in resource-constrained settings. The primary aim of this paper is to investigate the predictive ability of the antenatal variables and derive a model for personalized prediction of prediabetes. We explored the use of logistic regression (LR) and tree-based machine learning algorithms for developing the prognostic model. We report our findings on a multi-ethnic retrospective cohort in the UK. We show that a simple logistic regression model with antenatal fasting glucose and antenatal HbA1c predicts prediabetes in GDM women with a sensitivity of 92% for the Rule-in test and specificity of 92% for the Rule-out test.

Methods

Data Acquisition

A retrospective audit of electronic database records of postpartum screening at 6 to 13 weeks of women diagnosed with GDM, from January 2016 to December 2019, was conducted at an NHS trust hospital in the UK. GDM was diagnosed using NICE 2015 criteria.19 Complete data is available for 607 women for the following variables: age, height, weight, BMI, systolic and diastolic BP at booking, ethnicity, gravida, parity, smoking status, married status, employment status, gestational age at delivery, mode of delivery, birth weight, breastfeeding status, and biochemical variables such as antenatal fasting glucose (A-FG), antenatal postprandial glucose (A-PG), antenatal HbA1c (A-HbA1c), postpartum fasting glucose (P-FG), postpartum postprandial glucose (P-PG) and postpartum HbA1c (P-HbA1c). Postpartum Oral Glucose Tolerance Test (OGTT) was carried out at 6 weeks and following the change in the NICE guidelines, postpartum HbA1c was carried out at 12-13 weeks following delivery. We define prediabetes as: P-FG ≥ 5.6 mmol/l OR P-PG ≥ 7.8 mmol/l OR P-HbA1c ≥ 40 mmol/mol. ppIFG was defined as P-FG ≥ 5.6 mmol/l and ppIGT was defined as P-PG ≥ 7.8 mmol/l, respectively. We define T2DM as: P-FG ≥ 7.0 mmol/l or P-PG ≥ 11.1 mmol/l or P-HbA1c ≥ 48 mmol/mol.20 Normal Glucose Tolerance (NGT) is considered otherwise.

Statistical Power Analysis

We did a power analysis to determine if the available sample size was sufficient to identify the difference in effect between the normal and prediabetes-diagnosed GDM women. We used the statsmodels library and the TTestIndPower class in Python to calculate the power analysis for Student’s t test for independent samples. For a statistical power of 90%, a minimum sample size of 130 (99 normal and 31 prediabetes) is required for the observed effect size calculated using Cohen’s d statistic. We provide the details of power analysis in the Supplementary material.

Machine Learning

We perform Machine Learning (ML) in Python version 3.7. We compare logistic regression with tree-based methods to build the prognostic model for the prediction of early prediabetes in GDM women. These algorithms inherently address the imbalance in the representation for each of the binary classes of prediabetes outcome using the ‘balanced’ parameter. The ‘balanced’ mode uses the values of y to automatically adjust weights inversely proportional to class frequencies in the input data, as the ratio of the total number of samples to the product of the number of classes and the number of occurrences in each class. Mathematically, the class weight is calculated as 1/(2 × fraction of women in the class). We build the tree-based model using a simple decision tree algorithm, whose performance improves using ensemble methods such as bagging and boosting. All these algorithms use hyperparameters that can significantly affect the performance of these methods on an unseen set. We determine the optimal values of these hyperparameters using nested cross-validation. More specifically, we make the entire data undergo leave-one-out cross-validation (CV1) for model evaluation and we perform an internal stratified 4-fold cross-validation (CV2) on the training folds of CV1 for hyperparameter optimization. We impute the missing values with the Multivariate Imputation by Chained Equations (MICE) technique, using the other non-missing covariates. We scale the training data in CV1 using the StandardScaler function and use the saga solver in the logistic regression model. The saga solver is a variant of the stochastic average gradient (sag) solver that also supports the non-smooth L1 penalty, which promotes feature selection. The tree-based algorithms perform feature selection inherently, governed by the optimized hyperparameters in CV2. We perform hyperparameter optimization and model training only on the training folds (n − 1 samples) in CV1, with an independent set (1 sample) exclusively held out for testing. We aggregate the model predictions on each held-out sample across the n training folds of CV1 and plot the Receiver Operating Characteristic (ROC) curve for this aggregated set. We use the area under the ROC curve as a measure of performance. Finally, we apply it in a similar fashion on the full data to obtain the final model for deployment. We provide the details of the different tree-based methods employed in the supplementary materials.

Composite Risk Score calculation

Using the coefficients from the final fitted logistic regression model on the full data, we develop a composite risk scoring system using the best selected antenatal variables to predict the probability of prediabetes in GDM-diagnosed women. We calculate the composite risk score as the probability of class 1 obtained from the logistic regression model. It is given by the expression 1/(1 + e−b), where b = b0 + b1 ·x1 + b2 ·x2 + … + bm ·xm where b0 is the intercept and bm coefficient of mth variable (xm), respectively.

We compute specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and the F1 score at five predetermined values of sensitivity (60%, 70%, 75%, 80%, and 90%) for the optimal selected model. We give the definition/formulae for all these in the supplementary section.

Kullback-Leibler (K-L) Divergence and Information Graphs to evaluate and compare diagnostic tests and select optimal cut-point

We use the information theory approach in Lee et al.,21 Samawi et al.,22 and Benish et al.,23 briefly summarized below, to select the optimal probability threshold for accurate prediction of the binary outcome of prediabetes. An important approach followed in medical diagnostics is to predict the ‘Rule-in and Rule-out’ potential of the diagnostic test to safely include the patients in need of treatment and discard those not in need, respectively. At a probability threshold c reported by the ML algorithm, suppose the proportion of the diseased population correctly predicted as diseased is given by g1(c) and that of the non-diseased population correctly predicted as non-diseased is given by g2(c). Both g1(c) and g2(c) are Bernoulli probability distributions and are simply the sensitivity and specificity, respectively at the threshold value of c. The K-L divergence (or relative entropy) measures the separation between these two probability distributions and is given by: Embedded Image Embedded Image By definition, D(g1‖g2) ≥ 0, D(g2‖g1) ≥ 0. The KL divergence is close to 0 when there is little difference between the two distributions. A high D(g1‖g2) value indicates the increase in information of predicting disease onset. We calculate D(g1‖g2) and D(g2‖g1) for 1000 cut points at an interval of 0.001 from 0 to 1. We chose Tin with cut-point cin corresponding to Dmax(g1‖g2) as the diagnostic test with greatest rule-in potential. We chose Tout with cut-point cout corresponding to Dmax(g2‖g1) as the diagnostic test with greatest rule-out potential. We calculate Embedded Image, which is the ratio of post-test odds to the pre-test odds of having the disease for a randomly selected diseased individual. We also calculate Embedded Image, which is the ratio of pre-test odds to the post-test odds of having the disease for a randomly selected non-diseased individual. Pin, Pout ≥ 1.

Next, we calculate the Information Distinguishability measure, Embedded Image and Embedded Image, to study and compare the separation provided by the diagnostic test between the diseased and the non-diseased distributions. We calculate the objective function TKLdiscrete(c) = D(g1‖g2) + D(g2‖g1) and chose the optimal cut-point cin−out corresponding to max(TKLdiscrete(c)) to achieve maximum information for Tin−out with high potential in both rule-in and rule-out situations. Further, we plot information graphs to characterize and compare the performance of our diagnostic tests at different cut-points depending upon the rule-in or rule-out potential. The expected value of the relative entropy provides a measure of the expected diagnostic information and plotting it as a function of the pre-test probabilities yields an information graph. The equations used to plot the information graphs are given as follows: Let Di be the true status and Ti be the diagnostic test result for the patient, respectively, (i = {0, 1}, 0: disease absent, & 1: disease present). If x = Pr(D1), then the diagnostic information obtained from a +ve, and -ve test result (I+(x), I−(x), respectively) and the expected diagnostic information (IE(x)) are given as follows. Embedded Image Embedded Image Embedded Image Embedded Image Embedded Image In addition, we also plot the information graph by representing the total K-L divergence as the discrete Bregman divergence, which is the sum of the vertical distances between the negative Shannon entropy function (see Supplementary material for details) and tangents to it at probabilities p = g1(c) and p = 1 − g2(c).

Decision Curve Analysis

We carry out Decision Curve Analysis (DCA) to evaluate and compare the performance of our model in comparison to the ‘treat all’ and ‘treat none’ approaches. Finally, we compare the correctly identified non-attenders (sensitivity) vs follow-ups avoided (the true negatives + false negatives, obtained from the optimal selected model), to calculate the number of women requiring enhanced care, to maximize targeted postpartum follow-up.

Results

Postpartum glucose status was available for 394 (64.91%) out of the 607 women. 340 (56.01%) women underwent OGTT at 6 weeks and 128 (21.09%) underwent the postpartum HbA1c around 13 weeks. prediabetes is present in 92 (23.35%) women. Of these 47 (51.09%) were abnormal by P-FG, 33 (35.87%) by P-PG, and 39 (42.39%) by P-HbA1c. We show the baseline characteristics of these 394 women in Table 1.

View this table:
  • View inline
  • View popup
Table 1:

Comparison of antenatal, delivery and postnatal characteristics of GDM women with presence and absence of prediabetes

Machine Learning Analysis

The data is imbalanced (as expected), with a 23.35% representation of the positive prediabetes class. We compare simple logistic regression with different classification tree methods for predicting prediabetes from training on this small and imbalanced data set. We use class-weight = balanced in the logistic regression algorithm and ‘balanced’ classification tree-based algorithms from the imbalanced-learn python package for developing the tree-based prognostic models. The predictive performance of our proposed framework improves significantly by applying ensemble methods of bagging and boosting to the base decision tree estimator but remains lower than LR. LR gives the area under the ROC curve of 0.7203 from aggregating the test predictions from the leave-one-out cross-validation (Fig 2a). Using the base decision tree algorithm and leave-one-out cross-validation, the area under the ROC curve for the aggregated test predictions is 0.6210, bagging decision trees improves it to 0.6883. Random forests further improve it to 0.6944 using 4-fold stratified cross-validation in CV1 and the maximum area under the ROC curve from the tree-based algorithms is 0.6991 from balanced bagging using histogram-based gradient boosting tree classification algorithm using 4-fold stratified cross-validation. We use 4-fold stratified cross-validation in CV1 instead of leave-one-out for random forests and the boosting algorithm due to the high time complexity of leave-one-out. We conclude that the simplest prediction algorithm for binary classification, LR, outperforms the advanced tree-based methods in the prediction of prediabetes. Our final composite risk score using the LR model with A-FG and A-HbA1c is highly robust for the prediction of prediabetes in GDM women. Out of the n=394 runs of leave-one-out cross-validation, antenatal fasting glucose and antenatal HbA1c are selected 318 (> 80%) times.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1:

Consort diagram of early postpartum glucose tolerance. Caption: The flow chart displays the proportion of GDM women with and without prediabetes. The diagnosis of prediabetes was made if: FPG ≥ 5.6 or 2-hr glucose ≥ 7.8 at postpartum OGTT or HbA1c ≥ 40 mmol/mol.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2:

A. Estimated ROC for the prediction of postpartum prediabetes following a GDM pregnancy using ML model B. Decision curve analysis for the standardized net benefit obtained for the optimal probability threshold. Caption: Fig (a): AUROC (Area under the receiver operating characteristic) was used to evaluate the performance of our machine learning-based method using the logistic regression model on the validation cohort, n= 394 by aggregating the predictions from the test folds of CV1. The area under ROC was 0.7203. The green dots on the ROC curve represent Tin (cin = 0.381), Tin−out (cin−out = 0.260), and Tout (cout = 0.140), from left to right, respectively. Fig (b): The DCA (Decision curve analysis) showed the net benefit obtained from the ML (blue) prediction model. The net benefit of implementing our model in a clinical setting is larger when compared to the follow-up of all GDM women for prediabetes. DCA was derived from the equation, Net benefit Embedded Image, where TP and FP are the true positives and false positives respectively, pt is the probability threshold, and N is the total number of participants in the validation cohort, n=607.

Composite Risk Score calculation

Based on our proposed final Logistic regression model, we calculate the composite risk score, c (or P(prediabetes)), as Embedded Image

Kullback-Leibler (K-L) Divergence and Information Graphs to evaluate and compare diagnostic tests and select optimal cut-point

Tin with Dmax(g1(c), g2(c)) = 0.30 and cin = 0.381 has high specificity of 92%, in concurrence with the ‘Rule-in-specific-test’ principle and Tout with Dmax(g2(c), g1(c)) = 0.28 and cout = 0.140 has high sensitivity of 92%, again in concurrence with the ‘Rule-out-sensitive-test’ principle. Pin = 1.35 and Pout = 1.23 for Tin, and Pin = 1.21 and Pout = 1.33 for Tout, which is the increase (decrease) in disease odds after the test for a diseased (control) individual. Tin−out with max(TKLdiscrete(c)) = 0.51 for cin−out = 0.260 has Pin = 1.31 and Pout = 1.27. Also, maximum of the Youden’s index, Jmax = 0.34 (J (c) = g1(c)+g2(c)−1), and maximum F1-score = 0.49 occurs at the same cin−out = 0.260. Embedded Image, which implies that positive result obtained by Tin is more likely to be true than positive result obtained by Tout. In other words, Tin is more specific and yields fewer false positives compared to Tout. Similarly, Embedded Image shows that Tin is less sensitive with more false negatives.

We generated the information graphs using the equations for I+(x), I−(x), and IE(x) as a function of x = Pr(D1), as shown in Fig 3a, 3b, and 3c. We can observe that Tin provides the most diagnostic information when the test result is positive, and the pre-test probability of a positive result (Pr(D1)) is low. Tout provides the most diagnostic information when the test result is negative, and the pre-test probability of a positive result is high. For Tin−out, we obtain more diagnostic information when the test yields a positive result than a negative one and we obtain maximum information from a positive result at a lower pre-test probability than that from the negative result. In Fig 3d, we can see the information gained using the discrete Bregman divergence representation of TKLdiscrete by adding the vertical distances from the negative Shannon Entropy function to the tangents drawn at probability p = g1(c) and 1 − g2(c).

Figure 3:
  • Download figure
  • Open in new tab
Figure 3:

Information graphs for comparing Rule-in and Rule-out test potentials of diagnostic tests using different cut-points for predicting a low and high risk of prediabetes in GDM women Caption: Information graphs provide means to distinguish between diagnostic test performance. We compared the diagnostic information obtained from Tout, Tin−out, and Tin defined by the cut-points 0.140, 0.260, 0.381. A positive diagnosis made by the ‘Rule-in-specific-test’ and a negative diagnosis made by the ‘Rule-out-sensitive-test’ gives us the most information, as expected. Fig (a), (b), (c): Maximum information from a positive test diagnosis (blue) is obtained at a lower pre-test probability than the maximum information from a negative test diagnosis (red). The diagnostic test with a lower cut-point gives maximum information when the diagnosis is negative (i.e, the test is very sensitive and we can rule out the negative cases safely) and the diagnostic test with a higher cut-point gives maximum information when the diagnosis is positive (i.e, the test is very specific to the disease and we can rule in the positive cases safely). IE is the expected information from the diagnostic test (x × I+ + (1 − x) × I−, where x is the probability of a positive test diagnosis). Fig (d): The sum of the distances between the tangents to the negative Shannon entropy function at p = g1 (c) and p = 1 − g2 (c) is the discrete Bregman divergence, which represents total K-L divergence.

Using the prognostic model with LR, 15 out of 100 women are above the optimal threshold of 0.381, and focusing on these women could improve the early prediabetes diagnosis. 28 out of 100 women are below the optimal threshold of 0.140, and testing for early prediabetes diagnosis can be safely avoided in this category. The model shows 92% sensitivity for the rule-in test and 92% specificity for the rule-out test, Table 3 shows the sensitivity, specificity, PPV, NPV, F1 score, accuracy, and other measures related to K-L divergence at different probability thresholds.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2:

Factors associated with postpartum prediabetes by machine learning model

View this table:
  • View inline
  • View popup
Table 3:

Performance of the diagnostic test for postpartum prediabetes at various probability thresholds

Decision Curve Analysis

In the decision curve analysis by comparing the ‘treat all’ and ‘treat none’ approaches, the ML model obtains a higher standardized net benefit as compared to the universal screening of all GDM women for early prediabetes (Fig 2b).

Discussion

In this study, we try to predict at the time of delivery if the women diagnosed with GDM are at high risk of getting diagnosed with postpartum prediabetes at 6-13 weeks postpartum. For this purpose, we employ a variety of machine learning techniques including both logistic regression and advanced tree-based algorithms and train the models using routinely collected antenatal and delivery variables as predictors. Our proposed model using nested cross-validation and logistic regression algorithm can effectively predict prediabetes in GDM women, using only the antenatal predictors fasting glucose and HbA1c, with good sensitivity and specificity.

The use of machine learning for predicting postpartum prediabetes in GDM-diagnosed women has been rarely studied. We are aware of only one study that has made use of machine learning algorithms to predict the occurrence of T2DM post-GDM: Krishnan et al.24 proposed random forest and gaussian naive Bayes algorithms to predict T2DM after GDM, and achieved a modest specificity of 23% at a sensitivity of 88%. It also lacked the use of advanced techniques to deal with imbalanced data. Real-world medical data is scarce due to the different challenges posed in its collection. To the best of our knowledge, there is no larger data collected for studying prediabetes in GDM women than the data in the present study. In our study, we propose a more personalized approach to identifying postpartum prediabetes after GDM, at the antenatal visit itself, by calculating a simple score based on only two easy-to-measure biochemical predictors, obtained using machine learning techniques and a logistic regression algorithm, with good sensitivity and specificity (each of 92% for Rule-out and Rule-in tests, respectively). Further, we suggest different cut-offs for classifying high-risk women depending upon resource availability.

The proposed prediction test needs only the antenatal fasting glucose (at the time of antenatal OGTT) and HbA1c, usually measured soon after the diagnosis of GDM for clinical use. Thus, no additional tests/costs are involved, and is easy to use by healthcare professionals. The information theory analysis proposes different cut-offs for classification according to the requirement of ruling-in or ruling-out the prediabetes condition in GDM-diagnosed women. All women diagnosed with GDM during pregnancy are recommended to have annual screening,19, 25 although the compliance is currently poor.5, 26 Therefore, we can allow for more false positives than false negatives and propose cout = 0.140 as the optimal cut-off for classification. However, in low-resource settings, we can primarily focus on women with P (prediabetes) ≥ cin = 0.381 and then consider women with P (prediabetes) ≥ cin−out = 0.260 in the following step. If resource constraint is not an issue, we can target women with P (prediabetes) ≥ cout = 0.140 as well. Targeting GDM women stepwise according to their risk of developing prediabetes is more personalized than the blanket approach of targeting all women with GDM. This could be a pragmatic approach in settings with limited resources. The desired cut-off out of cin, cout, or cin−out can be chosen depending upon the purpose and setting in which this diagnostic test is used.

Postpartum weight loss has been shown to reduce the risk of incidence of T2DM and recurrent GDM in the subsequent pregnancy.27, 28 However, initiating such lifestyle interventions can be difficult due to lack of personalization and may not produce optimum results due to poor adherence by the women.29 Our approach to identifying women with a high risk of prediabetes (using any ‘c’) can provide an improved understanding of individualized prediabetes risk which can be used to target women for interventions (diet and lifestyle, encourage breastfeeding, etc) for postpartum weight loss. This can in turn improve their T2DM and CVD risk profile. Women are most conducive to interventions during pregnancy and also maintain close contact with healthcare professionals. Identifying the high-risk women during the antenatal visits will help the healthcare professionals to implement necessary interventions throughout the remaining pregnancy period, and also encourage postpartum follow-up.

We believe that the results obtained are supportive for testing and validating our Rule-in and Rule-out composite risk score approach on a larger prospective dataset.

The key strength of our study is the use of a variety of machine learning techniques and the comparison of the LR algorithm with tree-based algorithms for developing the prognostic model for individualized risk prediction of prediabetes following GDM pregnancy. In addition, to the best of our knowledge, this is the first study that used K-L divergence and information graphs for evaluating and comparing different diagnostic tests at different cut-points and explaining their rule-in and rule-out potentials. However, our study has important limitations. First, this is retrospective data and hence other potential variables that could influence the prediabetes status such as gestational weight gain and insulin treatment were not electronically available. Second, postpartum glucose status data was only available in 65.0% of the cohort, although this follow-up rate for postpartum glucose testing was higher than the national average. Finally, while the sample size is small (n = 394 and n = 92 for the prediabetes class) for machine learning analyses, this was adequate based on the substantial predictive performance and the power calculations. In addition, the only available literature to our knowledge that looked at predicting the onset of T2DM following GDM was based on only 77 patient records with 15 variables.24

Conclusions

This study shows that our proposed model using a logistic regression algorithm is effective for the prediction of prediabetes in GDM women by using the already available antenatal fasting glucose and antenatal HbA1c. We believe that this approach is easy for practical use with no additional cost and could be extremely effective for individualized risk stratification of GDM women. This approach could be used for targeted glucose testing during the postpartum period in a resource-constrained setting.

Data Availability

The full dataset is available upon request following the completion of a suitable confidentiality agreement.

Declarations

Funding

PS and YW are partly funded by Medical Research Council, UK (MR/R020981/1); DP is funded by Warwick-Novo Nordisk international Doctoral Training Program and NP’s Ph.D. is funded by Chancellor’s International Scholarship, University of Warwick. RS is supported by his institution, funded by the Department of Atomic Energy, Government of India.

Math variables used in the paper

n: Sample size (n = 607)

x: variable

m: Number of variables (m = 25)

y: class label

Availability of data and materials

The full dataset is available upon request following the completion of a suitable confidentiality agreement.

Ethics approval and consent to participate

This study was carried out as an audit and quality improvement project using an anonymous dataset. The local clinical governance team approved the audit and formal ethical approval was not required.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Authors’ contributions

  1. Conceptualization: DP, LN, RS, and PS

  2. Data curation: DP, NP, NS, YW

  3. Formal analysis: DP

  4. Funding acquisition: PS

  5. Investigation: NP, PS

  6. Methodology: DP, PS, LN, RS

  7. Project administration: VP, NS, PS

  8. Resources: VP, NS, PS

  9. Software: DP

  10. Supervision: PS, LN, RS

  11. Writing - original draft: DP

  12. Writing - review & editing: All authors

Acknowledgements

This study was conducted as an audit at the George Eliot Hospital NHS Trust, Nuneaton, UK. We extend our thanks to Mrs. J Wilson, Mrs. J Plester, Mrs. T Ritchie, and Mr. S Selvamoni for their help in providing the list of all the GDM women. Mrs. T Ritchie is the advanced nurse practitioner who facilitates the postpartum testing.

Abbreviations

GDM
Gestational Diabetes Mellitus
T2DM
Type 2 Diabetes Mellitus
CVD
Cardiovascular Disease
OGTT
Oral Glucose Tolerance Test
LR
Logistic Regression
ROC
Receiver Operating Characteristic
DCA
Decision Curve Analysis

References

  1. 1.↵
    Bellamy, L., Casas, J.P., Hingorani, A.D., Williams, D.: Type 2 diabetes mellitus after gestational diabetes: a systematic review and meta-analysis. The Lancet 373, 1773–1779 (2009). doi:10.1016/S0140-6736(09)60731-5
    OpenUrlCrossRefPubMedWeb of Science
  2. 2.↵
    Noctor, E.: Type 2 diabetes after gestational diabetes: The influence of changing diagnostic criteria. World Journal of Diabetes 6, 234 (2015). doi:10.4239/wjd.v6.i2.234
    OpenUrlCrossRefPubMed
  3. 3.↵
    Kramer, C.K., Campbell, S., Retnakaran, R.: Gestational Diabetes and the Risk of Cardiovascular Disease in Women: a Systematic Review and Meta-analysis. doi:10.1007/s00125-019-4840-2
    OpenUrlCrossRefPubMed
  4. 4.
    Carr, D.B., Utzschneider, K.M., Hull, R.L., Tong, J., Wallace, T.M., Kodama, K., Shofer, J.B., Heckbert, S.R., Boyko, E.J., Fujimoto, W.Y., Kahn, S.E.: Gestational diabetes mellitus increases the risk of cardiovascular disease in women with a family history of type 2 diabetes. Diabetes Care 29, 2078–2083 (2006). doi:10.2337/dc05-2482
    OpenUrlAbstract/FREE Full Text
  5. 5.↵
    Daly, B., Toulis, K.A., Thomas, N., Gokhale, K., Martin, J., Webber, J., Keerthy, D., Jolly, K., Saravanan, P., Nirantharakumar, K.: Increased risk of ischemic heart disease, hypertension, and type 2 diabetes in women with previous gestational diabetes mellitus, a target group in general practice for preventive interventions: A population-based cohort study. PLoS Medicine 15 (2018). doi:10.1371/journal.pmed.1002488
    OpenUrlCrossRefPubMed
  6. 6.↵
    Saravanan, P., Magee, L.A., Banerjee, A., Coleman, M.A., Von Dadelszen, P., Denison, F., Farmer, A., Finer, S., Fox-Rushby, J., Holt, R., et al: Gestational diabetes: opportunities for improving maternal and child health. The Lancet Diabetes & Endocrinology 8(9), 793–800 (2020)
    OpenUrl
  7. 7.↵
    Kim, C., Newton, K.M., Knopp, R.H.: Gestational Diabetes and the Incidence of Type 2 Diabetes A Systematic Review
  8. 8.↵
    Retnakaran, R., Qi, Y., Sermer, M., Connelly, P.W., Hanley, A.J.G., Zinman, B.: Glucose intolerance in pregnancy and future risk of pre-diabetes or diabetes (2008). doi:10.2337/d08-0972
    OpenUrlCrossRef
  9. 9.↵
    Wang, T., Zheng, W., Huang, W., Zhang, L., Tian, Z., Zhang, T., Yan, Q., Li, G.: Risk factors for abnormal postpartum glucose outcome in women with gestational diabetes mellitus diagnosed by modified the international association of the diabetes and pregnancy study groups criteria. Journal of Obstetrics and Gynaecology Research 45, 1545–1552 (2019). doi:10.1111/jog.14009
    OpenUrlCrossRef
  10. 10.↵
    Retnakaran, R., Qi, Y., Sermer, M., Connelly, P.W., Hanley, A.J.G., Zinman, B.: β-cell function declines within the first year postpartum in women with recent glucose intolerance in pregnancy. Diabetes Care 33, 1798–1804 (2010). doi:10.2337/dc10-0351
    OpenUrlAbstract/FREE Full Text
  11. 11.↵
    Retnakaran, R., Qi, Y., Sermer, M., Connelly, P.W., Zinman, B., Hanley, A.J.G.: Isolated hyperglycemia at 1 hour on oral glucose tolerance test in pregnancy resembles gestational diabetes mellitus in predicting postpartum metabolic dysfunction. Diabetes Care 31, 1275–1281 (2008). doi:10.2337/dc08-0126
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    Retnakaran, R., Qi, Y., Connelly, P.W., Sermer, M., Zinman, B., Hanley, A.J.G.: Glucose intolerance in pregnancy and postpartum risk of metabolic syndrome in young women. Journal of Clinical Endocrinology and Metabolism 95, 670–677 (2010). doi:10.1210/jc.2009-1990
    OpenUrlCrossRefPubMed
  13. 13.↵
    Benhalima, K., Crombrugge, P.V., Moyson, C., Verhaeghe, J., Vandeginste, S., Verlaenen, H., Vercammen, C., Maes, T., Dufraimont, E., Block, C.D., Jacquemyn, Y., Mekahli, F., Clippel, K.D., Bruel, A.V.D., Loccufier, A., Laenen, A., Minschart, C., Devlieger, R., Mathieu, C.: Prediction of glucose intolerance in early postpartum in women with gestational diabetes mellitus based on the 2013 who criteria. Journal of Clinical Medicine 8 (2019). doi:10.3390/jcm8030383
    OpenUrlCrossRef
  14. 14.↵
    Tang, L., Xu, S., Li, P., Li, L.: Predictors of insulin treatment during pregnancy and abnormal postpartum glucose metabolism in patients with gestational diabetes mellitus. Diabetes, Metabolic Syndrome and Obesity: Targets and Therapy 12, 2655–2665 (2019). doi:10.2147/DMSO.S233554
    OpenUrlCrossRef
  15. 15.↵
    Noctor, E., Mb, C.C., Carmody, L.A., Msc, B.A., Saunders, J.A., Rn, B.K., O’, A., Phd, D., Phd, P.G., Glynn, L.G., Mcguire, B.E., O’neill, C., O’shea, P.M., Dunne, F.P.: Abnormal Glucose Tolerance Post-gestational Diabetes Mellitus as Defined by International Association of Diabetes and Pregnancy Study Groups Criteria
  16. 16.↵
    Ogonowski, J., Miazgowski, T.: The prevalence of 6 weeks postpartum abnormal glucose tolerance in caucasian women with gestational diabetes. Diabetes Research and Clinical Practice 84, 239–244 (2009). doi:10.1016/j.diabres.2009.04.003
    OpenUrlCrossRefPubMed
  17. 17.↵
    Kjos, S.L., Buchanan, T.A., Greenspoon, J.S., Montoro, M., Bernstein, G.S., Mestman, J.H.: Gestational diabetes mellitus: The prevalence of glucose intolerance and diabetes mellitus in the first two months post partum. American Journal of Obstetrics and Gynecology 163, 93–98 (1990). doi:10.1016/S0002-9378(11)90676-0
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.↵
    Catalano, P.M., Vargo, K.M., Bernstein, I.M., Amini, S.B.: Incidence and risk factors associated with abnormal postpartum glucose tolerance in women with gestational diabetes. American Journal of Obstetrics and Gynecology 165, 914–919 (1991). doi:10.1016/0002-9378(91)90438-W
    OpenUrlCrossRefPubMedWeb of Science
  19. 19.↵
    Webber, J., Charlton, M., Johns, N.: Diabetes in pregnancy: management of diabetes and its complications from preconception to the postnatal period (ng3). British Journal of Diabetes 15(3), 107–111 (2015)
    OpenUrl
  20. 20.↵
    Kitzmiller, J.L., Dang-Kilduff, L., Taslimi, M.M.: Gestational diabetes after delivery: Short-term management and long-term risks. Diabetes Care 30 (2007). doi:10.2337/dc07-s221
    OpenUrlFREE Full Text
  21. 21.↵
    Lee, W.-C.: Selecting diagnostic tests for ruling out or ruling in disease: the use of the kullback-leibler distance. International journal of epidemiology 28(3), 521–525 (1999)
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Samawi, H.M., Yin, J., Zhang, X., Yu, L., Rochani, H., Vogel, R., Mo, C.: Kullback-leibler divergence for medical diagnostics accuracy and cut-point selection criterion: How it is related to the youden index (2020). doi:10.37532/jabcb.2020.9(2).168
    OpenUrlCrossRef
  23. 23.↵
    Benish, W.: The use of information graphs to evaluate and compare diagnostic tests. Methods of information in medicine 41(02), 114–118 (2002)
    OpenUrlPubMedWeb of Science
  24. 24.↵
    Krishnan, D.R., Menakath, G.P., Radhakrishnan, A., Himavarshini, Y., Aparna, A., Mukundan, K., Pathinarupothi, R.K., Alangot, B., Mahankali, S., Maddipati, C.: Evaluation of predisposing factors of diabetes mellitus post gestational diabetes mellitus using machine learning techniques. In: 2019 IEEE Student Conference on Research and Development (SCOReD), pp. 81–85 (2019). IEEE
  25. 25.↵
    Association, A.D., et al: 14. management of diabetes in pregnancy: Standards of medical care in diabetes—2020. Diabetes Care 43(Supplement 1), 183–192 (2020)
    OpenUrl
  26. 26.↵
    Periyathambi, N., Parkhi, D., Ghebremichael-Weldeselassie, Y., Patel, V., Sukumar, N., Siddharthan, R., Narlikar, L., Saravanan, P.: Machine learning prediction of non-attendance to postpartum glucose screening and subsequent risk of type 2 diabetes following gestational diabetes. PLOS ONE 17(3), 0264648 (2022). doi:10.1371/journal.pone.0264648
    OpenUrlCrossRef
  27. 27.↵
    Phelan, S.: Pregnancy: a “teachable moment” for weight control and obesity prevention. American Journal of Obstetrics and Gynecology 202, 135–11358 (2010). doi:10.1016/j.ajog.2009.06.008
    OpenUrlCrossRefPubMed
  28. 28.↵
    Phelan, S.: Windows of opportunity for lifestyle interventions to prevent gestational diabetes mellitus. American Journal of Perinatology 33, 1291–1299 (2016). doi:10.1055/s-0036-1586504
    OpenUrlCrossRef
  29. 29.↵
    O’Reilly, S.L., Dunbar, J.A., Versace, V., Janus, E., Best, J.D., Carter, R., Oats, J.J., Skinner, T., Ackland, M., Phillips, P.A., et al: Mothers after gestational diabetes in australia (magda): a randomised controlled trial of a postnatal diabetes prevention program. PLoS medicine 13(7), 1002092 (2016)
    OpenUrl
Back to top
PreviousNext
Posted February 23, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning prediction of early postpartum prediabetes in women with gestational diabetes mellitus
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning prediction of early postpartum prediabetes in women with gestational diabetes mellitus
Durga Parkhi, Nishanthi Periyathambi, Yonas Weldeselassie, Vinod Patel, Nithya Sukumar, Rahul Siddharthan, Leelavati Narlikar, Saravanan Ponnusamy
medRxiv 2023.02.16.23286016; doi: https://doi.org/10.1101/2023.02.16.23286016
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning prediction of early postpartum prediabetes in women with gestational diabetes mellitus
Durga Parkhi, Nishanthi Periyathambi, Yonas Weldeselassie, Vinod Patel, Nithya Sukumar, Rahul Siddharthan, Leelavati Narlikar, Saravanan Ponnusamy
medRxiv 2023.02.16.23286016; doi: https://doi.org/10.1101/2023.02.16.23286016

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Endocrinology (including Diabetes Mellitus and Metabolic Disease)
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)