# Comparing the Lifestyle Interventions for Prediabetes: An Integrated Microsimulation and Population Simulation Model

We developed a model to compare the impacts of different lifestyle interventions among prediabetes individuals and to identify the optimal age groups for such interventions. A stochastic simulation was developed to replicate the prediabetes and diabetes trends (1997–2010) in the U.S. adult population. We then simulated the population-wide impacts of three lifestyle diabetes prevention programs, i.e., the Diabetes Prevention Program (DPP), DPP-YMCA, and the Healthy Living Partnerships to Prevent Diabetes (HELP-PD), over a course of 10, 15 and 30 years. Our model replicated the temporal trends of diabetes in the U.S. adult population. Compared to no intervention, the diabetes incidence declined 0.3 per 1,000 by DPP, 0.2 by DPP-YMCA, and 0.4 by HELP-PD over the 15-year period. Our simulations identified HELP-PD as the most cost-effective intervention, which achieved the highest 10-year savings of $38 billion for those aged 25–65, assuming all eligible individuals participate in the intervention and considering intervention achievement rates. Our model simulates the diabetes trends in the U.S. population based on individual-level longitudinal data. However, it may be used to identify the optimal intervention for different subgroups in defined populations. ## Introduction Over 25 million Americans have diabetes, with approximately 2 million new diagnoses annually1. Complications associated with diabetes incur significant costs to individuals, families, and health care systems2. The costs of diabetes in the U.S. was$255 billion in 2012, a 41% increase since 20073. Prediabetes represents a stage when the blood glucose exceeds the optimal level but has not met the diagnosis criteria of diabetes4. Emerging evidence from randomized controlled clinical trials demonstrated that lifestyle interventions for prediabetes individuals can delay the onset of diabetes5,6,7,8,9, and reduce total and cardiovascular mortality10. Thus, individuals with prediabetes represent a unique high-risk population that might benefit from lifestyle interventions without pharmacological treatment. In 2017, the Centers for Disease Control and Prevention (CDC) reported that 84.1 million U.S. adults are estimated to have prediabetes, which often progresses to diabetes within a few years if untreated11. However, interventions for prediabetes require substantial resources and it is not yet clear how to allocate public health and clinical resources to minimize diabetes incidence. Questions remain on whether the benefits of such interventions sustain beyond the active treatment period and whether such interventions can reduce mortality rate. Seeking cost-effective interventions targeted on prediabetes individuals that can be conducted in clinical or community settings is a top priority for fighting the diabetes epidemic.

Computer models simulating the impacts of screening, lifestyle interventions, or different treatments are used to assess the effectiveness, costs, complications, and quality of life improvements. Several studies used Markov/semi-Markov chains to model the progression of diabetes status in an individual or population12,13,14,15. These studies used transition probabilities to model the change in the disease progression, whereas our framework considers a microsimulation model to incorporate heterogeneity, compare efficacy and cost-effectiveness of a variety of lifestyle interventions among the U.S. adult population, as well as identify optimal age groups for each intervention. Our model is different from other diabetes models in the following ways. First, the transition probabilities between disease stages were built upon individual-level data rather than a parameter value extracted from literature. This feature enables us to model the progression of each individual reflecting unique demographic characteristics. Second, our model simulates the entire life history of each individual, which allows us to explore the optimal age range of different interventions. Third, our life course approach estimates the gain in diabetes-free survival time for any given intervention, i.e., the average time for an individual with prediabetes conditions to develop diabetes.

## Methods

We developed a stochastic population-based simulation model to replicate prediabetes and diabetes trends in the U.S. adult population from 1997 to 2010. The stochastic progression of each individual is based on a microsimulation model for which the transition probability matrix (TPM) was estimated with data obtained from the PREMIER study16. The microsimulation and population simulation models were validated in several dimensions. Most importantly, the population simulation model was validated for the U.S. adult population from 1997 to 2010 (see Supplement A). To estimate and validate the efficacy of multiple intervention strategies, we extended the model for a course of 5 (2010 to 2015), 10 (2010 to 2020), and 15 (2010 to 2025) years and simulated the diabetes trends for three lifestyle interventions5,17,18. The efficacy was evaluated by comparing the population progression outcomes in intervention groups with those of a control group for categories such as diabetes incidence, prevalence, and mortality rates. To investigate cost-efficiency, the simulation is extended for a course of 10 (2010 to 2020) and 30 (2010 to 2040) years to evaluate averted cases of death and diabetes, as well as the average delay for diabetes onset (i.e., gain in diabetes-free survival time). We described the population-based simulation and the microsimulation in the following steps. Figure 1 demonstrates an overview of both simulation models, their modules, and the relationship between the population-based simulation model and the microsimulation model.

### Population-based simulation

We developed a population-based model to estimate and compare the impacts (both health and economic outcomes) of lifestyle interventions on prediabetes and diabetes trends. We used object-oriented programming to develop the population-based simulation, where progression of each individual is governed by the microsimulation. The population-based simulation model consists of an Initial Population Module to generate initial population, a Birth Module which simulates the population growth, and a microsimulation model, described in Step 2 (Fig. 1), to simulate the progression of each individual.

#### Initial population module

We simulated U.S. adult (>20) population in 1997 based on distributions of age, gender, body mass index (BMI), and diabetes status19,20,21. We validated this module by comparing the simulated population for each demographic characteristic with the corresponding U.S. population between 1997 and 2010 (Supplementary Table S1).

#### Birth module

Each year, “birth” to the population is computed as the number of those turning 20 years old, calculated from available data sets (Supplementary Table S8). The distribution of newly added individuals in each year (with respect to age, gender, BMI, and diabetes status) corresponds to that of the U.S. population at that year. We validated the population-based simulation model from 1997 to 2010 in five dimensions: total population (Supplementary Table S1), diabetes incidence (Supplementary Table S2), diabetes prevalence (Supplementary Table S3), prediabetes prevalence (Supplementary Table S4), and mortality rate (Supplementary Table S5). Therefore, the overall simulation model describes the complex progression of individuals from prediabetes to diabetes correlating the progression to the distributions of BMI in each age-gender category.

### Microsimulation

Microsimulation is characterized by specifying the probabilistic progression of each individual based on demographic characteristics. Each individual is characterized by age, gender, BMI, and fasting glucose level (FGL). We discretized age to 17 groups: 20–25, 25–30, .…., 100+; BMI to three groups: “Normal weight” (BMI between 20–25 kg/m2), “Overweight” (BMI between 25–30 kg/m2), and “Obese” (BMI greater than 30 kg/m2); FGL to three groups: “No diabetes” (less than 100 mg/dl), “Prediabetes” (between 100–125 mg/dl), and “Diabetes” (greater than 125 mg/dl). The microsimulation model consists of a Mortality Module, which decides on termination of life for each individual, an Intervention Module described in details in Step 3 (Fig. 1), which simulates the interventions’ effects, an Age/BMI Module, which increments age and replicates BMI trends, and a Diabetes Progression Module in which progression between diabetes statuses “No Diabetes,” “Prediabetes,” and “Diabetes” is simulated. To calibrate and/or validate each module, the microsimulation model is run from 1997 to 2010, where for each individual at baseline (1997), age and gender were assigned based on a cumulative density function (CDF) obtained from the 1997 census22. BMI and FGL are assigned based on a CDF obtained from CDC data described in details below23,24.

#### Mortality module

At the beginning of each year, the model determines whether an individual will be alive or dead at the end of that year. To that end, for individuals whose diabetes status is “No diabetes” or “Prediabetes”, a random number between zero and one is generated and calibrated to the CDF of age-gender-specific U.S. adult mortality data25. This module is validated by comparing life expectancies generated by the mortality module and CDC estimations in the same period (Supplementary Table S6). For diabetes patients, a random number between zero and one is generated and calibrated to the mortality rate produced by a validated Cox proportional-hazards model for diabetic individuals26. Note that the original Cox model contained covariates such as cancer history which are not considered in our model. Therefore, the model is adjusted by adding a constant (i.e., changing the baseline hazard) to compensate for the effects of missed factors and is calibrated with respect to the U.S. population data27 (Supplementary Table S7).

#### Age/BMI module

At each time period, age is incremented deterministically, i.e., age at period (t + 1) is age at period (t) + 1. BMI values change stochastically based on a regression model constructed from the reported data of the average BMI trend for each gender in the U.S.19,20,21. The module also decided whether to change the BMI of an individual in each period. Consequently, the amount of change is determined based on a probability distribution whose expected value equals the rate of BMI change produced by the regression model (Supplementary Table S9).

#### Diabetes progression module

The FGL progression in an individual is modeled by a Markov chain with three states: (1) “No diabetes” (less than 100 mg/dl), (2) “Prediabetes” (between 100 and 125 mg/dl), and (3) “Diabetes” (greater than 125 mg/dl). Markov chains are a common modeling technique to describe the probabilistic progression of patients’ health over time28. We constructed TPMs using the longitudinal data from the PREMIER study (see Supplement B).

### Intervention module

Although the model is flexible to simulate a variety of interventions, we simulated the impacts of three proven lifestyle interventions DPP, DPP-YMCA, and HELP-DP (the latter two being the translational versions of DPP) for 10 and 30 years starting from 2010. These interventions were chosen for simulation here because they are U.S.-based diabetes prevention interventions with proven effectiveness and publicly available cost information29,30,31,32. In simulating these interventions, we did not reduce BMI for individuals whose BMI was below 18.5 (underweight). In the first year of the microsimulation model, an intervention is simulated by reducing the BMI of eligible individuals. This reduction is only sustained for one year. After the first year, each individual’s BMI progresses according to an individual-level stochastic model. To have a fair comparison, we considered the same sub-population for all interventions: age 25 to 65, BMI 25+, and an elevated FGL. We also conducted a separate simulation study where each intervention is simulated for the population that is associated with its inclusion criteria (see Supplement C).

#### Diabetes prevention program (DPP)

DPP is a randomized control trial where prediabetes participants are assigned to three groups: placebo, metformin treatment, or lifestyle intervention5. We only simulated the lifestyle intervention which achieved at least 7% BMI reduction for 38% of participants measured some time after the first year of the trial. One may interpret the 38% figure as the achievement rate for the intervention, which is considered for other interventions as well.

#### DPP-young men’s christian association (DPP-YMCA)

DPP-YMCA adapted the DPP to group sessions and delivered them through Young Men’s Christian Association17. We simulated the trial’s lifestyle intervention which achieved at least 5% BMI reduction for 32.4% of participants after one year.

#### Healthy-living partnerships to prevent diabetes (HELP-PD)

HELP-PD is a diabetes education program delivered by community health workers18,33. We simulated the diabetes education intervention, which achieved at least 5% BMI reduction for 58.5% of participants after one year.

## Analysis

We considered three performance measures: (1) Diabetes Incidence, which is the number of new cases of diabetes in each year divided by the population, excluding those who had been previously diagnosed with diabetes; (2) Diabetes Mortality, which is the number of diabetes death cases divided by the population; and (3) Diabetes Prevalence, which is the total number of diabetes cases divided by the population. To estimate and validate efficacy, we simulated the impacts of DPP, DPP-YMCA, and HELP-PD interventions over a course of 5, 10, 15 years in the U.S. adult population starting from 2010 using the same target population for each intervention. The improvement percentage for a particular performance measure is calculated according to $$\frac{({{\rm{rate}}}_{I,15}-{{\rm{rate}}}_{NI,15})}{{{\rm{rate}}}_{NI,15}}$$, where rateI,15 and rateNI,15 denote the performance measure of the intervention and no-intervention groups at the end of 15 years, respectively. To compare cost-effectiveness and test the robustness of the optimal target population over time, we simulated 10-year (short-term) and 30-year (long-term) performance of each intervention in terms of total number of death cases averted, total number of diabetes cases averted, and average gain in diabetes-free survival time in four different age groups (25–65, 35–65, 45–65, and 55–65 years). The aversion in death and diabetes cases refer to the difference in number of death and new diabetes cases in the intervention and control population. We conducted an analysis to identify the optimal age group that produces minimal diabetes cases with maximum cost savings. The optimal target population was defined as the one that achieves maximum cost savings, calculated by the number of diabetes averted multiplied by the benefit of averting diabetes (medical cost associated with diabetes) minus the total cost of implementing the intervention. The cost for each intervention per individual in DPP, DPP-YMCA, and HELP-PD is obtained from literature29,30,31, as are the age-gender weighted average lifetime medical costs associated with diabetes13. We estimated the gain in diabetes-free survival time attributable to lifestyle interventions. We created two identical sets of 2010 U.S. populations in the population-based simulation model and applied each intervention to one of the populations and treated the other population as a control. We then kept track of the times required for an individual to develop diabetes in both populations. In the case an individual developed diabetes in both populations, the time when the individual developed diabetes in the intervention group was subtracted from that of the control group. If an individual developed diabetes in neither of the populations, then the delay is set to zero. In the case an individual developed diabetes in one population and not in the other, if the individual developed diabetes in the control population, the final simulation time was subtracted from the diabetes development time and the difference was reported as an estimate for the gain in diabetes-free survival time. However, if the individual developed diabetes in the intervention population, the time of diabetes development is subtracted from the final simulation time and is reported as an estimate for the loss in diabetes-free survival time. Finally, if an individual dies before developing diabetes in one population, then we exclude the individual from analysis.

Our simulation models are coded using C++ which are available by request. Parameter inputs and calculations are provided in Supplement B, and Supplementary Tables S9S11.

## Results

Our simulated population of U.S. adults (>20 years) corresponded well to the observed population from 1997 to 2010 (see Supplement A). Our Initial Population Module is validated by comparing total U.S. adult population (Supplementary Table S1) and total U.S. population by different age groups, BMI and diabetes status (Supplementary Figs S1S3). The model slightly underestimated the diabetes prevalence rate (Supplement A), which is likely to make conservative contributions to intervention performance. The Birth Module is validated by diabetes incidence, diabetes prevalence, prediabetes prevalence, and death cases over time34 (Figs 2, 3, and Supplementary Tables S2S5). Finally, our Mortality Module is validated by comparing life expectancies with respect to diabetes status (Supplementary Tables S6 and S7).

### Performance measures

Three performance measures (differences in diabetes incidence, diabetes prevalence, and mortality) after applying three interventions for one year as compared with a simulated control population (without intervention) in a course of 5, 10, and 15 years are presented in Table 1. The reduction in diabetes incidence (without intervention - with intervention) over 5, 10, and 15 years was 0.06, 0.03, and 0.03 per 100 by DPP, 0.03, 0.02, and 0.02 per 100 by DPP-YMCA, and 0.06, 0.04, and 0.04 per 100 by HELP-PD. Such reductions are about an improvement in diabetes incidence over 5, 10, and 15 years of 6.25%, 2.96%, and 2.99% by DPP, 3.39%, 1.91%, and 2.16% by DPP-YMCA and 6.79%, 3.38%, and 3.28% by HELP-PD. The reduction in diabetes prevalence over 5, 10, and 15 years were 0.18, 0.28, and 0.33 per 100 by DPP, 0.12, 0.2, and 0.2 per 100 by DPP-YMCA, and 0.2, 0.31, and 0.35 per 100 by HELP-PD. The reduction in mortality rates due to diabetes after 5, 10, and 15 years were 0.01, 0.02, and 0.03 per 100 in DPP, 0.0, 0.01, and 0.01 per 100 in DPP-YMCA, and 0.01, 0.02 and 0.03 per 100 by HELP-PD.

## References

1. 1.

Centers for Disease Control and Prevention. Increasing prevalence of diagnosed diabetes–united states and puerto rico, 1995–2010. Morb. Mortal. Wkly. Rep. 61, 918 (2012).

2. 2.

American Diabetes Association. Economic costs of diabetes in the U.S. in 2012. Diabetes Care 36, 1033–1046, https://doi.org/10.2337/dc12-2625 (2013).

3. 3.

Boyle, J. P., Thompson, T. J., Gregg, E. W., Barker, L. E. & Williamson, D. F. Projection of the year 2050 burden of diabetes in the US adult population: dynamic modeling of incidence, mortality, and Prediabetes prevalence. Popul. Heal. Metrics 8, 1–12, https://doi.org/10.1186/1478-7954-8-29 (2010).

4. 4.

American Diabetes Association. Classification and diagnosis of diabetes. Diabetes Care 40, S11–S24, https://doi.org/10.2337/dc18-S002 (2017).

5. 5.

Diabetes Prevention Program Research Group. Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. The New Engl. J. Medicine 346, 93, 10.1056%2FNEJMoa012512 (2002).

6. 6.

Tuomilehto, J. et al. Prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. New Engl. J. Medicine 344, 1343–1350, https://doi.org/10.1056/NEJM200105033441801 (2001).

7. 7.

Vermunt, P. et al. A lifestyle intervention to reduce type 2 diabetes risk in dutch primary care: 2.5-year results of a randomized controlled trial. Diabet. Medicine 29, e223–e231, https://doi.org/10.1111/j.1464-5491.2012.03648.x (2012).

8. 8.

Costa, B. et al. Delaying progression to type 2 diabetes among high-risk Spanish individuals is feasible in real-life primary healthcare settings using intensive lifestyle intervention. Diabetol. 55, 1319–1328, https://doi.org/10.1007/s00125-012-2492-6 (2012).

9. 9.

Nilsen, V., Bakke, P. S. & Gallefoss, F. Effects of lifestyle intervention in persons at risk for type 2 diabetes mellitus-results from a randomised, controlled trial. BMC Public Heal. 11, 893–902, https://doi.org/10.1186/1471-2458-11-893 (2011).

10. 10.

Li, G. et al. Cardiovascular mortality, all-cause mortality, and diabetes incidence after lifestyle intervention for people with impaired glucose tolerance in the Da Qing Diabetes Prevention Study: A 23-year follow-up study. The lancet Diabetes & endocrinology 2, 474–480, https://doi.org/10.1016/S2213-8587(14)70057-9 (2014).

11. 11.

Centers for Disease Control and Prevention. National diabetes statistics report, https://www.cdc.gov/diabetes/pdfs/data/statistics/national-diabetes-statistics-report.pdf (2017).

12. 12.

Huang, E. S., Basu, A., O’grady, M. & Capretta, J. C. Projecting the future diabetes population size and related costs for the U.S. Diabetes care 32, 2225–2229, https://doi.org/10.2337/dc09-0459 (2009).

13. 13.

Zhuo, X., Zhang, P. & Hoerger, T. J. Lifetime direct medical costs of treating type 2 diabetes and diabetic complications. Am. J. Prev. Medicine 45, 253–261, https://doi.org/10.1016/j.amepre.2013.04.017 (2013).

14. 14.

Seuring, T., Archangelidi, O. & Suhrcke, M. The economic costs of type 2 diabetes: A global systematic review. Pharmacoeconomics 33, 811–831, https://doi.org/10.1007/s40273-015-0268-9 (2015).

15. 15.

Leal, J., Khurshid, W., Pagano, E. & Feenstra, T. Computer simulation models of Prediabetes populations: A systematic review protocol. BMJ open 7, e014954, https://doi.org/10.1136/bmjopen-2016-014954 (2017).

16. 16.

Svetkey, L. P. et al. Premier: A clinical trial of comprehensive lifestyle modification for blood pressure control: Rationale, design and baseline characteristics. Annals Epidemiol. 13, 462–471, https://doi.org/10.1016/S1047-2797(03)00006-1 (2003).

17. 17.

Ackermann, R. T. et al. A randomized comparative effectiveness trial for preventing type 2 diabetes. Am. J. Public Heal. 105, 2328–2334, https://doi.org/10.2105/AJPH.2015.302641 (2015).

18. 18.

Katula, J. A. et al. One-year results of a community-based translation of the Diabetes Prevention Program: Healthy-Living Partnerships to Prevent Diabetes (HELP PD) Project. Diabetes Care 34, 1451–1457, https://doi.org/10.2337/dc10-2115 (2011).

19. 19.

McDowell, M. A. & National Center for Health Statistics. Anthropometric reference data for children and adults: U.S. population, 1999–2002, https://www.cdc.gov/nchs/data/ad/ad361.pdf (2005).

20. 20.

McDowell, M. A. & National Center for Health Statistics. Anthropometric reference data for children and adults: United states, 2003–2006, https://www.cdc.gov/nchs/data/nhsr/nhsr010.pdf (2008).

21. 21.

Fryar, C. D., Gu, Q., Ogden, C. L. & National Center for Health Statistics. Anthropometric reference data for children and adults: United States, 2007–2010, https://www.cdc.gov/nchs/data/series/sr_11/sr11_252.pdf (2012).

22. 22.

U.S. Census Bureau. Resident population estimates of the united states by age and sex: April 1, 1990 to july 1, 1999, with short-term projection to november 1, 2000, https://www.census.gov/population/estimates/nation/intfile2-1.txt (2001).

23. 23.

Schoenborn, C. A., Adams, P. F., Barnes, P. M. & National Center for Health Statistics. Body weight status of adults: United States, 1997–98, https://www.cdc.gov/nchs/data/ad/ad330.pdf (2002).

24. 24.

Centers for Disease Control and Prevention. Rates of diagnosed diabetes per 100 civilian, non-institutionalized population, by age, united states, 1980–2014, http://www.cdc.gov/diabetes/statistics/prev/national/figbyage.htm (2015).

25. 25.

Anderson, R. N. & National Center for Health Statistics. United states life tables, 1997, https://www.cdc.gov/nchs/data/nvsr/nvsr47/nvs47_28.pdf (1999).

26. 26.

Yang, X. et al. Development and validation of an all-cause mortality risk score in type 2 diabetes: The hong kong diabetes registry. Arch. Intern. Medicine 168, 451–457, https://doi.org/10.1001/archinte.168.5.451 (2008).

27. 27.

Narayan, K. V., Boyle, J. P., Thompson, T. J., Sorensen, S. W. & Williamson, D. F. Lifetime risk for diabetes mellitus in the United States. Jama 290, 1884–1890, https://doi.org/10.1001/jama.290.14.1884 (2003).

28. 28.

Sonnenberg, F. A. & Beck, J. R. Markov models in medical decision making: A practical guide. Med. Decis. Mak. 13, 322–338, https://doi.org/10.1177/0272989X9301300409 (1993).

29. 29.

The Diabetes Prevention Program Research Group and others. Costs associated with the primary prevention of type 2 diabetes mellitus in the diabetes prevention program. Diabetes Care 26, 36–47, https://doi.org/10.2337/diacare.26.1.36 (2003).

30. 30.

Lawlor, M. S. et al. Cost of a group translation of the diabetes prevention program. Am. journal preventive medicine 44, https://doi.org/10.1016/j.amepre.2012.12.016 (2013).

31. 31.

Ackermann, R. T. & Marrero, D. G. Adapting the diabetes prevention program lifestyle intervention for delivery in the community. The Diabetes Educ. 33, 69–78, https://doi.org/10.1177/0145721706297743 (2007).

32. 32.

Vojta, D., Koehler, T. B., Longjohn, M., Lever, J. A. & Caputo, N. F. A coordinated national model for diabetes prevention: linking health systems to an evidence-based community program. Am. journal preventive medicine 44, S301–S306, https://doi.org/10.1016/j.amepre.2012.12.018 (2013).

33. 33.

Katula, J. A. et al. The healthy living partnerships to prevent diabetes study: 2-year outcomes of a randomized controlled trial. Am. journal preventive medicine 44, S324–S332, https://doi.org/10.1016/j.amepre.2012.12.015 (2013).

34. 34.

Murphy, S. L., Xu, J., Kochanek, K. D. & National Center for Health Statistics. Deaths: final data for 2010, https://www.cdc.gov/nchs/data/nvsr/nvsr61/nvsr61_04.pdf (2013).

35. 35.

Thorpe, K. E. & Yang, Z. Enrolling people with prediabetes ages 60–64 in a proven weight loss program could save medicare \$7 billion or more. Heal. Aff. 30, 1673–1679, https://doi.org/10.1377/hlthaff.2010.0944 (2011).

## Acknowledgements

This research is partly funded by the National Science Foundation of the United States award number 1651912 and Clemson University HEHD Interdisciplinary Research Innovations.

## Author information

A.K. study and concept design, paper writing, L.S. study design, paper writing, A.N. simulation design, paper writing, H.N. simulation design, L.C. study and concept design, paper writing. All authors reviewed the manuscript.

## Ethics declarations

### Competing Interests

The authors declare no competing interests.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions