Figure 1. Conceptual framework for labor and postpartum management of women with gestational diabetes mellitus
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome comments on this evidence report. They may be sent by mail to the Task Order Officer named below at: Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850, or by e-mail to epc@ahrq.gov.
Carolyn M. Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Beth A. Collins Sharp, R.N., Ph.D.
Director, EPC Program
Agency for Healthcare Research and Quality
Jean Slutsky, P.A., M.S.P.H.
Director, Center for Outcomes and Evidence
Agency for Healthcare Research and Quality
Shilpa H. Amin, M.D., M.Bsc.
EPC Program Task Order Officer
Agency for Healthcare Research and Quality
The Evidence-based Practice Center thanks Gabriel Lai and Haeseong Park for their assistance with article reviewing and data entry; Karen Robinson for her guidance on the search strategy; Allison Jonas for her assistance with final preparations of the report; Deborah McClellan for copy editing; and Renee Wilson and Michael Oladubu for their assistance with the study. We would also like to acknowledge our technical expert panel:
E. Albert Reece, M.D., Ph.D., M.B.A.
Dean & Vice President for Medical Affairs
University of Maryland School of Medicine
Baltimore, MD
Richard Hellman, M.D.
Clinical Professor of Medicine
University of Missouri - Kansas City School of Medicine
Kansas City, MO
Troy Flint Porter, M.D.
University of Utah School of Medicine and Intermountain Health Care
Salt Lake City, UT
Jean M. Lawrence, Sc.D., M.P.H.
Research Scientist II/Epidemiologist
Research and Evaluation
Kaiser Permanente Southern California
Pasadena, CA
Objectives: We focused on four questions: (1) What are the risks and benefits of an oral diabetes agent (i.e., glyburide), as compared to all types of insulin, for gestational diabetes? (2) What is the evidence that elective labor induction, cesarean delivery, or timing of induction is associated with benefits or harm to the mother and neonate? (3) What risk factors are associated with the development of type 2 diabetes after gestational diabetes? (4) What are the performance characteristics of diagnostic tests for type 2 diabetes in women with gestational diabetes?
Data Sources: We searched electronic databases for studies published through January 2007. Additional articles were identified by searching the table of contents of 13 journals for relevant citations from August 2006 to January 2007 and reviewing the references in eligible articles and selected review articles.
Review Methods: Paired investigators reviewed abstracts and full articles. We included studies that were written in English, reported on human subjects, contained original data, and evaluated women with appropriately diagnosed gestational diabetes. Paired reviewers performed serial abstraction of data from each eligible study. Study quality was assessed independently by each reviewer.
Main Results: The search identified 45 relevant articles. The evidence indicated that (1) maternal glucose levels do not differ substantially in those treated with insulin versus insulin analogues or oral agents; (2) average infant birth weight may be lower in mothers treated with insulin than with glyburide; (3) induction at 38 weeks may reduce the macrosomia rate, with no increase in cesarean delivery rates; (4) anthropometric measures, fasting blood glucose (FBG), and 2-hour glucose value are the strongest risk factors associated with development of type 2 diabetes; (5) FBG had high specificity, but variable sensitivity, when compared to the 75-gm oral glucose tolerance test (OGTT) in the diagnosis of type 2 diabetes after delivery.
Conclusions: The evidence suggests that benefits and a low likelihood of harm are associated with the treatment of gestational diabetes with an oral diabetes agent or insulin. The effect of induction or elective cesarean on outcomes is unclear. The evidence is consistent that anthropometry identifies women at risk of developing subsequent type 2 diabetes; however, no evidence suggested the FBG out-performs the 75-gm OGTT in diagnosing type 2 diabetes after delivery.
Gestational diabetes mellitus (gestational diabetes), one of the most common medical complications of pregnancy, is defined as carbohydrate intolerance of variable degree, with an onset or first recognition occurring during pregnancy. Of the estimated 4 million births annually in the United States, gestational diabetes affects approximately 200,000 (7 percent), depending on the criteria (diagnostic test and threshold values) chosen for diagnosis. Initial diagnostic criteria for gestational diabetes were based on the ability to identify women at risk of developing type 2 diabetes, since 15 to 60 percent of women with gestational diabetes develop type 2 diabetes mellitus within 5 to 15 years of delivery. Therefore, the diagnosis and subsequent management of gestational diabetes after delivery has important implications for the prevention of type 2 diabetes. Questions remain, however, about the optimal ways to assess the postpartum risk of diabetes and to screen women for diabetes after a diagnosis of gestational diabetes has been made.
Equally important, gestational diabetes is associated with both maternal and infant complications, including maternal and neonatal hypoglycemia and complications of macrosomia, such as birth trauma and cesarean delivery. Treatment recommendations for gestational diabetes are based primarily on evidence from early trials suggesting that insulin treatment can reduce the incidence of macrosomia. To date, relatively little work has been done to synthesize more recent evidence regarding the management of maternal glucose or physicians' decisions to recommend elective labor induction or cesarean delivery in women with gestational diabetes.
Furthermore, while there is substantial literature regarding risk factors for type 2 diabetes, there has been no comprehensive review of these risk factors or the relative magnitude of their associations with type 2 diabetes. Finally, little work has been done to investigate the performance of postpartum glucose testing in women with gestational diabetes or to analyze the effect of performing the tests at different time intervals following delivery on the relative performance of current screening modalities.
Because of the broad clinical and public health policy implications of the management of women with gestational diabetes, the American College of Obstetricians and Gynecologists (ACOG) requested an evidence report from the Agency for Healthcare Research and Quality (AHRQ) through the Evidence-based Practice Center program (EPC) to systematically and critically examine the literature on specific aspects of the management of gestational diabetes. We were guided in our key questions and outcomes of interest by the ACOG, the AHRQ, and our panel of technical experts.
Our key questions were:
What is the evidence for the risks and benefits of oral diabetes agents (e.g., second-generation sulfonylureas and metformin), as compared to all types of insulin, for both the mother and neonate in the treatment of women with gestational diabetes?
How does maternal outcome vary based on the level of glucose at the initiation of a medication?
How does neonatal outcome vary based on the level of glucose at the initiation of a medication?
Maternal outcomes
| Neonatal outcomes
|
What is the evidence that elective cesarean delivery or the choice of timing of induction in women with gestational diabetes results in beneficial or harmful maternal and neonatal outcomes?
What is the evidence for elective cesarean delivery at term, as compared to an attempt at vaginal delivery (spontaneous or induced) at term, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
cesarean versus spontaneous labor and vaginal delivery
cesarean versus induced labor and vaginal delivery
cesarean versus any attempt at vaginal delivery at term
What is the evidence for labor induction at 40 weeks, as compared to labor induction at an earlier gestational age (less than 40 weeks) or spontaneous labor, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
labor induction at less than 40 weeks versus labor induction at 40 weeks
labor induction at 40 weeks versus spontaneous labor
labor induction at less than 40 weeks versus spontaneous labor
How is the estimated fetal weight (EFW) related to outcomes of management of gestational diabetes with elective cesarean delivery or the timing (i.e., gestational age range) of labor induction?
How is gestational age related to outcomes of management of gestational diabetes with elective cesarean delivery or the choice of timing (i.e., gestational age range) of labor induction?
Maternal outcomes
| Neonatal outcomes
|
What risk factors, including but not limited to family history, physical activity, pre-pregnancy weight, and gestational weight gain, are associated with short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes?
What are the performance characteristics (sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes after pregnancy in patients with a history of gestational diabetes? Are there differences in the performance characteristics of the test results based on subgroup analysis?
We identified the primary literature on labor and postpartum management of gestational diabetes and the association with maternal and neonatal outcomes through a comprehensive search plan that included electronic and hand searching. We ran searches of the following databases for the specified periods of time: MEDLINE® (1950 through January 2007), EMBASE® (1974 through January 2007), The Cochrane Central Register of Controlled Trials (CENTRAL; Issue 1, 2007), and the Cumulative Index to Nursing & Allied Health Literature (CINAHL®; 1982 through January 2007). Hand searching for relevant citations took several forms. From our electronic search, we identified the 13 journals (see Appendix B a) that were most likely to publish articles on this topic. We scanned the table of contents of each issue of these journals for relevant articles from August 2006 through January 2007. For the second form of hand searching, reviewers received eligible articles and flagged references of interest for the team to compare to the existing database.
Two independent reviewers conducted title scans in a parallel fashion. If either reviewer thought that a title was potentially eligible, its abstract was reviewed. If the abstract was deemed to meet the inclusion criteria by two reviewers, the abstract was included in our article review. Any differences of opinion were resolved by the two primary reviewers or by a third independent reviewer.
Each eligible article underwent double review by study investigators. A primary reviewer completed all data abstraction forms, and a second reviewer confirmed the first reviewer's data abstraction forms for completeness and accuracy. The reviewers assessed study quality independently. Reviewer pairs were formed to include personnel with both clinical and methodological expertise. A third reviewer re-reviewed a random sample of articles by the first two reviewers to ensure consistency in the abstraction of the articles.
We used several study quality assessment tools, based on the study design of the articles included in the review. Our dual, independent review of article quality judged articles on several aspects of each study type's internal validity. Quality assessment of trials for Key Questions 1 and 2 was based on the Jadad criteria and included: (1) whether the study was randomized, (2) the appropriateness of the randomization scheme, (3) whether the study was blinded, (4) the appropriateness of the blinding, and (5) the description of withdrawals and drop-outs. For each trial, we created a score between 5 (high quality) and 0 (low quality). Quality assessment of observational studies for Key Questions 1, 2, and 3 was designed by selecting key elements from the Standards for Reporting of Observational Studies (STROBE) checklist for reporting observational studies. The STROBE checklist is based on the consensus of 27 participants of the Meta-analysis of Observational Studies in Epidemiology (MOOSE) group and includes recommendations for standards for individual studies with regard to the presentation of the study hypotheses, eligibility criteria, study population, power and sample size calculations, definitions of outcomes, and description of loss to followup and missing data. Quality assessment of the diagnostic test studies for Key Question 4 was designed by selecting key elements from the Standards for Reporting of Diagnostic Accuracy (STARD) Initiative and included items about reporting the sampling design, describing the lost-to-followup, reporting diagnostic accuracy, verifying positive and negative tests equally, interpreting the tests independently, reporting reproducibility, and reporting subgroup analyses.
Based on the quantity, quality, and consistency of the studies, we graded the overall body of evidence for each of the key questions using the evidence-grading scheme recommended by the GRADE Working Group.
We conducted meta-analyses when there were sufficient data (three or more studies) and the studies were homogeneous with respect to key variables (population characteristics, study duration, intervention/exposure/comparison tests, and length of followup). When the data were not sufficient to allow us combine the studies in a meta-analysis, we prepared a qualitative summary of the results.
We retrieved 11,400 unique citations from our original search. After reviewing the titles and abstracts, 552 were deemed eligible for further review, and the full articles were retrieved. A total of 45 articles were ultimately included in this review.
What is the evidence for the risks and benefits of oral diabetes agents (e.g., second-generation sulfonylureas and metformin), as compared to all types of insulin, for both the mother and neonate in the treatment of women with gestational diabetes?
How does maternal outcome vary based on the level of glucose at the initiation of a medication?
How does neonatal outcome vary based on the level of glucose at the initiation of a medication?
We identified eight randomized controlled trials (RCTs) with a total of 845 participants that met our inclusion criteria for review: Three trials compared insulin to glyburide; two trials compared insulin to insulin lispro; one trial compared long-acting to short-acting insulin; one trial compared four-times-daily insulin to two-times daily insulin; and one trial compared diet to insulin.
Two small trials and one large trial (404 women) reported no significant difference in maternal glucose control or rates of cesarean delivery between the insulin and glyburide groups.
A meta-analysis of the three RCTs comparing insulin and glyburide showed that treatment with insulin was associated with a lower mean infant birth weight when compared to glyburide (weighted mean difference: -93 grams [gm]) (95 percent confidence interval [CI]: -191 to 5 gm), but the difference was small and not statistically significant.
The largest trial reported no difference in the proportion of infants with hypoglycemia (9 percent with glyburide as compared to 6 percent with insulin therapy [p = 0.25]). A smaller trial reported a significantly higher percentage of infants with hypoglycemia in the glyburide group than in the insulin or acarbose groups (33 percent compared to 4 percent and 5 percent, respectively; p = 0.006).
Four observational studies (N = 911 women) compared the effects of insulin and glyburide on maternal and neonatal outcomes.
Due to potential selection bias, loss to followup, and the lack of any power analysis to estimate detectable effect sizes, none of the observational studies was deemed strong enough to justify a modification of the conclusions drawn from the RCTs.
We graded the overall evidence comparing insulin and glyburide as very low.
We identified two RCTs that compared insulin lispro to insulin. It appeared that insulin lispro might be associated with tighter maternal glucose control than regular insulin, but there were only limited data to support this conclusion.
Both RCTs reported similar rates of cesarean delivery among women in the insulin lispro group, as compared to the insulin group.
No evidence existed to suggest that neonatal outcomes differ between women treated with insulin lispro and those treated with regular insulin.
We graded the strength of the evidence comparing insulin to insulin lispro as very low.
One RCT (N = 23 women) reported that long-acting insulin was associated with a higher proportion of infants with macrosomia when compared to short-acting insulin. No difference in birth trauma or metabolic abnormalities was found, but this study was not adequately powered to detect differences in these outcomes.
There was insufficient evidence to allow us to draw any conclusions regarding maternal outcomes.
One RCT (N = 274 women) reported that twice-daily insulin was associated with a higher proportion of hypoglycemia (6 percent versus 1 percent; p = 0.002) and hyperbilirubinemia (21 percent versus 11 percent; p = 0.002) when compared to four-times-daily insulin. No evidence existed to suggest a difference in maternal glucose levels or cesarean delivery between twice-daily and four-times-daily use of insulin.
We identified only one RCT (N = 95 women) that reported lower rates of macrosomia (5.9 percent versus 26.5 percent, respectively; p = 0.005) and lower infant birth weights (p = 0.002) for those using insulin plus dietary management versus those treated with diet alone.
We graded the overall evidence regarding comparisons of diet plus insulin to diet alone as very low, given that only one small RCT met our inclusion criteria.
We found no evidence to indicate whether the relative effect of different treatment approaches on maternal and infant outcomes varied with the level of glucose at the initiation of medical therapy.
We expect that the ongoing Hyperglycemia and Adverse Pregnancy Outcomes (HAPO) Study, an observational study of 23,325 pregnant women, should provide data on maternal and neonatal outcomes at various maternal glucose levels and potentially indicate thresholds at which medical therapy should be initiated.
We found no RCTs or observational studies comparing metformin to insulin in gestational diabetics that met our criteria for review. The Metformin in Gestational Diabetes (MiG) Trial is an ongoing randomized trial of over 500 women that should provide future insight into the benefits and risks of metformin use throughout pregnancy.
There were insufficient data regarding the teratogenic effects of intrauterine exposure to metformin or its potential effect on infant growth and motor development.
What is the evidence that elective cesarean delivery or the choice of timing of induction in women with gestational diabetes results in beneficial or harmful maternal and neonatal outcomes?
One RCT and seven observational studies evaluated two of our five maternal outcomes of interest and 11 of the 12 neonatal outcomes of interest.
One RCT of 200 women reported that elective induction at 38 weeks of gestation, as compared to expectant management (induction at 42 weeks' gestation or if the EFW was 4,200 gm or greater), reduced infant birth weight and the rate of macrosomia but did not alter other maternal or neonatal outcomes, including the rate of cesarean delivery.
Two observational studies of low quality also reported a reduction in infant birth weight or rates of macrosomia in women induced at 38 weeks of gestation, as compared to historical controls.
Five additional observational studies examined the effects of various delivery management protocols, but each had serious limitations, including reliance on historical controls, no adjustment for potential confounders, or no adjustment or stratified analysis based on severity of gestational diabetes (class A1 [diet-controlled] versus class A2 [insulin-controlled]). In addition, the studies covered a wide time period (4–19 years), with no adjustment for changes in clinical practice.
We were unable to draw firm conclusions from the limited data available.
We graded the overall strength of the evidence as very low, given the limited number of RCTs and the serious design limitations in the conduct of the observational studies.
What risk factors, including but not limited to family history, physical activity, pre-pregnancy weight, and gestational weight gain, are associated with short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes?
We developed an a priori list of risk factors for type 2 diabetes, based on guidance from the AHRQ, the ACOG, and members of our technical panel, and then grouped these risk factors into nine categories:
Family history of type 2 diabetes
Sociodemographics
Lifestyle factors
Parity
Pregnancy-related factors
Postpartum factors
Measures of anthropometry
Oral contraceptives
Physiological measures
Sixteen studies evaluated at least one risk factor and reported adjusted measures of association.
We identified 11 cohort studies that evaluated the relationship between 11 different anthropometric measures and the development of type 2 diabetes; 8 studies reported adjusted measures of association from multivariate models.
Seven of the eight studies that evaluated anthropometric measures (pre-pregnancy body mass index [BMI], pregnancy BMI, weight, waist-to-hip ratio) using multivariate analysis reported that these measures were positively associated with the risk of developing type 2 diabetes.
We graded the evidence on anthropometric measures and the risk of developing type 2 diabetes as moderate because of the inconsistency in the anthropometric measures used across the studies.
We identified five studies that included family history of type 2 diabetes in the multivariate analysis, but only one study reported the actual magnitude of the association of family history with the risk of type 2 diabetes, and this association was not statistically significant (relative risk [RR] = 1.7; 95 percent CI: 0.6 to 4.6).
We graded the evidence on family history as very low because only one study reported the actual measure of association.
Five studies assessed age as a risk factor for developing type 2 diabetes among gestational diabetics, but only one study reported the actual measure of association; women who were 30 years of age and older at diagnosis of gestational diabetes had a higher likelihood of developing type 2 diabetes, but the relative risk was not statistically significant.
We did not identify any studies of lifestyle behaviors that met our criteria for inclusion in this review.
Gestational age at diagnosis of gestational diabetes was inversely associated with a higher likelihood of the development of type 2 diabetes, but the modeling of gestational age varied across studies and therefore limited our ability to synthesize the data.
Two studies evaluated the association between the use of progesterone-only contraception or combination oral contraception (estrogen and progesterone) and the risk of developing type 2 diabetes in women with gestational diabetes. One study reported a two-fold increase in the risk of developing diabetes with the use of progestin-only oral contraceptives as compared to combination oral contraception; one study reported no increased risk in women using depo-medroxyprogesterone acetate as compared to combined oral contraceptives.
FBG, 2-hour glucose value, and the area under the curve from the diagnostic antepartum oral glucose tolerance test (OGTT) were associated with a significantly higher risk of developing type 2 diabetes in women with gestational diabetes.
What are the performance characteristics (sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes after pregnancy in patients with a history of gestational diabetes? Are there differences in the performance characteristics of the test results based on subgroup analysis?
We identified eight studies that reported 10 evaluations of the performance of a reference test versus a comparison (screening) test for the diagnosis of type 2 diabetes in the postpartum period.
Our review yielded three general comparisons: (1) two different diagnostic threshold values applied to the 75-gm OGTT (the World Health Organization [WHO] 1985 criterion compared with the WHO 1999 criterion), (2) FBG level greater than 7.0 mmol/L (126 mg/dL) (the American Diabetes Association [ADA] 1997) compared to the 75-gm OGTT (WHO 1999), and (3) FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) compared to the 75-gm OGTT (WHO 1985).
The sensitivity for the FBG greater than 7.0 mmol/L (126 mg/dL) alone as compared with a complete OGTT using the same FBG threshold (FBG greater than 7.0 mmol/L (126 mg/dL) or a 2-hr plasma glucose level after 75-g OGTT greater than 11.1 mmol/L (200 mg/dL) varied across the three studies, ranging from 46 to 89 percent.
With a threshold greater than 7.0 mmol/L, the FBG had high specificity when compared to the 75-gm OGTT but had highly variable sensitivity.
No studies included in this review reported measures of reproducibility.
We graded the strength of the evidence regarding postpartum screening for type 2 diabetes as very low because of the limited number of studies within each category of comparisons and the heterogeneity in the study populations.
Although the overall quality of the evidence was very low, we were able to draw some conclusions regarding treatment options for maternal glucose control, the timing and method of delivery, the risk factors for the development of type 2 diabetes, and the performance characteristics of screening tests conducted in the postpartum period to identify those who have developed type 2 diabetes.
When patients ask about the effect of the use of insulin analogues or glyburide as compared to insulin, clinicians should be aware that little clinical difference has been demonstrated in the infant birth weights associated with the use of these three regimens. Clinicians should also be aware that while the use of an alternative to regular insulin is unlikely to result in any adverse maternal or infant outcomes, there were insufficient data to allow us to determine whether insulin analogues or glyburide are more efficacious than regular insulin in achieving maternal glucose targets. Also, there was no evidence supporting a difference in terms of the prevention of episodes of maternal or neonatal hypoglycemia. To date, only insulin has been approved by the Food and Drug Administration for use in gestational diabetes. Because of the limited data available, it was unclear what glucose thresholds should be used to initiate treatment with insulin, insulin analogues, or glyburide in patients who are being treated with diet alone. Furthermore, there were insufficient data regarding the potential benefits or risk of metformin use.
There was also insufficient evidence to permit us to develop guidelines for elective labor induction or cesarean delivery in women with gestational diabetes. Well-designed clinical trials are needed to provide a stronger base of evidence for the management of gestational diabetes.
Based on multivariate models, measures of obesity appeared to be the strongest risk factor for type 2 diabetes in women with gestational diabetes. We have concluded that there are insufficient data to justify recommending alternative tests to the 75-gm OGTT for the detection of type 2 diabetes in women with gestational diabetes. Further studies conducted in diverse populations and high-risk subgroups and incorporating measures of reproducibility will help to move this area of investigation toward the development of clinically acceptable testing guidelines.
This review has several important limitations. First, the heterogeneous nature of the studies prevented a quantitative summary of much of the data. For Key Question 1, we were able to provide a summary measure of the weighted mean difference in infant birth weight in the three RCTs comparing insulin and glyburide. However, the pooled estimate provided data on only one of several important maternal and neonatal outcomes related to medical treatment in gestational diabetics. We were unable to conduct additional analysis because the number of trials comparing similar treatments was very limited. Also, maternal and neonatal outcomes were not consistent across studies. Few of the same outcome measures were included in two or more studies, and the definitions of outcomes varied across studies. Our review of five observational studies comparing of glyburide and insulin did not alter our limited conclusions from the RCTs. We were further limited by a lack of data on the potential risk of glyburide use. We were unable to provide evidence on the potential risks and benefits of metformin because of a lack of published studies that met our inclusion criteria.
We were also unable to draw substantial conclusions from our review of seven observational studies and a single RCT on elective induction and cesarean delivery. The observational studies had serious limitations, with no adjustment for potential confounders, severity of gestational diabetes, or variation in the definitions of major outcomes. There was substantial heterogeneity in the study populations and the time periods of these observational studies.
The lack of multivariate analysis in some studies, as well as inconsistencies in the covariates included in the multivariate models of other studies, made it difficult to compare results across the 16 studies on risk factors for type 2 diabetes. We found limited evidence for the magnitude of association of traditional risk factors (sociodemographics, parity, family history) with the development of type 2 diabetes in women with gestational diabetes. We were also limited by the lack of availability of any studies on the possible relationship of lifestyle behaviors to the risk of type 2 diabetes.
Finally, heterogeneity in the study populations and the time intervals of postpartum testing made it difficult to draw firm conclusions regarding the effectiveness of postpartum glucose screening in women with a history of gestational diabetes.
Researchers should focus on conducting studies that will lead to the development of evidenced-based guidelines for maternal glucose control in gestational diabetes and physician recommendations for labor induction, elective cesarean, or expectant management.
Well-designed RCTs with a priori hypotheses, power analysis, appropriate effect sizes, and intention-to-treat analysis can provide better data on treatment efficacy.
Consistency in the definition and collection of maternal and infant outcome measures is essential to our ability to draw confident conclusions about potential benefits and harms of treatment options among women with gestational diabetes.
The best evidence for delivery management in women with gestational diabetes will be garnered from the conduct of well-designed RCTs comparing elective induction and cesarean delivery to expectant management. Alternatively, observational studies with consistency in outcomes measures and multivariate adjustment for potential confounders can provide important, relevant information.
Those conducting longitudinal studies of women with a history of gestational diabetes should develop and follow standard protocols for retention in an effort to improve followup rates. Future studies should collect data on pertinent covariates and adjust for relevant confounders in multivariate analysis.
Studies measuring the sensitivity, specificity, and reproducibility of screening tests for type 2 diabetes in women with a history of gestational diabetes can help physicians in the early identification of women with type 2 diabetes and avoid potential medical complications of diabetes.
A comparison of screening and reference tests in certain subgroups (i.e., those with a family history type 2 diabetes or prior gestational diabetes) is also warranted.
In order to develop broadly acceptable guidelines for postpartum screening for type 2 diabetes in women with prior gestational diabetes, additional research should be conducted to assess test reproducibility as well as test performance based on varying intervals of postpartum screening.
The American College of Obstetricians and Gynecologists (ACOG) has requested an evidence report from the Agency for Healthcare Research and Quality (AHRQ) through the Evidence-based Practice Center program (EPC) to systematically and critically examine the literature on specific aspects of the management of gestational diabetes mellitus (gestational diabetes). With the ongoing increase in obesity and sedentary lifestyles, the prevalence of diabetes mellitus among reproductive-aged women is rising, both globally and in the United States.1 There are currently 1.85 million reproductive-aged women in the United States with gestational diabetes, type 1 or type 2 diabetes mellitus (type 2 diabetes), or glucose intolerance.2 Gestational diabetes, the most common medical complication of pregnancy, is defined as carbohydrate intolerance of variable degree, with an onset or first recognition occurring during pregnancy. Population-based studies estimate that gestational diabetes affects about 200,000 (7 percent) of the over 4 million births occurring annually in the United States and is associated with both maternal and neonatal complications.3–5 Furthermore, women with gestational diabetes are at high risk for future diabetes; 15 to 60 percent will develop type 2 diabetes mellitus within 5 to 15 years of delivery.6 Therefore, the diagnosis and subsequent management of gestational diabetes after delivery has important implications for the prevention of type 2 diabetes. A systematic review of evidence to guide decisions about glucose management, labor management, postpartum risk assessment, and screening of women with gestational diabetes would be useful for clinicians and public health officials.
In an effort to promote maternal wellbeing and avoid adverse neonatal outcomes, such as macrosomia, birth trauma, and neonatal hypoglycemia, clinical recommendations have been developed by the ACOG7 and the American Diabetes Association (ADA) for the obstetrical management of gestational diabetes. The guidelines emphasize the importance of glucose control to minimize the risk of macrosomia and its associated complications. When dietary management fails to achieve adequate glucose control, an anti-hyperglycemic medication should be used. Traditionally, insulin has been considered the gold standard for management because of the ability to achieve tight maternal glucose control without the risk of transfer of insulin across the placenta.8 However, an oral diabetes medication (i.e., glyburide) is being used increasingly in women with gestational diabetes despite the lack of approval by the Food and Drug Administration for this indication.8–10 Metformin is currently used in the non-pregnant woman with polycystic ovarian syndrome (PCOS) to treat insulin resistance and normalize ovulation.11 Metformin use in women with gestational diabetes is still in the experimental stages. Given the increasing use of different medications for gestational diabetes, it is time for a critical appraisal of the literature regarding the potential benefits and harms associated with the medications that can be used for the treatment of gestational diabetes.
To date, the evidence has been somewhat limited regarding the comparative effectiveness and safety of oral diabetes agents and insulin preparations for women with gestational diabetes. The Cochrane Collaboration has conducted a review of randomized clinical trials comparing the effects of alternative management strategies (e.g., dietary management, insulin, or an oral diabetes agent) in women with impaired glucose tolerance or gestational diabetes.12 The final analysis included three trials involving women with impaired glucose tolerance, but no trials involving women with gestational diabetes. No statistically significant differences were found in terms of cesarean delivery rates, neonatal intensive care unit admissions, or large-for-gestational age (LGA; weight greater than 90th percentile) infants among women with impaired glucose tolerance undergoing intensive treatment with insulin, as compared to those receiving dietary advice alone. Further review is needed to assess the evidence now available on the value of medical therapies for glucose control in gestational diabetes. In the current report, one of our goals was to synthesize current knowledge regarding the medical treatment benefits and harms associated with the metabolic management of gestational diabetes, by comparing insulin therapy to oral diabetes medications, including the sulfonylureas and metformin. Our maternal and neonatal outcomes of interest were chosen on the basis of established measures of maternal and infant morbidity and guidance by our team of technical experts, as described in the Methods chapter.
Both the ACOG and the ADA have provided guidelines for labor management of pregnancies complicated by gestational diabetes. The ACOG states that primary cesarean delivery may be indicated in women with gestational diabetes whose estimated fetal weight (EFW) is 4,500 grams (gm) or greater.7 The ADA recommends delivery during the 38th week, unless obstetric considerations dictate alternative management.13 Many institutions have implemented protocols for labor management of women with gestational diabetes, based largely on anecdotal or individual institutional experience. Variations in clinical management continue because patients and health care providers have differing perceptions of the potential benefits and risks of different management strategies. Neither health care providers nor patients are armed with the knowledge necessary to adequately weigh the potential benefits and harms associated with these strategies. The lack of consensus has led to controversy regarding best practices for labor management. Evidence relating to labor management can provide valuable epidemiological evidence to clinicians in daily practice as well as to professional organizations that seek to make clinical policy recommendations about the optimal delivery of obstetrical care to women with gestational diabetes.
In this report, we have systematically reviewed and summarized the available literature on outcomes associated with a range of labor management strategies, including elective induction of labor, elective cesarean delivery, and expectant management of labor. For the purposes of this report, we refer to “elective” cesarean delivery as a procedure performed after discussion between the provider and patient with the goal of avoiding adverse neonatal outcomes that occur more often in diabetic pregnancy, such as shoulder dystocia, nerve palsy, or fracture. We have also reviewed the evidence regarding the effect of gestational age and EFW on maternal and neonatal outcomes in pregnancies complicated by gestational diabetes.
There is growing interest in the effect of childbearing on the development of chronic medical conditions, including type 2 diabetes. Many studies have examined traditional risk factors for type 2 diabetes, including age, race/ethnicity, and a family history of type 2 diabetes. However, no review to date has systematically examined risk factors for type 2 diabetes in women with a history of gestational diabetes. Such a review is needed and should cover the available data on metabolic or hormonal risk factors in this population as well as emerging data on other risk factors such as homocysteine levels14 15 and glutamic acid decarboxylase (GAD) antibodies.16 A review of this body of evidence could assist policymakers in the development of guidelines targeted at primary prevention of type 2 diabetes. We have therefore systematically reviewed the evidence on risk factors for type 2 diabetes in women with gestational diabetes, assessing the magnitude of individual risk factors and study quality.
Because women with gestational diabetes are at high risk for future diabetes, postpartum testing is crucial for early diagnosis of type 2 diabetes and the prevention or delay of onset of diabetic complications. The ACOG recognizes the increased risk of diabetes in women with gestational diabetes but offers no standard recommendation for postpartum testing.7 The ADA recommends postpartum screening at 6 weeks postpartum using either a fasting blood glucose (FBG) or an oral glucose tolerance test (OGTT).13 17 Women with a normal result should be reassessed every 3 years. Women with impaired fasting glucose or impaired glucose tolerance should receive annual testing. The 4th International Workshop on Gestational Diabetes has recommended that postpartum glucose testing be performed at 6 to 12 weeks postpartum.18
Despite the general recommendation for postpartum screening, no consensus exists regarding the overall performance characteristics of the OGTT or FBG in the postpartum period or in women with a history of gestational diabetes. Emerging data suggest that many women with gestational diabetes do not receive appropriate postpartum testing,19 20 perhaps because of limited knowledge regarding the performance of the screening tests in postpartum women, differences in the recommendations by professional organizations, and the challenges posed by the 2- to 3-hour (hr) timeframe required for an OGTT for a busy new mother. Knowledge of the performance of the FBG in comparison to the standard OGTT could help to improve patient adherence to postpartum testing. In addition, evidence related to the sensitivity and specificity of screening tests for type 2 diabetes may inform the development of evidence-based guidelines by professional organizations, prevent provider confusion about the timing of testing, and facilitate provider adherence to recommendations for testing. We have therefore investigated the performance of currently used screening tests for type 2 diabetes of pregnancies for women with a history of gestational diabetes, assessing their sensitivity and specificity and summarizing the evidence with regard to reproducibility.
To improve the outcomes of pregnancies in women with gestational diabetes, several approaches should be considered, including: (1) novel approaches to maternal glycemic control; (2) modifications of the clinical assessment for timing and method of delivery; (3) identification of risk factors for subsequent development of type 2 diabetes; and (4) clarification of the performance characteristics of postpartum glucose screening tests. The use of oral diabetes agents and/or new insulin preparations, for example, might promote better glucose control, decrease maternal hypoglycemia, and reduce abnormal fetal growth. The adaptation of new guidelines for cesarean delivery and labor induction in women with diabetes might reduce the incidence of birth trauma or nerve damage (e.g., brachial plexus palsy). A better understanding of the efficiency of postpartum glucose screening tests and screening intervals might help to identify a greater number of reproductive-aged women who are at risk of type 2 diabetes and who could be targeted for primary prevention. For example, it is possible that screening women beyond the currently recommended time interval of 6 weeks after delivery might increase the sensitivity of diabetic screening protocols. A greater number of women could then receive counseling on lifestyle modifications (i.e., nutrition, exercise). Furthermore, among those women who screen negative for glucose intolerance after the index pregnancy, lifestyle modifications might reduce the risk of development of gestational diabetes in subsequent pregnancies.
As shown in our conceptual framework (see Figure 1
Maternal outcomes of interest included maternal hypoglycemia, glycemic control, pre-eclampsia, postpartum hemorrhage, maternal weight, and cesarean delivery, representing measures of maternal morbidity and quality of care. Neonatal outcomes of interest included macrosomia, neonatal hypoglycemia, birth trauma, birth weight, and respiratory distress syndrome, representing measures of neonatal morbidity and subsequent childhood wellbeing.
In Key Question 3, we examined multiple risk factors for development of type 2 diabetes. Assessment of the literature yielded two primary categories of risk factors: traditional epidemiological factors and physiological factors. As shown in the conceptual framework, these factors may be instrumental in the development of targeted interventions for this particular population of women. In Key Question 4, we assessed the performance of screening tests in detecting type 2 diabetes in women with gestational diabetes.
Our key questions were:
What is the evidence for the risks and benefits of oral diabetes agents (e.g., second-generation sulfonylureas and metformin), as compared to all types of insulin, for both the mother and neonate in the treatment of women with gestational diabetes?
How does maternal outcome vary based on the level of glucose at the initiation of a medication?
How does neonatal outcome vary based on the level of glucose at the initiation of a medication?
Maternal outcomes
| Neonatal outcomes
|
What is the evidence that elective cesarean delivery or the choice of timing of induction in women with gestational diabetes results in beneficial or harmful maternal and neonatal outcomes?
What is the evidence for elective cesarean delivery at term, as compared to an attempt at vaginal delivery (spontaneous or induced) at term, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
cesarean versus spontaneous labor and vaginal delivery
cesarean versus induced labor and vaginal delivery
cesarean versus any attempt at vaginal delivery at term
What is the evidence for labor induction at 40 weeks, as compared to labor induction at an earlier gestational age (less than 40 weeks) or spontaneous labor, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
labor induction at less than 40 weeks versus labor induction at 40 weeks
labor induction at 40 weeks versus spontaneous labor
labor induction at less than 40 weeks versus spontaneous labor
How is the EFW related to outcomes of management of gestational diabetes with elective cesarean delivery or the timing (i.e., gestational age range) of labor induction?
How is gestational age related to outcomes of management of gestational diabetes with elective cesarean delivery or the choice of timing (i.e., gestational age range) of labor induction?
Maternal outcomes
| Neonatal outcomes
|
What risk factors, including but not limited to family history, physical activity, pre-pregnancy weight, and gestational weight gain, are associated with short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes?
What are the performance characteristics (sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes after pregnancy in patients with a history of gestational diabetes? Are there differences in the performance characteristics of the test results based on subgroup analysis?
A systematic review of the evidence on labor and postpartum management of gestational diabetes can provide support for clinical guidelines, thereby arming clinicians with the knowledge necessary to provide evidenced-based, quality care to a growing population of women. For the current 200,000 pregnancies that are complicated by gestational diabetes annually in the United States, evidence-based clinical practice will be essential in promoting treatment effectiveness, evidenced-based labor management, effective assessment of risk factors for later development of type 2 diabetes in women with gestational diabetes, and efficient postpartum screening for type 2 diabetes.
The ACOG has requested an evidence report to review and synthesize published literature regarding the intrapartum management and postpartum followup of women with gestational diabetes. Our EPC established a team and a work plan to develop the evidence report. The project consisted of recruiting technical experts, formulating and refining the specific questions, performing a comprehensive literature search, summarizing the state of the literature, constructing evidence tables, synthesizing the evidence, grading the strength of the evidence, and submitting the report for peer review.
The topic for this report was nominated in a public process. At the beginning of the project, we recruited a panel of external technical experts to provide input at key steps, including the selection and refinement of the questions to be examined. The panel included external experts who have strong expertise in gestational diabetes (see Appendix A a).
We worked with the technical experts and representatives of the AHRQ and ACOG to develop the key questions that are presented in the Conceptual Framework and Key Questions section of Chapter 1 (Introduction). The key questions focused on: (1) the risks and benefits of using oral diabetes medications and any type of insulin to treat gestational diabetes affecting the mother and neonate, (2) the risks and benefits of medically indicated cesarean delivery and the choice of timing of induction for the mother and neonate, (3) the risk factors associated with the short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes, and (4) the performance characteristics (i.e., sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes when conducted in postpartum gestational diabetes patients.
Searching the literature involved identifying reference sources, formulating a search strategy for each source, and executing and documenting each search. We also searched for medical subject heading (MeSH) terms that were relevant to gestational diabetes. We used a systematic approach for searching the literature, with specific eligibility criteria, to minimize the risk of bias in selecting articles for inclusion in the review. The systematic approach was intended to help identify gaps in the published literature.
Our comprehensive search plan included electronic and hand searching. We ran searches of four databases, MEDLINE® (1950 through January 2007), EMBASE® (1974 through January 2007), The Cochrane Central Register of Controlled Trials (CENTRAL; Issue 1, 2007), and the Cumulative Index to Nursing & Allied Health Literature (CINAHL®; 1982 through January 2007), to identify primary literature on the association of intrapartum management and postpartum followup of women with gestational diabetes with various maternal and neonatal outcomes. Hand searching for possibly relevant citations took two forms. First, from our electronic search, we identified the 13 journals (see Appendix B a) that were most likely to publish articles on this topic (i.e., these journals had the highest number of abstracts and articles included in the review). We scanned the table of contents of each issue of these journals for relevant articles from August 2006 through January 2007. For the second form of hand searching, reviewers received eligible articles and flagged references of interest for the team to compare to the existing database.
Search strategies specific to each database were designed to enable the team to focus the available resources on articles that were the most likely to be relevant to the key questions. We initially developed a core strategy for MEDLINE®, accessed via PubMed®, based on an analysis of the MeSH terms and text words of key articles identified a priori. The PubMed® strategy formed the basis for the strategies developed for the other electronic databases (see Appendix C a).
The results of the searches were downloaded and imported into ProCite® version 5 (the Thompson Corporation, Stamford, CT). We used the duplication scan feature in ProCite® to delete citations already retrieved. From ProCite®, the articles were uploaded to SRS 4.0 (TrialStat! Corporation, Ottawa, Ontario, Canada), a Web-based software package developed for systematic review data management. This database was also used to store citations in portable document format (PDF) and to track the search results at the title review, abstract review, article inclusion/exclusion, and data abstraction levels. A list of excluded articles is presented in Appendix D a.
The study team scanned all titles. Two independent reviewers conducted title scans in a parallel fashion. For a title to be eliminated at this level, both reviewers had to indicate that it was obviously ineligible. If the two reviewers did not agree on the eligibility of an article, it was promoted to the next level (see Appendix E a, Title Review Form). The title review phase was designed to capture as many studies as possible that reported on the association of intrapartum management and postpartum followup of women with gestational diabetes with various maternal and neonatal outcomes. All titles that were identified as potentially addressing these issues were promoted to the abstract review phase.
| KQ1 | Maternal Outcomes
| Neonatal Outcomes
|
| KQ2 | Maternal Outcomes
| Neonatal Outcomes
|
| KQ3 |
| |
| KQ4 |
| |
dL = deciliter; FBS = fasting blood sugar; gm = gram; hr = hour; KQ = key question, LGA = large for gestational age; mg = milligrams; OGTT = oral glucose tolerance test; RDS = respiratory distress syndrome; SGA = small for gestational age; type 2 diabetes = type 2 diabetes mellitus
Because of the broad array of potentially eligible articles obtained at the abstract review phase, full articles initially selected for review underwent another independent parallel review by the investigators to determine whether the articles should be included in the full data abstraction. In addition to the exclusion criteria used for the abstract review, studies were excluded if less than 90 percent of the sample was diagnosed with gestational diabetes and there was no separate analysis for gestational diabetes patients. We limited the studies for Key Question 1 to all randomized controlled trials (RCTs) and observational studies that compared two types of treatment. For Key Question 1, we decided not to include observational studies that compared either an oral diabetes medication or insulin to diet, because most of these studies had a selection-by-indication bias (i.e., treatment was determined by the severity of the diabetes).
At this phase of the review, the investigators determined which of the key questions each article addressed (see Appendix E, Article Inclusion/Exclusion Form). If the articles were still deemed to have applicable information, they were included in the full data abstraction. Differences of opinion regarding article eligibility were resolved through consensus adjudication.
We used a systematic approach for extracting data to minimize the risk of bias in this process. By creating standardized forms for data extraction, we sought to maximize consistency in identifying all pertinent data available for synthesis.
Each article underwent double review by study investigators for full data abstraction and assessment of study quality. For all data abstracted from studies, we used a sequential review process. In this process, the primary reviewer completed all data abstraction forms. The second reviewer checked the first reviewer's data abstraction forms for completeness and accuracy. Reviewer pairs were formed to include personnel with both clinical and methodological expertise. A third reviewer re-reviewed a random sample of articles by the first two reviewers to ensure consistency in the classification of the articles. Reviewers were not masked to the articles' authors, institutions, or journal.21 In most instances, data were directly abstracted from the article. If possible, relevant data were also abstracted from figures. Differences of opinion were resolved through consensus adjudication. For assessments of study quality, each reviewer independently judged study quality and rated items on quality assessment forms (see Appendix E, Data Abstraction Review Forms).
For all included articles, reviewers abstracted information regarding the general study characteristics (e.g., exclusion criteria, study design, study period and followup, and country) and study participants (e.g., maternal age, race, weight/body mass index [BMI], parity/gravida, gestational age, method of gestational diabetes management, and the type, timing, and results of the OGTT). For articles that applied to Key Questions 1 and 2, we abstracted information on the type of intervention, the outcomes measures and the method of ascertainment, and the results of each outcome, including the measures of variability. For articles that applied to Key Question 3, we abstracted information on the diagnosis of type 2 diabetes, the length of followup, the covariates considered and included in the models, and the measure of association and variability. For articles that applied to Key Question 4, we abstracted information on the reference test, the comparison test, the length of followup, and the results of the tests.
All information from the article review process was entered into the SRS 4.0 database by the individual completing the review. Reviewers entered comments into the system whenever applicable. The SRS 4.0 database was used to maintain and clean the data, as well as to create detailed evidence tables and summary tables (see Appendix F and Summary Tables).
The study aspects considered in our quality assessment varied according to the question being addressed and the type of study design. As part of our dual, independent review of study quality, we judged articles on several aspects of each study type's internal validity. Quality assessment of trials for Key Questions 1 and 2 was based on the Jadad criteria22 and included: (1) appropriateness of the randomization scheme, (2) appropriateness of the blinding, and (3) description of withdrawals and drop-outs. For each trial, we awarded a score from 5 (high quality) to 0 (low quality). Quality assessment of observational studies for Key Questions 1, 2, and 3 involved selecting elements from the Standards for Reporting of Observational Studies (STROBE) checklist of the reporting of observational studies;23 it included items about reporting on the hypotheses, inclusion/exclusion criteria, study population, power and sample size calculations, definition of outcomes, loss to followup, and missing data. Quality assessment of the diagnostic test studies for Key Question 4 was designed by selecting elements from the Standards for Reporting of Diagnostic Accuracy (STARD) Initiative24 and included items about reporting of the sampling design, loss to followup, information about diagnostic accuracy, verification of positive and negative tests, independent interpretation of tests, reproducibility, and subgroup analyses.
For each key question, we created a set of detailed evidence tables containing all the information extracted from the eligible studies. The investigators reviewed the tables and eliminated items that were rarely reported.
We conducted meta-analyses when there were sufficient data (three or more studies) and the studies were homogeneous with respect to key variables (population characteristics, study duration, intervention/exposure/comparison tests, and length of followup). When the data were not sufficient to combine the studies in a meta-analysis, we prepared a qualitative summary of the results.
In the meta-analysis, we recorded the mean difference in infant birth weight between groups, along with its measure of dispersion. We calculated a pooled estimate (weighted mean difference) of infant birth weight from the eligible RCTs using a random effects model with the DerSimonian and Laird formula for calculating between-study variance.25 The random effects model was used because unmeasured heterogeneity was likely to exist among the trials.
We assessed heterogeneity among the trials considered for meta-analysis using a standard chi-squared test and a significance level of alpha ≤ 0.10. We also examined heterogeneity among studies with an I2 statistic, which describes the variability in effect estimates that is due to heterogeneity rather than random chance.26 A value greater than 50 percent may be considered to have substantial variability.
All statistical analyses were conducted using STATA (Intercooled, version 8.2, StataCorp, College Station, TX).
Initial data were abstracted by the investigators and entered directly into Web-based data collection forms using SRS® 4.0 (TrialStat! Corporation, Ottawa, Ontario, Canada). After a second reviewer reviewed the data, the adjudicated data were re-entered into the Web-based data collection forms by the second reviewer. Second reviewers were generally more experienced members of the research team, and one of their main priorities was to check the quality and consistency of the first reviewers' answers. In addition to the second reviewers checking the consistency and accuracy of the first reviewers, a lead investigator examined a random sample of the reviews to identify problems with the data abstraction. If problems were recognized in a reviewer's data abstraction, the problems were discussed at a meeting with the reviewers. In addition, research assistants used a system of random data checks to assure data abstraction accuracy.
At the completion of our review, we graded the quantity, quality, and consistency of the best available evidence addressing the key questions by adapting an evidence-grading scheme recommended by the GRADE Working Group.27 We assessed the strength of the study designs, with RCTs considered to be best, followed by non-randomized controlled trials and observational studies. To assess the quantity of evidence, we focused on the number of studies with the strongest design. We also assessed the quality and consistency of the best available evidence, including assessment of the limitations affecting individual study quality (using the individual study quality assessments), certainty regarding the directness of the observed effects in the studies, the precision and strength of the findings, and the availability (or lack) of data to answer the key question. We classified evidence bodies pertaining to the key questions into the following categories: (1) “high” grade, indicating confidence that further research is very unlikely to change our confidence in the estimated effect in the abstracted literature; (2) “moderate” grade, indicating that further research is likely to have an important impact on our confidence in the estimates of effects and may change the estimates in the abstracted literature; (3) “low” grade, indicating the further research is very likely to have an important impact on confidence in the estimates of effects and is likely to change the estimates in the abstracted literature; (4) “very low” grade, indicating any estimate of effect is very uncertain; and (5) “insufficient” grade, indicating the lack of enough evidence to make any estimate of effect.
A draft of the completed report was sent to the technical experts and peer reviewers, as well as to the representatives of AHRQ. In response to the comments of the technical experts, peer reviewers, and AHRQ, revisions were made to the evidence report, and a summary of the comments and their disposition was submitted to AHRQ.
We present the findings of our review using a standard format for each of the four key questions. First, we present the conceptual framework for each question, incorporating relevant background information and potential implications for clinical practice. Next, we summarize the population characteristics of each study. We then summarize the findings, emphasizing those results that are most relevant to our conceptual framework. We outline the methodological issues related to the heterogeneity of study design and outcome analyses and then summarize our assessment of the quality of each study using established quality criteria published in the literature. Finally, we assign a grade to the overall body of evidence on each question or sub-question.
A summary of the search results for the primary literature review is presented in Figure 2
What is the evidence for the risks and benefits of oral diabetes agents (e.g., second-generation sulfonylureas and metformin), as compared to all types of insulin, for both the mother and neonate in the treatment of women with gestational diabetes?
How does maternal outcome vary based on the level of glucose at the initiation of a medication?
How does neonatal outcome vary based on the level of glucose at the initiation of a medication?
Understanding the risks and benefits of the use of insulins or oral diabetes agents during pregnancy for both maternal and neonatal outcomes is essential to the care of women with gestational diabetes and their offspring.28 29 As shown in the conceptual framework (see Figure 3
The average maternal age ranged from 25 to 34 years and did not substantially differ across groups. Only three studies reported the racial distribution of the study participants:33 34 36 Anjalakshi et al.33 reported that 100 percent of the study participants were Indian. Most participants (95 percent) in the study by Jovanovic et al.36 were reported as Hispanic. All of the participants in the study by Mecacci et al.34 were reported as Caucasian.
In the studies that reported maternal weight, the weight measures were similar between groups. Five studies30 34–37 reported gravidity, and three studies30 32 36 reported the parity of study participants, which ranged from nulliparity to 2.5 prior births.
Consistent with our study selection criteria, each of the eight RCTs reported the test used to diagnose gestational diabetes. Three studies31 33 37 used the 75-gm OGTT World Health Organization (WHO) criterion. Two studies30 35 used the National Diabetes Data Group (NDDG) criterion, and three studies32 34 36 used the 3-hr, 100-gm OGTT with threshold values based on the Carpenter and Coustan criterion. Langer et al.32 used the FBG threshold of 95 mg/deciliter (dL) based on the 100-gm OGTT Carpenter and Coustan criteria to determine eligibility and as the threshold value for treatment with insulin or glyburide. Bertini et al. used a FBG greater than 90 mg/dL or a 2-hr PPG greater than 100 mg/dL as threshold values for initiation of treatment with glyburide or insulin. Anjalakshi et al.33 initiated medical therapy if the 2-hr PPG was 120 mg/dL or greater after two weeks of nutritional therapy. The average gestational age at screening and diagnosis of gestational diabetes varied across studies from 22 to 28 gestational weeks. Mecacci et al.34 reported a median gestational age at diagnosis of 28 weeks (range: 25 to 32). Polyhonen-Alho31 reported a gestational age range of 24 to 28 weeks.
| Insulin versus glyburide | Insulin versus insulin lispro | Insulin versus insulin | Diet versus insulin | |||||
|---|---|---|---|---|---|---|---|---|
| Maternal outcomes | Anjalakshi, 200633 | Bertini, 200537 | Langer, 200032 | Jovanovic, 199936 | Mecacci, 200334 | Nachum, 199935 | Poyhonen-Alho, 200231 | Thompson, 199030 |
| Cesarean delivery for CPD | • | |||||||
| Cesarean delivery, total | • | • | • | • | • | • | ||
| Glycemic control* | • | • | • | • | • | • | ||
| Hemorrhage | ||||||||
| Hypoglycemia | • | • | • | • | ||||
| Operative vaginal delivery | ||||||||
| Perineal tears | ||||||||
| Pre-eclampsia | • | |||||||
| Weight | • | • | • | |||||
| Neonatal outcomes | ||||||||
| Anoxia | ||||||||
| Birth trauma | • | • | ||||||
| Birth weight | • | • | • | • | • | • | ||
| Congenital malformations | • | • | ||||||
| Hyperbilirubinemia | • | • | • | • | ||||
| LGA | • | • | • | • | ||||
| Macrosomia | • | • | • | • | • | |||
| Mortality | • | • | • | |||||
| Hypoglycemia | • | • | • | • | • | |||
| NICU admission | • | |||||||
| RDS | • | |||||||
| SGA | • | • | • | |||||
| Shoulder dystocia | ||||||||
A dot (•)indicates that the outcome was evaluated in that study.
Includes FBG, 1-hr PPG, 2-hr PPG, HbA1c, combined glucose, preprandial glucose
2-hr PPG = 2 hour postprandial glucose; CPD = cephalopelvic disproportion; FBG = fasting blood glucose; HbA1c = hemoglobin A1c; LGA = large for gestational age; NICU = neonatal intensive care unit; RCTs = randomized controlled trials; RDS = respiratory distress syndrome; SGA = small for gestational age
Two RCTs32 33 evaluated maternal glycemic control. Langer et al.32 randomized 404 women to receive insulin (n=203) or glyburide (n=201). The insulin regimen was based on maternal weight, with two-thirds of the units administered as neutral protamine Hagedorn (NPH) and one-third of the units as regular insulin. In the study by Langer, glyburide was initiated at a dose of 5.0 milligrams (mg) or 2.5 mg and increased to a maximum dose of 20 mg/day. In the study by Anjalakshi,33 glyburide was initiated at a dose of 0.625 mg. No maximum or average dose was reported. Langer et al.32 reported no statistically significant differences in average final FBG or 2-hr PPG levels between those receiving insulin and those on glyburide. The average (mean ± standard deviation [SD]) FBG levels were 96 ± 16 for insulin and 98 ± 13 for glyburide (p = 0.17). The average 2-hr PPG levels were 112 ± 15 for insulin and 113 ± 22 for glyburide (p = 0.6).
A smaller randomized trial33 of 26 participants comparing glyburide to insulin also reported no statistically significant differences in mean 2-hr PPG levels during pregnancy in the insulin versus the glyburide group.
The two larger RCTs32 37 compared the percentage of women undergoing cesarean delivery in each group. Langer32 reported that 49 (24 percent) of the women on insulin underwent cesarean delivery, as compared to 46 (23 percent) of the women on glyburide (p > 0.05). Bertini et al.37 reported no significant differences in the rate of cesarean delivery among three groups of women receiving insulin (44 percent), glyburide (50 percent), or acarbose (52 percent).
Bertini37 and Langer32 both reported on maternal hypoglycemia. Bertini defined maternal hypoglycemia based on the need for hospitalization and reported no episodes of hospitalization in any of the three treatment groups. Langer did not define maternal hypoglycemia but reported a significantly higher percentage of women with a blood glucose level under 40 mg/dL in the insulin group than in the glyburide group (20 percent versus 4 percent; p = 0.03).
Bertini37 also compared the mean difference in maternal weight at delivery to the baseline value in each treatment group and found no significant differences.
| Author, year | N for analysis | Mean difference in birth weight,† grams (standard error) | Regression coefficient | 95% CI |
|---|---|---|---|---|
| Anjalakshi, 200633 | 23 | -120 (161) | -90 | -193, 12 |
| Bertini, 200537 | 51 | -244 (133) | -68 | -174, 37 |
| Langer, 200032 | 404 | -62 (57) | -194 | -395, 7.3 |
| Pooled estimates | Total participants = 478 | -93 | -191, 5 | |
95% CI = 95 percent confidence interval
Mean difference = the average difference in birth weight between infants in the insulin group and infants in the glyburide group.
Figure legend. The shaded boxes represent the mean difference in infant birth weight between the treatment groups in each study. The diamond represents the pooled mean difference in birth weight between infants born to mothers treated with insulin and infants born to mothers treated with glyburide.
Langer et al. reported no significant differences between treatment groups in the percentage of infants with hypoglycemia. Among the 201 women on glyburide, 9 percent of the infants experienced hypoglycemia, as compared to 6 percent of those with mothers on insulin (p = 0.25). Bertini et al. reported a higher percentage of infants with macrosomia (birth weight greater than 4,000 gm) and LGA among the women on glyburide than among those on insulin or acarbose. A significantly higher percentage of infants had hypoglycemia in the glyburide group than in the insulin or acarbose groups (33 percent compared to 4 percent and 5 percent, respectively; p = 0.006). However, Bertini reported no difference in SGA infants or in perinatal mortality between the group on insulin and the group on glyburide.
The mean decrease in glycosylated hemoglobin from the time of entry into the study until delivery was greater in the women on lispro (mean difference from baseline -0.35 percent) than in those on regular insulin (mean difference from baseline -0.07 percent; p = 0.002).36 Maternal hypoglycemia, reported as the mean (standard error [SE]) percentage of all blood determinations in the hypoglycemic range, was not significantly different in the insulin lispro group and the group receiving regular insulin (0.88 percent ± 0.25 percent versus 2.2 percent ± 0.86 percent; p > 0.05).36 Mecacci reported significantly higher maternal 1-hr PPG levels in the insulin lispro group than in the regular insulin group (108 mg/dL ± 11 versus 88 mg/dL ± 11, respectively; p < 0.001).34 However, both pre-prandial and 2-hr PPG levels were similar in the two groups (p > 0.05 for pre-prandial and 2-hr PPG). Both Jovanovic and Mecacci reported no significant differences in the proportion of women undergoing cesarean delivery; Jovanovic reported no differences across all cesarean deliveries,36 and Mecacci reported no differences between groups for cesarean delivery specifically for cephalopelvic disproportion.34
Metformin versus insulin. There is no currently published evidence on maternal and neonatal outcomes in women with gestational diabetes who have been treated with metformin versus insulin.39 Recently published data on metformin treatment in pregnancy are primarily based on small cohort studies in women with PCOS, in whom it has been used to treat infertility.40–43 Women with PCOS and women with type 2 diabetes who continue to receive metformin through the first trimester of pregnancy have demonstrated few adverse pregnancy events. An ongoing prospective RCT (the Metformin in Gestational Diabetes [MiG] trial) comparing metformin with insulin in women with gestational diabetes is currently underway in New Zealand and Australia.44 The goal of the trial is to recruit 750 women over a 2-year period, collecting data on multiple maternal and neonatal outcomes. The primary outcome is a composite of neonatal morbidity, including hypoglycemia, respiratory distress, phototherapy, birth trauma, low 5-minute Apgar score, and prematurity. The secondary outcomes include maternal glycemic control, neonatal body composition, and markers of neonatal insulin sensitivity. An interim report of 453 participants showed no adverse events.44 We anticipate that the results of this trial will provide meaningful insight into the potential risks and benefits of metformin therapy. The results of the MiG trial are likely to provide further evidence on the short-term (e.g., congenital anomalies) and as yet potentially unrecognized long-term effects of placental transfer and in utero fetal exposure to metformin.
| Author, year | Treatment, N | Hyperbilirubinemia | Congenital malformation | Perinatal mortality | Birth trauma | Other neonatal outcome | Maternal hypoglycemia |
|---|---|---|---|---|---|---|---|
| Insulin versus glyburide | |||||||
| Anjalakshi, 200633 | G1: Insulin, 13 | ||||||
| G2: Glibenclamide, 10 | |||||||
| Bertini, 200537 | G1: Insulin, 27 | 0 (0) | 0 (0) | ||||
| G2: Glyburide, 24 | 0 (0) | 0 (0) | |||||
| G3: Acarbose, 19 | 0 (0) | 0 (0) | |||||
| Langer, 200032 | G1: Insulin, 203 | 8 (4)‖ | 4 (2) | 2 (1) | NICU admission: 14 (7) | ||
| G2: Glyburide, 201 | 12 (6)‖ p = 0.36† | 5 (2) p = 0.74† | 2 (1) p = 0.99† | NICU admission: 12 (6) p = 0.68† | |||
| Insulin versus insulin lispro | |||||||
| Jovanovic, 1999 36 | G1: Regular human insulin, 23 | ||||||
| G2: Insulin lispro, 19 | |||||||
| Mecacci, 2003 34 | G1: Regular human insulin, 24 | (2.2) | |||||
| G2: Insulin lispro, 25 | (0.88) p > 0.05† | ||||||
| Insulin versus insulin | |||||||
| Nachum, 1999 35 | G1: Insulin twice daily, 136 | 29 (21)^ | 2 (2) | 1 (1) | 3 (2) | RDS: 0 (0.00) | 1 (0.72) |
| G2: Insulin four times daily, 138 | 15 (11)^ | 1 (1) | 0 (0.00) | 2 (1) | RDS: 1 (1) | 1 (0.72) | |
| Poyhonen-Alho, 2002 31 | G1: Short-acting insulin, 11 | 3 (27.27) | 0 (0.00) | ||||
| G2: Long-acting insulin, 12 | 3 (25.00) | 1 (8.33) | |||||
| Diet versus insulin | |||||||
| Thompson, 1990 30 | G1: Diet, 50 | 0 (0.00)Ω | |||||
| G2: Diet and insulin, 45 | 0 (0.00)Ω | ||||||
Comparing G1 to G2.
Serum bilirubin > 12 mg/dL.
Serum bilirubin > 205mmol/L at >= 34 weeks of gestation or > 137 mmol/L at < 34 weeks.
Serum bilirubin > 10 mg/dL.
RCT = randomized controlled trial; dL = deciliter; G = group; L = liter; mg = milligram; mmol = millimole; NICU = neonatal intensive care unit; RDS = respiratory distress syndrome
There were few reports of maternal hypoglycemia. Bertini37reported none; Langer32 reported a higher number of women with FBG less than 40 mg/dL in the glyburide than in the insulin group. Maternal hypoglycemia was not significantly different in the insulin lispro group and the group receiving regular insulin (0.88 percent ± 0.25 percent versus 2.2 percent ± 0.86 percent; p > 0.05).36 The twice-daily insulin and four-times-daily insulin groups each had one case of maternal hypoglycemia.35
Four studies45–47 49 used the 100-gm Carpenter and Coustan criterion (2003 ADA criterion), and one study48 used the NDDG criterion to diagnose gestational diabetes. Four studies45 47–49 reported the percentage of participants with prior gestational diabetes. All five studies reported the gestational age of pregnancies at the time of diagnosis of gestational diabetes; these ages ranged from 18 to 33 weeks of gestation.
The initial glyburide dose was 2.5 mg daily in three of the four studies. Two studies reported an initial dose between 2.5 mg and 5 mg per day. Dosages were escalated on the basis of glucose control to a maximum of 20 mg/day in each study.45–49 The initial insulin dose in three45–47of the four studies was 0.7 units/kg. One study49 reported a standard regimen consisting of a combination of NPH and regular insulin injected subcutaneously three times daily. One study48 did not report the initial insulin dose. Insulin levels were adjusted, with four studies reporting no maximum dose. Jacobson48 reported a mean dose of 34.4 units per day in 249 of the 268 women treated with insulin.
Observational studies. Because of the differences in study design, the use of non-comparable groups, and the differences in outcome measures, we chose not to conduct a meta-analysis of the five observational studies included in our review. We offer a summary of the relevant findings and study conclusions and discuss their potential relevance for future research. We include the data on 5 maternal and 11 neonatal outcomes from the observational studies. The maternal outcomes were: (1) operative vaginal delivery, (2) pre-eclampsia, (3) cesarean delivery, (4) glucose control, and (5) maternal hypoglycemia. The neonatal outcomes were: (1) hypoglycemia, (2) hyperbilirubinemia, (3) macrosomia, (4) LGA, (5) SGA, (6) perinatal mortality, (7) infant birth weight, (8) neonatal intensive care admissions, (9) birth trauma, (10) congenital malformations, and (11) shoulder dystocia.
Chmait45 conducted a prospective, cohort study of 69 women with gestational diabetes who failed diet alone and elected to proceed with glyburide therapy. Of the 69 participants, 13 participants were started on glyburide therapy but later required the addition of insulin or were transitioned from glyburide to insulin therapy because of inadequate glucose control. Fifty-six (81 percent) of the participants achieved adequate glucose control on glyburide.
Chmait concluded that women with gestational diabetes with FBG levels under 110 mg/dL and 1-hr PPG levels under 140 mg/dL were more likely to successfully continue glyburide therapy throughout pregnancy. However, these findings were based on a small sample size without any reported adjustment for confounders. Also, because the majority of participants were Hispanic, the findings may not apply to other populations.
Finally, Rochon et al. conducted a retrospective study of 101 participants recruited from a prenatal diabetes clinic in order to identify characteristics that might predict failure of glyburide therapy and to evaluate whether those women who had failed glyburide were more likely to undergo adverse pregnancy outcomes.49 These gestational diabetics, who had undergone a 1-week trial of diet but were not meeting glycemic goals (FBG between 60 and 90 mg/dL and 2-hr PPG of 120 mg/dL or less), were then started on glyburide. Those who were consistently 15 percent to 25 percent above the FBS or 2-hr PPG target values were switched to insulin therapy. Eighty (79 percent) of the 101 participants were identified as glyburide “successes” compared to 21 (21 percent) who were categorized as glyburide “failures.” Rochon and colleagues reported few statically significant differences in the maternal or neonatal outcomes for the success and failure groups. The rate of neonatal intensive care admissions was higher in the glyburide success group than in the glyburide failure group (33 percent versus 10 percent; p = 0.04). Infant birth weight was similar between the success and failure groups (3,415 gm ± 620 compared to 3,319 ± 559; p = 0.5). The absence of significant differences in birth weight may reflect, at least in part, the limited power of the study to detect a small difference between groups.
There was no difference in the percentage of cesarean deliveries (38 percent versus 43 percent) between the success and failure groups. The rate of shoulder dystocia (10 percent versus 11 percent) was also similar in both groups. Although congenital anomalies were not included as one of the outcomes, Rochon and colleagues reported two neonatal intensive care admissions in the glyburide success group that were due to a congenital anomaly. Also, most admissions to the neonatal intensive care unit were related to neonatal hypoglycemia (10 infants in the success group and 2 in the failure group). The authors concluded that there are few adverse maternal or neonatal outcomes in pregnancies in which glyburide therapy has failed and insulin is required. They also concluded that the rate of neonatal intensive care admissions was higher in the glyburide success group than in the glyburide failure group, primarily because of neonatal hypoglycemia.
Limitations. In addition to the quality assessment outlined above, two additional limitations deserve further comment: First, only one study48 adjusted for potential confounders. Jacobson adjusted for several relevant covariates, including race/ethnicity, FBG, BMI, and gestational age at diagnosis of gestational diabetes. Additional adjustment for relevant labor complications, such as maternal hypertension or intrapartum infection, might help to elucidate the association of insulin therapy with maternal and neonatal outcomes. Second, there was no discussion of potential selection bias in the conduct of the observational study or the potential influence of this bias on the associations reported. Because of the observational design and lack of adjustment for confounders, it is difficult to draw conclusions with confidence.
Given the limitations of the observational studies, we based our conclusions on the available RCTs. None of the observational studies was strong enough to justify a modification of the conclusions drawn from the RCTs.
Key Question 1a. How does maternal outcome vary based on the level of glucose at the initiation of a medication?
Key Question 1b. How does neonatal outcome vary based on the level of glucose at the initiation of a medication?
Maternal glycemia and maternal and neonatal outcomes. We found no evidence for variation in maternal or neonatal outcomes on the basis of the glucose level at the initiation of treatment with an oral agent or insulin. One ongoing study may provide evidence to address this important clinical question. We look forward to the publication of the findings from the Hyperglycemia and Adverse Pregnancy Outcomes (HAPO) Study.50 The HAPO Study is a 5-year, prospective, observational study designed to examine the association of various levels of maternal glycemia in the third trimester with adverse pregnancy outcomes in a multi-national, multicultural, ethnically diverse cohort of women. This investigator-initiated observational study has recruited 23,325 pregnant women from nine countries. All participants undergo glucose tolerance testing. Those participants with levels below the pre-defined threshold are provided with standard obstetrical care, and their providers are blinded to their glucose levels. Maternal blood is obtained for measurement of serum C-peptide and hemoglobin A1c (HbA1c) and cord blood for serum C-peptide and plasma glucose; a capillary specimen is obtained between 1 and 2 hr after delivery for measurement of neonatal plasma glucose. Neonatal anthropometrics are obtained, and followup data are collected at 4–6 weeks post-delivery. The primary outcomes are cesarean delivery, increased fetal size (macrosomia/LGA/obesity), neonatal morbidity (hypoglycemia), and fetal hyperinsulinemia. Preliminary findings, presented at the 67th Annual Scientific Session of the ADA,51 suggest a linear association between rising maternal glucose levels in the third trimester and the likelihood of cesarean delivery. Large babies (defined as being in the largest 10 percent of the newborn population) were born to only 5 percent of women with the lowest fasting plasma glucose levels (less than 75 mg/dL) but to 27 percent of those with the highest levels (greater than 100 mg/dL). Women with the highest glucose levels had a 6.6 times greater risk of delivering an infant with macrosomia than did women with the lowest glucose levels (OR = 6.6; 95 percent CI: 4.6 to 9.6). Rising glucose levels were also associated with a linearly higher likelihood that the newborn would be above the 90th percentile for total skinfold thickness (5.4 percent at the lowest glucose levels versus 28 percent at the highest, OR = 1.52, 95 percent CI: 1.40 to 1.59). These findings suggest that the likelihood of adverse outcomes increases linearly with rising maternal glucose levels even when the range of maternal glucose levels is considered normal. These findings should provide further information on the level of glycemia at which adverse events may occur, although the glucose levels may be below the threshold values for gestational diabetes. Also, these findings may provide insight into the level of glucose at which therapy with an oral agent or insulin should be added to diet therapy.
We found limited evidence on the risks and benefits of oral diabetes agents, insulin analogues, and insulin. The available evidence, to date, suggested little difference in maternal or neonatal outcomes for treatment with oral agents versus any type of insulin, but inconsistencies in clinical outcomes measures across studies and lack of data make it difficult to draw firm conclusions. No studies compared metformin to insulin or other oral agents. Our meta-analysis showed a small, non-significant lower infant birth weight in pregnant women treated with insulin as compared with those treated with glyburide. This small difference of 93 gm is unlikely to have significant clinical relevance. Further studies are needed to determine whether there is a consistent and clinically definable difference in infant birth weight. There appeared to be little difference in various reported measures of maternal glucose control in women treated with glyburide versus insulin (FBG and 2-hr PPG) or in women treated with insulin lispro versus regular insulin (glycosylated hemoglobin and 1-hr PPG). It is unclear whether differences in maternal hypoglycemia are associated with different treatment regimens: Only one study of glyburide and insulin37 defined threshold values for maternal hypoglycemia as part of the study protocol. In one study comparing insulin lispro to regular insulin, maternal hypoglycemia was based on the need for hospitalization rather than threshold glucose values. No available evidence met our inclusion criteria for variations in maternal or neonatal outcomes being based on glucose levels at the initiation of oral agents or insulin. However, as we have indicated above, ongoing investigations, such as the HAPO Study, may provide evidence to suggest threshold values at which clinicians should add oral diabetic agents, insulin analogues, or insulin to diet therapy. The results of the MiG trial should provide evidence regarding the relative benefits and harms of treatment with metformin versus insulin. Finally, additional data regarding congenital anomalies, the long-term consequences of glyburide use, and the effects of metformin transport across the placenta should inform clinical practice and clinical guidelines for the use of oral diabetic agents in pregnancy.
What is the evidence that elective cesarean delivery or the choice of timing of induction in women with gestational diabetes results in beneficial or harmful maternal and neonatal outcomes?
What is the evidence for elective cesarean delivery at term, as compared to an attempt at vaginal delivery (spontaneous or induced) at term, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
cesarean versus spontaneous labor and vaginal delivery
cesarean versus induced labor and vaginal delivery
cesarean versus any attempt at vaginal delivery at term
What is the evidence for labor induction at 40 weeks, as compared to labor induction at an earlier gestational age (less than 40 weeks) or spontaneous labor, with regard to beneficial or harmful maternal and neonatal outcomes in gestational diabetes?
labor induction at less than 40 weeks versus labor induction at 40 weeks
labor induction at 40 weeks versus spontaneous labor
labor induction at less than 40 weeks versus spontaneous labor
How is the EFW related to outcomes of management of gestational diabetes with elective cesarean delivery or the timing (i.e., gestational age range) of labor induction?
How is gestational age related to outcomes of management of gestational diabetes with elective cesarean delivery or the choice of timing (i.e., gestational age range) of labor induction?
Maternal outcomes
| Neonatal outcomes
|
Clinicians use a variety of clinical parameters in their clinical decisionmaking for intrapartum management. Estimates of fetal weight, gestational age, and maternal glucose control are measures of particular importance in pregnancies complicated by gestational diabetes. Clinical management can also be influenced by patient preference and provider perception. Management options include expectant management, labor induction, or “elective” cesarean delivery. In the context of diabetic pregnancies, we refer to “elective” cesarean delivery as a procedure performed following discussion between the patient and clinician, with the goal of avoiding adverse neonatal outcomes such as shoulder dystocia, nerve palsy, or fracture.
Medical institutions have traditionally developed protocols for labor management of women with gestational diabetes, incorporating a combination of anecdotal experience, published literature, and recommendations by national clinical organizations. Both the ACOG and the ADA7 13 have provided guidance with regard to labor management of pregnancies complicated by gestational diabetes. The current guidelines, however, are based primarily on retrospective studies that summarize individual hospitals' experiences with maternal and neonatal outcomes. Limitations in the available literature on the management of women with gestational diabetes may have contributed to delays in the development of broadly accepted guidelines for clinical management and to the current variation in practice patterns and clinical outcomes.
Our objective was to conduct a systematic review of the available literature on the effect of EFW and gestational age on maternal and neonatal outcomes in pregnancies involving gestational diabetes. We also focused on the effect of delivery options (i.e., expectant management, induction, and elective cesarean delivery). We developed a conceptual framework to guide the review of Key Question 2, incorporating the key steps in clinical decisionmaking for labor management (see Figure 5
| Author, year | Type of study | Control group intervention/protocol | Study group intervention/protocol | Population | Key limitations of study | Key conclusions |
|---|---|---|---|---|---|---|
| Kjos, 199355 | Randomized controlled trial | Expectant management | Induced at 38 weeks | GDMA2 | Randomization process not described | ↓ macrosomia, ↓ birth weight in study group |
| Pre-gestational diabetics (6.5%) | High rate of induction in control group | |||||
| Conway, 199853 | Prospective cohort study with HC; protocol-based | Expectant management | US at 37–38 weeks | GDMA1 | No adjustment for confounders | ↑ CD, ↓ macrosomia, ↓ shoulder dystocia in study group |
| CD if EFW>4,250gm | GDMA2 | No stratified analysis for GDM class | ||||
| Induced if LGA and EFW<4,250gm | Pre-gestational diabetics (8.6%) | No power calculation | ||||
| Lurie, 199658 | Prospective cohort study with HC; protocol-based | Induced if EFW>4000gm | Induced at 38 weeks, | GDMA1 | No adjustment for confounders | ↓ macrosomia, ↓ shoulder dystocia (only if compared to controls delivered after 40 weeks) |
| CD if EFW>4,500gm (sub-group analysis: delivered > 40 weeks) | CD if EFW>4,500gm | GDMA2 | No stratified analysis for GDM class | |||
| Small number of subjects | ||||||
| Lurie, 199257 | Retrospective cohort study; groups based on gestational age at delivery | Induced if EFW>4000gm | Induced if EFW>4,000g | GDMA1 | No adjustment for confounders | ↓birth weight in GDMA2 patients delivering before 40 weeks |
| CD if EFW>4,500gm, delivery > 40 weeks | CD if EFW>4,500gm, delivery < 40 weeks | GDMA2 | Outcomes not clearly defined | |||
| Stratified analysis | ||||||
| Peled, 200459 | Retrospective cohort study comparing four protocol periods | HC A: Induced at 42 wks | Period D: Induced at 38 weeks if LGA | GDMA1 | No adjustment for confounders | Decreasing rates of macrosomia and shoulder dystocia with level of intervention |
| CD if EFW>4,500gm | CD if EFW>4,000gm | GDMA2 | Limited information on baseline characteristics | |||
| HC B: Induced at 40 weeks if LGA | Exclusion criteria not reported | |||||
| CD if EFW>4,000gm | No stratified analysis for GDM class | |||||
| HC C: Induced at 40 weeks if LGA | Long study period (19 years) | |||||
| CD if EFW>4,000gm | ||||||
| Rayburn, 200556 | Retrospective cohort study; protocol-based | Expectant management of GDMA1 | Induction of GDMA2 at 38 weeks | GDMA2 versus GDMA1 | Significant differences in baseline characteristics between groups | No differences in maternal or neonatal outcomes |
| Outcomes not clearly defined | ||||||
| Marchiano, 200452 | Retrospective cohort study | Trial of labor | Elective repeat CD | GDMA1 with previous CD | Results only generalizable to patients with previous CD | ↑ macrosomia in elective CD group |
| Keller, 199154 | Retrospective cohort study | Trial of labor | GDMA1 | No adjustment for confounders | ↑ shoulder dystocia with ↑ birth weight | |
| GDMA2 | Lack of appropriate comparison group | |||||
| Limited information on baseline characteristics | ||||||
CC = concurrent control group; CD = cesarean delivery; EFW = estimated fetal weight; GDM = gestational diabetes mellitus; GDMA1 = diet-controlled gestational diabetes; GDMA2 = gestational diabetes requiring medical therapy; gm = gram; HC = historical control group; LGA=large for gestational age; US=ultrasound
Impact of gestational age on the timing of labor induction. We identified one RCT that addressed the impact of labor induction at term, as compared to expectant management, on maternal and neonatal outcomes in gestational diabetes.55 Kjos 1993 recruited 200 women from one tertiary care center. The study sample included 187 women with class A2 gestational diabetes and 13 women with pre-existing (class B non-insulin-requiring) diabetes. Inclusion criteria were clearly stated: good glucose control in at least 90 percent of measured levels, 38 completed weeks of gestation, good compliance with clinic appointments and home glucose monitoring, no antepartum testing abnormalities, singleton gestation with cephalic presentation, EFW less than 3800 gm at 38 weeks with no evidence of fetal growth restriction, no other medical or obstetrical complications, and no more than two previous cesarean deliveries. Women who met the inclusion criteria, agreed to randomization, and had an established diagnosis of diabetes were eligible to participate in the study. Women were randomized to either expectant management or induction of labor at 38 weeks. Of those with pre-existing diabetes, nine were in the active induction group and four were in the expectant management group. The two treatment groups did not differ significantly in terms of maternal age, gravidity, parity, maternal weight, or gestational age at entry into the study. The racial distribution of the study participants was not reported. Gestational age was calculated from the first day of the last menstrual period and adjusted if ultrasound estimation (before 22 weeks) differed from the menstrual age by 10 days or more. Amniocentesis and measurement of the lecithin-to-sphingomyelin (L/S ratio) was used if gestational age could not be accurately determined. Labor was induced with intravenous oxytocin at 38 weeks or in the presence of fetal lung maturity. Vaginal prostaglandin was used for cervical ripening if indicated (Bishop's score less than four) and if the patient had no contraindications to therapy.
In the final intention-to-treat analysis, there was no difference in cesarean delivery rates between the two groups (25 percent in the active induction group versus 31 percent in the expectant management group; p = 0.43). The average gestational age at delivery in the induction group was 1 week less than the gestational age in the expectant management group (39 weeks versus 40 weeks; p < 0.05).
The findings of this RCT suggested that infants born to women undergoing induction at 38 weeks have significantly lower average birth weights and perhaps a lower risk of macrosomia than do those born to women treated with expectant management. The absence of any difference in cesarean delivery rates suggested that maternal morbidity among women undergoing 38-week induction is similar to that of women undergoing expectant management. The similarity in demographics of the two groups suggested appropriate randomization. Adjustment for key covariates, including gestational at delivery, maternal weight, and age strengthened our confidence in the observed associations.
Impact of EFW on elective cesarean delivery and timing of labor induction. We identified one observational study on the effect of EFW on maternal and neonatal outcomes related to elective cesarean delivery and the timing of induction of labor.53 Conway et al. prospectively followed diabetic women (91.4 percent with gestational diabetes) who were delivered at a tertiary care institution between 1993 and 1995 according to an institutional protocol. Based on this protocol, women with diabetes underwent ultrasonographic estimates of fetal weight between 37 and 38 weeks of gestation. Women whose EFW was greater than or equal to 4,250 gm underwent cesarean delivery; those in whom the EFW was estimated at less than 4,250 gm but considered LGA (defined as 90th percentile or greater for the gestational age in their population) underwent labor induction. We will refer to this group who delivered between 1993 and 1995 as the study group. Outcomes for this study group were compared to those of a historical control group of diabetic women who delivered between 1990 and 1992, prior to the implementation of the new protocol. The study and control groups did not differ significantly in terms of their mean maternal age, racial composition, gestational age at delivery, or proportion of women with gestational diabetes or pre-gestational diabetes. Twenty-seven percent of the patients in the study group did not undergo ultrasound evaluation.
Based on this prospective, observational study, it appears that in women with gestational diabetes, a protocol involving elective cesarean delivery for macrosomia and induction at 38 weeks for LGA may reduce the number of macrosomic infants and the risk of shoulder dystocia, but it may also be associated with an increase in the number of cesarean deliveries. However, the lack of adjustment for the severity of the diabetes or other potentially confounding variables in this study may have resulted in an overestimate of the effect of the protocol on outcomes. Furthermore, temporal changes in the management of women with gestational diabetes may have also influenced the outcomes reported.
Relationship of gestational age and fetal weight to the timing of labor induction. We identified one cohort study58 that examined the relationship of gestational age and EFW to the timing of induction. Lurie et al58 prospectively followed a sample of women and compared outcomes with a historical control group in order to determine whether labor induction at 38 to 39 weeks of gestation might reduce the incidence of shoulder dystocia in women with gestational diabetes class A2. The study group (n = 96) was induced at 38 weeks or, if the EFW was greater than 4,500 gm, underwent elective cesarean delivery. The study group was compared to a historical cohort of women (n = 164) who delivered between 1983 and 1989 and in whom labor was induced only if the EFW was greater than 4,000 gm or, if the EFW was greater than 4,500 gm, underwent elective cesarean delivery. This historical cohort was the same study population described by Lurie et al. in an earlier paper,57 which will be discussed subsequently. Gestational age was based on the first day of the last menstrual period and serial crown rump measurements in the first trimester. Amniocentesis was performed to assess fetal lung maturity, using the L/S ratio prior to induction. Baseline participant characteristics, including maternal age and parity, were similar between the two groups. There were no reported data on maternal race, weight, or glucose control.
Additional analysis. The authors conducted a second analysis in which the outcomes in the study group were compared to those in a subset of the historical cohort of women who delivered after 40 weeks of gestation (n = 62). The proportion of deliveries complicated by shoulder dystocia was significantly reduced (from 10.2 percent to 1.4 percent; p < 0.05) in the study group when compared to this subset of the historical control group. Also, only nine percent of the infants in the study group had a weight greater than 4,000 gm, as compared to 24 percent in the historical control group (p < 0.05).
In summary, the authors of this paper found that the decrease in shoulder dystocia in the study group was only statistically significant if the study group was compared to the subgroup of control patients that delivered after 40 weeks. In addition to a lack of adjustment for severity of diabetes and a consideration of the temporal changes that had occurred in the management of patients with gestational diabetes, this study was further limited by its small population size.
Impact of gestational age at delivery. In their 1992 paper, Lurie et al57 conducted a retrospective chart review of all gestational diabetic women who delivered over a 5-year period, examining maternal and neonatal outcomes for women with gestational diabetes class A1 and A2 who delivered after 40 weeks of gestation or prior to 40 weeks. The groups were matched with regard to age, parity, and fetal presentation. Gestational age was based on the date of the last menstrual period and ultrasound measurements of crown rump lengths in the first trimester. Outcomes were reported separately for gestational diabetes classes A1 and A2.
Similar findings were obtained for the women with gestational diabetes class A2 (insulin-requiring gestational diabetes). The mean gestational age at delivery was 40.5 weeks in the group delivering after 40 weeks and 37.5 weeks in the group delivering before 40 weeks (p not reported). There were no differences in the number of vacuum-assisted deliveries (1/59 versus 4/59) or cesarean deliveries (15/59 versus 13/59; p = 0.6216).
In this retrospective cohort study, the only significant difference between patients delivering after 40 weeks and those delivering before 40 weeks was a higher mean birth weight in the subset of class A2 gestational diabetes patients, which was to be expected, given that gestational age is a strong predictor of birth weight. The authors concluded that the timing of delivery does not have a significant impact on clinically important maternal or neonatal outcomes. However, although the authors did perform a stratified analysis for class of gestational diabetes, the study did not adjust for other potential confounders or for delivery management in the groups.
Impact of gestational age and/or EFW on the timing of labor induction and/or elective cesarean delivery. Peled59 conducted a protocol-based chart review to evaluate the effect of gestational age and EFW on labor management. In this study, the charts of 2,060 patients with gestational diabetes treated over a 19-year period were abstracted for maternal and neonatal outcomes. The investigators compared four time periods, each with a distinct management protocol for the timing of labor induction or elective cesarean delivery, based on EFW and gestational age and target thresholds for maternal glycemia (Period A: 1980-1989; Period B: 1990-1992; Period C: 1993-1995; Period D: 1996-1999). Gestational age was calculated from the first day of the last menstrual period and confirmed by first trimester ultrasound when possible. EFW was estimated either clinically or by ultrasound. Outcomes among women in Period D (the study group) were compared with outcomes among women in the three prior periods (historical control groups). Women in the study group were induced at 38 weeks of gestation if the EFW was consistent with LGA (defined as greater than 90th percentile) or underwent elective cesarean if the EFW was greater than 4,000 gm. In Period A, patients underwent elective cesarean if the EFW was greater than 4,500 gm; otherwise, they were induced at 42 weeks. In both Periods B and C, patients underwent elective cesarean delivery if the EFW was greater than 4,000 gm, and they were induced at 40 weeks if LGA was diagnosed. It is noteworthy that the groups differed in terms of the level of glycemic control in the institution's protocol. For patients in Periods C and D, insulin was started at lower fasting glucose levels (> 5.3 mmol/L) and 2-hr postprandial levels (> 6.6 mmol/L) than in Periods A and B (> 5.8 mmol/L and > 7.8 mmol/L, respectively). Furthermore, patients had lower glycemic goals in Periods C and D (< 5.3 mmol/L) than in Period B (< 5.8 mmol/L) or Period A (no goal set). Thus, although glycemic control did not alter decisions regarding delivery, it is important to keep in mind that patients in the four periods differed in terms of their level of glucose control. Prostaglandin E2 gel or tablets was used for labor inductions over the 19-year period of the study. The authors also included both class A1 and A2 gestational diabetes patients but did not report outcomes separately for the two groups. The proportions of women treated with insulin during the four study periods were variable: 13 percent in Period A, 16.4 percent in Period B, 28 percent in Period C, and 32 percent in Period D. There was no other comparison of baseline characteristics (e.g., age, race, parity) in the four groups.
Neonatal outcomes. There was a reduction in the proportion of infants with birth weights greater than 4,000 gm (3.86 percent in the study group versus 20.6 percent in Period A, 16.3 percent in Period B, and 11.7 percent in Period C). The proportion of deliveries complicated by shoulder dystocia (none in Period D, versus 1.5 percent in Period A, 1.2 percent in Period B, and 0.6 percent in Period C) also decreased over the study period. Perinatal mortality rates also decreased from 8 percent in Period A to 3 percent in Period B, to 0 percent in Period C, and 0.77 percent in Period D. While p values were reported for comparisons between the gestational diabetes population and the non-gestational diabetes population, they were not reported for comparisons between time periods (the relevant comparison groups for this analysis).
Although this study provides data on a large population of patients with gestational diabetes, the lack of information on baseline characteristics (e.g., age, race, parity, severity of disease) in the four groups and the lack of adjustment for any differences between groups severely limited our ability to draw any substantial conclusions from this study. Also, the authors did not adjust for or discuss the influence of other potential obstetrical management patterns over the 19-year period. Clinical management of diabetic patients had changed substantially over the 19-year period of the study. Modifications in practice patterns have likely influenced the outcomes reported in these investigations. While examining trends in outcomes is useful, it is not possible to fully adjust for changes in practice patterns, leading to some level of bias in the reported associations.
Additional studies. We identified three additional studies that met our initial inclusion criteria but which focused on aspects of labor management that are outside our primary area of evidence review for Key Question 2. Nevertheless, given the paucity of data addressing labor management among women with gestational diabetes, we believe the findings of these studies and their relevance to delivery management deserve limited discussion.
Impact of gestational age on the timing of induction of labor in patients with different levels of disease severity. In a retrospective cohort study, Rayburn examined maternal and neonatal outcomes under an institutional protocol in which class A2 gestational diabetes patients were routinely induced at 38 weeks and class A1 gestational diabetes patients were managed expectantly.56 It is important to note that the control group, the gestational diabetes A1 patients who were managed expectantly (n = 137), underwent induction if there were any obstetrical indications for delivery, including pre-eclampsia, gestational hypertension, or poor glucose control; if the cervix was “favorable” at 40 weeks; or if the patient reached 42 weeks of gestation. The authors reported that only 53 percent of patients in the control group required induction, a rate that was significantly different from that in the study group (90 percent, p < 0.001). The gestational age at delivery was significantly different between groups (38.1 weeks in the study group as compared to 39 weeks in the control group, p < 0.001). The study found no differences in the rates of cesarean delivery or shoulder dystocia, macrosomia, respiratory difficulties in the neonate, or neonatal intensive care admissions.
The significant limitation of this investigation is that the study and control groups by definition had different severity levels of disease (class A1 versus class A2). There were also significant differences in the racial composition (the study group was 70 percent Hispanic, versus 60 percent in the control group; p < 0.01) and parity in each group (18 percent were nulliparous in the study group, versus 31 percent in the control group, p = 0.01).56
Impact of elective cesarean delivery versus a trial of labor in patients with previous cesarean delivery. Marchiano conducted a retrospective cohort study to examine outcomes related to elective repeat cesarean delivery versus a trial of labor in a population of women with gestational diabetes;52 423 women with class A1 gestational diabetes and singleton pregnancy who had undergone one previous cesarean delivery were included in the study.
The repeat cesarean delivery rate was 30 percent for those who attempted a trial of labor. The rate of macrosomia (defined as infant birth weight > 4,000 gm) for those who attempted a trial of labor was 18 percent, as compared to 33 percent for those who underwent elective cesarean (p < 0.0001) delivery. A sub-group analysis of women who attempted a trial of labor indicated a cesarean delivery rate of 43 percent for those whose infants weighed 4,000 gm or more, as compared to 28 percent for those with infants weighing less than 4,000 gm.
Although these results are relevant to the management of women with gestational diabetes, the results are only generalizable to those with prior cesarean delivery. Furthermore, the authors used actual infant birth weight rather than EFW in the analysis. Because EFW can vary from actual weight at delivery, it is difficult to draw useful conclusions from these results in terms of clinical decisionmaking for elective cesarean delivery versus an attempt at vaginal delivery.
Shoulder dystocia in patients with gestational diabetes. Keller 199154 performed a retrospective chart review of 210 patients with gestational diabetes from a tertiary care center in Chicago. Of the 210 patients, 173 underwent a trial of labor, 34 had elective repeat cesarean delivery, and 3 had an elective cesarean delivery for EFW greater than 4,000 gm (individual patient/provider decision). In those who underwent a trial of labor, the rate of cesarean delivery was 30.6 percent and the rate of forceps use was 4.6 percent. When birth weight categories were examined, the cesarean delivery rate was 33 percent in the greater than 4,500 gm group, 34 percent in the 4,000 to 4,499 gm group, and 29 percent in the 3,500 to 3,999 gm group.
The risk of shoulder dystocia in those patients who delivered vaginally was 12.5 percent overall and ranged from 9 percent in the lowest birth weight group to 14 percent in those weighing 4,000 to 4,499 gm and 38 percent in those infants weighing over 4,500 gm. Fractures and nerve injuries were rare (seven total) and were not related to birth weight category. The study also reported that the risk of shoulder dystocia in patients with class A1 gestational diabetes was not significantly different (OR = 0.78, 95 percent CI: 0.25 to 2.27) from that in patients with class A2 gestational diabetes.54
These findings by Keller offer a descriptive analysis of labor outcomes in women with gestational diabetes. Given the lack of a comparison group and any adjustment for confounders, as well as the limited sociodemographic and clinical information on the study sample, it is difficult to draw any reasonable conclusions from this study regarding labor management in women with gestational diabetes.
Limitations. Several limitations to these studies deserve further comment. First, there was heterogeneity in the severity of the gestational diabetes reported in the one RCT and four primary observational studies, making it difficult to assess the magnitude and direction of any association of the effect of gestational age or EFW with labor management. All four of the primary observational studies included women with gestational diabetes A1 and gestational diabetes A2, but only one reported outcomes stratified by insulin requirement.57 Furthermore, the RCT55 and one of the observational studies53 included pre-gestational diabetics, even though this condition represented only a small proportion of the sample (less than 10 percent). The results of these studies might have varied substantially if the study population had been limited to women with gestational diabetes class A1 or A2 or if the outcomes were stratified by severity.
Second, the four primary observational studies were conducted over a wide timeframe. It is difficult to account for the rise in the prevalence of gestational diabetes during this timeframe or the modifications in physician practice patterns and obstetrical technology that have certainly influenced maternal and neonatal outcomes. For example, while the intention of the study by Peled59 was to assess the impact of different management approaches over the 19-year period, it was impossible to discern the potential contribution of changes in glycemic target levels to delivery management over the four time periods.
Third, none of the four primary observational studies adjusted for potential confounders. Therefore, the magnitude of the associations between gestational age or EFW and outcomes may have been overestimated.
Fourth, the high rates of induction of labor in the expectant management group (49 percent) and of cesarean delivery in both groups in the RCT by Kjos et al55 illustrate the low threshold for intervention in current practice for patients with diabetes. They also highlight the potential role of medical liability in the design of studies of labor management. Physicians' concerns regarding medical liability, provider perception of risk, and maternal demand for cesarean delivery may limit the ability to conduct well-designed clinical trials of labor management.
One experimental study in this field suggested that active induction of labor at 38 weeks of gestation reduces birth weight, macrosomia, and LGA without increasing the rate of cesarean section. It was difficult to fully assess these outcomes, however, on the basis of a single clinical trial of only 200 patients. The current body of observational studies also suggested a potential reduction in macrosomia and shoulder dystocia with elective labor induction and elective cesarean delivery for macrosomia or LGA infants. We systematically searched the literature for evidence that the choice of timing of induction or elective cesarean delivery resulted in beneficial or harmful maternal or neonatal outcomes, as described in detail in the Key Question. Given the substantial heterogeneity in the studies reviewed and the serious limitations in study design and analysis of the existing literature, we were unable to draw any firm conclusions about the role of elective induction or cesarean delivery in the management of gestational diabetes.
What risk factors, including but not limited to family history, physical activity, pre-pregnancy weight, and gestational weight gain, are associated with short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes?
We conducted our systematic review of this question according to the framework outlined in Figure 6
We identified a number of studies that examined the risk factors for type 2 diabetes among women with previous gestational diabetes. These studies varied widely in terms of their design, population, measurement of risk factors, and method of analysis. No single study included all the risk factors we enumerated. Although longitudinal studies and well-done case-control studies that use multiple regression methods provide the best evidence about the independent contribution of risk factors, we also included studies that used univariate analytic methods if they reported a relative measure of association.
Anthropometry
Pregnancy-related factors
Postpartum factors
Parity
Family history of type 2 diabetes
Maternal lifestyle factors
Sociodemographics
Oral contraceptive use
Physiologic factors
Age. Six studies60–63 65 66 assessed age as a risk factor; five of the six studies used multivariate analysis.60–63 66 Only one study reported the relative measure of association resulting from the multivariate analysis: Cho et al. reported that after adjustment for gestational age at the time of diagnosis, pre-pregnancy BMI, family history of type 2 diabetes, FBG at diagnosis, and homocysteine level, women greater than 30 years of age had a two-fold increased likelihood of developing type 2 diabetes (RR = 2.0; 95 percent CI: 0.68 to 6.0), but this association was not statistically significant, as evidenced by the 95 percent CI that included one.61 In one univariate analysis, Dacus et al. observed that older age did not appear to be associated with the risk of type 2 diabetes (RR = 0.68; 95 percent CI: 0.24 to 1.9).65
Hospital location. Cheung et al. were able to evaluate the hospital attended for antenatal clinic visits as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes, since they had recruited women from two hospitals.60 Although they included age, parity, FBG at gestational diabetes diagnosis, BMI during pregnancy, 2-hr OGTT, number of prior gestational diabetes pregnancies, method of glucose control, and family history of type 2 diabetes, these investigators did not report a relative measure of association for the hospital attended and type of diabetes.
Work status. Cho et al. evaluated working status as a risk factor for the development of type 2 diabetes in eight multivariate models, including age, parity, family history of type 2 diabetes, working status, blood pressure, lipid profile, and one of eight measures of adiposity (postpartum BMI, waist circumference, weight, subscapular skin fold thickness, suprailiac skin fold thickness, tricep skin fold thickness, body fat weight, or waist-to-hip ratio).62 However, the relative measure of association was not reported for the association between working status and development of type 2 diabetes for any of the eight models.
Race. In a univariate analysis, Dacus et al. evaluated race as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes. They reported that as compared to other race groups, blacks had a 50 percent increased risk of developing type 2 diabetes, but this association was not statistically significant (RR = 1.5; 95 percent CI: 0.45 to 5.0).65
We concluded that there are only limited data on which to base any meaningful conclusions regarding sociodemographic factors and the short- or long-term risk of type 2 diabetes among women with gestational diabetes.
Maternal lifestyle factors. We did not identify any studies that examined the relationship between lifestyle factors, such as physical activity and diet, and the development of type 2 diabetes in women with prior gestational diabetes. We therefore concluded that no evidence exists to determine whether maternal lifestyle affects the risk of developing type 2 diabetes after having gestational diabetes.
Gestational age at diagnosis of gestational diabetes. Five studies61 63 65 68 70 assessed gestational age at diagnosis of gestational diabetes as a risk factor, and four of the five studies used multivariate analysis.61 63 68 70 The studies varied in terms of their categorization of gestational age at gestational diabetes diagnosis: Two studies divided gestational age at gestational diabetes diagnosis into quartiles and used the first quartile as the reference:68 70 Both Kjos et al.68 and Schaefer-Graf et al.70 reported a protective effect of gestational age at gestational diabetes diagnosis in the fourth quartile as compared to gestational age at gestational diabetes diagnosis in the first quartile, with the effect ranging from a 52 percent to a 65 percent reduction in the likelihood of developing type 2 diabetes (RH = 0.48; 95 percent CI: 0.29 to 0.82; and OR = 0.35; 95 percent CI: 0.23 to 0.54) respectively. Both studies varied with respect to the covariates included in the multivariate model, and they did not share any common covariates. Schaefer-Graf et al.70 included FBG at gestational diabetes diagnosis, class A2 gestational diabetes, area under the glucose curve of pregnancy OGTT, previous gestational diabetes and 50-gm GCT, while Kjos et al.68 included postpartum OGTT glucose area under the curve, antepartum OGTT glucose area under the curve, and highest antepartum FBG. When third-quartile gestational age at gestational diabetes diagnosis was compared to the first quartile, a smaller protective effect was observed in both studies. Schaefer-Graf et al. reported a 55 percent reduction in the likelihood of developing type 2 diabetes (OR = 0.45; 95 percent CI: 0.27 to 0.76).70 Kjos et al. reported a 27 percent reduction in the likelihood of developing diabetes, but this association was not statistically significant (RH = 0.73; 95 percent CI: 0.45 to 1.2).68 For both studies, when second-quartile gestational age at gestational diabetes diagnosis was compared to the first quartile, no significant difference in the development of type 2 diabetes was found (Schaefer-Graf et al., OR = 1.1; 95 percent CI: 0.72 to 1.7; and Kjos et al., RH = 0.66; 95 percent CI: 0.39 – 1.1).
Cho et al. categorized gestational age at gestational diabetes diagnosis into two groups, women who were diagnosed with gestational diabetes at greater than or equal to 26 weeks and women who were diagnosed at less than 26 weeks. There was no significant difference in the development of type 2 diabetes between the two groups after adjusting for age, pre-pregnancy BMI, family history of type 2 diabetes, FBG at diagnosis, and homocysteine level (RR = 2.4; 95 percent CI: 0.88 to 6.6).61
Jang et al. assessed gestational age at gestational diabetes diagnosis as a continuous variable and found that for each week of increase in gestational age at gestational diabetes diagnosis, there was a 0.01 decrease in the log odds of developing type 2 diabetes (β = -0.01; SE = 0.05; p = 0.008).63
In a univariate analysis, Dacus et al. categorized gestational age at gestational diabetes diagnosis into two groups, comparing women who were diagnosed with gestational diabetes at less than 24 weeks and those diagnosed with gestational diabetes greater than or equal to 24 weeks. No significant difference was observed between the two groups in terms of the development of type 2 diabetes (RR = 2.5; 95 percent CI: 0.9 to 6.9).65
Method of glucose control. Five studies evaluated the method of glucose control during pregnancy as a risk factor for the development of type 2 diabetes.60 65 66 69 71 Three of these studies60 66 69 included a multivariate analysis, but only two of them60 66 reported a relative measure of association for this risk factor. These two studies varied considerably. Cheung et al. found that as compared to women who did not use insulin, those that did use insulin during pregnancy had a three-fold higher risk of developing type 2 diabetes after adjusting for age, parity, FBG at diagnosis, BMI at index pregnancy, 2-hr OGTT, number of prior pregnancies complicated by gestational diabetes, family history of type 2 diabetes, and hospital location (RR = 3.2; 95 percent CI: 1.6 to 7.0).60 Lobner et al. reported that as compared to women who were diet-controlled, women who received insulin during pregnancy had an almost five-fold increased risk of developing type 2 diabetes after adjustment for age, parity, GAD and IA-2 antibody status, BMI during pregnancy, and serum CRP (RH = 4.7; 95 percent CI: 3.2 to 7.1; p < 0.0001).66
Two studies included a univariate analysis, but only Steinhart et al. reported a relative measure of association for the method of glucose control. This study reported an almost three-fold increased likelihood of developing type 2 diabetes in women requiring insulin as compared to those not on insulin, but this association was not statistically significant (OR = 2.8; 95 percent CI: 0.8 to 11.2).71
One study by Cheung et al. examined the required dosage of bedtime intermediate-acting insulin as a risk factor. For each unit (unspecified) increase in dosage, there was a 9 percent increased likelihood of developing type 2 diabetes after adjustment for FBG (RR = 1.1; 95 percent CI: 1.0 to 1.2).60 The clinical relevance of this finding, however, is unclear, given that it is based on data from one study and is of borderline statistical significance.
50-gm GCT. The 50-gm GCT is routinely performed during pregnancy as the baseline screening test for gestational diabetes. Only one study evaluated the results of the 50-gm GCT performed during pregnancy as a risk factor for the development of type 2 diabetes.70 Schaefer-Graf et al. categorized the GCT results into quartiles, using the first quartile as the reference. They reported that as compared to women with 50-gm GCT results in the first quartile, women with results in the second, third, and fourth quartiles had an increasingly higher risk of developing type 2 diabetes (OR = 2.9; 95 percent CI: 1.2 to 6.6; OR = 3.8; 95 percent CI: 1.7 to 8.5; and OR = 3.5; 95 percent CI: 1.6 to 7.6 for the second, third, and fourth quartiles, respectively), after adjusting for FBG at diagnosis, class A2 gestational diabetes, area under the glucose challenge curve of pregnancy OGTT, gestational age at diagnosis of gestational diabetes, and previous pregnancy complicated by gestational diabetes.
Class A-2 (insulin-requiring gestational diabetes). One study evaluated class A2 gestational diabetes, defined as requiring insulin therapy because of FBG levels greater than or equal to 105 mg/dL, as a risk factor for the development of type 2 diabetes.70 Schaefer-Graf et al. reported that as compared to women with gestational diabetes class A1, women with gestational diabetes class A2 were 2.4 times more likely to develop type 2 diabetes, after adjusting for FBG at diagnosis, 50-gm GCT, area under the curve for a pregnancy OGTT, gestational age at diagnosis of gestational diabetes, and previous pregnancy complicated by gestational diabetes (OR = 2.4; 95 percent CI: 1.2 to 4.7).
Previous pregnancies complicated by gestational diabetes. Two studies evaluated previous pregnancies complicated by gestational diabetes as a risk factor for the development of type 2 diabetes.60 70 These studies included multivariate analysis, but only one study reported a relative measure of association for this risk factor.70 Schaefer-Graf et al. reported that as compared to women without a previous pregnancy complicated by gestational diabetes, those with a such a pregnancy were 60 percent more likely to develop type 2 diabetes, after adjusting for FBG at diagnosis, 50-gm GCT, area under the glucose challenge curve of pregnancy OGTT, gestational age at gestational diabetes diagnosis, and previous pregnancy complicated by gestational diabetes (OR = 1.6; 95 percent CI: 1.1 to 2.5).
Spontaneous abortion. One study that included a univariate analysis examined spontaneous abortion as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes.71 Steinhart et al. reported that as compared to women without spontaneous abortions, those with spontaneous abortions were 36 percent more likely to develop type 2 diabetes, but this association was not statistically significant (OR = 1.4; 95 percent CI: 0.5 to 3.5).
We concluded that the overall grade of evidence for pregnancy-related factors was moderate.
Additional pregnancy. Two studies assessed additional pregnancy as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes, and both used multivariate analysis.69 72 However, only one study reported a relative measure of association for this risk factor.72 After adjusting for postpartum weight change (per 10 pounds), OGTT glucose area, postpartum BMI, and breastfeeding, Peters et al. found that as compared to women with no additional pregnancy, those with an additional pregnancy had a three-fold increased risk of developing type 2 diabetes (RH = 3.3; 95 percent CI: 1.8 to 6.2).72
Breastfeeding. Two studies assessed breastfeeding as a risk factor for the development of type 2 diabetes, and both constructed multivariate models.64 72 However, neither of these studies reported relative measures of association for this risk factor.
Duration of followup. One study evaluated the duration of followup as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes and constructed eight multivariate models, involving age, parity, family history of type 2 diabetes, working status, blood pressure, lipid profile, and one of eight measures of adiposity (postpartum BMI, waist circumference, weight, subscapular skin fold thickness, suprailiac skin fold thickness, tricep skin fold thickness, body fat weight, or waist-to-hip ratio).62 However, the relative measure of association was not reported for the association between duration of followup and development of type 2 diabetes for any of the eight models.
Recurrent gestational diabetes. One study evaluated recurrent gestational diabetes as a risk factor for the development of type 2 diabetes in women with previous gestational diabetes and conducted a univariate analysis.71 Steinhart et al. reported that as compared to women without recurrent gestational diabetes, those with recurrent gestational diabetes had a 24-fold increased risk of developing type 2 diabetes (OR = 24.8; 95 percent CI: 3.0 to 1132.2). The width of this confidence interval, however, suggests substantial variability in the point estimate and makes it impossible for us to draw any firm conclusions from these data.
We concluded that the overall grade of the evidence for postpartum factors was very low.
Measures of anthropometry. We identified 11 cohort studies that evaluated a total of 11 different anthropometric measures: weight, height, BMI, body fat weight, subscapular skin fold thickness, suprailiac skin fold thickness, tricep skin fold thickness, waist circumference, waist-to-hip ratio, percent ideal body weight, and weight change (see Appendix F, Evidence Table 23).60–67 69 72 73 The number of participants ranged from 17061 to 909.62 Followup of participants ranged from 6 weeks to 12 years. Of the 11 studies, 9 reported a relative measure of association.60–63 65–67 72 73 Eight studies 60–63 66 67 72 73 reported an adjusted relative measure of association. We have included these adjusted relative measures in Figure 7
Of the 11 studies, three used pre-pregnancy measures of obesity.61 63 73 Two of these studies reported a significant positive association between pre-pregnancy anthropometric measures and the development of type 2 diabetes.63 73 One study reported a protective effect of a higher anthropometric measure,61 and one study did not report the measure of association.63 Pallardo et al. found that as compared to women with a pre-pregnancy BMI less than or equal to 27 kg/m2, women with a BMI greater than 27 kg/m2 had an eight-fold increased risk of developing type 2 diabetes, after adjusting for the number of abnormal glucose results from the OGTT and C-peptide glucose score (OR = 8.7; 95 percent CI: 2.3 to 32.9).73 Jang et al. reported that for every 1-kg increase in pre-pregnancy weight, there was a 0.36 increase in the log odds of developing type 2 diabetes, although this relationship was not statistically significant (β = 0.36, SE = 0.10).63 One study61 reported a reduction in the likelihood of type 2 diabetes with higher BMI: Cho et al.61 reported that as compared to women with a pre-pregnancy BMI less than or equal to 23 kg/m2, women with a pre-pregnancy BMI greater than 23 kg/m2 were less likely (RR = 0.78; 95 percent CI: 0.27 to 2.2) to develop type 2 diabetes, after adjusting for age, gestational age at diagnosis of gestational diabetes, family history of type 2 diabetes, FBG at diagnosis, and homocysteine level. This reported association, however, was not statistically significant. We concluded that pre-pregnancy measures of obesity are associated with an increased likelihood of type 2 diabetes.
Three of the 11 studies used anthropometric measures during pregnancy. These studies reported a positive association between anthropometric measures and the development of type 2 diabetes.60 66 67 For example, Lobner et al. reported that women with a BMI greater than 30 kg/m2 were 50 percent more likely to develop type 2 diabetes than were women with a BMI less than 30 kg/m2, after adjusting for GAD and IA-2 antibody status, method of glucose control, parity, age, and serum CRP at 9 months (RH = 1.5; 95 percent CI: 1.0 to 2.2).66 In addition, Metzger et al. reported that as compared to women who were non-obese (<120 percent ideal body weight), women who were obese (≥ 120 percent ideal body weight) had an almost three-fold increased likelihood of developing type 2 diabetes, after adjusting for 3-hr integrated insulin level and parity.67 For each kg/m2 increase in BMI, Cheung et al. reported a 10 percent increase in the risk of developing type 2 diabetes (relative risk [RR] = 1.1; 95 percent CI: 1.0 to 1.2), after adjusting for age, parity, FBG at diagnosis, 2-hr OGTT, the number of prior pregnancies complicated by gestational diabetes, method of glucose control, family history of type 2 diabetes, and hospital location.60
Three studies evaluated anthropometric measures as time-dependent covariates, assessing the association of the change in these measures between delivery and followup with type 2 diabetes.64 69 72 Peters et al. showed that for every 10-pound change in weight, there was a 95 percent increase in the risk of developing type 2 diabetes, after adjusting for additional pregnancy, OGTT glucose area, postpartum BMI, and breastfeeding (RH = 2.0; 95 percent CI: 1.6 to 2.3).72 Although Kjos et al. and Xiang et al. included weight change in their multivariate analyses, the relative association of weight change with type 2 diabetes was not reported.64 69 Height was examined in one study, but the measure of association from the multivariate model was not reported.63 Because of multiple cohort studies and measures of association, we graded the overall evidence for anthropometric measures as moderate.
Xiang et al. did not find that progesterone-based contraceptives were consistently associated with an increased risk of type 2 diabetes. As compared to women who used combination oral contraceptives, those using depo-medroxyprogesterone acetate did not have an increased risk of type 2 diabetes in the entire cohort, after adjusting for postpartum BMI, breastfeeding, family history of type 2 diabetes, HDL cholesterol, triglycerides, and weight gain during followup (RH = 1.1; 95 percent CI: 0.6 to 1.9).64 This association did not differ by breastfeeding status.64 However, after adjusting for postpartum BMI, family history of type 2 diabetes, breastfeeding, HDL cholesterol, and weight gain during followup, use of depo-medroxyprogesterone acetate in women with triglycerides above the median of the population was associated with a two-fold greater risk of developing type 2 diabetes when compared to the use of a combination oral contraceptive and triglyceride levels below the population median (RH = 2.3; 95 percent CI: 1.1 to 4.8).64 We concluded that the limited number of studies available and the overall very low grade of evidence made it difficult to draw any firm conclusions regarding the relationship between progestin-only contraception and the development of type 2 diabetes among women with gestational diabetes.
FBG: antepartum. Five studies examined antepartum FBG at gestational diabetes diagnosis as a risk factor, and in all of these studies, FBG was a significant predictor of type 2 diabetes (see Figure 8
In unadjusted analyses, Steinhart et al. found that as compared to women with an FBG less than or equal to 5.83 mmol/L, those with an FBG greater than 5.83 mmol/L had an 11-fold increased risk of developing type 2 diabetes (OR = 11.1; 95 percent CI: 2.3 to 103.4).71 Cho et al. found that women with an FBG greater than 5.3 mmol/L had a four-fold increased risk of developing type 2 diabetes, after adjusting for age, gestational age at gestational diabetes diagnosis, pre-pregnancy BMI, family history of type 2 diabetes, and homocysteine level (RR = 4.0; 95 percent CI: 1.4 to 11.4).61 In another study, Schaefer-Graf et al. found an increased risk of type 2 diabetes with increasing quartiles of FBG, such that women in the highest quartile had a 21-fold increased risk of developing type 2 diabetes when compared to those in the lowest quartile (OR = 21.0; 95 percent CI: 4.6 to 96.3).70 Finally, Kjos et al. also found an increased risk of type 2 diabetes with increasing tertiles of FBG, with women in the highest tertile having a greater than two-fold increased risk when compared to those in the lowest tertile, after adjusting for postpartum OGTT glucose area under the curve, gestational age at gestational diabetes diagnosis, and antepartum OGTT glucose area under the curve (RH = 2.5; 95 percent CI: 1.3 to 4.9).68
Number of abnormal OGTT results. One study examined the number of abnormal OGTT results as a risk factor for subsequent development of type 2 diabetes.73 In this study, there was a three-fold increased risk of type 2 diabetes with each increase in the number of abnormal OGTT results, after adjusting for pre-pregnancy BMI and C-peptide glucose score (OR = 3.0; 95 percent CI: 1.4 to 6.4).73
Glucose tolerance test total. One study examined the OGTT total as a risk factor for type 2 diabetes. As compared to women with OGTT totals less than or equal to 41.63 mmol/L, those with a GTT total greater than 41.63 mmol/L had a 15-fold greater risk of developing type 2 diabetes (OR = 15.5; 95 percent CI: 2 to 678).71
1-hr glucose during the diagnostic OGTT. One study examined the 1-hr glucose level during the diagnostic OGTT as a risk factor for the development of type 2 diabetes.74 Buchanan et al. found that as compared to women with the lowest tertile of 1-hr plasma glucose during the diagnostic OGTT, women in the highest tertile had a 15-fold greater risk of developing type 2 diabetes, after adjusting for beta-cell compensation index and basal production rate (OR = 15.2; 95 percent CI: 1.4 to 166.3), and a 22-fold higher risk of developing type 2 diabetes, after adjusting for the OGTT 30-min incremental insulin:glucose ratio, basal glucose production rate, and insulin sensitivity index (OR = 22; 95 percent CI: 1.5 to 328.5).74
3-hr insulin level during the diagnostic OGTT. One study63 examined 3-hr insulin levels and found an inverse association between the insulin level and the risk of developing type 2 diabetes, after adjusting for pre-pregnancy weight, gestational age at gestational diabetes diagnosis, 2-hr glucose level, age, height, pre-pregnancy BMI, family history of type 2 diabetes, and weight at postpartum testing. A second study measured 3-hr integrated insulin levels and found no association with the development of type 2 diabetes.
30-minute incremental insulin:glucose ratio. Two studies examined the 30-min incremental insulin:glucose ratio from the antepartum OGTT.74 75 Both studies found it to be a predictor of type 2 diabetes. One study showed a non-significant 90 percent lower risk of type 2 diabetes in the highest versus the lowest tertile, after adjusting for incremental glucose area, diagnostic OGTT, frequently sampled intravenous glucose tolerance acute insulin response, basal glucose production rate, and insulin sensitivity index (OR = 0.1; 95 percent CI: 0.01 to 2.2), and a 92 percent lower risk after adjusting for 1-hr plasma glucose level during the diagnostic OGTT, basal glucose production rate, and insulin sensitivity index (OR = 0.08; 95 percent CI; 0.01 to 1.1).74
We concluded that increasing FBS or 2-hr glucose values on the diagnostic OGTT may indicate a higher likelihood of development of type 2 diabetes in women with gestational diabetes.
Area under the curve for postpartum OGTT. Two studies by the same author examined the postpartum OGTT area under the glucose curve and the risk of developing type 2 diabetes.68 69 In the one study in which measures of association were reported, the risk of type 2 diabetes increased with increasing quartiles of postpartum OGTT area under the glucose curve (p-value for trend < 0.0001), after adjusting for gestational age at gestational diabetes diagnosis, antepartum OGTT glucose area under the curve, and highest antepartum fasting glucose.68 As compared to those in the lowest quartile, those in the highest quartile had an 11-fold increased risk of type 2 diabetes (RH = 11.5; 95 percent CI: 4.5 to 29.1).68
We graded the overall body of evidence for metabolic risk factors as moderate. There was consistency in the association of 2-hr PPG and Antepartum OGTT glucose area under the curve.
Additional measures of glucose metabolism. One study by Buchanan et al. examined several additional measures of glucose metabolism as risk factors for type 2 diabetes, including basal glucose production rate, beta-cell compensation index, clamp insulin sensitivity, and frequently sampled intravenous glucose tolerance acute insulin response.74 A higher basal glucose production rate was associated with a non-significantly increased risk of type 2 diabetes in several multivariable models that included: (1) incremental glucose area on the diagnostic OGTT, frequently sampled intravenous glucose tolerance acute insulin response, OGTT 30-min incremental insulin:glucose ratio, and clamp insulin sensitivity (model 1); (2) 1-hr OGTT plasma glucose and beta-cell compensation index (model 2); and (3) 1-hr OGTT plasma glucose, OGTT 30-min incremental insulin:glucose ratio, and insulin sensitivity index (model 3).74
Greater beta-cell compensation index was associated with a 91 percent lower risk of developing type 2 diabetes, after adjusting for OGTT 1-hr plasma glucose level and basal glucose production rate. Greater clamp insulin sensitivity was associated with a non-significantly lower risk of developing type 2 diabetes, after adjusting for OGTT 1-hr glucose level, OGTT 30-min incremental insulin:glucose ratio, and basal glucose production rate in model 1 (OR = 0.2; 95 percent CI: 0.03 to 1.2) and after adjusting for diagnostic OGTT incremental glucose area, frequently sampled intravenous glucose tolerance acute insulin response, OGTT 30-min incremental insulin:glucose ratio, and basal glucose production rate in model 2 (OR = 0.15; 95 percent CI: 0.02 to 1.2).74 Finally, women in the highest tertile of frequently sampled intravenous glucose tolerance test acute insulin response had a 92 percent lower risk of developing type 2 diabetes than did those in the lowest tertile, after adjusting for diagnostic OGTT incremental glucose area, OGTT 30-min incremental insulin:glucose ratio, basal glucose production rate, and clamp insulin sensitivity (OR = 0.08; 95 percent CI: 0.01 to 1.0).74
One study examined C-peptide glucose score as a risk factor for type 2 diabetes.73 In this study, a higher C-peptide glucose score was associated with a 54 percent lower risk of developing type 2 diabetes, after adjusting for pre-pregnancy BMI and the number of abnormal OGTT results (OR = 0.46; 95 percent CI: 0.25 to 0.85).73 We included these additional measures of glucose metabolism in order to provide a comprehensive summary of potential risk factors for the development of type 2 diabetes. While we were unable to draw conclusions from this emerging area of investigation, this review provided insight into the physiologic pathways that are being studied to better define the risk of type 2 diabetes among women with gestational diabetes.
Blood pressure. Cho et al. found postpartum blood pressure to be a predictor of type 2 diabetes, although a relative measure for blood pressure was not reported in their multivariate models.62
Lipids. Two studies examined postpartum lipid parameters as predictors of type 2 diabetes,62 64 and in both of these studies, HDL cholesterol and triglycerides were risk factors for the development of type 2 diabetes; however, a relative measure for the lipid parameters was not reported in the multivariate models.62 64
Homocysteine. One study assessed homocysteine levels 6 weeks postpartum and found that women with homocysteine levels greater than 6.38 mmol had a greater than three-fold increased risk of developing type 2 diabetes when compared to those with homocysteine levels below this level, after adjusting for age, gestational age at gestational diabetes diagnosis, pre-pregnancy BMI, family history of type 2 diabetes, and FBG at diagnosis (RR = 3.6; 95 percent CI: 1.1 to 11.9).61
Autoantibodies. One study examined GAD and IA-2 antibodies as risk factors for type 2 diabetes and found that women with positive GAD or IA-2 antibodies had a four-fold increased risk of type 2 diabetes when compared to women who were antibody negative, after adjusting for the method of glucose control, BMI, parity, age, and serum CRP (RH = 4.1; 95 percent CI: 2.6 to 6.7).66 We were unable to draw meaningful conclusions based on the available evidence, but we have included summaries of these traditional (i.e., lipids, blood pressure) and novel measures to provide a comprehensive review of available risk factors for the development of type 2 diabetes.
Additional studies of risk factors for type 2 diabetes. We identified 11 studies that investigated factors associated with incident type 2 diabetes following a pregnancy complicated by gestational diabetes, but these studies did not include relative measures of risk or multivariate models.76–86 While these studies are important for qualitatively identifying risk factors, we consider them to provide the lowest level of evidence because there was no adjustment for potential confounders or relative estimates. The evidence is briefly discussed below by risk factor category.
Family history of type 2 diabetes: No additional studies.
Sociodemographics: Two studies investigated maternal age. Greenberg et al.83 compared maternal ages according to diabetic status at followup and did not find any statistical differences, while Dalfra et al.80 did find an association. Two studies, Kousta et al. and Ali et al.,77 87 examined the incidence of type 2 diabetes as stratified by race. Both studies found a higher incidence among black and Asian-Indian women than in European women or women of mixed ethnicity.
Maternal lifestyle factors: No additional studies.
Parity: Only one study, Linne et al.,79 compared parity in women with and without type 2 diabetes at followup. No association was observed.
Pregnancy-related factors: Younger gestational age at diagnosis was consistently associated with increased incidence of type 2 diabetes in three studies: Greenberg et al.,83 Bartha et al.,82 and Dalfra et al.80 Insulin use during pregnancy was consistently associated with increased type 2 diabetes in two studies: Greenberg et al.83 and Dalfra et al.80 Class A2 gestational diabetes was associated with increased type 2 diabetes in one study, that of Kjos et al.85 Greenberg et al.83 found that cesarean delivery, shoulder dystocia, and birthweight percentile did not differ between women who did and did not develop type 2 diabetes during followup.
Postpartum factors: Kjos et al.84 compared women who did and did not breastfeed following a pregnancy complicated by gestational diabetes and found that women who breastfed had a decreased incidence of type 2 diabetes.
Anthropometric measures: BMI was investigated in six studies: Bian et al.,81 Greenberg et al.,83 Pallardo et al.,78 Dalfraet et al.,80 Linne et al.,79 and Bartha.82 There was a significant relationship between higher BMI and increased type 2 diabetes in all but one study.83 Pallardo et al.78 found that women who had developed type 2 diabetes during followup had higher current weight but did not differ in pre-pregnancy weight, weight change, or body fat percentage from women without type 2 diabetes at followup. Waist circumference was found to be associated with type 2 diabetes by Pallardo et al.78 but was not found to be associated by Linne et al.79 Waist-to-hip ratio was also not associated with type 2 diabetes in the study by Linne et al.79
Oral contraceptive use: Kjos et al.85 found no difference in the incidence of type 2 diabetes in women using non-oral contraceptives, ethinyl estradiol-norethindrone, or ethinyl estradiol-levonorgestrel.
Metabolic risk factors: Increased fasting glucose was consistently higher in women developing type 2 diabetes during followup in four studies: Xiang et al.,76 Linne et al.,79 Dalfra et al.,80 and Greenberg et al.83 Higher HbA1c was consistently associated with increased type 2 diabetes in two studies: Linne et al.79 and Greenberg et al.83 Decreased beta-cell compensation was associated with higher risk of type 2 diabetes in one study, Xiang et al.76 Plasma glucose levels at 2- and 3-hr during the diagnostic OGTT were found to be associated with increased type 2 diabetes in one study, Dalfra et al.,80 but not associated in another, Greenberg et al.83 Greenberg et al.83 did find a difference in 1-hr OGTT between women developing type 2 diabetes during followup and those who remained normoglycemic. Dalfra et al.80 also found postprandial plasma glucose, plasma insulin at 30 min during the OGTT, and postpartum plasma glucose area under the curve to be associated with type 2 diabetes. While Linne et al.79 found blood pressure and lipids to be similar in women with and without type 2 diabetes at followup, Pallardo et al.78 found significant differences in triglycerides and diastolic blood pressure but not HDL cholesterol, total cholesterol, or systolic blood pressure in women with and without type 2 diabetes at followup.
Two studies60 62 with well-defined approaches to the development of the multivariate models deserve further comment. In their investigation of the relationship of eight different obesity indices with onset of type 2 diabetes, Cho et al.62 followed 909 Korean women for a mean of 2.13 ± 1.75 years. The authors first stratified the study population into three groups (normal glucose tolerance, impaired glucose tolerance, and type 2 diabetes) and performed a univariate analysis, examining the distribution of each of the seven obesity measures and relevant sociodemographic and clinical risk factors across the three groups of participants. Data were collected on risk factors that had been defined prior to the initiation of the study. Each obesity measure was then recategorized into quartiles (75th percentile compared to 25th percentile), and the association of each measure with type 2 diabetes was assessed using simple logistic regression. Correlations between obesity measures and other covariates were assessed. Only those factors that were statistically significantly associated with type 2 diabetes in the univariate analysis were included as covariates with the obesity measures in the final prediction model. These factors were blood pressure, lipid profile, age, duration of followup, parity, family history of type 2 diabetes, and working status. All eight of the obesity measures were associated with type 2 diabetes. Waist circumference was the strongest predictor (OR = 5.8; 95 percent CI: 2.0 to 11.8). After adjustment for covariates, the association of waist circumference with postpartum type 2 diabetes was moderately attenuated (OR = 3.4; 95 percent CI: 1.8 to 2.2) but remained statistically significant, as did the other six obesity measures. Although there was no R2 to assess the relative fit of the model, we conclude that the reported multivariate model was adjusted for covariates that are relevant both clinically and statistically to obesity and type 2 diabetes and were appropriately included in the model. Cheung et al.60 reported findings from Cox regression analyses. The authors chose to include factors that were clinically related to both type 2 diabetes and to underlying insulin resistance (as evidenced by fasting hyperglycemia in pregnancy): age, parity, BMI, number of episodes of prior gestational diabetes, family history of type 2 diabetes, and insulin use versus diet alone in pregnancy. We concluded that these authors appeared to have based the selection and adjustment of covariates on the a priori hypothesis of a relationship with hyperglycemia and the established association with type 2 diabetes in the development of the best predictive model. Both studies represented a systematic approach to the development of multivariate models for assessing the direction and magnitude of association of risk factors with type 2 diabetes.
What are the performance characteristics (sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes after pregnancy in patients with a history of gestational diabetes? Are there differences in the performance characteristics of the test results based on subgroup analysis?
The prevalence of type 2 diabetes is increasing in the United States and globally.1 Early detection and treatment of diabetes has been associated with improved outcomes related to microvascular complications and may prevent macrovascular complications as well.88 Women with gestational diabetes are at an increased risk of developing type 2 diabetes. An estimated 16 to 63 percent of women with gestational diabetes will develop type 2 diabetes in the 5 to 10 years immediately following pregnancy.29 While postpartum screening for type 2 diabetes among women with gestational diabetes has been supported by the ADA17 and ACOG,7 there is debate about which screening test to use and at what interval to screen. These are important questions for both clinical providers and public health officials. The fact that only limited evidence is available with regard to screening test performance in women with a history of gestational diabetes has prolonged the debate and perhaps delayed a consensus on appropriate screening. To further define our efforts in addressing this topic, we developed a conceptual framework (see Figure 9
Despite the known risk of type 2 diabetes among women with gestational diabetes, only 75 percent of ACOG fellows reported that they routinely perform postpartum screening with the 75-gm OGTT. Followup varies widely, and many women do not receive the recommended screening for type 2 diabetes.19 89 The barriers to use of the OGTT include the cost and inconvenience for a new mother. However, there is insufficient evidence supporting the use of an alternative screening test, such as the FBG. A recent cost-effectiveness analysis examined models for screening and found the OGTT to be cost-effective if used at 3-year intervals. Screening with the FBG was cost-effective if used at 1-year intervals.90 More precise knowledge of the performance characteristics of these tests may help improve our estimates of the effectiveness and total costs associated with screening.
In this report, we summarize and critically appraise the literature on the performance of currently available screening tests for postpartum glucose screening in order to support the development of clinical guidelines for postpartum glucose surveillance.
| FBG | AND/OR | 2-hr PG after 75-gm OGTT | |
|---|---|---|---|
| NDDG 197991 | ≥ 7.8 mmol/L (140 mg/dL) | ≥ 11.1 mmol/L (200 mg/dL) | |
| WHO 198592 | ≥ 7.8 mmol/L (140 mg/dL) | ≥ 11.1 mmol/L (200 mg/dL) | |
| WHO 199993 | ≥ 7.0 mmol/L (126 mg/dL) | ≥ 11.1 mmol/L (200 mg/dL) | |
| ADA 199717 | ≥ 7.0 mmol/L (126 mg/dL) | NA | |
| ≥ 7.0 mmol/L (126 mg/dL) | ≥ 11.1 mmol/L (200 mg/dL) |
2-hr PG = 2-hr plasma glucose; ADA = American Diabetes Association; dL = deciliter; FBG = fasting blood glucose; gm = gram; L = liter; mg = milligram; mmol = millimole; NA = not applicable; NDDG = National Diabetes Data Group; OGTT = oral glucose tolerance test; WHO = World Health Organization
The majority of the studies screened for type 2 diabetes within 1 year of delivery.94–97 100 Two studies98 101 reported wide ranges of postpartum testing intervals, from 1 to 86 months and from 6 to 72 months, respectively. Only one study99 conducted late screening of all subjects (between 4 and 8 years postpartum).
Overview of studies evaluating comparison and reference tests for type 2 diabetes. Our review yielded three general comparisons: (1) two different diagnostic threshold values applied to the 75-gm OGTT (the WHO 1985 criterion versus the WHO 1999 criterion); (2) FBG level greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) and the 75-gm OGTT (WHO 1999); and (3) FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) and the 75-gm OGTT (WHO 1985).
| REFERENCE TEST | |||
| Positive by OGTT | Negative by OGTT | ||
| COMPARISON TEST | Positive by FBG | TP = 2 | FP = 1 |
| Negative by FBG | FN = 0 | TN = 117 | |
| TP + FN = 2 | FP + TN = 118 | ||
Sensitivity: # TP/(# TP + # FN)= 2/2= 100 percent, 95 percent CI: 16–100 percent
Specificity: # TN/ (# TN + # FP)=117/118=99 percent, 95 percent CI: 95–100 percent
FBG = fasting blood glucose; FN = false negatives; FP = false positives; OGTT = oral glucose tolerance test; TN = true negatives; TP = true positives
Studies of different diagnostic threshold values applied to the 75-gm OGTT. Two studies97 101 compared different threshold values for the OGTT. They reported the same specificity of 98 percent for the OGTT using a threshold of FBG greater than 7.0 mmol/L (126 mg/dL) (WHO 1985) and using a threshold of FBG greater than 7.8 mmol/L (140 mg/dL) (see WHO 1999) (see Figure 10
Studies of FBG level greater than 7.0 mmol/L (126 mg/dL) (comparison test; ADA 1997) as compared to the 75-gm OGTT (reference test; WHO 1999). Three studies94 95 99 reported data in which a single FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) was compared to an FBG greater than 7.0 mmol/L (126 mg/dL) or 2-hr plasma glucose after 75-gm OGTT greater than 11.1 mmol/L (200 mg/dL) (WHO 1999). The sensitivity for the FBG greater than 7.0 mmol/L (126 mg/dL) alone compared with a complete OGTT using the same FBG threshold (FBG > 7.0 mmol/L [126 mg/dL]) or 2-hr plasma glucose after 75-gm OGTT greater than 11.1 mmol/L (200 mg/dL) varied across the three studies, ranging from 46 to 89 percent (see Figure 11
These three studies94 95 99 were heterogeneous because postpartum testing occurred less than 6 months after delivery in two studies94 95 but 4 to 8 years after delivery in the third study.99 In addition to this longer time period after delivery, the study population in the third study99 had a high prevalence of non-whites (previously reported by the Brazilian Gestational Diabetes Study Group)102, which may have affected the test performance.
Studies of FBG greater than 7.0 mmol/L (126 mg/dL) (comparison test; ADA 1997) as compared to the 75-gm OGTT (reference test; WHO 1985). Five studies95 96 98 100 101 compared an FBG greater than 7.8 mmol/l (140 mg/dL) or a 2-hr plasma glucose level after 75-gm OGTT of greater than 11.1 mmol/l (200 mg/dL) (WHO 1985) as the reference test to an FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) as the comparison test.
These studies consistently reported high specificity (range: 94 to 99 percent). However, the sensitivities ranged from 14 to 100 percent (see Figure 12
One study98 reported very low sensitivity for an FBG greater than 7.0 mmol/L (126 mg/dL) when compared to a reference OGTT with an FBG greater than 7.8 mmol/l (140 mg/dL) or 2-hr plasma glucose level after 75-gm OGTT greater than 11.1 mmol/l (200 mg/dL). This study population differed from the other studies' samples because 23 percent of the subjects were excluded from screening as a result of a new diagnosis of type 1 or 2 diabetes postpartum. Also, the study population was entirely Polish. These two study characteristics may have reduced the spectrum of risk for type 2 diabetes in the screened population as compared to other clinical populations, thereby lowering the test's sensitivity.
Subgroup analysis. Only one study94 included analyses of high-risk subgroups: In this study, the FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) alone was compared to a complete OGTT (FBG > 7.0 mmol/L (126 mg/dL) or 2-hr plasma glucose after 75-gm OGTT greater than 11.1 mmol/L (200 mg/dL) (WHO 1999)). In 168 subjects with a family history of type 2 diabetes, the sensitivity was 47 percent (95 percent CI: 24 to 71 percent). In another 168 subjects who required insulin during pregnancy, the sensitivity was 55 percent (95 percent CI: 32 to 76 percent). We concluded that the FBG may perform better in subgroups with a family history of type 2 diabetes or that required insulin during pregnancy than in the general population, as reported in a single study.94
Test reproducibility. Test reproducibility affects diagnostic test accuracy. Five studies95–97 100 101 reported the type of laboratory equipment used to test samples as an indicator of quality control. Three articles reported the kappa statistic as the measure of agreement between the results of the comparison and reference test, but not as a standard measure of single-test reproducibility.95 98 100
For quantitative assays such as measures of blood glucose, the STARD Initiative recommends calculating imprecision as the coefficient of variation by repeating the test over several days.24 One study96 reported the coefficient of variation: Holt et al. reported the coefficient of variation for plasma glucose testing using the specified laboratory equipment and assay. The coefficient of variation for this assay was 1.2 percent at 3.3 mmol/L and 1.49 percent at 16.5 mmol/L.96
One study did not meet our inclusion criteria because it did not report the method of diagnosing gestational diabetes, but it is notable because it focused on the question of reproducibility of the OGTT using FBG greater than 7.0 mmol/L (126 mg/dL) or 2-hr plasma glucose after 75-gm OGTT greater than 11.1 mmol/L (200 mg/dL) (WHO 1999).104 The study population consisted of 696 Caucasian women with previous gestational diabetes at a median of 6.2 years postpartum. Women were administered an OGTT, which was repeated within 3 months when it met the criteria for diabetes. Type 2 diabetes was confirmed in only 60 percent of the women.
Limitations. There were several key limitations of these studies. First, six studies95–98 100 101 used the 2-hr 75-gm OGTT with the FBG greater than 7.8 mmol/L (>140 mg/dL) (WHO 1985) threshold as a reference. This test may no longer be clinically useful, given current recommendations to use a threshold of FBG greater than 7.0 mmol/L (>126 mg/dL) as part of the OGTT (WHO 1999).
Overall, the study quality was poor. The studies were limited by their sampling methods, specifically the use of convenience samples that had high losses to followup. It is not clear whether the higher-risk patients are more or less likely to attend their postpartum followup visits to receive type 2 diabetes screening, and any such pattern may vary according to the country studied. In any case, the high loss to followup clearly limited the generalizability of the results.
Given the increase in obesity and sedentary lifestyles in the United States, the prevalence of gestational and type 2 diabetes among reproductive-aged women is expected to rise over the next decade. Both obstetrical and primary care physicians care for a growing number of women with gestational diabetes who are at increased risk of developing type 2 diabetes. For decades, obstetricians and primary care physicians have debated the optimal labor management and postpartum followup of women with gestational diabetes. Clinicians, public health advocates, and health policymakers have identified the need for evidenced-based practice guidelines for labor and postpartum management of women with gestational diabetes.
To identify the evidence on labor and postpartum management of gestational diabetes, the AHRQ, in conjunction with the ACOG, requested an evidence report on four distinct questions. We applied rigorous selection criteria and assessed the quality of each study, using a clinical and public health framework to guide our review. Our report is limited to gestational diabetes in which the diagnosis was confirmed by an OGTT, thereby ensuring that our review includes women with a definitive diagnosis of gestational diabetes. This evidence report outlines a comprehensive review of all the available research. In this final chapter, we first review the major findings pertaining to each question and the strength of the overall evidence; we then present our conclusions, make recommendations for future research, and offer clinical and public health perspectives.
What is the evidence for the risks and benefits of oral diabetes agents (e.g., second-generation sulfonylureas and metformin), as compared to all types of insulin, for both the mother and neonate in the treatment of women with gestational diabetes?
Relatively few studies have examined the effect of oral agents or insulin analogues, as compared to insulin, on a number of significant maternal and neonatal outcomes in women with gestational diabetes. Only three RCTs assessing the efficacy of glyburide and insulin met our inclusion criteria,32 33 37 and only two maternal outcomes were evaluated in more than one RCT: cesarean delivery and maternal glycemic control. There was little difference in maternal outcomes between those treated with glyburide and those receiving insulin. In the largest trial (n = 404) comparing glyburide and insulin, 49 percent of the women on insulin underwent cesarean delivery, as compared to 46 percent of those on glyburide.32 A second trial37 reported no difference in cesarean delivery rates for 51 women on glyburide, insulin, or acarbose. Three trials found no statistically significant differences in glucose control between women treated with insulin and those receiving glyburide.32 32 33 There was one study that considered pre-eclampsia, two studies that included maternal weight, and two studies that included information on maternal hypoglycemia. There were no available data with regard to perineal tears, operative vaginal delivery, or postpartum hemorrhage. Because of the small number of RCTs and the lack of consistency in the maternal outcomes measured across studies, we graded the overall strength of evidence as very low.
Only four neonatal outcomes were evaluated by more than one RCT: birth weight, LGA, macrosomia, and neonatal hypoglycemia. We conducted a meta-analysis of three RCTs with a total of 478 pregnancies. There was a lower average infant birth weight in the insulin group as compared to the glyburide group (-93 gm; 95 percent CI: -119 to 5). This difference was not statistically significant and is unlikely to have substantial clinical influence. We were unable to draw any definitive conclusions regarding neonatal hypoglycemia, given the limited data available. Langer et al.32 reported no significant difference between glyburide and insulin in the percentage of infants with hypoglycemia (9 percent versus 6 percent, p = 0.25), but Bertini et al.37 reported a higher percentage of infants with hypoglycemia in the glyburide group than in the insulin or acarbose groups (33 percent compared to 4 percent and 5 percent, respectively; p = 0.006). Several of our neonatal outcomes of interest were not included in the RCTs reviewed. Therefore, we were unable to draw conclusions about anoxia, birth trauma, respiratory distress syndrome, or shoulder dystocia.
We extended our review of the literature on insulin and glyburide to include four observational studies.45–48 None of the observational studies were strong enough to justify a modification of the conclusions drawn from the RCTs.
Two RCTs compared the effect of insulin lispro and regular insulin on maternal and neonatal outcomes in women with gestational diabetes.34 36 We concluded that there was little difference in maternal glucose control (glycosylated hemoglobin or 1-hr glucose levels) between the women treated with insulin lispro and those treated with regular insulin. Neither Jovanoic36 nor Mecacci34 reported significant differences in mean infant birth weight between the insulin lispro and regular insulin groups. We concluded that no evidence exists to suggest that neonatal outcomes differ between those treated with regular insulin and those receiving insulin lispro. The limited number of trials, limited sample size, and paucity of information on neonatal outcomes made it difficult to draw any firm conclusions.
There was insufficient evidence to draw meaningful conclusions about the effect of long-acting versus short-acting insulin, twice-daily versus four-times-daily use of regular insulin, or diet alone versus diet plus insulin. In one study comparing long-acting to short-acting insulin, there was a higher percentage of infants with macrosomia in the long-acting insulin group than in the short-acting insulin group.31 Limited data from one RCT35 suggested that twice-daily insulin may be associated with worse neonatal outcomes (neonatal hypoglycemia, macrosomia, LGA, and SGA) than four-times-daily use of insulin. We found no evidence to suggest a difference in maternal outcomes between twice-daily and four-times-daily use of regular insulin. In the study by Thompson,30 women were randomized to diet alone or diet plus a fixed insulin regimen that included 20 units of NPH insulin and 10 units of regular insulin. There was no reported difference in maternal glucose control or the proportion of women undergoing cesarean delivery in the two groups. In terms of neonatal outcomes, infant birth weight was higher in the diet-alone group than in the diet and insulin group. Similarly, there was a higher proportion of infants with macrosomia in the diet-alone group. These findings must be viewed with caution because the overall strength of the evidence for diet compared to insulin and dietary management was very low.
We did not identify any studies that compared metformin with other diabetes medications in women with gestational diabetes. Also, we found no evidence regarding maternal or neonatal outcomes as related to the level of glucose at the initiation of a medication.
What is the evidence that elective cesarean delivery or the choice of timing of induction in gestational diabetes results in beneficial or harmful maternal and neonatal outcomes?
There is little evidence on the effect of gestational age or EFW on the timing of labor induction or performance of elective cesarean delivery in women with gestational diabetes. The findings from one experimental study55 suggested that active induction of labor at 38 weeks of gestation reduced infant birth weight (3,672 gm versus 3,446 gm; p < 0.01) and rates of macrosomia (27 percent versus 15 percent; p = 0.05) when compared to expectant management, with no concomitant increase in the rate of cesarean delivery (25 percent in the active induction group versus 31 percent in the expectant management group; p = 0.43). While these results suggested that maternal outcomes might be better in women who undergo elective induction, we were unable to draw firm conclusions based on this one trial.
Observational studies52–54 56–59 provided some additional evidence of a reduction in macrosomia and shoulder dystocia with elective labor induction or cesarean delivery, when compared to expectant management. For example, in the study by Conway,53 women with diabetes underwent ultrasonographic estimates of fetal weight between 37 and 38 weeks of gestation. Women whose EFW was greater than or equal to 4,250 gm underwent cesarean delivery; those whose EFW was estimated at less than 4,250 gm but LGA (defined as ≥ 90th percentile for the gestational age in their population) underwent labor induction. Fewer infants were macrosomic (weighing 4,000 gm or more) in the group undergoing elective cesarean or labor induction than in the expectant management group (8.9 percent versus 11.6 percent; p = 0.04). In addition, the incidence of shoulder dystocia was higher in the expectant management group (OR = 1.9, 95 percent CI: 1.0 to 3.5) than in the group undergoing elective cesarean or labor induction. The overall strength of evidence on this comparison was graded as very low. Only one of the observational studies adjusted for potential confounders,56 so any measures of association may have been biased. Second, there may have been selection bias in the recruitment of women to participate in the studies. Third, there was substantial heterogeneity in terms of the comparison groups, length of followup, and outcome measures included in the analysis. Fourth, the four primary observational studies were conducted over a wide timeframe. It would be difficult to adequately adjust for changes in practice patterns and treatment modalities that occurred over the long time periods of the studies.
What risk factors, including but not limited to family history, physical activity, pre-pregnancy weight, and gestational weight gain, are associated with short-term and long-term development of type 2 diabetes following a pregnancy with gestational diabetes?
Several factors were associated with the development of type 2 diabetes in women with previous gestational diabetes. Anthropometric measures before, during, and after pregnancy were found to be positively associated with the development of type 2 diabetes in 10 of 11 cohort studies. Waist circumference and BMI were the strongest anthropometric measures associated with type 2 diabetes in gestational diabetic women. Early gestational age at diagnosis of gestational diabetes (primarily less than 24 weeks) and use of insulin versus diet for glucose control were key pregnancy-related clinical factors that were positively associated with type 2 diabetes. Physiologic measures, including FBG and 2-hr plasma glucose levels during the diagnostic OGTT, were also associated with development of type 2 diabetes. Higher blood glucose following a screening 50-gm GCT, prior gestational diabetes, and OGTT area under the curve during both the antepartum and postpartum periods were positively associated with development of type 2 diabetes, but the strength of the associations was not consistent across studies. There is conflicting data on progesterone-only contraceptive use and the risk for developing type 2 diabetes. Elevated postpartum homocysteine levels were positively associated with type 2 diabetes in one study. Surprisingly, there were no studies of lifestyle factors in women with gestational diabetes that met our review criteria.
After a review of the available evidence, we concluded that the strongest epidemiological risk factors were anthropometric measures prior to pregnancy and during both the antepartum and postpartum periods. Taking into consideration the quantity, quality, and consistency of the studies evaluating the association of risk factors for type 2 diabetes following a pregnancy with gestational diabetes, we graded the strength of the evidence as very low. While there was substantial consistency in the direction of association across studies for many of the risk factors, there was considerable variation in the covariates adjusted for in multivariate models across studies.
What are the performance characteristics (sensitivity, specificity, and reproducibility) of tests for diagnosing type 2 diabetes after pregnancy in patients with a history of gestational diabetes? Are there differences in the performance characteristics of the test results based on subgroup analysis?
Several studies have pointed to poor physician compliance with postpartum glucose screening for type 2 diabetes among women with a history of gestational diabetes.19 20 We reviewed the available studies of the diagnostic accuracy of screening for type 2 diabetes in this population. We identified 8 studies and 10 evaluations of screening tests, with three types of comparisons:
two different diagnostic fasting value thresholds applied to the 75-gm OGTT (the WHO 1985 criteria compared to the WHO 1999 criteria);
single FBG level greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) compared to the 75-gm OGTT (WHO 1999); and
single FBG greater than 7.0 mmol/L (126 mg/dL) (ADA 1997) compared to the 75-gm OGTT (WHO 1985).
For the first comparison, we concluded that there was acceptable specificity (98 percent) for the OGTT using either a FBG value greater than 7.0 mmol/L (126 mg/dL) or greater than 7.8 mmol/L (140 mg/dL). For the second comparison, we were unable to draw meaningful conclusions. The sensitivities for a single FBG greater than 7.0 mmol/L (126 mg/dL), as compared to a complete OGTT using the same FBG threshold, ranged from 46 to 89 percent in the three studies. For the third comparison, there were five studies, which reported a high specificity of the FBG greater than 7.0 mmol/L (126 mg/dL). However, there was a wide range of sensitivity, from 14 to 100 percent.
The six studies95–98 100 101 that used an FBG threshold greater than 7.8 mmol/L (140 mg/dL) in the reference test may be obsolete, since current guidelines recommend an FBG greater than 7.0 mmol/L (126mg/dL)17 93 105 The wide variation in the reported sensitivities for studies that compared the OGTT as the reference test to a single FBG greater than 7.0 mmol/L (126 mg/dL) may reflect differences in the study samples' risk for type 2 diabetes, based on heterogeneity of study design and population. The overall strength of evidence was very low because of the high loss-to-followup rates (22 to 82 percent) for studies using clinic convenience samples.
Based on the available data outlined in Chapter 3, we have made the following conclusions:
Key Question 1: Little evidence exists to guide patients, health care providers, or policymakers in the choice of treatment for gestational diabetes. We were unable to draw firm conclusions from any of the five treatment comparisons in Key Question 1 because of the availability of only a limited number of studies within each category of comparison, a lack of consistency in the outcomes measured across studies, and heterogeneity in the definition of outcome measures. Limited evidence demonstrated no substantial clinical differences in maternal or neonatal outcomes with the use of glyburide or insulin lispro as compared to insulin in women with gestational diabetes. Our meta-analysis of three studies showed a small difference in infant birth weight. We expect little clinical relevance for the weighted mean difference of 93 gm. The results of our meta-analysis provide additional information for clinicians to incorporate into their discussions with patients about their choice of treatment but are unlikely to affect current clinical practice.
We did not identify any available evidence on variations in maternal or neonatal outcomes based on the level of glucose at the initiation of a medication. Each of the clinical trials and observational studies reviewed specified threshold glucose levels for the initiation of medical treatment as part of the study protocol. However, none of the studies compared outcomes based on glucose thresholds in their evaluation of maternal or neonatal outcomes. Findings from the HAPO study may provide further insights. Therefore, we were unable to provide evidence for this portion of Key Question 1. We were also unable to identify any published studies comparing metformin to diet, insulin, or insulin analogues in women with gestational diabetes. However, the ongoing MiG trial will likely provide evidence regarding the comparative effects of metformin and insulin on maternal and neonatal outcomes.
Key Question 2: Little evidence exists to guide health care providers in the use of EFW or gestational age in making decisions about the timing of labor induction or elective cesarean delivery. We identified only one relevant RCT. The findings from the few available observational studies were difficult to interpret because of variations in clinical practice over the time period of the studies. Furthermore, serious methodological limitations made it difficult to draw firm conclusions. While our review does provide physicians and other health care providers with a summary of the gaps in the available evidence, further study involving clinical trials or well-designed observational studies is necessary to effect modifications in clinical management and inform development of clinical pathways.
Key Question 3: There was consistent evidence that anthropometric measures (i.e., weight, BMI, and waist circumference) prior to pregnancy and during both the antepartum and postpartum periods were positively associated with development of type 2 diabetes. These findings suggested similar risk factors for type 2 diabetes in reproductive and middle-aged women. Moreover, it appeared that weight and the distribution of weight were strong predictors of type 2 diabetes in this special population of women. Metabolic risk factors, including higher FBG at diagnosis of gestational diabetes, high glucose levels in oral glucose tolerance testing, insulin-requiring gestational diabetes, and glucose AUC for antepartum glucose tolerance testing, were consistently associated with an increased likelihood of type 2 diabetes. The relationship between progesterone-only contraception use and the risk of type 2 diabetes in women with a prior history of gestational diabetes, however, remains unclear. There was no evidence available regarding the potential effect of lifestyle factors (e.g., physical activity) on the development of type 2 diabetes in women with a prior history of gestational diabetes. Further investigation, ideally involving RCTs, would provide evidence for the primary prevention of type 2 diabetes in this high-risk group. Such evidence could then be incorporated into preconception and prenatal care education.
Key Question 4: We were unable to draw meaningful conclusions from the limited evidence available for our review. As compared to the 75-gm OGTT, the FBG had high specificity, but the sensitivity was variable across studies. As a result of heterogeneity in the study design, recruited population, and interval of followup testing, we were unable to draw firm conclusions about the performance characteristics of the FBG in women with a history of gestational diabetes. There was also insufficient evidence regarding test reproducibility. Until the appropriate intervals for followup testing are realized, further investigations would benefit from an interdisciplinary clinical approach. While obstetricians may provide immediate postpartum screening, general practitioners, internists, and other health care providers will likely provide long-term followup. With the increasing prevalence of childbearing among older women, pregnant women more commonly receive care from an obstetrician-gynecologist and either an internist or other primary care provider. Thus, an interdisciplinary dialogue among providers will be necessary to influence future care.
While basic science research and investigations using animal models have helped us to better understand the underlying pathophysiology of gestational diabetes, there is a wide gap in our clinical knowledge with regard to how potential treatments and postpartum management can benefit both mothers and infants. Future research should be directed toward filling this gap by conducting studies that will lead to the development of evidenced-based guidelines for maternal glucose control and physician recommendations for labor induction, elective cesarean, and expectant management. In addition, future research should focus on risk factors for type 2 diabetes in this high-risk population and on developing effective screening modalities for identifying women who are at risk for developing type 2 diabetes.
Further RCTs are needed to better assess maternal and neonatal outcomes in women with gestational diabetes who are being treated with insulin, insulin analogues, metformin, or glyburide. Future trials should specify a priori hypotheses and conduct power analyses prior to recruitment to ensure the ability to detect small differences in maternal glucose levels that can affect fetal weight and the risk of macrosomia, as well as common outcomes such as cesarean delivery. Power analyses will aid researchers in detecting differences in less common but critically important outcomes, such as shoulder dystocia and birth trauma. Clinical trials designed to capture these differences can offer important information and help us to draw reasonable and firm conclusions. Finally, intention-to-treat analysis will be essential to the ability to draw firm conclusions from the reported data. Consistency in the collection of outcome measures across studies is essential to our ability to draw confident conclusions. Furthermore, it would help to have more consistent definitions of clinical outcomes, including maternal and neonatal hypoglycemia, so that clinicians and investigators can better compare results across multiple studies. Observational studies in this area should be prospective, with protocols developed to minimize loss to followup. Adjustment for covariates will be of paramount importance for determining true estimates of the association of treatment choice with maternal and neonatal outcomes.
Well-designed RCTs comparing elective induction and cesarean delivery to expectant management would provide relevant, critical data to practitioners. These trials should incorporate appropriate methods of randomization and an intention-to-treat analysis, as well as power calculations with estimated effect sizes for mothers and infants. We acknowledge the potential barriers to performing clinical trials in pregnant women. Clinical trials with regard to labor management may be particularly difficult in the current obstetrical environment, which is highly litigious and influenced by patient and provider preferences for care. Well-designed observational studies are a reasonable alternative and can provide the necessary data to guide the development of clinical practice guidelines for labor management. Observational studies should primarily focus on insulin-requiring gestational diabetics (i.e., class A2), since this population is at higher risk of macrosomia or cesarean delivery. Alternatively, observational studies of diet and insulin-controlled gestational diabetics might include stratified analyses, which would provide outcome data at different levels of severity. Finally, future studies should adjust for other potential confounders, including sociodemographics and clinical factors related to intrapartum management.
Our review of 16 cohort studies identified several risk factors that are amenable to targeted interventions. One limitation of the current body of literature, however, is the inconsistency in the specific risk factors that have been assessed. Future studies should first focus on specific categories of risk factors, such as anthropometric measures (e.g., weight, BMI) or reproductive-related factors (e.g., parity). Second, future studies should collect data on pertinent covariates and adjust for relevant confounders in multivariate analysis. Third, women should be recruited for longitudinal study at the time of diagnosis of gestational diabetes. Fourth, several studies included in this review were based on convenience sampling, which may have biased the results; random or purposeful sampling of participants would yield a more representative group of participants.
Early identification of women with type 2 diabetes is paramount to achieving high quality of care and the ability to avoid diabetic complications due to delays in diagnosis. Future studies should focus on comparisons of the FBG and the standard 75-gm OGTT in postpartum women. Such comparisons would provide relevant data on the ability to screen women with a simple, time-efficient test, as compared to the burdensome OGTT. Studies should be conducted in diverse populations so that there is confidence that the findings are generalizable to other populations. The conduct of these studies in certain sub-groups (e.g., women with a family history of type 2 diabetes or prior gestational diabetes) is also warranted. Finally, studies of the reproducibility of test results will be critical to the development of broadly acceptable clinical guidelines for testing.
The results of this systematic review have important implications for clinical practice and public health policy. Clinicians and policymakers should be aware that the available data, while limited, do not suggest that there are adverse maternal or neonatal outcomes associated with the use of oral diabetic agents (i.e., glyburide), insulin lispro, or various insulin regimens. The efficacy of insulin analogs or glyburide in achieving maternal glucose targets or preventing episodes of maternal or neonatal hypoglycemia remains unclear. Several measures of maternal and neonatal morbidity, such as perineal tears, operative vaginal delivery, have not been evaluated, and several measures have only been evaluated in one or two studies. Also, it is unclear what glucose thresholds should be used to initiate insulin, insulin analogues, or glyburide in patients on diet alone.
Clinicians should also be aware that there is currently insufficient evidence to develop clear guidelines for labor induction or elective cesarean delivery in women with gestational diabetes. The conduct of well-designed clinical trials or observational studies may provide insight into evidenced-based management.
For public health policymakers, our conclusion is that measures of obesity and antepartum glucose values are the most consistent and substantiated risk factors for type 2 diabetes in women with gestational diabetes. With findings from the Diabetes Prevention Trial106 highlighting the effect of lifestyle modifications on the primary prevention of type 2 diabetes in high-risk populations, our review suggests that the effectiveness of these interventions should be tested in women with a prior history of gestational diabetes.
Finally, we conclude that there are insufficient data to recommend alternative tests to the 75-gm OGTT for the detection of type 2 diabetes in women with gestational diabetes. Public health policymakers should work with health care researchers and national organizations (e.g., the ACOG and ADA) to further evaluate the effectiveness and timeliness of postpartum screening for type 2 diabetes in women with gestational diabetes. Further investigation can provide the data needed to develop broadly acceptable postpartum screening guidelines.
| ACOG | American College of Obstetricians and Gynecologists |
| ADA | American Diabetes Association |
| AHRQ | Agency for Healthcare Research and Quality |
| BMI | Body mass index |
| CC | Concurrent control group |
| CD | Cesarean delivery |
| CENTRAL | Central Register of Controlled Trials |
| CI | Confidence interval |
| CINAHL | Cumulative Index to Nursing and Allied Health Literature |
| CPD | Cephalopelvic disproportion |
| CRP | C-reactive protein |
| dL | Deciliter |
| EFW | Estimated fetal weight |
| EPC | Evidence-based Practice Center |
| FBG | Fasting blood glucose |
| FN | False negative |
| FP | False positive |
| gm | Grams |
| GAD | Glutamic acid decarboxylase |
| GCT | Glucose challenge test |
| Gestational diabetes | Gestational diabetes mellitus |
| HAPO | Hyperglycemia and Adverse Pregnancy Outcome |
| HDL | High-density lipoprotein |
| hr | Hour |
| IA-2 | Insulinoma antigen-2 |
| IGT | Impaired glucose tolerance |
| IU | International units |
| kg | Kilograms |
| KQ | Key Question |
| LDL | Low-density lipoprotein |
| LGA | Large for gestational age |
| L/S ratio | Lecithin-to-sphingomyelin |
| MeSH | Medical subject headings |
| mg | milligrams |
| MiG | Metformin in Gestational Diabetes |
| MOOSE | Meta-analysis of Observational Studies in Epidemiology |
| NDDG | National Diabetes Data Group |
| NICU | Neonatal intensive care unit admissions |
| NPH | Neutral Protamine Hagedom |
| OGTT | Oral glucose tolerance test |
| OR | Odds ratio |
| PCOS | Polycystic ovarian syndrome |
| Portable document format | |
| PPG | Postprandial glucose |
| RCT | Randomized controlled trials |
| RDS | Respiratory distress syndrome |
| RH | Relative hazard |
| RR | Relative risk |
| SD | Standard deviation |
| SE | Standard error |
| SGA | Small for gestational age |
| STARD | Standards for Reporting of Diagnostic Accuracy |
| STROBE | Standards for Reporting of Observational Studies |
| TN | True negative |
| TP | True positive |
| Type 2 diabetes | Type 2 diabetes mellitus |
| US | Ultrasound |
| WHO | World Health Organization |
Donald R. Coustan, M.D.
Chace/Joukowsky Professor and Chair
Department of Obstetrics and Gynecology
Warren Alpert Medical School of Brown University
Chair of Obstetrics and Gynecology
Women and Infants Hospital of Rhode Island
Providence, RI
Richard Hellman, M.D.
Clinical Professor of Medicine
University of Missouri - Kansas City School of Medicine
Kansas City, MO
Jean M. Lawrence, Sc.D., M.P.H.
Research Scientist II/Epidemiologist
Research and Evaluation
Kaiser Permanente Southern California
Pasadena, CA
Troy Flint Porter, M.D.
University of Utah School of Medicine and Intermountain Health Care
Salt Lake City, UT
Samuel F. Posner, Ph.D.
Centers for Disease Control and Prevention
Atlanta, GA
E. Albert Reece, M.D., Ph.D., M.B.A.
Dean and Vice President for Medical Affairs
University of Maryland School of Medicine
Baltimore, MD
Caroline Signore, M.D., M.P.H.
Medical Officer, Obstetrics
Program Scientist, PASS Network
Pregnancy and Perinatology Branch
National Institute of Child Health and Human Development
Bethesda, MD
All Journals Hand Searched
August 2006 – January 2007
Acta Obstetricia et Gynecologica Scandinavica
American Journal of Obstetrics and Gynecology
American Journal of Perinatology
The Australian and New Zealand Journal of Obstetrics and Gynaecology
BJOG: An international journal of obstetrics and gynecology
Diabetic Medicine
Diabetes
Diabetes Care
Diabetes Research and Clinical Practice
European Journal of Obstetrics & Gynecology and Reproductive Biology
International Journal of Gynecology & Obstetrics
Obstetrics & Gynecology
| Terms | Returns |
|---|---|
| (Diabetes, gestational[mh] OR gestational diabet*[tiab] OR diabetes in pregnancy[tiab] OR (diabet*[tiab] AND gestation*[tiab])) AND (((Insulin[mh] OR Insulin[tiab]) OR (sulfonylurea compounds[mh] OR hypoglycemics[tiab] OR hypoglycemic agents[tiab] OR Glyburide[tiab] OR Glipizide[tiab] OR glimepiride[tiab]) OR (Biguanides[mh] OR biguanide*[tiab] OR Metformin[tiab]) OR (Pregnancy[mh] OR Pregnan*[tiab] OR Pregnancy complications[mh] OR treatment outcome[mh] OR treatment outcome*[tiab]) OR (labor, induced[mh] OR Induced labor[tiab] OR Induction of labor[tiab] OR Obstetric Labor[mh] OR Cesarean section[mh] OR cesarean*[tiab] OR C-section[tiab] OR Abdominal deliver*[tiab]) OR (Diabetes Mellitus, Type 2[mh] OR (Diabet*[tiab] AND type 2[tiab]) OR (Diabet*[tiab] AND type II[tiab])))) AND eng[la] NOT (animals[mh]NOT humans[mh]) | 5628 |
| (((((((‘pregnancy diabetes mellitus’/exp) OR (‘gestational diabetes’)) OR ((‘pregnancy’/exp) AND (‘non insulin dependent diabetes mellitus’/exp))) OR ((‘type 2 diabetes’ OR ‘type ii diabetes’ OR ‘diabetes mellitus’) AND (pregnant OR pregnancy))) AND ((((‘antidiabetic agent’/exp) OR (hypoglycemic) OR (‘hypoglycemic agent’)) OR (insulin)) OR ((‘risk factor’) OR (‘treatment outcome’/exp) OR (‘treatment outcome’) OR (‘pregnancy outcome’/exp) OR (‘pregnancy outcome’) OR (benefit) OR (‘adverse event’) OR (comorbidity)) OR ((‘labor’/exp) OR (‘labor induction’/exp) OR (‘induced labor’) OR (‘cesarean section’)) OR (‘reproducibility’/exp)))) AND [english]/lim AND [humans]/lim) NOT [review]/lim | 5306 |
| #1 MeSH descriptor Diabetes, Gestational explode all trees | 225 |
| #2 (gestational diabetes) or (gestational diabetes):ti or (gestational diabetes):ab or (gestational diabetes):kw | |
| #3 (#1 OR #2) | |
| #4 MeSH descriptor Diabetes Mellitus, Type 2 explode all trees | |
| #5 (diabetes) or (diabetes):ti or (diabetes):ab or (diabetes):kw | |
| #6 (#4 OR #5) | |
| #7 MeSH descriptor Labor, Induced explode all trees | |
| #8 (labor) or (labor):ti or (labor):ab or (labor):kw | |
| #9 (Induc*) or (Induc*):ti or (Induc*):ab or (Induc*):kw | |
| #10 (#8 AND #9) | |
| #11 (#7 OR #10) | |
| #12 MeSH descriptor Cesarean Section explode all trees | |
| #13 (cesarean*) or (cesarean*):ti or (cesarean*):ab or (cesarean*):kw | |
| #14 (caesarean*) or (caesarean*):ti or (caesarean*):kw or (caesarean*):ab | |
| #15 (#14 AND NOT #13) | |
| #16 (#13 OR #14) | |
| #17 MeSH descriptor Insulin explode all trees | |
| #18 (insulin) or (insulin):ti or (insulin):kw or (insulin):ab | |
| #19 MeSH descriptor Sulfonylurea Compounds explode all trees | |
| #20 (glyburide) or (glyburide):ti or (glyburide):kw or (glyburide):ab | |
| #21 (glipizide) or (glipizide):ti or (glipizide):kw or (glipizide):ab | |
| #22 (glimepiride) or (glimepiride):ti or (glimepiride):kw or (glimepiride):ab | |
| #23 (#17 OR #18) | |
| #24 (#19 OR #20 OR #21 OR #22) | |
| #25 MeSH descriptor Metformin explode all trees | |
| #26 (Metformin) or (Metformin):ti or (Metformin):kw or (Metformin):ab | |
| #27 (#25 OR #26) | |
| #28 (#12 OR #16) | |
| #29 MeSH descriptor Pregnancy Complications explode all trees | |
| #30 MeSH descriptor Pregnancy explode all trees | |
| #31 (pregnan*) or (pregnan*):ti or (pregnan*):kw or (pregnan*):ab | |
| #32 MeSH descriptor Risk explode all trees | |
| #33 (risk*) or (risk*):ti or (risk*):kw or (risk*):ab | |
| #34 (#29 OR #30 OR #31) | |
| #35 (#32 OR #33) | |
| #36 (#23 OR #24 OR #27) | |
| #37 (#3 AND ( #11 OR #28 OR #36 OR #34 OR #35 )) |
| ((MH Diabetes Mellitus, Gestational) OR (MH Diabetes Mellitus, Non-Insulin-Dependent) OR (TX “gestational diabetes”) OR ((TX “type 2 diabetes” OR TX “type II diabetes” OR (TX diabetes and TX ( “type II” OR “type 2” ))) AND TX Pregnancy) OR (TX Pregnancy and TX diabetes) OR (TX “diabetes in pregnancy”)) AND (( MH “pregnancy outcomes” or MH “Pregnancy Complications” or MH comorbidity ) OR (TX ( Maternal OR neonatal OR pregnancy ) and TX ( “adverse event” OR benefit OR risk OR complication OR complications OR outcome OR outcomes ) )) OR ((MH insulin or MH hypoglycemic agents or MH sulfonylurea compounds ) OR (TX ( hypoglycemics OR “hypoglycemic agents” OR sulfonylurea OR metformin ) )) OR (TX ( “diagnostic test” OR “diagnostic tests” ) or MH ( “sensitivity and specificity” ) or MH “reproducibility of results” )) NOT ( review OR “meta-analysis” OR “meta analysis” OR metaanalysis ) and LA English | 2907 |
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]To see the Sample Data Abstraction Forms, please select the link below. This link will take you to a PDF version of the forms.
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]Appendixes cited in this report are provided electronically at: http://www.ahrq.gov/clinic/tp/gdmparttp.htm