NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Management of Preterm Labor: Evidence Report/Technology Assessment Number 18
 
hserta
AHRQ Evidence Reports
public health

Chapter  18:  Management of Preterm Labor: Evidence Report/Technology Assessment Number 18

A26682

THIS EVIDENCE REPORT IS OUTDATED AND IS NO LONGER VIEWED AS GUIDANCE FOR CURRENT MEDICAL PRACTICE. IT IS MAINTAINED FOR ARCHIVAL PURPOSES ONLY.

Prepared for:
Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852
http://www.ahrq.gov

Contract No. 290-97-0011

Prepared by:
Research Triangle Institute, Research Triangle Park, NC
Nancy D. Berkman, Ph.D.
Principal Investigator
John M. Thorp, Jr., M.D.
Katherine E. Hartmann, M.D., Ph.D.
Kathleen N. Lohr, Ph.D.
Anjolie E. Idicula, B.A.
Melissa McPheeters, M.P.H.
Norma I. Gavin, Ph.D.
Timothy S. Carey, M.D.
Sue Tolleson-Rinehart, Ph.D.
Anne M. Jackman, M.S.W.
Victor Hasselblad, Ph.D.
Emily C. Puckett, B.A.
Investigators

AHRQ Publication No. 01-E021

December 2000
On December 6, 1999, under Public Law 106-129, the Agency for Health Care Policy and Research (AHCPR) was reauthorized and renamed the Agency for Healthcare Research and Quality (AHRQ). The law authorizes AHRQ to continue its research on the cost, quality, and outcomes of health care, and expands its role to improve patient safety and address medical errors.
This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

THIS EVIDENCE REPORT IS OUTDATED AND IS NO LONGER VIEWED AS GUIDANCE FOR CURRENT MEDICAL PRACTICE. IT IS MAINTAINED FOR ARCHIVAL PURPOSES ONLY.

Prepared for:
Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852
http://www.ahrq.gov

Contract No. 290-97-0011

Prepared by:
Research Triangle Institute, Research Triangle Park, NC
Nancy D. Berkman, Ph.D.
Principal Investigator
John M. Thorp, Jr., M.D.
Katherine E. Hartmann, M.D., Ph.D.
Kathleen N. Lohr, Ph.D.
Anjolie E. Idicula, B.A.
Melissa McPheeters, M.P.H.
Norma I. Gavin, Ph.D.
Timothy S. Carey, M.D.
Sue Tolleson-Rinehart, Ph.D.
Anne M. Jackman, M.S.W.
Victor Hasselblad, Ph.D.
Emily C. Puckett, B.A.
Investigators

AHRQ Publication No. 01-E021

December 2000
On December 6, 1999, under Public Law 106-129, the Agency for Health Care Policy and Research (AHCPR) was reauthorized and renamed the Agency for Healthcare Research and Quality (AHRQ). The law authorizes AHRQ to continue its research on the cost, quality, and outcomes of health care, and expands its role to improve patient safety and address medical errors.
This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

Preface

The Agency for Healthcare Research and Quality (AHRQ), formerly the Agency for Health Care Policy and Research, through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.

John M. Eisenberg, M.D.Douglas B. Kamerow, M.D.
Director Agency for Healthcare Research and QualityDirector, Center for Practice and Technology Assessment Agency for Healthcare Research and Quality


The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Structured Abstract

Objectives. Preterm labor is often a prelude to early births and the significant attendant burden of infant morbidity and mortality. To address the question of how best to manage women in preterm labor, the Research Triangle Institute-University of North Carolina Evidence-based Practice Center undertook a rigorous review of the scientific literature on: (1) appropriate criteria for diagnosing preterm labor and use of three biologic markers, fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol; (2) efficacy and effectiveness of tocolytics-pharmaceutical agents including beta-mimetics, calcium channel blockers, magnesium sulfate, nonsteroidal antiinflammatory disease drugs (NSAIDs), and ethanol that arrest preterm labor symptoms); (3) efficacy and effectiveness of antibiotics for treating covert infections that might have prompted preterm labor; and (4) efficacy of home uterine activity monitoring.

Search Strategy. We conducted a detailed search of the relevant literature using MEDLINE; EMBASE; the Cochrane Collaboration and its related York Database, International Pharmaceutical Abstracts; the Health Economic Evaluations Database; Genderwatch; and the Population Index. The Medical Subject Headings included premature labor, diagnosis, epidemiologic study characteristics, terms for specific therapies (biologic markers, antibiotics, and tocolytic agents) and costs, cost analysis, and cost-benefit analysis. We conducted an extensive search of the gray literature, chiefly government documents, with a focus on tocolytics and biologic markers.

Selection Criteria. Our inclusion criteria were randomized controlled trials (RCTs) and cohort or case-control studies on biologic markers, tocolytics, and antibiotics that met the following specifications: published from 1966 to 1999 in English, French, or German; including pregnant females of any age with signs and symptoms of preterm labor; involving inpatient and outpatient settings; and measuring delivery, maternal morbidities, and infant health outcomes. Home uterine activity monitoring was further restricted to RCTs published in English since 1980.

Data Collection and Analysis. We conducted dual and blinded reviews of abstracts of articles and pulled full articles. Those articles meeting inclusion criteria were given a detailed dual review and abstraction onto four (topic-specific) data extraction forms. We judged the quality of the individual articles (on internal validity, statistical considerations, clinical relevance of findings, an external validity) and the strength of the evidence overall in each topic area (considering consistency of findings over all studies, quality rating of each study, magnitude of important outcomes, and meta-analysis results). In addition to the systematic reviews of the literature, we conducted meta-analyses of treatment effects from first-line and maintenance tocolytics, antibiotics, and home uterine activity monitoring.

Main Results. The quality of this literature is questionable in many respects, including the definition of preterm labor, the size of the trials, confounding of results because of use of cointerventions, reliance on bivariate analyses without including stratified analyses such as would be available through survival analysis, and failure to separately analyze women who have conditions that culminate in medically indicated preterm births.

Both fFN and EVUSD present strong evidence of effectiveness from cohort studies as diagnostic tools for assessing the risk of preterm birth in women with preterm labor, mainly in terms of their negative predictive value. Both can identify women who are not likely to experience a preterm birth and thus help clinicians avoid unneeded therapy. The efficacy of these findings from RCTs is not yet available.

First-line treatment with certain tocolytics offers small improvements in pregnancy prolongation. Whether this prolongation is beneficial in improving neonatal outcomes has not been established. Tocolytics other than beta-mimetics offer greater efficacy at less risk; the benefits of beta-mimetics never exceeded other options, and their harms were potentially more severe than for other options. Ethanol is less efficacious than other tocolytic options and is an inappropriate treatment for women with preterm labor symptoms. In terms of gestational age at birth, prolongation of pregnancy, or birth weight, maintenance treatment with tocolytics conferred no benefits. Evidence of harms was high for beta-mimetics in relation to the probability of maternal risk (serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms); harms for other tocolytics were regarded as low for maternal risk. All classes of tocolytics posed low short-term risk for fetal or neonatal outcomes. Long-term risk to the infant was not evaluated.

Therapy with antibiotics for occult in utero infections believed to be associated with preterm labor had mixed results, but meta-analysis indicated marginally significant improvements in two maternal outcomes (length of pregnancy and gestational age). Significant increases in birth weight were not found.

No RCT of home uterine activity monitoring that controlled for the cointervention of nursing support found a significant effect from use of this device. Meta-analysis confirmed this "no- effect" conclusion in relation to gestational age at birth and birth weight.

Conclusions. Based on the literature reviewed, two biologic markers (fFN and EVUSD) are found to be quite useful in identifying women in preterm labor who are at low risk of experiencing a preterm birth. Although the evidence remains mixed, certain tocolytics-beta- mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs-appear effective in prolonging pregnancy when used as first-line agents in arresting preterm labor, but beta- mimetics in comparison with other tocolytics seem to present a higher risk of maternal harms. Ethanol is not found to be an appropriate tocolytic agent. Tocolytics are not useful as maintenance interventions. Antibiotics for suspected genital tract infections may be useful. Home uterine activity monitoring was found to confer no maternal or fetal/neonatal benefits.

The literature concerning the management of preterm labor is flawed in several ways relating largely to definitions of preterm labor, appropriate measures of outcomes, and use of survival analysis. Epidemiologic research as well as additional efficacy and effectiveness studies of the two biologic markers, selected tocolytics, and antibiotics are warranted.

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:
Berkman, ND, Thorp, JM Jr, Hartmann, KE, et al. Management of preterm labor. Evidence Report/Technology Assessment No. 18 (Prepared by Research Triangle Institute under Contract No. 290-97-0011). AHRQ Publication No. 01-E021. Rockville (MD) Agency for Healthcare Research and Quality. December 2000.

Summary

Overview

Women's health is a major concern in the Nation today, particularly in the areas of maternal health and pregnancy. Within this framework, the American College of Obstetricians and Gynecologists nominated issues for consideration by the Agency for Healthcare Research and Quality (AHRQ) relating to the care of women with preterm labor. Of specific concern are effective strategies for identifying and treating women with symptoms of preterm labor so as to prevent preterm births (i.e., births before 37 full weeks of gestation) and subsequent infant morbidity and mortality.

Definitions of preterm labor vary, but the research criteria commonly hold it to be contractions occurring between 20 and 36 weeks' gestation at a rate of four in 20 minutes or eight in 1 hour with at least one of the following: cervical change over time or dilation greater than or equal to 2.0 cm. Preterm labor is the most common cause of antenatal hospitalization, but the number of pregnancies in which preterm labor symptoms occur is inadequately documented. Preterm labor is recognized as a prelude to early births, with their attendant burden of infant morbidity and mortality. Although the incidence and prevalence of preterm labor are not well understood, the incidence and burdens of preterm births are. Conservatively, 11 percent of all live births, or approximately 440,000 annually, occur before term in this country, and preterm births are responsible for three-quarters of neonatal mortality and one-half of long-term neurological impairments in children. Preterm births account for one-third of all health care spending on infants and one-tenth of spending for children.

In short, the medical, psychological, and economic burdens of preterm births-and by extension, preterm labor that ends in preterm births-are substantial. Uncertainty persists, however, about the best strategies for managing women in preterm labor. Thus, compelling reasons can be amassed for a systematic review of the literature in this clinical area.

Reporting the Evidence

To address these issues, the Research Triangle Institute-University of North Carolina Evidence-based Practice Center (RTI-UNC EPC) undertook a rigorous review of the scientific literature on detection and management of preterm labor. This report addresses four main issues: (1) the appropriate criteria for diagnosing preterm labor, specifically with respect to the use of three biologic markers and their positive and negative predictive value; (2) the efficacy and effectiveness of tocolytics (pharmaceutical agents that arrest uterine contractions); (3) the efficacy and effectiveness of antibiotics (with respect to covert infections that might prompt preterm labor); and (4) the efficacy of home uterine activity monitoring in decreasing adverse outcomes in women who are experiencing preterm labor.

The key questions are:

Biologic Markers: What are the appropriate criteria for a diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?

Tocolytics: What is the efficacy and/or effectiveness of tocolytics in managing preterm labor? The analysis had two subtopics, one concerning first-line tocolytics (treatment for women having acute symptoms) and the other concerning maintenance therapy for women who have experienced an episode of preterm labor.

Antibiotics: What is the efficacy and/or effectiveness of antibiotics in managing preterm labor? The analysis was limited to women suspected of having occult in utero infections.

Home Uterine Activity Monitoring: What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?

Specifically, we investigated three biologic markers: fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol (E3); because of a lack of studies regarding E3 in laboring women, the remainder of this summary focuses only on fFN and EVUSD. We studied five classes of tocolytics: beta-mimetics, calcium channel blockers, magnesium sulfate, nonsteroidal anti-inflammatory disease drugs (NSAIDs), and ethanol. For these tocolytics, we considered whether they were used as first-line or maintenance regimens; the former applies to acute conditions when a woman's preterm labor symptoms are so significant that a tocolytic agent is used to prevent preterm birth, and the latter applies to use after an episode of acute tocolysis to maintain uterine quiescence. Our review included numerous antibiotics, which are reviewed as one group. Finally, we considered the literature on home uterine activity monitoring for women with preterm labor.

Methodology

Databases, Inclusion/Exclusion Criteria, and Search Terms

We conducted a detailed search of the relevant literatures using the following databases: MEDLINE; EMBASE; the Cochrane Collaboration and its related York Database, International Pharmaceutical Abstracts; the Health Economic Evaluations Database; Genderwatch; and the Population Index. We also conducted an extensive search of the gray literature, chiefly government documents, with a focus on tocolytics and biologic markers.

We examined literature meeting the following criteria:

  • Study populations: humans; pregnant females.

  • Condition: signs and symptoms of preterm labor, not including articles in which all research subjects experienced preterm premature rupture of membranes, medically indicated preterm delivery, or multiple gestation.

  • Interventions: biologic markers as noted, tocolytics, antibiotics, and home uterine activity monitoring.

  • Study settings: essentially all inpatient and outpatient settings including patients' homes.

  • Outcomes: three major categories of outcomes-birth, maternal morbidities, and infant health.

  • Time period: for pharmacotherapies and biologic markers, 1966 through 1999, depending on the date of approval; for home uterine activity monitoring, 1980 through 1999.

  • Languages and geographic sites: English, French, and German (English only for home uterine activity monitoring), and excluding locations based on language of publication other than these.

  • Admissible evidence: efficacy studies in which health care is delivered under ideal settings, identified as randomized controlled trials (RCTs) (double and single blinded); effectiveness studies (non-RCTs, prospective and retrospective cohort studies, and case control studies) in which health care is delivered under ordinary circumstances; only RCTs were used for home uterine activity monitoring; other information included meta-analyses, review articles for reference lists, and cost-effectiveness studies; finally, sample sizes of 40 or more subjects.

The Medical Subject Headings (MeSH) used for the searches were limited by the subject heading "premature labor." With respect to study designs, we included a variety of terms in "epidemiologic study characteristics" (exploded). In addition, we searched for specific therapies, including "biologic markers," "antibiotics," and "tocolytic agents." We searched the intersection of "premature labor" and "diagnosis," and finally, we searched under "costs and cost analysis and cost-benefit analysis."

Data Collection and Abstraction Process

Efficacy and Effectiveness Data

Upon completion of the initial literature searches, we subjected lists of titles and article abstracts to a dual, blinded review; we retained those articles that both a clinician and a methodologist reviewer indicated should be included and discarded those that both reviewers indicated should be dropped. In cases of disagreement, the Scientific Director for the project made the final selection. In addition, the Scientific Director reexamined a 20-percent sample of abstracts identified for inclusion and all abstracts identified for exclusion; generally, we erred on the side of inclusion.

We combined all citations into a ProCite database, removing duplicate records and identifying articles to be included in evidence tables and those to be excluded (with reason for exclusion). In all, we retrieved 506 studies of benefit from treatment; of these, we included 101 articles.

We developed, through several iterations, various data extraction forms, specific to each topic. These forms were designed to provide clear and easily accessible information for entering data into evidence tables. Included across all studies were the following variables:

  • Study designs.

  • Description of patient population, including maternal age and race, and clinical inclusion/exclusion criteria, such as condition of membranes (ruptured or intact), estimated gestational age, how gestational age was defined, and maternal and fetal indications for birth.

  • Definition of preterm labor.

  • Description of the test, treatment, or intervention.

  • Description of adjunct therapies.

  • Outcomes measured such as prolongation of pregnancy, gestational age at delivery, rate of preterm births, maternal morbidities, and infant birth weight.

  • Description of any secondary analyses performed.

The RTI-UNC EPC used two types of abstractors: individuals with content or clinical expertise and those with strong methodologic skills. The clinician abstractors included several obstetricians, one pediatrician with training in women's health, and a nurse midwife; all had prior research experience. The methods abstractors were all doctoral students in the UNC School of Public Health or the UNC Department of Economics with extensive training in quantitative methods. In addition, reviewers included a health services researcher with expertise in quantitative methods (the Project Director) and an obstetrician with expertise in treating women with preterm labor and conducting clinical research in the topic area (the Scientific Director). We instituted substantial quality control and reliability procedures.

Harms Data

We systematically reviewed evidence about side effects of tocolytics, comparing classes of these drugs and effects on the mother and the fetus or neonate, but we did not distinguish between first-line and maintenance therapies. Rather than developing full evidence tables, we created numerous charts depicting the absolute and relative frequency of various harms associated with tocolytic use.

Grading Article Quality and Overall Strength of Evidence

We first graded the quality of individual articles using a specific scale that considered various aspects of the internal and external validity of each study. It encompassed, among other issues, study design, measurement issues, statistical analyses, and the appropriateness of the conclusions being drawn. To assess the internal validity of a study (i.e., the likelihood that the design and conduct of the study minimize systematic error [bias]), we evaluated the following: recruitment strategy; masking of the treating clinician and patient; inclusion of baseline measurements (e.g., contraction frequency, cervical dilation, effacement, and estimated gestational age); and definition of outcome measures, attrition rates, and confounding variables. We evaluated the adequacy of the methods used and whether the investigators assessed the clinical relevancy of the statistical findings. The quality rating instrument assessed whether all three major categories of outcomes were included in a study: delivery, maternal, and infant. With respect to external validity (i.e., whether the findings of the study can be generalized to populations that did not participate in the study), we determined whether the clinical setting was clearly specified and whether conclusions apply to pregnant women in the United States. The quality scores from each abstractor are included in the evidence tables.

We also rated the quality or strength of the collective evidence on each topic according to a categorical, adjudicated rating developed by the Project Director, Scientific Director, and Clinical Methodologist. The quality of the collective evidence takes into account the design quality of the individual studies and the efficacy or effectiveness of reported outcomes. The ratings are as follows:

Good (A):The data or evidence are sufficient for evaluating the quality of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor.
Fair (B):The data are sufficient for evaluating the quality of the findings. The data indicate that inconsistencies exist in the findings between the key drug or intervention and alternative or placebo for treating women with preterm labor.
Poor (C):The data are sufficient for evaluating the quality of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor.
Incomplete Evidence (I):The data are insufficient for assessing the quality of the key drug or intervention for treating women with preterm labor based on limited sample size or poor methodology.
Efficacy (1):Evidence obtained from well-designed RCTs.
Effectiveness (2):Evidence obtained from well-designed cohort or case-control studies.

We used two designations to grade the overall evidence about harms: "high" if the side effects were life-threatening and their frequency was substantially different from that for alternative treatments or no treatment, and "low" if the side effects were short-term and not life-threatening and their frequency was not substantially different from the frequency for an alternative tocolytic treatment or no treatment.

Development of Evidence Tables

We presented information from individual studies into pairs of evidence tables-one for study design and the other for outcomes. The content of the study design tables covers the following variables (when available from the publications): statement of the research objective, definition of preterm labor used by the authors, patient inclusion/exclusion criteria, patient demographic characteristics, description of the experimental intervention, total number of participants, and number of participants in each arm of the study at its conclusion. The outcomes tables generally include data on delivery, and maternal and infant outcomes. For the biologic marker outcome tables, we present delivery outcomes in terms of sensitivity, specificity, positive predictive value, and negative predictive value.

Supplemental Analyses

In addition to the systematic review of the literature, we conducted meta-analyses focusing on treatment effects of first-line tocolytics by class, maintenance tocolytics by class, antibiotics, and home uterine activity monitoring. The efficacy of treatment was measured in relation to the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational age at delivery, and (3) birth weight. The literature used in the meta-analyses was restricted to RCT studies included in the evidence tables and data concerning women with intact membranes.

Findings

Quality of the Literature

The quality of this literature is questionable in many respects, thereby diminishing the confidence with which one can draw conclusions. Among the issues of concern are the definition of preterm labor, the size of the trials, confounding of results because of use of cointerventions, and failure to analyze separately women who have conditions associated with medically indicated preterm births. In addition, we note the general absence (except in a few cases) of the use of survival analytic techniques in preference to the use of dichotomous outcomes (e.g., preterm birth or not) or other categorical outcomes that do not adequately reflect how long a pregnancy "survived" without a preterm birth. Survival analysis allows for the stratification of outcomes by such factors as the gestational age at the onset of preterm labor.

Biologic Markers

Overall, fFN and EVUSD present strong evidence of effectiveness as a diagnostic tool for assessing the risk of preterm birth in women with symptoms of preterm labor. Both tests only moderately were successful in predicting which women with a positive test would deliver before term, but they consistently exhibited strong negative predictive values, thereby identifying women at low risk of preterm birth.

Tocolytics

First-Line Tocolytic Therapy

Literature comparing an intervention group of women receiving first-line treatment with tocolytics relative to a control group was available only for beta-mimetics and magnesium sulfate. Results across studies were mixed: for beta-mimetics, we saw evidence of efficacy in terms of estimated gestational age at birth, prolongation of pregnancy for 1 day or more and improved infant outcomes (birth weight greater than 2500 g). Significant differences were not found between magnesium sulfate and placebo.

With respect to comparisons among different classes of tocolytics (chiefly in relation to beta-mimetics), results again were mixed. Beta-mimetics had efficacy relative to ethanol treatment in relation to prolonging pregnancy in one (older) study, but other RCTs showed that all other classes of tocolytics had greater or not significantly different effects relative to beta-mimetics on delivery outcomes. For infant outcomes, results were also mixed, but no study reported that beta-mimetics were a superior treatment. Data from observational studies did not contradict the results of the RCTs. Meta-analyses suggest that all tocolytics, with the exception of ethanol, were effective in extending pregnancies at or beyond 36 to 38 weeks gestation, compared with a no-treatment group. Beta-mimetics, calcium channel blockers, and magnesium sulfate nearly doubled the odds of term births, relative to control, with potentially small differences in effect sizes between classes.

Overall, the evidence supports the notion that first-line treatment with beta-mimetics, calcium channel blockers, magnesium sulfate, or NSAIDs offers small improvements in prolonging pregnancy. Data concerning relative efficacy are mixed, but they clearly support the conclusion that ethanol is less efficacious than other tocolytic options and support the generally held clinical belief that ethanol is an inappropriate treatment for women with preterm labor symptoms. Concerns that other treatments offer greater efficacy with less risk seem to be warranted in regard to treatment with beta-mimetics as well: the benefits of beta-mimetics never were found to exceed other options, and the maternal harms (see below) were shown to be potentially more severe.

Maintenance Tocolytics

Except for one small study, the efficacy studies showed no difference between treatment and control arms in managing women who had recently experienced an episode of preterm labor, and meta-analysis confirmed these conclusions. In short, for gestational age at birth, prolongation of pregnancy, or birth weight, maintenance treatment conferred no benefits.

Harms of Tocolytics

We graded beta-mimetics as "high" in probability of maternal risk, including serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms. All other classes of tocolytic treatment were graded as "low" in relation to maternal risk. We graded all classes of tocolytics as "low" risk in relation to fetal or neonatal harms: evidence of short-term harms was inconsistent, and evidence of longer term problems was insufficient.

Antibiotics

Results of our review concerning therapy with antibiotics for treating occult in utero infections associated with preterm labor were mixed. Two RCTs found improvements in all three delivery outcomes of interest-prolongation of pregnancy, mean gestational age at birth, and birth at a particular number of weeks- but all other RCT studies showed mixed results or no significant difference from placebo/control group assignment. Results from survival analyses included in three RCTs were also mixed. Meta-analysis showed the following: a marginally significant increase in length of pregnancy of about 6 days; a marginally significant increase of about 0.60 of a week in gestational age resulting from antibiotic treatment; and a small increase in birth weight that was not statistically significant. The array of agents, routes of administration, and durations of therapy preclude us from making a generalization about the optional antibiotic regimen to achieve these benefits.

Home Uterine Activity Monitoring

Of the four RCTs reviewed, none of the three that controlled for nursing support as an element of the monitoring approach found a significant effect from home uterine activity monitoring; meta-analysis confirmed this "no-effect" conclusion in relation to gestational age at birth and birth weight. A fourth RCT reported that home uterine activity monitoring significantly prolonged delivery to 37 weeks, but the approach in this study included nursing support; the separate effect of monitoring was not analyzed.

Future Research

We encountered numerous difficulties in reviewing this literature, arriving at concrete conclusions, or drawing appropriate inferences about the efficacy or effectiveness of these drugs and other interventions. Methodologic issues included explicit definitions of preterm labor, clarity about cointerventions, and no separate analysis of women who had medically indicated preterm births. We urge investigators to be more precise in their design and description of efficacy and effectiveness studies.

Moreover, researchers differed in their definitions of even standard outcomes, and we have noted already the infrequent use of survival analysis (or related techniques). Our recommendation is that investigators, to the extent possible, supplement and integrate the use of dichotomous outcomes measures (e.g., preterm or term birth) and variables involving specific cutoff points for gestational age and include concepts such as the length of time a pregnancy was extended (i.e., prolongation as a form of survival of the pregnancy). We specifically advocate survival analysis stratified by gestational age at enrollment as the analytic technique of choice in view of its clinical and biologic relevance.

The dearth of reliable epidemiologic information about preterm labor is significant. The areas that require special attention include biologic mechanisms that result in birth before term, incidence and prevalence of preterm labor and the proportion that result in preterm birth, modifiable risk factors for preterm birth, and a better understanding of these basic facts among ethnic minorities.

Further study on risk factors is important because clinicians and researchers must be able to assess accurately the risk of preterm birth in women with symptoms of preterm labor and to exclude women at low risk of early birth from intervention trials. Biologic markers (such as fFN and EVUSD), by virtue of being objective and reproducible, are superior to purely clinical assessment or prognostication of preterm delivery. Nonetheless, we advocate more direct experimental investigation of the predictive characteristics of these two biologic markers. Meanwhile, we recommend that investigators studying tocolytics and antibiotics (in their capacity as part of a medical regimen for women with preterm labor) use biologic marker negativity as an exclusion criterion.

Our results from both the systematic review and meta-analysis for the first-line tocolytics and for antibiotics suggested that further trials of these modalities are justified, although we note that investigators must now attend to the deficits of the efficacy and effectiveness studies to date. Future research should focus on the benefits (and harms) of beta-mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs, in comparison with no treatment and in comparisons with one another.

Finally, we advise against further research on maintenance tocolytics or home uterine activity monitoring. The research to date has made adequately clear that the use of tocolytics in a maintenance capacity following an episode of acute tocolysis has no proven efficacy or effectiveness and that the use of home uterine activity monitoring confers no maternal, fetal, or neonatal benefits.

Chapter 1. Introduction

Primary Objectives and Scope

Table 1. Existing Information on Preterm Labor
AreaFacts Identified (Summary)
Incidence and PrevalenceAbout 11% of pregnant women in the U.S. will experience a spontaneous preterm delivery. 45 Of those, an estimated 40% will occur following preterm labor. About 7% of all live births will be low birth weight babies (about 6% among white and Hispanic babies, 13% among Black babies). Approximately 4 million babies are born in the U.S. each year. 149
Characteristics and Size of PopulationPreterm labor afflicts pregnant women of every socioeconomic class, and ethnic group, and age. Etiologic contributions include cigarette smoking, preeclampsia, incompetent cervix, prior preterm birth, abruptio placentae, occult in utero infections, and possibly stress, anxiety, and depression. 9
Practice SettingPreterm labor can be addressed in clinicians' offices, private and public inpatient and outpatient settings, free-standing birth centers, and patients' homes.
Morbidity and Developmental MilestonesFor mothers, the morbidity associated with preterm labor can be related to both physical functioning and emotional stress, as well as potential harms from the side effects of diagnostic testing or use of certain drugs. (Mistaking preterm labor for certain other conditions in which drugs used to treat preterm labor can be contraindicated can create its own set of potential adverse events.) The morbidity for babies born preterm can manifest in many ways. Short-term outcomes for newborns can include acute and chronic lung disease, central nervous system hemorrhage, and blindness. Long-term outcomes can involve cerebral palsy, mental retardation, and learning disability, and other forms of developmental delay. 1193
MortalityThe risk of neonatal mortality inversely is associated with gestational age at birth. Approximately, three-quarters of neonatal deaths can be attributed to preterm birth. 94
Quality of LifeMore than one-half of neurologic impairments found in children can be attributed to preterm delivery. The quality of life of women in preterm labor can be affected by stress associated with their condition, hospitalization, and need for bed rest. 1195
Loss of ProductivityEmployed pregnant women experiencing preterm labor will have reduced economic productivity to the extent that days of bed rest and/or hospitalization are required. Although difficult to measure, individuals with handicaps caused by having been preterm babies experience increased morbidity that may adversely affect their future level of productivity and earning power. 93
Costs to Diagnosis and TreatThe costs to treat women in preterm labor include the costs of drugs (such as tocolytics and possibly antibiotics) and hospitalizations. To those may be added the costs of biologic marker tests to make a more precise diagnosis, the costs of diagnosing possible overt or occult infections, and possibly even amniocentesis. In addition, preterm delivery accounts for approximately 35% of all health care spending on infants and 10% of all health care spending for children. 16
Other Costs or BurdensIn 1988, for children younger than 15 years of age, costs for health care, education, and child care associated with having been of low birth weight (a result of preterm birth) were estimated to be $5.5 to 6 billion. 16
Variations in PracticeNo consensus exists regarding the diagnosis or the treatment of preterm labor. Approaches range from essentially "watchful waiting" or traditional "conservative interventions" (e.g., bed rest) to complex diagnostic and therapeutic regimens involving biologic marker tests (diagnosis) and pharmacologic agents (antibiotics, tocolytic drugs and perhaps corticosteroids) and to system-wide (e.g., state or county) programs to detect and manage the condition. 94
Effective management strategies for identifying and treating preterm labor are required to prevent preterm birth, that is, birth before 37 full weeks of gestation. Preterm labor itself would be of little interest were it not the prelude to early births and their heavy burden of infant mortality and morbidity. Preterm birth is a factor in three-quarters of neonatal mortality and one-half of long-term neurologic impairment in children. 1 Table 1 summarizes the importance of preterm birth-and, by extension, management of preterm labor-as a national health concern.

In theory, early detection and management of preterm labor helps to prevent preterm birth and its potential neonatal sequelae, which include respiratory distress syndrome, sepsis, intraventricular hemorrhage, necrotizing enterocolitis, patent ductus arteriosus, and hyperbilirubinemia; 2 however, widespread treatment of women with signs and symptoms of preterm labor has failed to reduce the prevalence of preterm birth in the United States. Rates of preterm births has not changed meaningfully in 20 years, underscoring the need to evaluate the effectiveness of current interventions.

The reasons for the lack of progress remain undefined, although they are related in part to the rise in multiple gestations resulting from the success of assisted reproductive technologies. 3 Multifetal pregnancies are at higher risk for preterm birth. From 1981 to 1993, preterm births rose from 9.4 to 11 percent, 4 stabilizing at approximately 11 percent through 1997. 5 Keys to this failure to see a decline in the prevalence of preterm births may lie in how we identify, evaluate, and treat women with preterm labor.

Lack of uniform criteria for evaluating and diagnosing preterm labor presents special challenges for clinicians and perinatal researchers attempting to draw conclusions from the medical literature. This evidence report focuses in part on documenting the range, complexity, and lack of uniformity of definitions, and how this pattern influences interpretation of the scientific literature on key clinical questions. We revisit the challenge of definition at appropriate transitions throughout this report.

Care providers are anxious for tools that provide objective categorization of risk and assist diagnosis. Candidate biologic markers are numerous and offer the possibility of refining clinical estimates of the probability that a particular woman's uterine activity will continue, cause cervical change, and result in preterm birth. Nonetheless, such testing is costly and controversial. Little existing literature summarizes the utility of biologic markers in the population of women with preterm labor or compares the literature on the clinical diagnostic properties across tests.

Once a diagnosis is made, in contemporary practice, tocolytics (agents intended to stop contractions and maintain uterine quiescence) are the cornerstone of pharmacologic treatment of preterm labor and are often continued as a maintenance treatment to prevent further preterm labor episodes. Arguably, the effectiveness of these agents, as a class, to prolong gestation to term is limited. This report summarizes the literature on the effectiveness of tocolytics for preventing preterm birth. When possible, we explore the benefit expected from use of tocolytic agents with an emphasis not only on traditional dichotomous interpretations of treatment outcome (i.e., full-term versus preterm births) but also measures that take into account increments of time gained. Two factors drive this interest in additional measures: (1) the continuous improvement in infant outcomes found to be correlated with incremental increases in birth weight and gestational age, 6 and (2) the proven benefit of giving mothers at risk of a preterm birth corticosteroids to promote fetal lung maturation. 2 Therefore, understanding what are the effective, and preferable, means by which to delay preterm birth is also an objective of this work. Equally important is documenting potential harmful maternal and infant effects of treatment with tocolytics and estimating the incidence of serious adverse events.

As occult in utero infection and inflammation increasingly are thought to contribute to the risk of preterm birth, research on the use of antibiotics has proliferated. The literature is replete with studies of antibiotics of many types and doses, by several routes of administration, for differing durations, with and without the use of tocolytics or other cointerventions. This evidence report is an opportunity to consolidate data that may provide insight into the appropriate integration of antibiotics into management.

Preterm labor generally requires initial hospital-based management. The possibility of using technology, such as home uterine activity monitoring, to facilitate safe discharge from inpatient care is attractive for patients, providers, and payers. Home monitoring makes continued frequent surveillance of uterine activity feasible; its status as a management tool, however, remains unclear.

The broad scope, lack of clear clinical management imperatives, and importance of these concerns suggests that the audience for this report will be large and receptive to evidence provided to address the key clinical questions described in detail in the next subsection.

Key Clinical Questions

Our key clinical questions focus on three important components of managing preterm labor: diagnosis, pharmacotherapy for arresting the condition, and monitoring symptoms. (See Chapter 2 for a more detailed discussion of our key questions.) First, the key questions involve the use of particular diagnostic tests, so-called biologic markers, to enhance clinical accuracy in predicting which women presenting with signs and symptoms of preterm labor are most likely (or least likely) to result in a preterm birth. These biologic markers include fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol (E3).

Second, we investigate pharmacotherapies for arresting the condition, including tocolytics to stop uterine contractions and antibiotics for treating possible underlying infections. Tocolytic agents will be examined by class and by whether treatment focuses on arresting current symptoms or, following an episode of preterm labor, are used to maintain uterine quiescence.

The majority of the tocolytic literature concentrates on the class of drugs known as beta-mimetics; it also includes calcium channel blockers, magnesium sulfate, and nonsteroidal anti-inflammatory drugs (NSAIDs). In addition, we review the outcomes of treatment with ethanol, a therapy no longer in accepted practice in the United States. In addition to benefits obtained from tocolytic pharmacotherapy, we also explore the harms. In contrast, antibiotics will be discussed as one group, and potential harms will not be examined.

Third, our clinical questions concern the use of home uterine activity monitoring for detecting early symptoms of preterm labor, which is intended to allow the clinician more time to administer therapies intended to stop the progression to preterm delivery or administer pharmacotherapies to aid fetal development.

Technical Expert Advisory Group

Technical experts in the field of the management of preterm labor were identified to provide assistance throughout the project. The Technical Expert Advisory Group (TEAG) (Appendix A) was expected to contribute to advancing the Agency for Healthcare Research and Quality's (AHRQ's) broader goals of (1) creating and maintaining science partnerships as well as public-private partnerships and (2) meeting the needs of an array of potential customers and users of its products. Thus, it was both an additional resource and a sounding board during the project. The TEAG included eight members: six technical/clinical experts; one representative of organizations whose mission concerns the interests and perspectives of patients and consumers; and one potential user of the final evidence report, specifically, an American College of Obstetricians and Gynecologists (ACOG) representative, the organization originally nominating this topic.

To ensure robust, scientifically relevant work, the TEAG was called on to provide reactions to work in progress and advice on substantive issues or possibly overlooked areas of research. TEAG members participated in conference calls and discussions, periodically through e-mail, to:

  • Refine the key clinical questions at the beginning of the project.

  • Discuss the preliminary assessment of the literature and to provide input for the data extraction form during the project.

  • Select topics and guide approaches for meta-analyses toward the end of the project.

In addition, early in the process, senior staff and TEAG members discussed the challenges of designing a literature search concerning a clinical situation that is as difficult to conceptualize as preterm labor. A particular concern potentially resulting from defining the condition based on its symptoms is that the narrowness or breadth of the criteria for enrolling women with preterm labor in research protocols would influence the outcomes of the treatment under study. For instance, in trials with broad definitions of preterm labor, an unknown fraction of the women might have been inappropriate candidates for the therapies they received because their symptoms would have resolved on their own. We decided to proceed with the literature search and data extraction as outlined in the report on the grounds that, since the authors had defined study participants as experiencing preterm labor, we had no means either to exclude studies in an unbiased way or to "adjust" results post hoc for these problems. This concern is, however, what motivates the authors of this report to advocate for future research to include using biologic markers as a means of increasing clinical confidence that an appropriate diagnosis was made for women included in a study.

Because of their extensive knowledge of the literature on preterm labor and their active involvement in professional societies, TEAG members were also asked to participate in the external peer review of the draft report.

Background and Significance

Overview and Burden of Disease

Although preterm labor is the most common cause of antenatal hospitalization, the number of pregnancies in which preterm labor occurs is documented inadequately. Only two studies that specifically address the natural history of preterm labor were identified. In a study designed to determine how warning signs of preterm labor relate to eventual diagnosis of preterm labor and preterm birth, Copper, Goldenberg, Davis, et al. 7 followed 329 indigent, high-risk patients. In this small cohort, 27 percent of the participants met the research criteria for diagnosis of preterm labor: "contractions occurring between 20 and 36 weeks' gestation at a rate of four in 20 minutes or eight in 1 hour with at least one of the following: ruptured membranes, cervical change over time, dilatation > 2.0 cm, or cervical length < 1.0 cm." Twelve percent of the cohort had spontaneous preterm births. 7 Also, West, Yawn, Thorp, et al. 8 found that the cumulative incidence of physician diagnosis of preterm labor during the late 1980s and early 1990s was 3.6 to 6.4 percent in a population-based sample of live births in Olmstead County, Minnesota.

Such prospective enumeration of populations of pregnant women is required to accurately measure incidence, the rate of new diagnoses of preterm labor in pregnant women, and prevalence, the proportion of pregnant women receiving treatment for preterm labor, at a given time. Without a denominator from an identified population at risk for developing preterm labor, the majority of existing studies provide "numerator" data that poorly capture the impact of preterm labor on the health care system or on the lives of pregnant women. Each year at least 200,000 infants are born before term after spontaneous preterm labor, 5 9 but the number of other pregnancies in the United States affected by the occurrence of preterm labor that do not result in preterm birth has not been projected accurately.

The incidence and burden of preterm birth is comparatively well understood. Conservatively estimated, 11 percent of all live births, or approximately 440,000 births, occur before term each year in the United States. 5 9 Preterm birth is responsible for 70 to 85 percent of fetal, neonatal, and infant deaths 10 and is second only to fetal deformity in causing perinatal morbidity. Up to one-half of the total burden of neurologic disability in children, including cerebral palsy, is attributable to prematurity. 1 11 This risk of neonatal mortality and morbidity is related directly to gestational age and weight at birth: the more premature the infant, the greater the risk. Infants born before 32 weeks, 2 percent of births, experience the majority of neonatal deaths and congenital neurologic disorders. 12 Even late in the preterm period (35 or 36 weeks gestation), when the overall risk of mortality is modest, 13 an increased risk of infant mortality, 14 respiratory distress syndrome, 12 13 and impaired intellectual development 15 can still be discerned.

The economic costs of caring for preterm infants are substantial. Total care costs are, on average, tenfold higher for preterm infants than for normal-weight infants born at term. 16 Preterm birth accounts for 35 percent of all health care spending on infants and 10 percent of all spending for children. 16 The ongoing medical, educational, and special needs of children born at low birth weights, the majority after preterm birth, are estimated to exceed $5 billion each year. 16 These costs reflect only the care of infants born before term. Additional personal and health care costs accrue from the evaluation and care of women who experienced preterm labor.

To narrow the focus to spontaneous preterm labor potentially preceding spontaneous preterm birth, we first consider the major routes to preterm birth. Preterm births can be classified as spontaneous or medically indicated. Spontaneous preterm births are preceded by spontaneous preterm labor or preterm premature rupture of membranes, or both. 17 18 Medically indicated preterm births follow a decision to allow or cause delivery for maternal or fetal indications. Preeclampsia and presumed fetal growth restriction cause the majority of medically indicated preterm deliveries. 19 Spontaneous preterm labor precedes approximately 43 percent of preterm births; preterm premature rupture of membranes, approximately 32 percent; and medical indications for delivery, the remaining 25 percent. 17

The subtypes of spontaneous preterm birth, preterm labor, and preterm rupture of membranes share many risk factors. 20 21 In fact, 15 to 20 percent of women successfully treated for preterm labor subsequently experience preterm premature rupture of membranes. 21 22 Although this interrelationship is clear, the management of these conditions is divergent. This report focuses exclusively on the management of preterm labor.

The Epidemiology of Preterm Birth

The epidemiologic literature focuses on preterm birth, rather than preterm labor, as an outcome. Because the majority of women presenting for care with preterm labor signs or symptoms do not have the more serious outcome of giving birth prematurely, experts believe that examining the association of factors with spontaneous preterm birth itself (i.e., a "hard" outcome) reduces misclassification bias and yields more precise estimates of the importance of specific maternal characteristics and exposures. Research on determinants of preterm birth have three dominant foci:

  • Description of risk factors for individuals and populations to guide prenatal surveillance and intervention programs.

  • Identification of behaviors and conditions that are amenable to change or treatment.

  • Description of physiologic processes that precede preterm birth and may elucidate etiologic pathways or serve as biologic markers of impending birth.

The last category includes uterine activity, cervical change, and biologic markers such as fFN and E3, which will be reviewed later.

Risk Factors for Preterm Birth

The first category of risk factors consists predominantly of factors for which the mechanism of risk modification is unknown or difficult to separate definitively from socioeconomic, behavioral, or nutritional factors. Many of these "exposures" are relatively fixed (i.e., not amenable to modification after a woman becomes pregnant). The most robust and consistent factor is uniquely informative: women who have had a prior spontaneous preterm birth have a threefold increase in risk of preterm birth in a subsequent pregnancy compared with women who have had only term deliveries. 23 This risk increases with the number of previous preterm births, and the outcome of the most recent pregnancy has the greatest influence. 24 Depending on the population, women with a history of preterm birth deliver before term in 15 to 80 percent of subsequent pregnancies. 23 Presumably this reflects the constancy of unobserved, underlying factors.

In the United States, black race is a risk factor. Adjusting for known confounders, including income and educational status, being black confers a relative risk of 2.0 to 3.5 of having a preterm baby when compared with being white. 13 25 This divergence of risk is most pronounced at the lowest gestational ages (i.e., below 27 weeks). 26 The risk of other minorities in the United States is characterized too poorly to generalize.

Additional reported factors associated with increased risk include both very young and advanced maternal age, single marital status, low socioeconomic status, and some of its isolated correlates such as low income, poor access to and quality of care, and limited educational attainment. Poor nutritional status and low pre-pregnancy body weight also are related to increased risk. 27 28 These last two examples underscore the overlap between descriptive markers of risk and the possibility that deprivation plays a role in some cases, whereas knowledge and intentional behavior factors, such as nutritional habits, play a role in others.

Modifiable Risk Factors

Cigarette smoking is the most "costly" preventable risk from a population perspective. Smoking contributes independently to the risk of prematurity and to decreased birth weight. Adjusting for other factors, smokers are 40 to 70 percent more likely than nonsmokers to deliver before 37 weeks gestation. 29 30 31 Other potentially remediable risks include cocaine use, poor diet, and fasting. 32 33 Alcohol and marijuana use appear to be unrelated to spontaneous preterm birth risk. Data on the effects of stress, caffeine consumption, and physical exertion conflict.

Like behaviors, infections are risk factors that also can be modified before the window of risk for preterm labor. Alterations in vaginal flora are associated with increased risk of preterm birth. One of numerous possible infections is bacterial vaginosis (BV), a disruption of normal vaginal flora characterized by a superabundance of Gardnerella vaginalis , mixed anaerobes, and/or genital mycoplasmas. Its presence during pregnancy is associated with a 1.5- to 4.0-fold increased risk of preterm birth 34 35 36 37 38 39 and approximately twice the likelihood of preterm labor. 37 38 40 41 Black women are more likely than white women to have BV, 36 39 42 43 a finding that might help to explain the higher rate of preterm births in black women. Research regarding both pregnant and nonpregnant women has suggested that BV in the lower genital tract is associated with increased risk of infection in the upper genital tract; the latter may be one mechanism by which infection incites inflammation, uterine contractions, and subsequent preterm labor. Urinary tract infection also has been linked variably to increased risk. 20

Aggressive screening and treatment for genital and urinary tract infections have varied effectiveness for reducing preterm births; some studies demonstrate decreased rates and others fail to find benefit. 20 The physiologic mechanisms that may link infection, inflammation, and preterm labor are addressed in a subsequent discussion concerning the rationale for antibiotic use in the management of preterm labor.

Evaluation of Risk

Although the etiologic pathways to preterm birth are uncharted, substantial heterogeneity exists in the factors that contribute to the likelihood that any individual woman will experience preterm labor or deliver before term as a result of preterm labor. Conflicting results of studies assessing risk factors for preterm birth should not be viewed as irreconcilable. The distribution of fixed and modifiable risk factors in selected populations of pregnant women is sure to vary. As a result, synergy among risk factors-for instance, race and smoking and diet, or age and multiple gestation and socioeconomic status-also will vary. It is precisely the many permutations of risk factors and the complexity of their interactions that have thwarted efforts to use this knowledge in care settings to identify women at risk of preterm labor and spontaneous preterm birth.

With rare exceptions, risk scoring systems are empirically derived from what we know about the fixed and modifiable risks described above. Despite statistical refinements, they are neither sufficiently sensitive nor specific to guide care. The most successful tool, developed by Creasy, Golbud, Laros, et al. 44 had a sensitivity of 64 percent and specificity of 90 percent, with a positive predictive value of 30 percent when evaluated in a general obstetric population for prediction of preterm birth.

More than one-half of spontaneous preterm births occur in women with no apparent risk factors. 45 The limited utility of predictive tools, therefore, is less an indictment of the literature than it is a reflection of the lack of a gold standard, other than preterm birth, against which to measure the usefulness of prediction tools.

Clinical Management of Preterm Labor

Problems in the Diagnosis of Preterm Labor

Labor at term occurs when uterine contractions lead to progressive cervical dilation and effacement. By extension, the hallmarks of preterm labor are uterine activity and cervical change; however, uniformly accepted standards for defining preterm labor do not exist. Therefore, diagnosis of preterm labor invokes an essential circularity: this patient has the kind of uterine activity that may lead to preterm birth if not treated. Thus, intervention precludes definitive diagnosis. As a surrogate, operational definitions of preterm labor seek to quantify how many contractions over what period of time are occurring (e.g., more than four contractions in 20 minutes or five to eight contractions in 1 hour) they also often require documentation of change in dilation or effacement of the cervix.

In practice, clinicians weigh many highly individual factors not captured in formal definitions: Are the contractions painful? Does the mother recognize the contractions? Does she report other symptoms of labor such as back pain, pelvic pressure, vaginal bleeding, or vaginal discharge? What is her individual risk profile? Has she had prior preterm births? Are other processes at work that may be provoking uterine activity, such as a urinary tract infection, substance use, or placental compromise, including abruption (premature separation of the placenta)?

Once other explanations are excluded, two factors drive intervention for preterm labor: (1) estimates of the probability of progressive labor and (2) the degree of immaturity of the fetus. Higher probability and younger gestational age may lead to intervention based on less strict criteria than would otherwise be used. For example, allowing a mother at 28 weeks' gestation to continue to contract with the goal of documenting cervical change simply to meet diagnostic criteria is problematic at best. Because (in theory) achieving uterine quiescence is easier earlier in the cascade of labor events, early intervention is believed to be desirable.

Symptoms, including regular contractions before term, are common; hence, their presence is not highly predictive for discerning who will have progressive cervical change or go on to have a preterm delivery. In a study of 763 women who made unscheduled visits for symptoms suggestive of preterm labor, only 18 percent delivered before 37 weeks; moreover, only 3 percent delivered within the next 2 weeks. 46 Early detection of symptoms 47 48 49 50 51 and actual measurement of uterine contractions with home monitors 49 50 51 52 have failed to lower preterm delivery rates.

These findings suggest that women with contractions and symptoms that will lead to preterm birth are indistinguishable from the larger group who has incidental uterine activity, making it difficult to target surveillance and treatment efficiently. As the studies presented in the evidence tables show, the great majority of women diagnosed with preterm labor will not deliver before term.

Physiology of Preterm Labor

The cervical changes caused by labor (either term or preterm) reflect the culmination of a complex set of molecular changes in cervical connective tissue clinically known as "ripening" or "softening." 53 54 Although the sequence of events leading to cervical change, especially the triggers, is less clear for preterm labor than for term labor, the basic physiology is considered to be similar. The uterine wall consists primarily of smooth muscle or myometrium. Contractions result from shortening of the muscle fibers provoked by a stimulus, such as oxytocin or prostaglandins. At the cellular level, regulated release of intracellular calcium is required to set in motion the cascade that allows myosin and actin filaments within the muscle cell to generate force by shortening. To convert from the irregular, disorganized contractions common throughout pregnancy to coordinated contractions, the cells in the uterus create new channels called gap junctions to facilitate the flow of calcium and other intercellular signals. This process results in contractions in which the upper segment of the uterus provides downward expulsive force, pressing the presenting part of the infant against the cervix.

In addition to the mechanical force of uterine contractions, structural changes take place in the cervical tissue itself. White blood cells infiltrate the cervix and release various enzymes (proteases, collagenases, and elastases) that weaken the collagen and protein matrix of the cervix. This weakening allows the stretching and distortion of the cervix required for dilation and effacement.

Many types of stimuli may contribute to the initiation and perpetuation of contractions and to recruitment of inflammatory factors required for cervical change, which potentially provides multiple potential points for therapeutic intervention. In the discussion of management, we review how biologic markers can be used to detect important physiologic changes associated with preterm labor, the mechanism of action for each category of pharmacologic agents, potential harms associated with the use of tocolytics, and the rationale for use of home uterine activity monitoring.

Treatment Mechanisms

Clinicians frequently use a combination of treatments in caring for women with preterm labor. As noted earlier, the chief focus of this report centers on tocolytics and antibiotics. We do not review combinations of treatment that might be employed because the number of different combinations of pharmaceutical and nonpharmaceutical interventions is very large. In addition, frequent cointerventions that are not reviewed specifically in this report are bed rest and hydration.

Biologic Markers to Assess Risk in Women with Preterm Labor Symptoms

Over the past decade, two biologic markers-fFN in cervicovaginal secretions and EVUSD of the cervix have been investigated extensively. When used as tests to assess women with symptoms of preterm labor, these biologic markers have been shown to increase the precision by which clinicians can assess the likelihood of preterm birth. Our third marker, E3, has not been evaluated in a study restricted to women in preterm labor.

The value of these biologic markers lies in their ability to add predictive value (either positive or negative) to the clinician's diagnostic abilities. In other words, their purpose is to predict more accurately which patients presenting with signs and symptoms of preterm labor are likely to deliver prematurely. If biologic markers have strong positive predictive value, a positive test would indicate which women are more likely to deliver. The converse is equally valuable: if biologic markers have strong negative predictive value, a negative test predicts which women are likely not to progress to delivery and, thereby, enable women and their clinicians to avoid unnecessary treatment.

In clinical practice, when any of the three biologic markers is used in a group of women with preterm labor symptoms to predict the likelihood of spontaneous preterm birth, fFN, 46 51 EVUSD, 55 56 and E3 40 demonstrate moderate positive predictive value (20 to 40 percent) and good negative predictive value (85 to 95 percent). In logistic regression models, the presence or absence of these biologic markers explains significantly more of the variability in preterm birth in women with preterm labor symptoms than do traditional clinical assessment tools, such as cervical dilation by digital assessment or uterine contraction frequency.

Fetal Fibronectin

The fFN is a glycoprotein with high molecular weight that is expressed only during fetal life and oncogenesis. This molecule serves an adhesive function, helping to anchor the placenta to the uterine lining. With either labor or intrauterine infection, 57 fFN is released into the lower genital tract (cervix and vagina). Putative release mechanisms include placental shearing, cytokine-induced overproduction by the decidua, and occult leakage of amniotic fluid. The presence of fFN in the cervix or vagina precedes the onset of both term and preterm births. 46 The fetal isoform of fibronectin can be detected in lower genital secretions using a specific monoclonal antibody and enzyme-linked assay techniques.

Endovaginal Ultrasound

Before the development of ultrasound transducers that could be inserted vaginally, the cervix in women with preterm labor symptoms was assessed by either digital examination or transabdominal ultrasound. The former technique is less precise and subject to intraobserver variation; however, it is sufficient to clinically categorize degree of risk. 58 The latter technique is limited by the abdominal wall and the symphisis pubis. EVUSD provides reliable and reproducible views of the cervix that are easily obtained. The rationale for use rests on the ability to see and measure changes in cervical dimensions that may be associated with cervical ripening and/or the prior influence of contractile activity.

Prospective studies in women with preterm labor symptoms have demonstrated that EVUSD assessment of cervical length (distance between internal and external cervical openings) and dilation of the internal os (labeled dynamic change, beaking, funneling) is superior to digital assessment in assigning risk of preterm birth in women with preterm labor symptoms. 59

Salivary Estriol

The placenta in humans produces corticotropin-releasing hormone (CRH), and CRH levels gradually increase with gestational age. Unlike the pituitary-adrenal axis, in which cortisol has a negative feedback effect on CRH production, cortisol exerts positive feedback increasing placental CRH production. Placenta production of CRH increases dramatically just before both term and preterm labor.

CRH produced in the placenta enters both the maternal and fetal circulatory systems. 60 In the fetus, CRH induces the production of steroid precursors in the adrenal gland that are further synthesized by the fetal liver and placenta, culminating in the production of E3. The E3 freely enters the maternal circulatory system after it is synthesized by the fetus. Thus, maternal E3 is a byproduct of placental CRH production. A surge in maternal E3 blood levels occurs before labor.

Researchers have assessed the materno-placental-fetal CRH axis indirectly by measuring E3 in maternal saliva to predict preterm delivery risk in women at increased risk of preterm labor. 61 Relevant to this report, studies assessing its utility in symptomatic women were not identified.

Mechanisms of Tocolytic Drugs

Tocolytic drugs decrease the uterine muscle's ability to contract; the goal is to diminish the force and frequency of contractions and preferably bring them to a halt. Many agents have been used, including ethanol, magnesium sulfate, calcium channel blockers, oxytocin antagonists, NSAIDs, and beta-mimetic agonists. Tocolytics can be administered in two ways: (1) those used at initial presentation grouped under the rubric "first-line therapy", and (2) those given to maintain uterine quiescence after first-line tocolysis are labeled "maintenance therapy." The same classes of drugs are used in both therapeutic approaches, although parenteral administration predominates in first-line treatment and oral administration is the delivery route of choice for maintenance therapy.

Beta-Mimetic Agents

Two types of beta receptors are described in humans. 62 Beta-one receptors predominate in the heart and adipose tissue, whereas beta-two receptors are localized in the smooth muscle of the uterus, blood vessels, and bronchioles. Beta-mimetics, also called beta-adrenergic agonists, bind to beta receptors in these cells, activating the enzyme adenylate cyclase and causing an increase in intracellular cyclic adenosine monophosphate (cAMP). 63 Increases in cAMP initiates reduction in intracellular calcium and reduces the ability of the myosin-actin interactions to generate contractile force. 64 Thus, interactions between beta-mimetics and the beta-two receptor can inhibit uterine muscle activity.

Unfortunately, molecules with absolute selectivity for beta-two receptors have not been discovered, and even agonists with some degree of beta-two selectivity interact with beta-one receptors. This interaction explains the characteristic pattern of cardiovascular side effects (hypotension, tachycardia, arrhythmia, heart failure) and metabolic side effects (hyperglycemia, hypokalemia) seen in clinical attempts to block beta-two receptors and achieve tocolysis. 65 Beta-mimetic drugs include ritodrine, terbutaline, isoxuprine, salbutamol, fenoterol, hexoprenaline, and nylidrin.

Calcium Channel Blockers

Calcium channel blockers prevent the entry of calcium into smooth muscle cells, which occurs by two mechanisms: (1) blockage of calcium channels 66 and (2) suppression of calcium release from the intracellular stores. 67 The end result is smooth muscle relaxation, including the muscular activity of the uterus. This smooth muscle relaxation is nonspecific and can lead to peripheral vasodilation with hypotension. Calcium channel blockers used for tocolysis include nifedipine, nicardipine, and verapamil.

Magnesium Sulfate

Magnesium is a bivalent cation, as is calcium. Magnesium exerts its muscle-relaxing properties by competing with calcium rather than by interacting with receptors. The primary site for this competition is the intracellular stores within the sarcoplasmic reticulum. 68 At sufficiently high levels, magnesium displaces some of the calcium molecules that would otherwise have been held in store to drive actin and myosin interaction. Because magnesium cannot play the same role as calcium in facilitating contraction, a relative local shortage of calcium within intracellular stores lengthens the time required between contractions to repolarize the contractile units thereby decreasing the force of contractions. Serum magnesium concentrations of 4 to 8 mEq/L, which are roughly 2 to 4 times normal physiologic values, inhibit uterine activity in humans. 69

Nonsteroidal Anti-Inflammatory Drugs

NSAIDs inhibit production of prostaglandins, which are potent uterine muscle stimulants at and before term. 70 Because prostaglandins enhance uterine contractility, inhibition of prostaglandin formation depresses uterine activity. 71 72 73 Examples of NSAIDs used for tocolysis include aspirin, indomethacin, naproxen, ketorlac, and ibuprofen.

Oxytocin Antagonists

Oxytocin is a pituitary hormone thought to be critical in human parturition. 74 Its physiologic role is to stimulate uterine contractility. Uterine muscle has specific cell membrane receptors for the molecule. 75 Antagonists have been synthesized by altering the eight-amino-acid sequence of oxytocin.

Atosiban is a selective oxytocin antagonist molecule that binds to the uterine receptors in place of oxytocin. Because it does not initiate the cascade of events that lead to contractions and it blocks oxytocin from binding by occupying its place on the receptors, atosiban is capable of inhibiting uterine contractions in women with preterm labor. 76 77 78

Alcohol

Alcohol was the first tocolytic agent used for prevention of preterm birth in women with signs and symptoms of preterm labor. 79 It inhibits uterine contractility by diminishing secretion of two neurohypopheseal hormones: antidiuretic hormone and oxytocin. 80 Ethanol reduces the pulsatile excretion of oxytocin from the pituitary seen in normal labor, thereby causing a tocolytic effect. 81 Although alcohol is no longer used in the management of preterm labor, we mention it here primarily for historical comparison.

Harms of Tocolytics

The examination of the efficacy and effectiveness of treatments for any condition must be balanced against the harms caused by that treatment; in this case, the harms associated with tocolytic treatment. Such ill effects of treatments may be described by a number of terms, such as side effects, harms, untoward events, and adverse events. For ease of description, we will refer to them as "harms."

Every treatment has harms associated with its use. The harm may be associated directly with the therapeutic effect of the agent, as in myelosuppression related to the effect of chemotherapeutic agents. Harms may also relate to the effects of medications on organ systems distant from the target organ intended for the therapeutic effect, such as the gastric toxicity of NSAIDs. Finally, idiosyncratic reactions may occur, such as serious allergic reactions.

The level of harmful effect considered tolerable in the literature is variable. The tolerance level for harms may be somewhat greater if the condition under treatment is very severe or has a poor prognosis, if the harms are very transient and of little consequence other than patient inconvenience, or if the effect size of the efficacy of the treatment under consideration is very great.

The identification of harms in the medical literature is hindered by several issues, as follows:

  • The description of the efficacy of the treatment is often much more detailed and complete than the description of harms.

  • Attention is often not paid to the detailed issues of assessment of harms. Variables often are described poorly, making comparison of rates of harms among studies difficult. For example, articles may describe a side effect of beta-mimetic agents as "arrhythmia." No detail is given as to the definition of arrhythmia, which technically may include disturbances such as sinus tachycardia, supraventricular tachycardias, and more serious disturbances such as atrial fibrillation and the potentially lethal ventricular tachycardia. Combining these various cardiac disturbances under "arrhythmia" may lose a great deal of information.

  • Serious adverse events may be rare, but important. Most randomized controlled trials (RCTs) in the treatment of preterm labor have fewer than 100 subjects in the treatment group. Low frequency but serious adverse events with rates of 1 percent or less may not be detected at all in a trial of this size, or they may be ascribed to chance alone.

  • Followup of children born to women who have received therapy with tocolytic agents generally lasts only until the end of the child's hospitalization. No information is available on issues such as late cognitive development. Such issues could be resolved only by conducting very large RCTs with a long followup period, because children whose mothers have undergone premature labor may have been born prematurely with low birth weight, placing them at risk for adverse developmental outcomes. Developmental diseases may be subtle or appear long after delivery. In utero exposure to diethylstilbestrol is a classic example of such.

  • Controversy also exists concerning the potential benefits and harms that might affect the central nervous system of a neonate exposed in utero to magnesium sulfate. Observational studies suggest two important, yet contradictory, findings: (1) Magnesium has been found to have a neuroprotective effect in the development of cerebral palsy, 82 a potentially serious morbidity associated with preterm birth; (2) conversely, researchers have found that magnesium may increase the risk of perinatal death. 83 Given the disparity of findings and the dual roles of magnesium as a tocolytic and antiseizure medicine in preeclampsia, these differences can be resolved only through large RCTs.

Of the management tools for preterm labor, tocolytics carry the greatest potential for minor and serious side effects. The minor side effects are common; beta-mimetics are potent cardiovascular stimulants associated with disconcerting, but generally benign, signs and symptoms, including tachycardia, palpitations, or chest tightness. In rare instances, beta-mimetic cardiovascular stimulation can cause serious complications such as maternal myocardial ischemia or fetal cardiac effects. They also have metabolic effects including increasing glucose availability and decreasing potassium levels in the blood.

Magnesium when given per strict protocol to women with normal urine output has a wide margin of safety. Its side effect profile is medically minor, but it may cause significant maternal distress side effects, including lethargy, drowsiness, double vision, nausea, and vomiting. Beta-mimetics, magnesium, and calcium channel blockers are all associated with increased risk of pulmonary edema, a serious and potentially lethal maternal complication. NSAIDs have the fewest maternal risks; fetal effects include diminished production of urine with oligohydramnios and premature closure of the ductus arteriosis. These effects usually are transient, but cases of sustained diminished renal capacity and pulmonary hypertension have been reported. Calcium channel blockers as a single agent are thought to have a good safety profile for both mother and fetus. Concomitant use of calcium channel blockers with magnesium sulfate, which some describe as a "primitive" calcium channel blocker, is potentially harmful and has resulted in cardiovascular collapse.

The oxytocin antagonist atosiban, at least within the context of clinical trials, has minimal side effects. Clinicians could not distinguish atosiban from placebo. This cannot be said of other tocolytics because their characteristic side effect profiles are unmaskable for the knowledgeable clinician.

In addressing harms, the report seeks to describe and, when possible, estimate the types and frequency of major and minor harms associated with each class of tocolytics. Such information coupled with estimates of the efficacy of the agents will improve the ability to manage preterm labor in a way that maximizes benefit and minimizes risk.

Mechanisms of Antibiotics for Treatment of Preterm Labor

Women who present with preterm labor symptoms often have occult infections of their upper genital tract that can be detected by amniocentesis and subsequent culture of their amniotic fluid. 84 85 Women in the group who have occult infections are more likely to experience preterm birth than women without evidence of infection. 84 This association of infections of the upper genital tract and preterm birth in women with preterm labor symptoms provides the rationale for using antibiotics in the management of women with preterm labor. Antibiotic administration may cure occult infections, thereby lowering the risk of spontaneous preterm birth. 84

Organisms cultured from the amniotic fluid or placental-decidual interface of women who give birth before term include enteric gram negatives, gram-positive cocci, anaerobic cocci, and mycoplasma species-a mix of genital flora. Given the variety of potential pathogenic organisms, clinicians have empirically chosen broad spectrum antibiotics in an attempt to eradicate these organisms from the upper genital tract in women with preterm labor. Antibiotics used include second- and third-generation cephalosporins, beta-lactam antibiotics with beta-lactamase inhibitors, synthetic penicillins, metronidazole, and combinations of agents such as "triple coverage" (beta-lactam antibiotics, aminoglycoside, and clindamycin).

It is important to distinguish between antibiotics intended to treat occult in utero infection and those intended to prevent early-onset group B streptococcal sepsis in newborns. Preterm infants are particularly susceptible to acquiring group B streptococcal infections at birth from their mothers whose vaginas may be colonized with this organism and may be developing sepsis. The goal of early-onset sepsis prevention can be readily accomplished with single-agent beta-lactam antibiotic treatment during labor.

Home Uterine Activity Monitoring

The goal of home uterine activity monitoring is to detect early labor symptoms in women deemed to be at high risk of preterm delivery, with the assumption that early detection provides an opportunity for successful treatment. Thus, home monitoring proponents do not claim that monitoring in and of itself can prevent preterm birth, but rather that the technology helps clinicians target preventive therapies earlier for women with preterm labor symptoms who are at risk of preterm birth and who may not recognize their early symptoms. This report focuses exclusively on the use of this technology in the management of women who have been diagnosed with preterm labor and released home for monitoring, generally after some treatment.

Home uterine activity monitoring uses a tocodynamometer to record uterine contractions in conjunction with daily telephone calls from a health care provider, usually a nurse, to offer patient support and advice. 86 87 The biologic hypothesis underlying the use of the device is that women will have an identifiable increase in uterine contractions before preterm delivery (i.e., a "crescendo" effect in contractions) and that many women are unable to recognize this prodromal uterine activity. 88 89 The premise on which the usefulness of home uterine activity monitoring rests is that by detecting early contractions, health care providers can treat preterm labor and prevent or decrease the likelihood of preterm birth. 87

Patients attach the external tocodynamometer over the uterus twice each day for 30 to 60 minutes while they are recumbent. 90 Digitized tracings from the tocodynamometer are transmitted over telephone lines for immediate interpretation by a health care provider. The Food and Drug Administration (FDA) has approved a portable tocodynamometer for use in the homes of women with high-risk pregnancies, whose fetuses are of more than 24 weeks' gestation, and who have a history of preterm birth. 91 The device has been shown to record uterine contractions with precision in both hospitalized and ambulatory gravidas. 92

Organization of this Report

The remainder of this evidence report is organized in the following sections. Chapter 2 provides details about the literature search and review methodology. Specifically included are the causal pathways for our key clinical questions; our approaches to conducting the systematic review, abstracting data from articles, maintaining quality control, and applying a quality rating system for individual articles and similar details. In addition, we comment briefly on the external peer review process. Chapter 3 provides our results on the four main topics covered by this report: biologic markers, tocolytics, antibiotics, and home uterine activity monitoring. Results of our supplemental meta-analysis concerning tocolytics, antibiotics, and home uterine activity monitoring are found in Chapter 4. Chapter 5 provides conclusions, and Chapter 6 offers recommendations for a research agenda on management of preterm labor.

The references cited in the body of the report follow Chapter 6. The complete list of literature considered and used in developing the evidence report appears in the bibliography. Appendix A includes acknowledgments, information on the TEAG, and the peer reviewers for this report. The data extraction forms appear in Appendix B, and a list of acronyms and abbreviations is in Appendix C. The Evidence Tables are contained in their entirety in Volume 2 of this report.

Chapter 2. Methodology

This chapter documents the procedures that the Research Triangle Institute-University of North Carolina at Chapel Hill Evidence-based Practice Center (RTI-UNC EPC) used to develop a comprehensive evidence report for the AHRQ that describes and contrasts the approaches currently used in the management of preterm labor. To set the framework for the review, we present first the key questions and their underlying causal pathways followed by a detailed description of the literature search, which includes documenting the inclusion and exclusion criteria for literature acquisition, selecting relevant MeSH terms, listing databases searched, retrieving the gray literature, and specifying the eligibility criteria for study inclusion. Once we determined which studies met the inclusion/exclusion criteria and were eligible for inclusion, we abstracted data onto data extraction forms and then transferred critical information to evidence tables; these forms are also described in this chapter.

The methodology chapter also discusses quality issues (i.e., the RTI-UNC EPC's quality control procedures with regard to determining the eligibility for inclusion, carrying out data abstraction, and developing the quality rating for individual studies). An evidence report requires an extensive search for all types of literature. Because some published works are of higher methodologic quality than others, the RTI-UNC EPC developed quality rating forms specific to each body of literature studied in relation to managing preterm labor. This chapter describes the development of the rating system and its use in the analysis. Last, the RTI-UNC EPC's procedures for conducting a supplemental meta-analysis and the peer review process are discussed.

Key Questions and Causal Pathways

We address four key questions in this report. All questions were put into final form with input from the TEAG (Appendix A). After consultation with the TEAG, we determined that two additional questions concerning the enhancement of fetal lung development with corticosteroids and the cost-effectiveness of using antibiotic and tocolytic pharmacotherapies were inappropriate for further study through this report. The decision concerning corticosteroids was based on our conclusion that we would have little to add to the knowledge in the field. In relation to cost-effectiveness studies, we did not find a body of literature to review.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f001.jpg.

   Figure 1. Causal pathway: Biologic markers

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f002.jpg.

   Figure 2. Causual pathway: Tocolytics

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f003.jpg.

   Figure 3. Causual pathway: Antibiotics

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f004.jpg.

   Figure 4. Causual pathway: Home uterine activity monitoring

First, we discuss the key clinical questions and then we present the causal pathways (Figures 1-4) for each of the questions.

Final Key Questions

The first question concerns the effectiveness of using three biologic markers in assessing the risk of preterm delivery: fFN found in cervicovaginal secretions, EVUSD of the cervix, and E3. The second and third questions relate to pharmacotherapies for arresting the condition: tocolytics to stop uterine contractions, and antibiotics for treating possible underlying infections. The tocolytic review separates studies based on whether the aim was treatment during the episode of preterm labor and uterine activity (first-line use) or promotion of continued quiescence of the uterus (maintenance use). By contrast, although various antibiotics are used in treating preterm labor, they are discussed as one group. The fourth question concerns using home uterine activity monitoring for detecting early labor uterine contractions.

Our specific key questions are as follows:

  • 1

    What are the appropriate criteria for the diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?

  • 2

    What is the efficacy and/or effectiveness of tocolytics in managing preterm labor?

  • 3

    What is the efficacy and/or effectiveness of antibiotics in managing preterm labor?

  • 4

    What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?

In the context of managing preterm labor, the efficacy or effectiveness of treatment refers to three major categories of outcomes: delivery, maternal morbidities, and infant health. From a clinical perspective, the outcomes of greatest concern are infant morbidities and mortality associated with preterm birth; however, because these are rare events and difficult to quantify except in large samples, and because prolonging the pregnancy is believed to be correlated with improved infant outcomes, studies also include birth outcomes as intermediate or proxy measures of efficacy.

Our search included both efficacy results from RCTs and effectiveness outcomes from observational research to ensure the greatest breadth of understanding of the research in this area. Efficacy results are obtained from studies where health care is delivered under relatively ideal conditions, such as controlling for confounding interventions. In contrast, effectiveness results are gathered under more normal conditions. The potential harms associated with treatment are presented in relation to tocolytic therapy. Our review of the literature concerning diagnostic testing includes only cohort studies because those were the only ones available. Our review of the use of home uterine activity monitoring includes only English language efficacy studies, because our search was limited based on time and resource considerations.

Corticosteroids

In 1994, researchers from the United States and other countries participated in a Consensus Development Conference sponsored by the National Institute of Child Health and Human Development (NICHHD) to discuss the perinatal outcomes associated with the use of corticosteroids. The panel assessed relevant studies; most of the outcomes, both maternal and neonatal, reflected data collected in RCTs. The resulting consensus statement and clinical recommendations provide the definitive scientific approach to antenatal corticosteroid therapy. 2

Similar to the AHRQ evidence reports, the NICHHD panel reviewed the literature and evaluated findings using a grading system. The ratings developed for the NICHHD project were based on those constructed by the Canadian Task Force on the Periodic Health Examination (now, Canadian Task Force on Preventive Health Services), as adapted by the U.S. Preventive Services Task Force. The ratings largely capture the nature of the study designs and the strength of the recommendations that can be based on the literature.

The expert panel concluded that antenatal corticosteroid therapy is recommended for most women at risk of preterm birth. The advantages of this therapy include substantial reductions in neonatal morbidity and mortality, with specific effects on respiratory distress syndrome and intraventricular hemorrhage in preterm infants. The benefits identified appear to be greatest 24 hours post-therapy, and the treatment window extends from 24 to 34 weeks' gestation. 2 In addition, several followup studies that tracked children up to 12 years of age failed to find adverse effects in children exposed prenatally to corticosteroids. 2 The expert panel also concluded that corticosteroid therapy provides a cost-effective solution that improves perinatal outcomes and reduces the need for more expensive neonatal interventions for premature infants.

If the biologic markers described in this evidence report can accurately assess the risk of preterm birth in women with preterm labor symptoms, then one would assume that more targeted use of steroids might follow. Targeting antenatal corticosteroid therapy would avoid under and over treatment and would achieve one of the consensus conference's goals.

After reviewing the NICHHD Consensus Development Conference findings to assess their methodologic soundness, we searched for subsequent peer-reviewed literature concerning the effect of treating pregnant women with preterm labor with corticosteroids to enhance fetal lung maturity. In our review of the literature published since the NICHHD findings were released, we did not uncover any more recent studies evaluating treatment efficacy, effectiveness, or cost-effectiveness; all subsequent publications were review articles and meta-analyses. Based on these results, we elected not to revisit this topic in further detail.

Cost-effectiveness of Treatment

One of the issues originally proposed for this review concerned the question of the cost-effectiveness of using biologic markers or antibiotic or tocolytic pharmacotherapies in treating women with preterm labor. Although, in principle, we agree that this is a potentially worthy topic, as shown in the literature review section of this chapter, we also determined that the body of literature specifically covering these questions was insufficient for an evidence-based study. Only one study relating to this topic was uncovered. Thus, with the concordance of our TEAG, we did not investigate this issue in this report.

Causal Pathways

As indicated in the illustrations of the causal pathways (Figures 1, 2, 3, and 4), the diagnosis and treatment of a woman in preterm labor requires several steps and possibly the combination of several management modalities. Owing to the complexity of the problem, the potential breadth of its etiology, and our focus on the management of preterm labor, we begin our causal pathways with presentation of the patient with signs and symptoms of preterm labor. For example, these might include contractions, dilation, and/or effacement.

At this point, the clinician faces two challenges. The first challenge is to decide how to diagnose the preterm labor itself, which may include assessing the likelihood of preterm birth using biologic markers. The second is to determine whether the preterm labor stems from a treatable cause (e.g., an infection) that might be addressed through managing the condition or whether it cannot be attributed to a likely treatable cause and so should be limited to managing the symptoms (e.g., through the use of tocolytics).

We have devised causal pathways for each of the diagnostic, treatment, and monitoring modalities we examined. The causal pathway for biologic markers (Figure 1) indicates that their purpose is to predict which patients presenting with signs and symptoms of preterm labor are likely to continue to preterm birth. In other words, their value lies in their ability to add predictive value (either positive or negative) to the clinician's diagnostic abilities. If the biologic markers have strong positive predictive value, they would indicate which women are likely to give birth preterm. The converse is equally valuable-if the biologic markers have strong negative predictive value, they may be able to predict which women are likely not to give birth preterm and, therefore, enable women and their clinicians to avoid unnecessary treatment.

Figures 2 and 3 present two pharmacotherapy treatment scenarios. Figure 2 presents the hypothesized causal pathway for using tocolytics in managing uterine contractions associated with an episode of preterm labor. Figure 3 depicts the pathway for using antibiotics for treating infections relating to the condition. In both of these causal pathways, adjunctive bed rest, hydration, and/or routine inpatient monitoring may be occurring. In addition, when use of antibiotics is the issue, then concurrent use of tocolytics may be occurring and vice versa (concurrent use of antibiotics when use of tocolytics is the issue). Biologic markers might be used for diagnostic purposes with any of these modalities or combination of modalities.

These causal pathways can become quite complicated as the possible options and outcomes multiply. In our presentation, the treatment pathways are somewhat simplified because they do not reflect the possibility that clinicians might use more than one type of tocolytic when the first agent that is tried fails or the possibility that different practitioners might invoke the pharmacologic agents at different times in the course of an initial or subsequent preterm labor episode. Similarly, the "diagnosis" part of the causal pathway does not reflect the fact that different practitioners use different clinical criteria for diagnosing preterm labor, including different tests or using the tests in a different order if biologic testing is selected.

The causal pathway for home uterine activity monitoring (Figure 4) presents a relationship showing early detection allowing for intervention and, hence, leading to a greater opportunity for successful treatment. The system is held out not as a method of preventing adverse outcomes, but rather as a way of indicating when, if at all, at-risk women are symptomatic and, therefore, in need of treatment.

Literature Search

This portion of Chapter 2 describes our literature search strategy and results. Included are inclusion/exclusion criteria with regard to population, medical condition, interventions, and setting, quality grading forms containing the criteria for evaluating the quality of the evidence from the studies reviewed, databases searched and search terms used, and additional sources contacted for gray literature and works in progress. Documented here are the steps used to refine the literature search to the final number of articles ultimately reviewed and included in each evidence table.

Inclusion and Exclusion Criteria for the Literature Search

Table 2. Management of preterm labor: inclusion/exclusion criteria
CategoryCriteria
Study populationHumans Pregnant females
ConditionSigns and symptoms of preterm labor Exclude if all research subjects experience preterm premature rupture of membranes, medically indicated preterm birth, or multiple gestation
InterventionsBiologic markers (restricted to E3, EVUSD, and fFN), antibiotics, tocolytics, and HUAM
Study settingsInpatient and outpatient settings, including patient homes
OutcomesPreterm birth must be measured
Time period1966 and later HUAM: 1980 and later Depends on date of approval for pharmaceutical products or biologics
Geographic site of studyExclude locations based on language of publication
Publication languagesEnglish, French, and German HUAM: English only
Admissible evidenceStudy design: RCT-double- and single-blinded; non-RCT-prospective and retrospective cohort studies, case control Study design HUAM: RCT only Other designs: meta-analyses, review articles for reference list searches, cost-effectiveness studies Sample size: >40 subjects
The criteria listed in Table 2 provide the basis for the systematic search strategy. They map to the key questions with regard to the population being studied, the treatment setting for patients with preterm labor, and the publication language. As seen in the table, for the management of preterm labor, we have identified the population of interest as humans and pregnant women of any age, race/ethnicity, demographic, or health status characteristic. Women must be exhibiting signs and symptoms of preterm labor. Studies are included based on therapies or interventions examined, including testing with biologic markers, antibiotic or tocolytic treatment, or monitoring with home uterine activity monitoring. All inpatient and outpatient settings are considered, including patients' homes. Other filters involve the time period of interest and study publication languages. In cooperation with the TEAG and the need to limit our analysis, studies focusing solely on women with preterm premature rupture of membranes, medically indicated preterm birth, or multiple gestations were excluded. As noted in the table, the inclusion/exclusion criteria used in the home uterine activity monitoring review is somewhat more restrictive than those used for the other analyses.

Search Terms

The literature review specialist in cooperation with the study director and the scientific director developed the MeSH search terms used in the analysis. Searches in the National Library of Medicine (NLM) databases (e.g., MEDLINE) were limited by the subject heading: explode premature labor. This phrase is defined in the NLM databases as the onset of labor before term but after the fetus has become viable, usually sometime during estimated gestational ages of 29 through 38 weeks. Both preterm labor and premature delivery default to this subject heading. Preterm premature rupture of membrane, which is excluded from this analysis, is not included under this heading.

To ensure inclusion of relevant study designs, we searched within the MeSH tree under epidemiologic study characteristics, which are defined as types and formulations of studies used in epidemiologic and clinical research. Relevant MeSH key words included under this general topic and searched concurrently through the "exploded" term are clinical trials (Phase I-IV, controlled clinical trials, multicenter studies, and randomized controlled trials), analytic studies, epidemiologic studies, retrospective studies, prospective studies, longitudinal studies, followup studies, feasibility studies, pilot studies, sampling studies, intervention studies, cross-over studies, cohort studies, case control studies, and cross-sectional studies.

In addition, we searched the intersection of the terms premature labor and diagnosis to capture review articles. Bibliographies in review articles were examined as a double check to our electronic literature search to ensure that all relevant articles were included in the study. We searched for economic and cost analyses using the search terms costs and cost analysis and cost-benefit analysis.

We also searched for specific therapies. First, we searched on the relevant diagnostic tests within the term biologic markers, where the scope is defined as measurable and quantifiable biologic parameters that serve as indexes for health- and physiology-related assessments. Specific biologic markers included in the study are salivary estriol (estriol, saliva), endovaginal ultrasound, and fetal fibronectin (fibronectins). Second, we searched exploded antibiotics, which include ampicillin, amoxicillin, metronidazole, erythromycin, ceftizoxime, ampicillin-salbactam. Finally, we exploded tocolytic agents, which include ritodrine, terbutaline, isoxsuprine, magnesium sulfate, nifedipine, nicardipine, indomethacin, fenoterol, ethanol, nylidrin, and salbutamol.

Our primary literature search was conducted through MEDLINE. The MEDLINE database is widely recognized as the premier source for bibliographic coverage of biomedical literature. MEDLINE encompasses information from Index Medicus, Index to Dental Literature, andInternational Nursing, as well as other sources of coverage in the area of communication disorders, population biology, and reproductive biology.

In addition, we searched EMBASE (the electronic version of Excerpta Medica) because it provides a more complete coverage of the literature published in Europe, Latin America, and Asia. Apart from being a database that AHRQ has specified as being of interest, the EMBASE search is important because, historically, drugs have been approved for clinical application earlier in Europe than in the United States. Clinical findings regarding a drug's efficacy may have been published in Australia or Canada before reaching the U.S. market. We used a similar search strategy and terms for EMBASE as were employed for MEDLINE.

Additional databases searched include: (1) the Cochrane Collaboration database to examine relevant meta-analyses and systematic reviews closely related to our topic; (2) the related York database of the National Health Science Center for Reviews and Dissemination; (3)International Pharmaceutical Abstracts, a database devoted exclusively to pharmaceuticals and produced by the American Health-System Pharmacist Association; (4) the Health Economic Evaluations Database for literature concerning cost and economic analyses; (5)Genderwatch, a search engine that covers international journals, magazines, newsletters, regional publications, special reports, and conference proceedings devoted to women's and gender issues; and (6) the Population Index, a reference tool that focuses on the world's population literature. Of all these databases, Genderwatch and the Population Index uncovered no matches with our topics.

Gray Literature Search

In its first task order (Pharmacotherapies of Alcohol Dependence), the RTI-UNC EPC outlined a search strategy for uncovering the so-called "gray literature" (i.e., potentially useful literature and documents that are to be found in outlets other than peer-reviewed journals). For this evidence report on the management of preterm labor, the RTI-UNC EPC has pursued a similar strategy.

Ideally, gray literature searches include reviews of government documents and monographs; industry reports and studies; unpublished studies and works in progress; proceedings and technical bulletins; and FDA Medical Officer's Reviews of original or extended (change of label) New Drug Approval applications, including advisory committee minutes for drugs, and similar reviews of Pre-Market Approval applications for devices. In reality, constraints on time and resources necessitate strategic reductions in the scope of the gray literature search.

We made two important decisions in restricting our gray literature work. Our earliest strategic decision, in consultation with the task order officer, was not to conduct gray literature searches for home uterine activity monitoring devices because of time and resource limitations. We did search for and review Pre-Market Approval documents archived at the FDA Web site (http://www.fda.gov) for home uterine activity monitoring devices, the E3 test, and the fFN test. We identified and reviewed research information offered by the manufacturer of the E3 test SalEst ® (http://www.biex.com), and fFN test (http://www.adeza.com). Both manufacturers provided information such as clinical monographs; however, with the exception of one brief four-person case study that appeared to have been written expressly for the manufacturer, all research to which they referred is available already as journal articles or abstracts.

Our second significant strategic decision was sharply to restrict our search for gray literature on antibiotics, for several reasons that reflect an initial investment of study resources. First, thousands of antibiotics are marketed in the United States alone. For example, just one of the earliest antibiotics, erythromycin, is marketed for human use by almost 20 different pharmaceutical firms; most of them market more than one version (oral and intravenous and/or intramuscular). Multiplying all possible antibiotics by all firms marketing them would create a search the size of which no EPC could undertake.

Second, and more important, in no case is an antibiotic specifically labeled for use in the treatment of preterm labor. Only two antibiotics are approved currently by the FDA for the treatment of bacterial vaginosis in pregnancy, metronidazole and clindamycin (Anti-Infective Drugs Advisory Committee Meeting, 64th Meeting, Issue: Guidance documents on developing antimicrobial drugs: General considerations and individual indications. Wednesday, July 29, 1998.) The RTI-UNC EPC's experience with the gray literature search for alcohol suggests that pharmaceutical firms do not engage in research into off-label uses unless they are preparing an application for a new or changed labeling approval, and they will not otherwise comment on off-label uses for their drugs. Thus, although antibiotics are prescribed routinely in the management of preterm labor, all such prescriptions are "off label."

Additionally, most antibiotics in use, except for the latest generation of cephalosporins, could not have been investigated in pregnant human subjects because the FDA explicitly prohibited the use of "women of childbearing potential" from clinical trials beginning in 1977, and an informal practice of exclusion had been the rule before then. Only in 1993, did the FDA issue a new guideline emphasizing the need for drug research using women as subjects, and the question of conducting drug research in pregnant women is still exceedingly controversial. Most antibiotics, like most drugs, are labeled Pregnancy Category C (harms to fetal development are shown in animal studies but no adequate, well-controlled human studies exist); an antibiotic example includes clarithromycin, which also must carry a Boxed Warning strongly advising against use during pregnancy. Other common antibiotics, such as the aminoglycosides, including streptomycin, fall under Pregnancy Category D (there is evidence of harm to the fetus from human studies, and patients should be advised of the potential hazards). (See, for example, Public Hearing, Content and Format of Labeling for Human Prescription Drugs, Part 15 - Pregnancy Labeling Categories, Friday, September 12, 1997, Bethesda, MD, which we have reviewed as a part of the gray literature search.)

In other words, no pharmaceutical company could have tested its new antibiotic in a sample of pregnant women, even had it wished to do so. Thus, it is neither reasonable nor efficient to devote scarce resources to such a search. The one exception may be in the case of the latest generation of cephalosporins, which are classed in Pregnancy Category B (no evidence of harm from animal studies, but no adequate, well-controlled human studies have been conducted, and the drug should be used only if clearly indicated). Anecdotally, manufacturers have sometimes expressed off-the-record interest in research into cephalosporin use during pregnancy, but we have found no evidence that any manufacturer has petitioned the FDA recently for a change in labeling for use in pregnant women.

Additional strategic decisions concerned tocolytics. The gray literature search on tocolytics appears to reveal that discrepancies arise not from differences between published and unpublished literature but from differing emphases on and interpretations of outcomes and, perhaps, differing ideological approaches to the treatment of women in preterm labor.

This task order includes an evaluation of the tocolytic drug atosiban, an oxytocin-receptor antagonist, approval for which was denied by the FDA's Advisory Committee for Reproductive Health Drugs on April 20, 1998. Despite the failure of the R.W. Johnson Pharmaceutical Research Institute (Johnson & Johnson) to gain approval for atosiban, we have nonetheless located an unpublished manuscript describing the multicenter RCT of atosiban (forthcoming in the American Journal of Obstetrics and Gynecology). We have also obtained and reviewed the minutes of the 1998 Reproductive Health Drugs Advisory Committee meeting. From these minutes, it seems clear that the committee's failure to approve atosiban resulted not from its doubts about the drug's efficacy to halt labor as the endpoint, but rather from its concern as to whether the endpoint of interest ought to be improved perinatal outcome. The Advisory Committee was not satisfied that the manufacturer had collected data addressing such concerns, but it was not in agreement about the kind of perinatal outcomes trial it wanted to see. The manufacturer, Johnson & Johnson, maintains that it proceeded correctly in the initial application. Thus, for the present in the United States, there will be no further efforts to seek approval for atosiban, although it is expected to be approved very soon in 15 European Nations, and the scientific basis for its approval will be published shortly on the European Agency for the Evaluation of Medicinal Products Web site (personal communication, between Sue Tolleson-Rinehart and Johnson & Johnson official, September 23, 1999).

Ritodrine and terbutaline are both beta-mimetic agents. Terbutaline is approved only for treatment of asthma, but it has become the most frequently used tocolytic in the United States. Ritodrine, under the brand name Yutopar ® , was the only drug to receive FDA approval as a tocolytic. The drug, approved in 1980, was available in both oral and intravenous forms until a 1992 meeting of the Fertility and Maternal Drug Advisory Committee, resulting in an FDA request for more testing of high doses of oral ritodrine for tocolytic maintenance. In response, the manufacturer removed oral ritodrine from the market. Controversy still revolves around the clinical information presented to the Drug Advisory Committee at that time from the Canadian Ritodrine Trial; the results of the trial and criticisms of it are in the published literature. 96

Ritodrine in intravenous form remained available as a tocolytic treatment until its two manufacturers, Abbott and AstraZeneca, stopped manufacture. Abbott discontinued the intravenous form before December 1997; AstraZeneca stopped manufacturing it on September 1, 1998, but the last expiration date of previously shipped product was August 1999, meaning that ritodrine could have remained in use until that time. Terbutaline has been employed in an off-label use for tocolysis for the same period and in fact, has surpassed ritodrine in use. In a Fertility and Maternal Drug Advisory Committee meeting in 1993, committee members recommended that the FDA solicit a specific application for change of label to include tocolysis for intravenous terbutaline; also discussed was the possibility of a literature-only application because the literature on terbutaline as a tocolytic was already voluminous.

At roughly the same time, the use of terbutaline in a subcutaneous infusion pump delivery system was increasing. Its advocates (including pregnant women) have generally argued that the terbutaline infusion pump effectively maintains tocolysis at much lower doses than is possible with other delivery systems and that it permits pregnant women to continue tocolysis maintenance at home, rather than in the hospital.

In 1996, the National Women's Health Network presented a Citizen's Petition to the FDA, requesting prohibitory action on the terbutaline subcutaneous infusion pump allegedly because of two maternal deaths that could be attributed to terbutaline use (Docket No. 96P-0258/CP1). On November 13, 1997, Associate Commissioner for Health Affairs Stuart Nightingale issued a "Dear Colleague" letter that said, in part:

Based on information available to the Agency, as well as a review of the published literature, it is clear that the demonstrated value of tocolytics in general is limited to an initial, brief period of treatment, probably no more than 48-72 hours. No benefit from prolonged treatment has been documented. In addition, the safety of long-term subcutaneous administration of terbutaline sulfate, especially on an outpatient basis, has not been adequately addressed.

Published reports on the safety of this use are seriously hampered by methodologic inadequacies. It appears that women receiving continuous subcutaneous terbutaline sulfate infusions experience side-effects and complications similar to those experienced by women receiving terbutaline and other beta-sympathomimetics intravenously. Complications include chest pain, tachycardia, dyspnea, and pulmonary edema. At least one maternal death occurred during outpatient use of a continuous infusion of terbutaline sulfate by subcutaneous pump. The impact of long-term use on maternal glucose metabolism and the risks of prolonged exposure of the fetus are largely unknown.

The "Dear Colleague" letter provoked considerable response from clinicians, researchers, and patient advocates; it apparently induced Matria, Inc., the marketer of the terbutaline pump system, to bring suit against the National Women's Health Network. Advocacy also included a petition from obstetricians to the FDA (Docket No. 98P-0218/CP) to approve subcutaneous terbutaline as a tocolytic treatment. The RTI-UNC EPC has no evidence to date that the FDA has responded to the substance of the petition.

This review for tocolytics suggests that its gray literature actually refers to or summarizes the published literature for policy purposes, rather than presenting unpublished literature. The remaining tocolytic in this evidence report, magnesium sulfate, is not subject to FDA approval as a drug.

As we discussed at the beginning of this section on gray literature, we also examined government documents and manufacturers' information associated with approval and use of assays for E3 and fFN. The first fFN assay was approved on September 21, 1995; a subsequent "Rapid System" by the same manufacturer was approved on August 14, 1998. An E3 assay was approved on April 29, 1998. The available information suggests that most (if not all) of the data relied on by the manufacturers now can be found in published articles and abstracts. Similarly, Pre-Market Approval information for ultrasound devices, including a portable device designed by General Electric expressly for use in obstetrics and gynecology, is straightforward and presents no unusual data. It may be worth noting, however, that the FDA suggests in its guidelines for labeling home uterine activity monitoring devices that they include the following precaution: "No widely accepted controlled studies have been conducted that show that this device is effective at the early detection of preterm labor other than in patients with a previous preterm delivery."

Finally, our efforts to identify unpublished work or other work in progress produced only the atosiban manuscript cited earlier in this chapter.

Literature Retrieval Process

Basic Approach to Article Selection

Once articles were identified through the electronic database search or review article bibliographies, we examined abstracts of the articles to determine whether studies did, in fact, meet our criteria. Two reviewers (one clinician and one methodologist) reviewed abstracts independently for inclusion or exclusion. If both reviewers agreed, an article was retained (or excluded). If the two reviewers disagreed (one indicating eligible and the other judging ineligible), the scientific director made the final decision.

To ensure that all appropriate articles were included and no significant ones were missed, the scientific director rereviewed a 20 percent sample of abstracts determined to be eligible for inclusion as well as all those determined to be ineligible. Generally speaking, for all abstracts, we erred on the side of inclusion rather than exclusion. When abstracts were not available from the electronic search or when there was a question of relevance, the project director or scientific director subjected the full articles to additional scrutiny. In the end, we excluded a study once the full article was examined if, upon this closer scrutiny, we found that it did not conform to the inclusion/exclusion criteria.

For the article review stage, each abstraction form used by a reviewer included an eligibility checklist, or screen, so that if specific criteria were not met, the reviewer was directed not to proceed with the article review. Items that reviewers looked for included whether preterm birth was a study outcome, whether our minimum sample size was met, and whether subjects had signs and symptoms of preterm labor. The eligibility checklist used to review the biologic marker, tocolytic, and antibiotic literatures is presented in Figure 5. The comparable screen used in review of the home uterine activity monitoring literature contained an additional item prompting the reviewer to discontinue the abstraction if the article was not reporting on an RCT.

Figure 5. Article eligibility screen

Screen (If any response is "No," then stop. If any response is "Can't Tell," bring it to the attention of the research coordinator.)

Preterm birth an outcome? graphic element Yes graphic element No graphic element Can't Tell
40 or more subjects at completion? graphic element Yes graphic element No graphic element Can't Tell
Intervention used as part of treating subjects with signs/symptoms of preterm labor (not a preventive intervention used among asymptomatic patients)? graphic element Yes graphic element No graphic element Can't Tell

Productivity and Results of the Literature Search and Retrieval Process

Table 3. Search results by topic
TopicRetrievedRetained
Biologic markers10617
Tocolytics Benefits Harms 256 290 65 85
Antibiotics10715
Home uterine activity monitoring374
Costs and cost analyses10
Totala79795
a

Totals reflect a double count of some articles retrieved (retained) for both the tocolytics benefits and harms reviews.

We compiled the results from the electronic literature searches into the ProCite bibliographic database and removed duplicate records. The bibliographic software allowed us to record articles that were to be included in the evidence tables and those that were to be excluded with the reason for their exclusion. Table 3 summarizes our search results, showing that 797 studies were retrieved and reviewed for inclusion in the report. Of these, only 195 were included in the analysis after inappropriate articles were excluded.

Table 4. Biologic markers search strategy
Biologic MarkersResults
MEDLINE (Index Medicus)
1. Explode labor, premature, and explode epidemiologic studies, and limit to humans 1,039
2. Explode biologic markers; or estriol; or fibronectins; or saliva; or ultrasonography, or endovaginal ultrasound165,902
3. 1 and 235
EMBASE (Excerpta Medica)
1. Premature labor; or premature delivery3,764
2. Biologic with marker12,621
3. 1 and 299
International Pharmaceutical Abstracts
1. Premature labor; or premature delivery162
2. Biologic markers; or estriol; or fibronectin 85
3. 1 and 23
Total unduplicated count from bibliographic databases99
1. Articles found from bibliographies of review articles7
Total possible articles106
Reductions Based On Abstract and Article Reviews
1. Opinion or editorial5
2. Study with <40 subjects6
3. Review article40
4. Wrong language2
5. Not preterm labor18
6. Wrong biologic marker16
7. Twin study3
8. Inclusion status not yet obtained0
Total articles excluded90
Total articles reviewed16
Table 5. Tocolytic search strategy
TocolyticsResults
MEDLINE (Index Medicus)
1. Explode labor, premature, and explode epidemiologic studies, and limit to humans 1,039
2. Explode tocolytic agents
3. 1 and 2145
4. Explode labor, premature and, explode nitroglycerin11
EMBASE (Excerpta Medica)
1. Premature labor; or premature delivery3,764
2. Tocolytic438
3. 1 and 2245
4. Nitroglycerin2
International Pharmaceutical Abstracts
1. Premature labor; or premature delivery162
2. Tocolytic 138
3. 1 and 232
Total unduplicated count from bibliographic databases251
1. Articles found from bibliographies of review articles5
Total possible articles256
Reductions Based On Abstract and Article Reviews
1. Opinion or editorial7
2. Study with <40 subjects45
3. Review articles39
4. Wrong language6
5. Not preterm labor71
6. Not human subjects1
7. Twins5
8. Wrong drug5
9. Unavailable a12
Total articles excluded191
Total articles reviewed65
a

Unavailable: 1 German, 2 French, and 9 English articles were not received in sufficient time for review.

Table 6. Harms related to tocolytic use search strategy
Harms Related to Tocolytic UseResults
MEDLINE (Index Medicus)
1. Explode labor, premature7,001
2. Limit 1 to human and English language4,299
3. Explode tocolytic agents43,746
4. Atosiban24
5. Explode nitroglycerin; or explode nitrous oxide16,112
6. 3 or 4 or 559,383
7. 2 and 6880
8. Explode morbidity; or explode mortality; or explode heart failure, congestive217,717
9. Explode arrhythmia; or explode liver diseases; or explode anxiety310,250
10. Explode glucose intolerance; or explode pregnancy complications; or explode cerebral hemorrhage196,944
11. Explode enterocolitis4,553
12. Explode heart hypertrophy; or explode bradycardia; or explode cerebral palsy33,993
13. Adverse effects21,287
14. 8; or 9; or 10; or 11; or 12; or 13719,408
15. 7 and 14880
16. Limit 15 to clinical trial149
17. Limit 15 to randomized controlled trial90
18. Limit 15 to review; or review literature; or review of reported cases; or review, academic; or review, multicase; or review, tutorial87
19. Limit 15 to meta-analysis1
20. 16; or 17; or 18; or 19232
21. 15 and 20200
22. Case studies2,799
23. Explode cohort studies335,464
24. 22 or 23338,143
25. 15 and 2488
26. 21 or 25270
EMBASE (Exerpta Medica)
1. Premature labor; or premature delivery3,764
2. Tocolytic438
3. Complications; or adverse effects; or harms; or mortality182,906
4. 1 and 2 and 35
Total unduplicated count from bibliographic databases290
Total articles excluded205
Total articles abstracted85
Table 7. Antibiotic search strategy
AntibioticsResults
MEDLINE (Index Medicus)
1. Explode labor, premature, and explode epidemiologic study characteristics, and limit to humans 1,039
2. Explode antibiotics306,659
3. 1 and 220
EMBASE (Excerpta Medica)
1. Premature labor; or premature delivery3,764
2. Antibiotics79,377
3. 1 and 2105
International Pharmaceutical Abstracts
1. Premature labor; or premature delivery162
2. Antibiotics10,411
3. 1 and 23
Total unduplicated count from bibliographic databases105
1. Articles found from bibliographies of review articles2
Total possible articles107
Reductions Based On Abstract and Article Reviews
1. Opinion or editorial4
2. Study with <40 subjects6
3. Review articles28
4. Wrong language3
5. Not preterm labor33
6. Unavailable a2
7. Antibiotics not primary focus16
Total articles excluded92
Total articles reviewed15
a

Unavailable: 2 German articles were not received in sufficient time for review.

Table 8. Home uterine activity monitoring search strategy
Home Uterine Activity MonitoringResults
MEDLINE (Index Medicus)
1. Explode labor, premature; or explode uterine contractions; or explode pregnancy399,320
2. Explode uterine monitoring; or explode monitoring, physiologic 25,688
3. Explode home care services17,002
4. Fetal monitoring; or tocodynamometer4,728
5. 2 or 430,315
6. 1 and 3 and 554
7. Home uterine activity monitoring51
8. 6 or 784
9. Limit 8 to human, and English language, and year = 1980 to 199975
10. Limit 9 to RCTs30
Total unduplicated count from bibliographic databases30
1. Articles found through TEAG and scientific director recommendation7
Total possible articles37
Reductions Based on Abstract and Article Reviews
1. Not an RCT2
2. Review article6
3. Wrong study population15
4. Cost analysis1
5. Meta-analysis1
6. Letter4
7. Not home uterine activity monitoring1
8. Twin study1
9. Study <40 subjects1
10. Not preterm labor1
Total articles excluded33
Total articles abstracted4
Table 9. Costs and cost analyses search strategy
Costs and Cost Analyses (1966-present)Results
MEDLINE (Index Medicus)
1. Explode costs and cost analysis55,836
2. Explode labor, premature, and explode epidemiologic studies, and limit to humans 1,039
3. Explode antibiotics306,659
4. 1 and 2 and 30
5. Explode tocolytic agents42,787
6. 1 and 2 and 40
7. Explode biologic markers; or estriol; or fibronectins; or saliva31,610
8. 1 and 2 and 61
Tables 4 through 9 provide details of the literature search in each topic area. Presented are the results of the bibliographic databases searched and bibliographies in review articles examined followed by the number of resulting articles excluded with the reason for exclusion. Articles were excluded for primarily three reasons: (1) subjects did not have signs and symptoms of preterm labor, (2) the article was a literature review, or (3) the sample size was too small (fewer than 40).

As can be seen in these tables, we had sufficient evidence to warrant proceeding with a review of studies concerning all topics except the cost-effectiveness of using biologic markers or pharmacotherapies in preterm labor (a total of one article concerning the three interventions) (Table 9). Also, we did not find any literature on the biologic marker E3 that met our inclusion criteria; therefore, we did not proceed further with these topics during this review.

Data Abstraction Process

Data Extraction Form

The project director and scientific director worked with core project staff to develop data extraction forms to use for entry of relevant information from the eligible publications. The data extraction form was developed in an iterative fashion with extensive communication between the methodologists and researchers. We began by outlining common variables across types of studies, including study design features and outcomes. We then created more specialized forms to serve as the most efficient and effective vehicles to extract the necessary information from each of the five bodies of literature included in this review: (1) biologic markers, (2) tocolytic benefits, (3) tocolytic harms, (4) antibiotics, and (5) home uterine activity monitoring.

Included across all studies were the following:

  • Description of study designs.

  • Description of patient population, including maternal age and race and clinical. inclusion/exclusion criteria such as condition of membranes (ruptured or intact), estimated gestational age, and maternal and fetal indications for birth.

  • Definition of preterm labor.

  • Description of the test, treatment, or intervention.

  • Description of adjunct therapies.

  • Description of outcomes measured such as prolongation of pregnancy, gestational age at delivery, maternal morbidities, and infant birth weight.

  • Description of secondary analyses performed.

These variables were incorporated into the data extraction forms whose basic structure taken from forms developed for the evidence report on pharmacotherapies for alcohol dependence. The draft forms underwent extensive reviews, including pretesting on several randomly selected articles. After the pretest, we revised the forms to increase their utility and efficiency. The final forms can be found in Appendix B.

The tocolytic and antibiotic data extraction forms were quite similar in design. The tocolytic form had one additional question to capture the definition of tocolysis used in the analysis; therefore, only the tocolytic form is included in Appendix B. We made minor modifications to the form design to accommodate the home uterine activity monitoring literature and made more extensive modifications for the biologic marker form. Rather than mostly capturing the efficacy of treatment based on outcomes of RCTs, the biologic marker literature focuses on the effectiveness of the intervention with outcomes presented generally in terms of predictive values (positive and negative) and sensitivity and specificity. Both the home uterine activity monitoring and biologic marker extraction forms also are found in Appendix B.

Abstractors and Training

The RTI-UNC EPC used abstractors with two types of backgrounds for the data extraction process: individuals with content or clinical expertise and those with strong methodologic skills. Several obstetricians, one pediatrician with training in women's health, and a nurse midwife served as clinican abstractors; all had prior research experience. The methods abstractors were doctoral students in the UNC School of Public Health or the UNC Department of Economics who have extensive training in quantitative methods. In addition, reviewers included a health services researcher with expertise in quantitative methods (the project director) and an obstetrician with expertise in treating women with preterm labor and in conducting clinical research in the topic area (the scientific director).

To collect high-quality data, we put necessary instructions directly on the data extraction forms. Based on prior work of the RTI-UNC EPC, we found that any separately distributed written instructions were less reliable sources of help for abstractors than the extraction form itself. In addition, all abstractors attended two formal training sessions. At the first session, we explained the process and goals of the abstraction. Following the training, the abstractors were sent home with an article to review. We then reconvened the group and, through a review of the test article, ensured that the reviewers understood what was expected from their work. For example, we instructed abstractors that when inconsistencies arose between results stated in the text of an article and those presented in tabular form, they were to take data from the text. They also were told to extract precisely what was contained in the article and to reserve opinions concerning the contents to the comments section at the end of the form.

At the completion of abstractor training, the data abstraction process began. The project director, scientific director, and UNC research coordinator monitored progress. Problems or questions that abstractors encountered were routed through the research coordinator to the appropriate senior staff member.

Tocolytic Harms Abstraction

Extraction Form

The literature search on side effects or harms of tocolytic use, confined to literature in the English language was conducted using MEDLINE and EMBASE. In addition, we included all abstracted efficacy and effectiveness articles that had been flagged as containing harms information. To ensure a comprehensive review of potential harms and because study results were to be analyzed in aggregate, we did not limit this search by the stringent criteria used for the benefits studies. For example, we retained articles concerning women with ruptured membranes in the analysis because the side effects of tocolytic treatment would be the same as for women with intact membranes.

Because of time and resource constraints, we extracted data using only a simple, rather than dual, abstractive. A certified nurse midwife entered data into a spreadsheet and transferred them into an analytic data set for analysis.

Our harms extraction form (Appendix B) covered the following types of information:

  • Types of maternal and neonatal side effects.

  • Sample size of each arm of the study and total sample size.

  • Frequency of side effects in each group.

  • Methodology for harms data collection.

We did not abstract the harms of antibiotic therapy because they are generally well known to clinicians. Pregnancy commonly does not alter the potential complications of antibiotic therapy for mother or baby. One minor exception is tetracycline and its propensity to stain fetal teeth and bone. A potentially serious harm that has not been assessed adequately is the development of resistance to antimicrobials from widespread exposure of pregnant women to antibiotics, which could make treatment of neonatal sepsis more difficult.

Analysis Methodology

Harms results are presented through graphic representation rather than as evidence table entries. This methodology provides a means of comparing the number and percentage of participants experiencing adverse reactions to treatment or placebo. For clarity in the analysis, we grouped the harms into two categories: maternal harms and fetal/newborn harms. We aggregated individual descriptors into clinically sensible categories within each of these broad categories. The aggregation process may lead to some "double counting" of harms. For example, if an article described 15 percent of the subjects as having chest pain and 10 percent as having arrhythmia, then 25 percent of the patients would be assessed as having serious cardiovascular harms. From the studies, one cannot determine whether the patients with the various types of harms are the same or different patients. We did not allow greater than 100 percent of subjects to have harms within a category. Double counting of harms would result when one patient experienced both chest pain and arrhythmia; therefore, the graphs comparing studies should be viewed as somewhat more valuable for demonstrating the relative rates of harms among the therapeutic categories, rather than the absolute rates. (See results in Chapter 3.)

We performed a sensitivity analysis to examine whether such possible double counting would affect the patterns of harms demonstrated in the graphs. A low estimate of harm rates could be attained by selecting the side effect with the highest percentage within each category rather than by adding the harms within categories. In this low-estimate sensitivity analysis, if 25percent of patients had chest pain and 10 percent arrhythmia, then serious maternal cardiac harms would be reported as 25 percent. When we performed this analysis, we found the same overall pattern of harms.

The analysis of fetal/newborn harms and the evaluation of efficacy in other sections of this report overlap to some extent. Rates of fetal demise, sepsis, or intraventricular hemorrhage (IVH) could be considered as either a therapeutic effect or a harm associated with use of the pharmacologic treatment. This information, then, may also be found in the evidence tables.

Table 10. Categories of maternal and fetal/newborn harms related to tocolytictreatment
Maternal harms
  • Serious cardiovascular harms:

    • Respiratory arrest

    • Adult respiratory distress syndrome

    • Ischemic changes on ECG

    • Arrhythmia, including SVT or symptomatic PVCs

    • Chest pain

    • Heart failure or pulmonary edema

    • Abnormal ECG, including nonspecific ST-T wave changes

  • Minor cardiovascular harms

  • Hypertension:

    • Systemic BP >140

    • Diastolic BP <60

    • Transient hypertension

    • Palpitations

  • Tachycardia

  • Gastrointestinal harms:

    • Constipation

    • Diarrhea

    • Epigastric pain

    • Nausea

    • Vomiting

    • Dysgeusia

    • Rectal intolerance

    • Heartburn

    • Soft stool

    • Ileus

  • Metabolic harms:

    • Hypokalemia

    • Hypocalcemia

    • Hypermagnesemia

    • Hyperglycemia

    • Abnormal glucose tolerance test

  • Psychologic harms:

    • Itching

    • Restlessness

    • Agitation

    • Anxiety

    • Other emotional reactions

    • Shakiness

    • Tremor

    • Insomnia

Fetal/newborn harms
  • Fetal tachycardia

    • Increased fetal heart rate above baseline

    • Fetal tachycardia >120

    • Tachycardia

  • Ductal constriction/tricuspid regurgitation

    • Ductal constriction

    • Tricuspid regurgitation

  • IVH

    • IVH

    • IVH i-ii

    • IVH iii-iv

  • Necrotizing enterocolitis

  • Sepsis

    • Amnionitis

    • Sepsis

    • Infection

    • Meningitis

    • Probable sepsis

    • Pneumonia

Notes: ECG = electrocardiogram; SVT = suproventricular tachycardia; PVC = premature ventricular contraction; BP = blood pressure; IVH = intraventricular hemorrhage.

The aggregation of individual harm variables into categories is presented in Table 10. This categorization, of course, is somewhat arbitrary, but it does reflect a significant advance over the level of categorization in the source articles, which often provided no categorization at all. The placebo category refers to placebo pharmacologic therapy within the context of an RCT. Almost all placebo groups also received standard supportive care such as hospitalization, bed rest, and intravenous hydration.

Quality Control, Adjudication, and Reliability

Quality control for determining the eligibility for abstraction was described earlier in the section "Basic Approach to Article Selection." In short, all titles and abstracts were subject to dual review, and the scientific director adjudicated discrepancies. Each article was abstracted by two independent abstractors-one a clinician (content reviewer) and the other a methodologist. Abstractors were blinded to the authors' names, the institution that produced the work, and the journal title; however, in some instances, given the distinct formatting of certain journals, reviewers may have recognized journal titles.

Through prior work of the RTI-UNC EPC, we learned that using a third party for adjudicating differences between reviewers in a system of dual abstraction can be very time consuming. We decided for this project that the more efficient and effective process was for the two abstractors to discuss and reconcile differences between themselves and record their review on one reconciled extraction form. Information contained on the reconciled forms were spot checked by the project manager and the RTI research coordinator. In contrast, abstractors did not reconcile their opinion of the quality of the article as expressed through their responses on the quality rating form. We recorded a grade from each reviewer in the evidence tables.

Because the abstractors adjudicated their own work and completed a joint extraction form, we decided that a statistical check on the interrater reliability of their original data collection was not appropriate. In the past, the RTI-UNC EPC has used kappa statistics to evaluate agreement between reviews. Because our method of reconciliation produced 100percent agreement between reviewers (i.e., essentially perfect interrelater reliability), kappa statistics were neither useful nor appropriate.

Quality Rating Development and Analysis

Quality of the evidence can be judged on two levels: (1) at the level of the individual article and (2) in summary over the spectrum of articles addressing each of the key therapies or interventions. This portion of Chapter 2 describes our approaches to the development of quality ratings on both levels.

Quality of Individual Articles

The RTI-UNC EPC approach to assessing the quality of the individual articles considers the various concerns important to the scientific method of conducting research, including the internal and external validity of each study. It encompasses, among other factors, study design, measurement issues, statistical analyses, and the appropriateness of the conclusions being drawn.

The forms used to collect information on the quality of the literature were the same for the antibiotics and tocolytic literature (Figure 6). We made a few changes in the forms to accommodate differences in the biologic markers literature (Figure 7) and the home uterine activity monitoring studies (Figure 8). For example, the biologic marker form asks whether the authors provide calculations of sensitivity and specificity, likelihood ratios, or odds ratios. This information would not be relevant for the pharmacotherapy literature.

Figure 6. Quality of articles form: Categories and points for grading literature Preterm labor: Antibiotics and tocolytics literature
Category 1: Problem or question studied
Category 2: Sampling
Category 3: Measurement
Category 4: Internal validity
Category 5: External validity
Category 6: Construct validity
Category 7: Statistical conclusions
Category 8: Justification for conclusions
NoYes
1.Is the defnition used for the diagnosis of preterm labor clearly stated?02
NoYes
2.Is the recruitment strategy clearly described?01
NoYes
3.Is there random allocation of treatment?02
NoYes
4.Are baseline levels of cervical dilation and effacement assessed?01
NoYes
5.Are patients with ruptured membranes excluded or assessed in a stratified analysis?01
NoYes
6.Are medically indicated or iatrogenic pregnancies excluded or assessed in a stratified analysis (e.g., preeclampsia, severe fetal defects, placenta previa)?01
Unknown or >20%<20%
7.Is the dropout rate of patients invited into the study less than 20%?01
8.What outcomes are measured?NoYes
  • 1

    Proportion of patients with preterm delivery at a defined period (e.g., 32, 34, or 37 weeks).

01
  • 2

    Maternal outcomes (e.g., length of hospital stay, chorioamnionitis, endometritis).

01
  • 3

    Neonatal outcomes (e.g., APGAR, need for intensive care unit, length of stay, respiratory distress syndrome, birth weight, intraventricular hemorrhage).

01
NoYes
9.Are the outcome measurements clinically relevant?010.5
NoYesNA
10.Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, type of preterm labor, parity)?01 graphic element
NoYesNA
11.Are cointerventions (e.g., bed rest, hydration, or use of antibiotics when tocolytics are the study question, or vice versa) measured or reported on?01 graphic element
NoYesNA
12.Are the treating clinicians masked to treatment administered?01 graphic element
NoYesNA
13.Are the patients masked to treatment received?01 graphic element
NoYes
14.Do the study conclusions apply to pregnant women in the United States or to important subgroups of the U.S. population?01
NoYes
15.Is the clinical setting specified clearly?01
NoYes
16.Are the criteria used to assign gestational age clearly stated?01
NoYesNA
17.If the study is negative, is a power analysis performed?01 graphic element
NoYesSome
18.Are statistically significant (or insignificant) findings clinically significant?010.5
NoYesSome
19.Are statistical tests used appropriate to the data?010.5
NoYes
20.Are the conclusions warranted from the data and/or design of the study?02
Figure 7. Quality of articles form: Categories and points for grading biologic marker literature
Category 1: Problem or question studied
Category 2: Sampling
Category 3: Measurement
Category 4: Internal Validity
Category 5: External validity
Category 6: Construct validity
Category 7: Statistical conclusions
Category 8: Justification for conclusions
NoYes
1.Is the description of the test procedure provided?01
NoYes
2.Are the relevant outcomes clearly defined?01
NoYes
3.Is a reference standard for the test clearly referenced?01
NoYes
4.Is the recruitment strategy clearly described?01
5.Are inclusion/exclusion criteria clearly stated?01
6.Are baseline levels of cervical dilation and/or effacement assessed?01
7.Are patients with ruptured membranes excluded or assessed in a stratified analysis?01
8.Are medically indicated or iatrogenic deliveries excluded or assessed in a stratified analysis (e.g., preeclampsia, severe fetal defects, placenta previa)?01
Unknown or >20%<20%
9.Is the dropout rate of patients invited into the study less than 20%?01
10.What outcomes are measured?NoYes
  • 1

    Proportion of patients with preterm delivery at a defined period (e.g., 32, 34, or 37 weeks).

01
  • 2

    Maternal outcomes (e.g., length of hospital stay, chorioamnionitis, endometritis).

01
  • 3

    Neonatal outcomes (e.g., APGAR, need for intensive care unit, length of stay, respiratory distress syndrome, birth weight, intraventricular hemorrhage).

01
NoYesSome
11.Are the outcome measurements clinically relevant?010.5
NoYesNA
12.Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, type of preterm labor, parity)?01 graphic element
NoYesNA
13.Are cointerventions (e.g., bed rest, hydration, tocolytic use) measured or reported on?01 graphic element
NoYesNA
14.Are the treating clinicians masked to test result?01 graphic element
NoYesNA
15.Are the patients masked to test result?01 graphic element
NoYes
16.Is the clinical setting specified clearly?01
NoYes
17.Are the criteria used to assign gestational age clearly stated?01
NoYesNA
18.Is a power analysis performed?01 graphic element
NoYesSome
19.Are statistically significant (or insignificant) findings clinically significant?010.5
NoYesSome
20.Are statistical tests used appropriate to the data?010.5
NoYes
21.Do the authors provide either sensitivity/specificity, likelihood ratios or odds ratios?01
NoYes
22.Are the conclusions warranted from the data and/or design of the study?02
Figure 8. Quality of articles form: Categories and points for grading literature home uterine activity monitoring
Category 1: Problem or question studied
Category 2: Sampling
Category 3: Measurement
Category 4: Internal validity
Category 5: Construct validity
Category 6: Statistical conclusions
Category 7: Justification for conclusions
NoYes
1.Is the definition used for the diagnosis of preterm labor clearly stated?01
2.Is the intervention being tested clearly described?01
NoYes
3.Is the recruitment strategy described?01
NoYes
4.Are the inclusion criteria for the study stated?01
NoYes
5.Is the randomization scheme described?01
NoYes
6.Are baseline levels of cervical dilation and effacement presented?01
Unknown or >20%<20%
7.Is the dropout rate of patients invited into the study less than 20%?01
NoYes
8.Are all patients who dropped out of the study accounted for?01
NoYes
9.Are the outcomes clearly described?01
NoYesSome
10.Are the outcome measurements clinically relevant?010.5
NoYesNA
11.Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, labor, parity)?01 graphic element
NoYesNA
12.Are any cointerventions reported on?01 graphic element
NoYesNA
13.Are the treating clinicians masked to intervention group?01 graphic element
NoYesNA
14.Are the patients masked to treatment received?01 graphic element
NoYes
15.Are the criteria used to assign gestational age stated?01
NoYesNA
16.Was a power analysis performed?01 graphic element
NoYesSome
17.Are statistical tests used appropriate to the data?010.5
NoYesSome
18.Are statistical tests clearly described?010.5
NoYes
19.Are levels of significance and or confidence intervals given?01
NoYes
20.Was an intent-to-treat analysis performed?01
NoYes
21.Are the conclusions warranted from the data?01
22.Are the conclusions warranted from the study design?01

To assess the internal validity of a study (i.e., the likelihood that the design and conduct of the study minimize systematic error or bias) the following factors were evaluated from each study across all study types:

  • Description of the recruitment strategy.

  • Masking of the treating clinician and patient.

  • Inclusion of baseline measurements including cervical dilation, effacement, and estimated gestational age.

  • Definition of outcome measures, attrition rates, and confounding variables.

Statistical considerations included assessing the adequacy of the methods used and whether the authors did a power calculation. The rating scale also addressed the clinical relevancy of the statistical findings.

With regard to diagnostic and outcome measures, the abstractors evaluated whether the standard for a test was clearly referenced. In addition, the quality rating instrument assessed whether studies included the three major categories of outcomes (i.e., those for delivery, mothers, and infants).

External validity (i.e., whether the findings of the study can be generalized to populations that did not participate in the study) also was considered for quality rating. We included in our evaluation whether the clinical setting was specified clearly and whether conclusions apply to pregnant women in the United States.

The quality rating instrument was based on a point scale that was transformed into a percentage score, with the higher percentage score denoting a better study. Each abstractor completed the quality rating for each article reviewed. The individual scores provided in the evidence tables for each article were derived by calculating the following ratio: the points from the responses circled by the abstractor (numerator) divided by the total number of points that could have been awarded (denominator). When the items were determined to not be applicable, "NA," then the total number of points for that item was removed from both the numerator and denominator. Quality rating scores for each article were not averaged across the two abstractors (i.e., clinician and methodologist), so that patterns across studies could be observed.

Grading of the Evidence

According to a report developed by Lohr and Carey,97 several systems have been proposed in the past 10 to 15 years for "grading the quality or strength" on a given clinical topic or pathway; however, little or no consensus exists about which specific system is best, and thus little guidance was given about which approach to use for our review.

The RTI-UNC EPC developed a grading scheme that considers the strength and the quality of the evidence based on the work of the RTI-UNC EPC in the Evidence Report on the Pharmacotherapy for Alcohol Dependence98 and a report by the U.S. Preventive Services Task Force (1996).99 Our determination of the strength of the evidence includes an evaluation of the consistency of study findings over all studies, the quality rating scores for individual pieces of research, the magnitude of important outcomes, and, when available, the results of our meta-analyses. Because we include both randomized and cohort studies in the review, we also record whether the evidence is appropriate for evaluating the efficacy or the effectiveness of the findings. The letter grade given to a body of literature is not necessarily higher because it contains a greater number of RCTs; rather, it rests on the quality of the studies themselves. We did not assign scores for each of the variables considered for the ratings; instead, we based our quality assessment on an adjudicated rating that was initially provided by the project director, scientific director, and clinical methodologist as follows:

Grading of the Evidence
Good (A):The data are sufficient for evaluating the quality of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women in preterm labor.
Fair (B):The data are sufficient for evaluating the quality of the findings. The data indicate inconsistencies in the findings between the key drug or intervention and alternative treatment or placebo for treating women in preterm labor.
Poor (C):The data are sufficient for evaluating the quality of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women in preterm labor.
Incomplete evidence (I):The data are insufficient for assessing the quality of the key drug or intervention for treating women in preterm labor based on limited sample size or poor methodology.
Efficacy (1):The evidence was obtained from well-designed RCTs.
Effectiveness (2):The evidence was obtained from well-designed cohort or case control analytic studies.

Development of the Evidence Tables

Table 11. Content of study design and outcomes evidence tables
Study Design
Outcomes
Citation
Research objective
Study design and masking
Definition of preterm labor
Patient inclusion/exclusion criteria
Sample size initial/end
Demographic characteristics
Intervention
Cointerventions (drug and nondrug)
Country and time period
Quality score
Definition of successful tocolysis (tocolytics only)
Compliance assessment (antibiotics, tocolytics, and home uterine activity monitoring only)
Estimated gestational age
Prolongation of pregnancy
Gestational weeks
Maternal morbidities
Birth weight
APGAR score (antibiotics and tocolytics only)
Other infant outcomes
Other analyses
Conclusions
Comments and limitations (home uterine activity monitoring only)
We created our evidence tables from the format used in the RTI-UNC EPC's Evidence Report on Pharmacotherapy for Alcohol Dependence. Based on this example, we separated the presentation of study information into two tables-one for study design and the other for outcomes. The content of these two tables are presented in Table 11. All the evidence tables are contained in Volume 2.

The information contained in the tables is self-explanatory; however, numerous abbreviations, which can be found in Appendix C, were used to conserve space.

The content of the study design table is consistent across study topics. All tables include a statement of the research objective, the definition of preterm labor used by the authors, patient inclusion/exclusion criteria, and a description of the experimental intervention. When available, we include the total number of participants at the beginning and at the end of the study as well as the number of participants in each arm of the study at its conclusion.

Information unique to particular studies includes a definition of tocolysis in the tocolytics table and the exclusion of compliance assessment information in the biologic markers tables (as this would be irrelevant in relation to a diagnostic test). The home uterine activity monitoring study design table (Evidence Table 13) includes a list of the outcome measures that are described in greater detail in the outcomes table.

The outcomes tables across types of studies include data on delivery, maternal, and infant outcomes. The presentation is the same in the tables describing tocolytic and antibiotics outcomes (Evidence Tables 3 through 12). The home uterine activity monitoring outcome table provides information on recurrent preterm labor and includes comments and limitations of the study. In the pharmacotherapy studies table, information about the treatment for recurrent labor is included under intervention. We did not include comments and limitations for other study types because authors typically did not provide such information.

The presentation of outcomes in the biologic marker outcome tables is somewhat different from the other study types (Evidence Tables 1 and 2) because they describe the results of diagnostic tests and delivery outcomes and are presented in terms of sensitivity, specificity, positive predictive value and negative predictive value.

Peer Review Process

Among the more important activities involved in producing a credible evidence report is conducting an unbiased and broadly based review of the draft report. Such a review, here termed "peer review," should provide an array of scientists, methodologists, users, and laypersons adequate opportunity to comment on the report and to identify problems of fact, interpretation, or presentation. Appendix A describes the selection process for peer review and lists the names of the peer reviewers.

Supplemental Analysis

In addition to the systematic review of the literature discussed above, we conducted meta-analyses focusing on treatment effects of each class of first-line tocolytics and maintenance tocolytics, antibiotics, and home uterine activity monitoring. The efficacy of treatment was measured in relation to the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational age at delivery, and (3) birth weight. We restricted the literature for the meta-analyses to studies included in our evidence tables, RCTs, and to data concerning women with intact membranes. Chapter 4 presents a thorough discussion of the analyses, including selection criteria, methodology, and study findings.

Chapter 3. Results

Introduction

Based on the data presented in the evidence tables, this chapter presents the major findings from our review of the literature regarding the management of preterm labor. The chapter begins with a discussion of issues and limitations relevant to the collective body of literature. Subsequent sections focus on each assessment technique or therapy and provide a detailed discussion of efficacy and/or effectiveness results (benefits). We address, in turn, biologic markers (fFN and EVUSD), tocolytics (first-line and maintenance), antibiotics, and home uterine activity monitoring. Side effects (harms) data are presented only in relation to tocolytic treatment.

Issues and Limitations in the Literature

Through our review, we found a number of issues affecting the quality of the studies reported in the literature that cut across categories of assessments and interventions in the management of preterm labor. Our ability to draw conclusions with confidence is diminished by the following five issues concerning research design and study execution: (1) the definition of preterm labor, (2) the size of the trials, (3) the confounding of results because of use of other interventions, (4) the sole reliance on bivariate analysis, and (5) the failure to analyze separately women with medically indicated preterm births and compare results with those of the larger group of women with preterm labor. We discuss other concerns specific to the literature about a particular assessment technique or therapy in the section of this chapter covering that specific topic.

Definitions used to delineate an episode of preterm labor vary widely across studies. Some investigators are quite precise, requiring a specified level of contraction duration, contraction frequency, and cervical change; however, within this group, the number or frequency of symptoms varies greatly. Other authors specify, as an entry criterion, a lack of response to nondrug interventions, such as bed rest and hydration. Still other authors do not define preterm labor or specify the symptoms used as an inclusion criteria for the study. Thus, the reviewer is presented with possible heterogeneity in study populations, a factor that limits the generalizability and aggregation of study results.

Because adverse perinatal outcomes (morbidity and mortality) are so rare, studies generally have inadequate numbers to provide the power to assess whether any observed differences were statistically significant. Hence, nonstatistically significant results may stem from a lack of power to detect positive outcomes rather than a true lack of benefit. Instead, investigators rely on surrogate outcomes such as pregnancy prolongation, mean gestational age at delivery, and rate of delivery before a specific cutoff point (most commonly 35 to 37 weeks). Reviewers are asked implicitly to make the assumption that improvements in these surrogate outcomes would translate into improvements in perinatal outcomes if applied to enough women with preterm labor symptoms.

Interpreting results is further complicated by the use of cointerventions. Most studies of one type of therapy allow the use of another type of intervention; for instance, antibiotic studies allow for tocolytic use and vice versa. Moreover, studies also allow for nondrug interventions, such as intravenous hydration and bed rest. To avoid confounding from cointerventions, analyses would need to impose study designs or include statistical techniques to control for use of the cointerventions. Very few of the studies reviewed for this evidence report include these controls.

Given the rarity of perinatal death and serious morbidity, assessing these outcomes would require enrolling thousands of women. As this is unlikely to be a practical solution, survival analysis is an analytic technique that can enhance outcome presentations by providing stratified analyses by gestational age at enrollment for the period from treatment to birth. (See Rothman and Greenland 100 for a general description of survival analysis.)

To date, however, studies typically rely on bivariate analyses of categorical and continuous outcomes. Although categorical measures can capture such outcomes as "preterm birth, yes or no," these results may not be clinically informative in and of themselves because women at a wide range of gestational ages may be included in the analysis. Comparisons of continuous outcomes, such as mean prolongation of pregnancy, may also be misleading. For example, a similar lengthening of gestation may be of greater value for a woman at 26 weeks' gestation relative to 34 weeks' gestation and, because a wide range of gestational ages are typically included in the studies, this distinction is not commonly available.

The etiologic pathways leading to preterm birth are uncertain. Given the lack of epidemiologic evidence, it remains unclear as to whether indicated preterm births are etiologically heterogeneous or homogeneous with spontaneous preterm births. The inclusion of dual analyses in the review of efficacy and effectiveness studies could provide useful insights into this controversy; results for women with indicated preterm births and those experiencing spontaneous preterm births could be analyzed separately and in combination. A body of literature of this kind could provide insights into this controversy and support conclusions concerning whether indicated preterm births should be included or excluded from the analyses of women with preterm labor.

Biologic Markers

Chapter 1 discussed the clinician's limited ability to ascertain which symptomatic woman is at risk for imminent delivery and which is not. The inability to assign risk accurately, coupled with the ubiquity of preterm labor symptoms, exposes numerous women to unnecessary and potentially harmful treatment. Moreover, this phenomenon dilutes researchers' ability to detect a significant intervention effect in those with uterine activity that will result in cervical change and potential preterm birth. Thus, a rapid and reproducible biochemical or biophysical "diagnostic" test for women with preterm labor symptoms that could accurately assess the risk of preterm delivery would be of great value. In Chapter 1, we labeled such tests "biologic markers."

We proposed to study three biologic markers in this evidence report: fFN, EVUSD, and E3. Restricting our review to investigations that include women with symptoms of preterm labor, as we have done throughout this report, resulted in our finding no studies of E3 in symptomatic women. As a result, we cannot comment on estriol's efficacy to assess preterm delivery risk in women with preterm labor symptoms and will not address in this chapter this biologic marker. Also, the body of literature about the use of EVUSD as a screening tool is quite extensive; however, we have excluded screening studies in asymptomatic women, which substantially restricts the articles available for review in this evidence report.

Of the 17 reviewed studies about biologic markers, 12 examined fFN; 46 101 102 103 104 105 106 107 108 109 110 their findings can be found in Evidence Table 1. Three additional studies concern EVUSD alone; 55 111 112 the details can be found in Evidence Table 2. Finally, findings of two other studies that examined both fFN and EVUSD 113 114 can be found in both Evidence Tables 1 and 2 for completeness.

Description of the Biologic Marker Literature

General Limitations of this Literature

As discussed previously, all the literature relating to preterm labor has limitations including the definition of an episode of preterm labor, the trial size, the confounding of results because of the use of other interventions, the reporting of preterm birth outcomes in a categorical fashion, and the failure to analyze separately women experiencing medically indicated births.

With respect to biologic markers specifically, we note first that all biologic marker studies conducted to date have been observational. Although observational studies can give insights regarding the sensitivity and specificity of a biologic marker to predict preterm delivery within a certain period of time, they cannot assess whether clinicians and patients can use the additional information provided by biologic markers to improve perinatal outcomes. That hypotheses is best tested in an RCT in which clinicians are given biologic marker results for one set of patients and not for another. To our knowledge, no such study has been conducted. Moreover, masking and generalizability are further issues in the biologic marker research arena.

Study-Specific Characteristics of the Biologic Marker Literature

Overall, the study populations in these investigations were not large. Of the 12 fFN articles, 8 had fewer than 100 subjects relevant for our analysis. 102 104 106 107 109 110 114 115 Of the two studies that combined fFN and EVUSD, one had a population of 108 113 and the other 76. 114 Two studies on EVUSD alone enrolled fewer than 100 women, 55 111 another had 162 participants. 112

Definitions of preterm labor symptoms were highly variable. If one deems an adequate definition of preterm labor symptoms to include uterine contraction frequency and duration along with cervical dilation status, then only six studies used definitions that can be labeled adequate. 55 108 110 111 112 113 Five studies did not definite preterm labor symptoms. 101 102 106 107 110 In the largest study, a preterm labor symptom was defined as one that prompts an unscheduled visit to a clinician. 46

All but one report failed to control for confounding interventions that may have altered the likelihood of preterm birth in women with preterm labor symptoms. Peaceman, Andrews, Thorp, et al. 46 controlled for tocolytic use in two ways: (1) by calculating test performance characteristics in women who did and did not receive a tocolytic and (2) by including a study site that did not use tocolytics as a treatment to prevent preterm delivery.

Assessing the outcomes of interventions in women with preterm labor symptoms categorically (preterm birth: yes or no) as opposed to continuously (mean gestational age at delivery or mean prolongation of pregnancy in days) can make clinically important differences difficult to detect. This blurring phenomenon has been described in detail in other sections.

To avoid such problems, we have advocated survival analysis stratified by gestational age at enrollment as the ideal analytic technique. The largest fFN study, 46 a small French report on fFN, 106 and one of the combined reports 113 used this method of analysis. Three other fFN studies approximated survival analysis by using the categorical outcome of preterm delivery within 7, 14, and 21 days after sample collection, 105 108 110 and another evaluated 7- and 14-day intervals. 46

Preterm birth is thought by some to be etiologically heterogeneous; that is, medically indicated preterm births in nonlaboring women have different causal pathways than do births arising from preterm labor or premature rupture of membranes. If this concept of etiologic heterogeneity is accurate, then a corollary is that diagnostic tests designed to measure changes indicative of imminent preterm birth in women with preterm labor symptoms should have no predictive value for medically indicated preterm births. However, we do not know that this is true, since, to our knowledge, only Peaceman, Andrews, Thorp, et al. 46 have excluded medically indicated births before term.

In an observational study of a diagnostic test, an essential study design element is to mask clinicians (care providers, outcomes assessors) to the test result. Likewise, the ideal design is to ensure that the patients themselves are unaware of their fFN or ultrasound status. Failure to mask clinicians can alter clinician practices based on their biases and, thus, change test performance. We reviewed each study with respect to masking. If the articles states that clinicians caring for study subjects were unaware of the study results, we considered masking to be adequate; conversely, if the articles made no such statement, we judged masking to be inadequate. Three fFN studies, 105 107 115 one combined fFN-EVUSD study, 114 and one ultrasound study 55 had inadequate masking of clinicians.

Information about the number of patients eligible for the study who actually enrolled and completed the study is important because, in absence of such data, the assessment of the quality of the study and its generalizability to the universe of women with symptoms of preterm labor is difficult, if not impossible. Peaceman, Andrews, Thorp, et al. 46 and Iams, Casal, McGregor, et al. 103 were the only authors that provided information about the number of symptomatic women who were screened for study participation and the number ultimately enrolled. In these two studies, respectively, 52 percent and 46 percent of women screened for participation actually were enrolled and had fFN performed.

Each of the studies except that by Watson, Kim, and Humphrey 110 provided test characteristics such as sensitivity, specificity, and positive and negative predictive values of preterm delivery. Although influenced by the prevalence of preterm birth, predictive values, namely positive predictive value (PPV) and negative predictive value (NPV), are the most clinically relevant data. Predictive values answer the critical question: Given a positive or negative test result, what is the likelihood that a woman with symptoms of preterm labor will have a preterm delivery? All but two fFN studies 102 110 provided the reader with PPV and NPV. Inglis, Jeremias, Kuno, et al. 102 provided sufficient detail to calculate predictive values.

Results of the Biologic Marker Literature Review

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f009.jpg.

   Figure 9. Use of fFN to adjust clinical estimates of risk for preterm birth

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f010.jpg.

   Figure 10. Use of EVUSD to adjust clinical estimates of risk for preterm birth

In this section, we summarize test characteristics. Because these biologic markers appear to have clinical utility, at the end of this section we present aggregate information leading to a discussion of how biologic marker information can enhance clinical judgment and decisionmaking. We present graphs (Figures 9 and 10) that relate prior clinical assessment of the probability of preterm birth to probability estimates informed by biologic marker values.

fFN Studies

The predictive value of a positive fFN, that is, the proportion of women with a positive test who deliver before term, is moderate at best. (The cutoff point for fFN positivity was identical, i.e., 50 ng/ml in the fFN studies.) In the articles cited in Evidence Table 1, the PPV of predicting delivery within 7 days ranged from 13 percent to 44 percent 46 103 104 105 108 (singleton pregnancies only). When the outcome of interest was delivery at less than 37 weeks, the PPV ranged from 31 percent 107 to 83 percent. 109 In the largest fFN study, Peaceman, Andrews, Thorp, et al. 46 reported a PPV of 43 percent for delivery at less than 37 weeks.

In contrast to PPV, the NPV of fFN for identifying those at low risk of preterm birth is good, especially to assess risk of imminent delivery. The NPVs for birth before 37 weeks reported in these studies ranged between 69 and 92 percent 102 105 and for birth within 7 days between 98 and 100 percent. 104 105

Two studies of fFN 46 103 compared the diagnostic accuracy of this biologic marker with that of traditional clinical risk assessment based on cervical dilation, contraction frequency, and vaginal bleeding. In both instances, fFN was superior to clinical assessment.

EVUSD Studies

We elected to aggregate the test performance data from EVUSD studies that evaluated comparable outcomes with composite estimates of sensitivity, specificity, PPV, NPV, and likelihood ratios. In the five EVUSD papers, we developed cutoff points for considering ultrasound findings abnormal (in other words, a positive test) from the study data. Timor-Trisch, Boozarjomehri, Masakowski, et al. 111 focused on dynamic cervical changes (e.g., wedging or funneling); the other studies relied on cervical length. Performance of sonography, evaluating cervical length using cutoff points of 2.6 and 3.0 cm, 55 112 was similar to that of fFN: moderate PPVs of 50 to 55 percent and high NPVs of 76 to 100 percent. 55

fFN and EVUSD Studies

The two papers that compared fFN with EVUSD used cervical length as a criterion to assign risk. Rizzo, Capponi, Arduini, et al. 113 used 20 mm as a cutoff point; Rozenberg, Goffinet, Malagrida, et al. 114 used 26 mm. In one combined study, Rizzo, Capponi, Arduini, et al. 113 found that fFN testing was superior to sonography in terms of diagnostic performance; Rozenberg, Goffinet, Malagrida, et al. 114 found no difference between the two testing modalities. Both studies found that the combined use of fFN and EVUSD improved test performance and that each made independent contributions to the prediction of risk for preterm delivery.

Aggregate Performance of fFN and EVUSD
Overview

This section presents estimates of sensitivity, specificity, PPV, NPV, and likelihood ratios as a means of clarifying the potential value of these biologic markers in clinical practice. Of the 14 articles evaluating the use of fFN, 12 provided sufficient detail to construct contingency tables describing the relationship between test results and birth before 37 completed weeks' estimated gestational age (see Evidence Table 1). 46 101 102 103 104 105 106 107 108 109 113 114 Each study used a cutoff point of greater than or equal to 50 ng/ml for a positive test.

Aggregating across these studies, 1,786 samples from women being evaluated or treated for preterm labor were available for this analysis. In this combined cohort, 488 preterm births occurred, for an overall prevalence of 27 percent. Of the 267 women who delivered before term, 267 had positive test results at their initial evaluation and 221 did not. Of those who delivered at term, 1,111 women had true-negative fFN results and 187 women had false-positive fFN values.

Results

Combined, the sensitivity of fFN (i.e., the ability to identify correctly those who delivered before 37 weeks) was 54.7 percent; specificity (i.e., the ability to identify those who delivered after 37 weeks) was 85.6 percent. The PPV (i.e., the probability that a woman with a positive test result would deliver before term) was 58.8 percent. The NPV (i.e., the probability that a woman with a negative test would deliver at term) was 83.4 percent. The likelihood ratio for a positive fFN is 3.8 (95 percent confidence interval [CI]: 3.3, 4.4) and 0.53 (95 percent CI: 0.48, 0.58) for a negative test result. The clinical application of likelihood ratios is reviewed briefly below.

The majority of these studies enrolled women in a 10-week range of gestational ages, meaning some subjects have longer time at risk for preterm birth than others, typically between 24 and 34 weeks. This factor has the effect of reducing sensitivity of testing at a single point in time for predicting birth before 37 weeks. Presumably, some women with an initial negative fFN test will subsequently develop positive tests. In a sense, these women actually do not reflect false-negative values on their initial test; rather, they reflect the time-dependent nature of using biologic markers and their relationship to outcomes.

Limiting the effect of weeks elapsing between fFN is informative about the utility of fFN as a reliable biologic marker. The five publications that examine the more immediate outcome of delivery within a week are instructive on this point. 46 103 104 105 108 The aggregate sensitivity of fFN (from these studies) to identify women who delivered in 7 days after testing was 89.4 percent; the specificity was 83.3 percent; the PPV was 22.9 percent; and the NPV was 99.3 percent. For the outcome of birth within 7 days, the likelihood ratio for a positive fFN is 5.3 (95 percent CI: 4.6, 6.2) and 0.13 (95 percent) CI: 0.06, 0.26) for a negative fFN. The high NPV, supported by this combination of results, suggests that fFN conveys an important benefit in terms of identifying women who are not an immediate risk of preterm birth and who may not require intervention.

The EVUSD literature presents special challenges to combining results in this fashion. Continuous predictors, such as cervical measurements, cannot be captured adequately from the level of detail permitted in publications. Appropriately, therefore, authors of the articles we reviewed examined their data and established categories of cervical change or dichotomous cutoff points for examining test characteristics or calculating likelihood ratios. As a result of their statistically driven analyses, the studies show greater inconsistency in the cutoff points used than was true for the fFN articles.

Of the five EVUSD studies in women with signs or symptoms of preterm labor, three evaluated the use of wedging or funneling to determine risk of preterm birth. 111 112 113 Combined, these investigators evaluated 314 women, of whom 98 delivered before 37 weeks' gestation. Of those who delivered preterm, 59 had funneling or wedging noted on ultrasound examination; 39 did not have visible funneling. In addition, 23 women had funneling and delivered at term (a total of 23 false-positives), and 171 women had normal ultrasounds and term deliveries. These figures are equivalent to the following rates: sensitivity, 60.2 percent; specificity, 79.2 percent; PPV, 56.7 percent; and NPV, 81.4 percent. The positive likelihood ratio for funneling is 2.8 (95 percent CI: 2.1, 3.9) and 0.50 (0.39, 0.65) for the absence of funneling.

Two studies 55 112 used the cervical length of 3 cm or less as a diagnostic cutoff point to assign higher risk of preterm birth. This threshold was associated with the following rates: sensitivity, 88.5 percent; specificity, 59.3 percent; PPV, 49.5 percent; NPV, 92.0 percent; and positive likelihood ratio of 2.2 (95 percent CI: 1.7, 2.7); negative likelihood ratio of 0.19 (0.10, 0.39).

Table 12. Predictive value of biologic markers in women with symptoms of preterm labor
SensitivitySpecificityTest Characteristics by Incidence of Preterm Birth In Women Presenting for Preterm Labor
Included StudiesLow Risk Example Within 7 Days = 3% Before 37 Weeks = 10%Medium Risk Example Within 7 Days = 10% Before 37 Weeks = 25%High Risk Example Within 7 Days = 15% Before 37 Weeks = 45%
Observed IncidencePPVNPVPPVNPVPPVNPVPPVNPV
Birth within 7 days
fFN89.483.35.322.999.314.399.637.298.648.697.8
Birth before 37 weeks
fFN54.785.627.358.883.429.794.555.985.075.769.8
EVUSD: Funneling/wedging60.279.231.256.781.424.394.749.085.670.470.9
EVUSD: Cervical length < 3 cm88.559.331.149.592.019.497.842.093.964.086.2

fFv = fetal fibronectin; EVUSD = endovaginal ultrasound; PPV = positive predictive value; NPV = negative predictive value

The test characteristics reviewed above are summarized in Table 12. The right portion of the table provides examples of the predictive values of these biologic markers in the studies included in this summary and for a range of hypothetical rates of preterm birth in women presenting with signs and symptoms of preterm labor. As shown in the table, as the proportion of women who gave birth preterm increases, the NPV falls, and the PPV improves. The aggregate predictive values reflect expected performance of the biologic markers in populations similar to that of the study participants with the observed incidence of preterm birth comparable to that summarized in the table.

Conclusions

Biologic markers supplement clinical judgment. Based on a patient's obstetric history, her description of current symptoms, and findings from physical examination and monitoring of contractions, care providers estimate the probability of preterm birth. An estimate of high probability results in further testing and/or treatment; an estimate of lower probability leads to continued observation. Thresholds for deciding whether to treat women in preterm labor or to observe them further are highly individualized and often not explicit. For example, physicians with aggressive treatment policies may favor intervention at any time that they believe that a woman has a greater than 10 percent chance that her preterm labor might result in preterm birth and will likely further adjust that judgment based on gestational age. A conservative philosophy might reserve treatment for cases in which the probability estimate of a preterm birth is greater than 30 percent, accepting that time and observation will further clarify the need to treat.

Regardless of where this threshold to treat lies for individual providers, there exists a range of probabilities between an estimated probability at which they "always" treat and an equivalent estimated probability at which they are comfortable sending a patient home with no further evaluation. When this range is relatively wide, the additional diagnostic or prognostic information provided by fFN or EVUSD testing may help the clinician to make a more sound decision.

Clinical applications of likelihood ratios

Imagine a provider whose threshold to treat is 20 percent and who would accept a 2 percent risk that the pregnancy would be delivered before term. Faced with a patient who is being evaluated for her second episode of preterm labor at 33 weeks' gestation and who has contractions every 8 minutes, no cervical change, and a history of a prior preterm birth at 36 weeks, the clinician may believe that the woman's current episode has a 10 percent chance of leading to progressive cervical change and delivery. This is within the clinician's range of clinical uncertainty; additional information, such as fFN or EVUSD data, can have direct decisionmaking utility.

The question is how to apply such information easily across a range of patient populations with varied prevalence of preterm birth. Likelihood ratios for categories of test results accomplish this; they can be used to modify the odds of a patient's having a specific outcome such as preterm birth or birth within the next week. Posttest odds are equal to the pretest odds multiplied by the likelihood ratio.

In this hypothetical example, the provider's pretest probability, based on clinical judgment, was 10 percent. This value can be converted to odds using this formula:
graphic element
Pretest odds are 0.11:1 or 1:9.

From the combined analysis of fFN discussed above (with a likelihood ratio of 3.8 percent for birth before 37 weeks), if the woman's fFN test was positive, then her posttest odds would be given by the following equation:
Posttest odds = Pretest odds x Likelihood ratio = 0.11 x 3.8 = 0.42

To convert this back to a probability, we use this formula:
graphic element

In short, with a positive fFN test, a woman's posttest odds are 0.42, which is equivalent to a probability of 30 percent. This is a value within the range of probabilities for which the hypothetical clinician would start treatment. Pocket nomograms are available to ease use of these conversions in clinical settings. 116

Figures 9 and 10 remove these intermediate calculations and allow direct inspection of the relationship between a clinician's initial assessment of risk of preterm birth (the X-axis) and the probability after adjustment for evaluation of biologic markers (the Y-axis). If the clinician is certain that a woman will deliver, then both probabilities are near 1.0 (in the upper right-hand corner of the figures), and the biologic marker contributes little. The lower range of clinical assessment of probability (toward the lower left-hand corner) is of greater interest. In this range, biologic markers offer additional information that can aid decisionmaking, prompting treatment or confirming the choice of expectant management.

An alternative approach to using these graphs is to acknowledge that clinical estimates of risk of spontaneous preterm birth for an individual are imprecise, whereas the prevalence of preterm birth from preterm labor in a specified population can be determined with certainty. This prevalence can serve as the prior probability. In this case, the prevalence is located on the X-axis, and the Y-axis reflects the posterior probability, in that population, adjusted for the biologic marker result. Whether interpreted as adjustment of individual risk or risk for a population, these biologic markers offer additional information that refines assessment of the probability that preterm birth will result from an episode of preterm labor.

Tocolytics

We grouped the tocolytic strategies using two schemes. First, tocolytics were designated as either first-line or maintenance regimens. The former are used to treat acute preterm labor symptoms after the clinician decides that these symptoms are so significant that administration of a tocolytic drug to prevent preterm birth is indicated. Maintenance regimens are used after an episode of acute tocolysis with a first-line regimen. The goal of maintenance tocolysis is prolonged prevention of a preterm birth in a woman who had experienced symptoms of preterm labor.

Second, we divided the tocolytic strategies using two schemes: (1) beta-mimetics, (2) calcium channel blockers, (3) magnesium sulfate, (4) NSAIDs, and (5) a category labeled "other" that includes ethanol and atosiban. Clinicians use drugs within all five of these categories for both first-line and maintenance tocolysis. The evidence tables in correspond to these categories. Evidence Tables 3 through 7 contain first-line tocolytic studies, and Evidence Tables 8 through 11 provide information on maintenance tocolytics. Based on the available research that met our inclusion criteria, the maintenance review includes only beta-mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs. Studies comparing classes of tocolytics are repeated in the evidence tables and the discussion in this chapter within each tocolytic class.

A review of the potential side effects or harms from treatment follows the discussion of the benefits of tocolytic therapy. Results are divided by the drug's mechanisms of action. Owing to time and resource limitations, we were unable to divide the studies further on the basis of whether the drugs were being used for first-line or maintenance treatment.

Description of the Tocolytic Literature

The reports reviewed about the use of tocolytic drugs in women with preterm labor are a mixture of RCTs, nonrandomized controlled trials, and observational studies, including prospective and retrospective cohort and case series. (See the inclusion/exclusion criteria in Chapter 2, Table 2.) As is common in this literature, RCTs are susceptible to bias when clinicians, patients, or other outcome assessors are not masked to the interventions being studied. They also may be conducted for relatively short periods of time, in narrowly defined patient populations, and in exemplary settings. On the other hand, the observational studies included in this review may lack subjects randomly assigned to control groups and are susceptible to bias from sources other than the intervention being studied. Often, however, observational studies enroll a broader set of patients or are conducted in a wide variety of settings. We believe that in our review of this literature, including both RCTs (efficacy studies) and observational studies (effectiveness studies) of tocolytics provides the most comprehensive and informative picture of the efficacy and effectiveness of these substances in controlling or stopping the symptoms of preterm labor.

First-Line Tocolytics: Beta-Mimetics

Description of the Beta-Mimetic Literature

For first-line tocolysis, beta-mimetics have been the most frequent class of drugs studied (Evidence Table 3). We reviewed 47 studies including 35 RCTs, 4 nonrandomized controlled trials, 3 retrospective cohort studies, and 2 prospective cohort studies. Of these, 15 studies focused on tocolytic treatment with only a beta-mimetic, and 1 study focused on treatment with magnesium sulfate; some studies also included a control group. Other studies compared treatment with a beta-mimetic and a calcium channel blocker (15 studies), magnesium sulfate (10 studies), or an NSAID (5 studies). One final study compared treatment with magnesium sulfate and an NSAID.

More than one-half of the RCTs had definitions of preterm labor that we considered adequate. They included the number of contractions, typically recorded during a specific period of time, and another measure such as level of effacement or dilation. 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 Three RCTs had no definition. 136 137 138 One of the three nonrandomized trials 128 and approximately one-half of the nine observational studies 139 140 141 142 143 had an adequate definition. In the studies that included a count of contractions, the number used as an entry criterion varied greatly.

Authors included a definition of successful tocolysis in more than 60 percent of studies, 117 120 122 125 126 127 128 129 130 131 133 134 137 138 139 140 142 144 145 146 147 148 149 150 151 152 153 154 155 including approximately one-half of the RCTs. As with the definition of preterm labor, definitions of successful tocolysis varied greatly. The most common definitions were delay of labor for 48 hours, 7 days, and until 34, 35, or 36 weeks' gestation.

The number of subjects included in the studies varied widely; 23 studies enrolled 100 or more women. 96 117 121 122 123 124 131 132 133 135 136 137 141 143 144 145 149 151 156 157 158 159 Studies tended to have little or no demographic data. The mother's age was most commonly reported; nine studies also reported the mother's race. 117 123 124 128 133 138 147 150 154 Education levels were not reported in any of the studies.

Bias is a concern in this literature. Only six of the RCTs state that the treating physician was masked to which patients received which treatments. 96 121 129 137 160 None of the observational studies used regression techniques to control for confounding from factors outside the treatment.

A discussion of patient compliance with the treatment protocol is virtually nonexistent in the studies. If the patient is receiving only inpatient treatments, then compliance is not necessarily a serious concern, as it can be monitored by hospital staff; however, some studies continue treatment after discharge but do not provide information concerning compliance with these medications.

Another potential concern relating to the population of women reflected in the analyses is generally controlled in this literature; only nine studies do not either exclude women with ruptured membranes or use stratified analyses for women with intact membranes relative to ruptured membranes. 96 125 135 141 146 154 160 Patient exclusions based on the need for a medically indicated birth were not easily discernible from the literature.

Results of the Beta-Mimetic Literature Review
Beta-mimetics versus control groups

Merkatz, Peter, and Barden 136 reported on the only RCT comparing first-line beta-mimetic treatment with that given a control group that found the estimated gestational age at delivery to be significantly older with treatment. Some patients included in this control group, however, received ethanol treatment. Four other RCTs that compared a beta-mimetic treatment with that given a control group found no significant difference in this outcome. 121 131 133 151

Both Merkatz, Peter, and Barden 136 and Larsen, Hansen, Hesseldahl, et al. 121 found a significant increase in mean prolongation of pregnancy in patients receiving beta-mimetic treatment. Two other authors found greater prolongation in the beta-mimetic group relative to no treatment controls: 1 day or less by Leveno, Klein, Guzick, et al., 151 and less than 24 hours and less than 48 hours by the Canadian Preterm Labor Investigators Group. 96 Holleboom, Merkus, van Elferen, et al. 131 and Guinn, Goepfert, Owen, et al., 133 two relatively large studies with, respectively, 201 and 179 women, did not find efficacious results in prolonging pregnancy. Although Merkatz, Peter, and Barden 136 also found gestation to be significantly increased to greater than or equal to 36 weeks, gestational weeks were not significantly different in five other RCTs. 96 121 131 133 151

Only Merkatz, Peter, and Barden 136 reported efficacy of tocolytics for infant outcomes, calculated in relation to a significantly higher birth weight. Other studies found no difference in measures of birth weight; Activity, Pulse, Grimace, Appearance, Respiration (APGAR) score; or neonatal intensive care unit admissions. 121 131 133 151

Beta-mimetics versus other tocolytics

Other RCTs included in the review compared the efficacy of beta-mimetic treatment with treatment with a different class of tocolytic. With respect to older mean gestational age at birth, Jannet, Abankwa, Guyard, et al. 134 and Papatsonis, Van Geijn, Ader, et al. 159 found calcium channel blocker treatment, and Eronen, Pesonen, Kurki, et al. 161 found NSAID treatment, to be related significantly to older estimated mean gestational age at birth compared with beta-mimetic treatment. For the same outcome measure, Koks, Brolmann, de Kliene, et al. 135 and Bracero, Leikin, Kirshenbaum, et al. 127 found no significant difference between calcium channel blockers and beta-mimetics, and Kurki, Eronen, Lumme, et al. 129 found no difference between NSAIDs and beta-mimetics.

Similarly mixed results were found in RCTs that measured prolongation of pregnancy. Only in comparison to ethanol treatment was a beta-mimetic found to result in a significantly longer prolongation of pregnancy. 148 In several studies, beta-mimetics alone were found to result in a significantly shorter prolongation of pregnancy when compared with another class of tocolytic including calcium channel blockers, 159 NSAIDs, 129 and magnesium sulfate and beta-mimetic combination. 138

Other researchers found no significant differences in trials comparing beta-mimetic treatment and other classes of tocolytics using a variety of measures of prolongation, which includes five studies of calcium channel blocker treatment, 126 127 130 135 153 two of NSAID treatment, 119 123 and two of magnesium sulfate treatment. 122 150

Similarly mixed results were shown in observational studies. In no instance did beta-mimetic treatment significantly prolong pregnancy more than an alternative tocolytic. 128 140 141 142 146

The most common instant-related outcome in the first-line beta-mimetic RCTs was birth weight. Bracero, Leikin, Kirshenbaum, et al. 127 found calcium channel blocker treatment to be related to higher birth weight, but nonstatistically significant differences were not found by Meyer, Randall, Graves, et al., 126 Jennet, Abankwa, Guyard, et al., 134 Papatsonis, Van Geijn, Ader, et al., 159 Koks, Brolmann, de Kleine, et al., 135 and Garcia-Valasco, and Gonzalez. 153 Two studies concluded that the differences between an NSAID and a beta-mimetic were insignificant; 123 161 another found the same insignificant differences between magnesium sulfate and a beta-mimetic. 150

First-Line Tocolytics: Calcium Channel Blockers

Description of the Calcium Channel Blocker Literature

All of the 14 calcium channel blocker studies, except one case study, compared a calcium channel blocker with a different class of tocolytic. Of the 11 RCTs included in the review, 9 compared treatment with a calcium channel blocker and a beta-mimetic, which were discussed in the preceding section on beta-mimetics. The remaining two RCTs compared treatment with a calcium channel blocker and magnesium sulfate. 162 163 (See Evidence Table 4.)

All calcium channel blocker studies, except for one retrospective cohort analysis, 155 included a definition of preterm labor. Nine of the 11 RCTs included a definition that we would consider adequate. 125 126 127 130 134 135 154 162 163 In addition, 8 of the 11 RCTs included a definition of successful tocolysis. Definitions vary with length of cessation of symptoms, ranging from 2 hours in the research by Smith and Woodland 154 to 36 completed weeks' gestation in Ferguson, Dyson, Schutz, et al. 125 and Jannet, Abankwa, Guyard, et al. 134

Generally, studies either excluded women at entry who had ruptured membranes or stratified outcome analyses. Exceptions include two RCTs 135 154 and two of the three observational studies. 155 163 The estimated gestational age at entry was rather broad in the study population. In seven RCTs, it is 20 to 22 weeks at the lower boundary and 34 to 36 weeks at the upper boundary. In the remaining four RCTs, it is 24 to 26 weeks at the lower boundary and 34 to 36 weeks at the upper boundary.

Results of the Calcium Channel Blocker Literature Review

Few statistically significant differences were found in outcomes between study arms of the calcium channel blocker studies; all were in relation to beta-mimetic treatment. As discussed in the section of this chapter concerning beta-mimetic study results, Jannet, Abankwa, Guyard, et al. 134 in a study with an enrollment of 90 women, an adequate definition of preterm labor, and a definition of successful tocolysis found calcium channel blockers to have greater efficacy than beta-mimetics in terms of mean estimated gestational age at delivery and delivery at term. These researchers did not find significant differences in infant outcomes. Papatsonis, Van Geijn, Ader, et al. 159 also reported significant findings in favor of calcium channel blockers compared with beta-mimetic treatment in terms of mean estimated gestational age at birth, mean prolongation of pregnancy, prolongation of pregnancy for specific lengths of time ranging between 24 hours and 2 weeks, and delivery at less than 34 weeks. This study had 185 women at enrollment; it included a definition of preterm labor but no definition of successful tocolysis.

Although lacking statistically significant differences in delivery outcomes, several studies found calcium channel blocker treatment relative to beta-mimetic treatment to be associated with improved infant outcomes. Bracero, Leikin, Kirshbaum, et al. 127 found birth weights to be higher and neonatal intensive care unit admissions to be lower; Koks, Brolmann, DeKleine, et al. 135 found 1-minute APGAR scores to be higher. The retrospective cohort study comparing these two classes of tocolytics also found birth weight to be significantly higher for the calcium channel blocker group.

Four additional RCTs found no significant differences between any outcome measures comparing calcium channel blocker and beta-mimetic treatment. 126 130 153 154 This finding also was true for the two RCTs comparing calcium channel blockers and magnesium sulfate treatment. 162 163

First-Line Tocolytics: Magnesium Sulfate

Description of the Magnesium Sulfate Literature

The first-line magnesium sulfate literature (Evidence Table 5) includes nine RCTs and nine observational analyses (one nonrandomized control trial, two prospective studies, and six retrospective studies). As we found in studies of other classes of tocolytic pharmacotherapies, preterm labor typically was defined; one RCT was the exception. 138 Successful tocolysis was also defined in most studies in our review with the most common goal being delay of delivery for greater than 48 hours. 120 122 128 142 150 162 164 165

Based on the information included in the articles, none of the physicians providing treatment were masked. None of the observational studies included multivariate analyses to control for confounding.

Results of the Magnesium Sulfate Literature Review

Three studies compared magnesium treatment to a control group; 120 124 166 the later study also included a terbutaline treatment arm. Fox, Albert, McCaul, et al. 166 found magnesium treatment to show efficacy in terms of delivery at less than or equal to 48 hours. On the other hand, although both measures of mean estimated gestational age at birth were greater than 36 weeks, the control group showed significantly better outcomes than the magnesium group. No statistically significant differences were shown in any infant outcome measures. Cotton, Strassner, Hill, et al. 120 and Cox, Sherman, and Leveno 124 found no significant differences in delivery or infant outcomes.

The remaining six RCTs compared magnesium treatment with beta-mimetic treatment, 122 138 150 calcium channel blocker therapy, 162 and treatment with an NSAID. 167 Of the three studies comparing magnesium to a beta-mimetic, one found statistically significant results: Hatjis, Swain, Nelson, et al. 138 compared treatment with ritodrine alone with treatment with ritodrine and magnesium sulfate and found the combination therapy to be significantly more likely to prolong pregnancy by 1 week or more. No significant findings were uncovered in relation to an NSAID. 167

A nonrandomized control rial conducted by Chau, Gabert, and Miller 128 supported the finding of the efficacy of magnesium treatment in relation to beta-mimetic pharmacotherapy. Significant differences were shown in mean prolongation weeks, delivery at term, and mean birth weight. A limitation of this study, however, was that patients who failed in their initial treatment were moved to the other treatment group, thus confounding the results. A retrospective cohort study of 61 women 152 combining terbutaline, ritodrine, and magnesium sulfate treatment found that women who were able to persist with treatment relative to those who were forced to stop because of complications were significantly less likely to give birth at less than 36 weeks and their babies had shorter hospital stays. Another retrospective study 142 comparing a treatment combining magnesium and a beta-mimetic with only beta-mimetic treatment found no differences in effectiveness.

One small retrospective study of 44 women conducted by Lewis, Grimshaw, Brooks, et al. 165 compared the combination of magnesium sulfate and indomethacin, an NSAID, with magnesium alone. They found the combination therapy to be associated with greater mean prolongation days.

First-Line Tocolytics: Nonsteroidal Anti-Inflammatory Disease Drugs

Description of the NSAID Literature

As shown in Evidence Table 6, the literature meeting the inclusion criteria for NSAIDs was small: five RCTs, one nonrandomized controlled trial, one retrospective study, and two case series analyses. The only RCT that did not compare NSAID treatment with beta-mimetic treatment was conducted by Morales and Madhav 167 and compared NSAID and magnesium regimens. The nonrandomized trial 146 compared ethanol treatment with treatment with a combination of ethanol and an NSAID or treatment with a beta-mimetic. The retrospective cohort study 165 compared pharmacotherapy with magnesium, an NSAID, and a beta-mimetic with treatment with magnesium and a beta-mimetic. The two case series reviews 168 169 did not include comparison groups. These were almost all small studies; all the investigations reporting statistical comparisons had sample sizes of fewer than 100 subjects.

All three RCTs defined preterm labor, but only Kurki, Eronen, Lumme, et al. 129 defined successful tocolysis. Women with ruptured membranes were excluded from these studies. In two of the three RCTs, the physician was masked as to which patients received the intervention treatment. 118 129

Results of the NSAID Literature Review

Gamissans, Canas, Cararach, et al. 118 found treatment with an NSAID and a beta-mimetic relative to a beta-mimetic and a placebo to be related significantly to women being more likely to deliver at 37 or more weeks. Infants born to mothers receiving combination therapy had greater birth weights and higher APGAR scores at 1 minute. Kurki, Eronen, Lumme, et al. 129 also found that an NSAID showed efficacy over a beta-mimetic in relation to delaying labor 24 to 48 hours and promoting delivery at greater than or equal to 37 weeks and had fewer maternal side effects. Eronen, Pesonen, Kurki, et al. 161 also found the NSAID to provide a superior treatment. In contrast, Morales, Smith, Angel, et al. 123 found no difference between indomethacin and ritodrine in relation to prolongation of pregnancy, birth weight, APGAR scores, or neonatal death. It should be noted, however, that patients experiencing serious side effects from indomethacin were given magnesium sulfate.

The retrospective cohort study 165 found the combination of magnesium sulfate, an NSAID, and a beta-mimetic to be superior to a combination of magnesium sulfate and a beta-mimetic in mean prolongation days.

First-Line Tocolytics: Other Agents

Description of the Literature on Other Tocolytics

In Evidence Table 7, three RCTs compared therapy with ethanol with treatment with other classes of tocolytics. The two older studies provided insufficient statistical analyses for efficacy comparisons. 117 144 The third RCT compared ethanol with a beta-mimetic. 148 Two small observational studies explored tocolytic effects of ethanol 170 and atosiban. 171 Neither study enrolled more than 100 women, excluded preterm births arising from conditions other than preterm labor, or included statistical comparison allowing for conclusions about tocolytic effectiveness.

Results of the Other Tocolytic Literature Review

In the only analyzable efficacy study containing statistical analyses, Caritis, Carson, Greebon, et al. 148 found beta-mimetic treatment to be superior to ethanol in terms of mean prolongation days and for delaying birth sufficiently to allow for a full course of betamethasone for fetal lung maturation.

Maintenance Tocolytics: Beta-Mimetics

Thirteen studies explored the benefit of maintenance therapy with a beta-mimetic tocolytic after treatment with a first-line therapy had ended (Evidence Table 8). Eleven of the studies were RCTs. All of the studies included a definition of preterm labor, and all but Brown and Tejani 172 had what we consider an adequate definition. None of the studies provided a definition of successful tocolysis. All but two RCTs 173 174 excluded women with ruptured membranes at study entry.

Seven of the RCTs compared a treatment group to a control group. 44 172 175 175 176 177 178 179 All except three studies masked the treating clinician to the protocol. 173 175 176 Most of the RCTs 44 174 175 176 177 179 180 181 found no efficacy differences in treatment. In contrast, Brown and Tejani 172 in a small study found beta-mimetic treatment to show efficacy in terms of mean prolongation days and infant birth weight. In a more recent and larger study of more than 200 women, Rust, Bofill, Arriola, et al. 178 found no differences between groups in delivery outcomes; however, the length of hospital stay for the infant was shorter in the control group than in the group treated with magnesium or terbutaline. This was the only study to include a pill count to evaluate compliance with treatment.

A small observational study 182 contained no statistical analysis. Another larger cohort study compared women with and without cervical change 172 and found delivery and infant outcomes to be generally superior in women without cervical change. One RCT compared two beta-mimetics, ritodrine and terbutaline and found no difference. 173

Maintenance Tocolytics: Calcium Channel Blockers

Only one study met the criteria for inclusion within the category of maintenance calcium channel blocker therapy (Evidence Table 9). In an RCT, El-Sayed, Holbrook, Gibson, et al. 183 compared treatment with two calcium channel blockers, nifedipine and diltiazem. There was no masking in this study and no definition of successful tocolysis, but it did include an adequate definition of preterm labor. They found nifedipine to show efficacy in terms of mean estimated gestational age at delivery and delivery at greater than or equal to 37 weeks, but no significant differences in infant outcomes.

Maintenance Tocolytics: Magnesium Sulfate

All four maintenance magnesium sulfate studies included in the review (Evidence Table 10) compared magnesium and beta-mimetic treatments. The comparisons were discussed earlier in the maintenance beta-mimetic section.

Maintenance Tocolytics: Non-Steroidal Anti Inflammatory Disease Drugs

The only study in this category (Evidence Table 11) is an RCT conducted by Bivins, Newman, Fyfe, et al. 181 that enrolled fewer than 100 women in a comparison of indomethacin with terbutaline treatment. The research group found no significant differences in efficacy between the two therapies.

Tocolytic Harms

Owing to the large numbers of categories and the multiple classes of agents assessed in this evidence report, we have chosen to summarize the harms data graphically in the form of "bubble charts." Tables specifying the individual rates of the harms in each study would be unwieldy, so we believe that these figures will convey the information more readily. The articles included in this analysis appear in the tocolytic evidence tables, but we also included additional articles that had not met our inclusion criteria for the benefits analysis.

The graphs give the rates of each of the categories of harm within each class of tocolytic medication: beta-mimetics, calcium channel blockers, NSAIDs, magnesium sulfate, ethanol, and atosiban. Each circle within the graph represents a single study. The size of the circle is proportional to the number of subjects receiving the medication (or placebo) in the study. A study with 100 subjects would have a circle twice as large as a study with 50 subjects, assisting the reader in giving more "weight" to larger studies.

We listed as "harms" clinical markers and events that the authors of the individual studies considered as adverse events or "side effects." In general, these adverse events represent process measures of somewhat unclear significance for the long-term health of the mother and infant. For example, the hypokalemia that is sometimes found after treatment with beta-mimetics may have no clinical consequences but may be associated with increased risk of cardiac arrhythmia. This level of detail generally is not available in this literature. Confidence intervals are generally not provided in the articles, and statistical interpretation is often difficult. Our summary of the literature, by definition, cannot be more detailed than the detail in the studies performed.

We were unable to form synthetic estimates across studies of the rates of treatment harms. The definitions of the clinical categories often were vague or not provided at all in the source articles, which would lead to problems if we assumed that the clinical categories actually were equivalent. Not all categories of medications were represented on every graph. For example, data on "metabolic harms" (e.g., elevated blood glucose, hypokalemia) were collected only for beta-mimetics.

Maternal Harms
Serious cardiovascular harms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f011.jpg.

   Figure 11. Maternal harms - serious cardiovascular

Cardiac side effects, including arrhythmias, heart failure, and chest pain, are well known to occur with the beta-mimetics, and these findings were confirmed by the graph in Figure 11. The estimates varied widely among the studies, and the range in the severity of the "serious harms" was substantial. Some of the variability is likely attributable to the assiduousness with which investigators assessed the cardiac effects of these agents.

Minor cardiovascular harms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f012.jpg.

   Figure 12. Maternal harms - minor cardiovascular

The pattern for minor cardiovascular harms was similar to that for the major harms, with higher rates in patients taking the beta-mimetic class of medications (Figure 12). Again, a great deal of variability occurred within categories such as the beta-mimetics. Although these types of events are uncomfortable, they were generally not considered to be of major health consequence.

Maternal metabolic harms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f013.jpg.

   Figure 13. Maternal harms - metabolic

As shown in Figure 13, a definite pattern of increased metabolic abnormality was seen for women who received beta-mimetic agents. The detail in the articles does not allow us to assess the clinical importance of the hyperglycemia, hypokalemia, or other adverse effects; but such abnormalities clearly were very common.

Gastrointestinal harms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f014.jpg.

   Figure 14. Maternal harms - gastrointestinal

Many agents, including placebo, are associated with gastrointestinal harms (Figure 14). In the case of placebo, the "harm" is the association of symptoms with the medication when, in fact, they are not caused by the medication. Moderate rates of gastrointestinal harms were found for many of the agents. Interestingly, the few studies examining NSAIDs appeared to show relatively low rates of maternal gastrointestinal effects in this literature when compared with the other agents. This result may be because of the young age of the subjects and the relatively short duration of the NSAID exposure. Overall, there appeared to be little clear relationship among class of medication and gastrointestinal side effects.

Maternal psychologic harms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f015.jpg.

   Figure 15. Maternal harms - psychologic

The category of "psychologic" harms encompasses symptoms that are relatively nonspecific (Figure 15). The beta-mimetic studies assessed this category of harms most commonly and found them more frequently when compared with studies of other agents in which it was assessed.

Overall maternal results

The overall picture for maternal harms is one of more frequent occurrence of harms in users of beta-mimetic medications, especially for the cardiac, metabolic, and psychologic categories. We note, however, that these harms also were searched for more frequently in studies of these agents.

The articles were often unclear as to whether a single patient could appear more than once within a single category of harm. For example, if a single patient had tachycardia and palpitations, she could be counted in each category. Because we saw both of these effects as belonging to "minor maternal cardiovascular harms," the summary graphs may be subject to a degree of double counting that could bias the study outcome by raising somewhat the estimate of harms. Conversely, the reviewers for this report were impressed that many of the studies did not search adequately for maternal harms associated with tocolytic treatment, which would tend to bias the study outcome by lowering the estimate of the rate of harms.

Fetal/Neonatal Harms
Ductal constriction/tricuspid regurgitation

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f016.jpg.

   Figure 16. Fetal/newborn harms - ductal constriction/tricuspid regurgitation

As shown in Figure 16, premature constriction of the ductus arteriosis would be expected to be of concern with the anti-inflammatory nonsteroidal class of medications. Two studies addressed this issue, and they did appear to find somewhat increased rates of ductal constriction. The study with the highest rates used routine prepartum echocardiograms specifically to estimate ductal constriction. The study did not address the clinical significance of this ductal constriction.

Intraventricular hemorrhage

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f017.jpg.

   Figure 17. Fetal/newborn harms - IVH

Intraventricular hemorrhage (IVH) is an uncommon but potentially serious neonatal complication, with increased risk among premature infants (Figure 17). No pattern of increased risk appeared for any pharmacologic intervention. The studies with the highest rate of IVH were case series of premature, low-birth-weight infants, in whom one would expect higher rates of IVH in both intervention and placebo groups.

Necrotizing enterocolitis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f018.jpg.

   Figure 18. Fetal/newborn harms - NEC

We found little evidence of a difference among the treatment groups in the risk of necrotizing enterocolitis (NEC) (Figure 18). Prior research has shown that indomethacin may be associated with an increased risk of certain neonatal adverse outcomes, including intraventricular hemorrhage and NEC. 184 185 Moreover, indomethacin therapy in utero may cause constriction of the ductus arteriosis that may culminate in closure. 186 187 The drug is used in neonatology as a medical treatment to close persistently patent ductuses in newborns. Given that all of these complications also are associated with preterm birth, the magnitude of additional risk posed by indomethacin is unclear.

Sepsis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f019.jpg.

   Figure 19. Fetal/newborn harms - sepsis

Estimates of sepsis in the literature were hindered by definitional problems of "sepsis." Some studies required a positive blood culture, others defined sepsis clinically, still others had categories of "possible sepsis." The study with a category of "possible sepsis" did exhibit the highest rates of this harm. Overall, we found no pattern of differential neonatal harm among these agents (Figure 19).

Tachycardia

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f020.jpg.

   Figure 20. Fetal harms - tachycardia

The estimation of fetal/neonatal tachycardia is hampered by lack of a consistent definition of tachycardia, which likely contributed to the wide range of estimates. Some studies gave no specific heart rate but rather simply reported a percentage for tachycardia. As shown in Figure 20, not surprisingly the beta-mimetics appear to contribute to higher rates of rapid heart rate.

Overall fetal/neonatal results

The fetal/neonatal harms literature provided little consistent evidence of short-term harms to the infants of women receiving tocolytic agents. Insufficient evidence was available from which to draw conclusions regarding longer term effects of tocolytic medications on the health of children.

The analysis did not address the potential harms associated with the use of multiple classes of tocolytic medications during a single pregnancy. A few studies used such regimens, generating the challenge of differentiating the effects of single agents compared with those of multiple agents. One would expect that the harms associated with the use of multiple agents would be at least as great as those of both agents used alone.

Antibiotics

Observational studies consistently support the theory that genital tract infections are associated with preterm birth. 188 Histologic evidence of acute inflammation is found in up to 50 percent of placentas after delivery before term. 189 Up to two-thirds of women who present with preterm labor have positive amniotic fluid cultures, and their fluid harbors an array of organisms. 190 These empirical observations have prompted researchers to hypothesize that treating women who present with preterm labor symptoms may prevent preterm birth.

RCTs have been designed and completed to test this hypothesis. The 15 trials and 1 observational study that met the entry criteria are summarized in Evidence Table 12.

Results of the Antibiotic Review

The articles reviewed consistently report three outcomes: (1) prolongation of pregnancy after enrollment, (2) mean gestational age at delivery, and (3) preterm delivery at a specific cutoff point, usually less than or equal to 37 weeks. Of the 15 RCTs reviewed, two consistently found improvement in all three of these outcome measures, 191 186 but five found no improvement in any of the three measures. 193 194 195 196 197 Five studies found a significant increase in pregnancy prolongation, 191 192 198 199 200 and three studies found a diminution in the number of preterm deliveries. 192 198 201

Thus, the results of this research are mixed and definitive statements are hard to make about antibiotic efficacy in gravidas with preterm labor symptoms. This mixture of positive and negative results emphasizes the need for meta-analysis; Chapter 4 discusses the results of the meta-analysis on antibiotics conducted as part of this review.

Issues and Limitations in the Antibiotic Literature

Of the 15 RCTs reviewed, none used the same definition of symptoms to label a woman as having preterm labor. The array of criteria used included frequency of contractions, intensity of contractions, duration of contraction, and a wide range of measures of cervical dilation. Four trials used particularly broad definitions such as "painful" contractions, 191 dilation greater than or equal to 4 cm with no lower limit on contraction frequency, 201 contractions less than 10 minutes apart for more than 60 minutes, 192 and "preterm labor." 202 Lack of a standard, reproducible definition for preterm labor detracts from our ability to compare studies.

Study enrollment was quite varied in these 15 trials. Only one trial had more than 200 subjects (n=277), 194 and five trials enrolled fewer than 100 women. 196 197 200 202 203 Smaller samples force a reader to look at surrogate outcomes (e.g., pregnancy prolongation or gestational age at delivery) because they lack the power to show statistically that antibiotic therapy improves or worsens perinatal outcome. Moreover, clinically significant findings may not be discernable when statistically insignificant results are reported from studies with small sample sizes. Romero, Mazor, Oyarzun, et al. 188 calculated that their multicenter study of antibiotic efficacy, the largest study we reviewed, with 277 participants, had insufficient power to detect a difference in perinatal outcome. These investigators estimated that a sample size of 2,270 subjects would be required to detect a 33 percent reduction in the event rate between the placebo group and the antibiotic group.

Complicating the interpretation of this literature further is the array of cointerventions used in the management of women with symptoms of preterm labor. All but one study used tocolytics, 197 and the majority of studies allowed clinicians to prescribe more than one tocolytic. Likewise, the use of corticosteroids and nonpharmacologic interventions (e.g., hydration or bed rest) was ubiquitous throughout these studies. In no instance were regression techniques included to control for the potential confounding that could be caused by the use of these cointerventions and to support the finding that the treatment effect could be attributed to antibiotic therapy.

In lieu of adequate numbers to assess perinatal outcome (which, given the rarity of perinatal death and serious morbidity, would require enrolling thousands of women), survival analysis would be an additional analytic technique for assessment of interventions in women with preterm labor. Despite the advantages of survival analysis, only three trials 193 194 200 employed the technique. One of these trials showed a treatment benefit; 200 the other two did not. 193 194

A certain number of preterm births subsequent to assignment of the diagnosis of preterm labor in women with symptoms of labor will be medically indicated. Thus, treatment trials in women with preterm labor symptoms should exclude medically indicated preterm deliveries from their analysis or report on them separately. To our knowledge, none of these studies took this approach, and their failure to account for the etiologic heterogeneity of preterm birth may result in erroneous conclusions.

No two trials used the same antibiotic regimen. Beta-lactam, cephalosporin, macrolide, and metronidzole antibiotics were given to women with symptoms of preterm labor. Routes of administration and duration of use varied widely from study to study. Thus, there is insufficient consistency to determine whether any conclusions reached in relation to the results shown in this literature would apply to antibiotics in general or to the particularly convincing regimen used in an individual study.

In any study of a pharmaceutical intervention, bias is controlled by recording compliance with treatment in the study population and searching for potential confounding by recording differences in compliance. Our review of these 15 RCTs discloses only two studies that measured compliance. 194 203

A limitation of all but two trials is gestational age at enrollment. The prevalence of positive amniotic fluid cultures-a direct measure of in utero infection-and histologic chorioamniotis-an indirect measure-vary inversely with gestational age. 194 204 Thus, gravidas at earlier gestational ages are more likely to have infection as an etiology and, in theory, should be more likely to benefit from antibiotic treatment. The converse should be true about women who present with preterm labor symptoms at later gestational ages.

Thus, a relationship between gestational age and prevalence of in utero infection is particularly pertinent in this literature. Evidence Table 12 shows that the studies included women at a wide range of gestational ages, but only two studies had a mean gestational age at enrollment of less than 30 weeks. 191 201 The large trial from Romero, Sibai, Caritis, et al. 194 enrolled one-half of its subjects at more than 32 weeks; the positive culture rate from amniotic fluid, around 6 percent, was much lower than anticipated. When McGregor, French, and Seo 199 stratified their results by gestational age, they saw a benefit of treatment only in women enrolled before 32 weeks.

The final limitation is generalizability of these findings to the universe of women with symptoms of preterm labor. Three studies have dropout rates of greater than 20 percent, 191 198 203 and two studies have insufficient data to assess dropout rates. 194 201 In addition, only five studies reported the number of women with preterm labor symptoms who were screened to find the enrolled population. 192 193 194 195 The rates of study participation range from 13 percent 194 to 50 percent. 199 Such low enrollment figures may result in participants' being unrepresentative of the population of women with symptoms of preterm labor.

Home Uterine Activity Monitoring

Four studies of home uterine activity monitoring met the criteria for inclusion in this study. 205 206 207 208 All four studies measured deliveries at specific dates as their primary outcomes of interest. Of these four studies, one concluded that home uterine activity monitoring with daily nursing support was effective in reducing preterm deliveries, defined in that study as those before 37 weeks. 206 The authors of this study reported that among women experiencing recurrent PTL, 47 percent of patients enrolled in the home uterine activity monitoring group delivered before 37 weeks, compared with 84 percent in the "standard-care" group (p=0.025). The mean gestational age for the group receiving home uterine activity monitoring plus nursing support was 36.1 weeks compared with 33.9 weeks, but that difference was not reported to be significant. Standard care did not include nursing support, so it is unclear whether the home uterine activity monitoring device or the nursing support may have been the mitigating factor in reducing preterm births. Among those patients whose preterm labor recurred, 32 percent of the group receiving standard care experienced failed tocolysis at first recurrence compared with none of the group receiving home uterine activity monitoring plus nursing care (p=0.02). The authors attributed this difference to earlier detection of preterm labor in the home uterine activity monitoring group. Neonatal outcome results are not provided in this study, so it is impossible to review the effects that either of these results had on infant health.

The other three studies we evaluated attempted to equalize non-home uterine activity monitoring interventions (e.g., nursing contact or education) in the monitored and unmonitored groups to isolate the effects of the monitoring device. None of these studies found that monitoring improved outcomes or reduced preterm births. One study compared the use of monitoring (without added nursing contact) to standard care. 207 This research team gave the same instructions regarding signs and symptoms of preterm labor and the circumstances under which women should contact a physician, to both study groups. All women were taking terbutaline and had weekly cervical exams. Subjects in the monitoring group monitored themselves for 1 hour per day and transmitted the results to a Tokos Monitor Center perinatal nurse. Any woman experiencing four or more contractions per hour was remonitored immediately and referred to the hospital unless the contractions subsided.

None of the outcomes collected by Nagey, Bailey-Jones, and Herman 207 showed any statistically significant difference between the monitored and unmonitored group, including delivery after week 37, delivery after week 34, or delivery after week 32. The risk ratio for preterm delivery for the monitored patients was 1.08 (p=0.79). Eleven women in the monitored group and five in the unmonitored groups experienced a nondelivery readmission, this difference also was insignificant. Differences in cervical examination upon admission (cervical dilation, cervical effacement, and station) were insignificant.

Iams, Johnson, and O'Shaughnessy 205 compared self-palpation with home uterine activity monitoring in the presence of equal education and nursing contact. Subjects in both groups called the monitoring center if they noticed symptoms of preterm labor and were asked to palpate or record uterine activity whenever they noted these symptoms or an increase in perceived contraction frequency. Again, no significant differences between the two groups were noted in outcomes (gestational age, delivery less than 36 weeks, delivery less than 36 weeks with recurrent labor, term delivery, and birth weight). The authors suggested that, although the home uterine activity monitoring device per se did not have any effect on outcomes, the relevant part of the home uterine activity monitoring system as a whole may be the nursing contact, which, in this study, was provided to both the treatment and control groups, although the sample size in this study is too small to draw this conclusion with certainty.

Finally, Brown, Britton, et al. 208 also compared home uterine activity monitoring to no monitoring in groups of women receiving identical care other than monitoring, including daily perinatal telephone contact by a nurse. All women in this study also received oral terbutaline. Both groups received education that included recognizing the signs and symptoms of preterm labor and instructions on prenatal followup. The monitored patients transmitted a uterine activity monitoring strip to the Tokos Monitor Center twice daily until 37 weeks' gestation or until instructed to discontinue monitoring. This research team found no significant differences in gestational age at delivery, in delivery at specific endpoints (less than 35 weeks, 35 to 37 weeks, greater than 37 weeks), or in neonatal outcomes (birth weight, admissions or days in neonatal intensive care units, or mechanical ventilation). In a logistical regression model predicting delivery at less than 35 weeks, the authors found that only cervical dilation at study entry was significantly predictive. Women with cervical dilation (2 cm at enrollment) had an odds ratio of 4.35 for delivery at less than 35 weeks' gestation compared with women with cervical dilation of less than 2 cm. Readmissions necessitating additional magnesium sulfate treatment did not differ between the groups, nor did the numbers of women with one unscheduled hospital observation lasting less than 24 hours.

Harms of home uterine activity monitoring were not studied in this analysis; however, because home uterine activity monitoring is not invasive, risks of the system are limited to interventions that might be used with it, or that might be used upon identifying an increase in contractions (i.e., tocolysis, corticosteroids). Nagey, Bailey-Jones, and Herman 207 studied hospital readmission, which would presumably include additional treatment, and found no significant difference between the two groups. Watson, Welch, Mariona, et al. 206 also found no significant difference in the numbers of patients readmitted for recurrent preterm labor.

An important limitation in much of the home uterine activity monitoring literature in general is the mixing of the effects of the device and the accompanying nursing support. As noted earlier, Watson, Welch, Mariona, et al. 206 did not separate out these effects, making it difficult to compare their study with others. Iams, Johnson, and O'Shaughnessy 205 did not compare home uterine activity monitoring to standard care; rather they specifically were interested in comparing the device to self-palpation. This design would not provide information about whether the system might provide benefit compared with general practice.

Each study used relatively small sample size, although the authors addressed this limitation adequately in their texts. We have attempted to address this issue as well by performing a meta-analysis of the three more comparable studies.

Only Nagey, Bailey-Jones, and Herman 207 described how they developed their randomization schedule, so we are unable to comment on the appropriateness or lack thereof of the other studies. None of the studies presented an intent-to-treat analysis, which would have been useful to evaluate fully the likely impact of home uterine activity monitoring in a clinical setting; however, because these results were generally negative, such analysis would simply have magnified that effect.

Although each study reported data for preterm births, data were provided for few maternal and neonatal outcomes. Only Brown, Britton, Brizendine, et al. 208 gave information beyond birth weight for neonatal outcomes: They found no significant differences in care provided to infants whose mothers had or had not received home uterine activity monitoring.

Chapter 4. Meta-Analysesa

Introduction

Although a few trials of both first-line and maintenance tocolytic treatment for women in preterm labor had significant, positive effects, as discussed in Chapter 3 and presented in the evidence tables, most found the different tocolytics to be equally effective and to have little benefit, if any, over placebo or no tocolytic therapy. Similarly, a few trials of adjunctive antibiotic therapy also found significant effects of antibiotic treatment over placebo or no adjunctive antibiotic therapy, but again most of these trials found no statistically significant effects. Virtually no benefits were found in the trials of home uterine activity monitoring compared with trials involving no monitoring of women in preterm labor; however, many of the trials for these interventions had so few patients enrolled that only very large differences would have been statistically significant.

To determine whether the lack of significance in these various studies resulted from the small sample sizes or negligible impact of the interventions, we conducted a series of meta-analyses, statistically combining results from similar trials into summary measures. These meta- analyses were focused on the efficacy of the interventions for preterm labor as measured by one or more of the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational age at delivery, and (3) birth weight. These outcomes were most frequently reported in the abstracted studies. In particular, for each group of interventions first-line tocolytics, maintenance tocolytics, adjunctive antibiotics, and home uterine activity monitoring we sought answers to the following questions:

  • Does the intervention result in significantly prolonged pregnancies compared to placebo or no intervention for women in preterm labor? This outcome is measured in days of increased pregnancy.

  • Does the intervention result in deliveries at significantly later gestational age compared with placebo or no intervention for women in preterm labor? This outcome is measured in weeks of gestation.

  • Does the intervention result in higher birth weight babies compared with placebo or no intervention for women in preterm labor? This outcome is measured in grams or kilograms.

Methods

Selection Criteria for Inclusion of Studies

We first reviewed the completed evidence tables for the various interventions to identify studies for inclusion in the meta-analyses. We included only RCTs. Results from prospective cohort studies, case series, retrospective analyses, and other nonrandomized trials were excluded from the analyses. For inclusion in the meta-analyses, the trials also had either to be restricted to women with intact membranes or to have reported the outcomes separately for this subgroup.

Furthermore, only RCTs making comparisons of interest were considered for inclusion. For adjunctive antibiotic and home uterine activity monitoring trials, the comparison of interest was between standard treatment with the intervention and standard treatment with either placebo or no intervention. All antibiotic and home uterine activity monitoring RCTs had at least one intervention arm and one arm without the intervention and, therefore, were potential candidates for the meta-analyses.

By contrast, the tocolytic trials frequently compared one tocolytic agent with another; only a small number of trials included a placebo or an arm with no tocolytic treatment. Because the various tocolytics work through different mechanisms, we were interested in comparisons among different classes of tocolytics; therefore, tocolytic trials were considered for the meta-analyses if they compared one class of tocolytics with another and/or to placebo or no tocolytic treatment. The tocolytic classes included were beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs, and ethanol. Because trials comparing one beta-mimetic with another one calcium channel blocker with another added no additional information for the comparisons of interest, they were excluded from the meta-analyses unless they also had an arm with another class of tocolytic or a placebo or no tocolytic arm.

Of all the trials considered for these analyses, 23 trials of first-line tocolytics, 10 trials of maintenance tocolytics, 13 trials of adjunctive antibiotic treatment, and 3 trials of home uterine activity monitoring met the inclusion criteria. We then extracted the information on the outcomes of interest for each trial.

Although the three outcomes of interest were the most frequently reported, they were by no means universally or uniformly reported. For prolongation of pregnancy, some trials reported the mean number of days that pregnancy was prolonged as well as either the standard deviation, the standard error, or both. Others reported only the means. Others reported the number or percentage of pregnancies prolonged for various durations (e.g., <2 days, >7 days, <14 days). Similarly, some trials reported mean gestational age at delivery with or without standard deviations or standard errors; others reported the number or percentage of deliveries at or beyond different gestational age cutoffs (e.g.,>32 weeks, >36 weeks, >37 weeks). Still others reported mean birth weight with or without standard deviations or standard errors and a few reported the percentage of births greater than or equal to 2500 g. Some trials neglected to report any data on one or more of these outcomes altogether. Thus, the actual number of trial results that could be combined for any particular analysis was smaller than the number of trials meeting the inclusion criteria.

Methodology

Where possible, we used the continuous measures of the outcomes of interest and combined the mean differences, d, across studies:
d = Mt - Mc
where M t and M c are the sample means of the treated and control arms, respectively. Assuming a normal distribution for the individual observations with equal variances in each arm of the trial, the variance of the difference, S 2, is computed:
graphic element
where SEM t 2 is the standard error of the mean of the treated arm and SEM c 2 is the standard error of the mean of the control arm. Thus, to use this measure, the sample size, means, and standard errors (or standard deviations) must be available from the reported information. For analyses in which too few studies reported means with standard deviations or standard errors, we computed the odds ratio, ω, for selected cutoff values of the outcome measures. We could then use this value as the effect measure for combining results across studies:
w = Pt(1 - Pc)/Pc(1 - Pt)
where p t is the proportion of the treated group that successfully reached the cutoff value and p c is the proportion of the control group that successfully reached the cutoff value.

To manage the multiple treatment problem of first-line tocolytic treatments in which we had five classes of tocolytics to compare with placebo/no tocolytics, we used the generalization of the two-arm methods developed by Hasselblad.209 For all interventions and outcomes, we combined the results using a Bayes random effects model as described by Hedges and Olkin,210 allowing the true treatment effect to vary across studies. For dichotomous outcomes, we estimated the model parameters with the EGRET software package; for continuous measures, we used a customized program developed by one of the authors, Victor Hasselblad, Ph.D.

In addition to the meta-analysis results on the three main questions noted earlier (e.g., estimates of prolongation of gestation), we report the results of tests of homogeneity for the four main sets of analyses (both types of tocolytics, antibiotics, and home uterine activity monitoring). These tests indicate the extent to which the trials are sufficiently alike to permit them to be combined with confidence, and the possibility that various unknown factors are present that have not been adequately controlled for.

Results

First-Line Tocolytics

Table 13. Trial arms used in the meta-analysis of first-line tocolytics: Odds ratio for delivery at term (at or beyond 36 to 38 weeks gestation)
StudyPlaceboBeta- MimeticsCalcium Channel BlockersMagnesium SulfateNSAIDsEthanol
Merkatz, et al.,136 1980YesYesNoNoNoNo
Larsen, et al.,121 1986YesYesNoNoNoNo
Leveno, et al.,151 1986YesYesNoNoNoNo
Guinn, et al.,133 1997YesYesNoNoNoNo
Meyer, et al.,126 1990NoYesYesNoNoNo
Kupferminc, et al.,130 1993NoYesYesNoNoNo
Jannet, et al.,134 1997NoYesYesNoNoNo
Papatsonis, et al.,159 1997NoYesYesNoNoNo
Garcia-Velasco and Gonzalez,153 1998NoYesYesNoNoNo
Wilkins, et al.,122 1988NoYesNoYesNoNo
Kurki, et al.,129 1991NoYesNoNoYesNo
Castren, et al.,144 1975YesYesNoNoNoYes
Lauersen, et al.,117 1977NoYesNoNoNoYes
Caritis, et al.,148 1982NoYesNoNoNoYes
Glock and Morales,162 1993NoNoYesYesNoNo
Floyd, et al.,163 1995NoNoYesYesNoNo
Table 14. Effect of first-line tocolytics: Odds of delivery at term (at or beyond 36 to 38 weeks gestation)
TreatmentEstimated Odds RatioStandard Error of Log95% Confidence Interval
Beta-mimetics1.6220.1471.216, 2.164
Calcium channel blockers2.4850.2371.562, 3.954
Magnesium sulfate1.8670.2311.187, 2.936
NSAIDs4.9480.5621.645, 14.89
Ethanol0.9490.2630.567, 1.589
The odds ratio of delivery "at term" was the only measure reported in an adequate number of first-line tocolytic trials to conduct a meta-analysis of all five classes of tocolytics against a placebo/no tocolytic arm. We included studies as defining birth at term if they reported the fraction of women giving birth at or beyond 36, 37, or 38 weeks. Preliminary analyses of the odds ratios from the trials using the different cutoffs were similar, suggesting that the use of the three cutoffs as equivalent would not unduly affect the results. Thus, we treated these cutoffs as equivalent for purposes of our analysis. The 17 studies and the treatment arms used in the meta- analysis are listed in Table 13 (ordered by beta-mimetics included or not).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f021.jpg.

   Figure 21. Meta-analysis results: Effect of first-line tocolytics on the odds of delivery at term (at or beyond 36 to 38 weeks gestation), by tocolytic class

The results of the meta-analysis are presented in Table 14 and are shown graphically in Figure 21. The data on estimated odds ratios suggest that all tocolytics, except ethanol, are effective in extending deliveries to term compared with placebo/no tocolytic treatment (odds ratios ranging from 1.622 [beta-mimetics] to 2.485 [calcium channel blockers]). The evidence for NSAIDs (an odds ratio approaching 5.0) is based on only one study and thus should be viewed with extreme caution.

We conducted an overall test for goodness of fit, which gave a chi-square of 11.476 for 13 degrees of freedom, p=0.571. Thus, there was no evidence of lack of fit. We also conducted a test to see if beta-mimetics, calcium channel blockers, and magnesium sulfate had the same effects. This calculation gave an overall chi-square test of 8.762 for 2 degrees of freedom, p=0.0125. Thus, there is evidence that these three tocolytics may have a differential effect from each other relative to placebo in terms of increasing the number of births at or beyond 36 to 38 weeks gestation.

Maintenance Tocolytics

Table 15. Studies of the effects of tocolytics as maintenance therapies
StudyPlaceboBeta- MimeticsMagnesium Sulfate
Creasy, et al.,44 1980YesYesNo
Brown and Tejani,172 1981YesYesNo
Ricci, et al.,180 1991YesYesYes
Parilla, et al.,175 1993YesYesNo
How, et al.,176 1994YesYesNo
Lewis, et al.,177 1996YesYesNo
Rust, et al.,178 1996YesYesYes
Guinn, et al.,179 1998YesYesNo
We found fewer studies that investigated the efficacy of maintenance tocolytics. Most of these studies compared beta-mimetics and/or magnesium sulfate with placebo/no maintenance tocolytics. Therefore, we were unable to compare all classes of tocolytics in a single meta- analysis, as we had for first-line tocolytics. Instead, we conducted separate meta-analyses for beta-mimetics versus placebo/no maintenance tocolytics and magnesium sulfate versus placebo/no maintenance tocolytics. The eight studies, six with beta-mimetic arms and two with magnesium sulfate arms, in the analyses are listed in Table 15.

Table 16. Effect of maintenance tocolytics on pregnancy outcomes
TreatmentEndpoint (units)Estimated Difference95% Confidence Interval
Beta-mimeticsProlongation (days) Gestational age (weeks) Birth weight (grams)-0.68 -0.46 -16.4-4.68, 3,33 -1.03, 0.11 -189.0, 155.3
Magnesium sulfateProlongation (days) Gestational age (weeks) Birth weight (grams)-3.31 -0.55 -202.8-9.67, 3.06 -1.38, 0.28 -405.7, 0.1
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f022.jpg.

   Figure 22. Meta-analysis results: Effect of maintenance beta-mimetics on mean gestational age relative to placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f023.jpg.

   Figure 23. Meta-analysis results: Effect of maintenance magnesium sulfate on mean gestational age relative to placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f024.jpg.

   Figure 24. Meta-analysis: Effect of maintenance beta-mimetics on prolongation of pregnancy relative to placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f025.jpg.

   Figure 25. Meta-analysis results: Effect of maintenance magnesium sulfate on prolongation of pregnancy relative to placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f026.jpg.

   Figure 26. Meta-analysis results: Effect of maintenance beta-mimetics on birth weight relative to placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f027.jpg.

   Figure 27. Meta-analysis results: Effect of maintenance magnesium sulfate on birth weight relative to placebo

The results of these meta-analyses are presented in Table 16; Figures 21 through 26 provide the plots for mean gestational age, prolongation of pregnancy, and birth weight, respectively, for beta-mimetics and magnesium sulfate. These analyses show that tocolytics are not effective when used as a maintenance therapy. No evidence of benefit was demonstrated for any of the outcomes investigated. Furthermore, all of the point estimates go in the direction of harm.

Table 17. Homogeneity of effects of maintenance tocolytics on pregnancy outcomes
TreatmentEndpoint (Units)Chi-square for HomogeneityDegrees of Freedomp-Value
Beta-mimeticsProlongation (days) Gestational age (weeks) Birth weight (grams) 7.57 0.67 46.276 5 7 0.272 0.985 <0.001
Magnesium sulfateProlongation (days) Gestational age (weeks) Birth weight (grams)0.43 0.58 0.071 1 10.512 0.445 0.790
Table 17 gives the results of tests for homogeneity for the above results. They show that the hypothesis that the studies are measuring the same effect can be rejected only for the birth weight outcome for beta-mimetics, thus providing support for the conclusion that the correct statistical methodology was used in the analysis.

Adjunctive Antibiotics

Table 18. Effect of antibiotics on prolongation of pregnancy (measured in days)
StudyEstimated Prolongation (Days)Standard Error95% Confidence Interval
Morales, et al.,198 198813.553.396.898, 20.20
Nadiasauskiene, et al.,191 199610.506.91-3.067, 24.07
McGregor, et al.,199 19919.904.441.174, 18.63
McCaul, et al.,203 19920.508.53-16.34, 17.34
Newton, et al.,195 19890.104.73-9.200, 9.400
Gordon, et al.,193 1995-0.104.29-8.527, 8.327
COMBINED6.422.631.266, 11.56
Table 20. Effect of antibiotics on birth weight (measured in kilograms)
StudyIncrease in Birth Weight (kg)Standard Error95% Confidence Interval
Svare, et al.,192 19970.2920.170-0.04241, 0.6264
Morales, et al.,198 19880.2880.1280.0375, 0.5385
Norman, et al.,200 19940.2250.145-0.060, 0.5104
Cox, et al.,197 19960.1690.176-0.1739, 0.5119
McGregor, et al.,19919910.1450.135-0.1195, 0.4099
Newton, et al.,19619910.1350.181-0.2174, 0.4874
Nadisauskiene and Bergstrom201 19960.0560.104-0.1476, 0.2596
Gordon, et al.,19319950.0260.124-0.2177, 0.2697
Newton, et al.,19519890.0080.134-0.2547, 0.2707
McCaul, et al.,2031992-0.0590.247-0.5481, 0.4295
Oyarzun, et al.,2111998-0.0630.108-0.2748, 0.1488
Romero, et al.,1941993-0.1480.092-0.3287, 0.0327
COMBINED0.0680.045-0.01938, 0.1558
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f028.jpg.

   Figure 28. Meta-analysis: Effect of adjunctive antibiotics on prolongation of pregnancy relative to no antibiotics

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f029.jpg.

   Figure 29. Meta-analysis results: Effect of adjunctive antibiotics on estimated gestation age at birth relative to no antibiotics

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f030.jpg.

   Figure 30. Meta-analysis results: Effect of adjunctive antibiotics on birth weight relative to no antibiotics

For the 13 trials of adjunctive antibiotics in women with preterm labor and unknown infection, we were able to combine results on prolongation of pregnancy from 6 trials, gestational age at delivery from 9 trials, and birth weight from 12 trials. The individual and combined results for each outcome are shown in Tables 18 through 20 and Figures 28 through 30, respectively.

Table 18 shows that adjunctive antibiotic treatment in women with preterm labor produces a marginally significant increase in length of pregnancy of about 6 days (see the last "combined" row); however, the individual studies are somewhat inconsistent. The estimated prolongation ranges from virtually no time195 to nearly 14 days;198 hence the final result should be viewed with caution. Also, the test for homogeneity gave a chi-square of 9.99 for 5 degrees of freedom, p=0.076, implying that the studies included in the analysis may not have come from the same underlying distribution.

Table 19. Effect of antibiotics on estimated gestational age at birth (measured in weeks)
StudyIncrease in Gestation Age (Weeks)Standard Error95% Confidence Interval
Nadiasaukiene, et al.,191 19962.801.020.7942, 4.806
Norman, et al.,200 19941.300.82-0.3110, 2.911
Newton, et al.,196 19911.000.90-0.7581, 2.758
Morales, et al.,198 19880.950.410.1380, 1.762
McGregor, et al.,199 19910.500.66-0.8040, 1.804
Gordon, et al.,193 19950.100.55-0.9718, 1.172
Cox, et al.,197 19960.100.95-1.765, 1.965
Newton, et al.,195 19890.000.60-1.174, 1.174
McCaul, et al.,203 1992-1.101.01-3.093, 0.8931
COMBINED0.590.240.1198, 1.057
We obtain a similar result using gestational age at birth (Table 19). The combined estimate of effect shows a marginally significant increase in gestational age of about 0.60 of a week from the use of adjunctive antibiotics (see the last, "combined" row, for which the 95 percent confidence interval includes 1.0). Although the individual studies do not reflect as much difference as for gestational prolongation (above), the effect is nonetheless relatively small and should be viewed with caution. The test for homogeneity gave a chi-square of 11.272 for 8 degrees of freedom, p=0.019.

Finally, the meta-analysis of birth weight (Table 20) shows an increase in birth weight of about 0.07 kg (70 g) (see "combined" row); however, the effect is not only small but not quite statistically significant at the 0.05 level. The test for homogeneity gave a chi-square of 14.39 for 11 degrees of freedom, p=0.212.

Home Uterine Activity Monitoring

Table 21. Effect of home uterine activity monitoring on pregnancy outcomes
Endpoint (Units)Estimated Difference (treated - control)95% Confidence Interval
Gestational age (weeks)0.19-0.37, 0.76
Survival to 37 weeks (odds ratio)1.220.73, 2.05
Birth weight (grams)-47.0-201.1, 107.1
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f031.jpg.

   Figure 31. Meta-analysis results: Effect of home uterine activity monitoring on the odds on the odds of delivery at 37 or more weeks gestation relative to no monitoring

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f032.jpg.

   Figure 32. Meta-analysis results: Effect of home uterine activity monitoring on gestational age relative to no monitoring

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3153_f033.jpg.

   Figure 33. Meta-analysis results: Effect of home uterine activity monitoring on birth weight relative to monitoring

Three studies meeting the selection criteria compared the effect of home uterine activity monitoring on gestational age, 205 207 208 and two compared the effect of home uterine activity monitoring on birth weight. 205 208 For gestational age, we combined both the mean differences in gestational age and the odds of delivery at or beyond 37 weeks' gestation. For birth weight, we combined only the mean differences. The results of these meta-analyses are shown in Table 21 and Figures 31 through 33.

Table 22. Homogeneity of effects of home uterine activity monitoring on pregnancy outcomes
Endpoint (units)Chi-square for HomogeneityDegrees of Freedomp-Value
Gestational age (weeks)0.8320.660
Survival to 37 weeks (odds ratio)0.4410.508
Birth weight (grams)1.8820.391
The meta-analyses show that home uterine activity monitoring is not effective in increasing the gestational age or birth weight of infants born to women with suspected preterm labor and, at best, might prolong survival of the pregnancy to 37 weeks only very slightly; 95 percent confidence levels all include 1.0. Once again, the results should be viewed with caution because of the small number of studies and study subjects. Table 22 gives the tests for homogeneity for these analyses; none is significant.

Summary and Conclusions

We conducted a series of meta-analyses on the efficacy of various interventions for women with suspected preterm labor. These interventions included first-line tocolytics, maintenance tocolytics, adjunctive antibiotics for occult in utero, and home uterine activity monitoring. We restricted the analysis to evidence from RCTs for women with intact membranes and focused on three outcome measures: (1) prolongation of pregnancy in days, (2) gestational age at delivery, and (3) birth weight.

We found 23 first-line tocolytic studies, 10 maintenance tocolytic studies, 13 antibiotic studies, and 3 home uterine activity monitoring studies meeting our inclusion criteria. Because of the lack of complete and consistent reporting of outcomes, the meta-analyses for each intervention were conducted on fewer studies.

For first-line tocolytics, we first combined studies within tocolytic class and compared the results with placebo/no tocolytics. The only outcome measure with evidence from an adequate number of trials was the fraction of births occurring at term, defined as at or beyond 36 to 38 weeks' gestation. We found all tocolytic classes, except ethanol, were effective in increasing gestational age at birth. Beta-mimetics, calcium channel blockers, and magnesium sulfate nearly doubled the odds of birth at term, relative to control, with small but significant differences in effect size found between classes of drug.

We also found adjunctive antibiotic treatment to have small, beneficial effects for women in preterm labor with suspected occult in utero infection. We estimated increases of 6 days in the length of pregnancy, 0.6 of a week (i.e., slightly more than 4 days) in gestational age at delivery, and 70 g of weight at birth. The increase in birth weight was not found to be statistically significant.

Finally, we found no statistically significant effects from maintenance tocolytic treatment or of home uterine activity monitoring. For home uterine activity monitoring, however, no more than three trials were combined for estimates of these effects, and all trials entered few patients; therefore, the results should be interpreted with caution.

Chapter 5. Conclusions

Introduction

We discuss below the overall conclusions and implications of our analysis combining information from the evidence tables and the results in Chapter 3 and the meta-analyses discussed in Chapter 4. We begin by presenting our system for grading the strength of the information on the efficacy or effectiveness of treatment or interventions and harms data. Subsequent sections present our findings on individual topic areas, corresponding to our key clinical questions concerning the management of preterm labor (in order by biologic markers, tocolytics, antibiotics, and home uterine activity monitoring). Within each key therapy section, we present our evaluative grade for that body of literature. Finally, we discuss concerns relating to the literature as a whole.

Grading the Evidence

Chapter 2 introduced the discussion of our grading system for the preterm labor literature. Our overall approach for assigning categorical grades or ratings takes into account the quality of individual articles as scored according to the scale described in Chapter 2 as well as the strength and consistency or homogeneity of the findings across studies regarding a specific topic. The grades are defined as indicated below; the first set concerns benefits and the second set, harms.

Benefits Data Grades

We assign four basic grades for benefits. The first three grades indicate that the quantity of data is sufficient to make a judgment and that the overall data are good, fair, or poor in terms of supporting the conclusion that the intervention (e.g., pharmaceutical agent or biologic marker) is better than an alternative intervention or placebo. The fourth grade indicates that the quantity and quality of data are insufficient to draw any conclusions, chiefly because of small sample sizes or poor methods in the studies reviewed. The specific category definitions are as follows:

Benefits Data Grades
Good (A):The data are sufficient for evaluating the quality and strength of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor.
Fair (B):The data are sufficient for evaluating the quality and strength of the findings. The data indicate that inconsistencies exist in the findings about the key drug or intervention and alternative treatment or placebo for treating women with preterm labor.
Poor (C):The data are sufficient for evaluating the quality and strength of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor.
Incomplete evidence (I):The data are insufficient for assessing the quality of the key drug or intervention for treating women with preterm labor based on limited sample size or poor methodology.
The sources of information leading to these categories are the following:
Efficacy (1):Evidence obtained from at least one well-designed RCT.
Effectiveness (2):Evidence obtained from more than one well-designed cohort or case-control study.

Harms Data Grades

We assigned two categorical ratings to the overall harms data reported for tocolytics in Chapter 3, "low" or "high." The rating reflects our judgment of the probability of risk from the intervention in question. The specific category definitions are as follows:

Low probability of risk:

- The side effects of treatment with the specific tocolytic that we noted from our review of articles were short term and not life-threatening.

- Considering the body of evidence, the frequency of occurrence was not substantially different from the frequency for an alternative tocolytic treatment or no treatment.

"High" probability of risk:

- The side effects of treatment with the specific tocolytic that we noted from our review of articles were life-threatening.

- Considering the body of evidence, the frequency of occurrence was substantially different from the frequency for an alternative tocolytic treatment or no treatment.

Biologic Markers

The key question for this topic was: What are the appropriate criteria for diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?

Overall, the evidence-based review demonstrated that the two biologic markers studied- fFN and EVUSD-present strong evidence of effectiveness as diagnostic tools for assessing the risk of preterm birth in women with symptoms of preterm labor. Because of the lack of RCTs for biologic markers, we could not evaluate the efficacy of this intervention. Based on the relatively high quality and consistency of the data obtained from well-designed cohort studies, we assigned a quality grade for biologic markers evidence of Good (A-2).

Both tests were only moderately successful in predicting which women with a positive test would deliver before term, but they consistently exhibited strong NPVs, identifying women at low risk of preterm birth. Our conclusions are supported by the construction of contingency tables, allowing us to aggregate test performance information taken from individual articles. The outcome of this exercise (on fFN, as reported in Figure 9 and on EVUSD, as reported in Figure 10, Chapter 3) bolsters our qualitative finding that fFN exhibits a strong NPV.

Overall, our conclusion is that these biologic markers are untested as a means of assisting clinicians in improving outcomes in patients presenting with preterm labor. The evidence does indicate, however, that they offer valuable information that could allow women to avoid unnecessary treatments. In response to our key question, the literature supports the notion that these tests can usefully supplement clinical judgment, especially in terms of identifying women who are not likely to experience a preterm birth.

Tocolytics

The key question for this topic was: What is the efficacy and/or effectiveness of tocolytics in managing preterm labor?

We divided our analysis into two subtopics, one concerning first-line tocolytics and the other concerning maintenance therapy for a woman who has experienced an episode of preterm labor.

First-Line Tocolytic Therapy

The largest body of literature examined for this review concerned treatment with tocolytic agents for acute episodes of preterm labor: The tocolytics discussed included beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs, and ethanol. We included both efficacy and effectiveness studies; research compared treatment with a particular class of tocolytic with a control group as well as with a different class of tocolytic drug. Overall, we assigned a quality grade to the first-line tocolytic literature of Fair (B-1), having determined that the data were sufficient for evaluating the quality of the findings and that the evidence comes from at least one well-designed RCT.

Literature comparing an intervention group of women receiving first-line treatment with tocolytics relative to a control group were available only for beta-mimetics, magnesium sulfate, and ethanol. Results were mixed: Some evidence of efficacy was shown in terms of estimated gestational age at delivery and prolongation of pregnancy with beta-mimetic or magnesium sulfate treatment. Only one beta-mimetic study 136 found a relationship between treatment and improved infant outcomes, showing significantly higher birth weight in the treatment arm. Data from the ethanol study 144 were insufficient to evaluate efficacy.

The largest body of literature reviewed compared the outcomes of treating women with beta-mimetics with the outcomes of treatment with other classes of tocolytics. Results again were mixed: Beta-mimetics were shown to have efficacy only relative to ethanol treatment in relation to prolonging pregnancy. 148 Other RCTs showed that all other classes of tocolytics had greater or not significantly different effects relative to beta-mimetics on birth outcomes. We also found mixed results for infant outcomes, but no study reported that beta-mimetics were a superior treatment. No other differences in efficacy were found between classes of tocolytics, and data from observational studies did not contradict the results of the RCTs.

Supplementing our qualitative review, some of the RCT data available from the individual studies were combined using meta-analytic techniques (Chapter 4). This methodology allowed us to compare all classes of tocolytics (i.e., beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs) with a no-treatment group. The data suggest that all tocolytics, with the exception of ethanol, were effective in extending births to term (at or beyond 36 to 38 weeks gestation) compared with results in a no-treatment group. Evidence shows that there may be effect differences among beta-mimetics, calcium channel blockers, and magnesium sulfate; but more studies are needed for conclusive findings.

In sum, although inconsistencies exist in these data, overall the evidence supports the notion that treatment offers small improvements in pregnancy prolongation. If necessary, this may provide adequate time for administering corticosteroids for fetal lung development. Sufficient data were not available to determine directly whether this treatment has a directly beneficial effect on neonatal morbidity and mortality. Data concerning relative efficacy between drug classes are generally inconclusive, but they clearly support the conclusion that ethanol is less efficacious than other tocolytic options.

Maintenance Tocolytic Therapy

With the exception of one small study, 172 the efficacy studies of maintenance tocolytic literature showed no difference between treatment and control arms in managing women who had recently experienced an episode of preterm labor. These results were confirmed through our meta-analytic synthesis of results (Chapter 4 ). In terms of gestational age at birth, prolongation of pregnancy, or birth weight, no benefits from maintenance treatment were uncovered.

We graded this literature as Poor (C-1) in terms of showing efficacy or effectiveness of maintenance tocolytics. The data were sufficient for evaluating the quality of the literature, and no evidence supported the use of this therapeutic modality relative to no treatment.

Potential Harms From Tocolytic Therapy

The harms analysis examined tocolytic treatments for different classes of pharmaceuticals, but we did not distinguish between first-line and maintenance therapies. We assessed adverse events for both the mother and the fetus or neonate.

We graded beta-mimetics as "high" in probability of maternal risk. These drugs were shown to pose a risk to the mother of serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms. In comparison, all other classes of tocolytic treatments were graded as "low" in relation to maternal risk. Significant levels of serious maternal complications were not shown.

We graded all classes of tocolytics as "low" risk in relation to fetal or neonatal harms. Evidence of short-term harms was inconsistent, and evidence to evaluate longer term problems was insufficient.

Additional Comments Concerning Tocolytic Therapy

The evidence from our analysis supports the generally held clinical belief that ethanol is an inappropriate treatment for women with preterm labor symptoms. Other treatments offer greater efficacy at less risk. Similar concerns would seem to be warranted with regard to treatment with beta-mimetics. Although this judgment may be an artifact of the relative wealth of data on this class of tocolytics relative to others, the benefits of beta-mimetics were never found to exceed other options, and the harms were shown to be potentially more severe.

Antibiotics

The key question for this topic was: What is the efficacy and/or effectiveness of antibiotics in managing preterm labor?

Results of our review concerning the efficacy of antibiotics for treating women with preterm labor were mixed. We graded this literature as Fair (B-1) on the basis of judgments that the data were sufficient for evaluating quality and that results from individual studies were mixed, although the meta-analysis found small, yet significant effects.

Two RCTs found improvements in all three delivery outcomes of interest: prolongation of pregnancy, mean gestational age at birth, and birth at a particular number of weeks. 191 , 192 In contrast, all other RCTs showed mixed results or no significant difference from placebo treatment. Results from survival analyses included in three RCTs were also mixed.

Our meta-analysis on six of these studies (Chapter 4) found a marginally significant increase in length of pregnancy of about 6 days. The data from nine studies combined to evaluate change in gestational age with antibiotic treatment also found a marginally significant increase of about 0.60 of a week. In contrast, no statistically significant increase in birth weight was found in the treatment relative to the control group; however, this may be an artifact of lack of power because of the small number of studies available for analysis.

Home Uterine Activity Monitoring

The key question for this topic was: What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?

The efficacy literature on home uterine activity monitoring in the treatment of women with preterm labor comprised only four studies. We give this literature a grade of Poor (C-1) because we judged that the information was sufficient for evaluating the quality of the findings. Also, we found that the evidence does not support the use of this therapeutic modality relative to using no treatment.

Three RCTs found no significant effect from home uterine activity monitoring. Our meta-analysis (Chapter 4) confirmed these conclusions in relation to gestational age at delivery and birth weight; however, meta-analysis results should be viewed with caution as they are based on data from only three studies. The fourth RCT in our analysis reported that home uterine activity monitoring significantly prolonged delivery to 37 weeks; 206 however, this research used an expanded definition of home uterine activity monitoring that included nursing support. The authors did not disentangle the two interventions, and so it is unclear whether the outcome improvement was related to the device or the nursing support. Overall, the preponderance of the evidence points to a lack of efficacy from home uterine activity monitoring in women with preterm labor. We concluded that additional research comparing use of the intervention with a control group and confirming these results in relation to this particular patient group is not needed.

Chapter 6. Recommendations for Future Research

Our evidence report has identified two domains that demand attention from future investigators and funding agencies. One domain involves numerous issues regarding definitions and methods that were discussed in the opening sections of Chapter 3 on results; the other domain concerns various substantive issues that warrant specific efficacy or effectiveness and health services research. The main portion of this chapter focuses on the second domain, but first we reiterate here the main problem with the current literature and the research on which it is based. We also note areas in which we believe that further research is not likely to be productive.

Definitional and Methods Issues

Definitional Issues

Depending on the topic and the release date of the pharmaceuticals, we reviewed a literature base that could date back more than three decades. In that time, the standards for research, including the conduct of RCTs as well as case control, longitudinal cohort, and similar effectiveness studies, have risen appreciably.

Nonetheless, in our view, a good deal of work needs to be done to shore up critical building blocks of research in this area. In particular, researchers need to attend to three issues:

  • 1

    Define preterm labor explicitly and carefully.

  • 2

    Be clear about which cointerventions were used and when were they administered during the study.

  • 3

    Separately analyze outcomes for those women with medically indicated preterm births and compare the results with those for the larger group of women with preterm labor.

Outcome Measures

Perhaps the most important methodologic issue is that of measuring outcomes in trials or other studies of interventions for women in preterm labor. Our work has documented the extent to which researchers differ in their definitions of even standard outcomes. This matter is especially important if regulators (e.g., the FDA) or health plan administrators insist that an intervention must improve perinatal outcomes before it can be approved or adopted (as the FDA did with the oxytocin-agonist atosiban). However, any trial using perinatal endpoints would have to be quite large and expensive because progress in the care of tiny newborns has resulted in a relative rarity of poor perinatal outcomes; indeed, the numbers and expense of such trials are likely to be prohibitive.

Insistence on improvements in perinatal outcome is epidemiologically sound. Therefore, investigators and policymakers are likely to continue to rely on surrogates relating to delivery such as pregnancy prolongation or the number of preterm births before an arbitrary cutoff.

One way to address this problem is to improve the analysis of available data. As an answer to this problem, we have already noted the utility of survival analysis, which could provide continuous delivery outcomes stratified by gestational age at enrollment in the study. Such an approach is clinically and biologically relevant. It allows both clinicians and the public to decide whether an intervention's effects were relevant to an individual's particular situation, meaning that researchers can begin to clarify whether (and if so, to what extent) even a small delay in delivery at a particular gestational age has the potential to be significant.

Our recommendation, therefore, is that investigators, to the extent possible, supplement the use of categorical outcomes measures (e.g., preterm or term birth) and variables involving specific cutoff points for gestational age by including concepts such as the length of time a pregnancy was extended-prolongation-as a form of survival of the pregnancy. That is, we advocate for the assessment of the impact of an intervention on pregnancy prolongation through the use of survival analysis stratified by gestational age at the onset of preterm labor.

Implicit in this recommendation is an assumption that prolongation will improve perinatal outcome. This assumption has intuitive appeal given that the correlation between gestational age at birth and improved perinatal mortality and morbidity is strongly positive; however, no one knows the advantages or disadvantages of delaying delivery for an individual once preterm labor is established. Perhaps the in utero environment of the mother who is experiencing preterm labor is hostile and prolongation does more harm than good. Thus, the ultimate test for any therapeutic intervention will always be improvements in perinatal outcome.

Given the rarity of poor perinatal outcomes in an age of effective neonatal care, the need exists for composite perinatal outcomes that reflect the morbidity and mortality of premature birth. Potential measures that could be combined include perinatal death, neurologic deficit, and oxygen requirement at hospital discharge. Perinatologists could advance the field in important ways by constructing composite markers that accurately and consistently reflect the burden of being born before term. Such an approach has been useful in other fields such as cardiology. A valid composite measure could then become the outcome of choice in intervention trials.

Moreover, long-term followup of neonates is needed. Certain conditions that could be a direct harm from an intervention or a byproduct of pregnancy prolongation in a hostile environment may not become obvious until an infant matures. Followup of particular importance regarding developmental and neurologic outcomes as the fetal and neonatal brain has yet to complete development at the time of birth and hospital discharge.

A Future Research Agenda

This report on the evidence concerning current care of the woman who presents with labor symptoms before term provides insights and directions to investigators in this field along several dimensions. These topics, covered in the remainder of this section, include basic epidemiology, use of biologic markers and better understanding of the test characteristics of the biologic markers most likely to be used in further efficacy or effectiveness studies, and use of pharmaceuticals (both tocolytics and antibiotics). We also comment here on questions for which we believe further research is not warranted.

Epidemiology

First, the reader of this report will appreciate the need for basic and epidemiologic research in preterm labor. In our judgment, far more work needs to be done in this area just to lay a better framework for future intervention-specific efficacy and effectiveness research. Said another way: a field of knowledge so primitive that the patient's complaints (i.e., preterm labor symptoms) are used as the diagnostic criterion for the condition possessing the same name must invest in basic and epidemiologic research.

We lack basic knowledge regarding the biologic mechanisms resulting in birth before term. Moreover, there is a dearth of knowledge regarding the incidence and prevalence of preterm symptoms and modifiable risk factors for preterm birth. The issue of etiologic heterogeneity versus homogeneity for preterm birth is especially poorly understood.

We also draw attention to the need for better understanding of these basic facts in ethnic minorities, for at least two reasons. First, the demographics of this nation are changing rapidly, including among women of child-bearing ages. Second, clinicians should have better information at hand to address the needs of women who may have low English literacy skills, and who may thus be at a disadvantage in terms of communicating their symptoms or understanding the medical management regimens being recommended to them.

Finally, African American women and their offspring face profound disadvantages in terms of preterm birth. The discrepancy between perinatal outcomes in black and white women and the mechanisms contributing to such disparities must be investigated, so that clinicians and the public gain a better understanding of the epidemiologic, social, and other causal and contributing factors.

Also, the best type of clinician and optimal setting for care of the woman with preterm labor remains to be answered. Given the ubiquitous nature of these symptoms and our inability to assess risks, the minimum level of expertise and capacity needed to triage these patients effectively is not known.

Preterm labor with subsequent preterm birth has remained an enigma to clinicians and investigators. The condition has proven remarkably resistant to the inquiry of modern science. No satisfactory "meta-narrative" or paradigm has been constructed that reconciles all the observed phenomena. From the perspective of the authors of the evidence report, the failure to solve the muddled picture is because of faulty science and inadequate resources for investigation. By implication, better science and more resources to invest in efficacy and effectiveness studies will help to end the enigma.

Conversely, preterm labor and subsequent preterm birth may be the culmination of a complex, biopsychosocial process such as poverty or obesity. A narrow scientific focus on the clinical endpoint of preterm labor and relentless pursuit of a meta-narrative threaten to direct attention and resources from the changes needed in society that might ultimately result in substantial reductions in preterm birth rates.

Risk Factors and Biologic Markers

Additional study regarding risk factors is important, and this issue bridges the epidemiologic and clinical realms of research. Clinicians and researchers need to be able to assess accurately the risk of preterm birth in women with symptoms of preterm labor and to exclude women at low risk of early birth from intervention trials. Failure to assess risk will not only dilute the effects of the intervention in question, but also expose numerous women who are at low risk to unnecessary and potentially harmful intervention. Some investigators have tried to assess risk by constructing strict definitions of preterm labor (e.g., by incorporating a measure of contraction frequency and duration with measures obtained from digital examination of the cervix).

Biologic markers may enhance this goal because they are objective and reproducible and are superior to purely clinical assessment or prognostication of preterm birth. At least two of the biologic markers we reviewed-fFN and EVUSD-can serve in this role. Thus, we recommend that investigators studying tocolytics and antibiotics (in their capacity as part of a medical regimen for women in or suspected of preterm labor) use biologic marker negativity as an exclusion criterion.

Moreover, we advise that further clinical trials be conducted to demonstrate whether the enhanced knowledge and precision provided by biologic markers with respect to preterm birth risk actually can improve perinatal outcomes. Trials should be conducted in which clinicians are provided with biologic marker information on a subset of participants and not on others, so as to clarify the level of efficacy and usefulness of this information in "ideal" settings. This step is fundamental before the use of biologic markers is incorporated into routine care in "average" settings, so as to avoid dilemmas in which better measurement precision does not improve outcomes (as has been seen with electronic fetal monitoring).

Pharmaceutical Agents

Our meta-analytic results for the first-line tocolytics and for antibiotics (reported in Chapter 4) suggest that further trials of these modalities are needed. Use of antibiotics is almost universally recommended in women with preterm labor symptoms as a form of prophylaxis to prevent early-onset group B streptococcal sepsis. By contrast, the use of acute tocolysis has been disparaged by many. Well-designed trials of adequate size with appropriate outcome measures of acute tocolysis to prevent preterm birth in women with symptoms of preterm labor would be helpful and reasonable. We would call attention, in particular, to the likely benefits of a focus on beta-mimetics, calcium channel blockers, and magnesium sulfate.

Review of our evidence tables highlights not only the numerous agents used for tocolysis, but also the wide range of dosages prescribed. Clinicians and patients could potentially benefit from pharmacologic studies to assess optimal dosing and duration of therapy. Overdosing could lead to excessive harms from tocolysis, whereas underdosing could result in an inability to optimize outcomes. The same arguments and limitations apply to the duration of tocolytic dosing. Although we did not find evidence to support the use of so-called maintenance tocolysis once the acute preterm labor phase is over, the optimal duration for acute tocolysis is unknown.

Our report emphasizes the small, potential benefits of acute tocolysis-in terms of pregnancy prolongation-in the face of substantial potential maternal harms. This marginal benefits-to-risk ratio argues for continued development and testing of new tocolytic drugs. Clinicians and patients would benefit from agents that are selective for uterine smooth muscle quiescence. The oxytocin-agonist atosiban is an example of a novel tocolytic drug that met utero-selective criteria. Its failure to improve perinatal outcome kept the FDA from approving it as a tocolytic, but many authorities predict approval within the European community because of its ability to prolong pregnancy and lack of side effects. Should the European experience with atosiban be favorable, perhaps approval of this agent as a tocolytic should be reconsidered in the United States. Moreover, other oxytocin antagonists and novel classes of compounds with the ability to quiet the uterus should be studied and tested.

The studies we reviewed provided virtually no information on the quality of the mother's life while receiving treatment-other than in relation to physical harm. Data on lost wages or lost days of work would support cost-benefit analyses of treatment effectiveness. In addition, information on the psychological impacts of treatment and the treatment process could also provide valuable insights.

The best type of clinician and the optimal care setting of a woman with preterm labor remains to be answered. Given the ubiquitous nature of these symptoms and the inability to adequately assess risks, clinicians and institutions caring for these women need to be able to triage these patients. However, whether this needs to be at a tertiary perinatal center or a lower level of care is a topic for future study.

If the optimal class of drug, dose, and duration of tocolytic therapy is obfuscated, the evidence to guide clinicians' use of antibiotics is even more obscure. Given the array of microbicidal and static agents in the full evidence tables, we cannot delineate a rational approach to the selection and duration of such therapies. At present, it seems prudent in light of the marginal benefits of antibiotics to follow protocols for prophylaxis against early-onset Group B streptococcal sepsis with the expectation that such interventions will not only accomplish that goal, but also prolong pregnancy. Investigators should focus their research on determining the optimal agent, dose, duration, and possible combinations for antibiotics in women presenting with preterm labor.

Areas Not Warranting Further Research

Our review casts doubts on the wisdom of future research in three areas. The first is maintenance tocolysis, where we saw from both the synthesis of the literature and the meta-analyses that tocolytics used in a maintenance modality (as contrasted with first-line interventions) offered no demonstrable benefit in preventing preterm birth. Second, it seems clear that no further work need ever be done on ethanol as either a first-line or maintenance tocolytic. The third area is home uterine activity monitoring, where we were unable to find even indirect evidence that using this device alone is useful in preventing preterm birth.

Appendices

Appendix A. Acknowledgments

This study was supported by Contract No. 290-97-0011 from the Agency for Healthcare Research and Quality (AHRQ). We acknowledge the assistance of Jacqueline Besteman, J.D., M.A., the AHRQ Task Order Officer for the Evidence-based Practice Center Program, and Ernestine Murray, R.N., M.A.S., the AHRQ Task Order Officer for this project. The investigators appreciate the considerable help of the data abstractors including clinicians: Alexander Allaire, M.D., Jennifer Bailit M.D., Muge Calikoglu M.D., Tara Gustillo-Ahby, M.D., Martin Fielder, M.D., Gil Reid, M.D., and Paul Whiteacre, M.D. and methodologists: Tracy Bouchard-Cyr, M.P.H., Kathryn Anderson Clark, M.S., Eric Finch, B.A., and Erdal Tekin, B.A. We also thank Patricia Ann Payne, B.S.N., C.N.M., M.P.H., and Rukmini Bagchee, M.S. for their work in developing the analysis of tocolytic harms. We thank the following individuals from the University of North Carolina at Chapel Hill: Gordon DeFriese, Ph.D., Co-Director of the RTI-UNC Evidence-based Practice Center and Lynn Whitener, Dr.P.H., M.S.L.S. We also thank the following individuals from the Research Triangle Institute: Suzanne L. West, Ph.D., M.P.H., and Linda J. Lux, M.P.A. for their valuable insights into the methods and substance of this evidence report, Linda Fonville for her outstanding secretarial support, and Nicole Walker and Richard Strowd, J.D. for exceptional contracting support.

Technical Expert Advisory Group

We gratefully acknowledge the substantial involvement of and assistance from the Technical Expert Advisory Group (TEAG). TEAG members are listed at the end of this section. The TEAG was meant in part to contribute to (a) advancing AHRQ's broader goals of creating and maintaining "science partnerships" and "public-private partnerships" and (b) meeting the needs of a broad array of potential customers and users of its products. Thus, it was both a substantive resource and a sounding board throughout the study, and it is the body from whom "expert inputs" were formally sought at several points through the project.

We constituted our TEAG from three types of technical experts and other partners. These types are (1) technical/clinical experts; (2) patients or representatives of organizations whose mission concerns the interests and perspectives of patients and consumers; and (3) potential users of the final evidence report or other materials, including explicitly a representative of the organization that nominated the topic (in this case, the American College of Obstetrics and Gynecology. All in all, we had six clinical/technical experts, one individual representing patient populations, and one individual representing potential user groups, for a total of eight on the TEAG.

The final decision about TEAG membership was based on candidates' availability for scheduled conference calls and other input, willingness to review materials and provide advice and assistance within a short turnaround time, and approval by the AHRQ Task Order Officer.

The RTI-UNC Center team solicited the views of TEAG members from the start of the project. Among other issues, TEAG members provided insights and reactions to key clinical questions, input to the literature review process by ensuring that we included all known published research meeting our inclusion criteria, review of our data extraction forms and view of our approach to meta-analysis. TEAG members have also provided valuable input concerning problems of focusing a literature search on treatments, and specifying appropriate outcomes, for a clinical topic as difficult to conceptualize as preterm labor.

In keeping with AHRQ's standards for employing a multidisciplinary approach to the development of evidence reports, we called on our TEAG for inputs at two key points during this task. First, the group was asked to comment on the literature synthesis and to give us feedback on our overall plans at that stage of the analysis, which included approaches to developing evidence tables and to summarizing information about patient outcomes associated with the treatment options being studied for the management of preterm labor.

Technical Expert Advisory Group
TEAG Members
Haywood Brown, MD Department of Obstetrics and Gynecology St. Vincents Hospital (FLC)Carol Sakala, PhD, MSPH Center for Applied Ethics and Professional Practice Education Development Center, Inc.
Jan French, CNM Denver Health Medical Center Susan Smarr, MD Department of Obstetrics and Gynecology Permanente Medical Group, Inc.
Jay Iams, MD Ohio State University Dept of OB/GYNBarbara Yawn, MD MS MSPH Department of Research and Education Olmstead Medical Center
Dwight Rouse, MD University of Alabama, Birmingham Dept OB/GYNStanley Zinberg, MD, MS American College of Obstetricians and Gynecologists

Peer Reviewers

An important first step in the identification of potential peer reviewers was to determine the appropriate constituencies from which our reviewers should be approached. The categories we finally settled on, and the number of reviewers asked to participate in this effort, are shown in Table A. Although the categories are fairly self-explanatory, we note the following details for clarification. Individual experts engaged in related research (as contrasted with medical practice per se) were included in Category I (clinical practice and health care delivery) because they are based in health care delivery organizations and, we judged, likely to be involved to some extent in patient care. We assigned organizations that appear to be engaged chiefly in information dissemination (clearinghouses and the like) to Category IV (quality-of-care and consumer groups) on the grounds that much quality of care activity today assumes some effort to provide consumer information. We believe that these four categories represent the full range of health care experts, users, and patient groups that should be involved in reviewing this particular evidence report on the management of preterm labor; the specific peer reviewers are listed at the end of this appendix.

We selected 19 organizations or independent peer reviewers from the four categories noted in Table A. The individuals included the eight members of the TEAG because they played a major role throughout the project in conceptualizing the work and reviewing materials; moreover, as active professionals in the field, the RTI/UNC EPC believed that their comments at this stage would be very valuable. The remainder of the peer reviewer group were identified by issuing an invitation to the organization's executive officer/director (e.g., President, CEO) or to a public sector agency head asking them to nominate a peer reviewer or by soliciting nominations from the TEAG. A preliminary (and longer) list of organizations, agencies, or individuals was submitted to the AHRQ Task Order Officer for this project for review, comment, and approval. We then contacted all potential peer reviewers to determine their willingness to serve as peer reviewers, alerting them to the fact that this service would require them to prepare formal written reviews according to the checklist developed for this evidence report. Their comments and suggestions formed the basis of our revisions to the evidence report.

The peer reviewers who were selected for this Evidence Report are predominantly involved in clinical practice or research, professional associations, and public sector quality of care organizations.

Table A. Categories of Peer Reviewers
CategoryNumber of Peer Reviewers
I. Clinical Practice Experts and Health Care Delivery Organizations3
II. Professional Associations/Guidelines Developers/Other Users of Evidence Report5
III. Public Sector/ Quasi Public and Regulatory Agencies1
IV. Quality of Care and Consumer Groups2
Peer Reviewer List
Gina Burns President Group B Strep AssociationJan M. Kriebs, CNM MSN FACNM American College of Nurse Midwives
Nancy Cuddihy Health Care ConsultantBrian Mercer, MD Society for Maternal Fetal Medicine University of Tennessee at Memphis
Calvin Hobel, MD Cedars Sinai Medical Center Deptarment of OB/GYNMichael O'Shea, MD Wake Forest University School of Medicine Department of Pediatrics
William Heuston, MD American Association of Family Practice Kathleen Rice Simpson, PhD, RNC, FAAN Association of Women's Health, Obstetric and Neonatal Nurses
Ann Koontz, DrPH, CNM Maternal and Child Health Bureau Health Resources and Services AdministrationPat Venus Center for Health Care Policy and Evaluation United Health Group

Appendix B. Extraction Forms

Article Identification, Eligibility Screen & Abstract:BIOMARKERS FOR PRETERM LABOR

Initials of Reviewer __ __ __ Date__ __/__ __ (Mo/Day) Unique Article Identifier __ __ __

Screen

(If any response is "No" then STOP. If any response is "Can't Tell," bring to the attention of the Research Coordinator.

Preterm labor, preterm birth or time to delivery an outcome? graphic element Yes graphic element No graphic element Can't Tell
40 or more subjects at completion? graphic element Yes graphic element No graphic element Can't Tell
Population of women with signs/symptoms of PTL? graphic element Yes graphic element No graphic element Can't Tell

Abstract

(Note that numeric values in many instances need to be preceded by <, > , < or >.)

1. Main Objectives (described by author, typically in the introduction).
_____________________________________________________________________________
_____________________________________________________________________________

2. Study Design. (Record as stated by the author. Typically found in the abstract, end of the introduction or beginning of the methods section. If you think the author has misstated the study design, record the discrepancy in the comments section.)

graphic element Diagnostic;
graphic element Randomized controlled trial
graphic element Non-randomized controlled trial
graphic element Prospective cohort study
graphic element Retrospective cohort study
graphic element Case-control study
graphic element Cross-over study
graphic element Case series
graphic element Cross-sectional survey
graphic element Cost-benefit/cost-effectiveness study
graphic element Other (describe in space to right) _________________________________

3. Subanalysis. Are there any subgroup or stratified analyses in this article?

graphic element Yes graphic element No

4. Country in which study was conducted: ______________________________
STUDY POPULATION

5. Inclusion Criteria

graphic element Gestational age: __________ to __________ completed weeks
graphic element Intact membranes
graphic element Fetal weight estimate: __________
graphic element Maternal age: __________ (lower limit) __________ (upper limit)
graphic element Singleton gestation
graphic element Multiple gestation
graphic element Signs and symptoms of active labor
graphic element High risk of preterm labor/delivery
If yes, how was "high risk" defined? _____________________________________________________
_________________________________________________________________________________
graphic element Other: ______________________________

6. Exclusion Criteria

graphic element Abnormal fetus graphic element Maternal age __________ (upper limit)
graphic element Maternal age __________ (lower limit)
graphic element Antibiotic therapy, current or recent graphic element Maternal fever/Pyrexia: T _____
graphic element Cerclage/Need for cerclage graphic element Maternal White Blood Cells (WBC) _____
graphic element Chorioamnionitis graphic element Multiple gestation
graphic element Chronic disease graphic elementNo prior prenatal care
graphic element Current bleeding/hemorrhage graphic elementPlacenta previa
graphic element Current illegal substance abuse graphic element Rupture of membranes
graphic element Dilation __________cm graphic element Singleton gestation
graphic element Fetal distress graphic element Uterine anomaly
graphic element Failure to give informed consent graphic element Other maternal/fetal indication for delivery
graphic element Group B Strep (GBS) Positive graphic element Other Positive Cultures ( ______________ )
graphic element Hypertensive disorders (incl. Pre-eclampsia) graphic element Signs/Symptoms of labor
graphic element High risk of preterm delivery
graphic element If yes, how was "high risk" defined? ________________________________________________
_____________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________

7. Gestational age assignment. (Circle either and/or as appropriate.)

graphic element Not determined/not measured/can't tell
graphic element Last Menstrual Period (LMP)
graphic element And/or clinical sizing, including serial fundal heights
graphic element And/or ultrasound When was ultrasound conducted? _________________________________

8. Population Size:

Screened __________ graphic element Cannot Determine
Eligible __________ graphic element Cannot Determine
Enrolled __________ graphic element Cannot Determine
Completed __________ graphic element Cannot Determine
Included in analysis __________ graphic element Cannot Determine

8a. Reasons for exclusion from analysis:

graphic element Loss to follow-up (n= )
graphic element Other ____________________ (n= )
graphic element Other ____________________ (n= )
graphic element Other ____________________ (n= )
graphic element Other ____________________ (n= )

9. MASKING LEVEL

Were treating physicians masked to the results of the diagnostic tests?YesNoCan't Tell
Were patients masked to the results of the diagnostic tests?YesNoCan't Tell

Definition of Diagnostic Test

10. What screening test or tests was/were used?

10a. How are the test result groups compared in the analysis defined (e.g., positive fFN test; positive fFN test + some cervical length)?

Group A: _____________________________________________________________________
Group B: _____________________________________________________________________
Group C: _____________________________________________________________________
Group D: _____________________________________________________________________

11. Diagnostic testing

11a. Diagnostic Test 1
When initiated:
Frequency Performed:
Did the result of the test influence the frequency with which it was subsequently performed?
Yes No Can't Tell

Diagnostic Test 2
When initiated:
Frequency Performed:
Did the result of the test influence the frequency with which it was subsequently performed?
Yes No Can't Tell

12. If cervical length was measured, how was it measured?

13. Was wedging or funneling or breaking considered?

14. How were they defined?
Wedging: ________________________________________________
Funneling: _______________________________________________
Breaking: ________________________________________________

15. Non-Pharmaceutical Co-Interventions.

15a. Under what circumstances were non-pharmaceutical co-interventions administered?

15b. Which ones were used?

graphic element Bedrest graphic element Routine Observation
graphic element Fetal Monitoring graphic element Uterine Monitoring
graphic element IV Fluids
graphic element Other:______________________________________________________________________________

15c. Were non-pharmaceutical co-interventions administered in both the test positive and test negative group?
Yes No
If not, in which group were they administered? ___________________________________

16. Pharmaceutical Co-Interventions.

16a. Under what circumstances were co-interventions administered?

16b. What pharmaceutical co-interventions were administered?
tocolysis
specify drug _______________
antibiotics
specify drug _______________
other _______________________

16c. Were pharmaceutical co-interventions administered in the same fashion in the test positive and test negative groups?
Yes No

17. What evaluations other than the test being studied (i.e., fFN or E3) were performed?
a. Periodic pelvic examination _____ frequency
specify by speculum or digital cervical evaluation __________________
b. Periodic ultrasound _____ frequency
c. Cultures for infection
d. HUAM _____ frequency
e. Others _____________ _____ frequency
_____________ _____ frequency

18. Participant Characteristics. (Record as number and/or percent as presented in article. Follow percent with %.)

Group AGroup BGroup CGroup DTotal
Age (±SD)
Race:
White
Black
Hispanic
Other
High School Grads
Married
Gravidity
Parity

19. OUTCOME MEASURES.

19a. Definition of Preterm Labor. (Record each definition separately. Provide the letters corresponding to each symptom. Fill in any necessary data describing a particular symptom.)
graphic element Can't tell
Def. 1: ____________________
Def. 2: ____________________
Def. 3: ____________________
a. Contractions: __________/__________
b. History of contractions
c. Dilation: __________ cm
d. Change in dilation from baseline
e. Effacement: __________%
f. Cervical change
g. Length of cervix: __________ cm
h. Contractions unresolved with: __________
i. Other: ______________________________
j. Other: ______________________________

19b. Rates of outcomes
Specify mean or median. Numeric values may be preceded by <, >, <, or >. Record as number and/or percent as presented in article. Follow percent with "%". In the last column, if a difference between the groups is statistically significant, record the relevant p value. If it is not significant, state, "n.s." If statistical adjustments were performed (i.e., stratified analyses or multivariate modeling) report the adjusted results, the strata, and any variable used for adjustment in #.

OutcomeGroup A (N = )Group B (N = )Group C (N = )Group D (N= )Sig. Level
DELIVERY
Mean/Median Est. Gestational Age (EGA) at Delivery (SD)
Time to delivery
Occurrence of Contractions
PTL Only Group
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Combined PPROM + PTL
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivery at Term
MATERNAL OUTCOMES
Maternal Length of Stay (SD)
Chorioamnionitis
Endometritis/Post-partum febrile morbidity
INFANT OUTCOMES
Mean/Median Birthweight (SD)
_____ gms
_____ gms
_____ gms
Apgar ______ at ______ min
Apgar ______ at ______ min
Mean/Neonatal Hospital Days
Mean/Median Neonatal Deaths
Fetal Mortality
Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD)
Bronchio Pulmonary Dysplasia (BPD)
Intraventricular Hemorrhage
Neonatal Sepsis
Necrotizing Enterocolitis
Neonatal Morbidity
Pneumonia
Neonatal Infection
OTHER

20. Sensitvity/Specificity Analysis for Test ____________ for Outcome: _________________________
NOTE: You may use as many of these pages as is necessary to capture the information provided by test/outcome. The table below is designed to capture rates associated with a given test and outcome by the groups into which participants were divided for analysis (i.e., those defined in question 11). If results are provided for different outcomes, use a different page for each outcome.

OverallGroup A: ( ) Numeric Value CIGroup B: ( ) Numeric Value CIGroup C: ( ) Numeric Value CIGroup D: ( ) Numeric Value CI
Sensitivity
Specificity
Positive Predictive Value
Negative Predictive Value
Likelihood Ratio (positive/negative)
Odds Ratio or Relative Risk

20b. Were these results based on one-time testing or repeated testing?

20c. Numbers provided for sensitivity/specificity calculations:
Overall:

Had OutcomeDid not have outcome
Test positive
Test negative

Group A:

Had OutcomeDid not have outcome
Test positive
Test negative

Group B:

Had OutcomeDid not have outcome
Test positive
Test negative

Group C:

Had OutcomeDid not have outcome
Test positive
Test negative

Group D:

Had OutcomeDid not have outcome
Test positive
Test negative

21. Were ROC curves presented? graphic element Yes graphic element No If so, on what page? _____

22. Describe any multivariate analysis performed. Include statistical technique, model description, and result highlights.

23. Was cost data collected?
graphic element Yes graphic element No

24. Was information on harms collected?
graphic element Yes graphic element No

25. Describe any additional analyses and/or subgroup analyses:

26. Limitations. (Distinguish between limitations stated by the author and those found by the reviewer.)
a) State limitations noted by the author.
b) State any other limitations you found in the article.

27. Conclusions:
graphic element Test effective at predicting the following outcome: _________________________________________
_________________________________________________________________________________
graphic element Test effective at predicting relevant outcomes in the following subgroup(s)
_________________________________________________________________________________
_________________________________________________________________________________

28. Comments. (Provide any additional information that you believe needs to be captured concerning this study.)

Article Identification, Eligibility Screen & Abstract: TOCOLYTIC USE FOR PRETERM LABOR

Initials of Reviewer __ __ __ Date __ __/__ __ Unique Article Identifier __ __ __

SCREEN

(If any response is "No" then STOP. If any response is "Can't Tell," bring to the attention of the Research Coordinator.

Preterm birth an outcome graphic element Yes graphic element No graphic element Can't Tell
40 or more subjects at completion? graphic element Yes graphic element No graphic element Can't Tell
Intervention used as part of treating subjects with signs/symptoms of preterm labor (not a preventive intervention used among asymptomatic patients) graphic element Yes graphic element No graphic element Can't Tell

ABSTRACT

(Note that numeric values in many instances need to be preceded by <, >, < or >.)

1. Main Objectives (described by author, typically in the introduction).
_____________________________________________________________________________
_____________________________________________________________________________
_____________________________________________________________________________

2. Study Design. (Record as stated by the author. Typically found in the abstract, end of the introduction or beginning of the methods section. If you think the author has misstated the study design, record the discrepancy in the comments section.)

graphic element Randomized controlled trial
graphic element Non-randomized controlled trial
graphic element Prospective cohort study
graphic element Retrospective cohort study
graphic element Case-control study
graphic element Cross-over study
graphic element Case series
graphic element Cross-sectional survey
graphic element Cost-benefit/cost-effectiveness study
graphic element Other (describe in space to right) _________________________________

3. PPROM. Are women with preterm premature rupture of membrane included in this analysis as an entry criteria?
Yes No

4. Subanalysis. Are there any subgroup analyses in this article?
Yes No

5. Country in which study was conducted: ______________________________

6. Definition of Preterm Labor. (Record each definition separately. Provide the letters corresponding to each symptom. Fill in any necessary data describing a particular symptom.)
Can't tell
Def. 1: ____________________
Def. 2: ____________________
Def. 3: ____________________
c. Bio-marker(s) used (list: ______________________________)
d. Contractions: __________/__________
e. History of contractions
f. Dilation: __________ cm
g. Change in dilation from baseline
h. Effacement: __________%
i. Cervical change
j. Length of cervix: __________ cm
k. Contractions unresolved with: __________
l. Other: ______________________________
m. Other: ______________________________

7. Inclusion Criteria (Record criteria other than was stated within definition of preterm labor)
Gestational age: __________ to __________ completed weeks
Intact membranes
Fetal weight estimate: __________
Maternal age: __________ (lower limit)
__________ (upper limit)
Singleton gestation
Other: ______________________________
Other: ______________________________

8. Exclusion Criteria

graphic element Abnormal fetus graphic element Maternal age __________ (upper limit)
graphic element Allergy to protocol drug graphic element Maternal age __________ (lower limit)
graphic element Antibiotic therapy, current or recent graphic element Maternal fever/Pyrexia: T _____
graphic element Cerclage/Need for cerclage graphic element Maternal White Blood Cells (WBC) _____
graphic element Chorioamnionitis graphic element Multiple gestation
graphic element Chronic disease graphic element No prior prenatal care
graphic element Current bleeding/hemorrhage graphic element Placenta previa
graphic element Current illegal substance abuse graphic element Rupture of membranes
graphic element Dilation __________cm graphic element Singleton gestation
graphic element Fetal distress graphic element Uterine anomaly
graphic element Failure to give informed consent graphic element Other maternal/fetal indication for delivery
graphic element Group B Strep (GBS) Positive graphic element Other Positive Cultures ( ______________ )
graphic element Hypertensive disorders (incl. Pre-eclampsia)
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________

9. Gestational age assignment. (Circle either and/or as appropriate.)
Not determined/not measured/can't tell
Last Menstrual Period (LMP)
And/or clinical sizing, including serial fundal heights
And/or ultrasound When was ultrasound conducted? _________________________________

10. Population Size:
Screened __________ Cannot Determine
Eligible __________ Cannot Determine
Enrolled __________ Cannot Determine
Completed __________ Cannot Determine

11. MASKING LEVEL

Subject masked to treatment assignment graphic element Yes graphic element No graphic element Can't Tell
Treating physician(s) masked to treatment assignment graphic element Yes graphic element No graphic element Can't Tell
Other direct care providers masked to treatment assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A
Pharmacist masked to treatment assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A
Pathologist masked to treatment assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A

12. Costs and cost effectiveness. Is any cost information included in this study?
Yes No

13. Participant Characteristics. (Record as number and/or percent as presented in article. Follow percent with %.)

Treatment Group ATreatment Group BComparison GroupTotal
Age (±SD)
Race:
White
Black
Hispanic
Other
High School Grads
Married

14. Dominant Mechanisms(s) of Loss. (Provide the reason for discontinuation and if available, the number of patients).
graphic element None lost
graphic element Can't tell
graphic element Loss to follow-up __________
graphic element Side effects of treatment __________
graphic element Other: _________________________________________
graphic element Other: _________________________________________
graphic element Other: _________________________________________

15. Timing of intervention:
graphic element Within _____ hrs of evaluation/admission
graphic element Other: ____________________________
graphic element Cannot tell

16. Intervention Compliance Assessment. (State what action(s) was taken to ensure that subject completed treatment.)
graphic element None reported graphic element Mechanism: ______________________________

17. Intention to treat analysis provided?
graphic element Yes, list exclusions below and if available, number of subjects.
graphic element No
graphic element Developed allergy or did not tolerate intervention drug: __________
graphic element Positive cultures - urine: __________
graphic element Positive cultures - group B strep: __________
graphic element Positive cultures - amniotic fluid: __________
graphic element Positive cultures - other cervico-vaginal: __________
graphic element Did not comply with protocol for use of intervention drug: __________
graphic element Did not comply with follow-up: __________
graphic element Other: _____________________________________________
graphic element Other: _____________________________________________
graphic element Other: _____________________________________________
Were exclusions from analysis after Treatment Assignment? graphic element Yes graphic element No graphic element Can't Tell

18. Care Setting(s). List all settings where treatment was provided.
graphic element Hospital
graphic element Perinatal center
graphic element Home
graphic element Other: _______________________________
graphic element Other: _______________________________

19. Were harms or side effects of treatment included in the study? graphic element Yes, graphic element _____ page # (s) graphic element No

20. Is successful tocolysis defined? graphic element Yes graphic element No
If yes, define: __________________________________________________________________________
21. Definition of Treatment and Comparison Groups. How are the groups defined?
Treatment Group A: _____________________________________________________________________
Treatment Group B: _____________________________________________________________________
Comparison Group: _____________________________________________________________________

22. TOCOLYTIC DRUG REGIMEN BY GROUPS (If treatment includes a separate loading and maintenance dose of the same drug, i.e., the route and/or duration of treatment changes, list these as different drugs.)

Treatment Group ATreatment Group BComparison Group
Number of Subjects per groupN =N =N =
DRUG 1
Dose
Route
Commencement of Treatment
Duration of Treatment
DRUG 2
Dose
Route
Commencement of Treatment
Duration of Treatment
DRUG 3
Dose
Route
Commencement of Treatment
Duration of Treatment
DRUG 4
Dose
Route
Commencement of Treatment
Duration of Treatment

23. Non-Pharmaceutical Co-Interventions. Did comparison group receive co-interventions? graphic element Yes graphic element No

graphic element Bedrest graphic element Routine Observation
graphic element Fetal Monitoring graphic element Uterine Monitoring
graphic element IV Fluids
graphic element Other: ______________________________________________________________________________

24. CO-INTERVENTION DRUG REGIMEN BY GROUPS

Treatment Group ATreatment Group BComparison Group
Number of Subjects per groupN =N =N =
DRUG 1
Dose
Route
Aim of Treatment
Commencement of Treatment
Duration of Treatment
DRUG 2
Dose
Route
Aim of Treatment
Commencement of Treatment
Duration of Treatment
DRUG 3
Dose
Route
Aim of Treatment
Commencement of Treatment
Duration of Treatment
DRUG 4
Dose
Route
Aim of Treatment
Commencement of Treatment
Duration of Treatment

25a). OUTCOME MEASURES. Specify mean or median. Numeric values may be preceded by <, >, <, or >. Record as number and/or percent as presented in article. Follow percent with "%". In the last column, if a difference between the groups is statistically significant, record the relevant p value. If it is not significant, state, "n.s." If statistical adjustments were performed (i.e., stratified analyses or multivariate modeling) report the adjusted results, the strata, and any variable used for adjustment in #24.

OutcomeIntervention Group A (N = )Intervention Group B (N = )Comparison Group (N = )Sig. Level
DELIVERY
Mean/Median Est. Gestation Age (EGA) at Delivery (SD)
Mean/Median Prolongation (SD) Days
PTL Only Group
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Combined PPROM + PTL
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivery at Term
PROM OR PPROM Outcome
MATERNAL OUTCOMES
Maternal Length of Stay (SD)
Chorioamnionitis
Endometritis/Post-partum
Adverse Drug Reactions
INFANT OUTCOMES
Mean/Median Birthweight (SD)
_____ gms
_____ gms
_____ gms
Apgar ______ at ______ min
Apgar ______ at ______ min
Mean/Neonatal Hospital Days
Mean/MedianNeonatal Deaths
Fetal Mortality
Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD)
Bronchio Pulmonary Dysplasia (BPD)
Intraventricular Hemorrhage
Neonatal Sepsis
Necrotizing Enterocolitis
Neonatal Morbidity
Pneumonia
Neonatal Infection
OTHER

26. Describe any additional analyses and/or subgroup analyses:

27. Limitations. (Distinguish between limitations stated by the author and those found by the reviewer.)

28. Conclusions:
graphic element Intervention ineffective in altering outcomes detailed above.
graphic element Intervention effectively alters the following outcomes: ______________________________________
________________________________________________________________________________
graphic element Intervention effectively alters the following outcomes in the following subgroup(s)
________________________________________________________________________________
________________________________________________________________________________

29. Comments. (Provide any additional information that you believe needs to be captured concerning this study.)
Unique Article Identifier:
Study Type:
Sample Size:

graphic elementMaternal HarmsTreatment Grp1Treatment Grp2Control Grp
Death
Heart Failure
Arrhythmia
Abnormal glucose tolerance
Elevated liver functions/hepatitis
Renal failure
Headache
Tremor
Hypotension
Skin rash
Stress, emotional distress or other mental health issues
Discontinuations due to side effects
Other:
Other:
Other
Death
Intraventricular hemorrhage
Necrotizing enterocolitis
Ductal constriction
Bardycardia
Cerebral Palsy
Other neurologic effects
Other:
Other:
Other:
Other:
* Statistical significance: if nonsignificant note NS, otherwise provide p number.
Method of Ascertaining Harms:
Interviewer-administered side-effect checklist
Self-administered side-effect checklist
Open-ended question asked at each visit
Investigator observed at each visit
Spontaneous reporting by patient
Not otherwise specified

Article Identification, Eligibility Screen & Abstract:HOME UTERINE ACTIVITY MOINTORING

Initials of Reviewer __ __ __ Date __ __/__ __ (Mo/Day) Unique Article Identifier __ __ __

SCREEN

(If any response is "No" then STOP. If any response is "Can't Tell," bring to the attention of the Research Coordinator.

Preterm birth an outcome graphic element Yes graphic elementNo graphic element Can't Tell
40 or more subjects at completion? graphic element Yes graphic element No graphic element Can't Tell
Intervention used as part of managing subjects with signs/symptoms of preterm labor or who have been treated for preterm labor (not a preventive intervention used among asymptomatic patients)? graphic element Yes graphic element No graphic element Can't Tell
Study design RCT? graphic element Yes graphic element No graphic element Can't Tell

ABSTRACT

(Note that numeric values in many instances need to be preceded by <, > , < or >.)
25. Main Objectives (described by author, typically in the introduction).
_____________________________________________________________________________
_____________________________________________________________________________
_____________________________________________________________________________

2. Study Design. (Record as stated by the author. Typically found in the abstract, end of the introduction or beginning of the methods section. If you think the author has misstated the study design, record the discrepancy in the comments section.)
Randomized controlled trial
Non-randomized controlled trial
Prospective cohort study
Retrospective cohort study
Case-control study
Cross-over study
Case series
Cross-sectional survey
Cost-benefit/cost-effectiveness study
Other (describe in space to right) _________________________________

3. Country in which study was conducted: ______________________________

4. Definition of Preterm Labor. (Record each definition separately. Provide the letters corresponding to each symptom. Fill in any necessary data describing a particular symptom.)
Can't tell
Def. 1: ____________________
Def. 2: ____________________
Def. 3: ____________________
n. Bio-marker(s) used (list: ______________________________)
o. Contractions: __________/__________
p. History of contractions
q. Dilation: __________ cm
r. Change in dilation from baseline
s. Effacement: __________%
t. Cervical change
u. Length of cervix: __________ cm
v. Contractions unresolved with: __________
w. Other: ______________________________
x. Other: ______________________________

5. Inclusion Criteria (Record criteria other than was stated within definition of preterm labor)
Gestational age: __________ to __________ completed weeks
Intact membranes
Fetal weight estimate: __________
Maternal age: __________ (lower limit)
__________ (upper limit)
Singleton gestation
Signs and symptoms of preterm labor in the current pregnancy
Treatment for preterm labor in the current pregnancy
Other: ______________________________
Other: ______________________________

6. Exclusion Criteria

graphic element Abnormal fetus graphic element Maternal age __________ (upper limit)
graphic element Allergy to protocol drug graphic element Maternal age __________ (lower limit)
graphic element Antibiotic therapy, current or recent graphic element Maternal fever/Pyrexia: T _____
graphic element Cerclage/Need for cerclage graphic element Maternal White Blood Cells (WBC) _____
graphic element Chorioamnionitis graphic element Multiple gestation
graphic element Chronic disease graphic element No prior prenatal care
graphic element Current bleeding/hemorrhage Placenta previa graphic element Current illegal substance abuse Rupture of membranes
graphic element Dilation __________cm graphic element Singleton gestation
graphic element Fetal distress graphic element Uterine anomaly
graphic element Failure to give informed consent graphic element Other maternal/fetal indication for delivery
graphic element Group B Strep (GBS) graphic elementPositive Other Positive Cultures ( ______________ )
graphic element Hypertensive disorders (incl. Pre-eclampsia)
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________
graphic element Other: ________________________________________________________________________

7. Gestational age assignment. (Circle either and/or as appropriate.)
Not determined/not measured/can't tell
Last Menstrual Period (LMP)
And/or clinical sizing, including serial fundal heights
And/or ultrasound When was ultrasound conducted? _________________________________

8. Population Size:
Screened__________ Cannot Determine
Eligible__________ Cannot Determine
Enrolled__________ Cannot Determine
Completed__________ Cannot Determine

9. MASKING LEVEL

Subject masked to intervention assignment graphic element Yes graphic element No graphic element Can't Tell
Treating physician(s) masked to intervention assignment graphic element Yes graphic element No graphic element Can't Tell
Other direct care providers masked to intervention assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A
Pharmacist masked to intervention assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A
Pathologist masked to intervention assignment graphic element Yes graphic element No graphic element Can't Tell graphic element N/A

10. Definition of Treatment and Comparison Groups. How are the groups defined?
Group A: _____________________________________________________________________
Group B: _____________________________________________________________________
Group C: _____________________________________________________________________

11. Participant Characteristics. (Record as number and/or percent as presented in article. Follow percent with %.)

Group A (N= )Group B (N= )Group C (N= )Total
Age (±SD)
Race:
White
Black
Hispanic
Other
PTL in a previous pregnancy
Previous PTD
Gravidity
Parity
High School Grads
Married

12. Dominant Mechanisms(s) of Loss. (Provide the reason for discontinuation and if available, the number of patients).
graphic element None lost
graphic element Can't tell
graphic element Loss to follow-up __________
graphic element Side effects of treatment __________
graphic element Other: _________________________________________
graphic element Other: _________________________________________
graphic element Other: _________________________________________

13. Intervention Compliance Assessment. (State what action(s) was taken to ensure that subject completed treatment.)
graphic element None reported graphic element Mechanism: ______________________________

14. Intention to treat analysis provided?
graphic element Yes, list exclusions below and if available, number of subjects.
graphic element No
graphic element Positive cultures - urine: __________
graphic element Positive cultures - group B strep: __________
graphic element Positive cultures - amniotic fluid: __________
graphic element Positive cultures - other cervico-vaginal: __________
graphic element Did not comply with protocol for use of intervention: __________
graphic element Did not comply with follow-up: __________
graphic element Other: _____________________________________________
graphic element Other: _____________________________________________
graphic element Other: _____________________________________________
Were exclusions from analysis after Treatment Assignment? graphic element Yes graphic element No graphic element Can't Tell

15. Non-Pharmaceutical Co-Interventions.

15a. Under what circumstances were non-pharmaceutical co-interventions (other than HUAM) administered?

15b. Which ones were used?
graphic element Bedrest graphic element Routine Observation
graphic element Fetal Monitoring
graphic element IV Fluids
graphic element Other:______________________________________________________________________________

15c. Were non-pharmaceutical co-interventions administered in both/all groups? graphic element Yes graphic element No

If not, in which group(s) were they administered? ___________________________________

Under what circumstances were they administered?_____________________________________

16. Pharmaceutical Co-Interventions.

16a. Under what circumstances were co-interventions administered?

16b. What pharmaceutical co-interventions were administered?
graphic element tocolysis
specify drug _______________
graphic element antibiotics
specify drug________________
graphic element other ________________________

16c. Were pharmaceutical co-interventions administered in the same fashion in both/all groups?
graphic element Yes graphic element No
If not, describe the differences in administration.________________________________________________

18a). OUTCOME MEASURES. Specify mean or median. Numeric values may be preceded by <, >, <, or >. Record as number and/or percent as presented in article. Follow percent with "%". In the last column, if a difference between the groups is statistically significant, record the relevant p value. If it is not significant, state, "n.s." If statistical adjustments were performed (i.e., stratified analyses or multivariate modeling) report the adjusted results, the strata, and any variable used for adjustment)

OutcomeGroup A (N = )Group B (N = )Group C (N = )Sig. Level
DELIVERY
Mean/Median Est. Gestation Age (EGA) at Delivery (SD)
Mean/Median Prolongation (SD) Days
PTL Only Group
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
Delivered at: __________ weeks
MATERNAL OUTCOMES
Hospital admissions
Any non-delivery admissions
Chorioamnionitis
Endometritis/Post-partum
Adverse Drug Reactions
INFANT OUTCOMES
Mean/Median Birthweight (SD)
_____ gms
_____ gms
_____ gms
Apgar ______ at ______ min
Apgar ______ at ______ min
Mean/Neonatal Hospital Days
Mean/MedianNeonatal Deaths
Fetal Mortality
Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD)
Bronchio Pulmonary Dysplasia (BPD)
Intraventricular Hemorrhage
Neonatal Sepsis
Necrotizing Enterocolitis
Neonatal Morbidity
Pneumonia
Neonatal Infection
OTHER

19. Describe any multivariate analysis performed. Include statistical technique, model description, and result highlights.
:

20. Describe any additional analyses and/or subgroup analyses:

21. Was cost data collected?
graphic element Yes graphic element No

22. Limitations. (Distinguish between limitations stated by the author and those found by the reviewer.)
a) State limitations noted by the author.
b) State any other limitations you found in the article.

23. Conclusions:
graphic element Intervention ineffective in altering outcomes detailed above.
graphic element Intervention effectively alters the following outcomes: ______________________________________
________________________________________________________________________________
graphic element Intervention effectively alters the following outcomes in the following subgroup(s)
________________________________________________________________________________
________________________________________________________________________________

24. Comments. (Provide any additional information that you believe needs to be captured concerning this study.)

Appendix C. Acronyms and Abbreviations

ACOG: American College of Obstetricians and Gynecologists

add.: additional

AHRQ: Agency for Healthcare Research and Quality

AoT: aim of treatment

APGAR: Activity, Pulse, Grimace, Appearance, Respiration

assoc: association

BP: blood pressure

bpm: beats per minute

BV: bacterial vaginosis

C: clinician reviewer

cAMP: cyclic adenosine monophosphate

CI: confidence interval

corr: correlation, correlated with

CoT: commencement of treatment

CRH: corticotropin-releasing hormone

Cx: control group

dk: don't know, not stated in article

DoT: duration of treatment

E3: salivary estriol

ECG: electrocardiogram

EGA: estimated gestational age

EVUSD: endovaginal ultrasound

FDA: Food and Drug Administration

fFN: fetal fibronectin

HEED: Health Economic Evaluations Database

HMD: hyaline memebrane disease

HUAM: home uterine activity monitoring

IUGR: intrauterine growth retardation

IV: intravenous

IVH: intraventricular hemorrhage

M: methods reviewer

mcg, µg: microgram

MeSH: Medical Subject Headings

NEC: necrotising enterocoitis

NICHHD: National Institute of Child Health and Human Development

NICU: neonatal intensive care unit

NLM: National Library of Medicine

NPV: negative predictive value

NRDS: neonatal respiratory distress syndrome

NS: not significant

NSAID: nonsteroidal anti-inflammatory disease drugs

OR: odds ratio

path: pathologist

pd: time period

PDA: patent ductus arteriosus

pharm: pharmacist

PPV: positive predictive value

PROM: premature rupture of membrane

prov: providers

PTL: preterm labor

PVC: premature vehicular contraction

Q: quartile

RCT: randomized controlled trial

Rh: blood group type

ROC: receiver-operator characteristic curve

RR: risk ratio

RTI-UNC EPC: Research Triangle Institute-University of North Carolina Evidence-based Practice Center

SD: standard deviation

SEM: standard error of the mean

sens: sensitivity

SGA: small for gestational age

sig: significant

spec: s1pecificity

SVT: suproventricular tachycardia

TEAG: Technical Expert Advisory Group

TVUSD: transvaginal ultrasound

tx: treatment group

UTI: urinary tract infection

wk(s): week(s)

wkly: weekly

Evidence Tables

References
1.
Paneth NS. The problem of low birth weight. Future Child. 1995; 5(1): 1934. [PubMed]
2.
National Institute of Child Health and Human Development. Report of the Consensus Development Conference on the Effect of Corticosteroids for Fetal Maturation on Perinatal Outcomes. Bethesda, MD: The Institute; 1994 Feb 28-Mar 2.
3.
Wilcox LS, Kiely JL, Melvin CL, et al. Assisted reproductive technologies: Estimates of their contribution to multiple births and newborn hospital days in the United States. Fertil Steril. 1996; 65(2): 3616. [PubMed]
4.
Ventura SJ, Martin JA, Curtin SC, et al. Report of final natality statistics, 1996. Monthly Vital Statistics Report, Vol. 46, No. 11, supp. Hyattsville, MD: National Center for Health Statistics; 1998.
5.
Ventura SJ, Martin JA, Curtin SC, et al. Final data for 1997. National Vital Statistics reports, Vol. 47, No. 18. Hyattsville, MD: National Center for Health Statistics; 1998.
6.
ACOG Technical Bulletin. Preterm labor. Int J Gynecol Obstet. 1995; 50: 30313.
7.
Copper RL, Goldenberg RL, Davis RO, et al. Warning symptoms, uterine contractions, and cervical examination findings in women at risk of preterm delivery. Am J Obstet Gynecol. 1990; 162: 74854. [PubMed]
8.
West SL, Yawn BP, Thorp JM, et al. The preventability of preterm delivery after preterm labor: A population-based assessment. 14th International Conference on Pharmacoepidemiology; 1998 Aug 16; Berlin.
9.
Kramer MS. Preventing preterm birth: Are we making progress? Prenat Neonat Med. 1998; 3: 102.
10.
Rush RW, Keirse MJ, Howat P, et al. Contribution of preterm delivery to perinatal mortality. Br Med J. 1976; 2(6042): 9658. [PubMed]
11.
McCormick MC. The contribution of low birth weight to infant mortality and childhood morbidity. N Engl J Med. 1985; 312: 829. [PubMed]
12.
Robertson PA, Sniderman SH, Laros RK, et al. Neonatal morbidity according to gestational age and birth weight from five tertiary care centers in the United States, 1983 through 1986. Am J Obstet Gynecol. 1992; 166: 162941. [PubMed]
13.
Creasy RK. Preterm birth prevention: Where are we? Am J Obstet Gynecol. 1993; 168: 122330. [PubMed]
14.
Copper RL, Goldenberg RL, Creasy RK, et al. A multicenter study of preterm birth weight and gestational age-specific neonatal mortality. Am J Obstet Gynecol. 1993; 168: 7884. [PubMed]
15.
Seidman DS, Laor A, Gale R, et al. Birth weight and intellectual performance in late adolescence. Obstet Gynecol. 1992; 79: 5436. [PubMed]
16.
Lewit E, Baker L, Corman H, et al. The direct cost of low birth weight. Future Child. 1995; 5(1): 3556. [PubMed]
17.
Savitz DA, Blackmore CA, Thorp JM. Epidemiologic characteristics of preterm delivery: Etiologic heterogeneity. Am J Obstet Gynecol. 1991; 164: 4617.
18.
Berkowitz GS, Blackmore-Prince C, Lapinski RH, et al. Risk factors for preterm birth subtypes. Epidemiology. 1998; 9: 27985. [PubMed]
19.
Meis PJ, Goldenberg RL, Mercer BM, et al. The preterm prediction study: Risk factors for indicated preterm births. Am J Obstet Gynecol. 1998; 178: 5627. [PubMed]
20.
Goldenberg RL, Rouse DJ. Prevention of premature birth. N Engl J Med. 1998; 339: 31320. [PubMed]
21.
Klebanoff MA, Regan JA, Rao AV, et al. Outcome of the vaginal Infections and Prematurity Study: Results of a clinical trial of erythromycin among pregnant women colonized with group B streptococci. Am J Obstet Gynecol. 1995; 172: 15405. [PubMed]
22.
Guinn DA, Goldenberg RL, Hauth JC, et al. Risk factors for the development of preterm premature rupture of the membranes after arrest of preterm labor. Am J Obstet Gynecol. 1995; 173: 13105. [PubMed]
23.
Berkowitz GS, Papiernik E. Epidemiology of preterm birth. Epidemiol Rev. 1993; 15: 41443. [PubMed]
24.
Bakketeig LS, Hoffman HJ, Harley EE. The tendency to repeat gestational age and birth weight in successive births. Am J Obstet Gynecol. 1979; 135: 1086103. [PubMed]
25.
Kleinman JC, Kessel SS. Racial differences in low birth weight: Trends and risk factors. N Engl J Med. 1987; 317: 74953. [PubMed]
26.
Blackmore CA, Savitz DA, Edwards LJ, et al. Racial differences in the patterns of preterm delivery in central North Carolina, USA. Paediatr Perinat Epidemiol. 1995; 9: 28195. [PubMed]
27.
Kramer MS, Coates AL, Michoud MC, et al. Maternal anthropometry and idiopathic preterm labor. Obstet Gynecol. 1995; 86: 7448. [PubMed]
28.
Siega-Riz AM, Adair LS, Hobel CJ. Maternal underweight status and inadequate rate of weight gain during the third trimester of pregnancy increases the risk of preterm delivery. J Nutr. 1996; 126: 14653. [PubMed]
29.
Meyer MB, Tonascia JA. Maternal smoking, pregnancy complications, and perinatal mortality. Am J Obstet Gynecol. 1977; 128: 494502. [PubMed]
30.
Floyd RL, Rimer BK, Giovino GA, et al. A review of smoking in pregnancy: Effects on pregnancy outcomes and cessation efforts. Ann Rev Public Health. 1993; 14: 379411. [PubMed]
31.
Wisborg K, Henriksen TB, Hedegaard M, et al. Smoking during pregnancy and preterm birth. Br J Obstet Gynaecol. 1996; 103: 8005. [PubMed]
32.
Metzger BE, Ravnikar V, Vileisis RA, et al. Accelerated starvation and the skipped breakfast in late normal pregnancy. Laucet. 1982 Mar 13; 1(8272): 58892.
33.
Binienda Z, Massmann A, Mitchell MD, et al. Effect of food withdrawal on arterial blood glucose and plasma 13, 14-dihydro-15-keto-prostaglandin F2 alpha concentrations and nocturnal myometrial electromyographic activity in the pregnant rhesus monkey in the last third of gestation: A model for preterm labor? Am J Obstet Gynecol. 1989 Mar; 160(3): 74650.
34.
Martius J, Krohn MA, Hillier SL, et al. Relationships of vaginal lactobacillus species, cervical chlamydia trachomatis, and bacterial vaginosis to preterm birth. Obstet Gynecol. 1988; 71: 8995. [PubMed]
35.
Riduan JM, Hillier SL, Utomo B, et al. Bacterial vaginosis and prematurity in Indonesia: Association in early and late pregnancy. Am J Obstet Gynecol. 1993; 169: 1758. [PubMed]
36.
Hay PE, Lamont RF, Taylor-Robinson D, et al. Abnormal bacterial colonisation of the genital tract and subsequent preterm delivery and late miscarriage. BMJ. 1994; 308(6924): 2958. [Free Full Text in PMC icon.Free Full text in PMC] [PubMed]
37.
McGregor JA, French JI, Jones W, et al. Bacterial vaginosis is associated with prematurity and vaginal fluid mucinase and sialidase: Results of a controlled trial of topical clindamycin cream. Am J Obstet Gynecol. 1994; 170: 104859. [PubMed]
38.
Holst E, Goffeng AR, Andersch B. Bacterial vaginosis and vaginal microorganisms in idiopathic premature labor and association with pregnancy outcome. J Clin Microbiol. 1994; 32: 17686. [Free Full Text in PMC icon.Free Full text in PMC] [PubMed]
39.
Meis PJ, Goldenberg RL, Mercer B, et al. The preterm prediction study: Significance of vaginal infections. Am J Obstet Gynecol. 1995; 173: 12315. [PubMed]
40.
Gravett MG, Nelson HP, DeRouen T, et al. Independent associations of bacterial vaginosis and Chlamydia trachomatis infection with adverse pregnancy outcome. JAMA. 1986; 256: 1899903. [PubMed]
41.
McGregor JA, French JI, Parker R, et al. Prevention of premature birth by screening and treatment for common genital tract infections: Results of a prospective controlled evaluation. Am J Obstet Gynecol. 1995; 173: 15767. [PubMed]
42.
Hillier SL, Krohn MA, Cassen E, et al. The role of bacterial vaginosis and vaginal bacteria in amniotic fluid infection in women in preterm labor with intact fetal membranes. Clin Infect Dis. 1995; 20(Suppl 2): S276S278. [PubMed]
43.
Goldenberg RL, Klebanoff MA, Nugent R, et al. Bacterial colonization of the vagina during pregnancy in four ethnic groups. Am J Obstet Gynecol. 1996; 174: 161821. [PubMed]
44.
Creasy RK, Golbus MS, Laros RKJ, et al. Oral ritodrine maintenance in the treatment of preterm labor. Am J Obstet Gynecol. 1980; 137: 2129. [PubMed]
45.
Mercer BM, Lewis R. Preterm labor and preterm premature rupture of the membranes. Diagnosis and management. Infect Dis Clin North Am. 1997; 11: 177201. [PubMed]
46.
Peaceman AM, Andrews WW, Thorp JM, et al. Fetal fibronectin as a predictor of preterm birth in patients with symptoms: A multicenter trial. Am J Obstet Gynecol. 1997; 177: 138. [PubMed]
47.
Main DM, Gabbe SG, Richardson D, et al. Can preterm deliveries be prevented? Am J Obstet Gynecol. 1985; 151: 8928. [PubMed]
48.
Iams JD, Johnson FF, Creasy RK. Prevention of preterm birth. Clin Obstet Gynecol. 1988; 31: 599615. [PubMed]
49.
Dyson DC, Crites YM, Ray DA, et al. Prevention of preterm birth in high-risk patients: The role of education and provider contact versus home uterine monitoring. Am J Obstet Gynecol. 1991; 164: 75662. [PubMed]
50.
Collaborative Home Uterine Monitoring Study (CHUMS) Group. A multicenter randomized controlled trial of home uterine monitoring: Active versus sham device. Am J Obstet Gynecol. 1995; 173: 11207. [PubMed]
51.
Dyson DC, Danbe KH, Bamber JA, et al. Monitoring women at risk for preterm labor. N Engl J Med. 1998; 338: 159. [PubMed]
52.
Grimes DA, Schulz KF. Randomized controlled trials of home uterine activity monitoring: A review and critique. Obstet Gynecol. 1992; 79: 1372. [PubMed]
53.
Uldberg N, Ekman G, Malmstrom A, et al. Ripening of the human uterine cervix related to changes in collagen, glycosaminoglycans, and collagenolytic activity. Am J Obstet Gynecol. 1983; 147: 6626. [PubMed]
54.
Oxorn H. Human labor and birth. 5th ed. Norwalk, CT: Appleton Century Crofts; 1986. p. 113-51.
55.
Iams JD, Paraskos J, Landon MB, et al. Cervical sonography in preterm labor. Obstet Gynecol. 1994; 84: 406. [PubMed]
56.
Iams JD, Goldenberg RL, Mercer BM, et al. The preterm prediction study: Recurrence risk of spontaneous preterm birth. Am J Obstet Gynecol. 1998; 178: 103540. [PubMed]
57.
Goldenberg RL, Mercer BM, Meis PJ, et al. The preterm prediction study: Fetal fibronectin testing and spontaneous preterm birth. Obstet Gynecol. 1996; 87: 6438. [PubMed]
58.
Hartmann K, Thorp JM, McDonald TL, et al. Cervical kimensions and risk of preterm birth: A prospective cohort study. Obstetrics and Gynecology. 1999; 93: 5049. [PubMed]
59.
Iams JD. Cervical ultrasonography. Ultrasound Obstet Gynecol. 1997; 10: 15660. [PubMed]
60.
Goland RS, Wardlaw SL, Blum M, et al. Biologically active corticotropin-releasing hormone in maternal and fetal plasma during pregnancy. Am J Obstet Gynecol. 1988; 159: 88490. [PubMed]
61.
McGregor JA, Jackson GM, Lachelin GC, et al. Salivary estriol as risk assessment for preterm labor: A prospective trial. Am J Obstet Gynecol. 1995; 173: 133742. [PubMed]
62.
Lands AM, Arnold A, McAuliff JP, et al. Differentiation of receptor systems by sympathomimetic amines. Nature. 1967; 214: .
63.
Scheid CR, Honeyman TW, Fay FS. Mechanism of beta-adrenergic relaxation of smooth muscle Nature 1979. 277:326.View this and related citations using .
64.
Roberts JM. Current understanding of pharmacologic mechanisms in the prevention of preterm birth. Clin Obstet Gynecol. 1984; 27: 592605. [PubMed]
65.
Besinger RE, Niebyl JR. The safety and efficacy of tocolytic agents for the treatment of preterm labor. Obstet Gynecol Surv. 1990; 45: 41540. [PubMed]
66.
Forman A, Andersson KE, Ulmsten U. Inhibition of myometrial activity by calcium antagonists. Semin Perinatol. 1981; 5: 28894. [PubMed]
67.
Calvin C, Loutzenhizer R, Van Breeman C. Mechanisms of calcium antagonist-induced vasodilation. Ann Rev Pharmacol Toxicol. 1983; 23: 37396. [PubMed]
68.
Petrie RH. Preterm parturition. Tocolysis using magnesium sulfate. Semin Perinatol. 1981; 5: 26673.
69.
Harbert GM, Cornell GW, Thorton WN. Effect of toxemia therapy on uterine dynamics. Am J Obstet Gynecol. 1969; 105: .
70.
Karim SM. On the use of blockers of prostaglandin synthesis in the control of labor. Adv Prostaglandin Thromboxane Res. 1978; 4: 3016. [PubMed]
71.
Johnson WC, Habert GM, Martin CB. Pharmacologic control of uterine contractility. Am J Obstet Gynecol. 1975; 123: 36475. [PubMed]
72.
Wigvist N, Lundstrom V, Green K. Premature labor and indomethacin. Prostaglandins. 1975; 10: 51526. [PubMed]
73.
Niebyl JR, Blake DA, White RD, et al. The inhibition of premature labor with indomethacin. Am J Obstet Gynecol. 1980; 136: 10149. [PubMed]
74.
Phaneuf S, Asboth G, MacKenzie JZ, et al. Effect of oxytocin antagonists on the activation of human myometrium in vitro: Atosiban prevents oxytocin induced desensitization. Am J Obstet Gynecol. 1994; 171: 162734. [PubMed]
75.
Zingg H, Lefebvre D, Giaid A. Uterine oxytocin gene expression: A novel framework for oxytocin action. Reg Peptides. 1993; 45: 436.
76.
Hahn DW, Demarest KT, Ericson E, et al. Evaluation of 1-deamino-[D-Tyr(Oethyl)2, Thr4, Orn8] vasotocin, an oxytocin antagonist, in animal models of uterine contractility and preterm labor; a new tocolytic agent. Am J Obstet Gynecol. 1987; 157: 97782. [PubMed]
77.
Bossmar T, Akerlund M, Fantoni G, et al. Receptors for and myometrial responses to oxytocin and vasopressin in preterm and term human pregnancy: Effects of the oxytocin antagonist atosiban. Am J Obstet Gynecol. 1994; 171: 163442. [PubMed]
78.
Giussani DA, Jenkins SL, Mecenas CA, et al. The oxytocin antagonist atosiban prevents androstenedione-induced myometrial contractions in the chronically instrumented, pregnant Rhesus monkey. Endocrinology. 1996; 137: 33027. [PubMed]
79.
Fuchs F, Fuchs AR, Poblete VF, et al. Effect of alcohol on threatened premature labor. Am J Obstet Gynecol. 1967; 99: 62737. [PubMed]
80.