The Agency for Healthcare Research and Quality (AHRQ), formerly the Agency for Health Care Policy and Research, through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
| John M. Eisenberg, M.D. | Douglas B. Kamerow, M.D. |
| Director Agency for Healthcare Research and Quality | Director, Center for Practice and Technology Assessment Agency for Healthcare Research and Quality |
| The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service. |
Objectives. Preterm labor is often a prelude to early births and the significant attendant burden of infant morbidity and mortality. To address the question of how best to manage women in preterm labor, the Research Triangle Institute-University of North Carolina Evidence-based Practice Center undertook a rigorous review of the scientific literature on: (1) appropriate criteria for diagnosing preterm labor and use of three biologic markers, fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol; (2) efficacy and effectiveness of tocolytics-pharmaceutical agents including beta-mimetics, calcium channel blockers, magnesium sulfate, nonsteroidal antiinflammatory disease drugs (NSAIDs), and ethanol that arrest preterm labor symptoms); (3) efficacy and effectiveness of antibiotics for treating covert infections that might have prompted preterm labor; and (4) efficacy of home uterine activity monitoring.
Search Strategy. We conducted a detailed search of the relevant literature using MEDLINE; EMBASE; the Cochrane Collaboration and its related York Database, International Pharmaceutical Abstracts; the Health Economic Evaluations Database; Genderwatch; and the Population Index. The Medical Subject Headings included premature labor, diagnosis, epidemiologic study characteristics, terms for specific therapies (biologic markers, antibiotics, and tocolytic agents) and costs, cost analysis, and cost-benefit analysis. We conducted an extensive search of the gray literature, chiefly government documents, with a focus on tocolytics and biologic markers.
Selection Criteria. Our inclusion criteria were randomized controlled trials (RCTs) and cohort or case-control studies on biologic markers, tocolytics, and antibiotics that met the following specifications: published from 1966 to 1999 in English, French, or German; including pregnant females of any age with signs and symptoms of preterm labor; involving inpatient and outpatient settings; and measuring delivery, maternal morbidities, and infant health outcomes. Home uterine activity monitoring was further restricted to RCTs published in English since 1980.
Data Collection and Analysis. We conducted dual and blinded reviews of abstracts of articles and pulled full articles. Those articles meeting inclusion criteria were given a detailed dual review and abstraction onto four (topic-specific) data extraction forms. We judged the quality of the individual articles (on internal validity, statistical considerations, clinical relevance of findings, an external validity) and the strength of the evidence overall in each topic area (considering consistency of findings over all studies, quality rating of each study, magnitude of important outcomes, and meta-analysis results). In addition to the systematic reviews of the literature, we conducted meta-analyses of treatment effects from first-line and maintenance tocolytics, antibiotics, and home uterine activity monitoring.
Main Results. The quality of this literature is questionable in many respects, including the definition of preterm labor, the size of the trials, confounding of results because of use of cointerventions, reliance on bivariate analyses without including stratified analyses such as would be available through survival analysis, and failure to separately analyze women who have conditions that culminate in medically indicated preterm births.
Both fFN and EVUSD present strong evidence of effectiveness from cohort studies as diagnostic tools for assessing the risk of preterm birth in women with preterm labor, mainly in terms of their negative predictive value. Both can identify women who are not likely to experience a preterm birth and thus help clinicians avoid unneeded therapy. The efficacy of these findings from RCTs is not yet available.
First-line treatment with certain tocolytics offers small improvements in pregnancy prolongation. Whether this prolongation is beneficial in improving neonatal outcomes has not been established. Tocolytics other than beta-mimetics offer greater efficacy at less risk; the benefits of beta-mimetics never exceeded other options, and their harms were potentially more severe than for other options. Ethanol is less efficacious than other tocolytic options and is an inappropriate treatment for women with preterm labor symptoms. In terms of gestational age at birth, prolongation of pregnancy, or birth weight, maintenance treatment with tocolytics conferred no benefits. Evidence of harms was high for beta-mimetics in relation to the probability of maternal risk (serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms); harms for other tocolytics were regarded as low for maternal risk. All classes of tocolytics posed low short-term risk for fetal or neonatal outcomes. Long-term risk to the infant was not evaluated.
Therapy with antibiotics for occult in utero infections believed to be associated with preterm labor had mixed results, but meta-analysis indicated marginally significant improvements in two maternal outcomes (length of pregnancy and gestational age). Significant increases in birth weight were not found.
No RCT of home uterine activity monitoring that controlled for the cointervention of nursing support found a significant effect from use of this device. Meta-analysis confirmed this "no- effect" conclusion in relation to gestational age at birth and birth weight.
Conclusions. Based on the literature reviewed, two biologic markers (fFN and EVUSD) are found to be quite useful in identifying women in preterm labor who are at low risk of experiencing a preterm birth. Although the evidence remains mixed, certain tocolytics-beta- mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs-appear effective in prolonging pregnancy when used as first-line agents in arresting preterm labor, but beta- mimetics in comparison with other tocolytics seem to present a higher risk of maternal harms. Ethanol is not found to be an appropriate tocolytic agent. Tocolytics are not useful as maintenance interventions. Antibiotics for suspected genital tract infections may be useful. Home uterine activity monitoring was found to confer no maternal or fetal/neonatal benefits.
The literature concerning the management of preterm labor is flawed in several ways relating largely to definitions of preterm labor, appropriate measures of outcomes, and use of survival analysis. Epidemiologic research as well as additional efficacy and effectiveness studies of the two biologic markers, selected tocolytics, and antibiotics are warranted.
This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.
Suggested Citation:
Berkman, ND, Thorp, JM Jr, Hartmann,
KE, et al. Management of preterm labor. Evidence Report/Technology Assessment
No. 18 (Prepared by Research Triangle Institute under Contract No. 290-97-0011).
AHRQ Publication No. 01-E021. Rockville (MD) Agency for Healthcare Research and
Quality. December 2000.
Women's health is a major concern in the Nation today, particularly in the areas of maternal health and pregnancy. Within this framework, the American College of Obstetricians and Gynecologists nominated issues for consideration by the Agency for Healthcare Research and Quality (AHRQ) relating to the care of women with preterm labor. Of specific concern are effective strategies for identifying and treating women with symptoms of preterm labor so as to prevent preterm births (i.e., births before 37 full weeks of gestation) and subsequent infant morbidity and mortality.
Definitions of preterm labor vary, but the research criteria commonly hold it to be contractions occurring between 20 and 36 weeks' gestation at a rate of four in 20 minutes or eight in 1 hour with at least one of the following: cervical change over time or dilation greater than or equal to 2.0 cm. Preterm labor is the most common cause of antenatal hospitalization, but the number of pregnancies in which preterm labor symptoms occur is inadequately documented. Preterm labor is recognized as a prelude to early births, with their attendant burden of infant morbidity and mortality. Although the incidence and prevalence of preterm labor are not well understood, the incidence and burdens of preterm births are. Conservatively, 11 percent of all live births, or approximately 440,000 annually, occur before term in this country, and preterm births are responsible for three-quarters of neonatal mortality and one-half of long-term neurological impairments in children. Preterm births account for one-third of all health care spending on infants and one-tenth of spending for children.
In short, the medical, psychological, and economic burdens of preterm births-and by extension, preterm labor that ends in preterm births-are substantial. Uncertainty persists, however, about the best strategies for managing women in preterm labor. Thus, compelling reasons can be amassed for a systematic review of the literature in this clinical area.
To address these issues, the Research Triangle Institute-University of North Carolina Evidence-based Practice Center (RTI-UNC EPC) undertook a rigorous review of the scientific literature on detection and management of preterm labor. This report addresses four main issues: (1) the appropriate criteria for diagnosing preterm labor, specifically with respect to the use of three biologic markers and their positive and negative predictive value; (2) the efficacy and effectiveness of tocolytics (pharmaceutical agents that arrest uterine contractions); (3) the efficacy and effectiveness of antibiotics (with respect to covert infections that might prompt preterm labor); and (4) the efficacy of home uterine activity monitoring in decreasing adverse outcomes in women who are experiencing preterm labor.
The key questions are:
Biologic Markers: What are the appropriate criteria for a diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?
Tocolytics: What is the efficacy and/or effectiveness of tocolytics in managing preterm labor? The analysis had two subtopics, one concerning first-line tocolytics (treatment for women having acute symptoms) and the other concerning maintenance therapy for women who have experienced an episode of preterm labor.
Antibiotics: What is the efficacy and/or effectiveness of antibiotics in managing preterm labor? The analysis was limited to women suspected of having occult in utero infections.
Home Uterine Activity Monitoring: What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?
Specifically, we investigated three biologic markers: fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol (E3); because of a lack of studies regarding E3 in laboring women, the remainder of this summary focuses only on fFN and EVUSD. We studied five classes of tocolytics: beta-mimetics, calcium channel blockers, magnesium sulfate, nonsteroidal anti-inflammatory disease drugs (NSAIDs), and ethanol. For these tocolytics, we considered whether they were used as first-line or maintenance regimens; the former applies to acute conditions when a woman's preterm labor symptoms are so significant that a tocolytic agent is used to prevent preterm birth, and the latter applies to use after an episode of acute tocolysis to maintain uterine quiescence. Our review included numerous antibiotics, which are reviewed as one group. Finally, we considered the literature on home uterine activity monitoring for women with preterm labor.
We conducted a detailed search of the relevant literatures using the following databases: MEDLINE; EMBASE; the Cochrane Collaboration and its related York Database, International Pharmaceutical Abstracts; the Health Economic Evaluations Database; Genderwatch; and the Population Index. We also conducted an extensive search of the gray literature, chiefly government documents, with a focus on tocolytics and biologic markers.
We examined literature meeting the following criteria:
Study populations: humans; pregnant females.
Condition: signs and symptoms of preterm labor, not including articles in which all research subjects experienced preterm premature rupture of membranes, medically indicated preterm delivery, or multiple gestation.
Interventions: biologic markers as noted, tocolytics, antibiotics, and home uterine activity monitoring.
Study settings: essentially all inpatient and outpatient settings including patients' homes.
Outcomes: three major categories of outcomes-birth, maternal morbidities, and infant health.
Time period: for pharmacotherapies and biologic markers, 1966 through 1999, depending on the date of approval; for home uterine activity monitoring, 1980 through 1999.
Languages and geographic sites: English, French, and German (English only for home uterine activity monitoring), and excluding locations based on language of publication other than these.
Admissible evidence: efficacy studies in which health care is delivered under ideal settings, identified as randomized controlled trials (RCTs) (double and single blinded); effectiveness studies (non-RCTs, prospective and retrospective cohort studies, and case control studies) in which health care is delivered under ordinary circumstances; only RCTs were used for home uterine activity monitoring; other information included meta-analyses, review articles for reference lists, and cost-effectiveness studies; finally, sample sizes of 40 or more subjects.
The Medical Subject Headings (MeSH) used for the searches were limited by the subject heading "premature labor." With respect to study designs, we included a variety of terms in "epidemiologic study characteristics" (exploded). In addition, we searched for specific therapies, including "biologic markers," "antibiotics," and "tocolytic agents." We searched the intersection of "premature labor" and "diagnosis," and finally, we searched under "costs and cost analysis and cost-benefit analysis."
Upon completion of the initial literature searches, we subjected lists of titles and article abstracts to a dual, blinded review; we retained those articles that both a clinician and a methodologist reviewer indicated should be included and discarded those that both reviewers indicated should be dropped. In cases of disagreement, the Scientific Director for the project made the final selection. In addition, the Scientific Director reexamined a 20-percent sample of abstracts identified for inclusion and all abstracts identified for exclusion; generally, we erred on the side of inclusion.
We combined all citations into a ProCite database, removing duplicate records and identifying articles to be included in evidence tables and those to be excluded (with reason for exclusion). In all, we retrieved 506 studies of benefit from treatment; of these, we included 101 articles.
We developed, through several iterations, various data extraction
forms, specific to each topic. These forms were designed to provide
clear and easily accessible information for entering data into
evidence tables. Included across all studies were the following
variables:
Study designs.
Description of patient population, including maternal age and race, and clinical inclusion/exclusion criteria, such as condition of membranes (ruptured or intact), estimated gestational age, how gestational age was defined, and maternal and fetal indications for birth.
Definition of preterm labor.
Description of the test, treatment, or intervention.
Description of adjunct therapies.
Outcomes measured such as prolongation of pregnancy, gestational age at delivery, rate of preterm births, maternal morbidities, and infant birth weight.
Description of any secondary analyses performed.
The RTI-UNC EPC used two types of abstractors: individuals with content or clinical expertise and those with strong methodologic skills. The clinician abstractors included several obstetricians, one pediatrician with training in women's health, and a nurse midwife; all had prior research experience. The methods abstractors were all doctoral students in the UNC School of Public Health or the UNC Department of Economics with extensive training in quantitative methods. In addition, reviewers included a health services researcher with expertise in quantitative methods (the Project Director) and an obstetrician with expertise in treating women with preterm labor and conducting clinical research in the topic area (the Scientific Director). We instituted substantial quality control and reliability procedures.
We systematically reviewed evidence about side effects of tocolytics, comparing classes of these drugs and effects on the mother and the fetus or neonate, but we did not distinguish between first-line and maintenance therapies. Rather than developing full evidence tables, we created numerous charts depicting the absolute and relative frequency of various harms associated with tocolytic use.
We first graded the quality of individual articles using a specific scale that considered various aspects of the internal and external validity of each study. It encompassed, among other issues, study design, measurement issues, statistical analyses, and the appropriateness of the conclusions being drawn. To assess the internal validity of a study (i.e., the likelihood that the design and conduct of the study minimize systematic error [bias]), we evaluated the following: recruitment strategy; masking of the treating clinician and patient; inclusion of baseline measurements (e.g., contraction frequency, cervical dilation, effacement, and estimated gestational age); and definition of outcome measures, attrition rates, and confounding variables. We evaluated the adequacy of the methods used and whether the investigators assessed the clinical relevancy of the statistical findings. The quality rating instrument assessed whether all three major categories of outcomes were included in a study: delivery, maternal, and infant. With respect to external validity (i.e., whether the findings of the study can be generalized to populations that did not participate in the study), we determined whether the clinical setting was clearly specified and whether conclusions apply to pregnant women in the United States. The quality scores from each abstractor are included in the evidence tables.
We also rated the quality or strength of the collective evidence on each topic according to a categorical, adjudicated rating developed by the Project Director, Scientific Director, and Clinical Methodologist. The quality of the collective evidence takes into account the design quality of the individual studies and the efficacy or effectiveness of reported outcomes. The ratings are as follows:
| Good (A): | The data or evidence are sufficient for evaluating the quality of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor. |
| Fair (B): | The data are sufficient for evaluating the quality of the findings. The data indicate that inconsistencies exist in the findings between the key drug or intervention and alternative or placebo for treating women with preterm labor. |
| Poor (C): | The data are sufficient for evaluating the quality of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor. |
| Incomplete Evidence (I): | The data are insufficient for assessing the quality of the key drug or intervention for treating women with preterm labor based on limited sample size or poor methodology. |
| Efficacy (1): | Evidence obtained from well-designed RCTs. |
| Effectiveness (2): | Evidence obtained from well-designed cohort or case-control studies. |
We used two designations to grade the overall evidence about harms: "high" if the side effects were life-threatening and their frequency was substantially different from that for alternative treatments or no treatment, and "low" if the side effects were short-term and not life-threatening and their frequency was not substantially different from the frequency for an alternative tocolytic treatment or no treatment.
We presented information from individual studies into pairs of evidence tables-one for study design and the other for outcomes. The content of the study design tables covers the following variables (when available from the publications): statement of the research objective, definition of preterm labor used by the authors, patient inclusion/exclusion criteria, patient demographic characteristics, description of the experimental intervention, total number of participants, and number of participants in each arm of the study at its conclusion. The outcomes tables generally include data on delivery, and maternal and infant outcomes. For the biologic marker outcome tables, we present delivery outcomes in terms of sensitivity, specificity, positive predictive value, and negative predictive value.
In addition to the systematic review of the literature, we conducted meta-analyses focusing on treatment effects of first-line tocolytics by class, maintenance tocolytics by class, antibiotics, and home uterine activity monitoring. The efficacy of treatment was measured in relation to the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational age at delivery, and (3) birth weight. The literature used in the meta-analyses was restricted to RCT studies included in the evidence tables and data concerning women with intact membranes.
The quality of this literature is questionable in many respects, thereby diminishing the confidence with which one can draw conclusions. Among the issues of concern are the definition of preterm labor, the size of the trials, confounding of results because of use of cointerventions, and failure to analyze separately women who have conditions associated with medically indicated preterm births. In addition, we note the general absence (except in a few cases) of the use of survival analytic techniques in preference to the use of dichotomous outcomes (e.g., preterm birth or not) or other categorical outcomes that do not adequately reflect how long a pregnancy "survived" without a preterm birth. Survival analysis allows for the stratification of outcomes by such factors as the gestational age at the onset of preterm labor.
Overall, fFN and EVUSD present strong evidence of effectiveness as a diagnostic tool for assessing the risk of preterm birth in women with symptoms of preterm labor. Both tests only moderately were successful in predicting which women with a positive test would deliver before term, but they consistently exhibited strong negative predictive values, thereby identifying women at low risk of preterm birth.
Literature comparing an intervention group of women receiving first-line treatment with tocolytics relative to a control group was available only for beta-mimetics and magnesium sulfate. Results across studies were mixed: for beta-mimetics, we saw evidence of efficacy in terms of estimated gestational age at birth, prolongation of pregnancy for 1 day or more and improved infant outcomes (birth weight greater than 2500 g). Significant differences were not found between magnesium sulfate and placebo.
With respect to comparisons among different classes of tocolytics (chiefly in relation to beta-mimetics), results again were mixed. Beta-mimetics had efficacy relative to ethanol treatment in relation to prolonging pregnancy in one (older) study, but other RCTs showed that all other classes of tocolytics had greater or not significantly different effects relative to beta-mimetics on delivery outcomes. For infant outcomes, results were also mixed, but no study reported that beta-mimetics were a superior treatment. Data from observational studies did not contradict the results of the RCTs. Meta-analyses suggest that all tocolytics, with the exception of ethanol, were effective in extending pregnancies at or beyond 36 to 38 weeks gestation, compared with a no-treatment group. Beta-mimetics, calcium channel blockers, and magnesium sulfate nearly doubled the odds of term births, relative to control, with potentially small differences in effect sizes between classes.
Overall, the evidence supports the notion that first-line treatment with beta-mimetics, calcium channel blockers, magnesium sulfate, or NSAIDs offers small improvements in prolonging pregnancy. Data concerning relative efficacy are mixed, but they clearly support the conclusion that ethanol is less efficacious than other tocolytic options and support the generally held clinical belief that ethanol is an inappropriate treatment for women with preterm labor symptoms. Concerns that other treatments offer greater efficacy with less risk seem to be warranted in regard to treatment with beta-mimetics as well: the benefits of beta-mimetics never were found to exceed other options, and the maternal harms (see below) were shown to be potentially more severe.
Except for one small study, the efficacy studies showed no difference between treatment and control arms in managing women who had recently experienced an episode of preterm labor, and meta-analysis confirmed these conclusions. In short, for gestational age at birth, prolongation of pregnancy, or birth weight, maintenance treatment conferred no benefits.
We graded beta-mimetics as "high" in probability of maternal risk, including serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms. All other classes of tocolytic treatment were graded as "low" in relation to maternal risk. We graded all classes of tocolytics as "low" risk in relation to fetal or neonatal harms: evidence of short-term harms was inconsistent, and evidence of longer term problems was insufficient.
Results of our review concerning therapy with antibiotics for treating occult in utero infections associated with preterm labor were mixed. Two RCTs found improvements in all three delivery outcomes of interest-prolongation of pregnancy, mean gestational age at birth, and birth at a particular number of weeks- but all other RCT studies showed mixed results or no significant difference from placebo/control group assignment. Results from survival analyses included in three RCTs were also mixed. Meta-analysis showed the following: a marginally significant increase in length of pregnancy of about 6 days; a marginally significant increase of about 0.60 of a week in gestational age resulting from antibiotic treatment; and a small increase in birth weight that was not statistically significant. The array of agents, routes of administration, and durations of therapy preclude us from making a generalization about the optional antibiotic regimen to achieve these benefits.
Of the four RCTs reviewed, none of the three that controlled for nursing support as an element of the monitoring approach found a significant effect from home uterine activity monitoring; meta-analysis confirmed this "no-effect" conclusion in relation to gestational age at birth and birth weight. A fourth RCT reported that home uterine activity monitoring significantly prolonged delivery to 37 weeks, but the approach in this study included nursing support; the separate effect of monitoring was not analyzed.
We encountered numerous difficulties in reviewing this literature, arriving at concrete conclusions, or drawing appropriate inferences about the efficacy or effectiveness of these drugs and other interventions. Methodologic issues included explicit definitions of preterm labor, clarity about cointerventions, and no separate analysis of women who had medically indicated preterm births. We urge investigators to be more precise in their design and description of efficacy and effectiveness studies.
Moreover, researchers differed in their definitions of even standard outcomes, and we have noted already the infrequent use of survival analysis (or related techniques). Our recommendation is that investigators, to the extent possible, supplement and integrate the use of dichotomous outcomes measures (e.g., preterm or term birth) and variables involving specific cutoff points for gestational age and include concepts such as the length of time a pregnancy was extended (i.e., prolongation as a form of survival of the pregnancy). We specifically advocate survival analysis stratified by gestational age at enrollment as the analytic technique of choice in view of its clinical and biologic relevance.
The dearth of reliable epidemiologic information about preterm labor is significant. The areas that require special attention include biologic mechanisms that result in birth before term, incidence and prevalence of preterm labor and the proportion that result in preterm birth, modifiable risk factors for preterm birth, and a better understanding of these basic facts among ethnic minorities.
Further study on risk factors is important because clinicians and researchers must be able to assess accurately the risk of preterm birth in women with symptoms of preterm labor and to exclude women at low risk of early birth from intervention trials. Biologic markers (such as fFN and EVUSD), by virtue of being objective and reproducible, are superior to purely clinical assessment or prognostication of preterm delivery. Nonetheless, we advocate more direct experimental investigation of the predictive characteristics of these two biologic markers. Meanwhile, we recommend that investigators studying tocolytics and antibiotics (in their capacity as part of a medical regimen for women with preterm labor) use biologic marker negativity as an exclusion criterion.
Our results from both the systematic review and meta-analysis for the first-line tocolytics and for antibiotics suggested that further trials of these modalities are justified, although we note that investigators must now attend to the deficits of the efficacy and effectiveness studies to date. Future research should focus on the benefits (and harms) of beta-mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs, in comparison with no treatment and in comparisons with one another.
Finally, we advise against further research on maintenance tocolytics or home uterine activity monitoring. The research to date has made adequately clear that the use of tocolytics in a maintenance capacity following an episode of acute tocolysis has no proven efficacy or effectiveness and that the use of home uterine activity monitoring confers no maternal, fetal, or neonatal benefits.
| Area | Facts Identified (Summary) |
|---|---|
| Incidence and Prevalence | About 11% of pregnant women in the U.S. will experience a spontaneous preterm delivery. 45 Of those, an estimated 40% will occur following preterm labor. About 7% of all live births will be low birth weight babies (about 6% among white and Hispanic babies, 13% among Black babies). Approximately 4 million babies are born in the U.S. each year. 149 |
| Characteristics and Size of Population | Preterm labor afflicts pregnant women of every socioeconomic class, and ethnic group, and age. Etiologic contributions include cigarette smoking, preeclampsia, incompetent cervix, prior preterm birth, abruptio placentae, occult in utero infections, and possibly stress, anxiety, and depression. 9 |
| Practice Setting | Preterm labor can be addressed in clinicians' offices, private and public inpatient and outpatient settings, free-standing birth centers, and patients' homes. |
| Morbidity and Developmental Milestones | For mothers, the morbidity associated with preterm labor can be related to both physical functioning and emotional stress, as well as potential harms from the side effects of diagnostic testing or use of certain drugs. (Mistaking preterm labor for certain other conditions in which drugs used to treat preterm labor can be contraindicated can create its own set of potential adverse events.) The morbidity for babies born preterm can manifest in many ways. Short-term outcomes for newborns can include acute and chronic lung disease, central nervous system hemorrhage, and blindness. Long-term outcomes can involve cerebral palsy, mental retardation, and learning disability, and other forms of developmental delay. 1193 |
| Mortality | The risk of neonatal mortality inversely is associated with gestational age at birth. Approximately, three-quarters of neonatal deaths can be attributed to preterm birth. 94 |
| Quality of Life | More than one-half of neurologic impairments found in children can be attributed to preterm delivery. The quality of life of women in preterm labor can be affected by stress associated with their condition, hospitalization, and need for bed rest. 1195 |
| Loss of Productivity | Employed pregnant women experiencing preterm labor will have reduced economic productivity to the extent that days of bed rest and/or hospitalization are required. Although difficult to measure, individuals with handicaps caused by having been preterm babies experience increased morbidity that may adversely affect their future level of productivity and earning power. 93 |
| Costs to Diagnosis and Treat | The costs to treat women in preterm labor include the costs of drugs (such as tocolytics and possibly antibiotics) and hospitalizations. To those may be added the costs of biologic marker tests to make a more precise diagnosis, the costs of diagnosing possible overt or occult infections, and possibly even amniocentesis. In addition, preterm delivery accounts for approximately 35% of all health care spending on infants and 10% of all health care spending for children. 16 |
| Other Costs or Burdens | In 1988, for children younger than 15 years of age, costs for health care, education, and child care associated with having been of low birth weight (a result of preterm birth) were estimated to be $5.5 to 6 billion. 16 |
| Variations in Practice | No consensus exists regarding the diagnosis or the treatment of preterm labor. Approaches range from essentially "watchful waiting" or traditional "conservative interventions" (e.g., bed rest) to complex diagnostic and therapeutic regimens involving biologic marker tests (diagnosis) and pharmacologic agents (antibiotics, tocolytic drugs and perhaps corticosteroids) and to system-wide (e.g., state or county) programs to detect and manage the condition. 94 |
In theory, early detection and management of preterm labor helps to prevent preterm birth and its potential neonatal sequelae, which include respiratory distress syndrome, sepsis, intraventricular hemorrhage, necrotizing enterocolitis, patent ductus arteriosus, and hyperbilirubinemia; 2 however, widespread treatment of women with signs and symptoms of preterm labor has failed to reduce the prevalence of preterm birth in the United States. Rates of preterm births has not changed meaningfully in 20 years, underscoring the need to evaluate the effectiveness of current interventions.
The reasons for the lack of progress remain undefined, although they are related in part to the rise in multiple gestations resulting from the success of assisted reproductive technologies. 3 Multifetal pregnancies are at higher risk for preterm birth. From 1981 to 1993, preterm births rose from 9.4 to 11 percent, 4 stabilizing at approximately 11 percent through 1997. 5 Keys to this failure to see a decline in the prevalence of preterm births may lie in how we identify, evaluate, and treat women with preterm labor.
Lack of uniform criteria for evaluating and diagnosing preterm labor presents special challenges for clinicians and perinatal researchers attempting to draw conclusions from the medical literature. This evidence report focuses in part on documenting the range, complexity, and lack of uniformity of definitions, and how this pattern influences interpretation of the scientific literature on key clinical questions. We revisit the challenge of definition at appropriate transitions throughout this report.
Care providers are anxious for tools that provide objective categorization of risk and assist diagnosis. Candidate biologic markers are numerous and offer the possibility of refining clinical estimates of the probability that a particular woman's uterine activity will continue, cause cervical change, and result in preterm birth. Nonetheless, such testing is costly and controversial. Little existing literature summarizes the utility of biologic markers in the population of women with preterm labor or compares the literature on the clinical diagnostic properties across tests.
Once a diagnosis is made, in contemporary practice, tocolytics (agents intended to stop contractions and maintain uterine quiescence) are the cornerstone of pharmacologic treatment of preterm labor and are often continued as a maintenance treatment to prevent further preterm labor episodes. Arguably, the effectiveness of these agents, as a class, to prolong gestation to term is limited. This report summarizes the literature on the effectiveness of tocolytics for preventing preterm birth. When possible, we explore the benefit expected from use of tocolytic agents with an emphasis not only on traditional dichotomous interpretations of treatment outcome (i.e., full-term versus preterm births) but also measures that take into account increments of time gained. Two factors drive this interest in additional measures: (1) the continuous improvement in infant outcomes found to be correlated with incremental increases in birth weight and gestational age, 6 and (2) the proven benefit of giving mothers at risk of a preterm birth corticosteroids to promote fetal lung maturation. 2 Therefore, understanding what are the effective, and preferable, means by which to delay preterm birth is also an objective of this work. Equally important is documenting potential harmful maternal and infant effects of treatment with tocolytics and estimating the incidence of serious adverse events.
As occult in utero infection and inflammation increasingly are thought to contribute to the risk of preterm birth, research on the use of antibiotics has proliferated. The literature is replete with studies of antibiotics of many types and doses, by several routes of administration, for differing durations, with and without the use of tocolytics or other cointerventions. This evidence report is an opportunity to consolidate data that may provide insight into the appropriate integration of antibiotics into management.
Preterm labor generally requires initial hospital-based management. The possibility of using technology, such as home uterine activity monitoring, to facilitate safe discharge from inpatient care is attractive for patients, providers, and payers. Home monitoring makes continued frequent surveillance of uterine activity feasible; its status as a management tool, however, remains unclear.
The broad scope, lack of clear clinical management imperatives, and importance of these concerns suggests that the audience for this report will be large and receptive to evidence provided to address the key clinical questions described in detail in the next subsection.
Our key clinical questions focus on three important components of managing preterm labor: diagnosis, pharmacotherapy for arresting the condition, and monitoring symptoms. (See Chapter 2 for a more detailed discussion of our key questions.) First, the key questions involve the use of particular diagnostic tests, so-called biologic markers, to enhance clinical accuracy in predicting which women presenting with signs and symptoms of preterm labor are most likely (or least likely) to result in a preterm birth. These biologic markers include fetal fibronectin (fFN), endovaginal ultrasound (EVUSD), and salivary estriol (E3).
Second, we investigate pharmacotherapies for arresting the condition, including tocolytics to stop uterine contractions and antibiotics for treating possible underlying infections. Tocolytic agents will be examined by class and by whether treatment focuses on arresting current symptoms or, following an episode of preterm labor, are used to maintain uterine quiescence.
The majority of the tocolytic literature concentrates on the class of drugs known as beta-mimetics; it also includes calcium channel blockers, magnesium sulfate, and nonsteroidal anti-inflammatory drugs (NSAIDs). In addition, we review the outcomes of treatment with ethanol, a therapy no longer in accepted practice in the United States. In addition to benefits obtained from tocolytic pharmacotherapy, we also explore the harms. In contrast, antibiotics will be discussed as one group, and potential harms will not be examined.
Third, our clinical questions concern the use of home uterine activity monitoring for detecting early symptoms of preterm labor, which is intended to allow the clinician more time to administer therapies intended to stop the progression to preterm delivery or administer pharmacotherapies to aid fetal development.
Technical experts in the field of the management of preterm labor were identified to provide assistance throughout the project. The Technical Expert Advisory Group (TEAG) (Appendix A) was expected to contribute to advancing the Agency for Healthcare Research and Quality's (AHRQ's) broader goals of (1) creating and maintaining science partnerships as well as public-private partnerships and (2) meeting the needs of an array of potential customers and users of its products. Thus, it was both an additional resource and a sounding board during the project. The TEAG included eight members: six technical/clinical experts; one representative of organizations whose mission concerns the interests and perspectives of patients and consumers; and one potential user of the final evidence report, specifically, an American College of Obstetricians and Gynecologists (ACOG) representative, the organization originally nominating this topic.
To ensure robust, scientifically relevant work, the TEAG was called on to provide
reactions to work in progress and advice on substantive issues or possibly
overlooked areas of research. TEAG members participated in conference calls and
discussions, periodically through e-mail, to:
Refine the key clinical questions at the beginning of the project.
Discuss the preliminary assessment of the literature and to provide input for the data extraction form during the project.
Select topics and guide approaches for meta-analyses toward the end of the project.
In addition, early in the process, senior staff and TEAG members discussed the challenges of designing a literature search concerning a clinical situation that is as difficult to conceptualize as preterm labor. A particular concern potentially resulting from defining the condition based on its symptoms is that the narrowness or breadth of the criteria for enrolling women with preterm labor in research protocols would influence the outcomes of the treatment under study. For instance, in trials with broad definitions of preterm labor, an unknown fraction of the women might have been inappropriate candidates for the therapies they received because their symptoms would have resolved on their own. We decided to proceed with the literature search and data extraction as outlined in the report on the grounds that, since the authors had defined study participants as experiencing preterm labor, we had no means either to exclude studies in an unbiased way or to "adjust" results post hoc for these problems. This concern is, however, what motivates the authors of this report to advocate for future research to include using biologic markers as a means of increasing clinical confidence that an appropriate diagnosis was made for women included in a study.
Because of their extensive knowledge of the literature on preterm labor and their active involvement in professional societies, TEAG members were also asked to participate in the external peer review of the draft report.
Although preterm labor is the most common cause of antenatal hospitalization, the number of pregnancies in which preterm labor occurs is documented inadequately. Only two studies that specifically address the natural history of preterm labor were identified. In a study designed to determine how warning signs of preterm labor relate to eventual diagnosis of preterm labor and preterm birth, Copper, Goldenberg, Davis, et al. 7 followed 329 indigent, high-risk patients. In this small cohort, 27 percent of the participants met the research criteria for diagnosis of preterm labor: "contractions occurring between 20 and 36 weeks' gestation at a rate of four in 20 minutes or eight in 1 hour with at least one of the following: ruptured membranes, cervical change over time, dilatation > 2.0 cm, or cervical length < 1.0 cm." Twelve percent of the cohort had spontaneous preterm births. 7 Also, West, Yawn, Thorp, et al. 8 found that the cumulative incidence of physician diagnosis of preterm labor during the late 1980s and early 1990s was 3.6 to 6.4 percent in a population-based sample of live births in Olmstead County, Minnesota.
Such prospective enumeration of populations of pregnant women is required to accurately measure incidence, the rate of new diagnoses of preterm labor in pregnant women, and prevalence, the proportion of pregnant women receiving treatment for preterm labor, at a given time. Without a denominator from an identified population at risk for developing preterm labor, the majority of existing studies provide "numerator" data that poorly capture the impact of preterm labor on the health care system or on the lives of pregnant women. Each year at least 200,000 infants are born before term after spontaneous preterm labor, 5 9 but the number of other pregnancies in the United States affected by the occurrence of preterm labor that do not result in preterm birth has not been projected accurately.
The incidence and burden of preterm birth is comparatively well understood. Conservatively estimated, 11 percent of all live births, or approximately 440,000 births, occur before term each year in the United States. 5 9 Preterm birth is responsible for 70 to 85 percent of fetal, neonatal, and infant deaths 10 and is second only to fetal deformity in causing perinatal morbidity. Up to one-half of the total burden of neurologic disability in children, including cerebral palsy, is attributable to prematurity. 1 11 This risk of neonatal mortality and morbidity is related directly to gestational age and weight at birth: the more premature the infant, the greater the risk. Infants born before 32 weeks, 2 percent of births, experience the majority of neonatal deaths and congenital neurologic disorders. 12 Even late in the preterm period (35 or 36 weeks gestation), when the overall risk of mortality is modest, 13 an increased risk of infant mortality, 14 respiratory distress syndrome, 12 13 and impaired intellectual development 15 can still be discerned.
The economic costs of caring for preterm infants are substantial. Total care costs are, on average, tenfold higher for preterm infants than for normal-weight infants born at term. 16 Preterm birth accounts for 35 percent of all health care spending on infants and 10 percent of all spending for children. 16 The ongoing medical, educational, and special needs of children born at low birth weights, the majority after preterm birth, are estimated to exceed $5 billion each year. 16 These costs reflect only the care of infants born before term. Additional personal and health care costs accrue from the evaluation and care of women who experienced preterm labor.
To narrow the focus to spontaneous preterm labor potentially preceding spontaneous preterm birth, we first consider the major routes to preterm birth. Preterm births can be classified as spontaneous or medically indicated. Spontaneous preterm births are preceded by spontaneous preterm labor or preterm premature rupture of membranes, or both. 17 18 Medically indicated preterm births follow a decision to allow or cause delivery for maternal or fetal indications. Preeclampsia and presumed fetal growth restriction cause the majority of medically indicated preterm deliveries. 19 Spontaneous preterm labor precedes approximately 43 percent of preterm births; preterm premature rupture of membranes, approximately 32 percent; and medical indications for delivery, the remaining 25 percent. 17
The subtypes of spontaneous preterm birth, preterm labor, and preterm rupture of membranes share many risk factors. 20 21 In fact, 15 to 20 percent of women successfully treated for preterm labor subsequently experience preterm premature rupture of membranes. 21 22 Although this interrelationship is clear, the management of these conditions is divergent. This report focuses exclusively on the management of preterm labor.
The epidemiologic literature focuses on preterm birth, rather than preterm
labor, as an outcome. Because the majority of women presenting for care with
preterm labor signs or symptoms do not have the more serious outcome of
giving birth prematurely, experts believe that examining the association of
factors with spontaneous preterm birth itself (i.e., a "hard" outcome)
reduces misclassification bias and yields more precise estimates of the
importance of specific maternal characteristics and exposures. Research on
determinants of preterm birth have three dominant foci:
Description of risk factors for individuals and populations to guide prenatal surveillance and intervention programs.
Identification of behaviors and conditions that are amenable to change or treatment.
Description of physiologic processes that precede preterm birth and may elucidate etiologic pathways or serve as biologic markers of impending birth.
The last category includes uterine activity, cervical change, and biologic markers such as fFN and E3, which will be reviewed later.
The first category of risk factors consists predominantly of factors for which the mechanism of risk modification is unknown or difficult to separate definitively from socioeconomic, behavioral, or nutritional factors. Many of these "exposures" are relatively fixed (i.e., not amenable to modification after a woman becomes pregnant). The most robust and consistent factor is uniquely informative: women who have had a prior spontaneous preterm birth have a threefold increase in risk of preterm birth in a subsequent pregnancy compared with women who have had only term deliveries. 23 This risk increases with the number of previous preterm births, and the outcome of the most recent pregnancy has the greatest influence. 24 Depending on the population, women with a history of preterm birth deliver before term in 15 to 80 percent of subsequent pregnancies. 23 Presumably this reflects the constancy of unobserved, underlying factors.
In the United States, black race is a risk factor. Adjusting for known confounders, including income and educational status, being black confers a relative risk of 2.0 to 3.5 of having a preterm baby when compared with being white. 13 25 This divergence of risk is most pronounced at the lowest gestational ages (i.e., below 27 weeks). 26 The risk of other minorities in the United States is characterized too poorly to generalize.
Additional reported factors associated with increased risk include both very young and advanced maternal age, single marital status, low socioeconomic status, and some of its isolated correlates such as low income, poor access to and quality of care, and limited educational attainment. Poor nutritional status and low pre-pregnancy body weight also are related to increased risk. 27 28 These last two examples underscore the overlap between descriptive markers of risk and the possibility that deprivation plays a role in some cases, whereas knowledge and intentional behavior factors, such as nutritional habits, play a role in others.
Cigarette smoking is the most "costly" preventable risk from a population perspective. Smoking contributes independently to the risk of prematurity and to decreased birth weight. Adjusting for other factors, smokers are 40 to 70 percent more likely than nonsmokers to deliver before 37 weeks gestation. 29 30 31 Other potentially remediable risks include cocaine use, poor diet, and fasting. 32 33 Alcohol and marijuana use appear to be unrelated to spontaneous preterm birth risk. Data on the effects of stress, caffeine consumption, and physical exertion conflict.
Like behaviors, infections are risk factors that also can be modified before the window of risk for preterm labor. Alterations in vaginal flora are associated with increased risk of preterm birth. One of numerous possible infections is bacterial vaginosis (BV), a disruption of normal vaginal flora characterized by a superabundance of Gardnerella vaginalis , mixed anaerobes, and/or genital mycoplasmas. Its presence during pregnancy is associated with a 1.5- to 4.0-fold increased risk of preterm birth 34 35 36 37 38 39 and approximately twice the likelihood of preterm labor. 37 38 40 41 Black women are more likely than white women to have BV, 36 39 42 43 a finding that might help to explain the higher rate of preterm births in black women. Research regarding both pregnant and nonpregnant women has suggested that BV in the lower genital tract is associated with increased risk of infection in the upper genital tract; the latter may be one mechanism by which infection incites inflammation, uterine contractions, and subsequent preterm labor. Urinary tract infection also has been linked variably to increased risk. 20
Aggressive screening and treatment for genital and urinary tract infections have varied effectiveness for reducing preterm births; some studies demonstrate decreased rates and others fail to find benefit. 20 The physiologic mechanisms that may link infection, inflammation, and preterm labor are addressed in a subsequent discussion concerning the rationale for antibiotic use in the management of preterm labor.
Although the etiologic pathways to preterm birth are uncharted, substantial heterogeneity exists in the factors that contribute to the likelihood that any individual woman will experience preterm labor or deliver before term as a result of preterm labor. Conflicting results of studies assessing risk factors for preterm birth should not be viewed as irreconcilable. The distribution of fixed and modifiable risk factors in selected populations of pregnant women is sure to vary. As a result, synergy among risk factors-for instance, race and smoking and diet, or age and multiple gestation and socioeconomic status-also will vary. It is precisely the many permutations of risk factors and the complexity of their interactions that have thwarted efforts to use this knowledge in care settings to identify women at risk of preterm labor and spontaneous preterm birth.
With rare exceptions, risk scoring systems are empirically derived from what we know about the fixed and modifiable risks described above. Despite statistical refinements, they are neither sufficiently sensitive nor specific to guide care. The most successful tool, developed by Creasy, Golbud, Laros, et al. 44 had a sensitivity of 64 percent and specificity of 90 percent, with a positive predictive value of 30 percent when evaluated in a general obstetric population for prediction of preterm birth.
More than one-half of spontaneous preterm births occur in women with no apparent risk factors. 45 The limited utility of predictive tools, therefore, is less an indictment of the literature than it is a reflection of the lack of a gold standard, other than preterm birth, against which to measure the usefulness of prediction tools.
Labor at term occurs when uterine contractions lead to progressive cervical dilation and effacement. By extension, the hallmarks of preterm labor are uterine activity and cervical change; however, uniformly accepted standards for defining preterm labor do not exist. Therefore, diagnosis of preterm labor invokes an essential circularity: this patient has the kind of uterine activity that may lead to preterm birth if not treated. Thus, intervention precludes definitive diagnosis. As a surrogate, operational definitions of preterm labor seek to quantify how many contractions over what period of time are occurring (e.g., more than four contractions in 20 minutes or five to eight contractions in 1 hour) they also often require documentation of change in dilation or effacement of the cervix.
In practice, clinicians weigh many highly individual factors not captured in formal definitions: Are the contractions painful? Does the mother recognize the contractions? Does she report other symptoms of labor such as back pain, pelvic pressure, vaginal bleeding, or vaginal discharge? What is her individual risk profile? Has she had prior preterm births? Are other processes at work that may be provoking uterine activity, such as a urinary tract infection, substance use, or placental compromise, including abruption (premature separation of the placenta)?
Once other explanations are excluded, two factors drive intervention for preterm labor: (1) estimates of the probability of progressive labor and (2) the degree of immaturity of the fetus. Higher probability and younger gestational age may lead to intervention based on less strict criteria than would otherwise be used. For example, allowing a mother at 28 weeks' gestation to continue to contract with the goal of documenting cervical change simply to meet diagnostic criteria is problematic at best. Because (in theory) achieving uterine quiescence is easier earlier in the cascade of labor events, early intervention is believed to be desirable.
Symptoms, including regular contractions before term, are common; hence, their presence is not highly predictive for discerning who will have progressive cervical change or go on to have a preterm delivery. In a study of 763 women who made unscheduled visits for symptoms suggestive of preterm labor, only 18 percent delivered before 37 weeks; moreover, only 3 percent delivered within the next 2 weeks. 46 Early detection of symptoms 47 48 49 50 51 and actual measurement of uterine contractions with home monitors 49 50 51 52 have failed to lower preterm delivery rates.
These findings suggest that women with contractions and symptoms that will lead to preterm birth are indistinguishable from the larger group who has incidental uterine activity, making it difficult to target surveillance and treatment efficiently. As the studies presented in the evidence tables show, the great majority of women diagnosed with preterm labor will not deliver before term.
The cervical changes caused by labor (either term or preterm) reflect the culmination of a complex set of molecular changes in cervical connective tissue clinically known as "ripening" or "softening." 53 54 Although the sequence of events leading to cervical change, especially the triggers, is less clear for preterm labor than for term labor, the basic physiology is considered to be similar. The uterine wall consists primarily of smooth muscle or myometrium. Contractions result from shortening of the muscle fibers provoked by a stimulus, such as oxytocin or prostaglandins. At the cellular level, regulated release of intracellular calcium is required to set in motion the cascade that allows myosin and actin filaments within the muscle cell to generate force by shortening. To convert from the irregular, disorganized contractions common throughout pregnancy to coordinated contractions, the cells in the uterus create new channels called gap junctions to facilitate the flow of calcium and other intercellular signals. This process results in contractions in which the upper segment of the uterus provides downward expulsive force, pressing the presenting part of the infant against the cervix.
In addition to the mechanical force of uterine contractions, structural changes take place in the cervical tissue itself. White blood cells infiltrate the cervix and release various enzymes (proteases, collagenases, and elastases) that weaken the collagen and protein matrix of the cervix. This weakening allows the stretching and distortion of the cervix required for dilation and effacement.
Many types of stimuli may contribute to the initiation and perpetuation of contractions and to recruitment of inflammatory factors required for cervical change, which potentially provides multiple potential points for therapeutic intervention. In the discussion of management, we review how biologic markers can be used to detect important physiologic changes associated with preterm labor, the mechanism of action for each category of pharmacologic agents, potential harms associated with the use of tocolytics, and the rationale for use of home uterine activity monitoring.
Clinicians frequently use a combination of treatments in caring for women with preterm labor. As noted earlier, the chief focus of this report centers on tocolytics and antibiotics. We do not review combinations of treatment that might be employed because the number of different combinations of pharmaceutical and nonpharmaceutical interventions is very large. In addition, frequent cointerventions that are not reviewed specifically in this report are bed rest and hydration.
Over the past decade, two biologic markers-fFN in cervicovaginal secretions and EVUSD of the cervix have been investigated extensively. When used as tests to assess women with symptoms of preterm labor, these biologic markers have been shown to increase the precision by which clinicians can assess the likelihood of preterm birth. Our third marker, E3, has not been evaluated in a study restricted to women in preterm labor.
The value of these biologic markers lies in their ability to add predictive value (either positive or negative) to the clinician's diagnostic abilities. In other words, their purpose is to predict more accurately which patients presenting with signs and symptoms of preterm labor are likely to deliver prematurely. If biologic markers have strong positive predictive value, a positive test would indicate which women are more likely to deliver. The converse is equally valuable: if biologic markers have strong negative predictive value, a negative test predicts which women are likely not to progress to delivery and, thereby, enable women and their clinicians to avoid unnecessary treatment.
In clinical practice, when any of the three biologic markers is used in a group of women with preterm labor symptoms to predict the likelihood of spontaneous preterm birth, fFN, 46 51 EVUSD, 55 56 and E3 40 demonstrate moderate positive predictive value (20 to 40 percent) and good negative predictive value (85 to 95 percent). In logistic regression models, the presence or absence of these biologic markers explains significantly more of the variability in preterm birth in women with preterm labor symptoms than do traditional clinical assessment tools, such as cervical dilation by digital assessment or uterine contraction frequency.
The fFN is a glycoprotein with high molecular weight that is expressed only during fetal life and oncogenesis. This molecule serves an adhesive function, helping to anchor the placenta to the uterine lining. With either labor or intrauterine infection, 57 fFN is released into the lower genital tract (cervix and vagina). Putative release mechanisms include placental shearing, cytokine-induced overproduction by the decidua, and occult leakage of amniotic fluid. The presence of fFN in the cervix or vagina precedes the onset of both term and preterm births. 46 The fetal isoform of fibronectin can be detected in lower genital secretions using a specific monoclonal antibody and enzyme-linked assay techniques.
Before the development of ultrasound transducers that could be inserted vaginally, the cervix in women with preterm labor symptoms was assessed by either digital examination or transabdominal ultrasound. The former technique is less precise and subject to intraobserver variation; however, it is sufficient to clinically categorize degree of risk. 58 The latter technique is limited by the abdominal wall and the symphisis pubis. EVUSD provides reliable and reproducible views of the cervix that are easily obtained. The rationale for use rests on the ability to see and measure changes in cervical dimensions that may be associated with cervical ripening and/or the prior influence of contractile activity.
Prospective studies in women with preterm labor symptoms have demonstrated that EVUSD assessment of cervical length (distance between internal and external cervical openings) and dilation of the internal os (labeled dynamic change, beaking, funneling) is superior to digital assessment in assigning risk of preterm birth in women with preterm labor symptoms. 59
The placenta in humans produces corticotropin-releasing hormone (CRH), and CRH levels gradually increase with gestational age. Unlike the pituitary-adrenal axis, in which cortisol has a negative feedback effect on CRH production, cortisol exerts positive feedback increasing placental CRH production. Placenta production of CRH increases dramatically just before both term and preterm labor.
CRH produced in the placenta enters both the maternal and fetal circulatory systems. 60 In the fetus, CRH induces the production of steroid precursors in the adrenal gland that are further synthesized by the fetal liver and placenta, culminating in the production of E3. The E3 freely enters the maternal circulatory system after it is synthesized by the fetus. Thus, maternal E3 is a byproduct of placental CRH production. A surge in maternal E3 blood levels occurs before labor.
Researchers have assessed the materno-placental-fetal CRH axis indirectly by measuring E3 in maternal saliva to predict preterm delivery risk in women at increased risk of preterm labor. 61 Relevant to this report, studies assessing its utility in symptomatic women were not identified.
Tocolytic drugs decrease the uterine muscle's ability to contract; the goal is to diminish the force and frequency of contractions and preferably bring them to a halt. Many agents have been used, including ethanol, magnesium sulfate, calcium channel blockers, oxytocin antagonists, NSAIDs, and beta-mimetic agonists. Tocolytics can be administered in two ways: (1) those used at initial presentation grouped under the rubric "first-line therapy", and (2) those given to maintain uterine quiescence after first-line tocolysis are labeled "maintenance therapy." The same classes of drugs are used in both therapeutic approaches, although parenteral administration predominates in first-line treatment and oral administration is the delivery route of choice for maintenance therapy.
Two types of beta receptors are described in humans. 62 Beta-one receptors predominate in the heart and adipose tissue, whereas beta-two receptors are localized in the smooth muscle of the uterus, blood vessels, and bronchioles. Beta-mimetics, also called beta-adrenergic agonists, bind to beta receptors in these cells, activating the enzyme adenylate cyclase and causing an increase in intracellular cyclic adenosine monophosphate (cAMP). 63 Increases in cAMP initiates reduction in intracellular calcium and reduces the ability of the myosin-actin interactions to generate contractile force. 64 Thus, interactions between beta-mimetics and the beta-two receptor can inhibit uterine muscle activity.
Unfortunately, molecules with absolute selectivity for beta-two receptors have not been discovered, and even agonists with some degree of beta-two selectivity interact with beta-one receptors. This interaction explains the characteristic pattern of cardiovascular side effects (hypotension, tachycardia, arrhythmia, heart failure) and metabolic side effects (hyperglycemia, hypokalemia) seen in clinical attempts to block beta-two receptors and achieve tocolysis. 65 Beta-mimetic drugs include ritodrine, terbutaline, isoxuprine, salbutamol, fenoterol, hexoprenaline, and nylidrin.
Calcium channel blockers prevent the entry of calcium into smooth muscle cells, which occurs by two mechanisms: (1) blockage of calcium channels 66 and (2) suppression of calcium release from the intracellular stores. 67 The end result is smooth muscle relaxation, including the muscular activity of the uterus. This smooth muscle relaxation is nonspecific and can lead to peripheral vasodilation with hypotension. Calcium channel blockers used for tocolysis include nifedipine, nicardipine, and verapamil.
Magnesium is a bivalent cation, as is calcium. Magnesium exerts its muscle-relaxing properties by competing with calcium rather than by interacting with receptors. The primary site for this competition is the intracellular stores within the sarcoplasmic reticulum. 68 At sufficiently high levels, magnesium displaces some of the calcium molecules that would otherwise have been held in store to drive actin and myosin interaction. Because magnesium cannot play the same role as calcium in facilitating contraction, a relative local shortage of calcium within intracellular stores lengthens the time required between contractions to repolarize the contractile units thereby decreasing the force of contractions. Serum magnesium concentrations of 4 to 8 mEq/L, which are roughly 2 to 4 times normal physiologic values, inhibit uterine activity in humans. 69
NSAIDs inhibit production of prostaglandins, which are potent uterine muscle stimulants at and before term. 70 Because prostaglandins enhance uterine contractility, inhibition of prostaglandin formation depresses uterine activity. 71 72 73 Examples of NSAIDs used for tocolysis include aspirin, indomethacin, naproxen, ketorlac, and ibuprofen.
Oxytocin is a pituitary hormone thought to be critical in human parturition. 74 Its physiologic role is to stimulate uterine contractility. Uterine muscle has specific cell membrane receptors for the molecule. 75 Antagonists have been synthesized by altering the eight-amino-acid sequence of oxytocin.
Atosiban is a selective oxytocin antagonist molecule that binds to the uterine receptors in place of oxytocin. Because it does not initiate the cascade of events that lead to contractions and it blocks oxytocin from binding by occupying its place on the receptors, atosiban is capable of inhibiting uterine contractions in women with preterm labor. 76 77 78
Alcohol was the first tocolytic agent used for prevention of preterm birth in women with signs and symptoms of preterm labor. 79 It inhibits uterine contractility by diminishing secretion of two neurohypopheseal hormones: antidiuretic hormone and oxytocin. 80 Ethanol reduces the pulsatile excretion of oxytocin from the pituitary seen in normal labor, thereby causing a tocolytic effect. 81 Although alcohol is no longer used in the management of preterm labor, we mention it here primarily for historical comparison.
The examination of the efficacy and effectiveness of treatments for any condition must be balanced against the harms caused by that treatment; in this case, the harms associated with tocolytic treatment. Such ill effects of treatments may be described by a number of terms, such as side effects, harms, untoward events, and adverse events. For ease of description, we will refer to them as "harms."
Every treatment has harms associated with its use. The harm may be associated directly with the therapeutic effect of the agent, as in myelosuppression related to the effect of chemotherapeutic agents. Harms may also relate to the effects of medications on organ systems distant from the target organ intended for the therapeutic effect, such as the gastric toxicity of NSAIDs. Finally, idiosyncratic reactions may occur, such as serious allergic reactions.
The level of harmful effect considered tolerable in the literature is variable. The tolerance level for harms may be somewhat greater if the condition under treatment is very severe or has a poor prognosis, if the harms are very transient and of little consequence other than patient inconvenience, or if the effect size of the efficacy of the treatment under consideration is very great.
The identification of harms in the medical literature is hindered by
several issues, as follows:
The description of the efficacy of the treatment is often much more detailed and complete than the description of harms.
Attention is often not paid to the detailed issues of assessment of harms. Variables often are described poorly, making comparison of rates of harms among studies difficult. For example, articles may describe a side effect of beta-mimetic agents as "arrhythmia." No detail is given as to the definition of arrhythmia, which technically may include disturbances such as sinus tachycardia, supraventricular tachycardias, and more serious disturbances such as atrial fibrillation and the potentially lethal ventricular tachycardia. Combining these various cardiac disturbances under "arrhythmia" may lose a great deal of information.
Serious adverse events may be rare, but important. Most randomized controlled trials (RCTs) in the treatment of preterm labor have fewer than 100 subjects in the treatment group. Low frequency but serious adverse events with rates of 1 percent or less may not be detected at all in a trial of this size, or they may be ascribed to chance alone.
Followup of children born to women who have received therapy with tocolytic agents generally lasts only until the end of the child's hospitalization. No information is available on issues such as late cognitive development. Such issues could be resolved only by conducting very large RCTs with a long followup period, because children whose mothers have undergone premature labor may have been born prematurely with low birth weight, placing them at risk for adverse developmental outcomes. Developmental diseases may be subtle or appear long after delivery. In utero exposure to diethylstilbestrol is a classic example of such.
Controversy also exists concerning the potential benefits and harms that might affect the central nervous system of a neonate exposed in utero to magnesium sulfate. Observational studies suggest two important, yet contradictory, findings: (1) Magnesium has been found to have a neuroprotective effect in the development of cerebral palsy, 82 a potentially serious morbidity associated with preterm birth; (2) conversely, researchers have found that magnesium may increase the risk of perinatal death. 83 Given the disparity of findings and the dual roles of magnesium as a tocolytic and antiseizure medicine in preeclampsia, these differences can be resolved only through large RCTs.
Of the management tools for preterm labor, tocolytics carry the greatest potential for minor and serious side effects. The minor side effects are common; beta-mimetics are potent cardiovascular stimulants associated with disconcerting, but generally benign, signs and symptoms, including tachycardia, palpitations, or chest tightness. In rare instances, beta-mimetic cardiovascular stimulation can cause serious complications such as maternal myocardial ischemia or fetal cardiac effects. They also have metabolic effects including increasing glucose availability and decreasing potassium levels in the blood.
Magnesium when given per strict protocol to women with normal urine output has a wide margin of safety. Its side effect profile is medically minor, but it may cause significant maternal distress side effects, including lethargy, drowsiness, double vision, nausea, and vomiting. Beta-mimetics, magnesium, and calcium channel blockers are all associated with increased risk of pulmonary edema, a serious and potentially lethal maternal complication. NSAIDs have the fewest maternal risks; fetal effects include diminished production of urine with oligohydramnios and premature closure of the ductus arteriosis. These effects usually are transient, but cases of sustained diminished renal capacity and pulmonary hypertension have been reported. Calcium channel blockers as a single agent are thought to have a good safety profile for both mother and fetus. Concomitant use of calcium channel blockers with magnesium sulfate, which some describe as a "primitive" calcium channel blocker, is potentially harmful and has resulted in cardiovascular collapse.
The oxytocin antagonist atosiban, at least within the context of clinical trials, has minimal side effects. Clinicians could not distinguish atosiban from placebo. This cannot be said of other tocolytics because their characteristic side effect profiles are unmaskable for the knowledgeable clinician.
In addressing harms, the report seeks to describe and, when possible, estimate the types and frequency of major and minor harms associated with each class of tocolytics. Such information coupled with estimates of the efficacy of the agents will improve the ability to manage preterm labor in a way that maximizes benefit and minimizes risk.
Women who present with preterm labor symptoms often have occult infections of their upper genital tract that can be detected by amniocentesis and subsequent culture of their amniotic fluid. 84 85 Women in the group who have occult infections are more likely to experience preterm birth than women without evidence of infection. 84 This association of infections of the upper genital tract and preterm birth in women with preterm labor symptoms provides the rationale for using antibiotics in the management of women with preterm labor. Antibiotic administration may cure occult infections, thereby lowering the risk of spontaneous preterm birth. 84
Organisms cultured from the amniotic fluid or placental-decidual interface of women who give birth before term include enteric gram negatives, gram-positive cocci, anaerobic cocci, and mycoplasma species-a mix of genital flora. Given the variety of potential pathogenic organisms, clinicians have empirically chosen broad spectrum antibiotics in an attempt to eradicate these organisms from the upper genital tract in women with preterm labor. Antibiotics used include second- and third-generation cephalosporins, beta-lactam antibiotics with beta-lactamase inhibitors, synthetic penicillins, metronidazole, and combinations of agents such as "triple coverage" (beta-lactam antibiotics, aminoglycoside, and clindamycin).
It is important to distinguish between antibiotics intended to treat occult in utero infection and those intended to prevent early-onset group B streptococcal sepsis in newborns. Preterm infants are particularly susceptible to acquiring group B streptococcal infections at birth from their mothers whose vaginas may be colonized with this organism and may be developing sepsis. The goal of early-onset sepsis prevention can be readily accomplished with single-agent beta-lactam antibiotic treatment during labor.
The goal of home uterine activity monitoring is to detect early labor symptoms in women deemed to be at high risk of preterm delivery, with the assumption that early detection provides an opportunity for successful treatment. Thus, home monitoring proponents do not claim that monitoring in and of itself can prevent preterm birth, but rather that the technology helps clinicians target preventive therapies earlier for women with preterm labor symptoms who are at risk of preterm birth and who may not recognize their early symptoms. This report focuses exclusively on the use of this technology in the management of women who have been diagnosed with preterm labor and released home for monitoring, generally after some treatment.
Home uterine activity monitoring uses a tocodynamometer to record uterine contractions in conjunction with daily telephone calls from a health care provider, usually a nurse, to offer patient support and advice. 86 87 The biologic hypothesis underlying the use of the device is that women will have an identifiable increase in uterine contractions before preterm delivery (i.e., a "crescendo" effect in contractions) and that many women are unable to recognize this prodromal uterine activity. 88 89 The premise on which the usefulness of home uterine activity monitoring rests is that by detecting early contractions, health care providers can treat preterm labor and prevent or decrease the likelihood of preterm birth. 87
Patients attach the external tocodynamometer over the uterus twice each day for 30 to 60 minutes while they are recumbent. 90 Digitized tracings from the tocodynamometer are transmitted over telephone lines for immediate interpretation by a health care provider. The Food and Drug Administration (FDA) has approved a portable tocodynamometer for use in the homes of women with high-risk pregnancies, whose fetuses are of more than 24 weeks' gestation, and who have a history of preterm birth. 91 The device has been shown to record uterine contractions with precision in both hospitalized and ambulatory gravidas. 92
The remainder of this evidence report is organized in the following sections. Chapter 2 provides details about the literature search and review methodology. Specifically included are the causal pathways for our key clinical questions; our approaches to conducting the systematic review, abstracting data from articles, maintaining quality control, and applying a quality rating system for individual articles and similar details. In addition, we comment briefly on the external peer review process. Chapter 3 provides our results on the four main topics covered by this report: biologic markers, tocolytics, antibiotics, and home uterine activity monitoring. Results of our supplemental meta-analysis concerning tocolytics, antibiotics, and home uterine activity monitoring are found in Chapter 4. Chapter 5 provides conclusions, and Chapter 6 offers recommendations for a research agenda on management of preterm labor.
The references cited in the body of the report follow Chapter 6. The complete list of literature considered and used in developing the evidence report appears in the bibliography. Appendix A includes acknowledgments, information on the TEAG, and the peer reviewers for this report. The data extraction forms appear in Appendix B, and a list of acronyms and abbreviations is in Appendix C. The Evidence Tables are contained in their entirety in Volume 2 of this report.
This chapter documents the procedures that the Research Triangle Institute-University of North Carolina at Chapel Hill Evidence-based Practice Center (RTI-UNC EPC) used to develop a comprehensive evidence report for the AHRQ that describes and contrasts the approaches currently used in the management of preterm labor. To set the framework for the review, we present first the key questions and their underlying causal pathways followed by a detailed description of the literature search, which includes documenting the inclusion and exclusion criteria for literature acquisition, selecting relevant MeSH terms, listing databases searched, retrieving the gray literature, and specifying the eligibility criteria for study inclusion. Once we determined which studies met the inclusion/exclusion criteria and were eligible for inclusion, we abstracted data onto data extraction forms and then transferred critical information to evidence tables; these forms are also described in this chapter.
The methodology chapter also discusses quality issues (i.e., the RTI-UNC EPC's quality control procedures with regard to determining the eligibility for inclusion, carrying out data abstraction, and developing the quality rating for individual studies). An evidence report requires an extensive search for all types of literature. Because some published works are of higher methodologic quality than others, the RTI-UNC EPC developed quality rating forms specific to each body of literature studied in relation to managing preterm labor. This chapter describes the development of the rating system and its use in the analysis. Last, the RTI-UNC EPC's procedures for conducting a supplemental meta-analysis and the peer review process are discussed.
We address four key questions in this report. All questions were put into final form with input from the TEAG (Appendix A). After consultation with the TEAG, we determined that two additional questions concerning the enhancement of fetal lung development with corticosteroids and the cost-effectiveness of using antibiotic and tocolytic pharmacotherapies were inappropriate for further study through this report. The decision concerning corticosteroids was based on our conclusion that we would have little to add to the knowledge in the field. In relation to cost-effectiveness studies, we did not find a body of literature to review.
The first question concerns the effectiveness of using three biologic markers in assessing the risk of preterm delivery: fFN found in cervicovaginal secretions, EVUSD of the cervix, and E3. The second and third questions relate to pharmacotherapies for arresting the condition: tocolytics to stop uterine contractions, and antibiotics for treating possible underlying infections. The tocolytic review separates studies based on whether the aim was treatment during the episode of preterm labor and uterine activity (first-line use) or promotion of continued quiescence of the uterus (maintenance use). By contrast, although various antibiotics are used in treating preterm labor, they are discussed as one group. The fourth question concerns using home uterine activity monitoring for detecting early labor uterine contractions.
Our specific key questions are as follows:
What are the appropriate criteria for the diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?
What is the efficacy and/or effectiveness of tocolytics in managing preterm labor?
What is the efficacy and/or effectiveness of antibiotics in managing preterm labor?
What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?
In the context of managing preterm labor, the efficacy or effectiveness of treatment refers to three major categories of outcomes: delivery, maternal morbidities, and infant health. From a clinical perspective, the outcomes of greatest concern are infant morbidities and mortality associated with preterm birth; however, because these are rare events and difficult to quantify except in large samples, and because prolonging the pregnancy is believed to be correlated with improved infant outcomes, studies also include birth outcomes as intermediate or proxy measures of efficacy.
Our search included both efficacy results from RCTs and effectiveness outcomes from observational research to ensure the greatest breadth of understanding of the research in this area. Efficacy results are obtained from studies where health care is delivered under relatively ideal conditions, such as controlling for confounding interventions. In contrast, effectiveness results are gathered under more normal conditions. The potential harms associated with treatment are presented in relation to tocolytic therapy. Our review of the literature concerning diagnostic testing includes only cohort studies because those were the only ones available. Our review of the use of home uterine activity monitoring includes only English language efficacy studies, because our search was limited based on time and resource considerations.
In 1994, researchers from the United States and other countries participated in a Consensus Development Conference sponsored by the National Institute of Child Health and Human Development (NICHHD) to discuss the perinatal outcomes associated with the use of corticosteroids. The panel assessed relevant studies; most of the outcomes, both maternal and neonatal, reflected data collected in RCTs. The resulting consensus statement and clinical recommendations provide the definitive scientific approach to antenatal corticosteroid therapy. 2
Similar to the AHRQ evidence reports, the NICHHD panel reviewed the literature and evaluated findings using a grading system. The ratings developed for the NICHHD project were based on those constructed by the Canadian Task Force on the Periodic Health Examination (now, Canadian Task Force on Preventive Health Services), as adapted by the U.S. Preventive Services Task Force. The ratings largely capture the nature of the study designs and the strength of the recommendations that can be based on the literature.
The expert panel concluded that antenatal corticosteroid therapy is recommended for most women at risk of preterm birth. The advantages of this therapy include substantial reductions in neonatal morbidity and mortality, with specific effects on respiratory distress syndrome and intraventricular hemorrhage in preterm infants. The benefits identified appear to be greatest 24 hours post-therapy, and the treatment window extends from 24 to 34 weeks' gestation. 2 In addition, several followup studies that tracked children up to 12 years of age failed to find adverse effects in children exposed prenatally to corticosteroids. 2 The expert panel also concluded that corticosteroid therapy provides a cost-effective solution that improves perinatal outcomes and reduces the need for more expensive neonatal interventions for premature infants.
If the biologic markers described in this evidence report can accurately assess the risk of preterm birth in women with preterm labor symptoms, then one would assume that more targeted use of steroids might follow. Targeting antenatal corticosteroid therapy would avoid under and over treatment and would achieve one of the consensus conference's goals.
After reviewing the NICHHD Consensus Development Conference findings to assess their methodologic soundness, we searched for subsequent peer-reviewed literature concerning the effect of treating pregnant women with preterm labor with corticosteroids to enhance fetal lung maturity. In our review of the literature published since the NICHHD findings were released, we did not uncover any more recent studies evaluating treatment efficacy, effectiveness, or cost-effectiveness; all subsequent publications were review articles and meta-analyses. Based on these results, we elected not to revisit this topic in further detail.
One of the issues originally proposed for this review concerned the question of the cost-effectiveness of using biologic markers or antibiotic or tocolytic pharmacotherapies in treating women with preterm labor. Although, in principle, we agree that this is a potentially worthy topic, as shown in the literature review section of this chapter, we also determined that the body of literature specifically covering these questions was insufficient for an evidence-based study. Only one study relating to this topic was uncovered. Thus, with the concordance of our TEAG, we did not investigate this issue in this report.
At this point, the clinician faces two challenges. The first challenge is to decide how to diagnose the preterm labor itself, which may include assessing the likelihood of preterm birth using biologic markers. The second is to determine whether the preterm labor stems from a treatable cause (e.g., an infection) that might be addressed through managing the condition or whether it cannot be attributed to a likely treatable cause and so should be limited to managing the symptoms (e.g., through the use of tocolytics).
These causal pathways can become quite complicated as the possible options and outcomes multiply. In our presentation, the treatment pathways are somewhat simplified because they do not reflect the possibility that clinicians might use more than one type of tocolytic when the first agent that is tried fails or the possibility that different practitioners might invoke the pharmacologic agents at different times in the course of an initial or subsequent preterm labor episode. Similarly, the "diagnosis" part of the causal pathway does not reflect the fact that different practitioners use different clinical criteria for diagnosing preterm labor, including different tests or using the tests in a different order if biologic testing is selected.
This portion of Chapter 2 describes our literature search strategy and results. Included are inclusion/exclusion criteria with regard to population, medical condition, interventions, and setting, quality grading forms containing the criteria for evaluating the quality of the evidence from the studies reviewed, databases searched and search terms used, and additional sources contacted for gray literature and works in progress. Documented here are the steps used to refine the literature search to the final number of articles ultimately reviewed and included in each evidence table.
| Category | Criteria |
|---|---|
| Study population | Humans Pregnant females |
| Condition | Signs and symptoms of preterm labor Exclude if all research subjects experience preterm premature rupture of membranes, medically indicated preterm birth, or multiple gestation |
| Interventions | Biologic markers (restricted to E3, EVUSD, and fFN), antibiotics, tocolytics, and HUAM |
| Study settings | Inpatient and outpatient settings, including patient homes |
| Outcomes | Preterm birth must be measured |
| Time period | 1966 and later HUAM: 1980 and later Depends on date of approval for pharmaceutical products or biologics |
| Geographic site of study | Exclude locations based on language of publication |
| Publication languages | English, French, and German HUAM: English only |
| Admissible evidence | Study design: RCT-double- and single-blinded; non-RCT-prospective and retrospective cohort studies, case control Study design HUAM: RCT only Other designs: meta-analyses, review articles for reference list searches, cost-effectiveness studies Sample size: >40 subjects |
The literature review specialist in cooperation with the study director and the scientific director developed the MeSH search terms used in the analysis. Searches in the National Library of Medicine (NLM) databases (e.g., MEDLINE) were limited by the subject heading: explode premature labor. This phrase is defined in the NLM databases as the onset of labor before term but after the fetus has become viable, usually sometime during estimated gestational ages of 29 through 38 weeks. Both preterm labor and premature delivery default to this subject heading. Preterm premature rupture of membrane, which is excluded from this analysis, is not included under this heading.
To ensure inclusion of relevant study designs, we searched within the MeSH tree under epidemiologic study characteristics, which are defined as types and formulations of studies used in epidemiologic and clinical research. Relevant MeSH key words included under this general topic and searched concurrently through the "exploded" term are clinical trials (Phase I-IV, controlled clinical trials, multicenter studies, and randomized controlled trials), analytic studies, epidemiologic studies, retrospective studies, prospective studies, longitudinal studies, followup studies, feasibility studies, pilot studies, sampling studies, intervention studies, cross-over studies, cohort studies, case control studies, and cross-sectional studies.
In addition, we searched the intersection of the terms premature labor and diagnosis to capture review articles. Bibliographies in review articles were examined as a double check to our electronic literature search to ensure that all relevant articles were included in the study. We searched for economic and cost analyses using the search terms costs and cost analysis and cost-benefit analysis.
We also searched for specific therapies. First, we searched on the relevant diagnostic tests within the term biologic markers, where the scope is defined as measurable and quantifiable biologic parameters that serve as indexes for health- and physiology-related assessments. Specific biologic markers included in the study are salivary estriol (estriol, saliva), endovaginal ultrasound, and fetal fibronectin (fibronectins). Second, we searched exploded antibiotics, which include ampicillin, amoxicillin, metronidazole, erythromycin, ceftizoxime, ampicillin-salbactam. Finally, we exploded tocolytic agents, which include ritodrine, terbutaline, isoxsuprine, magnesium sulfate, nifedipine, nicardipine, indomethacin, fenoterol, ethanol, nylidrin, and salbutamol.
Our primary literature search was conducted through MEDLINE. The MEDLINE database is widely recognized as the premier source for bibliographic coverage of biomedical literature. MEDLINE encompasses information from Index Medicus, Index to Dental Literature, andInternational Nursing, as well as other sources of coverage in the area of communication disorders, population biology, and reproductive biology.
In addition, we searched EMBASE (the electronic version of Excerpta Medica) because it provides a more complete coverage of the literature published in Europe, Latin America, and Asia. Apart from being a database that AHRQ has specified as being of interest, the EMBASE search is important because, historically, drugs have been approved for clinical application earlier in Europe than in the United States. Clinical findings regarding a drug's efficacy may have been published in Australia or Canada before reaching the U.S. market. We used a similar search strategy and terms for EMBASE as were employed for MEDLINE.
Additional databases searched include: (1) the Cochrane Collaboration database to examine relevant meta-analyses and systematic reviews closely related to our topic; (2) the related York database of the National Health Science Center for Reviews and Dissemination; (3)International Pharmaceutical Abstracts, a database devoted exclusively to pharmaceuticals and produced by the American Health-System Pharmacist Association; (4) the Health Economic Evaluations Database for literature concerning cost and economic analyses; (5)Genderwatch, a search engine that covers international journals, magazines, newsletters, regional publications, special reports, and conference proceedings devoted to women's and gender issues; and (6) the Population Index, a reference tool that focuses on the world's population literature. Of all these databases, Genderwatch and the Population Index uncovered no matches with our topics.
In its first task order (Pharmacotherapies of Alcohol Dependence), the RTI-UNC EPC outlined a search strategy for uncovering the so-called "gray literature" (i.e., potentially useful literature and documents that are to be found in outlets other than peer-reviewed journals). For this evidence report on the management of preterm labor, the RTI-UNC EPC has pursued a similar strategy.
Ideally, gray literature searches include reviews of government documents and monographs; industry reports and studies; unpublished studies and works in progress; proceedings and technical bulletins; and FDA Medical Officer's Reviews of original or extended (change of label) New Drug Approval applications, including advisory committee minutes for drugs, and similar reviews of Pre-Market Approval applications for devices. In reality, constraints on time and resources necessitate strategic reductions in the scope of the gray literature search.
We made two important decisions in restricting our gray literature work. Our earliest strategic decision, in consultation with the task order officer, was not to conduct gray literature searches for home uterine activity monitoring devices because of time and resource limitations. We did search for and review Pre-Market Approval documents archived at the FDA Web site (http://www.fda.gov) for home uterine activity monitoring devices, the E3 test, and the fFN test. We identified and reviewed research information offered by the manufacturer of the E3 test SalEst ® (http://www.biex.com), and fFN test (http://www.adeza.com). Both manufacturers provided information such as clinical monographs; however, with the exception of one brief four-person case study that appeared to have been written expressly for the manufacturer, all research to which they referred is available already as journal articles or abstracts.
Our second significant strategic decision was sharply to restrict our search for gray literature on antibiotics, for several reasons that reflect an initial investment of study resources. First, thousands of antibiotics are marketed in the United States alone. For example, just one of the earliest antibiotics, erythromycin, is marketed for human use by almost 20 different pharmaceutical firms; most of them market more than one version (oral and intravenous and/or intramuscular). Multiplying all possible antibiotics by all firms marketing them would create a search the size of which no EPC could undertake.
Second, and more important, in no case is an antibiotic specifically labeled for use in the treatment of preterm labor. Only two antibiotics are approved currently by the FDA for the treatment of bacterial vaginosis in pregnancy, metronidazole and clindamycin (Anti-Infective Drugs Advisory Committee Meeting, 64th Meeting, Issue: Guidance documents on developing antimicrobial drugs: General considerations and individual indications. Wednesday, July 29, 1998.) The RTI-UNC EPC's experience with the gray literature search for alcohol suggests that pharmaceutical firms do not engage in research into off-label uses unless they are preparing an application for a new or changed labeling approval, and they will not otherwise comment on off-label uses for their drugs. Thus, although antibiotics are prescribed routinely in the management of preterm labor, all such prescriptions are "off label."
Additionally, most antibiotics in use, except for the latest generation of cephalosporins, could not have been investigated in pregnant human subjects because the FDA explicitly prohibited the use of "women of childbearing potential" from clinical trials beginning in 1977, and an informal practice of exclusion had been the rule before then. Only in 1993, did the FDA issue a new guideline emphasizing the need for drug research using women as subjects, and the question of conducting drug research in pregnant women is still exceedingly controversial. Most antibiotics, like most drugs, are labeled Pregnancy Category C (harms to fetal development are shown in animal studies but no adequate, well-controlled human studies exist); an antibiotic example includes clarithromycin, which also must carry a Boxed Warning strongly advising against use during pregnancy. Other common antibiotics, such as the aminoglycosides, including streptomycin, fall under Pregnancy Category D (there is evidence of harm to the fetus from human studies, and patients should be advised of the potential hazards). (See, for example, Public Hearing, Content and Format of Labeling for Human Prescription Drugs, Part 15 - Pregnancy Labeling Categories, Friday, September 12, 1997, Bethesda, MD, which we have reviewed as a part of the gray literature search.)
In other words, no pharmaceutical company could have tested its new antibiotic in a sample of pregnant women, even had it wished to do so. Thus, it is neither reasonable nor efficient to devote scarce resources to such a search. The one exception may be in the case of the latest generation of cephalosporins, which are classed in Pregnancy Category B (no evidence of harm from animal studies, but no adequate, well-controlled human studies have been conducted, and the drug should be used only if clearly indicated). Anecdotally, manufacturers have sometimes expressed off-the-record interest in research into cephalosporin use during pregnancy, but we have found no evidence that any manufacturer has petitioned the FDA recently for a change in labeling for use in pregnant women.
Additional strategic decisions concerned tocolytics. The gray literature search on tocolytics appears to reveal that discrepancies arise not from differences between published and unpublished literature but from differing emphases on and interpretations of outcomes and, perhaps, differing ideological approaches to the treatment of women in preterm labor.
This task order includes an evaluation of the tocolytic drug atosiban, an oxytocin-receptor antagonist, approval for which was denied by the FDA's Advisory Committee for Reproductive Health Drugs on April 20, 1998. Despite the failure of the R.W. Johnson Pharmaceutical Research Institute (Johnson & Johnson) to gain approval for atosiban, we have nonetheless located an unpublished manuscript describing the multicenter RCT of atosiban (forthcoming in the American Journal of Obstetrics and Gynecology). We have also obtained and reviewed the minutes of the 1998 Reproductive Health Drugs Advisory Committee meeting. From these minutes, it seems clear that the committee's failure to approve atosiban resulted not from its doubts about the drug's efficacy to halt labor as the endpoint, but rather from its concern as to whether the endpoint of interest ought to be improved perinatal outcome. The Advisory Committee was not satisfied that the manufacturer had collected data addressing such concerns, but it was not in agreement about the kind of perinatal outcomes trial it wanted to see. The manufacturer, Johnson & Johnson, maintains that it proceeded correctly in the initial application. Thus, for the present in the United States, there will be no further efforts to seek approval for atosiban, although it is expected to be approved very soon in 15 European Nations, and the scientific basis for its approval will be published shortly on the European Agency for the Evaluation of Medicinal Products Web site (personal communication, between Sue Tolleson-Rinehart and Johnson & Johnson official, September 23, 1999).
Ritodrine and terbutaline are both beta-mimetic agents. Terbutaline is approved only for treatment of asthma, but it has become the most frequently used tocolytic in the United States. Ritodrine, under the brand name Yutopar ® , was the only drug to receive FDA approval as a tocolytic. The drug, approved in 1980, was available in both oral and intravenous forms until a 1992 meeting of the Fertility and Maternal Drug Advisory Committee, resulting in an FDA request for more testing of high doses of oral ritodrine for tocolytic maintenance. In response, the manufacturer removed oral ritodrine from the market. Controversy still revolves around the clinical information presented to the Drug Advisory Committee at that time from the Canadian Ritodrine Trial; the results of the trial and criticisms of it are in the published literature. 96
Ritodrine in intravenous form remained available as a tocolytic treatment until its two manufacturers, Abbott and AstraZeneca, stopped manufacture. Abbott discontinued the intravenous form before December 1997; AstraZeneca stopped manufacturing it on September 1, 1998, but the last expiration date of previously shipped product was August 1999, meaning that ritodrine could have remained in use until that time. Terbutaline has been employed in an off-label use for tocolysis for the same period and in fact, has surpassed ritodrine in use. In a Fertility and Maternal Drug Advisory Committee meeting in 1993, committee members recommended that the FDA solicit a specific application for change of label to include tocolysis for intravenous terbutaline; also discussed was the possibility of a literature-only application because the literature on terbutaline as a tocolytic was already voluminous.
At roughly the same time, the use of terbutaline in a subcutaneous infusion pump delivery system was increasing. Its advocates (including pregnant women) have generally argued that the terbutaline infusion pump effectively maintains tocolysis at much lower doses than is possible with other delivery systems and that it permits pregnant women to continue tocolysis maintenance at home, rather than in the hospital.
In 1996, the National Women's Health Network presented a Citizen's Petition to the FDA, requesting prohibitory action on the terbutaline subcutaneous infusion pump allegedly because of two maternal deaths that could be attributed to terbutaline use (Docket No. 96P-0258/CP1). On November 13, 1997, Associate Commissioner for Health Affairs Stuart Nightingale issued a "Dear Colleague" letter that said, in part:
Based on information available to the Agency, as well as a review of the published literature, it is clear that the demonstrated value of tocolytics in general is limited to an initial, brief period of treatment, probably no more than 48-72 hours. No benefit from prolonged treatment has been documented. In addition, the safety of long-term subcutaneous administration of terbutaline sulfate, especially on an outpatient basis, has not been adequately addressed.
Published reports on the safety of this use are seriously hampered by methodologic inadequacies. It appears that women receiving continuous subcutaneous terbutaline sulfate infusions experience side-effects and complications similar to those experienced by women receiving terbutaline and other beta-sympathomimetics intravenously. Complications include chest pain, tachycardia, dyspnea, and pulmonary edema. At least one maternal death occurred during outpatient use of a continuous infusion of terbutaline sulfate by subcutaneous pump. The impact of long-term use on maternal glucose metabolism and the risks of prolonged exposure of the fetus are largely unknown.
The "Dear Colleague" letter provoked considerable response from clinicians, researchers, and patient advocates; it apparently induced Matria, Inc., the marketer of the terbutaline pump system, to bring suit against the National Women's Health Network. Advocacy also included a petition from obstetricians to the FDA (Docket No. 98P-0218/CP) to approve subcutaneous terbutaline as a tocolytic treatment. The RTI-UNC EPC has no evidence to date that the FDA has responded to the substance of the petition.
This review for tocolytics suggests that its gray literature actually refers to or summarizes the published literature for policy purposes, rather than presenting unpublished literature. The remaining tocolytic in this evidence report, magnesium sulfate, is not subject to FDA approval as a drug.
As we discussed at the beginning of this section on gray literature, we also examined government documents and manufacturers' information associated with approval and use of assays for E3 and fFN. The first fFN assay was approved on September 21, 1995; a subsequent "Rapid System" by the same manufacturer was approved on August 14, 1998. An E3 assay was approved on April 29, 1998. The available information suggests that most (if not all) of the data relied on by the manufacturers now can be found in published articles and abstracts. Similarly, Pre-Market Approval information for ultrasound devices, including a portable device designed by General Electric expressly for use in obstetrics and gynecology, is straightforward and presents no unusual data. It may be worth noting, however, that the FDA suggests in its guidelines for labeling home uterine activity monitoring devices that they include the following precaution: "No widely accepted controlled studies have been conducted that show that this device is effective at the early detection of preterm labor other than in patients with a previous preterm delivery."
Finally, our efforts to identify unpublished work or other work in progress produced only the atosiban manuscript cited earlier in this chapter.
Once articles were identified through the electronic database search or review article bibliographies, we examined abstracts of the articles to determine whether studies did, in fact, meet our criteria. Two reviewers (one clinician and one methodologist) reviewed abstracts independently for inclusion or exclusion. If both reviewers agreed, an article was retained (or excluded). If the two reviewers disagreed (one indicating eligible and the other judging ineligible), the scientific director made the final decision.
To ensure that all appropriate articles were included and no significant ones were missed, the scientific director rereviewed a 20 percent sample of abstracts determined to be eligible for inclusion as well as all those determined to be ineligible. Generally speaking, for all abstracts, we erred on the side of inclusion rather than exclusion. When abstracts were not available from the electronic search or when there was a question of relevance, the project director or scientific director subjected the full articles to additional scrutiny. In the end, we excluded a study once the full article was examined if, upon this closer scrutiny, we found that it did not conform to the inclusion/exclusion criteria.
| Preterm birth an outcome? |
Yes |
No |
Can't
Tell |
| 40 or more subjects at completion? |
Yes |
No |
Can't
Tell |
| Intervention used as part of treating subjects with signs/symptoms of preterm labor (not a preventive intervention used among asymptomatic patients)? |
Yes |
No |
Can't
Tell |
| Topic | Retrieved | Retained |
|---|---|---|
| Biologic markers | 106 | 17 |
| Tocolytics Benefits Harms | 256 290 | 65 85 |
| Antibiotics | 107 | 15 |
| Home uterine activity monitoring | 37 | 4 |
| Costs and cost analyses | 1 | 0 |
| Totala | 797 | 95 |
Totals reflect a double count of some articles retrieved (retained) for both the tocolytics benefits and harms reviews.
| Biologic Markers | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode labor, premature, and explode epidemiologic studies, and limit to humans | 1,039 |
| 2. Explode biologic markers; or estriol; or fibronectins; or saliva; or ultrasonography, or endovaginal ultrasound | 165,902 |
| 3. 1 and 2 | 35 |
| EMBASE (Excerpta Medica) | |
| 1. Premature labor; or premature delivery | 3,764 |
| 2. Biologic with marker | 12,621 |
| 3. 1 and 2 | 99 |
| International Pharmaceutical Abstracts | |
| 1. Premature labor; or premature delivery | 162 |
| 2. Biologic markers; or estriol; or fibronectin | 85 |
| 3. 1 and 2 | 3 |
| Total unduplicated count from bibliographic databases | 99 |
| 1. Articles found from bibliographies of review articles | 7 |
| Total possible articles | 106 |
| Reductions Based On Abstract and Article Reviews | |
| 1. Opinion or editorial | 5 |
| 2. Study with <40 subjects | 6 |
| 3. Review article | 40 |
| 4. Wrong language | 2 |
| 5. Not preterm labor | 18 |
| 6. Wrong biologic marker | 16 |
| 7. Twin study | 3 |
| 8. Inclusion status not yet obtained | 0 |
| Total articles excluded | 90 |
| Total articles reviewed | 16 |
| Tocolytics | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode labor, premature, and explode epidemiologic studies, and limit to humans | 1,039 |
| 2. Explode tocolytic agents | |
| 3. 1 and 2 | 145 |
| 4. Explode labor, premature and, explode nitroglycerin | 11 |
| EMBASE (Excerpta Medica) | |
| 1. Premature labor; or premature delivery | 3,764 |
| 2. Tocolytic | 438 |
| 3. 1 and 2 | 245 |
| 4. Nitroglycerin | 2 |
| International Pharmaceutical Abstracts | |
| 1. Premature labor; or premature delivery | 162 |
| 2. Tocolytic | 138 |
| 3. 1 and 2 | 32 |
| Total unduplicated count from bibliographic databases | 251 |
| 1. Articles found from bibliographies of review articles | 5 |
| Total possible articles | 256 |
| Reductions Based On Abstract and Article Reviews | |
| 1. Opinion or editorial | 7 |
| 2. Study with <40 subjects | 45 |
| 3. Review articles | 39 |
| 4. Wrong language | 6 |
| 5. Not preterm labor | 71 |
| 6. Not human subjects | 1 |
| 7. Twins | 5 |
| 8. Wrong drug | 5 |
| 9. Unavailable a | 12 |
| Total articles excluded | 191 |
| Total articles reviewed | 65 |
Unavailable: 1 German, 2 French, and 9 English articles were not received in sufficient time for review.
| Harms Related to Tocolytic Use | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode labor, premature | 7,001 |
| 2. Limit 1 to human and English language | 4,299 |
| 3. Explode tocolytic agents | 43,746 |
| 4. Atosiban | 24 |
| 5. Explode nitroglycerin; or explode nitrous oxide | 16,112 |
| 6. 3 or 4 or 5 | 59,383 |
| 7. 2 and 6 | 880 |
| 8. Explode morbidity; or explode mortality; or explode heart failure, congestive | 217,717 |
| 9. Explode arrhythmia; or explode liver diseases; or explode anxiety | 310,250 |
| 10. Explode glucose intolerance; or explode pregnancy complications; or explode cerebral hemorrhage | 196,944 |
| 11. Explode enterocolitis | 4,553 |
| 12. Explode heart hypertrophy; or explode bradycardia; or explode cerebral palsy | 33,993 |
| 13. Adverse effects | 21,287 |
| 14. 8; or 9; or 10; or 11; or 12; or 13 | 719,408 |
| 15. 7 and 14 | 880 |
| 16. Limit 15 to clinical trial | 149 |
| 17. Limit 15 to randomized controlled trial | 90 |
| 18. Limit 15 to review; or review literature; or review of reported cases; or review, academic; or review, multicase; or review, tutorial | 87 |
| 19. Limit 15 to meta-analysis | 1 |
| 20. 16; or 17; or 18; or 19 | 232 |
| 21. 15 and 20 | 200 |
| 22. Case studies | 2,799 |
| 23. Explode cohort studies | 335,464 |
| 24. 22 or 23 | 338,143 |
| 25. 15 and 24 | 88 |
| 26. 21 or 25 | 270 |
| EMBASE (Exerpta Medica) | |
| 1. Premature labor; or premature delivery | 3,764 |
| 2. Tocolytic | 438 |
| 3. Complications; or adverse effects; or harms; or mortality | 182,906 |
| 4. 1 and 2 and 3 | 5 |
| Total unduplicated count from bibliographic databases | 290 |
| Total articles excluded | 205 |
| Total articles abstracted | 85 |
| Antibiotics | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode labor, premature, and explode epidemiologic study characteristics, and limit to humans | 1,039 |
| 2. Explode antibiotics | 306,659 |
| 3. 1 and 2 | 20 |
| EMBASE (Excerpta Medica) | |
| 1. Premature labor; or premature delivery | 3,764 |
| 2. Antibiotics | 79,377 |
| 3. 1 and 2 | 105 |
| International Pharmaceutical Abstracts | |
| 1. Premature labor; or premature delivery | 162 |
| 2. Antibiotics | 10,411 |
| 3. 1 and 2 | 3 |
| Total unduplicated count from bibliographic databases | 105 |
| 1. Articles found from bibliographies of review articles | 2 |
| Total possible articles | 107 |
| Reductions Based On Abstract and Article Reviews | |
| 1. Opinion or editorial | 4 |
| 2. Study with <40 subjects | 6 |
| 3. Review articles | 28 |
| 4. Wrong language | 3 |
| 5. Not preterm labor | 33 |
| 6. Unavailable a | 2 |
| 7. Antibiotics not primary focus | 16 |
| Total articles excluded | 92 |
| Total articles reviewed | 15 |
Unavailable: 2 German articles were not received in sufficient time for review.
| Home Uterine Activity Monitoring | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode labor, premature; or explode uterine contractions; or explode pregnancy | 399,320 |
| 2. Explode uterine monitoring; or explode monitoring, physiologic | 25,688 |
| 3. Explode home care services | 17,002 |
| 4. Fetal monitoring; or tocodynamometer | 4,728 |
| 5. 2 or 4 | 30,315 |
| 6. 1 and 3 and 5 | 54 |
| 7. Home uterine activity monitoring | 51 |
| 8. 6 or 7 | 84 |
| 9. Limit 8 to human, and English language, and year = 1980 to 1999 | 75 |
| 10. Limit 9 to RCTs | 30 |
| Total unduplicated count from bibliographic databases | 30 |
| 1. Articles found through TEAG and scientific director recommendation | 7 |
| Total possible articles | 37 |
| Reductions Based on Abstract and Article Reviews | |
| 1. Not an RCT | 2 |
| 2. Review article | 6 |
| 3. Wrong study population | 15 |
| 4. Cost analysis | 1 |
| 5. Meta-analysis | 1 |
| 6. Letter | 4 |
| 7. Not home uterine activity monitoring | 1 |
| 8. Twin study | 1 |
| 9. Study <40 subjects | 1 |
| 10. Not preterm labor | 1 |
| Total articles excluded | 33 |
| Total articles abstracted | 4 |
| Costs and Cost Analyses (1966-present) | Results |
|---|---|
| MEDLINE (Index Medicus) | |
| 1. Explode costs and cost analysis | 55,836 |
| 2. Explode labor, premature, and explode epidemiologic studies, and limit to humans | 1,039 |
| 3. Explode antibiotics | 306,659 |
| 4. 1 and 2 and 3 | 0 |
| 5. Explode tocolytic agents | 42,787 |
| 6. 1 and 2 and 4 | 0 |
| 7. Explode biologic markers; or estriol; or fibronectins; or saliva | 31,610 |
| 8. 1 and 2 and 6 | 1 |
The project director and scientific director worked with core project staff to develop data extraction forms to use for entry of relevant information from the eligible publications. The data extraction form was developed in an iterative fashion with extensive communication between the methodologists and researchers. We began by outlining common variables across types of studies, including study design features and outcomes. We then created more specialized forms to serve as the most efficient and effective vehicles to extract the necessary information from each of the five bodies of literature included in this review: (1) biologic markers, (2) tocolytic benefits, (3) tocolytic harms, (4) antibiotics, and (5) home uterine activity monitoring.
Included across all studies were the following:
Description of study designs.
Description of patient population, including maternal age and race and clinical. inclusion/exclusion criteria such as condition of membranes (ruptured or intact), estimated gestational age, and maternal and fetal indications for birth.
Definition of preterm labor.
Description of the test, treatment, or intervention.
Description of adjunct therapies.
Description of outcomes measured such as prolongation of pregnancy, gestational age at delivery, maternal morbidities, and infant birth weight.
Description of secondary analyses performed.
These variables were incorporated into the data extraction forms whose basic structure taken from forms developed for the evidence report on pharmacotherapies for alcohol dependence. The draft forms underwent extensive reviews, including pretesting on several randomly selected articles. After the pretest, we revised the forms to increase their utility and efficiency. The final forms can be found in Appendix B.
The tocolytic and antibiotic data extraction forms were quite similar in design. The tocolytic form had one additional question to capture the definition of tocolysis used in the analysis; therefore, only the tocolytic form is included in Appendix B. We made minor modifications to the form design to accommodate the home uterine activity monitoring literature and made more extensive modifications for the biologic marker form. Rather than mostly capturing the efficacy of treatment based on outcomes of RCTs, the biologic marker literature focuses on the effectiveness of the intervention with outcomes presented generally in terms of predictive values (positive and negative) and sensitivity and specificity. Both the home uterine activity monitoring and biologic marker extraction forms also are found in Appendix B.
The RTI-UNC EPC used abstractors with two types of backgrounds for the data extraction process: individuals with content or clinical expertise and those with strong methodologic skills. Several obstetricians, one pediatrician with training in women's health, and a nurse midwife served as clinican abstractors; all had prior research experience. The methods abstractors were doctoral students in the UNC School of Public Health or the UNC Department of Economics who have extensive training in quantitative methods. In addition, reviewers included a health services researcher with expertise in quantitative methods (the project director) and an obstetrician with expertise in treating women with preterm labor and in conducting clinical research in the topic area (the scientific director).
To collect high-quality data, we put necessary instructions directly on the data extraction forms. Based on prior work of the RTI-UNC EPC, we found that any separately distributed written instructions were less reliable sources of help for abstractors than the extraction form itself. In addition, all abstractors attended two formal training sessions. At the first session, we explained the process and goals of the abstraction. Following the training, the abstractors were sent home with an article to review. We then reconvened the group and, through a review of the test article, ensured that the reviewers understood what was expected from their work. For example, we instructed abstractors that when inconsistencies arose between results stated in the text of an article and those presented in tabular form, they were to take data from the text. They also were told to extract precisely what was contained in the article and to reserve opinions concerning the contents to the comments section at the end of the form.
At the completion of abstractor training, the data abstraction process began. The project director, scientific director, and UNC research coordinator monitored progress. Problems or questions that abstractors encountered were routed through the research coordinator to the appropriate senior staff member.
The literature search on side effects or harms of tocolytic use, confined to literature in the English language was conducted using MEDLINE and EMBASE. In addition, we included all abstracted efficacy and effectiveness articles that had been flagged as containing harms information. To ensure a comprehensive review of potential harms and because study results were to be analyzed in aggregate, we did not limit this search by the stringent criteria used for the benefits studies. For example, we retained articles concerning women with ruptured membranes in the analysis because the side effects of tocolytic treatment would be the same as for women with intact membranes.
Because of time and resource constraints, we extracted data using only a simple, rather than dual, abstractive. A certified nurse midwife entered data into a spreadsheet and transferred them into an analytic data set for analysis.
Our harms extraction form (Appendix
B) covered the following types of information:
Types of maternal and neonatal side effects.
Sample size of each arm of the study and total sample size.
Frequency of side effects in each group.
Methodology for harms data collection.
We did not abstract the harms of antibiotic therapy because they are generally well known to clinicians. Pregnancy commonly does not alter the potential complications of antibiotic therapy for mother or baby. One minor exception is tetracycline and its propensity to stain fetal teeth and bone. A potentially serious harm that has not been assessed adequately is the development of resistance to antimicrobials from widespread exposure of pregnant women to antibiotics, which could make treatment of neonatal sepsis more difficult.
Harms results are presented through graphic representation rather than as evidence table entries. This methodology provides a means of comparing the number and percentage of participants experiencing adverse reactions to treatment or placebo. For clarity in the analysis, we grouped the harms into two categories: maternal harms and fetal/newborn harms. We aggregated individual descriptors into clinically sensible categories within each of these broad categories. The aggregation process may lead to some "double counting" of harms. For example, if an article described 15 percent of the subjects as having chest pain and 10 percent as having arrhythmia, then 25 percent of the patients would be assessed as having serious cardiovascular harms. From the studies, one cannot determine whether the patients with the various types of harms are the same or different patients. We did not allow greater than 100 percent of subjects to have harms within a category. Double counting of harms would result when one patient experienced both chest pain and arrhythmia; therefore, the graphs comparing studies should be viewed as somewhat more valuable for demonstrating the relative rates of harms among the therapeutic categories, rather than the absolute rates. (See results in Chapter 3.)
We performed a sensitivity analysis to examine whether such possible double counting would affect the patterns of harms demonstrated in the graphs. A low estimate of harm rates could be attained by selecting the side effect with the highest percentage within each category rather than by adding the harms within categories. In this low-estimate sensitivity analysis, if 25percent of patients had chest pain and 10 percent arrhythmia, then serious maternal cardiac harms would be reported as 25 percent. When we performed this analysis, we found the same overall pattern of harms.
The analysis of fetal/newborn harms and the evaluation of efficacy in other sections of this report overlap to some extent. Rates of fetal demise, sepsis, or intraventricular hemorrhage (IVH) could be considered as either a therapeutic effect or a harm associated with use of the pharmacologic treatment. This information, then, may also be found in the evidence tables.
| Maternal harms |
|
| Fetal/newborn harms |
|
Notes: ECG = electrocardiogram; SVT = suproventricular tachycardia; PVC = premature ventricular contraction; BP = blood pressure; IVH = intraventricular hemorrhage.
Quality control for determining the eligibility for abstraction was described earlier in the section "Basic Approach to Article Selection." In short, all titles and abstracts were subject to dual review, and the scientific director adjudicated discrepancies. Each article was abstracted by two independent abstractors-one a clinician (content reviewer) and the other a methodologist. Abstractors were blinded to the authors' names, the institution that produced the work, and the journal title; however, in some instances, given the distinct formatting of certain journals, reviewers may have recognized journal titles.
Through prior work of the RTI-UNC EPC, we learned that using a third party for adjudicating differences between reviewers in a system of dual abstraction can be very time consuming. We decided for this project that the more efficient and effective process was for the two abstractors to discuss and reconcile differences between themselves and record their review on one reconciled extraction form. Information contained on the reconciled forms were spot checked by the project manager and the RTI research coordinator. In contrast, abstractors did not reconcile their opinion of the quality of the article as expressed through their responses on the quality rating form. We recorded a grade from each reviewer in the evidence tables.
Because the abstractors adjudicated their own work and completed a joint extraction form, we decided that a statistical check on the interrater reliability of their original data collection was not appropriate. In the past, the RTI-UNC EPC has used kappa statistics to evaluate agreement between reviews. Because our method of reconciliation produced 100percent agreement between reviewers (i.e., essentially perfect interrelater reliability), kappa statistics were neither useful nor appropriate.
Quality of the evidence can be judged on two levels: (1) at the level of the individual article and (2) in summary over the spectrum of articles addressing each of the key therapies or interventions. This portion of Chapter 2 describes our approaches to the development of quality ratings on both levels.
The RTI-UNC EPC approach to assessing the quality of the individual articles considers the various concerns important to the scientific method of conducting research, including the internal and external validity of each study. It encompasses, among other factors, study design, measurement issues, statistical analyses, and the appropriateness of the conclusions being drawn.
| Category 1: Problem or question studied | ||||
|---|---|---|---|---|
| Category 2: Sampling | ||||
| Category 3: Measurement | ||||
| Category 4: Internal validity | ||||
| Category 5: External validity | ||||
| Category 6: Construct validity | ||||
| Category 7: Statistical conclusions | ||||
| Category 8: Justification for conclusions | ||||
| No | Yes | |||
| 1. | Is the defnition used for the diagnosis of preterm labor clearly stated? | 0 | 2 | |
| No | Yes | |||
| 2. | Is the recruitment strategy clearly described? | 0 | 1 | |
| No | Yes | |||
| 3. | Is there random allocation of treatment? | 0 | 2 | |
| No | Yes | |||
| 4. | Are baseline levels of cervical dilation and effacement assessed? | 0 | 1 | |
| No | Yes | |||
| 5. | Are patients with ruptured membranes excluded or assessed in a stratified analysis? | 0 | 1 | |
| No | Yes | |||
| 6. | Are medically indicated or iatrogenic pregnancies excluded or assessed in a stratified analysis (e.g., preeclampsia, severe fetal defects, placenta previa)? | 0 | 1 | |
| Unknown or >20% | <20% | |||
| 7. | Is the dropout rate of patients invited into the study less than 20%? | 0 | 1 | |
| 8. | What outcomes are measured? | No | Yes | |
| 0 | 1 | ||
| 0 | 1 | ||
| 0 | 1 | ||
| No | Yes | |||
| 9. | Are the outcome measurements clinically relevant? | 0 | 1 | 0.5 |
| No | Yes | NA | ||
| 10. | Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, type of preterm labor, parity)? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 11. | Are cointerventions (e.g., bed rest, hydration, or use of antibiotics when tocolytics are the study question, or vice versa) measured or reported on? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 12. | Are the treating clinicians masked to treatment administered? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 13. | Are the patients masked to treatment received? | 0 | 1 |
![]() |
| No | Yes | |||
| 14. | Do the study conclusions apply to pregnant women in the United States or to important subgroups of the U.S. population? | 0 | 1 | |
| No | Yes | |||
| 15. | Is the clinical setting specified clearly? | 0 | 1 | |
| No | Yes | |||
| 16. | Are the criteria used to assign gestational age clearly stated? | 0 | 1 | |
| No | Yes | NA | ||
| 17. | If the study is negative, is a power analysis performed? | 0 | 1 |
![]() |
| No | Yes | Some | ||
| 18. | Are statistically significant (or insignificant) findings clinically significant? | 0 | 1 | 0.5 |
| No | Yes | Some | ||
| 19. | Are statistical tests used appropriate to the data? | 0 | 1 | 0.5 |
| No | Yes | |||
| 20. | Are the conclusions warranted from the data and/or design of the study? | 0 | 2 | |
| Category 1: Problem or question studied | ||||
|---|---|---|---|---|
| Category 2: Sampling | ||||
| Category 3: Measurement | ||||
| Category 4: Internal Validity | ||||
| Category 5: External validity | ||||
| Category 6: Construct validity | ||||
| Category 7: Statistical conclusions | ||||
| Category 8: Justification for conclusions | ||||
| No | Yes | |||
| 1. | Is the description of the test procedure provided? | 0 | 1 | |
| No | Yes | |||
| 2. | Are the relevant outcomes clearly defined? | 0 | 1 | |
| No | Yes | |||
| 3. | Is a reference standard for the test clearly referenced? | 0 | 1 | |
| No | Yes | |||
| 4. | Is the recruitment strategy clearly described? | 0 | 1 | |
| 5. | Are inclusion/exclusion criteria clearly stated? | 0 | 1 | |
| 6. | Are baseline levels of cervical dilation and/or effacement assessed? | 0 | 1 | |
| 7. | Are patients with ruptured membranes excluded or assessed in a stratified analysis? | 0 | 1 | |
| 8. | Are medically indicated or iatrogenic deliveries excluded or assessed in a stratified analysis (e.g., preeclampsia, severe fetal defects, placenta previa)? | 0 | 1 | |
| Unknown or >20% | <20% | |||
| 9. | Is the dropout rate of patients invited into the study less than 20%? | 0 | 1 | |
| 10. | What outcomes are measured? | No | Yes | |
| 0 | 1 | ||
| 0 | 1 | ||
| 0 | 1 | ||
| No | Yes | Some | ||
| 11. | Are the outcome measurements clinically relevant? | 0 | 1 | 0.5 |
| No | Yes | NA | ||
| 12. | Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, type of preterm labor, parity)? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 13. | Are cointerventions (e.g., bed rest, hydration, tocolytic use) measured or reported on? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 14. | Are the treating clinicians masked to test result? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 15. | Are the patients masked to test result? | 0 | 1 |
![]() |
| No | Yes | |||
| 16. | Is the clinical setting specified clearly? | 0 | 1 | |
| No | Yes | |||
| 17. | Are the criteria used to assign gestational age clearly stated? | 0 | 1 | |
| No | Yes | NA | ||
| 18. | Is a power analysis performed? | 0 | 1 |
![]() |
| No | Yes | Some | ||
| 19. | Are statistically significant (or insignificant) findings clinically significant? | 0 | 1 | 0.5 |
| No | Yes | Some | ||
| 20. | Are statistical tests used appropriate to the data? | 0 | 1 | 0.5 |
| No | Yes | |||
| 21. | Do the authors provide either sensitivity/specificity, likelihood ratios or odds ratios? | 0 | 1 | |
| No | Yes | |||
| 22. | Are the conclusions warranted from the data and/or design of the study? | 0 | 2 | |
| Category 1: Problem or question studied | ||||
|---|---|---|---|---|
| Category 2: Sampling | ||||
| Category 3: Measurement | ||||
| Category 4: Internal validity | ||||
| Category 5: Construct validity | ||||
| Category 6: Statistical conclusions | ||||
| Category 7: Justification for conclusions | ||||
| No | Yes | |||
| 1. | Is the definition used for the diagnosis of preterm labor clearly stated? | 0 | 1 | |
| 2. | Is the intervention being tested clearly described? | 0 | 1 | |
| No | Yes | |||
| 3. | Is the recruitment strategy described? | 0 | 1 | |
| No | Yes | |||
| 4. | Are the inclusion criteria for the study stated? | 0 | 1 | |
| No | Yes | |||
| 5. | Is the randomization scheme described? | 0 | 1 | |
| No | Yes | |||
| 6. | Are baseline levels of cervical dilation and effacement presented? | 0 | 1 | |
| Unknown or >20% | <20% | |||
| 7. | Is the dropout rate of patients invited into the study less than 20%? | 0 | 1 | |
| No | Yes | |||
| 8. | Are all patients who dropped out of the study accounted for? | 0 | 1 | |
| No | Yes | |||
| 9. | Are the outcomes clearly described? | 0 | 1 | |
| No | Yes | Some | ||
| 10. | Are the outcome measurements clinically relevant? | 0 | 1 | 0.5 |
| No | Yes | NA | ||
| 11. | Are potential confounding variables taken into account in the outcome analysis (e.g., age, race, labor, parity)? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 12. | Are any cointerventions reported on? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 13. | Are the treating clinicians masked to intervention group? | 0 | 1 |
![]() |
| No | Yes | NA | ||
| 14. | Are the patients masked to treatment received? | 0 | 1 |
![]() |
| No | Yes | |||
| 15. | Are the criteria used to assign gestational age stated? | 0 | 1 | |
| No | Yes | NA | ||
| 16. | Was a power analysis performed? | 0 | 1 |
![]() |
| No | Yes | Some | ||
| 17. | Are statistical tests used appropriate to the data? | 0 | 1 | 0.5 |
| No | Yes | Some | ||
| 18. | Are statistical tests clearly described? | 0 | 1 | 0.5 |
| No | Yes | |||
| 19. | Are levels of significance and or confidence intervals given? | 0 | 1 | |
| No | Yes | |||
| 20. | Was an intent-to-treat analysis performed? | 0 | 1 | |
| No | Yes | |||
| 21. | Are the conclusions warranted from the data? | 0 | 1 | |
| 22. | Are the conclusions warranted from the study design? | 0 | 1 | |
To assess the internal validity of a study (i.e., the likelihood that the
design and conduct of the study minimize systematic error or bias) the
following factors were evaluated from each study across all study types:
Description of the recruitment strategy.
Masking of the treating clinician and patient.
Inclusion of baseline measurements including cervical dilation, effacement, and estimated gestational age.
Definition of outcome measures, attrition rates, and confounding variables.
Statistical considerations included assessing the adequacy of the methods used and whether the authors did a power calculation. The rating scale also addressed the clinical relevancy of the statistical findings.
With regard to diagnostic and outcome measures, the abstractors evaluated whether the standard for a test was clearly referenced. In addition, the quality rating instrument assessed whether studies included the three major categories of outcomes (i.e., those for delivery, mothers, and infants).
External validity (i.e., whether the findings of the study can be generalized to populations that did not participate in the study) also was considered for quality rating. We included in our evaluation whether the clinical setting was specified clearly and whether conclusions apply to pregnant women in the United States.
The quality rating instrument was based on a point scale that was transformed into a percentage score, with the higher percentage score denoting a better study. Each abstractor completed the quality rating for each article reviewed. The individual scores provided in the evidence tables for each article were derived by calculating the following ratio: the points from the responses circled by the abstractor (numerator) divided by the total number of points that could have been awarded (denominator). When the items were determined to not be applicable, "NA," then the total number of points for that item was removed from both the numerator and denominator. Quality rating scores for each article were not averaged across the two abstractors (i.e., clinician and methodologist), so that patterns across studies could be observed.
According to a report developed by Lohr and Carey,97 several systems have been proposed in the past 10 to 15 years for "grading the quality or strength" on a given clinical topic or pathway; however, little or no consensus exists about which specific system is best, and thus little guidance was given about which approach to use for our review.
The RTI-UNC EPC developed a grading scheme that considers the strength and the quality of the evidence based on the work of the RTI-UNC EPC in the Evidence Report on the Pharmacotherapy for Alcohol Dependence98 and a report by the U.S. Preventive Services Task Force (1996).99 Our determination of the strength of the evidence includes an evaluation of the consistency of study findings over all studies, the quality rating scores for individual pieces of research, the magnitude of important outcomes, and, when available, the results of our meta-analyses. Because we include both randomized and cohort studies in the review, we also record whether the evidence is appropriate for evaluating the efficacy or the effectiveness of the findings. The letter grade given to a body of literature is not necessarily higher because it contains a greater number of RCTs; rather, it rests on the quality of the studies themselves. We did not assign scores for each of the variables considered for the ratings; instead, we based our quality assessment on an adjudicated rating that was initially provided by the project director, scientific director, and clinical methodologist as follows:
| Good (A): | The data are sufficient for evaluating the quality of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women in preterm labor. |
| Fair (B): | The data are sufficient for evaluating the quality of the findings. The data indicate inconsistencies in the findings between the key drug or intervention and alternative treatment or placebo for treating women in preterm labor. |
| Poor (C): | The data are sufficient for evaluating the quality of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women in preterm labor. |
| Incomplete evidence (I): | The data are insufficient for assessing the quality of the key drug or intervention for treating women in preterm labor based on limited sample size or poor methodology. |
| Efficacy (1): | The evidence was obtained from well-designed RCTs. |
| Effectiveness (2): | The evidence was obtained from well-designed cohort or case control analytic studies. |
| Study Design |
|---|
| Outcomes |
| Citation |
| Research objective |
| Study design and masking |
| Definition of preterm labor |
| Patient inclusion/exclusion criteria |
| Sample size initial/end |
| Demographic characteristics |
| Intervention |
| Cointerventions (drug and nondrug) |
| Country and time period |
| Quality score |
| Definition of successful tocolysis (tocolytics only) |
| Compliance assessment (antibiotics, tocolytics, and home uterine activity monitoring only) |
| Estimated gestational age |
| Prolongation of pregnancy |
| Gestational weeks |
| Maternal morbidities |
| Birth weight |
| APGAR score (antibiotics and tocolytics only) |
| Other infant outcomes |
| Other analyses |
| Conclusions |
| Comments and limitations (home uterine activity monitoring only) |
The information contained in the tables is self-explanatory; however, numerous abbreviations, which can be found in Appendix C, were used to conserve space.
The content of the study design table is consistent across study topics. All tables include a statement of the research objective, the definition of preterm labor used by the authors, patient inclusion/exclusion criteria, and a description of the experimental intervention. When available, we include the total number of participants at the beginning and at the end of the study as well as the number of participants in each arm of the study at its conclusion.
Information unique to particular studies includes a definition of tocolysis in the tocolytics table and the exclusion of compliance assessment information in the biologic markers tables (as this would be irrelevant in relation to a diagnostic test). The home uterine activity monitoring study design table (Evidence Table 13) includes a list of the outcome measures that are described in greater detail in the outcomes table.
The outcomes tables across types of studies include data on delivery, maternal, and infant outcomes. The presentation is the same in the tables describing tocolytic and antibiotics outcomes (Evidence Tables 3 through 12). The home uterine activity monitoring outcome table provides information on recurrent preterm labor and includes comments and limitations of the study. In the pharmacotherapy studies table, information about the treatment for recurrent labor is included under intervention. We did not include comments and limitations for other study types because authors typically did not provide such information.
The presentation of outcomes in the biologic marker outcome tables is somewhat different from the other study types (Evidence Tables 1 and 2) because they describe the results of diagnostic tests and delivery outcomes and are presented in terms of sensitivity, specificity, positive predictive value and negative predictive value.
Among the more important activities involved in producing a credible evidence report is conducting an unbiased and broadly based review of the draft report. Such a review, here termed "peer review," should provide an array of scientists, methodologists, users, and laypersons adequate opportunity to comment on the report and to identify problems of fact, interpretation, or presentation. Appendix A describes the selection process for peer review and lists the names of the peer reviewers.
In addition to the systematic review of the literature discussed above, we conducted meta-analyses focusing on treatment effects of each class of first-line tocolytics and maintenance tocolytics, antibiotics, and home uterine activity monitoring. The efficacy of treatment was measured in relation to the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational age at delivery, and (3) birth weight. We restricted the literature for the meta-analyses to studies included in our evidence tables, RCTs, and to data concerning women with intact membranes. Chapter 4 presents a thorough discussion of the analyses, including selection criteria, methodology, and study findings.
Based on the data presented in the evidence tables, this chapter presents the major findings from our review of the literature regarding the management of preterm labor. The chapter begins with a discussion of issues and limitations relevant to the collective body of literature. Subsequent sections focus on each assessment technique or therapy and provide a detailed discussion of efficacy and/or effectiveness results (benefits). We address, in turn, biologic markers (fFN and EVUSD), tocolytics (first-line and maintenance), antibiotics, and home uterine activity monitoring. Side effects (harms) data are presented only in relation to tocolytic treatment.
Through our review, we found a number of issues affecting the quality of the studies reported in the literature that cut across categories of assessments and interventions in the management of preterm labor. Our ability to draw conclusions with confidence is diminished by the following five issues concerning research design and study execution: (1) the definition of preterm labor, (2) the size of the trials, (3) the confounding of results because of use of other interventions, (4) the sole reliance on bivariate analysis, and (5) the failure to analyze separately women with medically indicated preterm births and compare results with those of the larger group of women with preterm labor. We discuss other concerns specific to the literature about a particular assessment technique or therapy in the section of this chapter covering that specific topic.
Definitions used to delineate an episode of preterm labor vary widely across studies. Some investigators are quite precise, requiring a specified level of contraction duration, contraction frequency, and cervical change; however, within this group, the number or frequency of symptoms varies greatly. Other authors specify, as an entry criterion, a lack of response to nondrug interventions, such as bed rest and hydration. Still other authors do not define preterm labor or specify the symptoms used as an inclusion criteria for the study. Thus, the reviewer is presented with possible heterogeneity in study populations, a factor that limits the generalizability and aggregation of study results.
Because adverse perinatal outcomes (morbidity and mortality) are so rare, studies generally have inadequate numbers to provide the power to assess whether any observed differences were statistically significant. Hence, nonstatistically significant results may stem from a lack of power to detect positive outcomes rather than a true lack of benefit. Instead, investigators rely on surrogate outcomes such as pregnancy prolongation, mean gestational age at delivery, and rate of delivery before a specific cutoff point (most commonly 35 to 37 weeks). Reviewers are asked implicitly to make the assumption that improvements in these surrogate outcomes would translate into improvements in perinatal outcomes if applied to enough women with preterm labor symptoms.
Interpreting results is further complicated by the use of cointerventions. Most studies of one type of therapy allow the use of another type of intervention; for instance, antibiotic studies allow for tocolytic use and vice versa. Moreover, studies also allow for nondrug interventions, such as intravenous hydration and bed rest. To avoid confounding from cointerventions, analyses would need to impose study designs or include statistical techniques to control for use of the cointerventions. Very few of the studies reviewed for this evidence report include these controls.
Given the rarity of perinatal death and serious morbidity, assessing these outcomes would require enrolling thousands of women. As this is unlikely to be a practical solution, survival analysis is an analytic technique that can enhance outcome presentations by providing stratified analyses by gestational age at enrollment for the period from treatment to birth. (See Rothman and Greenland 100 for a general description of survival analysis.)
To date, however, studies typically rely on bivariate analyses of categorical and continuous outcomes. Although categorical measures can capture such outcomes as "preterm birth, yes or no," these results may not be clinically informative in and of themselves because women at a wide range of gestational ages may be included in the analysis. Comparisons of continuous outcomes, such as mean prolongation of pregnancy, may also be misleading. For example, a similar lengthening of gestation may be of greater value for a woman at 26 weeks' gestation relative to 34 weeks' gestation and, because a wide range of gestational ages are typically included in the studies, this distinction is not commonly available.
The etiologic pathways leading to preterm birth are uncertain. Given the lack of epidemiologic evidence, it remains unclear as to whether indicated preterm births are etiologically heterogeneous or homogeneous with spontaneous preterm births. The inclusion of dual analyses in the review of efficacy and effectiveness studies could provide useful insights into this controversy; results for women with indicated preterm births and those experiencing spontaneous preterm births could be analyzed separately and in combination. A body of literature of this kind could provide insights into this controversy and support conclusions concerning whether indicated preterm births should be included or excluded from the analyses of women with preterm labor.
Chapter 1 discussed the clinician's limited ability to ascertain which symptomatic woman is at risk for imminent delivery and which is not. The inability to assign risk accurately, coupled with the ubiquity of preterm labor symptoms, exposes numerous women to unnecessary and potentially harmful treatment. Moreover, this phenomenon dilutes researchers' ability to detect a significant intervention effect in those with uterine activity that will result in cervical change and potential preterm birth. Thus, a rapid and reproducible biochemical or biophysical "diagnostic" test for women with preterm labor symptoms that could accurately assess the risk of preterm delivery would be of great value. In Chapter 1, we labeled such tests "biologic markers."
We proposed to study three biologic markers in this evidence report: fFN, EVUSD, and E3. Restricting our review to investigations that include women with symptoms of preterm labor, as we have done throughout this report, resulted in our finding no studies of E3 in symptomatic women. As a result, we cannot comment on estriol's efficacy to assess preterm delivery risk in women with preterm labor symptoms and will not address in this chapter this biologic marker. Also, the body of literature about the use of EVUSD as a screening tool is quite extensive; however, we have excluded screening studies in asymptomatic women, which substantially restricts the articles available for review in this evidence report.
Of the 17 reviewed studies about biologic markers, 12 examined fFN; 46 101 102 103 104 105 106 107 108 109 110 their findings can be found in Evidence Table 1. Three additional studies concern EVUSD alone; 55 111 112 the details can be found in Evidence Table 2. Finally, findings of two other studies that examined both fFN and EVUSD 113 114 can be found in both Evidence Tables 1 and 2 for completeness.
As discussed previously, all the literature relating to preterm labor has limitations including the definition of an episode of preterm labor, the trial size, the confounding of results because of the use of other interventions, the reporting of preterm birth outcomes in a categorical fashion, and the failure to analyze separately women experiencing medically indicated births.
With respect to biologic markers specifically, we note first that all biologic marker studies conducted to date have been observational. Although observational studies can give insights regarding the sensitivity and specificity of a biologic marker to predict preterm delivery within a certain period of time, they cannot assess whether clinicians and patients can use the additional information provided by biologic markers to improve perinatal outcomes. That hypotheses is best tested in an RCT in which clinicians are given biologic marker results for one set of patients and not for another. To our knowledge, no such study has been conducted. Moreover, masking and generalizability are further issues in the biologic marker research arena.
Overall, the study populations in these investigations were not large. Of the 12 fFN articles, 8 had fewer than 100 subjects relevant for our analysis. 102 104 106 107 109 110 114 115 Of the two studies that combined fFN and EVUSD, one had a population of 108 113 and the other 76. 114 Two studies on EVUSD alone enrolled fewer than 100 women, 55 111 another had 162 participants. 112
Definitions of preterm labor symptoms were highly variable. If one deems an adequate definition of preterm labor symptoms to include uterine contraction frequency and duration along with cervical dilation status, then only six studies used definitions that can be labeled adequate. 55 108 110 111 112 113 Five studies did not definite preterm labor symptoms. 101 102 106 107 110 In the largest study, a preterm labor symptom was defined as one that prompts an unscheduled visit to a clinician. 46
All but one report failed to control for confounding interventions that may have altered the likelihood of preterm birth in women with preterm labor symptoms. Peaceman, Andrews, Thorp, et al. 46 controlled for tocolytic use in two ways: (1) by calculating test performance characteristics in women who did and did not receive a tocolytic and (2) by including a study site that did not use tocolytics as a treatment to prevent preterm delivery.
Assessing the outcomes of interventions in women with preterm labor symptoms categorically (preterm birth: yes or no) as opposed to continuously (mean gestational age at delivery or mean prolongation of pregnancy in days) can make clinically important differences difficult to detect. This blurring phenomenon has been described in detail in other sections.
To avoid such problems, we have advocated survival analysis stratified by gestational age at enrollment as the ideal analytic technique. The largest fFN study, 46 a small French report on fFN, 106 and one of the combined reports 113 used this method of analysis. Three other fFN studies approximated survival analysis by using the categorical outcome of preterm delivery within 7, 14, and 21 days after sample collection, 105 108 110 and another evaluated 7- and 14-day intervals. 46
Preterm birth is thought by some to be etiologically heterogeneous; that is, medically indicated preterm births in nonlaboring women have different causal pathways than do births arising from preterm labor or premature rupture of membranes. If this concept of etiologic heterogeneity is accurate, then a corollary is that diagnostic tests designed to measure changes indicative of imminent preterm birth in women with preterm labor symptoms should have no predictive value for medically indicated preterm births. However, we do not know that this is true, since, to our knowledge, only Peaceman, Andrews, Thorp, et al. 46 have excluded medically indicated births before term.
In an observational study of a diagnostic test, an essential study design element is to mask clinicians (care providers, outcomes assessors) to the test result. Likewise, the ideal design is to ensure that the patients themselves are unaware of their fFN or ultrasound status. Failure to mask clinicians can alter clinician practices based on their biases and, thus, change test performance. We reviewed each study with respect to masking. If the articles states that clinicians caring for study subjects were unaware of the study results, we considered masking to be adequate; conversely, if the articles made no such statement, we judged masking to be inadequate. Three fFN studies, 105 107 115 one combined fFN-EVUSD study, 114 and one ultrasound study 55 had inadequate masking of clinicians.
Information about the number of patients eligible for the study who actually enrolled and completed the study is important because, in absence of such data, the assessment of the quality of the study and its generalizability to the universe of women with symptoms of preterm labor is difficult, if not impossible. Peaceman, Andrews, Thorp, et al. 46 and Iams, Casal, McGregor, et al. 103 were the only authors that provided information about the number of symptomatic women who were screened for study participation and the number ultimately enrolled. In these two studies, respectively, 52 percent and 46 percent of women screened for participation actually were enrolled and had fFN performed.
Each of the studies except that by Watson, Kim, and Humphrey 110 provided test characteristics such as sensitivity, specificity, and positive and negative predictive values of preterm delivery. Although influenced by the prevalence of preterm birth, predictive values, namely positive predictive value (PPV) and negative predictive value (NPV), are the most clinically relevant data. Predictive values answer the critical question: Given a positive or negative test result, what is the likelihood that a woman with symptoms of preterm labor will have a preterm delivery? All but two fFN studies 102 110 provided the reader with PPV and NPV. Inglis, Jeremias, Kuno, et al. 102 provided sufficient detail to calculate predictive values.
The predictive value of a positive fFN, that is, the proportion of women with a positive test who deliver before term, is moderate at best. (The cutoff point for fFN positivity was identical, i.e., 50 ng/ml in the fFN studies.) In the articles cited in Evidence Table 1, the PPV of predicting delivery within 7 days ranged from 13 percent to 44 percent 46 103 104 105 108 (singleton pregnancies only). When the outcome of interest was delivery at less than 37 weeks, the PPV ranged from 31 percent 107 to 83 percent. 109 In the largest fFN study, Peaceman, Andrews, Thorp, et al. 46 reported a PPV of 43 percent for delivery at less than 37 weeks.
In contrast to PPV, the NPV of fFN for identifying those at low risk of preterm birth is good, especially to assess risk of imminent delivery. The NPVs for birth before 37 weeks reported in these studies ranged between 69 and 92 percent 102 105 and for birth within 7 days between 98 and 100 percent. 104 105
Two studies of fFN 46 103 compared the diagnostic accuracy of this biologic marker with that of traditional clinical risk assessment based on cervical dilation, contraction frequency, and vaginal bleeding. In both instances, fFN was superior to clinical assessment.
We elected to aggregate the test performance data from EVUSD studies that evaluated comparable outcomes with composite estimates of sensitivity, specificity, PPV, NPV, and likelihood ratios. In the five EVUSD papers, we developed cutoff points for considering ultrasound findings abnormal (in other words, a positive test) from the study data. Timor-Trisch, Boozarjomehri, Masakowski, et al. 111 focused on dynamic cervical changes (e.g., wedging or funneling); the other studies relied on cervical length. Performance of sonography, evaluating cervical length using cutoff points of 2.6 and 3.0 cm, 55 112 was similar to that of fFN: moderate PPVs of 50 to 55 percent and high NPVs of 76 to 100 percent. 55
The two papers that compared fFN with EVUSD used cervical length as a criterion to assign risk. Rizzo, Capponi, Arduini, et al. 113 used 20 mm as a cutoff point; Rozenberg, Goffinet, Malagrida, et al. 114 used 26 mm. In one combined study, Rizzo, Capponi, Arduini, et al. 113 found that fFN testing was superior to sonography in terms of diagnostic performance; Rozenberg, Goffinet, Malagrida, et al. 114 found no difference between the two testing modalities. Both studies found that the combined use of fFN and EVUSD improved test performance and that each made independent contributions to the prediction of risk for preterm delivery.
This section presents estimates of sensitivity, specificity, PPV, NPV, and likelihood ratios as a means of clarifying the potential value of these biologic markers in clinical practice. Of the 14 articles evaluating the use of fFN, 12 provided sufficient detail to construct contingency tables describing the relationship between test results and birth before 37 completed weeks' estimated gestational age (see Evidence Table 1). 46 101 102 103 104 105 106 107 108 109 113 114 Each study used a cutoff point of greater than or equal to 50 ng/ml for a positive test.
Aggregating across these studies, 1,786 samples from women being evaluated or treated for preterm labor were available for this analysis. In this combined cohort, 488 preterm births occurred, for an overall prevalence of 27 percent. Of the 267 women who delivered before term, 267 had positive test results at their initial evaluation and 221 did not. Of those who delivered at term, 1,111 women had true-negative fFN results and 187 women had false-positive fFN values.
Combined, the sensitivity of fFN (i.e., the ability to identify correctly those who delivered before 37 weeks) was 54.7 percent; specificity (i.e., the ability to identify those who delivered after 37 weeks) was 85.6 percent. The PPV (i.e., the probability that a woman with a positive test result would deliver before term) was 58.8 percent. The NPV (i.e., the probability that a woman with a negative test would deliver at term) was 83.4 percent. The likelihood ratio for a positive fFN is 3.8 (95 percent confidence interval [CI]: 3.3, 4.4) and 0.53 (95 percent CI: 0.48, 0.58) for a negative test result. The clinical application of likelihood ratios is reviewed briefly below.
The majority of these studies enrolled women in a 10-week range of gestational ages, meaning some subjects have longer time at risk for preterm birth than others, typically between 24 and 34 weeks. This factor has the effect of reducing sensitivity of testing at a single point in time for predicting birth before 37 weeks. Presumably, some women with an initial negative fFN test will subsequently develop positive tests. In a sense, these women actually do not reflect false-negative values on their initial test; rather, they reflect the time-dependent nature of using biologic markers and their relationship to outcomes.
Limiting the effect of weeks elapsing between fFN is informative about the utility of fFN as a reliable biologic marker. The five publications that examine the more immediate outcome of delivery within a week are instructive on this point. 46 103 104 105 108 The aggregate sensitivity of fFN (from these studies) to identify women who delivered in 7 days after testing was 89.4 percent; the specificity was 83.3 percent; the PPV was 22.9 percent; and the NPV was 99.3 percent. For the outcome of birth within 7 days, the likelihood ratio for a positive fFN is 5.3 (95 percent CI: 4.6, 6.2) and 0.13 (95 percent) CI: 0.06, 0.26) for a negative fFN. The high NPV, supported by this combination of results, suggests that fFN conveys an important benefit in terms of identifying women who are not an immediate risk of preterm birth and who may not require intervention.
The EVUSD literature presents special challenges to combining results in this fashion. Continuous predictors, such as cervical measurements, cannot be captured adequately from the level of detail permitted in publications. Appropriately, therefore, authors of the articles we reviewed examined their data and established categories of cervical change or dichotomous cutoff points for examining test characteristics or calculating likelihood ratios. As a result of their statistically driven analyses, the studies show greater inconsistency in the cutoff points used than was true for the fFN articles.
Of the five EVUSD studies in women with signs or symptoms of preterm labor, three evaluated the use of wedging or funneling to determine risk of preterm birth. 111 112 113 Combined, these investigators evaluated 314 women, of whom 98 delivered before 37 weeks' gestation. Of those who delivered preterm, 59 had funneling or wedging noted on ultrasound examination; 39 did not have visible funneling. In addition, 23 women had funneling and delivered at term (a total of 23 false-positives), and 171 women had normal ultrasounds and term deliveries. These figures are equivalent to the following rates: sensitivity, 60.2 percent; specificity, 79.2 percent; PPV, 56.7 percent; and NPV, 81.4 percent. The positive likelihood ratio for funneling is 2.8 (95 percent CI: 2.1, 3.9) and 0.50 (0.39, 0.65) for the absence of funneling.
Two studies 55 112 used the cervical length of 3 cm or less as a diagnostic cutoff point to assign higher risk of preterm birth. This threshold was associated with the following rates: sensitivity, 88.5 percent; specificity, 59.3 percent; PPV, 49.5 percent; NPV, 92.0 percent; and positive likelihood ratio of 2.2 (95 percent CI: 1.7, 2.7); negative likelihood ratio of 0.19 (0.10, 0.39).
| Sensitivity | Specificity | Test Characteristics by Incidence of Preterm Birth In Women Presenting for Preterm Labor | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Included Studies | Low Risk Example Within 7 Days = 3% Before 37 Weeks = 10% | Medium Risk Example Within 7 Days = 10% Before 37 Weeks = 25% | High Risk Example Within 7 Days = 15% Before 37 Weeks = 45% | ||||||||
| Observed Incidence | PPV | NPV | PPV | NPV | PPV | NPV | PPV | NPV | |||
| Birth within 7 days | |||||||||||
| fFN | 89.4 | 83.3 | 5.3 | 22.9 | 99.3 | 14.3 | 99.6 | 37.2 | 98.6 | 48.6 | 97.8 |
| Birth before 37 weeks | |||||||||||
| fFN | 54.7 | 85.6 | 27.3 | 58.8 | 83.4 | 29.7 | 94.5 | 55.9 | 85.0 | 75.7 | 69.8 |
| EVUSD: Funneling/wedging | 60.2 | 79.2 | 31.2 | 56.7 | 81.4 | 24.3 | 94.7 | 49.0 | 85.6 | 70.4 | 70.9 |
| EVUSD: Cervical length < 3 cm | 88.5 | 59.3 | 31.1 | 49.5 | 92.0 | 19.4 | 97.8 | 42.0 | 93.9 | 64.0 | 86.2 |
fFv = fetal fibronectin; EVUSD = endovaginal ultrasound; PPV = positive predictive value; NPV = negative predictive value
Biologic markers supplement clinical judgment. Based on a patient's obstetric history, her description of current symptoms, and findings from physical examination and monitoring of contractions, care providers estimate the probability of preterm birth. An estimate of high probability results in further testing and/or treatment; an estimate of lower probability leads to continued observation. Thresholds for deciding whether to treat women in preterm labor or to observe them further are highly individualized and often not explicit. For example, physicians with aggressive treatment policies may favor intervention at any time that they believe that a woman has a greater than 10 percent chance that her preterm labor might result in preterm birth and will likely further adjust that judgment based on gestational age. A conservative philosophy might reserve treatment for cases in which the probability estimate of a preterm birth is greater than 30 percent, accepting that time and observation will further clarify the need to treat.
Regardless of where this threshold to treat lies for individual providers, there exists a range of probabilities between an estimated probability at which they "always" treat and an equivalent estimated probability at which they are comfortable sending a patient home with no further evaluation. When this range is relatively wide, the additional diagnostic or prognostic information provided by fFN or EVUSD testing may help the clinician to make a more sound decision.
Imagine a provider whose threshold to treat is 20 percent and who would accept a 2 percent risk that the pregnancy would be delivered before term. Faced with a patient who is being evaluated for her second episode of preterm labor at 33 weeks' gestation and who has contractions every 8 minutes, no cervical change, and a history of a prior preterm birth at 36 weeks, the clinician may believe that the woman's current episode has a 10 percent chance of leading to progressive cervical change and delivery. This is within the clinician's range of clinical uncertainty; additional information, such as fFN or EVUSD data, can have direct decisionmaking utility.
The question is how to apply such information easily across a range of patient populations with varied prevalence of preterm birth. Likelihood ratios for categories of test results accomplish this; they can be used to modify the odds of a patient's having a specific outcome such as preterm birth or birth within the next week. Posttest odds are equal to the pretest odds multiplied by the likelihood ratio.
In this hypothetical example, the provider's pretest probability,
based on clinical judgment, was 10 percent. This value can be
converted to odds using this formula:

Pretest
odds are 0.11:1 or 1:9.
From the combined analysis of fFN discussed above (with a likelihood
ratio of 3.8 percent for birth before 37 weeks), if the woman's fFN
test was positive, then her posttest odds would be given by the
following equation:
Posttest odds = Pretest odds x
Likelihood ratio = 0.11 x 3.8 = 0.42
To convert this back to a probability, we use this
formula:
In short, with a positive fFN test, a woman's posttest odds are 0.42, which is equivalent to a probability of 30 percent. This is a value within the range of probabilities for which the hypothetical clinician would start treatment. Pocket nomograms are available to ease use of these conversions in clinical settings. 116
An alternative approach to using these graphs is to acknowledge that clinical estimates of risk of spontaneous preterm birth for an individual are imprecise, whereas the prevalence of preterm birth from preterm labor in a specified population can be determined with certainty. This prevalence can serve as the prior probability. In this case, the prevalence is located on the X-axis, and the Y-axis reflects the posterior probability, in that population, adjusted for the biologic marker result. Whether interpreted as adjustment of individual risk or risk for a population, these biologic markers offer additional information that refines assessment of the probability that preterm birth will result from an episode of preterm labor.
We grouped the tocolytic strategies using two schemes. First, tocolytics were designated as either first-line or maintenance regimens. The former are used to treat acute preterm labor symptoms after the clinician decides that these symptoms are so significant that administration of a tocolytic drug to prevent preterm birth is indicated. Maintenance regimens are used after an episode of acute tocolysis with a first-line regimen. The goal of maintenance tocolysis is prolonged prevention of a preterm birth in a woman who had experienced symptoms of preterm labor.
Second, we divided the tocolytic strategies using two schemes: (1) beta-mimetics, (2) calcium channel blockers, (3) magnesium sulfate, (4) NSAIDs, and (5) a category labeled "other" that includes ethanol and atosiban. Clinicians use drugs within all five of these categories for both first-line and maintenance tocolysis. The evidence tables in correspond to these categories. Evidence Tables 3 through 7 contain first-line tocolytic studies, and Evidence Tables 8 through 11 provide information on maintenance tocolytics. Based on the available research that met our inclusion criteria, the maintenance review includes only beta-mimetics, calcium channel blockers, magnesium sulfate, and NSAIDs. Studies comparing classes of tocolytics are repeated in the evidence tables and the discussion in this chapter within each tocolytic class.
A review of the potential side effects or harms from treatment follows the discussion of the benefits of tocolytic therapy. Results are divided by the drug's mechanisms of action. Owing to time and resource limitations, we were unable to divide the studies further on the basis of whether the drugs were being used for first-line or maintenance treatment.
More than one-half of the RCTs had definitions of preterm labor that we considered adequate. They included the number of contractions, typically recorded during a specific period of time, and another measure such as level of effacement or dilation. 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 Three RCTs had no definition. 136 137 138 One of the three nonrandomized trials 128 and approximately one-half of the nine observational studies 139 140 141 142 143 had an adequate definition. In the studies that included a count of contractions, the number used as an entry criterion varied greatly.
Authors included a definition of successful tocolysis in more than 60 percent of studies, 117 120 122 125 126 127 128 129 130 131 133 134 137 138 139 140 142 144 145 146 147 148 149 150 151 152 153 154 155 including approximately one-half of the RCTs. As with the definition of preterm labor, definitions of successful tocolysis varied greatly. The most common definitions were delay of labor for 48 hours, 7 days, and until 34, 35, or 36 weeks' gestation.
The number of subjects included in the studies varied widely; 23 studies enrolled 100 or more women. 96 117 121 122 123 124 131 132 133 135 136 137 141 143 144 145 149 151 156 157 158 159 Studies tended to have little or no demographic data. The mother's age was most commonly reported; nine studies also reported the mother's race. 117 123 124 128 133 138 147 150 154 Education levels were not reported in any of the studies.
Bias is a concern in this literature. Only six of the RCTs state that the treating physician was masked to which patients received which treatments. 96 121 129 137 160 None of the observational studies used regression techniques to control for confounding from factors outside the treatment.
A discussion of patient compliance with the treatment protocol is virtually nonexistent in the studies. If the patient is receiving only inpatient treatments, then compliance is not necessarily a serious concern, as it can be monitored by hospital staff; however, some studies continue treatment after discharge but do not provide information concerning compliance with these medications.
Another potential concern relating to the population of women reflected in the analyses is generally controlled in this literature; only nine studies do not either exclude women with ruptured membranes or use stratified analyses for women with intact membranes relative to ruptured membranes. 96 125 135 141 146 154 160 Patient exclusions based on the need for a medically indicated birth were not easily discernible from the literature.
Merkatz, Peter, and Barden 136 reported on the only RCT comparing first-line beta-mimetic treatment with that given a control group that found the estimated gestational age at delivery to be significantly older with treatment. Some patients included in this control group, however, received ethanol treatment. Four other RCTs that compared a beta-mimetic treatment with that given a control group found no significant difference in this outcome. 121 131 133 151
Both Merkatz, Peter, and Barden 136 and Larsen, Hansen, Hesseldahl, et al. 121 found a significant increase in mean prolongation of pregnancy in patients receiving beta-mimetic treatment. Two other authors found greater prolongation in the beta-mimetic group relative to no treatment controls: 1 day or less by Leveno, Klein, Guzick, et al., 151 and less than 24 hours and less than 48 hours by the Canadian Preterm Labor Investigators Group. 96 Holleboom, Merkus, van Elferen, et al. 131 and Guinn, Goepfert, Owen, et al., 133 two relatively large studies with, respectively, 201 and 179 women, did not find efficacious results in prolonging pregnancy. Although Merkatz, Peter, and Barden 136 also found gestation to be significantly increased to greater than or equal to 36 weeks, gestational weeks were not significantly different in five other RCTs. 96 121 131 133 151
Only Merkatz, Peter, and Barden 136 reported efficacy of tocolytics for infant outcomes, calculated in relation to a significantly higher birth weight. Other studies found no difference in measures of birth weight; Activity, Pulse, Grimace, Appearance, Respiration (APGAR) score; or neonatal intensive care unit admissions. 121 131 133 151
Other RCTs included in the review compared the efficacy of beta-mimetic treatment with treatment with a different class of tocolytic. With respect to older mean gestational age at birth, Jannet, Abankwa, Guyard, et al. 134 and Papatsonis, Van Geijn, Ader, et al. 159 found calcium channel blocker treatment, and Eronen, Pesonen, Kurki, et al. 161 found NSAID treatment, to be related significantly to older estimated mean gestational age at birth compared with beta-mimetic treatment. For the same outcome measure, Koks, Brolmann, de Kliene, et al. 135 and Bracero, Leikin, Kirshenbaum, et al. 127 found no significant difference between calcium channel blockers and beta-mimetics, and Kurki, Eronen, Lumme, et al. 129 found no difference between NSAIDs and beta-mimetics.
Similarly mixed results were found in RCTs that measured prolongation of pregnancy. Only in comparison to ethanol treatment was a beta-mimetic found to result in a significantly longer prolongation of pregnancy. 148 In several studies, beta-mimetics alone were found to result in a significantly shorter prolongation of pregnancy when compared with another class of tocolytic including calcium channel blockers, 159 NSAIDs, 129 and magnesium sulfate and beta-mimetic combination. 138
Other researchers found no significant differences in trials comparing beta-mimetic treatment and other classes of tocolytics using a variety of measures of prolongation, which includes five studies of calcium channel blocker treatment, 126 127 130 135 153 two of NSAID treatment, 119 123 and two of magnesium sulfate treatment. 122 150
Similarly mixed results were shown in observational studies. In no instance did beta-mimetic treatment significantly prolong pregnancy more than an alternative tocolytic. 128 140 141 142 146
The most common instant-related outcome in the first-line beta-mimetic RCTs was birth weight. Bracero, Leikin, Kirshenbaum, et al. 127 found calcium channel blocker treatment to be related to higher birth weight, but nonstatistically significant differences were not found by Meyer, Randall, Graves, et al., 126 Jennet, Abankwa, Guyard, et al., 134 Papatsonis, Van Geijn, Ader, et al., 159 Koks, Brolmann, de Kleine, et al., 135 and Garcia-Valasco, and Gonzalez. 153 Two studies concluded that the differences between an NSAID and a beta-mimetic were insignificant; 123 161 another found the same insignificant differences between magnesium sulfate and a beta-mimetic. 150
All of the 14 calcium channel blocker studies, except one case study, compared a calcium channel blocker with a different class of tocolytic. Of the 11 RCTs included in the review, 9 compared treatment with a calcium channel blocker and a beta-mimetic, which were discussed in the preceding section on beta-mimetics. The remaining two RCTs compared treatment with a calcium channel blocker and magnesium sulfate. 162 163 (See Evidence Table 4.)
All calcium channel blocker studies, except for one retrospective cohort analysis, 155 included a definition of preterm labor. Nine of the 11 RCTs included a definition that we would consider adequate. 125 126 127 130 134 135 154 162 163 In addition, 8 of the 11 RCTs included a definition of successful tocolysis. Definitions vary with length of cessation of symptoms, ranging from 2 hours in the research by Smith and Woodland 154 to 36 completed weeks' gestation in Ferguson, Dyson, Schutz, et al. 125 and Jannet, Abankwa, Guyard, et al. 134
Generally, studies either excluded women at entry who had ruptured membranes or stratified outcome analyses. Exceptions include two RCTs 135 154 and two of the three observational studies. 155 163 The estimated gestational age at entry was rather broad in the study population. In seven RCTs, it is 20 to 22 weeks at the lower boundary and 34 to 36 weeks at the upper boundary. In the remaining four RCTs, it is 24 to 26 weeks at the lower boundary and 34 to 36 weeks at the upper boundary.
Few statistically significant differences were found in outcomes between study arms of the calcium channel blocker studies; all were in relation to beta-mimetic treatment. As discussed in the section of this chapter concerning beta-mimetic study results, Jannet, Abankwa, Guyard, et al. 134 in a study with an enrollment of 90 women, an adequate definition of preterm labor, and a definition of successful tocolysis found calcium channel blockers to have greater efficacy than beta-mimetics in terms of mean estimated gestational age at delivery and delivery at term. These researchers did not find significant differences in infant outcomes. Papatsonis, Van Geijn, Ader, et al. 159 also reported significant findings in favor of calcium channel blockers compared with beta-mimetic treatment in terms of mean estimated gestational age at birth, mean prolongation of pregnancy, prolongation of pregnancy for specific lengths of time ranging between 24 hours and 2 weeks, and delivery at less than 34 weeks. This study had 185 women at enrollment; it included a definition of preterm labor but no definition of successful tocolysis.
Although lacking statistically significant differences in delivery outcomes, several studies found calcium channel blocker treatment relative to beta-mimetic treatment to be associated with improved infant outcomes. Bracero, Leikin, Kirshbaum, et al. 127 found birth weights to be higher and neonatal intensive care unit admissions to be lower; Koks, Brolmann, DeKleine, et al. 135 found 1-minute APGAR scores to be higher. The retrospective cohort study comparing these two classes of tocolytics also found birth weight to be significantly higher for the calcium channel blocker group.
Four additional RCTs found no significant differences between any outcome measures comparing calcium channel blocker and beta-mimetic treatment. 126 130 153 154 This finding also was true for the two RCTs comparing calcium channel blockers and magnesium sulfate treatment. 162 163
Based on the information included in the articles, none of the physicians providing treatment were masked. None of the observational studies included multivariate analyses to control for confounding.
Three studies compared magnesium treatment to a control group; 120 124 166 the later study also included a terbutaline treatment arm. Fox, Albert, McCaul, et al. 166 found magnesium treatment to show efficacy in terms of delivery at less than or equal to 48 hours. On the other hand, although both measures of mean estimated gestational age at birth were greater than 36 weeks, the control group showed significantly better outcomes than the magnesium group. No statistically significant differences were shown in any infant outcome measures. Cotton, Strassner, Hill, et al. 120 and Cox, Sherman, and Leveno 124 found no significant differences in delivery or infant outcomes.
The remaining six RCTs compared magnesium treatment with beta-mimetic treatment, 122 138 150 calcium channel blocker therapy, 162 and treatment with an NSAID. 167 Of the three studies comparing magnesium to a beta-mimetic, one found statistically significant results: Hatjis, Swain, Nelson, et al. 138 compared treatment with ritodrine alone with treatment with ritodrine and magnesium sulfate and found the combination therapy to be significantly more likely to prolong pregnancy by 1 week or more. No significant findings were uncovered in relation to an NSAID. 167
A nonrandomized control rial conducted by Chau, Gabert, and Miller 128 supported the finding of the efficacy of magnesium treatment in relation to beta-mimetic pharmacotherapy. Significant differences were shown in mean prolongation weeks, delivery at term, and mean birth weight. A limitation of this study, however, was that patients who failed in their initial treatment were moved to the other treatment group, thus confounding the results. A retrospective cohort study of 61 women 152 combining terbutaline, ritodrine, and magnesium sulfate treatment found that women who were able to persist with treatment relative to those who were forced to stop because of complications were significantly less likely to give birth at less than 36 weeks and their babies had shorter hospital stays. Another retrospective study 142 comparing a treatment combining magnesium and a beta-mimetic with only beta-mimetic treatment found no differences in effectiveness.
One small retrospective study of 44 women conducted by Lewis, Grimshaw, Brooks, et al. 165 compared the combination of magnesium sulfate and indomethacin, an NSAID, with magnesium alone. They found the combination therapy to be associated with greater mean prolongation days.
As shown in Evidence Table 6, the literature meeting the inclusion criteria for NSAIDs was small: five RCTs, one nonrandomized controlled trial, one retrospective study, and two case series analyses. The only RCT that did not compare NSAID treatment with beta-mimetic treatment was conducted by Morales and Madhav 167 and compared NSAID and magnesium regimens. The nonrandomized trial 146 compared ethanol treatment with treatment with a combination of ethanol and an NSAID or treatment with a beta-mimetic. The retrospective cohort study 165 compared pharmacotherapy with magnesium, an NSAID, and a beta-mimetic with treatment with magnesium and a beta-mimetic. The two case series reviews 168 169 did not include comparison groups. These were almost all small studies; all the investigations reporting statistical comparisons had sample sizes of fewer than 100 subjects.
All three RCTs defined preterm labor, but only Kurki, Eronen, Lumme, et al. 129 defined successful tocolysis. Women with ruptured membranes were excluded from these studies. In two of the three RCTs, the physician was masked as to which patients received the intervention treatment. 118 129
Gamissans, Canas, Cararach, et al. 118 found treatment with an NSAID and a beta-mimetic relative to a beta-mimetic and a placebo to be related significantly to women being more likely to deliver at 37 or more weeks. Infants born to mothers receiving combination therapy had greater birth weights and higher APGAR scores at 1 minute. Kurki, Eronen, Lumme, et al. 129 also found that an NSAID showed efficacy over a beta-mimetic in relation to delaying labor 24 to 48 hours and promoting delivery at greater than or equal to 37 weeks and had fewer maternal side effects. Eronen, Pesonen, Kurki, et al. 161 also found the NSAID to provide a superior treatment. In contrast, Morales, Smith, Angel, et al. 123 found no difference between indomethacin and ritodrine in relation to prolongation of pregnancy, birth weight, APGAR scores, or neonatal death. It should be noted, however, that patients experiencing serious side effects from indomethacin were given magnesium sulfate.
The retrospective cohort study 165 found the combination of magnesium sulfate, an NSAID, and a beta-mimetic to be superior to a combination of magnesium sulfate and a beta-mimetic in mean prolongation days.
In Evidence Table 7, three RCTs compared therapy with ethanol with treatment with other classes of tocolytics. The two older studies provided insufficient statistical analyses for efficacy comparisons. 117 144 The third RCT compared ethanol with a beta-mimetic. 148 Two small observational studies explored tocolytic effects of ethanol 170 and atosiban. 171 Neither study enrolled more than 100 women, excluded preterm births arising from conditions other than preterm labor, or included statistical comparison allowing for conclusions about tocolytic effectiveness.
In the only analyzable efficacy study containing statistical analyses, Caritis, Carson, Greebon, et al. 148 found beta-mimetic treatment to be superior to ethanol in terms of mean prolongation days and for delaying birth sufficiently to allow for a full course of betamethasone for fetal lung maturation.
Seven of the RCTs compared a treatment group to a control group. 44 172 175 175 176 177 178 179 All except three studies masked the treating clinician to the protocol. 173 175 176 Most of the RCTs 44 174 175 176 177 179 180 181 found no efficacy differences in treatment. In contrast, Brown and Tejani 172 in a small study found beta-mimetic treatment to show efficacy in terms of mean prolongation days and infant birth weight. In a more recent and larger study of more than 200 women, Rust, Bofill, Arriola, et al. 178 found no differences between groups in delivery outcomes; however, the length of hospital stay for the infant was shorter in the control group than in the group treated with magnesium or terbutaline. This was the only study to include a pill count to evaluate compliance with treatment.
A small observational study 182 contained no statistical analysis. Another larger cohort study compared women with and without cervical change 172 and found delivery and infant outcomes to be generally superior in women without cervical change. One RCT compared two beta-mimetics, ritodrine and terbutaline and found no difference. 173
Owing to the large numbers of categories and the multiple classes of agents assessed in this evidence report, we have chosen to summarize the harms data graphically in the form of "bubble charts." Tables specifying the individual rates of the harms in each study would be unwieldy, so we believe that these figures will convey the information more readily. The articles included in this analysis appear in the tocolytic evidence tables, but we also included additional articles that had not met our inclusion criteria for the benefits analysis.
The graphs give the rates of each of the categories of harm within each class of tocolytic medication: beta-mimetics, calcium channel blockers, NSAIDs, magnesium sulfate, ethanol, and atosiban. Each circle within the graph represents a single study. The size of the circle is proportional to the number of subjects receiving the medication (or placebo) in the study. A study with 100 subjects would have a circle twice as large as a study with 50 subjects, assisting the reader in giving more "weight" to larger studies.
We listed as "harms" clinical markers and events that the authors of the individual studies considered as adverse events or "side effects." In general, these adverse events represent process measures of somewhat unclear significance for the long-term health of the mother and infant. For example, the hypokalemia that is sometimes found after treatment with beta-mimetics may have no clinical consequences but may be associated with increased risk of cardiac arrhythmia. This level of detail generally is not available in this literature. Confidence intervals are generally not provided in the articles, and statistical interpretation is often difficult. Our summary of the literature, by definition, cannot be more detailed than the detail in the studies performed.
We were unable to form synthetic estimates across studies of the rates of treatment harms. The definitions of the clinical categories often were vague or not provided at all in the source articles, which would lead to problems if we assumed that the clinical categories actually were equivalent. Not all categories of medications were represented on every graph. For example, data on "metabolic harms" (e.g., elevated blood glucose, hypokalemia) were collected only for beta-mimetics.
Cardiac side effects, including arrhythmias, heart failure, and chest pain, are well known to occur with the beta-mimetics, and these findings were confirmed by the graph in Figure 11
The pattern for minor cardiovascular harms was similar to that for the major harms, with higher rates in patients taking the beta-mimetic class of medications (Figure 12
As shown in Figure 13
Many agents, including placebo, are associated with gastrointestinal harms (Figure 14
The category of "psychologic" harms encompasses symptoms that are relatively nonspecific (Figure 15
The overall picture for maternal harms is one of more frequent occurrence of harms in users of beta-mimetic medications, especially for the cardiac, metabolic, and psychologic categories. We note, however, that these harms also were searched for more frequently in studies of these agents.
The articles were often unclear as to whether a single patient could appear more than once within a single category of harm. For example, if a single patient had tachycardia and palpitations, she could be counted in each category. Because we saw both of these effects as belonging to "minor maternal cardiovascular harms," the summary graphs may be subject to a degree of double counting that could bias the study outcome by raising somewhat the estimate of harms. Conversely, the reviewers for this report were impressed that many of the studies did not search adequately for maternal harms associated with tocolytic treatment, which would tend to bias the study outcome by lowering the estimate of the rate of harms.
As shown in Figure 16
Intraventricular hemorrhage (IVH) is an uncommon but potentially serious neonatal complication, with increased risk among premature infants (Figure 17
We found little evidence of a difference among the treatment groups in the risk of necrotizing enterocolitis (NEC) (Figure 18
Estimates of sepsis in the literature were hindered by definitional problems of "sepsis." Some studies required a positive blood culture, others defined sepsis clinically, still others had categories of "possible sepsis." The study with a category of "possible sepsis" did exhibit the highest rates of this harm. Overall, we found no pattern of differential neonatal harm among these agents (Figure 19
The estimation of fetal/neonatal tachycardia is hampered by lack of a consistent definition of tachycardia, which likely contributed to the wide range of estimates. Some studies gave no specific heart rate but rather simply reported a percentage for tachycardia. As shown in Figure 20
The fetal/neonatal harms literature provided little consistent evidence of short-term harms to the infants of women receiving tocolytic agents. Insufficient evidence was available from which to draw conclusions regarding longer term effects of tocolytic medications on the health of children.
The analysis did not address the potential harms associated with the use of multiple classes of tocolytic medications during a single pregnancy. A few studies used such regimens, generating the challenge of differentiating the effects of single agents compared with those of multiple agents. One would expect that the harms associated with the use of multiple agents would be at least as great as those of both agents used alone.
Observational studies consistently support the theory that genital tract infections are associated with preterm birth. 188 Histologic evidence of acute inflammation is found in up to 50 percent of placentas after delivery before term. 189 Up to two-thirds of women who present with preterm labor have positive amniotic fluid cultures, and their fluid harbors an array of organisms. 190 These empirical observations have prompted researchers to hypothesize that treating women who present with preterm labor symptoms may prevent preterm birth.
RCTs have been designed and completed to test this hypothesis. The 15 trials and 1 observational study that met the entry criteria are summarized in Evidence Table 12.
The articles reviewed consistently report three outcomes: (1) prolongation of pregnancy after enrollment, (2) mean gestational age at delivery, and (3) preterm delivery at a specific cutoff point, usually less than or equal to 37 weeks. Of the 15 RCTs reviewed, two consistently found improvement in all three of these outcome measures, 191 186 but five found no improvement in any of the three measures. 193 194 195 196 197 Five studies found a significant increase in pregnancy prolongation, 191 192 198 199 200 and three studies found a diminution in the number of preterm deliveries. 192 198 201
Thus, the results of this research are mixed and definitive statements are hard to make about antibiotic efficacy in gravidas with preterm labor symptoms. This mixture of positive and negative results emphasizes the need for meta-analysis; Chapter 4 discusses the results of the meta-analysis on antibiotics conducted as part of this review.
Of the 15 RCTs reviewed, none used the same definition of symptoms to label a woman as having preterm labor. The array of criteria used included frequency of contractions, intensity of contractions, duration of contraction, and a wide range of measures of cervical dilation. Four trials used particularly broad definitions such as "painful" contractions, 191 dilation greater than or equal to 4 cm with no lower limit on contraction frequency, 201 contractions less than 10 minutes apart for more than 60 minutes, 192 and "preterm labor." 202 Lack of a standard, reproducible definition for preterm labor detracts from our ability to compare studies.
Study enrollment was quite varied in these 15 trials. Only one trial had more than 200 subjects (n=277), 194 and five trials enrolled fewer than 100 women. 196 197 200 202 203 Smaller samples force a reader to look at surrogate outcomes (e.g., pregnancy prolongation or gestational age at delivery) because they lack the power to show statistically that antibiotic therapy improves or worsens perinatal outcome. Moreover, clinically significant findings may not be discernable when statistically insignificant results are reported from studies with small sample sizes. Romero, Mazor, Oyarzun, et al. 188 calculated that their multicenter study of antibiotic efficacy, the largest study we reviewed, with 277 participants, had insufficient power to detect a difference in perinatal outcome. These investigators estimated that a sample size of 2,270 subjects would be required to detect a 33 percent reduction in the event rate between the placebo group and the antibiotic group.
Complicating the interpretation of this literature further is the array of cointerventions used in the management of women with symptoms of preterm labor. All but one study used tocolytics, 197 and the majority of studies allowed clinicians to prescribe more than one tocolytic. Likewise, the use of corticosteroids and nonpharmacologic interventions (e.g., hydration or bed rest) was ubiquitous throughout these studies. In no instance were regression techniques included to control for the potential confounding that could be caused by the use of these cointerventions and to support the finding that the treatment effect could be attributed to antibiotic therapy.
In lieu of adequate numbers to assess perinatal outcome (which, given the rarity of perinatal death and serious morbidity, would require enrolling thousands of women), survival analysis would be an additional analytic technique for assessment of interventions in women with preterm labor. Despite the advantages of survival analysis, only three trials 193 194 200 employed the technique. One of these trials showed a treatment benefit; 200 the other two did not. 193 194
A certain number of preterm births subsequent to assignment of the diagnosis of preterm labor in women with symptoms of labor will be medically indicated. Thus, treatment trials in women with preterm labor symptoms should exclude medically indicated preterm deliveries from their analysis or report on them separately. To our knowledge, none of these studies took this approach, and their failure to account for the etiologic heterogeneity of preterm birth may result in erroneous conclusions.
No two trials used the same antibiotic regimen. Beta-lactam, cephalosporin, macrolide, and metronidzole antibiotics were given to women with symptoms of preterm labor. Routes of administration and duration of use varied widely from study to study. Thus, there is insufficient consistency to determine whether any conclusions reached in relation to the results shown in this literature would apply to antibiotics in general or to the particularly convincing regimen used in an individual study.
In any study of a pharmaceutical intervention, bias is controlled by recording compliance with treatment in the study population and searching for potential confounding by recording differences in compliance. Our review of these 15 RCTs discloses only two studies that measured compliance. 194 203
A limitation of all but two trials is gestational age at enrollment. The prevalence of positive amniotic fluid cultures-a direct measure of in utero infection-and histologic chorioamniotis-an indirect measure-vary inversely with gestational age. 194 204 Thus, gravidas at earlier gestational ages are more likely to have infection as an etiology and, in theory, should be more likely to benefit from antibiotic treatment. The converse should be true about women who present with preterm labor symptoms at later gestational ages.
The final limitation is generalizability of these findings to the universe of women with symptoms of preterm labor. Three studies have dropout rates of greater than 20 percent, 191 198 203 and two studies have insufficient data to assess dropout rates. 194 201 In addition, only five studies reported the number of women with preterm labor symptoms who were screened to find the enrolled population. 192 193 194 195 The rates of study participation range from 13 percent 194 to 50 percent. 199 Such low enrollment figures may result in participants' being unrepresentative of the population of women with symptoms of preterm labor.
Four studies of home uterine activity monitoring met the criteria for inclusion in this study. 205 206 207 208 All four studies measured deliveries at specific dates as their primary outcomes of interest. Of these four studies, one concluded that home uterine activity monitoring with daily nursing support was effective in reducing preterm deliveries, defined in that study as those before 37 weeks. 206 The authors of this study reported that among women experiencing recurrent PTL, 47 percent of patients enrolled in the home uterine activity monitoring group delivered before 37 weeks, compared with 84 percent in the "standard-care" group (p=0.025). The mean gestational age for the group receiving home uterine activity monitoring plus nursing support was 36.1 weeks compared with 33.9 weeks, but that difference was not reported to be significant. Standard care did not include nursing support, so it is unclear whether the home uterine activity monitoring device or the nursing support may have been the mitigating factor in reducing preterm births. Among those patients whose preterm labor recurred, 32 percent of the group receiving standard care experienced failed tocolysis at first recurrence compared with none of the group receiving home uterine activity monitoring plus nursing care (p=0.02). The authors attributed this difference to earlier detection of preterm labor in the home uterine activity monitoring group. Neonatal outcome results are not provided in this study, so it is impossible to review the effects that either of these results had on infant health.
The other three studies we evaluated attempted to equalize non-home uterine activity monitoring interventions (e.g., nursing contact or education) in the monitored and unmonitored groups to isolate the effects of the monitoring device. None of these studies found that monitoring improved outcomes or reduced preterm births. One study compared the use of monitoring (without added nursing contact) to standard care. 207 This research team gave the same instructions regarding signs and symptoms of preterm labor and the circumstances under which women should contact a physician, to both study groups. All women were taking terbutaline and had weekly cervical exams. Subjects in the monitoring group monitored themselves for 1 hour per day and transmitted the results to a Tokos Monitor Center perinatal nurse. Any woman experiencing four or more contractions per hour was remonitored immediately and referred to the hospital unless the contractions subsided.
None of the outcomes collected by Nagey, Bailey-Jones, and Herman 207 showed any statistically significant difference between the monitored and unmonitored group, including delivery after week 37, delivery after week 34, or delivery after week 32. The risk ratio for preterm delivery for the monitored patients was 1.08 (p=0.79). Eleven women in the monitored group and five in the unmonitored groups experienced a nondelivery readmission, this difference also was insignificant. Differences in cervical examination upon admission (cervical dilation, cervical effacement, and station) were insignificant.
Iams, Johnson, and O'Shaughnessy 205 compared self-palpation with home uterine activity monitoring in the presence of equal education and nursing contact. Subjects in both groups called the monitoring center if they noticed symptoms of preterm labor and were asked to palpate or record uterine activity whenever they noted these symptoms or an increase in perceived contraction frequency. Again, no significant differences between the two groups were noted in outcomes (gestational age, delivery less than 36 weeks, delivery less than 36 weeks with recurrent labor, term delivery, and birth weight). The authors suggested that, although the home uterine activity monitoring device per se did not have any effect on outcomes, the relevant part of the home uterine activity monitoring system as a whole may be the nursing contact, which, in this study, was provided to both the treatment and control groups, although the sample size in this study is too small to draw this conclusion with certainty.
Finally, Brown, Britton, et al. 208 also compared home uterine activity monitoring to no monitoring in groups of women receiving identical care other than monitoring, including daily perinatal telephone contact by a nurse. All women in this study also received oral terbutaline. Both groups received education that included recognizing the signs and symptoms of preterm labor and instructions on prenatal followup. The monitored patients transmitted a uterine activity monitoring strip to the Tokos Monitor Center twice daily until 37 weeks' gestation or until instructed to discontinue monitoring. This research team found no significant differences in gestational age at delivery, in delivery at specific endpoints (less than 35 weeks, 35 to 37 weeks, greater than 37 weeks), or in neonatal outcomes (birth weight, admissions or days in neonatal intensive care units, or mechanical ventilation). In a logistical regression model predicting delivery at less than 35 weeks, the authors found that only cervical dilation at study entry was significantly predictive. Women with cervical dilation (2 cm at enrollment) had an odds ratio of 4.35 for delivery at less than 35 weeks' gestation compared with women with cervical dilation of less than 2 cm. Readmissions necessitating additional magnesium sulfate treatment did not differ between the groups, nor did the numbers of women with one unscheduled hospital observation lasting less than 24 hours.
Harms of home uterine activity monitoring were not studied in this analysis; however, because home uterine activity monitoring is not invasive, risks of the system are limited to interventions that might be used with it, or that might be used upon identifying an increase in contractions (i.e., tocolysis, corticosteroids). Nagey, Bailey-Jones, and Herman 207 studied hospital readmission, which would presumably include additional treatment, and found no significant difference between the two groups. Watson, Welch, Mariona, et al. 206 also found no significant difference in the numbers of patients readmitted for recurrent preterm labor.
An important limitation in much of the home uterine activity monitoring literature in general is the mixing of the effects of the device and the accompanying nursing support. As noted earlier, Watson, Welch, Mariona, et al. 206 did not separate out these effects, making it difficult to compare their study with others. Iams, Johnson, and O'Shaughnessy 205 did not compare home uterine activity monitoring to standard care; rather they specifically were interested in comparing the device to self-palpation. This design would not provide information about whether the system might provide benefit compared with general practice.
Each study used relatively small sample size, although the authors addressed this limitation adequately in their texts. We have attempted to address this issue as well by performing a meta-analysis of the three more comparable studies.
Only Nagey, Bailey-Jones, and Herman 207 described how they developed their randomization schedule, so we are unable to comment on the appropriateness or lack thereof of the other studies. None of the studies presented an intent-to-treat analysis, which would have been useful to evaluate fully the likely impact of home uterine activity monitoring in a clinical setting; however, because these results were generally negative, such analysis would simply have magnified that effect.
Although each study reported data for preterm births, data were provided for few maternal and neonatal outcomes. Only Brown, Britton, Brizendine, et al. 208 gave information beyond birth weight for neonatal outcomes: They found no significant differences in care provided to infants whose mothers had or had not received home uterine activity monitoring.
Although a few trials of both first-line and maintenance tocolytic treatment for women in preterm labor had significant, positive effects, as discussed in Chapter 3 and presented in the evidence tables, most found the different tocolytics to be equally effective and to have little benefit, if any, over placebo or no tocolytic therapy. Similarly, a few trials of adjunctive antibiotic therapy also found significant effects of antibiotic treatment over placebo or no adjunctive antibiotic therapy, but again most of these trials found no statistically significant effects. Virtually no benefits were found in the trials of home uterine activity monitoring compared with trials involving no monitoring of women in preterm labor; however, many of the trials for these interventions had so few patients enrolled that only very large differences would have been statistically significant.
To determine whether the lack of significance in these various studies resulted
from the small sample sizes or negligible impact of the interventions, we
conducted a series of meta-analyses, statistically combining results from
similar trials into summary measures. These meta- analyses were focused on the
efficacy of the interventions for preterm labor as measured by one or more of
the following outcomes: (1) prolongation of pregnancy, (2) estimated gestational
age at delivery, and (3) birth weight. These outcomes were most frequently
reported in the abstracted studies. In particular, for each group of
interventions first-line tocolytics, maintenance tocolytics, adjunctive
antibiotics, and home uterine activity monitoring we sought answers to the
following questions:
Does the intervention result in significantly prolonged pregnancies compared to placebo or no intervention for women in preterm labor? This outcome is measured in days of increased pregnancy.
Does the intervention result in deliveries at significantly later gestational age compared with placebo or no intervention for women in preterm labor? This outcome is measured in weeks of gestation.
Does the intervention result in higher birth weight babies compared with placebo or no intervention for women in preterm labor? This outcome is measured in grams or kilograms.
We first reviewed the completed evidence tables for the various interventions to identify studies for inclusion in the meta-analyses. We included only RCTs. Results from prospective cohort studies, case series, retrospective analyses, and other nonrandomized trials were excluded from the analyses. For inclusion in the meta-analyses, the trials also had either to be restricted to women with intact membranes or to have reported the outcomes separately for this subgroup.
Furthermore, only RCTs making comparisons of interest were considered for inclusion. For adjunctive antibiotic and home uterine activity monitoring trials, the comparison of interest was between standard treatment with the intervention and standard treatment with either placebo or no intervention. All antibiotic and home uterine activity monitoring RCTs had at least one intervention arm and one arm without the intervention and, therefore, were potential candidates for the meta-analyses.
By contrast, the tocolytic trials frequently compared one tocolytic agent with another; only a small number of trials included a placebo or an arm with no tocolytic treatment. Because the various tocolytics work through different mechanisms, we were interested in comparisons among different classes of tocolytics; therefore, tocolytic trials were considered for the meta-analyses if they compared one class of tocolytics with another and/or to placebo or no tocolytic treatment. The tocolytic classes included were beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs, and ethanol. Because trials comparing one beta-mimetic with another one calcium channel blocker with another added no additional information for the comparisons of interest, they were excluded from the meta-analyses unless they also had an arm with another class of tocolytic or a placebo or no tocolytic arm.
Of all the trials considered for these analyses, 23 trials of first-line tocolytics, 10 trials of maintenance tocolytics, 13 trials of adjunctive antibiotic treatment, and 3 trials of home uterine activity monitoring met the inclusion criteria. We then extracted the information on the outcomes of interest for each trial.
Although the three outcomes of interest were the most frequently reported, they were by no means universally or uniformly reported. For prolongation of pregnancy, some trials reported the mean number of days that pregnancy was prolonged as well as either the standard deviation, the standard error, or both. Others reported only the means. Others reported the number or percentage of pregnancies prolonged for various durations (e.g., <2 days, >7 days, <14 days). Similarly, some trials reported mean gestational age at delivery with or without standard deviations or standard errors; others reported the number or percentage of deliveries at or beyond different gestational age cutoffs (e.g.,>32 weeks, >36 weeks, >37 weeks). Still others reported mean birth weight with or without standard deviations or standard errors and a few reported the percentage of births greater than or equal to 2500 g. Some trials neglected to report any data on one or more of these outcomes altogether. Thus, the actual number of trial results that could be combined for any particular analysis was smaller than the number of trials meeting the inclusion criteria.
Where possible, we used the continuous measures of the outcomes of interest
and combined the mean differences, d, across studies:
d = Mt -
Mc
where M
t and M
c are the sample means of the treated and control arms,
respectively. Assuming a normal distribution for the individual observations
with equal variances in each arm of the trial, the variance of the
difference, S
2, is computed:

where SEM
t
2 is the standard error of the mean of the treated arm and
SEM
c
2 is the standard error of the mean of the control arm. Thus, to
use this measure, the sample size, means, and standard errors (or standard
deviations) must be available from the reported information. For analyses in
which too few studies reported means with standard deviations or standard
errors, we computed the odds ratio, ω, for selected cutoff values of the
outcome measures. We could then use this value as the effect measure for
combining results across studies:
w =
Pt(1 -
Pc)/Pc(1 -
Pt)
where p
t is the proportion of the treated group that successfully
reached the cutoff value and p
c is the proportion of the control group that successfully
reached the cutoff value.
To manage the multiple treatment problem of first-line tocolytic treatments in which we had five classes of tocolytics to compare with placebo/no tocolytics, we used the generalization of the two-arm methods developed by Hasselblad.209 For all interventions and outcomes, we combined the results using a Bayes random effects model as described by Hedges and Olkin,210 allowing the true treatment effect to vary across studies. For dichotomous outcomes, we estimated the model parameters with the EGRET software package; for continuous measures, we used a customized program developed by one of the authors, Victor Hasselblad, Ph.D.
In addition to the meta-analysis results on the three main questions noted earlier (e.g., estimates of prolongation of gestation), we report the results of tests of homogeneity for the four main sets of analyses (both types of tocolytics, antibiotics, and home uterine activity monitoring). These tests indicate the extent to which the trials are sufficiently alike to permit them to be combined with confidence, and the possibility that various unknown factors are present that have not been adequately controlled for.
| Study | Placebo | Beta- Mimetics | Calcium Channel Blockers | Magnesium Sulfate | NSAIDs | Ethanol |
|---|---|---|---|---|---|---|
| Merkatz, et al.,136 1980 | Yes | Yes | No | No | No | No |
| Larsen, et al.,121 1986 | Yes | Yes | No | No | No | No |
| Leveno, et al.,151 1986 | Yes | Yes | No | No | No | No |
| Guinn, et al.,133 1997 | Yes | Yes | No | No | No | No |
| Meyer, et al.,126 1990 | No | Yes | Yes | No | No | No |
| Kupferminc, et al.,130 1993 | No | Yes | Yes | No | No | No |
| Jannet, et al.,134 1997 | No | Yes | Yes | No | No | No |
| Papatsonis, et al.,159 1997 | No | Yes | Yes | No | No | No |
| Garcia-Velasco and Gonzalez,153 1998 | No | Yes | Yes | No | No | No |
| Wilkins, et al.,122 1988 | No | Yes | No | Yes | No | No |
| Kurki, et al.,129 1991 | No | Yes | No | No | Yes | No |
| Castren, et al.,144 1975 | Yes | Yes | No | No | No | Yes |
| Lauersen, et al.,117 1977 | No | Yes | No | No | No | Yes |
| Caritis, et al.,148 1982 | No | Yes | No | No | No | Yes |
| Glock and Morales,162 1993 | No | No | Yes | Yes | No | No |
| Floyd, et al.,163 1995 | No | No | Yes | Yes | No | No |
| Treatment | Estimated Odds Ratio | Standard Error of Log | 95% Confidence Interval |
|---|---|---|---|
| Beta-mimetics | 1.622 | 0.147 | 1.216, 2.164 |
| Calcium channel blockers | 2.485 | 0.237 | 1.562, 3.954 |
| Magnesium sulfate | 1.867 | 0.231 | 1.187, 2.936 |
| NSAIDs | 4.948 | 0.562 | 1.645, 14.89 |
| Ethanol | 0.949 | 0.263 | 0.567, 1.589 |
The results of the meta-analysis are presented in Table 14 and are shown graphically in Figure 21
We conducted an overall test for goodness of fit, which gave a chi-square of 11.476 for 13 degrees of freedom, p=0.571. Thus, there was no evidence of lack of fit. We also conducted a test to see if beta-mimetics, calcium channel blockers, and magnesium sulfate had the same effects. This calculation gave an overall chi-square test of 8.762 for 2 degrees of freedom, p=0.0125. Thus, there is evidence that these three tocolytics may have a differential effect from each other relative to placebo in terms of increasing the number of births at or beyond 36 to 38 weeks gestation.
| Treatment | Endpoint (units) | Estimated Difference | 95% Confidence Interval |
|---|---|---|---|
| Beta-mimetics | Prolongation (days) Gestational age (weeks) Birth weight (grams) | -0.68 -0.46 -16.4 | -4.68, 3,33 -1.03, 0.11 -189.0, 155.3 |
| Magnesium sulfate | Prolongation (days) Gestational age (weeks) Birth weight (grams) | -3.31 -0.55 -202.8 | -9.67, 3.06 -1.38, 0.28 -405.7, 0.1 |
| Treatment | Endpoint (Units) | Chi-square for Homogeneity | Degrees of Freedom | p-Value |
|---|---|---|---|---|
| Beta-mimetics | Prolongation (days) Gestational age (weeks) Birth weight (grams) | 7.57 0.67 46.27 | 6 5 7 | 0.272 0.985 <0.001 |
| Magnesium sulfate | Prolongation (days) Gestational age (weeks) Birth weight (grams) | 0.43 0.58 0.07 | 1 1 1 | 0.512 0.445 0.790 |
| Study | Estimated Prolongation (Days) | Standard Error | 95% Confidence Interval |
|---|---|---|---|
| Morales, et al.,198 1988 | 13.55 | 3.39 | 6.898, 20.20 |
| Nadiasauskiene, et al.,191 1996 | 10.50 | 6.91 | -3.067, 24.07 |
| McGregor, et al.,199 1991 | 9.90 | 4.44 | 1.174, 18.63 |
| McCaul, et al.,203 1992 | 0.50 | 8.53 | -16.34, 17.34 |
| Newton, et al.,195 1989 | 0.10 | 4.73 | -9.200, 9.400 |
| Gordon, et al.,193 1995 | -0.10 | 4.29 | -8.527, 8.327 |
| COMBINED | 6.42 | 2.63 | 1.266, 11.56 |
| Study | Increase in Birth Weight (kg) | Standard Error | 95% Confidence Interval |
|---|---|---|---|
| Svare, et al.,192 1997 | 0.292 | 0.170 | -0.04241, 0.6264 |
| Morales, et al.,198 1988 | 0.288 | 0.128 | 0.0375, 0.5385 |
| Norman, et al.,200 1994 | 0.225 | 0.145 | -0.060, 0.5104 |
| Cox, et al.,197 1996 | 0.169 | 0.176 | -0.1739, 0.5119 |
| McGregor, et al.,1991991 | 0.145 | 0.135 | -0.1195, 0.4099 |
| Newton, et al.,1961991 | 0.135 | 0.181 | -0.2174, 0.4874 |
| Nadisauskiene and Bergstrom201 1996 | 0.056 | 0.104 | -0.1476, 0.2596 |
| Gordon, et al.,1931995 | 0.026 | 0.124 | -0.2177, 0.2697 |
| Newton, et al.,1951989 | 0.008 | 0.134 | -0.2547, 0.2707 |
| McCaul, et al.,2031992 | -0.059 | 0.247 | -0.5481, 0.4295 |
| Oyarzun, et al.,2111998 | -0.063 | 0.108 | -0.2748, 0.1488 |
| Romero, et al.,1941993 | -0.148 | 0.092 | -0.3287, 0.0327 |
| COMBINED | 0.068 | 0.045 | -0.01938, 0.1558 |
| Study | Increase in Gestation Age (Weeks) | Standard Error | 95% Confidence Interval |
|---|---|---|---|
| Nadiasaukiene, et al.,191 1996 | 2.80 | 1.02 | 0.7942, 4.806 |
| Norman, et al.,200 1994 | 1.30 | 0.82 | -0.3110, 2.911 |
| Newton, et al.,196 1991 | 1.00 | 0.90 | -0.7581, 2.758 |
| Morales, et al.,198 1988 | 0.95 | 0.41 | 0.1380, 1.762 |
| McGregor, et al.,199 1991 | 0.50 | 0.66 | -0.8040, 1.804 |
| Gordon, et al.,193 1995 | 0.10 | 0.55 | -0.9718, 1.172 |
| Cox, et al.,197 1996 | 0.10 | 0.95 | -1.765, 1.965 |
| Newton, et al.,195 1989 | 0.00 | 0.60 | -1.174, 1.174 |
| McCaul, et al.,203 1992 | -1.10 | 1.01 | -3.093, 0.8931 |
| COMBINED | 0.59 | 0.24 | 0.1198, 1.057 |
| Endpoint (Units) | Estimated Difference (treated - control) | 95% Confidence Interval |
|---|---|---|
| Gestational age (weeks) | 0.19 | -0.37, 0.76 |
| Survival to 37 weeks (odds ratio) | 1.22 | 0.73, 2.05 |
| Birth weight (grams) | -47.0 | -201.1, 107.1 |
| Endpoint (units) | Chi-square for Homogeneity | Degrees of Freedom | p-Value |
|---|---|---|---|
| Gestational age (weeks) | 0.83 | 2 | 0.660 |
| Survival to 37 weeks (odds ratio) | 0.44 | 1 | 0.508 |
| Birth weight (grams) | 1.88 | 2 | 0.391 |
We conducted a series of meta-analyses on the efficacy of various interventions for women with suspected preterm labor. These interventions included first-line tocolytics, maintenance tocolytics, adjunctive antibiotics for occult in utero, and home uterine activity monitoring. We restricted the analysis to evidence from RCTs for women with intact membranes and focused on three outcome measures: (1) prolongation of pregnancy in days, (2) gestational age at delivery, and (3) birth weight.
We found 23 first-line tocolytic studies, 10 maintenance tocolytic studies, 13 antibiotic studies, and 3 home uterine activity monitoring studies meeting our inclusion criteria. Because of the lack of complete and consistent reporting of outcomes, the meta-analyses for each intervention were conducted on fewer studies.
For first-line tocolytics, we first combined studies within tocolytic class and compared the results with placebo/no tocolytics. The only outcome measure with evidence from an adequate number of trials was the fraction of births occurring at term, defined as at or beyond 36 to 38 weeks' gestation. We found all tocolytic classes, except ethanol, were effective in increasing gestational age at birth. Beta-mimetics, calcium channel blockers, and magnesium sulfate nearly doubled the odds of birth at term, relative to control, with small but significant differences in effect size found between classes of drug.
We also found adjunctive antibiotic treatment to have small, beneficial effects for women in preterm labor with suspected occult in utero infection. We estimated increases of 6 days in the length of pregnancy, 0.6 of a week (i.e., slightly more than 4 days) in gestational age at delivery, and 70 g of weight at birth. The increase in birth weight was not found to be statistically significant.
Finally, we found no statistically significant effects from maintenance tocolytic treatment or of home uterine activity monitoring. For home uterine activity monitoring, however, no more than three trials were combined for estimates of these effects, and all trials entered few patients; therefore, the results should be interpreted with caution.
We discuss below the overall conclusions and implications of our analysis combining information from the evidence tables and the results in Chapter 3 and the meta-analyses discussed in Chapter 4. We begin by presenting our system for grading the strength of the information on the efficacy or effectiveness of treatment or interventions and harms data. Subsequent sections present our findings on individual topic areas, corresponding to our key clinical questions concerning the management of preterm labor (in order by biologic markers, tocolytics, antibiotics, and home uterine activity monitoring). Within each key therapy section, we present our evaluative grade for that body of literature. Finally, we discuss concerns relating to the literature as a whole.
Chapter 2 introduced the discussion of our grading system for the preterm labor literature. Our overall approach for assigning categorical grades or ratings takes into account the quality of individual articles as scored according to the scale described in Chapter 2 as well as the strength and consistency or homogeneity of the findings across studies regarding a specific topic. The grades are defined as indicated below; the first set concerns benefits and the second set, harms.
We assign four basic grades for benefits. The first three grades indicate that the quantity of data is sufficient to make a judgment and that the overall data are good, fair, or poor in terms of supporting the conclusion that the intervention (e.g., pharmaceutical agent or biologic marker) is better than an alternative intervention or placebo. The fourth grade indicates that the quantity and quality of data are insufficient to draw any conclusions, chiefly because of small sample sizes or poor methods in the studies reviewed. The specific category definitions are as follows:
| Good (A): | The data are sufficient for evaluating the quality and strength of the findings. The data are consistent and indicate that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor. |
| Fair (B): | The data are sufficient for evaluating the quality and strength of the findings. The data indicate that inconsistencies exist in the findings about the key drug or intervention and alternative treatment or placebo for treating women with preterm labor. |
| Poor (C): | The data are sufficient for evaluating the quality and strength of the findings. The data do not show that the key drug or intervention is superior to alternative treatment or placebo for treating women with preterm labor. |
| Incomplete evidence (I): | The data are insufficient for assessing the quality of the key drug or intervention for treating women with preterm labor based on limited sample size or poor methodology. |
| The sources of information leading to these categories are the following: | |
| Efficacy (1): | Evidence obtained from at least one well-designed RCT. |
| Effectiveness (2): | Evidence obtained from more than one well-designed cohort or case-control study. |
We assigned two categorical ratings to the overall harms data reported for tocolytics in Chapter 3, "low" or "high." The rating reflects our judgment of the probability of risk from the intervention in question. The specific category definitions are as follows:
Low probability of risk:
- The side effects of treatment with the specific tocolytic that we noted from our review of articles were short term and not life-threatening.
- Considering the body of evidence, the frequency of occurrence was not substantially different from the frequency for an alternative tocolytic treatment or no treatment.
"High" probability of risk:
- The side effects of treatment with the specific tocolytic that we noted from our review of articles were life-threatening.
- Considering the body of evidence, the frequency of occurrence was substantially different from the frequency for an alternative tocolytic treatment or no treatment.
The key question for this topic was: What are the appropriate criteria for diagnosis of preterm labor? Relatedly, how much positive or negative predictive value does the use of biologic markers add to clinical opinion in diagnosing preterm labor?
Overall, the evidence-based review demonstrated that the two biologic markers studied- fFN and EVUSD-present strong evidence of effectiveness as diagnostic tools for assessing the risk of preterm birth in women with symptoms of preterm labor. Because of the lack of RCTs for biologic markers, we could not evaluate the efficacy of this intervention. Based on the relatively high quality and consistency of the data obtained from well-designed cohort studies, we assigned a quality grade for biologic markers evidence of Good (A-2).
Overall, our conclusion is that these biologic markers are untested as a means of assisting clinicians in improving outcomes in patients presenting with preterm labor. The evidence does indicate, however, that they offer valuable information that could allow women to avoid unnecessary treatments. In response to our key question, the literature supports the notion that these tests can usefully supplement clinical judgment, especially in terms of identifying women who are not likely to experience a preterm birth.
The key question for this topic was: What is the efficacy and/or effectiveness of tocolytics in managing preterm labor?
We divided our analysis into two subtopics, one concerning first-line tocolytics and the other concerning maintenance therapy for a woman who has experienced an episode of preterm labor.
The largest body of literature examined for this review concerned treatment with tocolytic agents for acute episodes of preterm labor: The tocolytics discussed included beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs, and ethanol. We included both efficacy and effectiveness studies; research compared treatment with a particular class of tocolytic with a control group as well as with a different class of tocolytic drug. Overall, we assigned a quality grade to the first-line tocolytic literature of Fair (B-1), having determined that the data were sufficient for evaluating the quality of the findings and that the evidence comes from at least one well-designed RCT.
Literature comparing an intervention group of women receiving first-line treatment with tocolytics relative to a control group were available only for beta-mimetics, magnesium sulfate, and ethanol. Results were mixed: Some evidence of efficacy was shown in terms of estimated gestational age at delivery and prolongation of pregnancy with beta-mimetic or magnesium sulfate treatment. Only one beta-mimetic study 136 found a relationship between treatment and improved infant outcomes, showing significantly higher birth weight in the treatment arm. Data from the ethanol study 144 were insufficient to evaluate efficacy.
The largest body of literature reviewed compared the outcomes of treating women with beta-mimetics with the outcomes of treatment with other classes of tocolytics. Results again were mixed: Beta-mimetics were shown to have efficacy only relative to ethanol treatment in relation to prolonging pregnancy. 148 Other RCTs showed that all other classes of tocolytics had greater or not significantly different effects relative to beta-mimetics on birth outcomes. We also found mixed results for infant outcomes, but no study reported that beta-mimetics were a superior treatment. No other differences in efficacy were found between classes of tocolytics, and data from observational studies did not contradict the results of the RCTs.
Supplementing our qualitative review, some of the RCT data available from the individual studies were combined using meta-analytic techniques (Chapter 4). This methodology allowed us to compare all classes of tocolytics (i.e., beta-mimetics, calcium channel blockers, magnesium sulfate, NSAIDs) with a no-treatment group. The data suggest that all tocolytics, with the exception of ethanol, were effective in extending births to term (at or beyond 36 to 38 weeks gestation) compared with results in a no-treatment group. Evidence shows that there may be effect differences among beta-mimetics, calcium channel blockers, and magnesium sulfate; but more studies are needed for conclusive findings.
In sum, although inconsistencies exist in these data, overall the evidence supports the notion that treatment offers small improvements in pregnancy prolongation. If necessary, this may provide adequate time for administering corticosteroids for fetal lung development. Sufficient data were not available to determine directly whether this treatment has a directly beneficial effect on neonatal morbidity and mortality. Data concerning relative efficacy between drug classes are generally inconclusive, but they clearly support the conclusion that ethanol is less efficacious than other tocolytic options.
With the exception of one small study, 172 the efficacy studies of maintenance tocolytic literature showed no difference between treatment and control arms in managing women who had recently experienced an episode of preterm labor. These results were confirmed through our meta-analytic synthesis of results (Chapter 4 ). In terms of gestational age at birth, prolongation of pregnancy, or birth weight, no benefits from maintenance treatment were uncovered.
We graded this literature as Poor (C-1) in terms of showing efficacy or effectiveness of maintenance tocolytics. The data were sufficient for evaluating the quality of the literature, and no evidence supported the use of this therapeutic modality relative to no treatment.
The harms analysis examined tocolytic treatments for different classes of pharmaceuticals, but we did not distinguish between first-line and maintenance therapies. We assessed adverse events for both the mother and the fetus or neonate.
We graded beta-mimetics as "high" in probability of maternal risk. These drugs were shown to pose a risk to the mother of serious cardiovascular harms, minor cardiovascular harms, metabolic harms, and psychologic harms. In comparison, all other classes of tocolytic treatments were graded as "low" in relation to maternal risk. Significant levels of serious maternal complications were not shown.
We graded all classes of tocolytics as "low" risk in relation to fetal or neonatal harms. Evidence of short-term harms was inconsistent, and evidence to evaluate longer term problems was insufficient.
The evidence from our analysis supports the generally held clinical belief that ethanol is an inappropriate treatment for women with preterm labor symptoms. Other treatments offer greater efficacy at less risk. Similar concerns would seem to be warranted with regard to treatment with beta-mimetics. Although this judgment may be an artifact of the relative wealth of data on this class of tocolytics relative to others, the benefits of beta-mimetics were never found to exceed other options, and the harms were shown to be potentially more severe.
The key question for this topic was: What is the efficacy and/or effectiveness of antibiotics in managing preterm labor?
Results of our review concerning the efficacy of antibiotics for treating women with preterm labor were mixed. We graded this literature as Fair (B-1) on the basis of judgments that the data were sufficient for evaluating quality and that results from individual studies were mixed, although the meta-analysis found small, yet significant effects.
Two RCTs found improvements in all three delivery outcomes of interest: prolongation of pregnancy, mean gestational age at birth, and birth at a particular number of weeks. 191 , 192 In contrast, all other RCTs showed mixed results or no significant difference from placebo treatment. Results from survival analyses included in three RCTs were also mixed.
Our meta-analysis on six of these studies (Chapter 4) found a marginally significant increase in length of pregnancy of about 6 days. The data from nine studies combined to evaluate change in gestational age with antibiotic treatment also found a marginally significant increase of about 0.60 of a week. In contrast, no statistically significant increase in birth weight was found in the treatment relative to the control group; however, this may be an artifact of lack of power because of the small number of studies available for analysis.
The key question for this topic was: What is the efficacy of home uterine activity monitoring in decreasing adverse maternal and neonatal outcomes in women who have experienced an episode of preterm labor in the current pregnancy?
The efficacy literature on home uterine activity monitoring in the treatment of women with preterm labor comprised only four studies. We give this literature a grade of Poor (C-1) because we judged that the information was sufficient for evaluating the quality of the findings. Also, we found that the evidence does not support the use of this therapeutic modality relative to using no treatment.
Three RCTs found no significant effect from home uterine activity monitoring. Our meta-analysis (Chapter 4) confirmed these conclusions in relation to gestational age at delivery and birth weight; however, meta-analysis results should be viewed with caution as they are based on data from only three studies. The fourth RCT in our analysis reported that home uterine activity monitoring significantly prolonged delivery to 37 weeks; 206 however, this research used an expanded definition of home uterine activity monitoring that included nursing support. The authors did not disentangle the two interventions, and so it is unclear whether the outcome improvement was related to the device or the nursing support. Overall, the preponderance of the evidence points to a lack of efficacy from home uterine activity monitoring in women with preterm labor. We concluded that additional research comparing use of the intervention with a control group and confirming these results in relation to this particular patient group is not needed.
Our evidence report has identified two domains that demand attention from future investigators and funding agencies. One domain involves numerous issues regarding definitions and methods that were discussed in the opening sections of Chapter 3 on results; the other domain concerns various substantive issues that warrant specific efficacy or effectiveness and health services research. The main portion of this chapter focuses on the second domain, but first we reiterate here the main problem with the current literature and the research on which it is based. We also note areas in which we believe that further research is not likely to be productive.
Depending on the topic and the release date of the pharmaceuticals, we reviewed a literature base that could date back more than three decades. In that time, the standards for research, including the conduct of RCTs as well as case control, longitudinal cohort, and similar effectiveness studies, have risen appreciably.
Nonetheless, in our view, a good deal of work needs to be done to shore up
critical building blocks of research in this area. In particular,
researchers need to attend to three issues:
Define preterm labor explicitly and carefully.
Be clear about which cointerventions were used and when were they administered during the study.
Separately analyze outcomes for those women with medically indicated preterm births and compare the results with those for the larger group of women with preterm labor.
Perhaps the most important methodologic issue is that of measuring outcomes in trials or other studies of interventions for women in preterm labor. Our work has documented the extent to which researchers differ in their definitions of even standard outcomes. This matter is especially important if regulators (e.g., the FDA) or health plan administrators insist that an intervention must improve perinatal outcomes before it can be approved or adopted (as the FDA did with the oxytocin-agonist atosiban). However, any trial using perinatal endpoints would have to be quite large and expensive because progress in the care of tiny newborns has resulted in a relative rarity of poor perinatal outcomes; indeed, the numbers and expense of such trials are likely to be prohibitive.
Insistence on improvements in perinatal outcome is epidemiologically sound. Therefore, investigators and policymakers are likely to continue to rely on surrogates relating to delivery such as pregnancy prolongation or the number of preterm births before an arbitrary cutoff.
One way to address this problem is to improve the analysis of available data. As an answer to this problem, we have already noted the utility of survival analysis, which could provide continuous delivery outcomes stratified by gestational age at enrollment in the study. Such an approach is clinically and biologically relevant. It allows both clinicians and the public to decide whether an intervention's effects were relevant to an individual's particular situation, meaning that researchers can begin to clarify whether (and if so, to what extent) even a small delay in delivery at a particular gestational age has the potential to be significant.
Our recommendation, therefore, is that investigators, to the extent possible, supplement the use of categorical outcomes measures (e.g., preterm or term birth) and variables involving specific cutoff points for gestational age by including concepts such as the length of time a pregnancy was extended-prolongation-as a form of survival of the pregnancy. That is, we advocate for the assessment of the impact of an intervention on pregnancy prolongation through the use of survival analysis stratified by gestational age at the onset of preterm labor.
Implicit in this recommendation is an assumption that prolongation will improve perinatal outcome. This assumption has intuitive appeal given that the correlation between gestational age at birth and improved perinatal mortality and morbidity is strongly positive; however, no one knows the advantages or disadvantages of delaying delivery for an individual once preterm labor is established. Perhaps the in utero environment of the mother who is experiencing preterm labor is hostile and prolongation does more harm than good. Thus, the ultimate test for any therapeutic intervention will always be improvements in perinatal outcome.
Given the rarity of poor perinatal outcomes in an age of effective neonatal care, the need exists for composite perinatal outcomes that reflect the morbidity and mortality of premature birth. Potential measures that could be combined include perinatal death, neurologic deficit, and oxygen requirement at hospital discharge. Perinatologists could advance the field in important ways by constructing composite markers that accurately and consistently reflect the burden of being born before term. Such an approach has been useful in other fields such as cardiology. A valid composite measure could then become the outcome of choice in intervention trials.
Moreover, long-term followup of neonates is needed. Certain conditions that could be a direct harm from an intervention or a byproduct of pregnancy prolongation in a hostile environment may not become obvious until an infant matures. Followup of particular importance regarding developmental and neurologic outcomes as the fetal and neonatal brain has yet to complete development at the time of birth and hospital discharge.
This report on the evidence concerning current care of the woman who presents with labor symptoms before term provides insights and directions to investigators in this field along several dimensions. These topics, covered in the remainder of this section, include basic epidemiology, use of biologic markers and better understanding of the test characteristics of the biologic markers most likely to be used in further efficacy or effectiveness studies, and use of pharmaceuticals (both tocolytics and antibiotics). We also comment here on questions for which we believe further research is not warranted.
First, the reader of this report will appreciate the need for basic and epidemiologic research in preterm labor. In our judgment, far more work needs to be done in this area just to lay a better framework for future intervention-specific efficacy and effectiveness research. Said another way: a field of knowledge so primitive that the patient's complaints (i.e., preterm labor symptoms) are used as the diagnostic criterion for the condition possessing the same name must invest in basic and epidemiologic research.
We lack basic knowledge regarding the biologic mechanisms resulting in birth before term. Moreover, there is a dearth of knowledge regarding the incidence and prevalence of preterm symptoms and modifiable risk factors for preterm birth. The issue of etiologic heterogeneity versus homogeneity for preterm birth is especially poorly understood.
We also draw attention to the need for better understanding of these basic facts in ethnic minorities, for at least two reasons. First, the demographics of this nation are changing rapidly, including among women of child-bearing ages. Second, clinicians should have better information at hand to address the needs of women who may have low English literacy skills, and who may thus be at a disadvantage in terms of communicating their symptoms or understanding the medical management regimens being recommended to them.
Finally, African American women and their offspring face profound disadvantages in terms of preterm birth. The discrepancy between perinatal outcomes in black and white women and the mechanisms contributing to such disparities must be investigated, so that clinicians and the public gain a better understanding of the epidemiologic, social, and other causal and contributing factors.
Also, the best type of clinician and optimal setting for care of the woman with preterm labor remains to be answered. Given the ubiquitous nature of these symptoms and our inability to assess risks, the minimum level of expertise and capacity needed to triage these patients effectively is not known.
Preterm labor with subsequent preterm birth has remained an enigma to clinicians and investigators. The condition has proven remarkably resistant to the inquiry of modern science. No satisfactory "meta-narrative" or paradigm has been constructed that reconciles all the observed phenomena. From the perspective of the authors of the evidence report, the failure to solve the muddled picture is because of faulty science and inadequate resources for investigation. By implication, better science and more resources to invest in efficacy and effectiveness studies will help to end the enigma.
Conversely, preterm labor and subsequent preterm birth may be the culmination of a complex, biopsychosocial process such as poverty or obesity. A narrow scientific focus on the clinical endpoint of preterm labor and relentless pursuit of a meta-narrative threaten to direct attention and resources from the changes needed in society that might ultimately result in substantial reductions in preterm birth rates.
Additional study regarding risk factors is important, and this issue bridges the epidemiologic and clinical realms of research. Clinicians and researchers need to be able to assess accurately the risk of preterm birth in women with symptoms of preterm labor and to exclude women at low risk of early birth from intervention trials. Failure to assess risk will not only dilute the effects of the intervention in question, but also expose numerous women who are at low risk to unnecessary and potentially harmful intervention. Some investigators have tried to assess risk by constructing strict definitions of preterm labor (e.g., by incorporating a measure of contraction frequency and duration with measures obtained from digital examination of the cervix).
Biologic markers may enhance this goal because they are objective and reproducible and are superior to purely clinical assessment or prognostication of preterm birth. At least two of the biologic markers we reviewed-fFN and EVUSD-can serve in this role. Thus, we recommend that investigators studying tocolytics and antibiotics (in their capacity as part of a medical regimen for women in or suspected of preterm labor) use biologic marker negativity as an exclusion criterion.
Moreover, we advise that further clinical trials be conducted to demonstrate whether the enhanced knowledge and precision provided by biologic markers with respect to preterm birth risk actually can improve perinatal outcomes. Trials should be conducted in which clinicians are provided with biologic marker information on a subset of participants and not on others, so as to clarify the level of efficacy and usefulness of this information in "ideal" settings. This step is fundamental before the use of biologic markers is incorporated into routine care in "average" settings, so as to avoid dilemmas in which better measurement precision does not improve outcomes (as has been seen with electronic fetal monitoring).
Our meta-analytic results for the first-line tocolytics and for antibiotics (reported in Chapter 4) suggest that further trials of these modalities are needed. Use of antibiotics is almost universally recommended in women with preterm labor symptoms as a form of prophylaxis to prevent early-onset group B streptococcal sepsis. By contrast, the use of acute tocolysis has been disparaged by many. Well-designed trials of adequate size with appropriate outcome measures of acute tocolysis to prevent preterm birth in women with symptoms of preterm labor would be helpful and reasonable. We would call attention, in particular, to the likely benefits of a focus on beta-mimetics, calcium channel blockers, and magnesium sulfate.
Review of our evidence tables highlights not only the numerous agents used for tocolysis, but also the wide range of dosages prescribed. Clinicians and patients could potentially benefit from pharmacologic studies to assess optimal dosing and duration of therapy. Overdosing could lead to excessive harms from tocolysis, whereas underdosing could result in an inability to optimize outcomes. The same arguments and limitations apply to the duration of tocolytic dosing. Although we did not find evidence to support the use of so-called maintenance tocolysis once the acute preterm labor phase is over, the optimal duration for acute tocolysis is unknown.
Our report emphasizes the small, potential benefits of acute tocolysis-in terms of pregnancy prolongation-in the face of substantial potential maternal harms. This marginal benefits-to-risk ratio argues for continued development and testing of new tocolytic drugs. Clinicians and patients would benefit from agents that are selective for uterine smooth muscle quiescence. The oxytocin-agonist atosiban is an example of a novel tocolytic drug that met utero-selective criteria. Its failure to improve perinatal outcome kept the FDA from approving it as a tocolytic, but many authorities predict approval within the European community because of its ability to prolong pregnancy and lack of side effects. Should the European experience with atosiban be favorable, perhaps approval of this agent as a tocolytic should be reconsidered in the United States. Moreover, other oxytocin antagonists and novel classes of compounds with the ability to quiet the uterus should be studied and tested.
The studies we reviewed provided virtually no information on the quality of the mother's life while receiving treatment-other than in relation to physical harm. Data on lost wages or lost days of work would support cost-benefit analyses of treatment effectiveness. In addition, information on the psychological impacts of treatment and the treatment process could also provide valuable insights.
The best type of clinician and the optimal care setting of a woman with preterm labor remains to be answered. Given the ubiquitous nature of these symptoms and the inability to adequately assess risks, clinicians and institutions caring for these women need to be able to triage these patients. However, whether this needs to be at a tertiary perinatal center or a lower level of care is a topic for future study.
If the optimal class of drug, dose, and duration of tocolytic therapy is obfuscated, the evidence to guide clinicians' use of antibiotics is even more obscure. Given the array of microbicidal and static agents in the full evidence tables, we cannot delineate a rational approach to the selection and duration of such therapies. At present, it seems prudent in light of the marginal benefits of antibiotics to follow protocols for prophylaxis against early-onset Group B streptococcal sepsis with the expectation that such interventions will not only accomplish that goal, but also prolong pregnancy. Investigators should focus their research on determining the optimal agent, dose, duration, and possible combinations for antibiotics in women presenting with preterm labor.
Our review casts doubts on the wisdom of future research in three areas. The first is maintenance tocolysis, where we saw from both the synthesis of the literature and the meta-analyses that tocolytics used in a maintenance modality (as contrasted with first-line interventions) offered no demonstrable benefit in preventing preterm birth. Second, it seems clear that no further work need ever be done on ethanol as either a first-line or maintenance tocolytic. The third area is home uterine activity monitoring, where we were unable to find even indirect evidence that using this device alone is useful in preventing preterm birth.
This study was supported by Contract No. 290-97-0011 from the Agency for Healthcare Research and Quality (AHRQ). We acknowledge the assistance of Jacqueline Besteman, J.D., M.A., the AHRQ Task Order Officer for the Evidence-based Practice Center Program, and Ernestine Murray, R.N., M.A.S., the AHRQ Task Order Officer for this project. The investigators appreciate the considerable help of the data abstractors including clinicians: Alexander Allaire, M.D., Jennifer Bailit M.D., Muge Calikoglu M.D., Tara Gustillo-Ahby, M.D., Martin Fielder, M.D., Gil Reid, M.D., and Paul Whiteacre, M.D. and methodologists: Tracy Bouchard-Cyr, M.P.H., Kathryn Anderson Clark, M.S., Eric Finch, B.A., and Erdal Tekin, B.A. We also thank Patricia Ann Payne, B.S.N., C.N.M., M.P.H., and Rukmini Bagchee, M.S. for their work in developing the analysis of tocolytic harms. We thank the following individuals from the University of North Carolina at Chapel Hill: Gordon DeFriese, Ph.D., Co-Director of the RTI-UNC Evidence-based Practice Center and Lynn Whitener, Dr.P.H., M.S.L.S. We also thank the following individuals from the Research Triangle Institute: Suzanne L. West, Ph.D., M.P.H., and Linda J. Lux, M.P.A. for their valuable insights into the methods and substance of this evidence report, Linda Fonville for her outstanding secretarial support, and Nicole Walker and Richard Strowd, J.D. for exceptional contracting support.
We gratefully acknowledge the substantial involvement of and assistance from the Technical Expert Advisory Group (TEAG). TEAG members are listed at the end of this section. The TEAG was meant in part to contribute to (a) advancing AHRQ's broader goals of creating and maintaining "science partnerships" and "public-private partnerships" and (b) meeting the needs of a broad array of potential customers and users of its products. Thus, it was both a substantive resource and a sounding board throughout the study, and it is the body from whom "expert inputs" were formally sought at several points through the project.
We constituted our TEAG from three types of technical experts and other partners. These types are (1) technical/clinical experts; (2) patients or representatives of organizations whose mission concerns the interests and perspectives of patients and consumers; and (3) potential users of the final evidence report or other materials, including explicitly a representative of the organization that nominated the topic (in this case, the American College of Obstetrics and Gynecology. All in all, we had six clinical/technical experts, one individual representing patient populations, and one individual representing potential user groups, for a total of eight on the TEAG.
The final decision about TEAG membership was based on candidates' availability for scheduled conference calls and other input, willingness to review materials and provide advice and assistance within a short turnaround time, and approval by the AHRQ Task Order Officer.
The RTI-UNC Center team solicited the views of TEAG members from the start of the project. Among other issues, TEAG members provided insights and reactions to key clinical questions, input to the literature review process by ensuring that we included all known published research meeting our inclusion criteria, review of our data extraction forms and view of our approach to meta-analysis. TEAG members have also provided valuable input concerning problems of focusing a literature search on treatments, and specifying appropriate outcomes, for a clinical topic as difficult to conceptualize as preterm labor.
In keeping with AHRQ's standards for employing a multidisciplinary approach to the development of evidence reports, we called on our TEAG for inputs at two key points during this task. First, the group was asked to comment on the literature synthesis and to give us feedback on our overall plans at that stage of the analysis, which included approaches to developing evidence tables and to summarizing information about patient outcomes associated with the treatment options being studied for the management of preterm labor.
| TEAG Members | |
|---|---|
| Haywood Brown, MD Department of Obstetrics and Gynecology St. Vincents Hospital (FLC) | Carol Sakala, PhD, MSPH Center for Applied Ethics and Professional Practice Education Development Center, Inc. |
| Jan French, CNM Denver Health Medical Center | Susan Smarr, MD Department of Obstetrics and Gynecology Permanente Medical Group, Inc. |
| Jay Iams, MD Ohio State University Dept of OB/GYN | Barbara Yawn, MD MS MSPH Department of Research and Education Olmstead Medical Center |
| Dwight Rouse, MD University of Alabama, Birmingham Dept OB/GYN | Stanley Zinberg, MD, MS American College of Obstetricians and Gynecologists |
An important first step in the identification of potential peer reviewers was to determine the appropriate constituencies from which our reviewers should be approached. The categories we finally settled on, and the number of reviewers asked to participate in this effort, are shown in Table A. Although the categories are fairly self-explanatory, we note the following details for clarification. Individual experts engaged in related research (as contrasted with medical practice per se) were included in Category I (clinical practice and health care delivery) because they are based in health care delivery organizations and, we judged, likely to be involved to some extent in patient care. We assigned organizations that appear to be engaged chiefly in information dissemination (clearinghouses and the like) to Category IV (quality-of-care and consumer groups) on the grounds that much quality of care activity today assumes some effort to provide consumer information. We believe that these four categories represent the full range of health care experts, users, and patient groups that should be involved in reviewing this particular evidence report on the management of preterm labor; the specific peer reviewers are listed at the end of this appendix.
We selected 19 organizations or independent peer reviewers from the four categories noted in Table A. The individuals included the eight members of the TEAG because they played a major role throughout the project in conceptualizing the work and reviewing materials; moreover, as active professionals in the field, the RTI/UNC EPC believed that their comments at this stage would be very valuable. The remainder of the peer reviewer group were identified by issuing an invitation to the organization's executive officer/director (e.g., President, CEO) or to a public sector agency head asking them to nominate a peer reviewer or by soliciting nominations from the TEAG. A preliminary (and longer) list of organizations, agencies, or individuals was submitted to the AHRQ Task Order Officer for this project for review, comment, and approval. We then contacted all potential peer reviewers to determine their willingness to serve as peer reviewers, alerting them to the fact that this service would require them to prepare formal written reviews according to the checklist developed for this evidence report. Their comments and suggestions formed the basis of our revisions to the evidence report.
The peer reviewers who were selected for this Evidence Report are predominantly involved in clinical practice or research, professional associations, and public sector quality of care organizations.
| Category | Number of Peer Reviewers |
|---|---|
| I. Clinical Practice Experts and Health Care Delivery Organizations | 3 |
| II. Professional Associations/Guidelines Developers/Other Users of Evidence Report | 5 |
| III. Public Sector/ Quasi Public and Regulatory Agencies | 1 |
| IV. Quality of Care and Consumer Groups | 2 |
| Gina Burns President Group B Strep Association | Jan M. Kriebs, CNM MSN FACNM American College of Nurse Midwives |
| Nancy Cuddihy Health Care Consultant | Brian Mercer, MD Society for Maternal Fetal Medicine University of Tennessee at Memphis |
| Calvin Hobel, MD Cedars Sinai Medical Center Deptarment of OB/GYN | Michael O'Shea, MD Wake Forest University School of Medicine Department of Pediatrics |
| William Heuston, MD American Association of Family Practice | Kathleen Rice Simpson, PhD, RNC, FAAN Association of Women's Health, Obstetric and Neonatal Nurses |
| Ann Koontz, DrPH, CNM Maternal and Child Health Bureau Health Resources and Services Administration | Pat Venus Center for Health Care Policy and Evaluation United Health Group |
Initials of Reviewer __ __ __ Date__ __/__ __ (Mo/Day) Unique Article Identifier __ __ __
(If any response is "No" then STOP. If any response is "Can't Tell," bring to the attention of the Research Coordinator.
| Preterm labor, preterm birth or time to delivery an outcome? |
Yes |
No |
Can't Tell |
| 40 or more subjects at completion? |
Yes |
No |
Can't Tell |
| Population of women with signs/symptoms of PTL? |
Yes |
No |
Can't Tell |
(Note that numeric values in many instances need to be preceded by <, > , < or >.)
1. Main Objectives (described by author, typically in the
introduction).
_____________________________________________________________________________
_____________________________________________________________________________
2. Study Design. (Record as stated by the author. Typically found in the abstract, end of the introduction or beginning of the methods section. If you think the author has misstated the study design, record the discrepancy in the comments section.)
Diagnostic; |
Randomized controlled trial |
Non-randomized controlled trial |
Prospective cohort study |
Retrospective cohort study |
Case-control study |
Cross-over study |
Case series |
Cross-sectional survey |
Cost-benefit/cost-effectiveness study |
Other (describe in space to right)
_________________________________ |
3. Subanalysis. Are there any subgroup or stratified analyses in this article?
Yes
No |
4. Country in which study was conducted:
______________________________
STUDY
POPULATION
5. Inclusion Criteria
Gestational age: __________ to __________ completed
weeks |
Intact membranes |
Fetal weight estimate: __________ |
Maternal age: __________ (lower limit)
__________ (upper limit) |
Singleton gestation |
Multiple gestation |
Signs and symptoms of active labor |
High risk of preterm labor/delivery |
| If yes, how was "high risk" defined? _____________________________________________________ |
| _________________________________________________________________________________ |
Other: ______________________________ |
6. Exclusion Criteria
Abnormal fetus |
Maternal age __________ (upper limit) |
Maternal age __________ (lower limit) | |
Antibiotic therapy, current or recent |
Maternal fever/Pyrexia: T _____ |
Cerclage/Need for cerclage |
Maternal White Blood Cells (WBC) _____ |
Chorioamnionitis |
Multiple gestation |
Chronic disease |
No
prior prenatal care |
Current bleeding/hemorrhage |
Placenta previa |
Current illegal substance abuse |
Rupture of membranes |
Dilation __________cm |
Singleton gestation |
Fetal distress |
Uterine anomaly |
Failure to give informed consent |
Other maternal/fetal indication for delivery |
Group B Strep (GBS) Positive |
Other Positive Cultures ( ______________ ) |
Hypertensive disorders (incl. Pre-eclampsia) |
Signs/Symptoms of labor |
High risk of preterm delivery | |
If
yes, how was "high risk" defined?
________________________________________________ | |
| _____________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
7. Gestational age assignment. (Circle either and/or as appropriate.)
Not determined/not measured/can't tell |
Last Menstrual Period (LMP) |
And/or clinical sizing, including serial fundal
heights |
And/or ultrasound When was ultrasound conducted?
_________________________________ |
8. Population Size:
Screened __________
Cannot Determine |
Eligible __________
Cannot Determine |
Enrolled __________
Cannot Determine |
Completed __________
Cannot Determine |
Included in analysis __________
Cannot Determine |
8a. Reasons for exclusion from analysis:
Loss to follow-up (n= ) |
Other ____________________ (n= ) |
Other ____________________ (n= ) |
Other ____________________ (n= ) |
Other ____________________ (n= ) |
9. MASKING LEVEL
| Were treating physicians masked to the results of the diagnostic tests? | Yes | No | Can't Tell | |
| Were patients masked to the results of the diagnostic tests? | Yes | No | Can't Tell |
10. What screening test or tests was/were used?
10a. How are the test result groups compared in the analysis defined (e.g., positive fFN test; positive fFN test + some cervical length)?
| Group A: _____________________________________________________________________ |
| Group B: _____________________________________________________________________ |
| Group C: _____________________________________________________________________ |
| Group D: _____________________________________________________________________ |
11. Diagnostic testing
11a. Diagnostic Test 1
When initiated:
Frequency Performed:
Did the result of
the test influence the frequency with which it was subsequently performed?
Yes
No
Can't Tell
Diagnostic Test 2
When initiated:
Frequency Performed:
Did the result of
the test influence the frequency with which it was subsequently performed?
Yes
No
Can't Tell
12. If cervical length was measured, how was it measured?
13. Was wedging or funneling or breaking considered?
14. How were they defined?
Wedging:
________________________________________________
Funneling: _______________________________________________
Breaking:
________________________________________________
15. Non-Pharmaceutical Co-Interventions.
15a. Under what circumstances were non-pharmaceutical co-interventions administered?
15b. Which ones were used?
Bedrest |
Routine Observation |
Fetal
Monitoring |
Uterine Monitoring |
IV
Fluids | |
Other:______________________________________________________________________________ | |
15c. Were non-pharmaceutical co-interventions administered in both the test
positive and test negative group?
Yes
No
If
not, in which group were they administered?
___________________________________
16. Pharmaceutical Co-Interventions.
16a. Under what circumstances were co-interventions administered?
16b. What pharmaceutical co-interventions were administered?
tocolysis
specify drug _______________
antibiotics
specify drug _______________
other
_______________________
16c. Were pharmaceutical co-interventions administered in the same fashion in
the test positive and test negative groups?
Yes
No
17. What evaluations other than the test being studied (i.e., fFN or E3) were
performed?
a. Periodic pelvic examination _____ frequency
specify by speculum or digital cervical evaluation
__________________
b. Periodic ultrasound _____ frequency
c. Cultures for infection
d. HUAM
_____ frequency
e. Others _____________ _____ frequency
_____________ _____ frequency
18. Participant Characteristics. (Record as number and/or percent as presented in article. Follow percent with %.)
| Group A | Group B | Group C | Group D | Total | |
|---|---|---|---|---|---|
| Age (±SD) | |||||
| Race: | |||||
| White | |||||
| Black | |||||
| Hispanic | |||||
| Other | |||||
| High School Grads | |||||
| Married | |||||
| Gravidity | |||||
| Parity |
19. OUTCOME MEASURES.
19a. Definition of Preterm Labor. (Record each definition
separately. Provide the letters corresponding to each symptom. Fill in any
necessary data describing a particular symptom.)
Can't tell
Def. 1:
____________________
Def. 2: ____________________
Def. 3: ____________________
a.
Contractions: __________/__________
b. History of
contractions
c. Dilation: __________ cm
d. Change in dilation from baseline
e.
Effacement: __________%
f. Cervical change
g. Length of cervix: __________ cm
h.
Contractions unresolved with: __________
i. Other:
______________________________
j. Other:
______________________________
19b. Rates of outcomes
Specify mean or median.
Numeric values may be preceded by <, >, <, or >.
Record as number and/or percent as presented in article. Follow percent with
"%". In the last column, if a difference between the groups is statistically
significant, record the relevant p value. If it is not significant, state,
"n.s." If statistical adjustments were performed (i.e., stratified analyses
or multivariate modeling) report the adjusted results, the strata, and any
variable used for adjustment in #.
| Outcome | Group A (N = ) | Group B (N = ) | Group C (N = ) | Group D (N= ) | Sig. Level |
|---|---|---|---|---|---|
| DELIVERY | |||||
| Mean/Median Est. Gestational Age (EGA) at Delivery (SD) | |||||
| Time to delivery | |||||
| Occurrence of Contractions | |||||
| PTL Only Group | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Combined PPROM + PTL | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Delivered at: __________ weeks | |||||
| Delivery at Term | |||||
| MATERNAL OUTCOMES | |||||
| Maternal Length of Stay (SD) | |||||
| Chorioamnionitis | |||||
| Endometritis/Post-partum febrile morbidity | |||||
| INFANT OUTCOMES | |||||
| Mean/Median Birthweight (SD) | |||||
| _____ gms | |||||
| _____ gms | |||||
| _____ gms | |||||
| Apgar ______ at ______ min | |||||
| Apgar ______ at ______ min | |||||
| Mean/Neonatal Hospital Days | |||||
| Mean/Median Neonatal Deaths | |||||
| Fetal Mortality | |||||
| Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD) | |||||
| Bronchio Pulmonary Dysplasia (BPD) | |||||
| Intraventricular Hemorrhage | |||||
| Neonatal Sepsis | |||||
| Necrotizing Enterocolitis | |||||
| Neonatal Morbidity | |||||
| Pneumonia | |||||
| Neonatal Infection | |||||
| OTHER | |||||
20. Sensitvity/Specificity Analysis for Test ____________ for
Outcome: _________________________
NOTE: You may use as
many of these pages as is necessary to capture the information provided by
test/outcome. The table below is designed to capture rates associated with a
given test and outcome by the groups into which participants were divided
for analysis (i.e., those defined in question 11). If results are provided
for different outcomes, use a different page for each outcome.
| Overall | Group A: ( ) Numeric Value CI | Group B: ( ) Numeric Value CI | Group C: ( ) Numeric Value CI | Group D: ( ) Numeric Value CI | |
|---|---|---|---|---|---|
| Sensitivity | |||||
| Specificity | |||||
| Positive Predictive Value | |||||
| Negative Predictive Value | |||||
| Likelihood Ratio (positive/negative) | |||||
| Odds Ratio or Relative Risk |
20b. Were these results based on one-time testing or repeated testing?
20c. Numbers provided for sensitivity/specificity calculations:
Overall:
| Had Outcome | Did not have outcome | |
| Test positive | ||
| Test negative |
Group A:
| Had Outcome | Did not have outcome | |
| Test positive | ||
| Test negative |
Group B:
| Had Outcome | Did not have outcome | |
| Test positive | ||
| Test negative |
Group C:
| Had Outcome | Did not have outcome | |
| Test positive | ||
| Test negative |
Group D:
| Had Outcome | Did not have outcome | |
| Test positive | ||
| Test negative |
21. Were ROC curves presented?
Yes
No If so, on what page? _____
22. Describe any multivariate analysis performed. Include statistical technique, model description, and result highlights.
23. Was cost data collected?
Yes
No
24. Was information on harms collected?
Yes
No
25. Describe any additional analyses and/or subgroup analyses:
26. Limitations. (Distinguish between limitations stated by the
author and those found by the reviewer.)
a) State
limitations noted by the author.
b) State any other
limitations you found in the article.
27. Conclusions:
Test effective at predicting the following outcome:
_________________________________________
_________________________________________________________________________________
Test effective at predicting relevant outcomes in
the following subgroup(s)
_________________________________________________________________________________
_________________________________________________________________________________
28. Comments. (Provide any additional information that you believe needs to be captured concerning this study.)
Initials of Reviewer __ __ __ Date __ __/__ __ Unique Article Identifier __ __ __
(If any response is "No" then STOP. If any response is "Can't Tell,"
bring to the attention of the Research Coordinator.
| Preterm birth an outcome |
Yes |
No |
Can't Tell |
| 40 or more subjects at completion? |
Yes |
No |
Can't Tell |
| Intervention used as part of treating subjects with signs/symptoms of preterm labor (not a preventive intervention used among asymptomatic patients) |
Yes |
No |
Can't Tell |
(Note that numeric values in many instances need to be preceded by <, >, < or >.)
1. Main Objectives (described by author, typically in the
introduction).
_____________________________________________________________________________
_____________________________________________________________________________
_____________________________________________________________________________
2. Study Design. (Record as stated by the author. Typically found in the abstract, end of the introduction or beginning of the methods section. If you think the author has misstated the study design, record the discrepancy in the comments section.)
Randomized controlled trial |
Non-randomized controlled trial |
Prospective cohort study |
Retrospective cohort study |
Case-control study |
Cross-over study |
Case series |
Cross-sectional survey |
Cost-benefit/cost-effectiveness study |
Other (describe in space to right)
_________________________________ |
3. PPROM. Are women with preterm premature rupture of
membrane included in this analysis as an entry criteria?
Yes No
4. Subanalysis. Are there any subgroup analyses in this
article?
Yes No
5. Country in which study was conducted: ______________________________
6. Definition of Preterm Labor. (Record each definition
separately. Provide the letters corresponding to each symptom. Fill in
any necessary data describing a particular symptom.)
Can't tell
Def. 1: ____________________
Def. 2: ____________________
Def.
3: ____________________
c. Bio-marker(s) used (list:
______________________________)
d. Contractions:
__________/__________
e. History of contractions
f. Dilation: __________ cm
g. Change
in dilation from baseline
h. Effacement: __________%
i. Cervical change
j. Length of
cervix: __________ cm
k. Contractions unresolved with:
__________
l. Other: ______________________________
m. Other: ______________________________
7. Inclusion Criteria (Record criteria other than was stated
within definition of preterm labor)
Gestational age: __________ to __________ completed weeks
Intact membranes
Fetal weight estimate: __________
Maternal age: __________ (lower limit)
__________ (upper limit)
Singleton gestation
Other: ______________________________
Other: ______________________________
8. Exclusion Criteria
Abnormal fetus |
Maternal age __________ (upper limit) |
Allergy to protocol drug |
Maternal age __________ (lower limit) |
Antibiotic therapy, current or recent |
Maternal fever/Pyrexia: T _____ |
Cerclage/Need for cerclage |
Maternal White Blood Cells (WBC) _____ |
Chorioamnionitis |
Multiple gestation |
Chronic disease |
No
prior prenatal care |
Current bleeding/hemorrhage |
Placenta previa |
Current illegal substance abuse |
Rupture of membranes |
Dilation __________cm |
Singleton gestation |
Fetal distress |
Uterine anomaly |
Failure to give informed consent |
Other maternal/fetal indication for delivery |
Group B Strep (GBS) Positive |
Other Positive Cultures ( ______________ ) |
Hypertensive disorders (incl. Pre-eclampsia) | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
9. Gestational age assignment. (Circle either and/or as
appropriate.)
Not determined/not measured/can't tell
Last Menstrual Period (LMP)
And/or clinical sizing, including serial fundal heights
And/or ultrasound When was ultrasound conducted?
_________________________________
10. Population Size:
Screened __________
Cannot Determine
Eligible __________
Cannot Determine
Enrolled __________
Cannot Determine
Completed __________
Cannot Determine
11. MASKING LEVEL
| Subject masked to treatment assignment |
Yes |
No |
Can't Tell | |
| Treating physician(s) masked to treatment assignment |
Yes |
No |
Can't Tell | |
| Other direct care providers masked to treatment assignment |
Yes |
No |
Can't Tell |
N/A |
| Pharmacist masked to treatment assignment |
Yes |
No |
Can't Tell |
N/A |
| Pathologist masked to treatment assignment |
Yes |
No |
Can't Tell |
N/A |
12. Costs and cost effectiveness. Is any cost information
included in this study?
Yes
No
13. Participant Characteristics. (Record as number and/or
percent as presented in article. Follow percent with %.)
| Treatment Group A | Treatment Group B | Comparison Group | Total | |
|---|---|---|---|---|
| Age (±SD) | ||||
| Race: | ||||
| White | ||||
| Black | ||||
| Hispanic | ||||
| Other | ||||
| High School Grads | ||||
| Married |
14. Dominant Mechanisms(s) of Loss. (Provide the reason for
discontinuation and if available, the number of patients).
None lost
Can't tell
Loss to follow-up __________
Side effects of treatment __________
Other:
_________________________________________
Other:
_________________________________________
Other:
_________________________________________
15. Timing of intervention:
Within _____ hrs of evaluation/admission
Other: ____________________________
Cannot tell
16. Intervention Compliance Assessment. (State what
action(s) was taken to ensure that subject completed treatment.)
None reported
Mechanism: ______________________________
17. Intention to treat analysis
provided?
Yes, list exclusions below and if available,
number of subjects.
No
Developed allergy or did not tolerate
intervention drug: __________
Positive cultures - urine: __________
Positive cultures - group B strep: __________
Positive cultures - amniotic fluid: __________
Positive cultures - other cervico-vaginal:
__________
Did not comply with protocol for use of
intervention drug: __________
Did not comply with follow-up: __________
Other:
_____________________________________________
Other:
_____________________________________________
Other:
_____________________________________________
Were exclusions from analysis after
Treatment Assignment?
Yes
No
Can't Tell
18. Care Setting(s). List all settings where treatment was
provided.
Hospital
Perinatal center
Home
Other: _______________________________
Other: _______________________________
19. Were harms or side effects of treatment included in the
study?
Yes,
_____ page # (s)
No
20. Is successful tocolysis defined?
Yes
No
If yes, define:
__________________________________________________________________________
21. Definition of Treatment and Comparison
Groups. How are the groups defined?
Treatment Group A:
_____________________________________________________________________
Treatment Group B:
_____________________________________________________________________
Comparison Group:
_____________________________________________________________________
22. TOCOLYTIC DRUG REGIMEN BY GROUPS (If treatment includes
a separate loading and maintenance dose of the same drug, i.e., the
route and/or duration of treatment changes, list these as different
drugs.)
| Treatment Group A | Treatment Group B | Comparison Group | |
|---|---|---|---|
| Number of Subjects per group | N = | N = | N = |
| DRUG 1 | |||
| Dose | |||
| Route | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 2 | |||
| Dose | |||
| Route | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 3 | |||
| Dose | |||
| Route | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 4 | |||
| Dose | |||
| Route | |||
| Commencement of Treatment | |||
| Duration of Treatment |
23. Non-Pharmaceutical Co-Interventions. Did comparison
group receive co-interventions?
Yes
No
Bedrest |
Routine Observation |
Fetal Monitoring |
Uterine Monitoring |
IV
Fluids | |
Other:
______________________________________________________________________________ | |
24. CO-INTERVENTION DRUG REGIMEN BY GROUPS
| Treatment Group A | Treatment Group B | Comparison Group | |
|---|---|---|---|
| Number of Subjects per group | N = | N = | N = |
| DRUG 1 | |||
| Dose | |||
| Route | |||
| Aim of Treatment | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 2 | |||
| Dose | |||
| Route | |||
| Aim of Treatment | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 3 | |||
| Dose | |||
| Route | |||
| Aim of Treatment | |||
| Commencement of Treatment | |||
| Duration of Treatment | |||
| DRUG 4 | |||
| Dose | |||
| Route | |||
| Aim of Treatment | |||
| Commencement of Treatment | |||
| Duration of Treatment |
25a). OUTCOME MEASURES. Specify mean or median. Numeric
values may be preceded by <, >, <, or >.
Record as number and/or percent as presented in article. Follow percent
with "%". In the last column, if a difference between the groups is
statistically significant, record the relevant p value. If it is not
significant, state, "n.s." If statistical adjustments were performed
(i.e., stratified analyses or multivariate modeling) report the adjusted
results, the strata, and any variable used for adjustment in #24.
| Outcome | Intervention Group A (N = ) | Intervention Group B (N = ) | Comparison Group (N = ) | Sig. Level |
|---|---|---|---|---|
| DELIVERY | ||||
| Mean/Median Est. Gestation Age (EGA) at Delivery (SD) | ||||
| Mean/Median Prolongation (SD) Days | ||||
| PTL Only Group | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Combined PPROM + PTL | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivery at Term | ||||
| PROM OR PPROM Outcome | ||||
| MATERNAL OUTCOMES | ||||
| Maternal Length of Stay (SD) | ||||
| Chorioamnionitis | ||||
| Endometritis/Post-partum | ||||
| Adverse Drug Reactions | ||||
| INFANT OUTCOMES | ||||
| Mean/Median Birthweight (SD) | ||||
| _____ gms | ||||
| _____ gms | ||||
| _____ gms | ||||
| Apgar ______ at ______ min | ||||
| Apgar ______ at ______ min | ||||
| Mean/Neonatal Hospital Days | ||||
| Mean/MedianNeonatal Deaths | ||||
| Fetal Mortality | ||||
| Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD) | ||||
| Bronchio Pulmonary Dysplasia (BPD) | ||||
| Intraventricular Hemorrhage | ||||
| Neonatal Sepsis | ||||
| Necrotizing Enterocolitis | ||||
| Neonatal Morbidity | ||||
| Pneumonia | ||||
| Neonatal Infection | ||||
| OTHER | ||||
26. Describe any additional analyses and/or subgroup analyses:
27. Limitations. (Distinguish between limitations stated by the author and those found by the reviewer.)
28. Conclusions:
Intervention ineffective in altering outcomes
detailed above.
Intervention effectively alters the following
outcomes: ______________________________________
________________________________________________________________________________
Intervention effectively alters the following
outcomes in the following subgroup(s)
________________________________________________________________________________
________________________________________________________________________________
29. Comments. (Provide any additional information that you
believe needs to be captured concerning this study.)
Unique Article
Identifier:
Study
Type:
Sample
Size:
![]() | Maternal Harms | Treatment Grp1 | Treatment Grp2 | Control Grp |
|---|---|---|---|---|
| Death | ||||
| Heart Failure | ||||
| Arrhythmia | ||||
| Abnormal glucose tolerance | ||||
| Elevated liver functions/hepatitis | ||||
| Renal failure | ||||
| Headache | ||||
| Tremor | ||||
| Hypotension | ||||
| Skin rash | ||||
| Stress, emotional distress or other mental health issues | ||||
| Discontinuations due to side effects | ||||
| Other: | ||||
| Other: | ||||
| Other | ||||
| Death | ||||
| Intraventricular hemorrhage | ||||
| Necrotizing enterocolitis | ||||
| Ductal constriction | ||||
| Bardycardia | ||||
| Cerebral Palsy | ||||
| Other neurologic effects | ||||
| Other: | ||||
| Other: | ||||
| Other: | ||||
| Other: | ||||
| * Statistical significance: if nonsignificant note NS, otherwise provide p number. | ||||
Initials of Reviewer __ __ __ Date __ __/__ __ (Mo/Day) Unique Article Identifier __ __ __
(If any response is "No" then STOP. If any response is "Can't Tell," bring to the attention of the Research Coordinator.
| Preterm birth an outcome |
Yes |
No |
Can't Tell |
| 40 or more subjects at completion? |
Yes |
No |
Can't Tell |
| Intervention used as part of managing subjects with signs/symptoms of preterm labor or who have been treated for preterm labor (not a preventive intervention used among asymptomatic patients)? |
Yes |
No |
Can't Tell |
| Study design RCT? |
Yes |
No |
Can't Tell |
(Note that numeric values in many instances need to be preceded by
<, > , < or >.)
25.
Main Objectives (described by author, typically in the
introduction).
_____________________________________________________________________________
_____________________________________________________________________________
_____________________________________________________________________________
2. Study Design. (Record as stated by the author. Typically
found in the abstract, end of the introduction or beginning of the
methods section. If you think the author has misstated the study design,
record the discrepancy in the comments section.)
Randomized controlled trial
Non-randomized controlled trial
Prospective cohort study
Retrospective cohort study
Case-control study
Cross-over study
Case series
Cross-sectional survey
Cost-benefit/cost-effectiveness study
Other (describe in space to right) _________________________________
3. Country in which study was conducted: ______________________________
4. Definition of Preterm Labor. (Record each definition
separately. Provide the letters corresponding to each symptom. Fill in
any necessary data describing a particular symptom.)
Can't tell
Def. 1: ____________________
Def. 2: ____________________
Def. 3:
____________________
n. Bio-marker(s) used (list:
______________________________)
o. Contractions:
__________/__________
p. History of contractions
q. Dilation: __________ cm
r. Change
in dilation from baseline
s. Effacement: __________%
t. Cervical change
u. Length of
cervix: __________ cm
v. Contractions unresolved with:
__________
w. Other: ______________________________
x. Other: ______________________________
5. Inclusion Criteria (Record criteria other than was stated
within definition of preterm labor)
Gestational age: __________ to __________ completed weeks
Intact membranes
Fetal weight estimate: __________
Maternal age: __________ (lower limit)
__________ (upper limit)
Singleton gestation
Signs and symptoms of preterm labor in the current pregnancy
Treatment for preterm labor in the current pregnancy
Other: ______________________________
Other: ______________________________
6. Exclusion Criteria
Abnormal fetus |
Maternal age __________ (upper limit) |
Allergy to protocol drug |
Maternal age __________ (lower limit) |
Antibiotic therapy, current or recent |
Maternal fever/Pyrexia: T _____ |
Cerclage/Need for cerclage |
Maternal White Blood Cells (WBC) _____ |
Chorioamnionitis |
Multiple gestation |
Chronic disease |
No
prior prenatal care |
Current bleeding/hemorrhage Placenta previa |
Current illegal substance abuse Rupture of
membranes |
Dilation __________cm |
Singleton gestation |
Fetal distress |
Uterine anomaly |
Failure to give informed consent |
Other maternal/fetal indication for delivery |
Group B Strep (GBS) |
Positive Other Positive Cultures ( ______________
) |
Hypertensive disorders (incl. Pre-eclampsia) | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
Other:
________________________________________________________________________ | |
7. Gestational age assignment. (Circle either and/or as
appropriate.)
Not determined/not measured/can't tell
Last Menstrual Period (LMP)
And/or clinical sizing, including serial fundal heights
And/or ultrasound When was ultrasound conducted?
_________________________________
8. Population Size:
Screened__________
Cannot Determine
Eligible__________
Cannot Determine
Enrolled__________
Cannot Determine
Completed__________
Cannot Determine
9. MASKING LEVEL
| Subject masked to intervention assignment |
Yes |
No |
Can't Tell | |
| Treating physician(s) masked to intervention assignment |
Yes |
No |
Can't Tell | |
| Other direct care providers masked to intervention assignment |
Yes |
No |
Can't Tell |
N/A |
| Pharmacist masked to intervention assignment |
Yes |
No |
Can't Tell |
N/A |
| Pathologist masked to intervention assignment |
Yes |
No |
Can't Tell |
N/A |
10. Definition of Treatment and Comparison Groups. How are
the groups defined?
Group A:
_____________________________________________________________________
Group B:
_____________________________________________________________________
Group C:
_____________________________________________________________________
11. Participant Characteristics. (Record as number and/or
percent as presented in article. Follow percent with %.)
| Group A (N= ) | Group B (N= ) | Group C (N= ) | Total | |
|---|---|---|---|---|
| Age (±SD) | ||||
| Race: | ||||
| White | ||||
| Black | ||||
| Hispanic | ||||
| Other | ||||
| PTL in a previous pregnancy | ||||
| Previous PTD | ||||
| Gravidity | ||||
| Parity | ||||
| High School Grads | ||||
| Married |
12. Dominant Mechanisms(s) of Loss. (Provide the reason for
discontinuation and if available, the number of patients).
None lost
Can't tell
Loss to follow-up __________
Side effects of treatment __________
Other:
_________________________________________
Other:
_________________________________________
Other:
_________________________________________
13. Intervention Compliance Assessment. (State what
action(s) was taken to ensure that subject completed treatment.)
None reported
Mechanism: ______________________________
14. Intention to treat analysis
provided?
Yes, list exclusions below and if available,
number of subjects.
No
Positive cultures - urine: __________
Positive cultures - group B strep: __________
Positive cultures - amniotic fluid: __________
Positive cultures - other cervico-vaginal:
__________
Did not comply with protocol for use of
intervention: __________
Did not comply with follow-up: __________
Other:
_____________________________________________
Other:
_____________________________________________
Other:
_____________________________________________
Were exclusions from analysis after
Treatment Assignment?
Yes
No
Can't Tell
15. Non-Pharmaceutical Co-Interventions.
15a. Under what circumstances were non-pharmaceutical co-interventions (other than HUAM) administered?
15b. Which ones were used?
Bedrest
Routine Observation
Fetal Monitoring
IV Fluids
Other:______________________________________________________________________________
15c. Were non-pharmaceutical co-interventions administered in both/all
groups?
Yes
No
If not, in which group(s) were they administered? ___________________________________
Under what circumstances were they administered?_____________________________________
16. Pharmaceutical Co-Interventions.
16a. Under what circumstances were co-interventions administered?
16b. What pharmaceutical co-interventions were administered?
tocolysis
specify drug
_______________
antibiotics
specify
drug________________
other ________________________
16c. Were pharmaceutical co-interventions administered in the same
fashion in both/all groups?
Yes
No
If not, describe the
differences in
administration.________________________________________________
18a). OUTCOME MEASURES. Specify mean or median. Numeric
values may be preceded by <, >, <, or >.
Record as number and/or percent as presented in article. Follow percent
with "%". In the last column, if a difference between the groups is
statistically significant, record the relevant p value. If it is not
significant, state, "n.s." If statistical adjustments were performed
(i.e., stratified analyses or multivariate modeling) report the adjusted
results, the strata, and any variable used for adjustment)
| Outcome | Group A (N = ) | Group B (N = ) | Group C (N = ) | Sig. Level |
|---|---|---|---|---|
| DELIVERY | ||||
| Mean/Median Est. Gestation Age (EGA) at Delivery (SD) | ||||
| Mean/Median Prolongation (SD) Days | ||||
| PTL Only Group | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| Delivered at: __________ weeks | ||||
| MATERNAL OUTCOMES | ||||
| Hospital admissions | ||||
| Any non-delivery admissions | ||||
| Chorioamnionitis | ||||
| Endometritis/Post-partum | ||||
| Adverse Drug Reactions | ||||
| INFANT OUTCOMES | ||||
| Mean/Median Birthweight (SD) | ||||
| _____ gms | ||||
| _____ gms | ||||
| _____ gms | ||||
| Apgar ______ at ______ min | ||||
| Apgar ______ at ______ min | ||||
| Mean/Neonatal Hospital Days | ||||
| Mean/MedianNeonatal Deaths | ||||
| Fetal Mortality | ||||
| Neonatal Respiratory Distress Syndrome (NRDS), Hyaline Membrane Disease (HMD) | ||||
| Bronchio Pulmonary Dysplasia (BPD) | ||||
| Intraventricular Hemorrhage | ||||
| Neonatal Sepsis | ||||
| Necrotizing Enterocolitis | ||||
| Neonatal Morbidity | ||||
| Pneumonia | ||||
| Neonatal Infection | ||||
| OTHER | ||||
19. Describe any multivariate analysis performed. Include statistical
technique, model description, and result highlights.
:
20. Describe any additional analyses and/or subgroup analyses:
21. Was cost data collected?
Yes
No
22. Limitations. (Distinguish between limitations stated by
the author and those found by the reviewer.)
a) State
limitations noted by the author.
b) State any other
limitations you found in the article.
23. Conclusions:
Intervention ineffective in altering outcomes
detailed above.
Intervention effectively alters the following
outcomes: ______________________________________
________________________________________________________________________________
Intervention effectively alters the following
outcomes in the following subgroup(s)
________________________________________________________________________________
________________________________________________________________________________
24. Comments. (Provide any additional information that you believe needs to be captured concerning this study.)
ACOG: American College of Obstetricians and Gynecologists
add.: additional
AHRQ: Agency for Healthcare Research and Quality
AoT: aim of treatment
APGAR: Activity, Pulse, Grimace, Appearance, Respiration
assoc: association
BP: blood pressure
bpm: beats per minute
BV: bacterial vaginosis
C: clinician reviewer
cAMP: cyclic adenosine monophosphate
CI: confidence interval
corr: correlation, correlated with
CoT: commencement of treatment
CRH: corticotropin-releasing hormone
Cx: control group
dk: don't know, not stated in article
DoT: duration of treatment
E3: salivary estriol
ECG: electrocardiogram
EGA: estimated gestational age
EVUSD: endovaginal ultrasound
FDA: Food and Drug Administration
fFN: fetal fibronectin
HEED: Health Economic Evaluations Database
HMD: hyaline memebrane disease
HUAM: home uterine activity monitoring
IUGR: intrauterine growth retardation
IV: intravenous
IVH: intraventricular hemorrhage
M: methods reviewer
mcg, µg: microgram
MeSH: Medical Subject Headings
NEC: necrotising enterocoitis
NICHHD: National Institute of Child Health and Human Development
NICU: neonatal intensive care unit
NLM: National Library of Medicine
NPV: negative predictive value
NRDS: neonatal respiratory distress syndrome
NS: not significant
NSAID: nonsteroidal anti-inflammatory disease drugs
OR: odds ratio
path: pathologist
pd: time period
PDA: patent ductus arteriosus
pharm: pharmacist
PPV: positive predictive value
PROM: premature rupture of membrane
prov: providers
PTL: preterm labor
PVC: premature vehicular contraction
Q: quartile
RCT: randomized controlled trial
Rh: blood group type
ROC: receiver-operator characteristic curve
RR: risk ratio
RTI-UNC EPC: Research Triangle Institute-University of North Carolina Evidence-based Practice Center
SD: standard deviation
SEM: standard error of the mean
sens: sensitivity
SGA: small for gestational age
sig: significant
spec: s1pecificity
SVT: suproventricular tachycardia
TEAG: Technical Expert Advisory Group
TVUSD: transvaginal ultrasound
tx: treatment group
UTI: urinary tract infection
wk(s): week(s)
wkly: weekly
Free Full text in PMC] [PubMed]
Free Full text in PMC]
[PubMed]