Figure 1. Conceptual framework for routine use of episiotomy in obstetric care
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome comments on this evidence report. They may be sent by mail to the Task Order Officer named below at: Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850, or by email to epc@ahrq.gov.
Carolyn M. Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Jean Slutsky, P.A., M.S.P.H.
Director, Center for Outcomes and Evidence
Agency for Healthcare Research and Quality
Kenneth S. Fink, M.D., M.G.A., M.P.H.
Director, EPC Program
Agency for Healthcare Research and Quality
Marian D. James, M.A., Ph.D.
EPC Program Task Order Officer
Agency for Healthcare Research and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.
Context: In the United States, use of episiotomy varies from less than 10 percent to more than 75 percent of vaginal births. Overall, 30 to 35 percent of vaginal births include episiotomy. Routine episiotomy may not yield maternal benefits traditionally ascribed to it.
Objectives: We addressed five key questions (KQs):
Does the practice of liberal or routine episiotomy, compared to more selective use of episiotomy, influence maternal postpartum outcomes?
Does episiotomy incision type (i.e., midline or mediolateral), influence maternal postpartum outcomes?
Does the repair of the perineal defect (i.e., suture type and repair approach) influence maternal postpartum outcomes?
Does episiotomy have a long-term influence on urinary incontinence, fecal incontinence, or pelvic floor defects?
Does episiotomy or incision type, or both, influence future sexual function?
Data Sources: We searched MEDLINE®, Cochrane Library, and CINAHL® and did hand-searches, and consulted with experts.
Study Selection: We excluded studies (1) not about outcomes of vaginal birth; (2) in languages other than English; (3) not pertinent to the key questions; (4) with < 40 subjects; and (5) not representing original research. KQs1, 2, and 3 were limited to randomized controlled trials. KQs4 and 5 included nonrandomized prospective cohorts.
Data Extraction: We entered data into pretested abstraction forms; did a second review for accuracy, completeness, and consistency; and graded quality of studies.
Data Synthesis: Literature searches yielded 986 articles; 659 were excluded after abstract review. Of the remaining 327, we included 45 articles.
Conclusions: Fair to good evidence suggests immediate maternal outcomes from routine episiotomy are not better than those from restrictive use; instead, outcomes are worse because some proportion of women who would have had lesser injury instead had a surgical incision. Evidence is insufficient to provide guidance on choice of midline or mediolateral episiotomy when indicated. For perineal injury requiring suturing, fair to good evidence suggests leaving superficial vaginal and perineal skin unsutured is potentially preferable. If used for skin approximation, a continuous, subcuticular repair is superior to an interrupted, transcutaneous method. Evidence is consistent and clear that absorbable suture is preferred and that polyglycolic acid suture is associated with less morbidity than gut and chromic gut suture. Evidence is insufficient to determine whether novel materials, such as tissue adhesive, offer benefits. Evidence regarding long-term sequelae is fair to poor; assessment of pelvic floor dysfunction was not conducted in the age groups of greatest relevance. Limited data show that episiotomy does not prevent fecal and urinary incontinence, pelvic floor relaxation, or impaired sexual function, within months to years from childbirth.
Episiotomy, incision of the perineum at the time of vaginal childbirth, is a common surgical procedure experienced by women in the United States.1 Based on national hospital discharge data for 1999, just over 35 percent of women who gave birth vaginally had an episiotomy performed; the figure was approximately 33 percent in 2000.2, 3 National rates reflect a steady decline over the prior two decades,1 with 2001 data suggesting that approximately 30 percent of vaginal births include episiotomy.4
Actual rates are likely to be higher because administrative data sources are prone to capture fewer events than occur. A study of the validity of birth data for Washington state in 1989 found that hospital discharge data underestimated episiotomy incidence by 44 percent overall when compared with medical records; accuracy of discharge record reporting for individual facilities ranged from recording none of the episiotomies performed at worst, to 86.4 percent at best.5 In their nationally representative survey of women's childbearing experiences between 2000 and 2002, the Maternity Center Association documented that 35 percent of women who had a vaginal birth reported having an episiotomy.6
Likelihood of episiotomy is known to vary based on whether a woman is having a first vaginal birth or a subsequent birth and whether the birth is assisted by use of vacuum or forceps. Both a first birth and assisted vaginal delivery are associated with greater use of episiotomy.1, 7 Likelihood of episiotomy also varies across obstetric care settings. A study of 49,692 vaginal births in 18 hospitals in Philadelphia between 1994 and 1998 examined use of episiotomy among women giving birth for the first time to infants who weighed 2500 to 4000 grams and whose records did not note a difficult labor or assisted delivery. Forty-two percent of women in the study had an episiotomy, with a range of hospital averages from 20 percent to 73 percent.8
The precise origins of episiotomy are lost. Descriptions appear in European texts by the 1740s.9 Taliaferro first described its use in the U.S. medical literature in 1852.10 While caring for a moribund primiparous woman with eclampsia, he describes “immense distension of the vulva” and proceeding to make “an incision at the vulva, believing that preferable to permitting it [the fetal head] to force its way through [the anus] below.” He further noted: “…surely a smooth incised wound would be less injurious and heal more readily than one by rough violence.”10
These observations foreshadow early uses of episiotomy that became ingrained in hospital obstetric practice beginning in the 1920s: to hasten delivery for maternal or fetal indications; to resolve the “unyielding vulva”; and in cases thought to portend imminent severe laceration, to forestall an extensive spontaneous laceration and substitute a more readily repaired surgical incision. In this decade, Joseph DeLee, an opinion leader in the drive to establish obstetrics as a medical specialty, began to promote the concept that episiotomy should be “used routinely” for the maternal indications above as well as to prevent brain damage, epilepsy, and cerebral palsy that might result from the “battering” of the fetal head against a rigid perinuem.11, 12
Most obstetric textbooks endorsed episiotomy by the 1930s: “This is a prophylactic procedure, its purpose being: (a) to prevent extensive damage of the posterior vaginal wall and pelvic floor; (b) to save from gross injury the sphincter ani muscle and wall of the anal canal; (c) to curtail long-drawn-out overdistension of the vaginal wall, and the damage resulting therefore…,” providing the advantages of preventing extensive laceration, preserving sphincter integrity, providing a clean-cut wound and making scar tissue less likely to form, and ultimately achieving a result that is more satisfactory from “anatomical, functional, and cosmetic standpoints.”13 (Chapter 22, p. 666) Authors of texts frequently note that the procedure is especially warranted for primiparous patients, observing, “inasmuch as some degree of laceration occurs in the majority of cases episiotomy is a conservative rather than a radical procedure”14 (Vol 2, Section 10, p. 330).
In the 1940s and 1950s, routine episiotomy was little debated and increasingly used. During subsequent decades, the proposed benefits of episiotomy continued to take on broader scope. These benefits included goals of reducing postpartum perineal pain when compared to spontaneous lacerations, preventing future pelvic organ prolapse and urinary and rectal incontinence, and preserving sexual function both by reducing slackness of the vaginal introitus and by reducing the likelihood of pain with intercourse.15, 16 By the 1980s, episiotomy accompanied 64 percent of vaginal births in this country.17
Episiotomy became a routine practice of physicians long before emphasis on using outcomes research to inform practice. In seeking to establish an evidence base to support or refute the use of episiotomy, randomized clinical trials in the mid and late 1980s revealed two key findings: (1) routine mediolateral episiotomy use compared to restricted use was associated with higher risk of anal sphincter and rectal injuries, and (2) such surgery precluded a woman's possibility of giving birth with an intact or minimally damaged perineum.18 Larger trials in more varied populations of women and providers followed in the 1990s, with similar results. Investigators also sought to assess longer-term effects of perineal management at the time of birth on outcomes such as persistent pain, pelvic floor defects, urinary and rectal continence, and sexual function and satisfaction. The latter topics entered the spotlight as these outcomes became more dominant among the prevention-oriented goals proposed to be achieved by episiotomy.
Despite several decades of research, which many interpret as definitive evidence against routine use of episiotomy, little professional consensus has developed about the appropriateness of routine use. Lack of consensus is illustrated by variation in rates of use. From 1987 to 1992, Kane Low and her colleagues documented provider-level variation from 13.3 percent to 84.6 percent, with an average of 51 percent among spontaneous term births in a prospectively enrolled low-risk population.19 Episiotomy use varied widely in the midwives and physicians studied. Variation has been reported by time of day20 and by facility type, size, and location.21
Although restricted-use arms of trials have achieved episiotomy rates as low as 8 percent to 10 percent,22, 23 use remains common in many locations. Current obstetric care providers who continue to view episiotomy favorably most strongly agree with survey items that indicate they employ episiotomy to “prevent perineal trauma and to prevent pelvic floor relaxation and the consequences of pelvic floor relaxation, such as bladder prolapse and urinary incontinence.” Furthermore, providers endorse the statement that they “prefer to employ episiotomy frequently, because it is easier to repair than the laceration that results when episiotomy is not used.”24
Five points summarize the long history of episiotomy:
1. routine use of episiotomy evolved from more limited indications;
2. a goal of preventing future problems is eclipsing goals for labor “management”;
3. provider type is associated with acceptance or avoidance of its use;
4. among providers of the same type, use varies widely; and
5. rates of use vary distinctively by institution and region.
The last three of these characteristics—wide practice variation—suggest to health services researchers that episiotomy use is heavily driven by local professional norms, experiences in training, and individual provider preference. Variation in biology, in this case the physiology of vaginal birth, rarely explains discrepancies in practice as large as those seen for episiotomy use. When practice variation is prominent, accrual of evidence of benefits and risks should take on a key role in informing care. In this context, episiotomy has the hallmarks of a procedure that warrants repeated synthesis of the evidence of proposed benefits and potential risks. A 1968 Lancet editorial aptly captures the issues: “Despite the apparent simplicity of episiotomy, argument continues about how often the operation should be undertaken, the choice of incision, and the method of repair. Moreover, little information is available about the incidence of later complications such as dyspareunia.”25
This systematic evidence review revisits randomized trials of routine versus restricted use, identifies the sole trial of midline versus median episiotomy, presents evidence for choosing among options for repair methods, and extends prior reviews to encompass longer-term outcomes. Specifically, we have systematically assessed the evidence from trials and prospective cohorts related to the influence of episiotomy on measures of pelvic floor relaxation, continence, and sexual function and satisfaction. The goal of this synthesis is to inform care providers, professional organizations, advocates, and individual women about the current state of the evidence about the routine use of episiotomy.
The original Scope of Work for this review was developed by the American College of Obstetricians and Gynecologists (ACOG) and forwarded by the Agency for Healthcare Research and Quality (AHRQ) to the RTI International-University of North Carolina Evidence-based Practice Center (RTI-UNC EPC). The work assignment proposed four provisional questions for review. Those questions were the basis for a brief review completed by the EPC Coordinating Center (The Lewin Group). Brief reviews help prioritize the topics AHRQ assigns to the 10 “generalist” EPCs.
The RTI-UNC EPC further revised the proposed questions after discussions with internal technical staff, AHRQ staff, and our Technical Expert Advisory Group (TEAG). The final key questions (KQ s) are listed below.
KQ 1. Does the practice of liberal or routine episiotomy compared to more selective use of episiotomy influence maternal postpartum outcomes?
KQ 2. Does episiotomy incision type (i.e., midline or mediolateral) influence maternal postpartum outcomes?
KQ 3. Does the repair of the perineal defect (i.e., suture type and repair approach) influence maternal postpartum outcomes?
KQ 4. Does episiotomy have a long-term influence on urinary incontinence, fecal incontinence, or pelvic-floor defects?
KQ 5. Does episiotomy or incision type, or both, influence future sexual function?
The conceptual framework in Figure 1
Another issue is how to define “routine episiotomy.” Defining the term is a challenge because the category is described in studies by negatives such as “not for fetal distress” and “not for dystocia.” We captured the operational definitions provided by authors of included publications and attempted to isolate data that reflect use of episiotomy at the time of uncomplicated spontaneous vaginal births. The text of this review and the evidence tables specify how authors define the terms “indicated” and “routine” so that our readers may use this information as a filter through which to view study findings.
A third concern is how to distinguish immediate versus long-term outcomes. To ensure a broad review of the available literature, we included all studies that report relevant outcomes without regard to the specific followup interval. We abstracted the intervals at which followup data are collected. Studies were later classified into those that report on postpartum versus those that include longer-term followup. If a study provides both types of information—immediate and long-term followup—study results appear in more than one portion of the review.
Chapter 2 describes our methods, including our search strategies and inclusion/exclusion criteria; we also document our approach to grading the quality of articles and rating the strength of evidence. In Chapter 3, we present the results of our literature search and synthesis of retained articles by key question. Chapter 4 further discusses the findings, presents our conclusions, and offers recommendations for future research. Our references and included studies and a listing of excluded studies follow Chapter 4. Appendixes include a detailed description of our search strings (Appendix A), abstraction and quality-rating forms (Appendix B), detailed evidence tables (Appendix C), and acknowledgments (Appendix D). Appendixes and evidence tables cited in this report are provided electronically at http://www.ahrq.gov/clinic/ tp/epistp.htm.
We identified technical experts in the field of episiotomy to provide assistance throughout the project. The TEAG (see Appendix D) was expected to contribute to AHRQ's broader goals of (1) creating and maintaining science partnerships as well as public-private partnerships and (2) meeting the needs of an array of potential customers and users of its products. Thus, the TEAG was both an additional resource and a sounding board during the project. The TEAG included eight members: seven technical/clinical experts and one potential user of the final evidence report, an ACOG representative.
To ensure robust, scientifically relevant work, we called on the TEAG to provide reactions to work in progress and advice on substantive issues or possibly overlooked areas of research. TEAG members participated in conference calls and discussions through e-mail to
refine the analytic framework and key questions at the beginning of the project;
discuss the preliminary assessment of the literature, including inclusion/exclusion criteria; and
provide input on the information and categories included in evidence tables.
Because of their extensive knowledge of the literature, including numerous articles authored by TEAG members themselves, and their active involvement in professional societies and as practitioners in the field, we also asked TEAG members to participate in the external peer review of the draft report.
This evidence report addresses the key questions outlined in Chapter 2 through systematic review of published literature. We anticipate that the report will be of value to ACOG and other professional societies for their various efforts to inform and educate obstetricians, family physicians, nurses, midwives, childbirth educators, doulas, and women in their reproductive years. This report can bring practitioners up to date about the current state of evidence, and it provides an assessment of the quality of studies that aim to determine the outcomes of the practice of episiotomy. Researchers can obtain a concise analysis of the current state of knowledge in this field and will be poised to pursue further investigations that are needed to improve health for obstetric populations.
In this chapter, we document the procedures that the RTI International-University of North Carolina Evidence-based Practice Center (RTI-UNC EPC) used to develop this comprehensive evidence report on episiotomy. We first describe our strategy for identifying articles relevant to our key questions, our inclusion/exclusion criteria, and the process we used to abstract relevant information from the eligible articles and generate our evidence tables. We also discuss our criteria for grading the quality of individual articles and the strength of the evidence as a whole. Finally, we explain the peer-review process.
| Category | Criteria |
|---|---|
| Study population | Humans |
| Study settings and geography | Inpatient, outpatient, home; all geographical locations subject to publication language and study design criteria |
| Time period | 1950 through 2004 |
| Publication languages | English only |
| Sample size | N greater than or equal to 40 |
| Admissible evidence (study design and other criteria) | Original research studies that provide sufficient detail regarding methods and results to enable use and adjustment of the data and results |
| For studies on KQ 1 and KQ 3 | |
| RCTs: double-blinded and single-blinded designs | |
| For studies on KQ 2, KQ 4 and KQ 5 | |
| RCTs: double-blinded and single-blinded designs | |
| Non-RCTs: prospective cohort studies | |
| Relevant outcomes must be able to be abstracted from data presented in the papers |
We excluded studies that (1) did not report on women of reproductive age; (2) were published in languages other than English; (3) did not report information pertinent to the key clinical questions; (4) had fewer than 40 subjects; and (5) were not original studies (although we did include systematic reviews and meta-analysis in our discussion).
Databases. We used multifaceted search strategies to include all the current valid research on the key questions. We used standard electronic databases: MEDLINE®, Cochrane Collaboration resources, and the Cumulative Index to Nursing and Allied Health Literature (CINAHL®). We also undertook hand-searches of the reference lists of relevant articles to make sure that we were not missing any relevant studies. We consulted with the Technical Expert Advisory Group (TEAG) about any studies or trials that are currently under way that may not be published yet.
| Search Terms | Results |
|---|---|
| “Episiotomy” [MeSH] Field: All Fields, Limits: English, Randomized Controlled Trial, Human | 75 |
| “Episiotomy” [MeSH], English, Review, Human | 68 |
| Labor Stage, Second [mh], English, Review, Human | 40 |
| Labor Stage, Second [mh], English, Randomized Controlled Trial, Human | 58 |
| Search Number | Search Terms | Results |
| #1 | “Episiotomy”[MeSH:NoExp] Field: All Fields, Limits: English, Human | 676 |
| #2 | “Episiotomy” English, Editorial, Human | 14 |
| #3 | “Episiotomy” English, Letter, Human | 58 |
| #4 | “Episiotomy” English, Review, Human | 68 |
| #5 | “Episiotomy” English, Meta-Analysis, Human | 3 |
| #6 | “Episiotomy” English, Practice Guideline, Human | 0 |
| #7 | #2 OR #3 OR #4 OR #5 OR #6 | 140 |
| #8 | #1 NOT #7 | 536 |
| #9 | Repair | 138,222 |
| #10 | #1 AND #9 | 86 |
| #11 | labor stage, second [mh] | 638 |
| #12 | #9 AND #11 | 6 |
| #13 | ((“Episiotomy” OR “pregnancy”) AND (“midline” AND “mediolateral”)) [MeSH:NoExp] Field: All Fields, Limits: English, Human | 11 |
| #14 | ((“Episiotomy” OR “pregnancy”) AND (“sphincter”)) [MeSH:NoExp] Field: All Fields, Limits: English, Human | 3 |
Figure 2
Article Selection Process. Once we had identified articles through the electronic database search, review articles, and bibliographies, we examined abstracts of articles to determine whether studies did, in fact, meet our criteria. Two reviewers separately evaluated the abstracts for inclusion or exclusion, using an Abstract Review Form (see Appendix B). If one abstractor concluded that the article should be included in the review, we retained it. Abstracts initially excluded from the study by one reviewer received a second review. The group included three physician health-services researchers—Katherine Hartmann, MD, PhD (Scientific Director); John Thorp, Jr., MD (Co-Investigator); and Gerald Gartlehner, MD, MPH (Study Coordinator); one health-services researcher—Meera Viswanathan, PhD (Study Director); and one junior epidemiologist—Rachel Palmieri, B.S.
Approximately 325 articles required review of the full article because of missing or uninformative abstracts. For the full article review, one reviewer read each article and decided whether it met our inclusion criteria, using a Full Text Inclusion/Exclusion Form (see Appendix B *). A list of articles excluded at the full-article review stage is provided at the end of this report, along with the reasons for their exclusion.
The five staff members who conducted this systematic review jointly developed the data abstraction tables (see Appendix B) and evidence tables (Appendix C). These tables were designed to provide sufficient information to enable readers to understand the studies and to determine their quality; we gave particular emphasis to essential information related to our key questions. The format of the evidence tables, which was based on successful designs used for prior systematic reviews, varies somewhat by key questions.
The abstractors trained themselves on entering data into the tables by abstracting several articles and then reconvening as a group to discuss the utility of the table design. The abstractors repeated this process through several iterations until they decided that the tables included the appropriate categories for gathering the information contained in the articles. The design was then reviewed by the TEAG through a teleconference.
All team members shared the task of initially entering information into the data abstraction forms. Another member of the team also reviewed the articles and edited all initial table entries for accuracy, completeness, and consistency. The two abstractors reconciled all disagreements concerning the information reported in the abstraction forms. The full research team met regularly during the article abstraction period and discussed global issues related to the data abstraction process.
We then entered the data from the abstraction forms into evidence tables and once again checked for consistency and accuracy.
The final evidence tables are presented in their entirety in Appendix C. Entries in the tables are listed by publication date. A list of abbreviations used in the tables appears at the beginning of that appendix.
Rating the Quality of Individual Articles. The RTI-UNC EPC's approach to assessing the quality of individual articles was developed based on the domains and elements recommended in the evidence report by West and colleagues, Systems to Rate the Strength of Scientific Evidence. 26 We developed different rating schemes for RCTs and prospective cohort studies.
For RCTs, we rated studies on the following criteria (see Appendix B for the RCT Quality Rating Form).
1. Randomization Approach and Implementation: This item judged whether the approach described a valid method of randomization, whether allocation concealment was achieved, and whether balance was documented across study groups.
Approach: Articles that assigned the groups in a manner inconsistent with true randomization methods automatically received a poor rating for this category and overall. Articles that merely stated that they “randomly assigned” the groups and had either no balance or did not report on balance received a poor rating. Articles with no documentation of concealment were rated poor. Those with potentially inadequate concealment methods were rated poor if the study had poor balance of allocation or if balance was not documented in the paper. Those with potentially poor concealment were rated fair if they documented good balance.
2. Masking: This item was relevant only to Key Question (KQ) 3. For KQ 1, masking of the birth attendant is not feasible and for KQ 2, masking the kind of episiotomy the women received was not possible.
Approach: If the outcome assessors and participants were adequately masked within the possibilities of the study design, we rated the category as good. If there was a mix of masking among the outcomes, we rated the category as fair. If masking was not done at all and not attempted, we rated the category as poor. In the event that the article did not report on masking, we noted this point in the quality assessment table and counted it against the trial in the overall quality rating.
3. Operational Definitions and Measurements: This item judged the quality of the operational definitions of the outcomes (i.e., were they adequately described) and whether they were adequately collected (i.e., was the method sufficient and appropriate).
Approach: If a primary outcome was identified, we gave it more weight in its contribution to this category's score. Otherwise, we rated this category on the basis of an average across all outcomes and the ability to define and measure them. Good definitions and measurement include the following: visual analog scale, detailed Likert scale, detailed time points in question, details about what was asked of the patient, medical chart abstractions, and clinical examination or assessment. If an article simply stated an outcome, such as “perineal pain,” and gave no further explanations about it, we rated the category in the fair-to-poor range, depending on how the study collected the information.
4. Post-Randomization Exclusions: This item captured how many post-randomization exclusions were explicitly stated.
Approach: In typical randomized trials, intention to treat analysis is expected. Some investigators represented in this literature enrolled women during prenatal care rather than on labor and delivery in an effort to get a representative sample of prenatal patients. We note exclusions as appropriate when individual gave birth at a hospital not participating in the study, or when participants had outcomes that made them ineligible to participate in the trial such as preterm birth or cesarean birth. Any other exclusions after randomization were considered inappropriate.
5. Loss to Followup: This item collected percentages of followup at every time point in the study at which data were collected; we used it to determine if followup was adequate.
Approach: An average of followup percentages for short-term and long-term followup contributed to this category. In general, we considered followup greater than or equal to 90 percent in the short term and 80 percent in the long term to be good.
6. Statistical analysis: This factor included whether the investigators conducted the study in an appropriate manner and took the effect of multiple comparisons into account. This item also reviewed the study's use of multivariate statistical techniques and/or participant restriction or stratification to control for confounding.
Approach: This category is not included on the quality assessment form because of the nature of reporting in journals dating back to 1974. P-values were sufficient for reporting in the past, whereas point estimates, tests for homogeneity, stratification, and confidence intervals are more widely reported now. Although this category did not explicitly contribute to the overall quality rating, we used it for articles that were on the border between categories.
For RCTs, the two article abstractors independently rated each article on each of the first five categories as indicated by the quality assessment form (Appendix B *). A third reviewer flagged studies with differences in scoring on individual components. We reconciled these differences by consensus. We then created a composite rating. If a study had poor randomization approach or implementation with a fatal flaw (e.g., lottery cards), we rated it as poor. For all other scores, we gave each item equal weight. Specifically, studies that received good ratings on all categories were rated as good studies overall. If a study received one or two fair or poor ratings, or the equivalent of a deficiency, it was rated as an overall fair-quality study. Studies with three or more fair ratings or a poor randomization design or implementation with a fatal flaw were rated poor-quality studies.
For classifying the quality of prospective cohort studies included for KQs4 and 5, we assessed the following factors:
1. Study population: We sought documentation in the publication of the degree to which the study population was representative of women with uncomplicated spontaneous vaginal births in the study facilities or broader population sampled.
Approach: To receive a rating of good for this component of study design and conduct, we required a study to describe clearly (1) the base population from which cohort participants were sought, (2) the number of women in that base population (a denominator), (3) clear inclusion and exclusion criteria, and (4) the proportion of eligible women who were ultimately enrolled in the cohort.
Studies lacking only items (2) and (4) were classified as fair, and studies lacking items (1) or (3) with any combination of other missing documentation were rated as poor with respect to documentation of the study population.
2. Measures: We sought documentation in the publication of four components of quality of measurements. The first was specification of whether the measure was a primary or secondary measure for the study as noted in any portion of the paper. The second was a clear description of the measures used that is sufficient to allow replication of the measure (e.g., visual analog scale, McGill Pain Score). We accepted references to methods described more fully in other publications as documentation if the reference in fact provided details. The third component was a clear description of how the measure was obtained and by whom if, applicable (e.g., telephone interview, face-to-face interview, mailed questionnaire). The fourth was a clear specification of the time interval in which the data were collected with respect to the index birth.
Approach: We classified studies that achieved all four document requirements for the measurement of relevance to the key question as having good implementation of the measures component. We classified studies as fair for this component if item (1) was unclear or not noted and if this was the only limitation. If any other item was missing, we considered the quality of documentation of measures to be poor. Of note, a given study could be classified as good for one key question (e.g., the key question about sexual function), while getting a fair rating for another measure that related to a different key question.
3. Loss to followup: If data from more than one time interval were reported, we sought documentation of the these followup measures: (1) the number of participants in the sample at the time of followup, (2) analysis of how respondents differed from nonrespondents if loss exceeded 20 percent, and (3) absolute loss to followup by time interval.
Approach: We rated a study as good quality if the research team accomplished each of the above measures and had ≤ 20 percent loss to followup at 3 months and beyond. A study was rated fair if the investigators accomplished items (1) and (2), had no apparent response bias as investigated by comparison of baseline characteristics, and had up to 30 percent loss to followup; or if they had between 20 percent and 25 percent loss to followup without documentation of comparability. We rated a study as poor for this component if it had more than 30 percent loss to followup or more than 25 percent loss without comparison for response bias.
4. Analysis: We sought four tiers of documentation: (1) thorough enumeration of the number of cohort participants, the characteristics of their birth experience and perineal status, and general descriptive characteristics such as parity and number of prior vaginal births in cohorts that included multiparous women; (2) assessment of confounding and modifying factors by bivariate analysis, stratified analysis, or multivariable modeling; (3) reporting of adjusted estimates for main effects that took into account identified confounding or modifying factors (stratified or separate analyses were acceptable for simple constructs); and (4) presentation of adjusted results with a measure of statistical precision such as a confidence interval or P-value.
Approach: We rated a study as having a good analysis implementation if all of these elements were present. Missing or limited detail for item (1) resulted in a fair rating if this was the only deficit; similarly, we rated a study fair if it was missing or providing only limited detail for item (2) if subsequent multivariable modeling implied that the step had been completed and all other items were present. Missing items (3) or (4) or any other two or more items in combination resulted in a poor rating.
Grading the Strength of Available Evidence. Our scheme follows the criteria applied by Berkman et al.27 That system included three domains: quality of the research, quantity of studies (including number of studies and adequacy of the sample size), and consistency of findings. The four senior staff members assigned grades by consensus.
We graded the body of literature applicable to each of the four components of the two key questions separately and present our findings in Chapter 4. The possible grades in our scheme are as follows:
I. The evidence is from studies of strong design; results are both clinically important and consistent with minor exceptions at most; results are free from serious doubts about generalizability, bias, or flaws in research design. Studies with negative results have sufficiently large samples to have adequate statistical power.
II. The evidence is from studies of strong design, but some uncertainty remains because of inconsistencies or concern about generalizability, bias, research design flaws, or adequate sample size. Alternatively, the evidence is consistent but derives from studies of weaker design.
III. The evidence is from a limited number of studies of weaker design. Studies with strong design either have not been done or are inconclusive.
IV. No published literature.
As is customary for all evidence reports and systematic reviews done for the Agency for Healthcare Research and Quality (AHRQ), the RTI-UNC EPC requested review of this report from a wide array of outside experts in the field and from relevant professional societies and public organizations. AHRQ also requested review from its own staff and appropriate federal agencies. We received 18 responses; the 17 individuals listed in Appendix D * gave us permission to acknowledge them. We compiled all comments and addressed each one individually, revising the text as appropriate.
This chapter presents results of our literature search and findings for each key question (KQ) introduced in Chapter 1. KQ 1 examines postpartum maternal outcomes related to liberal or restrictive use of episiotomy. KQ 2 compares postpartum outcomes of midline and mediolateral episiotomy. KQ 3 examines outcomes of methods for repair of perineal defects. KQ 4 summarizes longer term outcomes of episiotomy related to fecal and urinary incontinence and pelvic floor integrity and function, and KQ 5 examines longer-term sexual function.
We report results in five main sections of this chapter corresponding to the core issues that this systematic review addressed. In each section, we report first on specific details about the yields of the literature searches, population, outcomes, and quality of the studies, and then on the findings for each key question. Summary tables present selected information on each study. Detailed evidence tables are in Appendix C *.
Each trial that we identified, compared two study arms or groups: (1) one in which the intention was to restrict routine use of episiotomy and (2) one group in which a liberal--use policy endorsed routine use. Defining the distinctions between such groups in a manner that can be uniformly achieved is the central challenge for these trials.
The strictest definition of “restrictive” was to avoid episiotomy unless indicated for fetal well-being.23, 32 Other definitions pivoted on instructions to “avoid episiotomy,” use only when “medically necessary,” or not perform episiotomy for the sole purpose of avoiding a laceration.22, 28, 29, 31 The largest trial defined restrictive use as only for fetal indications and/or to avoid severe lacerations.30
Liberal-use arms were defined in terms such as “routinely conducted,” “routine,” “usual care,” and “elective.”22, 28–31 Two studies describeed the liberal-use policy as encouraging routine use of episiotomy when “a tear is imminent”32 and “to prevent a tear.”23
Regardless of how the randomization groups were defined for study implementation, neither definition may align with the usual practice of individual clinicians, especially with respect to how they would describe the goal they are trying to achieve when performing an episiotomy. Variation in norms and usual practice patterns are demonstrable in the variation of episiotomy use observed in these trials. Use of episiotomy in the restrictive groups ranged from lows of 7.6 percent22 and 10.2 percent23 to highs of 44 percent29 and 53 percent.31 We observed up to a seven-fold difference in the use of episiotomy within the restrictive groups across these trials. The routine--use/liberal-use arms had episiotomy use rates from a low of 44.9 percent22 to a high of 83 percent.30, 31 Wide variation in patterns of use across trials introduces substantial heterogeneity in the “exposures” under study. A large degree of built-in “cross-over” occurred in the setting of higher rates of use in the restrictive groups. In synthesizing these data, we emphasize that these trials compare policies of episiotomy use, not episiotomy versus no episiotomy.
An additional factor that influences generalizability of findings for practice in the United States is that six of the seven studies used mediolateral episiotomy. The only North American study, conducted in Canada, was also the only study in which median (midline) episiotomy was used. Because midline episiotomy is the most common technique used in the United States,9, 35 this means that the majority of the literature reflects outcomes that would be expected with a distinctively different episiotomy approach with respect to anatomic location of the defect and potential complications. Key Question 2, focused on incision type, identified only one poor- quality RCT directly comparing median and mediolateral episiotomy.36 We review this study- in detail in the next section; briefly, the study -suggests increased risk of rectal injury and complicated or extended incision with midline episiotomy. This finding is comparable to those of observational studies that report that midline incisions are more likely to result in extensions that injure the anal sphincter and/or rectal mucosa.37, 38
Study Populations. Six studies restricted participation to term births;22, 23, 28–31 the seventh, to births at longer than 34 weeks' gestation.32 Five studies specified that they enrolled only singleton gestations. The two studies that did not specify singleton gestations were conducted in the 1980s before routine use of ultrasound; in that period, -providers were unlikely to miss a multiple gestation clinically, -excluding twin gestations before enrollment- in a trial might have been difficult to do with complete confidence.22, 23 Two studies required vertex presentations.28, 31 Regardless of stated inclusion criteria, multiple gestations and breech presentations do not seem to be represented in these trials. To this extent the studies do represent episiotomy use in uncomplicated vaginal deliveries.
Three studies enrolled only women having their first births.22, 31, 32 This approach eliminates any influence of prior perineal trauma and healing on the trial outcomes. The combined study populations of these three studies was 409 participants; this is smaller than the entire cohort of each of the other studies. Four studies enrolled multiparous women. In these studies, the proportion of women who were primiparous ranged from 40 percent to 68 percent, generally with good balance across study groups.23, 28–30 The exception occurred in the publication by House and colleagues in which it appears that multiparous women were more likely to have been randomized to the restricted-use group.28 Within the studies that achieved balanced allocation by parity, analyses that stratify outcomes by parity help inform how outcomes may differ based on prior childbirth experiences. No trial data were identified that allow consideration of whether outcomes vary by race and ethnicity.
Each study focused on normal spontaneous vaginal births. To reduce the number of women who subsequently had operative vaginal deliveries or cesarean births, the majority of studies allocated women to study groups as close to the anticipated time of birth as feasible. The proportion of assisted vaginal births (both forceps and vacuum) in these trials was 2 percent,30 3 percent,29 4 percent to 5 percent,31 11 percent to 14 percent,28 or not specifically reported; . The absence of reporting on instrumental and cesarean births raises the potential of unreported post-randomization exclusions. In two cases, authors noted the number of cesarean births and that those cases are described as part of the trial population and excluded from analyses.29, 32 Both of these studies enrolled women during prenatal care, an approach that improved representativeness of the study population and explains the increased numbers of women with cesarean births who were logically excluded later.
Outcomes. The most common primary outcome was perineal status after the birth. All seven studies reported incidence of episiotomy and of third- or fourth-degree lacerations or extensions. Five studies reported prevalence of intact perineum;22, 23, 29, 31, 32 one reported prevalence of intact perineum combined with first-degree lacerations.28 One trial did not report data about intact perineum or minor lacerations, but the proportion can be inferred because first-degree or intact should be the converse of the data they do report about “any perineal suturing” required.30
A single study incorporated masked assessment of perineal trauma; Sleep and colleagues arranged to have a clinician who did not assist the delivery and was masked to the study group asesss perineal status and perform the repair.23 Although this design approach does not mask the obvious appearance of an episiotomy compared to a spontaneous tear or intact perineum, it prevents any bias in uniformly recording the extent and location of any perineal trauma. The other studies are at risk of this type of bias.
The most common secondary outcome was pain in the days immediately after the birth. Five of the seven trials assessed pain. Two groups used visual analog scores and classified responses into categories of mild, moderate, or severe.28, 32 One study used an unspecified “standardized questionnaire” and also reported pain severity as mild, moderate, or severe. 23 Another used the McGill Pain Questionnaire and reported the composite score of the 10-item scale.29 The largest study, conducted in Argentina, did not define how they collected data about “perineal pain.”30 Additional measures include use of pain medications and reports of pain with specific physical activities.
Three publications did not report masking of the assessors to study group.22, 31, 32 The remainder of the studies reported that the individual conducting the pain assessment was unaware of allocation of the participants. Two studies clearly noted that women were not aware of the group to which they were allocated.23, 28
Additional outcomes assessed include resumption of sexual activity, estimated blood loss, and wound- healing appearance and complications, including infection, healing by secondary intention, and persistent granulation tissue. Few of these outcomes were measured uniformly in more than one study.
| Citation | Inclusion Parity | Groups | Outcome(s) | Outcome among Liberal-Use Group | Outcome among Restrictive-Use Group | Authors' Conclusions |
|---|---|---|---|---|---|---|
| Country | Episiotomy Use | |||||
| Episiotomy Type | ||||||
| Number | ||||||
| Sleep et al., 198423 | Term, singleton pregnancy | G1: Liberal = “try to prevent a tear” | Intact | 24.30% | 33.90% | Restricting use of episiotomy neither increased nor decreased problems experienced by mothers. |
| United Kingdom | Anticipated NSVD | G2: Restrictive = “try to avoid episiotomy and restrict to fetal indications” | Third or fourth degree | n = 1 | n = 4 | |
| Mediolateral | 40%–46% primiparous | G1: 51.4% | Any suturing | |||
| N = 1,000 | G2: 10.2% | Primip: 89% | Primip: 74% | |||
| Multip: 69% | Multip: 66% | |||||
| Harrison et al., 198422 | Term, primigravid, anticipated vaginal birth | G1: Mediolateral episiotomy routinely conducted | Intact | Not reported | 21% | Primigravid patients allocated to not undergo episiotomy generally fared better than they would have done with normal hospital practices. Forty-six percent had no or only first-degree tears. |
| Ireland | G2: No episiotomy unless “medically necessary” | Third or fourth degree | 6% | None | ||
| Mediolateral | G1: 44.9% | |||||
| N = 181 | G2: 7.6% | |||||
| House et al., 198628 | Term, vertex, anticipated NSVD | G1: Standard current management | Intact or first degree | Primip: 4% | Primip: 32% | Restrictive policy resulted in a significant increase in the incidence of patients with intact perineum or only a first-degree tear. |
| United Kingdom | 53%–68% primiparous | G2: Episiotomy not performed to prevent laceration | Second degree = | Multip: 26% | Multip: 54% | |
| Mediolateral | G1: 69% | Episiotomy or second degree | Primip: 96% | Primip: 68% | ||
| N = 165 | G2: 18% | Third degree | Multip: 70% | Multip: 45% | ||
| Primip: 4% | Primip: None | |||||
| Multip: 4% | Multip: None | |||||
| Klein et al., 199229 | Term, singleton; anticipated NSVD | G1: Liberal = avoid tear | Intact (no suturing) | Primip: 6.6% | Primip: 7.5% | No evidence that liberal use prevents perineal trauma; restriction of episiotomy use among multiparous women results in significantly more intact perineums and less suturing. |
| Canada | 50%–52% primiparous | G2: Restrictive = attempt to avoid episiotomy | Episiotomy alone | Multip: 19.3% | Multip: 30.7% | |
| Midline | G1: | Third or fourth degree | Primip: 67.2% | Primip: 42.2% | ||
| N = 730 | Primip: 81% | Multip: 45.2% | Multip: 29.0% | |||
| Multip: 52% | Primip: 7.9% | Primip: 13.9% | ||||
| G2: | Multip: 0% | Multip: 0% | ||||
| Primip: 47% | ||||||
| Multip: 31% | ||||||
| Argentine Episiotomy Trial Collaborative Group, 199330 | Term, singleton first or second vaginal birth; no prior cesarean or severe perineal trauma | G1: Routine | Perineal suturing | 88.1% | 63.1% | No evidence that routine use of episiotomy reduces risk of severe perineal trauma. |
| Argentina | 40%–41% primiparous | G2: Selective | Third or fourth degree | Primip: 1.8% | Primip: 1.4% | |
| Mediolateral | G1: 82.6% | Multip: 0.9% | Multip: 0.8% | |||
| N = 2,606 | G2: 30.1% | |||||
| Eltorkey and Nuaim, 199431 | Term, singleton, vertex, primiparous; anticipated NSVD | G1: Elective | Intact | 7% | 28% | Selective group more likely to have an intact perineum. No indication that episiotomy offers clear benefit in terms of decreased numbers of lacerations. |
| Saudi Arabia | G2: Selective = essential | Second degree or episiotomy without extension | 71%* | 49%* | ||
| (British staff) | G1: 83%* | Third degree or episiotomy with extension | None | None | ||
| Mediolateral | G2: 53%* | |||||
| N = 200 | ||||||
| Dannecker et al., 200432 | >34 weeks, singleton, primiparous; anticipated NSVD | G1: Liberal = if tear imminent and/or fetal indications | Intact | 10% | 29% | Restrictive use resulted in three-fold increase in the rates of intact perinea. No difference with regard to third-degree tears. |
| Germany | G2: Restrictive = fetal indications only | Third degree | 8% | 4% | ||
| Mediolateral | G1: 77% | |||||
| N = 109 | G2: 41% | |||||
G, group; primip, primiparous; multip, multiparous; NSVD, normal spontaneous vaginal delivery.
Text and tables in this publication are not concordant; overall incidence from text; second degree and episiotomy totals from table.
The largest trial conducted was a multisite study in Argentina that enrolled 2,606 women and was of fair quality (see evidence tables in Appendix C for details).30 This study found a 2.4-fold increase in risk of anterior tears among women in the restrictive- use arm (95% confidence interval [CI], 1.9–2.9), and decreased risk of posterior perineal surgical repair (relative risk [RR] = 0.72; 95% CI, 0.68–0.75) when comparing restrictive to liberal use. Eighty-eight percent of women in the liberal group had a surgical repair, as did 63 percent in the restrictive group. Pain, healing complications, and dehiscence were all less frequent in the restrictive-use group. The authors concluded that episiotomy confers no apparent benefit and that high rates of use cannot be justified.
The findings of these studies are fully compatible. None of the authors concluded that episiotomy provides any benefits with respect to perineal trauma. The majority concluded that intact or minimal perineal trauma is more common when episiotomy use is restricted. This synthesis of the data is compatible with the findings of prior systematic evidence reviews that are updated with this report. Although the authors of these research publication appear loath to ascribe harm to the use of episiotomy, in our judgement-concluding from their data that routine use is harmful is accurate, at least to the extent that it creates a surgical incision of greater extent than a woman might have experienced had the episiotomy not been performed.
| Citation | Inclusion Parity | Groups | Outcome(s) | Outcome among Liberal-Use Group | Outcome among Restrictive-Use Group | Authors' Conclusions |
|---|---|---|---|---|---|---|
| Country | Episiotomy Use | How Measured? | ||||
| Episiotomy Type | When? | |||||
| Number | ||||||
| Sleep, 198423 | Term, singleton pregnancy, anticipated NSVD* | G1: Liberal = “try to prevent a tear” | Pain severity in prior 24 hours; questionnaire administered by midwife; 10 days postpartum | 10 days | 10 days | No significant differences between the two groups in maternal pain at 10 days and 3 months postpartum. |
| United Kingdom | 40%–46% primiparous | G2: Restrictive = “try to avoid episiotomy and restrict to fetal indications” | Worst pain in past week; postal questionnaire; 3 months postpartum | Mild: 14.6% | Mild: 14.1% | |
| Mediolateral | G1: 51.4% | Mod: 7.8% | Mod: 7.5% | |||
| N =1000 | G2: 10.2% | Severe: 0.2% | Severe: 0.9% | |||
| 3 months | 3 months | |||||
| Mild: 5.7% | Mild: 4.6% | |||||
| Mod: 1.8% | Mod: 2.5% | |||||
| Severe: 0.2% | Severe: 0.5% | |||||
| House et al., 198628 | Term, vertex, anticipated NSVD* | G1: Standard current management | Pain severity; interview by one of authors using VAS scale 1 to 10 with 1–3 grouped as minimal; 4–6 moderate; 7–10 severe; | 3 days | 3 days | Pain symptoms on the third day postpartum were on average reduced in the patients in whom the use of episiotomy was restricted and equivalent thereafter. |
| United Kingdom | 53%–68% primiparous. | G2: Episiotomy not performed to prevent laceration | 3 days; 6 weeks; 3 months | Mild: 55% | Mild: 68% | |
| Mediolateral | G1: 69% | Mod: 34% | Mod: 22% | |||
| N = 165 | G2: 18% | Severe: 11% | Severe: 10% | |||
| No differences at 6 weeks and 3 months | No women in either group with more than minimal pain at 3 months | |||||
| Klein et al., 199229 | Term, singleton; anticipated NSVD | G1: Liberal = avoid tear | Perineal pain measured by 10 individually scored items using the McGill Pain Questionnaire at 1, 2, and 10 days postpartum | First day | First day | No significant differences in perineal pain and pain with urination at 1, 2, and 10 days postpartum for individual pain scale items or composite score |
| Canada | 50%–52% primiparous | G2: Restrictive = attempt to avoid episiotomy | Primip: 1.8± 0.8 | Primip: 1.7± 0.8 | ||
| Midline | G1: | Multip:1.3± 0.8 | Multip:1.3± 0.9 | |||
| N = 730 | Primip: 81% | Second day | Second day | |||
| Multip: 52% | Primip: 1.3± 0.7 | Primip: 1.4±0.8 | ||||
| G2: | Multip:0.9± 0.7 | Multip: 0.9± 0.8 | ||||
| Primip: 47% | Tenth day | Tenth day | ||||
| Multip: 31% | Primip:0.5±0.5 | Primip: 0.5± 0.5 | ||||
| Multip: 0.3±0.4 | Multip: 0.3± 0.5 | |||||
| Argentine Episiotomy Trial Collaborative Group, 199330 | Term, singleton first or second vaginal birth; no prior cesarean or severe perineal trauma | G1: Routine = do according to hospital's policy before trial | Perineal pain (not clearly defined), assessment method not clearly delineated, physician masked to allocation evaluated on day of discharge | 42.5% | 30.7% | Perineal pain was less common in the restrictive use group. |
| Argentina | 40%–41% primiparous | G2: Selective = try to avoid, do only for fetal indications or if severe tear is imminent | ||||
| Mediolateral | G1: 82.6% | |||||
| N = 2,606 | G2: 30.1% | |||||
| Dannecker et al., 200432 | >34 weeks, singleton, primiparous; anticipated NSVD | G1: Liberal = if tear imminent and/or fetal indications | Perineal pain in postpartum period (days 1 to 5) on 100 mm visual analog scale anchored at “not at all” and “very much” for a range of activities; approach to measurement not clearly specified | Bedrest: 39± 28 | Bedrest: 22±21 | Women in the restrictive group had considerably lower perineal pain scores in all activities assessed during the first 5 days postpartum. |
| Germany | G2: Restrictive = fetal indications only | Sitting: 69±23 | Sitting: 51±25 | |||
| Mediolateral | G1: 77% | Walking: 56±24 | Walking: 37±24 | |||
| N = 109 | G2: 41% | Defecation: 36±30 | Defecation: 21±21 | |||
G, group; primip, primiparous; mod, moderate; multip, multiparous; NSVD, normal spontaneous vaginal delivery
The most recent study, although small, provided the most nuanced approach to pain assessment. The investigators used a visual analog scale to assess pain with four activities: bedrest, sitting, walking, and defecation. Scores were reported in millimeters of the 100 mm-pain scale for each activity. Those in the liberal-use group had the following mean scores (± standard deviation): 39 ± 28 mm with bedrest; 69 ± 23 mm with sitting; 56 ± 24 mm with walking; and 36 ± 30 mm with defecation. The comparable scores in the restrictive group were 22 ± 21 mm; 51 ± 25 mm; 37 ± 24 mm; and 21 ± 21 mm. These differences indicate that the restrictive-use group experienced significantly lower perineal pain during all activities, at levels that are likely clinically significant. These findings could indicate a real difference in pain outcomes or some bias in assessment. The publication does not report masking of the assessors or how the assessments were conducted.
House and colleagues report that pain was more severe on postpartum day 3 among those in the liberal-use group. They also assessed pain outcomes by visual analog scale during an interview conducted by an author; masking of the assessors is not specifically noted. On day 3 in the liberal-use group, 55 percent of women had mild pain' 34 percent, moderate pain; and 11 percent, severe; the comparable categories for those in the restrictive-use group were 68 percent, 22 percent, and 10 percent. They also report tenderness at the time of examination on postpartum day 3. The restricted use group had less tenderness on examination: 79 percent had mild or minimal pain; 18 percent, moderate; 3 percent, severe; compared to 51 percent, 39 percent, and 10 percent in the liberal-use group. . These differences were statistically significant and likely to be clinically relevant.28 These differences in pain by group had resolved by 6 weeks and 3 months respectively.
Klein and colleagues, who conducted the only North American trial and the only trial using midline episiotomy, found no difference in McGill pain scores on days 1, 2, and 10 after the birth for either perineal pain or pain with urination.29 They reported that they conducted analyses both with individual pain-scale items and composite scores. Across each of the five studies, no study found a pain measure that was improved by routine liberal use of episiotomy.
Healing Outcomes. Two studies assessed healing outcomes by physical examination. The Argentine trial reported no difference in rates of the following adverse outcomes that were adequately defined: hematoma prior to discharge; - and infection, healing complications, and dehiscence as assessed on day 7 postpartum. At discharge, they assessed 92 percent and 93 percent of participants in the restricted use and liberal-use groups respectively; this dropped to 43 percent of both groups by the evaluation on day 7 postpartum. 30 House and colleagues examined participants on day 3 postpartum and at 6 weeks. Risk of infection was assessed for all participants on day 3; poor wound apposition and granulation tissue indicating secondary healing were assessed at the later visit at which 53 percent of the trial participants were assessed. Each adverse outcome was equivalent across groups.28
Other Outcomes. Two trials, both reflecting use of mediolateral episiotomy, reported on timing of resumption of intercourse. One study documented that women in the restricted episiotomy-use group resumed intercourse an average of a week earlier (5.5 ± 3.0 weeks compared to 6.5 ± 3.0 weeks).28 The other study found that 37 percent of the women in the liberal-use group (P < 0.01) .23 Longer-term influences on sexual function - those assessed at 3 months or later - are reviewed in the section on KQ 5.
Two studies, also of mediolateral episiotomy use, assessed estimated maternal blood loss. One found no difference in the amount that maternal hemoglobin measures fell.32 The other found that estimated blood loss (method not defined) was 58 cc greater in the liberal-use group, a statistically significant but likely not clinically relevant mean difference.23
Study Participants. The study included primigravidas who were admitted to the delivery unit of a university hospital in London. The mean age at delivery was 26 years; the mean gestational age was 40 weeks.
Episiotomy Type. Midline episiotomy—“incisions divided 2 to 3 cm of the perineal tissue in the midline.” Mediolateral incisions “were made from the midline and were carried to the right of the anal sphincter for about 3 to 4 cm.”36 Method of repair was identical for both study groups. Standard repair technique for both types of incisions included subcuticular skin closure with polygycolic acid suture.
Outcomes. Outcome measures included the proportion of extended or complicated incisions, recommencement of sexual intercourse, pain, pain during intercourse, satisfaction from intercourse, and the cosmetic appearance of the scar. Investigators did not report methods of outcomes assessment or the use of objective scales for pain and sexual satisfaction. An exam was conducted and pain was queried before hospital discharge and again at a 3-month followup visit that included the sexual function data collection. The study did not assess differences in fecal incontinence.
Quality. This study had serious methodological flaws; we gave it a poor rating for internal validity. In particular, we viewed an inadequate randomization method, lack of allocation concealment, and failure to blind the outcome assessors as potential sources of severe biases and as a rationale for a poor-quality rating. Consequently, the evidence is insufficient to draw any firm conclusions on differences in adverse outcomes of midline compared to mediolateral episiotomy.
The trial included 407 primigravidas who were randomly assigned to midline or mediolateral episiotomy. Results revealed a significantly higher rate of complicated or extended incisions for the midline group than for the mediolateral group (P < 0.001). A total of 23.9 percent of women in the midline group experienced an extension of the episiotomy into or through the sphincter, compared to 9 percent of women in the mediolateral group. The midline group had significantly less bruising of the perineum than the mediolateral group (P < 0.001). The investigators did not note any differences in pain in the postpartum timeframe. Of all enrolled women, 76 percent attended a 3-month followup. Women in the midline group began sexual intercourse significantly earlier (P < 0.01) and had a significantly better cosmetic appearance of the scar (P < 0.02) than women in the mediolateral group. No significant differences in pain or satisfaction from sexual intercourse were detected.
On the question of midline versus mediolateral episiotomy, the only study in our review that met our inclusion criteria found that women receiving midline episiotomy had a significantly greater probability of anal sphincter injuries than women in the mediolateral episiotomy group.36 This study did not assess fecal incontinence as a long-term health outcome. Because of considerable methodologic flaws, any conclusions must be made cautiously.
Differences in sphincter injury rates are clinically important. As an RCT, this study's internal validity is poor. Nevertheless, considering that this is the only trial pertaining to this key question, the findings regarding sphincter injuries could be viewed as relevant observational evidence. Multiple retrospective cohort studies that did not meet eligibility criteria for this key question support findings regarding high sphincter injury rates and midline episiotomy.39–45 These studies provide consistent evidence that midline episiotomy leads to significantly higher rates of third- and fourth-degree tears than mediolateral episiotomy. In another study, women with midline episiotomy had a significantly higher rate of fecal incontinence than did women with spontaneous second-degree tears.46
| Trial | Trial Groups | Setting | Trial Size | Percentage Primiparous | Percentage Instrumental Delivery | Overall Quality Rating |
|---|---|---|---|---|---|---|
| Type of Repair | ||||||
| Oboro et al., 200366 | 2 layer vs. 3 layer | Nigeria | N =1,077 | 53% | 23% | Fair |
| Ipswich Childbirth Study, Gordon et al., 1998,60 Grant et al., 200163 | 2 layer vs. 3 layer | United Kingdom | N =1,780* | 61% | 17% | Good |
| Kettle, 200267 | Continuous vs. interrupted | United Kingdom | N =1,542 | 56% | 0% | Good |
| Mahomed et al., 198958 | Continuous vs. interrupted | United Kingdom | N =1,574† | 51% | 23% | Good |
| Materials for Repair | ||||||
| Bowen and Selinger, 200264 | Absorbable vs. adhesive | United Kingdom | N = 62 | 100% | NR | Poor |
| Adoni and Anteby, 199147 | Absorbable vs. adhesive | Israel | N = 60 | NR | NR | Poor |
| Kettle, 200267 | Absorbable vs. rapidly absorbable | United Kingdom | N = 1,542 | 56% | 0% | Good |
| McElhinney et al., 200062 | Absorbable vs. rapidly absorbable | Ireland | N = 153 | 55% | NR | Poor |
| Spencer et al., 1986,56 Grant et al., 198957 | Untreated vs. treated CC | United Kingdom | N = 737 | 47% | 0% | Fair |
| Buchan and Nicholls, 198054 | Nonabsorbable vs. absorbable | United Kingdom | N = 140 | 100% | 0% | Fair |
| Mahomed et al., 198958 | a. Absorbable vs. absorbable vs. nonabsorbable b. PGA vs. CC | United Kingdom | N = 1,574 | 52% | 23% | Good |
| Upton et al., 200265 | PGA vs. CC | Australia | N = 391 | 47% | 0% | Fair |
| Ipswich Childbirth Study, Mackrodt et al., 1998,61 Grant et al., 200163 | PGA vs. CC | United Kingdom | N = 1,780* | 61% | 17% | Good |
| Olah, 199059 | PGA vs. CC | United Kingdom | N = 120 | 46% | 100% | Fair |
| Ping and Kee, 197553 | PGA vs. CC | Malaysia | N = 122 | 61% | 38% | Fair |
| Rogers, 197452 | PGA vs. CC | United States | N = 600 | NR | NR | Poor |
| Livingstone et al., 197451 | PGA vs. CC | Scotland | N = 100 | 100% | 62% | Poor |
| Beard et al., 197450 | PGA vs. CC | United Kingdom | N = 200 | 51% | NR | Fair |
| Repair Techniques and Materials | ||||||
| Doyle et al., 199349 | Absorbable sutures (plain catgut, PGA) and combination of methods | United Kingdom | N = 199 | 72% | NR | Poor |
| Isager-Sally et al., 198655 | Combination of absorbable and nonabsorbable sutures and combination of methods | Denmark | N = 900‡ | 61% | NR | Fair |
Note: CC, chromic catgut; NR, not reported; PGA, polyglycolic acid.
* The Ipswich Childbirth Study61, 63 reported a 1-year followup of results63 that included a subset (n= 793) of the original trial's population. Percentages shown reflect baseline population.
† The trial used a 2×3×2 factorial design to investigate both methods and materials for repair. The methods for the repair arm of the trial investigated continuous and interrupted methods for absorbable sutures, a subset (N= 1,057) of the entire population (N = 1,574). Percentages of primiparous and instrumental deliveries were calculated with a denominator of 1,057.
‡ 900 women were randomized but 98 were excluded because they transferred to another hospital or left the hospital before the fifth day after delivery. Three groups did not differ in age, parity, or frequency of previous episiotomy.
Of the four trials investigating techniques of repair, two compared a two-layer approach (leaving the perineal skin unsutured) with a three-layer approach (suturing the perineal skin) and two trials compared a continuous (subcutaneous) technique with an interrupted (transcutaneous) technique.
Of the 14 trials investigating repair materials, eight compared polyglycolic-acid sutures with chromic-catgut sutures, both absorbable. Two trials compared absorbable sutures (one polyglycolic acid and one chromic catgut) with an enbucrilate tissue adhesive (Histoacryl®). Two trials compared standard absorbable suture material with its rapidly absorbed counterpart, and one trial compared untreated chromic-catgut with a glycerol-treated “softgut” chromic catgut. In addition, two trials compared nonabsorbable and absorbable sutures: one compared silk sutures with polyglycolic-acid sutures and one compared silk sutures with both polyglycolic-acid and chromic-catgut sutures.
Most of these trials randomly allocated participants to one of two groups. Three trials, however, incorporated a factorial design of randomization. Using a 2×2 design, both the Ipswich Childbirth Study60, 61, 63 and the Kettle et al. trial67 randomized to methods of repair and type of sutures. The Mahomed et al. perineal suture study used a 2×3×2 design and randomized to suture type for deep tissue repair (two groups), suture type for the perineal skin (three groups), and method of repair (two groups).58 These studies contributed to more than one section of the results below.
Country. Approximately 65 percent of the RCTs in this report were conducted in the United Kingdom, including Ireland and Scotland. Australia, Denmark, Israel, Malaysia, Nigeria, and the United States each contributed one trial to this report.
Episiotomy Type. These trials included women who had an episiotomy at the time of vaginal childbirth, regardless of the number of previous pregnancies or births. The vast majority of episiotomies repaired in these studies were mediolateral. This reflects the fact that most of the studies were conducted in European countries, the majority contributed by the United Kingdom. Practitioners in North America generally perform midline episiotomy, whereas mediolateral is the rule elsewhere.
Outcomes. Perineal pain and need for analgesia were assessed during the short-term postpartum period in a majority of the trials and during the long-term postpartum period for some of the trials. Investigators used self-report through interviews administered by midwives or study staff and questionnaires to measure subjective levels of pain and use of analgesia. Trials also reported on specific aspects of healing and wound breakdown including inflammation, bruising, infection, wound gaping, need for removal of sutures, and need for resuturing. In each case, a clinician involved in the trial assessed these outcomes, often without masking to study allocation. Longer-term outcomes related to sexual function, such as dyspareunia, recommencement of sexual intercourse, and timing of resumption of intercourse, were typically measured at 3 months postpartum and up to 3 years by interview or questionnaire. Incontinence and other pelvic-floor-related outcomes were investigated by one trial.55 Few trials collected data on comfort with daily activities or satisfaction with repair.
Quality. Of the 17 trials, we rated three as good quality, eight as fair quality, and six as poor quality; for the last group, four trials were rated poor because of inadequate randomization techniques and were most likely not truly randomized. We rated trials as fair or poor quality on the basis of inadequate randomization approach and implementation, failure to mask the outcome assessors, and high loss to followup. Specific limitations of the trials are discussed elsewhere in this report.
Pain and analgesia use. Consistent evidence from these two trials suggests that the two-layer suturing technique decreases perineal pain in both the short- and long-term postpartum periods and requires less analgesia use. When the evidence is limited to the trial of good quality, however, these differences were not statistically significant.
Although both trials reported less perineal pain in the two-layer groups, only one trial66 found significant differences in the short-term postpartum period (0 to 3 months) and the long-term postpartum period (≥ 3 months). At 48 hours postpartum, fewer participants in the two-layer approach reported perineal pain than those in the three-layer approach (RR = 0.87; 95% CI, 0.78–0.97). Over time, the differences persisted: two-layer was superior at 14 days (RR = 0.77; 95% CI, 0.61–0.98), at 6 weeks (RR = 0.64; 95% CI, 0.44–0.93), and at 3 months postpartum (RR = 0.19; 95% CI, 0.06–0.54). The Ipswich Childbirth Study60, 63 did not note differences between the two groups in self-report of any, mild, moderate, or severe perineal pain at 24 to 48 hours, 10 days, 3 months, or 1 year. At each followup period, however, fewer women in the two-layer group reported perineal pain.
Similar findings extended to analgesia use in the two groups but only in the short-term postpartum period. Participants in the two-layer group of the Oboro et al. trial reported significantly less use of analgesics at 48 hours (RR = 0.71; 95% CI, 0.60–0.83) and at 14 days (RR = 0.54; 95% CI, 0.32–0.90) but not at 6 weeks (RR = 0.56; 95% CI, 0.16–0.1.89) or at 3 months postpartum (RR = 0.16; 95% CI, 0.02–1.34). The Ipswich trial found no differences in analgesia use at followup.60, 63
Healing and wound breakdown. Despite inconsistent definition and measurement of healing outcomes, the evidence suggesting that the two-layer approach decreases healing complications and that wound gaping associated with leaving the perineal skin unsutured resolves shortly after repair.
Significantly fewer participants in the two-layer group needed sutures removed for pain or infection in both trials at any time. Healing outcomes were assessed at 14 days, 6 weeks, and 3 months in one trial66 and were generally significant ; these outcomes were also significant at 1 year in the other trial (RR = 0.61, 95% CI, 0.45–0.83, P = 0.002).60, 63
Oboro et al. found that fewer women in the two-layer group reported “tight stitches” than in the three-layer group.66 Neither trial found a difference in the need for resuturing.
In the Ipswich trials, more women in the two-layer group than the three-layer group had wound gaping, defined in one of the trials as having edges greater than 0.5 cm apart, in the 24- to 48-hour postpartum period.60 Observing such a separation may be an artifact of the technique used to examine the incision, because only incisions without suture approximation of the skin can appear to “gape” when tension is applied to the posterior perineum to allow visual inspection. Only one trial looked at outcomes at 1 year postpartum; the authors reported that fewer women in the two-layer group reported that the area that had been cut or torn felt different (RR = 0.75; 95% CI, 0.61–0.91; P < 0.01).63
Incontinence and pelvic floor function. Neither trial investigated outcomes related to incontinence and pelvic floor function in terms of the difference in suturing methods.
Sexual function. These two trials did not investigate the same sexual functioning outcomes. The trend in healing outcomes suggests that the two-layer approach to suturing is associated with less morbidity.
Women in the two-layer repair group were significantly less likely in the Oboro et al. trial to have superficial dyspareunia than women in the three-layer repair group at both 6 weeks and 3 months (RR = 0.60; 95% CI, 0.42–0.85 and RR = 0.52; 95% CI, 0.33–0.81, respectively).66 These results did not extend to deep dyspareunia, which was comparable between the two groups. At 6 weeks, more women in the two-layer group had resumed pain-free intercourse (26 percent vs. 10 percent, RR = 2.54; 95% CI, 1.82–3.55);66 in the Ipswich trial,60, 63 this difference was noted but not statistically significant.
Pain and analgesia use. The evidence is inconsistent as to whether the continuous subcutaneous method of suturing decreases perineal pain and need for oral analgesia following repair. In the Mahomed et al. trial,58 the groups did not differ across the categories of any, mild, moderate, or severe perineal pain at day 2, day 10, and 3 months postpartum. By contrast, the trial by Kettle and colleagues documented significant differences.48 In this latter trial, at the same time points, women in the continuous-suture group reported less pain than women in the interrupted-suture group. On day 2, 69 percent of the continuous group and 79 percent of the interrupted group reported pain (P < 0.0001). The difference in pain outcomes continued at day 10 (26.5 percent vs. 44 percent; P < 0.0001), 3 months (9 percent vs. 13 percent; P = 0.03) and 1 year (4 percent vs. 7 percent; P = 0.05). At day 10, significant differences were also seen in perineal pain reported during walking, sitting, urination, and defecation; the difference in favor of less pain among those with subcutaneous repair ranged from 7 percent to 16 percent.
The former study also did not find significant differences between the two groups in their need for oral analgesia at day 2 (52% vs. 48%) or day 10 (7% vs. 9%). By contrast, the latter trial found that women in the continuous group needed less analgesia at 10 days postpartum (8.5% vs. 13.5%; P = 0.002).
Healing and wound breakdown. Evidence was inconsistent with respect to healing and wound breakdown. The Mahomed et al. trial found no significant differences in edema, bruising, or inflammation between the groups on day 2. The trial by Kettle and colleagues reported less morbidity in the continuous suture group on day 2 for uncomfortable stitches (OR = 0.78; 95% CI, 0.64–0.96) and tight stitches (OR = 0.40; 95% CI, 0.22–0.74), but they did not find a difference in wound gaping. They reported significant differences on day 10 for wound gaping (OR = 0.46; 95% CI, 0.29–0.74), uncomfortable stitches (OR = 0.58; 95% CI, 0.46–0.74), and tight stitches (OR = 0.43; 95% CI, 0.27–0.69). The need to remove sutures was significantly lower in the continuous method group on day 10 (OR = 0.17; 95% CI, 0.10–0.28). The Mahomed trial measured this variable only at 3 months; they reported a significant difference in favor of continuous subcutaneous closure (26% vs. 37%; P < 0.001).
Incontinence and pelvic floor function. Neither trial investigated the difference in suturing methods regarding outcomes related to incontinence and pelvic floor function.
Sexual function. Neither trial found significant differences between the groups regarding dyspareunia at 3 months postpartum. The trial that measured dyspareunia at 1 year also found equivalent outcomes across the groups.
Other outcomes. Kettle et al. collected information from the women about their overall satisfaction with the repair.48 Significantly more women in the continuous method group reported being satisfied with the repair at both 3 months and 12 months postpartum (OR = 1.64; 95% CI, 1.28–2.11 and OR = 1.68; 95% CI, 1.27–2.21, respectively).
Perineal pain and analgesia use. Both trials report less pain during several activities and at rest in women whose episiotomies were repaired with tissue adhesive. Bowen and colleagues64 used a 10-point visual analog scale (VAS) and Adoni and colleagues47 used a Likert pain scale of 1 (minimum) to 5 (maximum). Because of the differences in the pain scales, the results are not directly comparable, but the overall differences do contribute to the consistent evidence in these two trials that adhesive may lead to less perineal pain in the immediate postpartum. Both trials reported less pain while walking (1.6 vs. 2.6, P < 0.00147 and 2.7 vs. 4.0, P = 0.0015)64 on day 2. One trial47 reported significantly less pain on day 3 (2.0 vs. 2.9, P = 0.029). Both trials reported less pain during micturition: in the Adoni and Anteby trial47 on days 1 and 3 (4.5 vs. 6.3, P = 0.025 and 3.0 vs. 4.0, P = 0.025, respectively) and, in the Bowen and Selinger trial,64 on day 2 (1.0 vs. 1.7, P < 0.031). Bowen and Selinger64 reported less pain in the adhesive group on day 2 (1.95 vs. 3.3, P < 0.001), both while sitting (1.75 vs. 3.6, P < 0.0001) and while lying down (1.0 vs. 2.35, P < 0.001). Adoni and colleagues reported less pain in the adhesive group during defecation on days 3 and 4 (2.2 vs. 4.3, P = 0.003 and 2.1 vs. 3.7, P = 0.015, respectively). Nonsignificant differences were reported on other days. Neither trial compared need for analgesia between the two repair groups.
Healing and wound breakdown. Neither trial investigated differences in these outcomes by type of repair materials. One trial64 did report that they identified no cases of wound infection or dehiscence.
Sexual function. Only one trial reported on sexual functioning postpartum.64 The group repaired with adhesive, in this case Enbucrilate® tissue adhesive, had a 35 percent reduction in the onset of pain-free sexual intercourse (P = 0.0009). Neither trial reported any other outcomes related to sexual functioning.
Perineal pain and analgesia use. The poor-quality trial, although it used good measurements of pain (VAS and Likert scale), found no significant differences in perineal pain between the two groups at 24 hours and 3 days. Analgesic use before discharge also did not differ by group. The good-quality Kettle trial reported mixed results for perineal pain during specific activities and the need for analgesia at 10 days postpartum. Women in the rapidly absorbed suture groups reported less pain while walking, sitting, passing urine, and defecating. Only the differences in pain with walking were statistically significant (OR = 0.74; 95% CI, 0.56–0.97; P = 0.004). Women in the rapidly absorbed group also reported less need for analgesia (8% versus 14%, P = 0.0002). This trial randomized women to suture method (continuous versus interrupted), so the investigator was able to complete stratified analyses; they showed nonsignificant results by both method of suturing and degree of trauma. In other words, improvement in pain was independent of method of closure or size of defect.
Healing and wound breakdown. The poor-quality trial combined all healing outcomes, such as infection, gaping wound, or residual material requiring removal, into one group and found that at 6 weeks, 30 percent of women whose episiotomies were repaired with standard material and 1.7 percent of women whose episiotomies were repaired with the rapidly absorbed material reported problems. The good-quality trial examined different healing outcomes separately and did not find significant differences between the two groups with respect to wound gaping, uncomfortable sutures, or tight stitches at 2 or 10 days postpartum. However, groups had meaningful differences in need for removal of sutures between 10 days and 3 months. Women whose episiotomies were repaired with the rapidly absorbed material required removal less often than women who received the standard suture at 10 days, between 10 days and 3 months, and at any point before 3 months postpartum (OR = 0.38, 95% CI, 0.23–0.64; OR = 0.19, 95% CI, 0.13–0.30; and OR = 0.26, 95% CI, 0.18–0.37, respectively).
Sexual function. Both trials measured dyspareunia, although at different time points in the short- and long-term periods of followup. The good-quality trial found no statistically significant differences at 3 months or 12 months postpartum. The poor-quality trial found that women with repairs using rapidly absorbed material had significantly lower dyspareunia scores than women who received standard sutures. This finding extended to 3 months (mean scores 0.05 versus 0.27 in women who had dyspareunia, P < 0.05) but the authors noted that the scores were very low in both groups.
Perineal pain and analgesia use. Women in the softgut group reported significantly greater perineal pain (P = 0.015) at 10 days postpartum. Women in the softgut group were also significantly more likely to have used a perineal salt bath to relieve pain (42% versus 34%, P = 0.03) and to use more doses of oral analgesia (P = 0.18), though this difference was not statistically significant. At 3 months postpartum, self-report of perineal pain did not differ by type of catgut used.
Healing and wound breakdown. Sutures were removed more often in the chromic-catgut group both by 10 days (2.4% versus 11.5%, P < 0.001) and 3 months (6.9% versus 16.4%, P < 0.0001). Removals were described as being for maternal discomfort. Based on assessment by a midwife at 10 days, risk of perineal breakdown and healing by secondary intention did not differ by group.
Sexual function. Sexual function was assessed at 3 months postpartum and 3 years. More women in the chromic-catgut group reported pain-free sexual intercourse than women in the softgut group (50.7% versus 38.0%, P < 0.025). This difference was significant for both transient and persistent pain. At 3 years, more women in the softgut group still reported painful intercourse (OR = 1.17; 95% CI, 1.1–2.6; P < 0.02); a majority with pain described the pain as “soreness.”
Perineal pain and analgesia use. The Mahomed et al. study reported no statistically significant differences among the material groups with respect to perineal pain or use of analgesia at 2 and 10 days postpartum or at 3 months. The Buchan and Nicholls trial did report differences between the groups, but the results were inconsistent by the day 6 postpartum. The investigators used the mean number of analgesic tablets used by the women to make inferences about the level of perineal pain experienced by the women. Women with repairs made with silk sutures used more analgesia than women whose episiotomies were repaired with polyglycolic-acid suture, but the results were only significant for days 3 through 5 (P < 0.001).
Healing and wound breakdown. The Mahomed et al. study reported no significant differences among the groups with regard to bruising, edema, or healing, outcomes that were clinically assessed at 2 days postpartum. In the long-term postpartum period, the silk suture group needed the absorbable suture materials removed from perineal tissue significantly less than the polyglycolic-acid or chromic-catgut groups (7 percent versus 39 percent versus 23 percent, P < 0.001).58 No other results were reported and the fair-quality trial did not contribute to this outcome assessment.
Sexual function. In the good-quality trial, more women in the silk-suture group had not resumed intercourse by 3 months postpartum (15 percent versus 9 percent and 11 percent, P < 0.05). Among women who had resumed intercourse, dyspareunia risk was comparable. Conflicting evidence was reported by the fair-quality trial: at 4 months postpartum, more women in the silk-suture group reported no pain at all during intercourse (21 percent versus 11 percent, P < 0.001).
| Trial Information | Description of Pain Outcome | Superior Material for Pain | Description of Healing Outcome | Superior Material for Healing | Author's Overall Conclusions | ||||
|---|---|---|---|---|---|---|---|---|---|
| PGA | CC | ND | PGA | CC | ND | ||||
| Upton et al., 200265* | Short-term perineal pain (any, moderate to severe) | ![]() | Short-term problems with sutures | ![]() | No statistically significant differences between groups but leaned in favor of polyglycolic acid | ||||
| Australia | |||||||||
| N = 391 | |||||||||
| Quality: Fair | |||||||||
| Ipswich Childbirth Study, Mackrodt et al., 1998 and Grant et al., 200161,63* | Short-term perineal pain (any, mild, moderate) |
| Short-term healing problems (tight stitches, uncomfortable stitches, gaping perineum) |
| Clear advantages of polyglycolic acid | ||||
| United Kingdom | Long-term perineal pain (mild, moderate, or severe) | ![]() | Long-term need for resuturing | ![]() | |||||
| N = 1,780 | |||||||||
| Quality: Good | |||||||||
| Olah, 199059 | Short-term perineal pain (10 cm VAS) | ![]() | Short-term edema and bruising | ![]() | Does not substantiate previous trials that show a benefit to polyglycolic acid | ||||
| United Kingdom | |||||||||
| N=120 | |||||||||
| Quality: Fair | |||||||||
| Mahomed et al., 198958* | Short- and long-term perineal pain (none, mild, mod, severe) |
| Short- and long-term edema, bruising and healing |
| Not much evidence to support polyglycolic acid but the little they have is consistent with other trials | ||||
| United Kingdom | Short-term use of analgesics |
| Long-term need for removal of sutures |
| |||||
| N = 1,574 | Long-term use of analgesics | ![]() | Long-term need for resuturing | ![]() | |||||
| Quality: Good | |||||||||
| Ping and Kee, 197553 | Short-term perineal pain (No pain, mild, moderate, severe) | ![]() | Not measured | --- | --- | --- | Polyglycolic-acid sutures have considerable advantage over chromic-catgut sutures in episiotomy repair | ||
| Malaysia | |||||||||
| N = 122 | |||||||||
| Quality: Fair | |||||||||
| Beard et al., 197450 | Short-term perineal pain (none, mild, moderate, severe) | ![]() | Short-term wound breakdown and inflammation | ![]() | Polyglycolic-acid sutures should be used | ||||
| United Kingdom | |||||||||
| N = 200 | |||||||||
| Quality: Fair | |||||||||
| Livingstone et al., 197451 | Short-term perineal pain (none, uncomfortable, painful, very painful, unbearably painful) | ![]() | Short-term edema | ![]() | Significant reduction in pain and edema with polyglycolic acid, no evident disadvantage in the use of polyglycolic acid | ||||
| Scotland | |||||||||
| N = 100 | |||||||||
| Quality: Poor | |||||||||
| Rogers, 197452 | Short-term perineal pain (none, degree of pain) | ![]() | Not measured | --- | --- | --- | Polyglycolic acid decreased the pain by half | ||
| United States | |||||||||
| N = 600 | |||||||||
| Quality: Poor | |||||||||
Note: PGA, polyglycolic acid; CC, chromic catgut; ND, no difference.
* Three trials also investigated long-term sexual function outcomes with regards to polyglycolic-acid and chromic-catgut sutures. Two trials58, 65 found no differences between the sutures and one trial61, 63 found polyglycolic-acid sutures to be superior at 1 year postpartum regarding resumption of pain-free intercourse and dyspareunia.
Both the Mahomed et al.58 and the Mackrodt et al.61, 63 trials were of good quality; they contributed to other sections of this key question because of their factorial design. Four trials were of fair quality.50, 53, 59, 65 Finally, two trials were of poor quality;51, 52 both used methods of randomization that are not considered to be truly randomized; thus, we considered them to have a fatal flaw.
Perineal pain and analgesia use. All eight trials investigated differences in perineal pain outcomes between the two groups. All but two provided consistent evidence that polyglycolic-acid sutures have an advantage over chromic catgut with regards to perineal pain. Two of the fair-quality trials59, 65 found no significant differences between the groups; in one trial, the estimate of effect favored polyglycolic-acid suture for their pain measures on postpartum day 3 (OR = 0.70; 95% CI, 0.46–1.08). The same group of women who were in the polyglycolic-acid suture group, however, were more likely to have perineal pain at 6 months (OR = 1.77; 95% CI, 0.57–5.47), although the precision of that estimate is much less than that others in the study.
Two fair50, 53 and two poor trials51, 52 also reported less perineal pain and less need for analgesics in women who received polyglycolic-acid sutures in the short-term postpartum period. All four trials used Likert scales (mild, moderate, severe, or the equivalent) to measure pain; one trial50 counted the proportion of women requiring analgesia (tablets or injections). Although these trials offer only fair- or poor-quality evidence, they do contribute consistent evidence that polyglycolic-acid sutures may be associated with less short-term perineal pain.
The best evidence comes from the two good-quality trials.58, 61, 63 In the short-term postpartum period, both trials reported pain outcomes at 24 to 48 hours and 10 days. At 24 to 48 hours, more women in the chromic-catgut group in the Mahomed et al. trial required analgesia (54 percent versus 48 percent, P < 0.05;58 47 percent versus 42 percent, P = 0.0361, 63). This requirement for analgesia continued at 10 days postpartum in both trials and remained statistically significant in the Ipswich trial (10 percent versus 6 percent, P = 0.01). In the Ipswich trial,61, 63 women in the chromic-catgut group reported more perineal pain and greater severity of pain at 24 to 48 hours (P for trend = 0.002) and at 10 days postpartum (P for trend = 0.05). The earlier trial had not identified significant differences for perineal pain at these two time points.58 Neither trial found significant differences between the groups regarding perineal pain at 3 months. In addition, the Ipswich trial found no differences at 1 year.
Healing and wound breakdown. Six of the eight trials examined healing and wound-breakdown outcomes between the two suture groups. Three fair trials50, 59, 65 and the Mahomed et al. study58 did not report statistically significant differences between the groups with respect to removal of sutures, resuturing, wound breakdown, inflammation, edema, bruising, or infection, most of which were measured before 3 months. One of the poor trials51 reported significantly more edema (P < 0.05 at the perineotomy site on day 3 postpartum) in the chromic-catgut group. The Ipswich trial results favored polyglycolic-acid suture at various followup time points and on various measures. At 24 to 48 hours and 10 days postpartum, more women in the chromic-catgut group reported uncomfortable stitches (40 percent versus 33 percent, P = 0.003, and 26 percent versus 19 percent, P = 0.001, respectively). The midwives reported more “wound gaping” in the chromic-catgut women at 10 days postpartum (26 percent versus 16 percent, P < 0.00001) but not at 24 to 48 hours. By 10 days, neither group was more likely to have sutures removed, but fewer women in the chromic-catgut group reported ever having any sutures removed by 3 months (7 percent versus 12 percent, P = 0.002). This finding is the only outcome representing a disadvantage among those who received polyglycolic-acid sutures in this trial.
Sexual function. Three trials, two of good quality58, 61, 63 and one of fair quality,65 investigated the effect of suture type on outcomes of sexual function. The fair-quality trial found no significant differences between the groups at 6 weeks, 3 months, or 6 months postpartum. The adjusted odds ratios (adjusted for parity) indicated a possible association between polyglycolic-acid sutures and less resumption of intercourse and more dyspareunia among women who had resumed intercourse. In one good-quality trial, women whose repairs were made with polyglactin 910, a polyglycolic acid-based suture, were less likely to suffer from dyspareunia at 1 year (RR = 0.59; 95% CI, 0.39–0.91; P = 0.002) and less likely to fail to resume pain-free intercourse (RR = 0.57; 95% CI, 0.38–0.87; P < 0.01).61, 63 In both of the good-quality trials, all other comparisons yielded nonsignificant differences between the two suture groups at 3 months and 1 year.
One trial randomized women to one of two groups, both of which used the standard chromic-catgut approach to repair the deeper vaginal defect. In one group, the perineal skin was repaired with chromic catgut using an interrupted method of suturing; in the other, perineal skin was repaired with PROLENE™, a nonabsorbable suture material, using a subcuticular approach.49 A similar set of groups was seen in the Buchan and Nichols trial (included above in the “Nonabsorbable versus Absorbable” section).54 However, the Doyle trial, unlike the Buchan and Nicholls trial, was clear in its intent to investigate an entire approach to repair rather than a particular method or material.
The Doyle trial is of poor quality because it used a fair-quality randomization and implementation approach, had a small number of post-randomization exclusions, and had poor retention of subjects at followup, even in the short-term, immediate postpartum period.
As assessed by the midwife at 2 and 10 days, the groups did not differ with respect to perineal pain, need for analgesia, or bruising. The groups did not differ in pain or pain during sexual intercourse at 3 months.
Another trial randomly allocated women to one of three groups.55 The first group had the episiotomy repaired using chromic catgut for the deep tissues and perineal muscles and an interrupted method using nylon for the perineal skin. The second group received polyglycolic-acid sutures for the deep tissues and perineal muscles and an interrupted method using polyglycolic-acid sutures for the perineal skin. The third group is described more ambiguously and had a repair done with polyglycolic-acid sutures for the deep tissues and perineal muscles and a subcuticular method using polyglycolic-acid sutures for the perineal skin.
This trial is of fair quality because of fair definitions and measurements of the outcomes and post-randomization exclusion of 98 women whom the authors were unable to follow because of relocation. These exclusions were nondifferential across the three randomized groups.
More women who had a repair with subcuticular polyglycolic suturing had no discomfort at 5 days than did women who had repairs with interrupted nylon or interrupted polyglycolic-acid sutures (40%, 12%, and 18%, respectively, P < 0.001). The groups that had repairs with the interrupted method did not differ. The authors reported significant differences between women who had repairs with subcuticular polyglycolic-acid sutures and the interrupted method group including pain during sitting, walking, and bowel movements at 5 days. As assessed by the midwife at 5 days, the group repaired with subcuticular polyglycolic-acid sutures experienced less edema (11 percent versus 30 percent versus 23 percent, P < 0.005). No significant differences were found with respect to infection or hematoma.
At 3 months, the polyglycolic-acid suture groups differed significantly. Women whose episiotomies were repaired with the subcuticular approach were less likely to suffer from dyspareunia, discomfort with defecation, incontinence of flatus or discomfort when sitting (P < 0.025) than women whose episiotomies were repaired with the interrupted approach.
Overview. We identified 16 publications that prospectively collected data about some aspect of continence or pelvic floor muscle function with good documentation of perineal status and episiotomy use at the time of the index birth. Outcomes of interest included physiologic measures of muscle strength, clinical urodynamic testing, or self report by interview or questionnaire. No studies directly compared type of incision and future pelvic floor function. The 16 publications include four reports from two randomized clinical trials (RCTs) of liberal versus restrictive use of episiotomy, 11 prospective studies of representative cohorts of women delivering at particular facilities or with a particular practice group (including two publications from a cohort of women who participated in an RCT of perineal massage versus none in the third trimester), and one cohort composed of all women in a region who had third-degree lacerations at the time of the index birth. The last study followed the cohort to assess risk of fecal incontinence at 3 months.
Two publications came from the same population in the United Kingdom: a primary analysis of the RCT outcomes at 3 months,23 and a secondary analysis after 3 years of followup.33 Two Canadian reports also present analyses of the same study population.29, 34 In this case, both publications report 3-month followup data: one analysis by the initial trial groups of liberal versus restrictive use of episiotomy and the other an analysis based on classification of perineal trauma at the index birth. Of the remaining 12 prospective studies, three were conducted in the United Kingdom, two in Denmark (separate populations), two in Canada (separate populations), two in Sweden (separate populations), one in Italy, one in Turkey, and one in the United States. In total, the 16 publications represent 12 unduplicated study populations from seven countries.
Study Participants. All but the study of third-degree lacerations restricted study participants to those with term, singleton gestations at the time of the index birth. Four of 16 studies restricted their study populations to primiparous women;68–71 four studies restricted to spontaneous vaginal deliveries.23, 33, 71, 72 To assess what component of change in pelvic floor muscle function could be attributed to pregnancy and what component was most influenced by vaginal births, two studies included a nonpregnant comparison group and a group of women who had cesarean birth in their physiologic measures.68, 73 In each case the comparison groups were recruited contemporaneously to the women who had vaginal deliveries generally from the same practice.
Episiotomy Type. We did not identify any studies that compared the influence of mediolateral versus midline (also called median) episiotomy on pelvic floor function or continence. The remainder of the studies reflects the dominant practice patterns in the countries in which the studies were conducted. Mediolateral episiotomy was the rule (with very rare exceptions) in European and Turkish cohorts; midline episiotomy predominated in the United States and Canada. This implies that to some degree European and North American studies are investigating fundamentally different exposures. The anatomic location, involved tissue planes, extent of perineal body disruption, and risk for extension associated with mediolateral compared to midline episiotomy would be expected to be distinctly different.
A single randomized trial comparing the two methods has documented that the risk of extension into and/or through the rectal sphincter is more than 2.5 times more likely with midline episiotomy; sphincter involvement occurred in 24 percent of deliveries with midline episiotomy in their trial.36 They also noted that local extension, not involving the sphincter, was 1.8-fold more likely with midline episiotomy. These differential outcomes are believed to represent differences in the tissue planes involved. They may also represent differences in familiarity with midline episiotomy technique since reported risk of extension is lower in prospective cohorts in countries where midline episiotomy is routine. Because no trials or prospective studies directly compare the pelvic floor muscle function across type of episiotomy, the long-term differences in continence and pelvic floor muscle outcomes that would be anticipated secondary to differences in the type is unknown. These differences must be taken into consideration when synthesizing the findings relevant to this key question.
Outcomes Measured. This question was aimed at identifying research publications that undertook long-term followup, measured in years. However, we identified only five publications from four study populations with followup of a year or longer.33, 68–70 74 In response, we have included the entire literature that assesses continence and pelvic floor function at any time after the arbitrary 8-week window that can be used to define the postpartum period. Shorter-term continence and pelvic floor outcomes in the days and weeks around birth are described in the section on the outcomes of routine use of episiotomy.
To summarize outcomes, we grouped measures into four categories: those that assess urinary incontinence by self-report, those that assess continence of stool and flatus by self-report, those based on physical examination findings to describe anatomy, and those measures intended to document physiologic function, such as perioneometry. Seven studies assessed urinary incontinence by self-report; timeframes for self-report included 3 months,23, 29, 34, 70, 71, 75, 76 12 months,70 3 years,33 4 years,74 and 5 years.69 Three studies assessed continence of stool and/or flatus by self-report at 3 months,71, 75, 77 and one study collected self-report data at an average of 10 months postpartum. Two studies described physical examination findings related to prolapse71and anal sphincter function and anorectal anatomy.77 Five studies used perineometry measures to document characteristics of muscle function such as maximum strength of contraction and maximum sustained contraction over 10 seconds.29, 34, 68, 71, 73 A single study reports findings from urodynamic testing that included observation of stress incontinence with strain and timing of interval required to stop urine flow;71 one study used weighted vaginal cones recording the heaviest weight that could be retained while standing or walking antepartum and 2 months postpartum. The highest-quality studies combined types of measures and reported data in standardized fashion that concurs with definitions of the International Continence Society.
| Author, Year | Study Design | Timing of Outcome | Outcome(s) Assessed | Outcome among Those with Episiotomy | Outcome among Those without Episiotomy | Results of Multivariable Models |
|---|---|---|---|---|---|---|
| Country | N | Assessment after Birth Approach | Definitions Provided | Authors' Conclusions | ||
| Sleep et al., 198423 | RCT | 3 months | Urinary incontinence | Incontinence: | Incontinence: | Incontinence was more common among multiparas than primiparas but did not differ significantly between the two trial groups when stratified by parity. |
| United Kingdom | N = 1,000 | Mailed questionnaire | “involuntary loss of urine” | 19% | 19% | There is no evidence that episiotomy prevents urinary incontinence. |
| “Need to wear a pad” for loss of urine | Pad: 6% | Pad: 6% | ||||
| Gordon and Logue, 198568 | Prospective cohort | 12 months | Perineometry pressure readings | Maximum pressure epis: | Maximum pressure intact: | Not reported |
| United Kingdom | N = 70 | Physiologic testing in women with all perineal outcomes and cesarean | Methods summarized in text; average of five measures used | 11.7 mm water | 11.1 | No significant difference between the groups. Differences between postnatal exercise levels were highly significant with more exercise associated with greater perineal muscle strength. |
| Maximum pressure forceps and epis: | Maximum pressure | |||||
| 9.4 mm water | second degree: 10.8 | |||||
| Maximum pressure cesarean: 12.5 | ||||||
| Sleep et al., 198733 | RCT | 3 years | Urinary incontinence | Incontinence | Incontinence | Not reported |
| United Kingdom | N = 674 | Mailed questionnaire | “Lost urine when they did not mean to” | < once past wk: 22% | < once past wk: 25% | No difference in prevalence of urinary incontinence, even when severity and nature of the incontinence, and subsequent deliveries, were taken into account. |
| “Severe enough to wear pad” | 1–2× past wk: 12% | 1–2× past wk: 11% | ||||
| “Loss when coughing, laughing, sneezing” | ≥ 3× past wk: 2% | ≥ 3× past wk: 2% | ||||
| “Loss with urgent desire to pass urine but no toilet nearby" | Pad sometimes: 8% | Pad sometimes: 7% | ||||
| Pad daily: 2% | Pad daily: 1% | |||||
| SUI: 33% | SUI: 31% | |||||
| Urge incont.: 13% | Urge incont: 13% | |||||
| Rockner, 199074 | Prospective Cohort | 4 years | Urinary incontinence | Urinary incontinence: | Urinary incontinence: | Not reported. |
| Sweden | N = 185 | Mailed questionnaire | Frequency | Occas.: 37 (26%) | Occas.: 12 (28%) | Episiotomy and spontaneous tear groups had the same frequency of urinary incontinence symptoms, giving no support to the suggestion that episiotomy prevents long-term damage of the pelvic floor. |
| Severity | 1×/week: 10 (7%) | 1×/week: 1 (2%) | ||||
| (data corresponds to definitions) | 2–3×/wk: 2 (1%) | 2–3×/wk: 1 (2%) | ||||
| >3×/wk: 1 (1%) | >3×/wk: 1 (2%) | |||||
| With cough/laugh/sneeze: | With cough/laugh/ sneeze | |||||
| 48 (34%) | 13 (30%) | |||||
| Sufficiently severe to wear pad | Sufficiently severe to wear pad | |||||
| Sometimes: 13 (9%) | Sometimes: 6 (14%) | |||||
| Always: 1 (1%) | Always: 0 (0%) | |||||
| Rockner et al, 199178 | Prospective cohort | 2 months | Pelvic floor muscle function measured using weighted vaginal cones at 36 wks gestation and postpartum | Mean decrease in muscle function (gms): 30.0 ± 11.8 | Mean decrease in muscle function (gms) | Not reported |
| Sweden | N = 92 | Physiologic measure | Details provided in text | Intact: 19.2 ± 10.2 | Pelvic floor muscle function was most decreased in the episiotomy group. The results do not support the concept that episiotomy reduces damage to the pelvic floor muscles. | |
| Spontaneous tear: 18.9 ± 9.1 | ||||||
| (P < 0.001) | ||||||
| Klein et al., 199229 | RCT | 3 months | Urinary incontinence | Incontinence | Incontinence | Not reported |
| Canada | N = 703 | In-person interview | Not defined - used 4-point scale, dichotomized as present/ absent | Primip: 14.5% | Primip: 21.1% | None of the differences in urinary incontinence were statistically significant after controlling for antepartum history of urinary incontinence. |
| Physiologic measure: Antepartum and 3 months postpartum | Subjective sense of “perineal bulging”; 4-point scale dichotomized as present/ absent | Multip: 21.5% | Multip: 12.9% | |||
| Perineometry | Bulging | Bulging | ||||
| Primip: 7.9% | Primip: 9.1% | |||||
| Multip: 9.5% | Multip: 5.4% | |||||
| EMG | EMG | |||||
| Primip ante: 2.1 (1.8) | Primip ante: 2.0 (1.6) | |||||
| Primip post: 2.3 (1.8) | Primip post: 2.3 (1.6) | |||||
| Multip ante: 1.7 (1.5) | Multip ante: 1.9 (1.6) | |||||
| Multip post: 2.1 (1.5) | Multip post: 2.1 (1.5) | |||||
| Viktrup et al., 199270 | Prospective cohort | 3 months | Telephone interview | Data not provided | Data not provided | Not reported |
| Denmark | N = 305 | 12 months | Questionnaire using International Continence Society definitions | Women who had an episiotomy developed stress incontinence significantly (P < 0.05) more frequently after delivery. However, episiotomy was performed more often in women with an increased length of second stage (P < 0.01). Differences in stress incontinence associated with episiotomy had resolved by 1 year. | ||
| Urinary incontinence provoked by physical exertion; daily incontinence; incontinence as hygienic or social problem | ||||||
| Klein et al., 199434 | Prospective cohort assembled from participants in liberal vs. restrictive episiotomy trial | 3 months | Self-reported urinary incontinence (4 point scale) | No difference | No difference | Not reported |
| Canada | N = 697 | In-person interview | Perineometry scores | (data not shown) | (data not shown) | Episiotomy fails to prevent the trauma or pelvic floor relaxation that it was designed to prevent. |
| Physiologic measures antepartum and postpartum | (electronic vaginal myography) | Epis, no exten. | In tact | |||
| Methods described in text | Net change: Primip: 0.19 | Net change: Primip: 0.47 | ||||
| Multip: 0.05 | Multip: 0.57 | |||||
| Third/fourth degree | Spontaneous tear | |||||
| Net change: Primip: 0.08 | Net change: | |||||
| Multip: -0.07 | Primip: 0.29 | |||||
| Multip: 0.39 | ||||||
| Walsh et al., 199677 | Prospective cohort of women with Third-degree tears | 3 months | Physical examination by colorectal surgeon | 100% of women with abnormal exam and fecal incontinence had episiotomy | No cases of fecal incontinence among women without episiotomy | Not reported |
| United Kingdom | N = 81 | 60% of women with abnormal exam and no incontinence had episiotomy | 40% of women with abnormal exam and no incontinence did not have episiotomy | Obstetric trauma causes significant anorectal dysfunction and patients with third-degree tears require assessment. | ||
| MacArthur et al., 199779 | Prospective cohort | 10 months | Fecal incontinence | Primp.: 4.6% | Intact | In multivariable models: episiotomy not an independent predictor of fecal incontinence |
| United Kingdom | N = 906 | N = 906 | “Loss of bowel control with no warning needed to go”; | Multip.:8.8% | Primp.: 5.2% | |
| In-person | “soiling or staining”; “felt need to go but couldn't hold on” | Multip.:2.9% | ||||
| Interview | One or more considered incontinence | Second degree | ||||
| Primip: 5.2% | ||||||
| Multip: 4.2% | ||||||
| Viktrup and Lose, 200169 | Prospective cohort | 5 years | Telephone interview | Not provided by episiotomy status | In multivariable modes, episiotomy at the first delivery was significantly associated with stress incontinence 5 years after delivery, even after adjustment for the few with coexistence of anal sphincter rupture. | |
| Denmark | N = 305 | Questionnaire using International Continence Society definitions; | Episiotomy contributed to prediction of risk of incontinence at 5 years when comparing women who had incontinence during their pregnancy to those without any incontinence associated with pregnancy or postpartum. | |||
| urinary incontinence provoked by physical exertion; daily incontinence; incontinence as hygienic or social problem | Episiotomy not risk factor among women with only postpartum symptoms. | |||||
| Eason et al., 200275 | Prospective cohort assembled from participants in perineal massage RCT | 3 months | Incontinence of stool | Loss of stool: | Loss of stool: | Loss of stool/flatus: |
| Canada | N = 949 | Mailed questionnaire | Incontinence of flatus | RR: 5.4% | RR: 2.5% | No perineal injury: RR 1.0 |
| “Involuntary loss of stool or flatus” | Loss of flatus: | Loss of flatus: | First degree: 1.2 (0.8, 1.7) | |||
| Frequency (never, less than 1 a week, 1 to 6 times a week, daily, or more than once a day) | RR: 30.2% | RR: 24.4% | Episiotomy without extension: 1.3 (0.9, 1.8) | |||
| Third/fourth degree: 2.1 (1.4, 3.1) | ||||||
| Anal incontinence is associated with sphincter laceration, which was more common among those with episiotomy. | ||||||
| Fleming et al., 200373 | Prospective cohort | 6 months | Perineometry scores | Mean score (SD) | Mean score (SD) | Not reported |
| United States | N = 102 | Baseline perineometry during pregnancy; and at 6 wks | (electronic vaginal myography) | Peak: -1.7 (2.1) | Intact | No significant differences in absolute postpartum perineal muscle strength or endurance between episiotomy and laceration groups. |
| Physiologic testing in women with all perineal outcomes and cesarean | Methods detailed in text; average of three measures of each type of contraction used for analysis | Hold: -1.7 (2.1) | Peak: 2.7 (2.8) | Women who had episiotomy were only group with net loss of perineal muscle function after delivery. | ||
| Difference in antepartum and postpartum scores | Hold: 2.8 (3.5) | |||||
| Second- or third-degree laceration | ||||||
| Peak: 0.8 (2.6) | ||||||
| Hold: 0.8 (2.3) | ||||||
| Karacam and Eroglu, 200372 | Prospective cohort | 3 months | Stress incontinence | 12/50 (24%) | 15/50 (30%) | No significant differences in stress incontinence before labor, or if after delivery of first child, or if after delivery of second child that was related to episiotomy. |
| Turkey | N = 100 | Telephone questionnaire | Not defined | |||
| Eason et al., 200476 | Prospective cohort participants in perineal massage RCT | 3 months | Frequency of involuntary loss of urine when coughing, sneezing, laughing, running | Any stress urinary incontinence: 29% | Any stress urinary incontinence: 35% | OR: 0.68 (0.47, 1.01) |
| Canada | N = 949 | Mailed questionnaire | No significant association between episiotomy and urinary incontinence. | |||
| Sartore et al., 200471 | Prospective cohort | 3 months | Perineometry with highest/best single recording used for analysis | SUI: 12.9% | SUI: 12.1% | OR: 1.01 (0.61, 1.7) |
| Italy | N = 519 | Physical exam | Baden and Walker classification of urogenital prolapse | Anal incont: 2.8% | Anal incont: 1.9% | OR: 1.47 (0.46, 4.7) |
| Physiologic measures: | Urine stream interruption test | Ante prolapse: 41p.5% | Ante prolapse: 42.1 | OR: 0.97 (0.69, 1.4) | ||
| Perineometry | SVI - visible involuntary loss of urine by ICS standards | Post prolapse: 15.8% | Post prolapse: 14.6% | OR: 1.1 (0.68, 1.8) | ||
| Uroflowmeter | Self-reported urge and anal incontinence of stool or flatus, classified by frequency | Vaginal manometry: | Vaginal manometry: | P < 0.001 | ||
| In-person interview | 12.2 (5.1) | 13.8 (4.7) | P = 0.85 | |||
| Urine stream interrupt: 3.9 (3.5) | Urine stream interrupt: 3.8 (2.9) | OR: 1.79 (1.2, 2.6) | ||||
| Vaginal manometry percent abnormal: | Vaginal manometry: percent abnormal: | Mediolateral episiotomy does not protect against urinary and anal incontinence. Episiotomy is associated with lower pelvic floor muscle strength than spontaneous tears. | ||||
| 40.60% | 27.7% | |||||
Randomized Clinical Trials. Both randomized clinical trials, Sleep and colleagues in the United Kingdom23 and Klein and colleagues in Canada,29 conducted trials that required providers to alter their use of episiotomy. These trials randomized women to receive “liberal use” versus “restricted use” of episiotomy, with the latter category intended to restrict use to circumstances such as fetal distress or maternal exhaustion with an “unyielding perineum.” Both trials enrolled singleton, vertex presentation pregnancies at term and randomized in the delivery suite close to the time of birth.
The United Kingdom trial had a 10.2 percent use of episiotomy in the restrictive group (2.6 percent for maternal indications; 6.6 percent for fetal distress), compared to 51.4 percent use of episiotomy in the liberal group. The Canadian trial had greater difficulty modifying provider behavior as background rates of episiotomy exceeded 80 percent. Restrictive use resulted in 57.2 percent of the women having an episiotomy compared to 81.4 percent in the liberal-use arm. Each of these research groups published an analysis as randomized for 3-month postpartum data. The sole violation of intention to treat was Klein's elimination of five women with cesarean section from analysis of pelvic floor outcomes. Both trials achieved good balance of baseline characteristics through randomization. Both trials assessed urinary incontinence outcomes; neither assessed continence of stool or flatus.
The Canadian trial also assessed self-reported sensation of perineal “bulging” and conducted perineometry. The Canadian team has published two analyses: an analysis as randomized and an analysis by perineal trauma sustained. The randomized analyses produced no meaningful differences in self-reported urinary incontinence, subjective sensation of perineal bulging, or perineometry readings, including when baseline antepartum readings and parity were incorporated. Likewise, the analysis by perineal status (intact, episiotomy, spontaneous tear, third- and fourth-degree tear) revealed no differences in self-reported incontinence or in perineometry scores. These analyses were stratified by parity and suggest some effect modification; however, the authors did not provide adjusted models or note any statistically significant results.
The research team from the United Kingdom has published two analyses as randomized: a 3-month and 3-year followup, both conducted by mailed questionnaire followup.23, 33 The 3-month followup found no difference by group in risk of involuntary loss of urine (19 percent in both arms) or in need to wear a pad because of urine loss (6 percent in both arms). Their 3-year followup was more detailed and included involuntary loss of urine; use of a pad; loss of urine with coughing, sneezing, laughing; and loss with urgent need to void. No aspect of these symptoms or their severity varied by restrictive versus liberal episiotomy group. For the 3-year followup, this lack of difference across groups persisted when taking into account subsequent obstetric history. The 3-year followup was also marred by loss, although the authors were able to use 3-month data to demonstrate little evidence of response bias. Adjustment using multivariable models is alluded to but numeric data are not provided.
Neither trial collected data about continence of flatus or stool, descriptive data from physical examination, or urodynamic studies. Both research teams concluded that they did not observe any benefits associated with episiotomy. Klein and colleagues, based on perineometry measures, also concluded that episiotomy fails to prevent pelvic floor relaxation.
Prospective Studies. The Italian study by Sartore and colleagues provided the most global assessment of continence and pelvic floor function; they addressed each of our four categories of outcomes.71 They enrolled 519 primiparous women who had singleton, spontaneous vaginal births in lithotomy position. Women with pre-existing incontinence were excluded. Measures of outcomes at 3 months included in-person interviews, physical examination, perineometry, a test to provoke stress urinary incontinence, and a urine-stream-interruption test. The study team clearly described methods, used a standard scheme for classifying prolapse, and collected data about urinary and anal incontinence. Overall measures and implementation were good. For the entire panel of outcomes (stress urinary incontinence, anal incontinence, anterior prolapse, posterior prolapse, vaginal manometry, and urine-stream interruption), there was only one statistically significant difference in perineometry findings. Women who did not have an episiotomy (all mediolateral) had higher contraction strength on perineometry (13.8 compared to 12.2; P < 0.001); moreover, the proportion of women with abnormal manometry was higher among women with episiotomy (40.6 percent compared to 27.7 percent without episiotomy). The adjusted relative risk for abnormal manometry was 1.8 (95% CI, 1.2–2.6). The study team concluded that episiotomy is associated with lower pelvic floor muscle strength than spontaneous tears. All self-reported symptoms of urinary and anal incontinence and degree of prolapse on physical examination were equivalent across groups so the clinical significance of this finding is unclear. Overall interpretation must be that episiotomy does not protect against incontinence, prolapse, or decrements in pelvic floor muscle function by 3 months postpartum.
Studies focused on self-reported urinary continence. Excluding the clinical trial populations and the study by Sartore et al. described above, five studies (in four study populations) evaluated self-report of urinary continence.69, 70, 72, 74, 76 Two used a telephone interview,70, 72 the other three mailed questionnaires.
Karacam and Eroglu provided the least-detailed information:72 no details about how stress incontinence was queried or defined for data analysis and no report of adjusting for factors that might influence outcomes by using stratified analyses or multivariable models. They reported, from bivariate data at 3 months (N = 100), that 24 percent of women with episiotomy and 30 percent of women without episiotomy had stress incontinence.
Eason and colleagues asked about occurrence and frequency of “involuntary loss of urine when coughing, sneezing, laughing, or running” in a cohort of 949 women also at 3 months.76 For analysis, they reported that any stress incontinence occurred in 29 percent of those with episiotomy and in 35 percent of those without. Multivariable models for stress urinary incontinence comparing episiotomy to no episiotomy yielded an odds ratio of 0.68 (95% CI, 0.47–1.01).
Viktrup and colleagues reported using a questionnaire to obtain all the facets of the International Continence Society definitions of incontinence from 305 women.70 They reported no differences but do not provide numeric data. In summary, they stated that although women with episiotomy had more incontinence postpartum, differences had resolved by 3 months and remained equivalent at 1 year. Their followup survey at 5 years revealed in multivariable models that episiotomy in a first birth was significantly associated with stress incontinence.69 An adjusted point estimate is not provided.
Rockner's followup of a cohort of 185 women with either episiotomy or spontaneous tear at 4 years after the index pregnancy asked women about symptoms before, during, and after all pregnancies; information about the index pregnancy focused on frequency and severity of urinary incontinence; need for pad; and loss of urine provoked by cough, laugh, or sneeze.74 Symptom profiles were very similar across groups; for instance, 34 percent of women who had an episiotomy and 30 percent without reported incontinence provoked by cough, laugh, or sneeze. No stratified or adjusted models are provided. The author concluded that episiotomy and spontaneous-tear groups had the same frequency of incontinence symptoms. Overall, each research team investigating self-reported urinary incontinence concluded that no evidence supported the view that episiotomy prevents pelvic floor defects.
Studies focused on self-reported incontinence of stool or flatus. Three cohort studies asked women about rectal incontinence symptoms.75, 77, 79 One study also conducted physical examinations.77
The earliest of these studies was conducted on a cohort constructed of 81 women who had third-or fourth-degree lacerations.77 The prevalence of episiotomy among women without third- or fourth-degree lacerations was known as well as their episiotomy history in the index pregnancy. They were followed up at 3 months when their symptoms were evaluated and they received a physical examination. All women who had fecal incontinence and abnormal rectal examination at three months had had an episiotomy. Of those with an abnormal exam and no incontinence 60 percent had had an episiotomy, meaning relative risk of abnormal exam was 50 percent greater among those with history of episiotomy. These authors focused on the high prevalence of anorectal dysfunction at 3 months with episiotomy as a key risk factor. None of the research teams that focused on incontinence of flatus or stool found episiotomy to be significantly associated with reduced risk.
MacArthur and colleagues sent questionnaires at 6 to 7 months and then followed up all women who had a variety of symptoms at 10-months with an in-person interview. 79 Their questionnaire and interview classified several types of fecal incontinence and staining (not including simple flatus); any one or more of the symptoms was considered evidence of incontinence. At 10 months, episiotomy was not an independent predictor of fecal incontinence in multivariable models. Because episiotomy was not a key focus of their analysis, they do not provide a point estimate.
Eason and colleagues inquired about involuntary loss of stool or flatus and the frequency at 3 months.75 Women with episiotomy reported higher prevalence of loss of stool (5.4 percent) and loss of flatus (30.2 percent) than did women without episiotomy (2.5 percent and 24.4 percent, respectively). Adjusted models revealed that episiotomy without extension was associated with a relative risk of 1.3 (95% CI, 0.9–1.8), and third- to fourth-degree lacerations (virtually all after episiotomy) were associated with 2.1-fold increased risk of anal incontinence (95% CI, 1.4–3.1). The study team concluded that anal incontinence was associated with severe lacerations that are most likely to result from episiotomy.
Studies focused on physiologic measures of pelvic floor function. In 1985, Gordon and Logue published the first use of perineometry to evaluate prospectively a group of 70 women.68 They included a nonpregnant and a cesarean comparison group to take into account changes associated with pregnancy and labor, respectively. They did not compare women with their own measures in pregnancy. No difference in maximum contraction strength was seen across groups. However, the researchers did note that postnatal exercise level was highly associated with perinatal muscle strength.
Fleming and colleagues refined the Gordon and Logue study design.73 They conducted perineometry antepartum, at 6 weeks, and again at 6 months among 102 women with singleton spontaneous vaginal births. Detailed measurement protocols are provided and their analysis focused on mean difference between antepartum scores and 6-month scores. No differences in perineal muscle strength or endurance were identified between laceration and episiotomy groups.
In another approach to measuring muscle strength, Rockner and colleagues conducted studies with weighted vaginal cones at 36 weeks gestation and again at 2 months postpartum. They calculated decrements in weight that could be retained while standing or walking: women with episiotomy had the greatest decrement in function (30 gm decrease in maximum weight held), compared to 19.2 gram decrease with intact perineum, and 18.9 gm decrease with spontaneous tears (P < 0.001).78
Overall, none of these research teams concluded that episiotomy had advantages, and one identified a decrease in functional muscle strength. These intermediate findings concur with the self-report and clinical examination findings of other studies that detected no evidence of benefit from episiotomy with respect to preserving continence or pelvic floor muscle function.
Study Participants. In current practice in the United States, women who are giving birth for the first time are most likely to have a routine episiotomy. Several studies that evaluated sexual function restricted the study population to primiparous women. This approach assures that the influence of episiotomy, spontaneous laceration, or intact perineum reflects only the potential influences of the index birth, rather than both the index birth and any prior history of perineal trauma among women who have had prior births. Those studies that did not restrict their study of sexual function to primiparous patients adjusted for prior episiotomy in data analysis as a method to account for the influence of prior birth experiences. All studies restricted participation to singleton births; and some specifically included only women who had a spontaneous vaginal birth.
Episiotomy Type. Only one study directly compared mediolateral to median (midline) episiotomy.36 The remainder of the studies reflects the dominant practice patterns of the countries in which the studies were conducted. Mediolateral episiotomy is routine in the countries represented, with the exception of the United States and Canada, where median episiotomy is routine. Overall, when episiotomy was performed in the U.S. and Canadian studies, it was a median incision;29, 34, 38 this phenomenon stands in contrast to 98.9 percent mediolateral episiotomies in the European and Turkish studies.72, 33, 71, 80, 80, 81
This factor introduces a fundamental difference in the “exposure” across studies. The anatomic location, involved tissues plains, extent of perineal disruption, and risk for extension associated with mediolateral as compared to median episiotomy are distinctly different. Once healed, the scar from each type of episiotomy and from spontaneous lacerations will be subject to different amounts and types of contact, pressure, and stretch depending on position of partners during sexual intercourse. Thus, the literature reflects two distinct types of procedures, the effects of which need to be addressed separately.
Outcome Measures. Of the 10 studies included for this key question, eight were not designed to address sexual function as the primary outcome. Only the Signorello et al. and Karacam and Eroglu studies reported that a primary objective of the study was to assess the relationship between perineal trauma (spontaneous versus episiotomy) and postpartum sexual function.38, 72
The most consistently reported outcome was “dyspareunia.” In three of these 10 publications, the researchers provide no detail to document how they phrased a question or questions about pain with intercourse or how they recorded participant responses; no reports distinguished between pain on insertion, deep dyspareunia with thrusting, or residual pain (deep or perineal) after intercourse. Four studies used a written questionnaire to collect information about sexual function.23, 33, 38, 80 One study conducted telephone interviews,72 and four conducted in-person interviews.29, 34, 71, 81 In-person interview methods for assessing sexual function outcome tended to be more detailed than those obtained from written questionnaires. However, none of the publications distinguish between pain on insertion, deep dyspareunia with thrusting, or residual pain (deep or perineal) after intercourse.
Across all 10 studies, investigators used three approaches for summarizing when women experienced dyspareunia. The most common was to inquire about any dyspareunia since resuming intercourse. Other authors inquired about dyspareunia with episodes of intercourse near the time of the followup. To differentiate this approach from measures of any experience of dyspareunia, we have called inquiry about recent status “current dyspareunia” in this report. Less often, authors reported about pain at the time of the first episode of intercourse after the index birth.
In a related measure, four research teams also asked women to recall when they resumed having intercourse. This question allows the investigators to report both continuous and categorical data about the proportion of women who had resumed intercourse by particular points in time, for example, by 2 months postpartum.23, 29, 34, 80 Few authors clearly explained if the prevalence of dyspareunia reported is appropriately calculated as a proportion (number of women with pain with intercourse divided by number of women who have resumed intercourse). The most common timeframe for assessment of outcomes was 3 months. One group assessed dyspareunia at postpartum exams between 2 and 3 months;81 one used a mailed questionnaire at 6 months;38 and the longest followup was conducted by questionnaire mailed at 3 years.33
The two publications by Klein and colleagues had the most elaborate approach to collecting several types of information. These authors reported greater detail about how participant responses were collected and analyzed. In interviews at 3 months postpartum, they asked women when they resumed intercourse and assessed recalled pain at the first postpartum episode of intercourse using the McGill Pain Scale. They inquired about sexual satisfaction using an unspecified number of items measured on a 4-point scale and reported a summary measure of “sexual satisfaction” in their tables.29, 34 Two other groups classified degree of pain with intercourse using an approach that assigned levels: none, mild, moderate, and severe.71, 34
Quality. None of the identified studies was designed exclusively to examine sexual function. We classified primary and secondary outcomes based on objectives provided in the introduction of the publication or used stated research questions to classify primary and secondary objectives.
None of the 10 studies met criteria that we consider necessary to be a good study of sexual function after episiotomy. Our criteria included (1) documentation of a representative sample of women who had spontaneous vaginal births, (2) use of outcomes that provide a well-rounded picture of sexual function, (3) clear specification of outcome measurement approach (including specification of items asked of participants on surveys or in interviews), (4) use of measures with documented validity and reliability, (5) use of adjusted models in prospective data to control for potential confounding factors, (6) minimal to modest loss to followup, and (7) use of intention-to-treat analysis in randomized clinical trials.
| Citation | Study Design | Timing of Outcome Assessment after Birth; | Outcome Assessed | Outcome among Those with Episiotomy | Outcome among Those without Episiotomy | Results of Multivariable Models |
|---|---|---|---|---|---|---|
| Epis. Type | N | Approach | Definitions Provided | Authors' Conclusions | ||
| Country | ||||||
| Sleep et al., 198423 | RCT | 3 months | Resumption of intercourse by 3 months (not defined) | 90% | 90% | Not reported |
| Mediolateral | N = 1000 | Mailed questionnaire | Current dyspareunia: “pain during sexual intercourse” | 22% | 18% | Only difference was tendency for women allocated to restrictive episiotomy to resume intercourse sooner. |
| UK | Any dyspareunia: “pain during sexual intercourse, at some time” in prior 3 months | 52% | 51% | |||
| Sleep and Grant, 198733 | Prospective cohort that included RCT participants | 3 years | Any dyspareunia: | 16% | 13% | RR 1.21 (0.84, 1.75); |
| Mediolateral | N = 326 | Mailed questionnaire | “ever suffering painful sexual intercourse” | No significant difference | ||
| UK | ||||||
| Rockner et al., 198880 | Prospective cohort | 3 months | Resumption of intercourse (Y/N) | 92% | 92% | Not reported |
| Mediolateral (88%) | N = 205 | Questionnaire (setting not specified) | Current dyspareunia | 20% | 20% | No significant difference |
| Sweden | (not defined) | 44% | 43% | |||
| Any dypareunia in prior 3 months | ||||||
| (not defined) | ||||||
| Larsson et al., 199181 | Prospective cohort | 2 to 3 months | Dyspareunia (not defined) | 16% | 11% | Not reported |
| Mediolateral | N = 1889 | In-person interview with midwife | None made regarding sexual function | |||
| Sweden | ||||||
| Klein et al., 199229 | RCT: | 3 months | Resumption of intercourse (“weeks between birth and first intercourse”) | Primip: 5.8 (2.1) | Primip: 5.9 (2.5) | Time to resumption of intercourse similar; those with intact perineum began intercourse 1 week earlier than others. Pain with resumption, 3-month sexual satisfaction and proportion not resuming by 3 months similar across groups. |
| Midline | Liberal vs restrictive | In-person interview | Dyspareunia: “Pain at first postpartum intercourse” assessed using McGill Pain Scale | Multip: 5.8 (2.6) | Multip: 5.4 (2.3) | |
| Canada | N = 703 | Sexual satisfaction at 3 months × items using “4 point scale” - actual items not provided | Primip: 2.2 (1.3) | Primip: 2.2 (1.3) | ||
| Multip: 1.3 (1.1) | Multip: 1.2 (1.0) | |||||
| Primip: 3.1 (0.7) | Primip: 3.0 (0.8) | |||||
| Multip: 3.3 (0.7) | Multip: 3.3 (0.6) | |||||
| Klein et al., 199434 | Prospective cohort derived from RCT | 3 months | Resumption of intercourse by week 6 | Epis alone: 61.7% | Intact: 76.5% | Women with spontaneous perineal tears had less pain on first intercourse than those with episiotomy alone. Those with third- to fourth-degree episiotomy extensions had the most pain on resumption of intercourse. |
| Midline | N = 697 | In-person interview | Dyspareunia: “Pain at first postpartum intercourse: none, mild, discomforting, distressing-horrible” | Third-/fourth-degree: 55.4% | Spont. tear: 62.5% | |
| Canada | Sexual satisfaction at 3 months; items using “4-point scale” - actual items not provided | Epis alone: | Intact: | |||
| Mild: 22.7% | Mild: 37.6% | |||||
| Discomf: 34.1% | Discomf: 22.8% | |||||
| Distress: 28.8% | Distress: 6.9% | |||||
| Third-/fourth-degree: | Spont. tear: | |||||
| Mild: 23.0% | Mild: 27.3% | |||||
| Discomf: 39.3% | Discomf: 27.3% | |||||
| Distress: 29.5% | Distress: 24.6% | |||||
| Epis alone: | Intact: | |||||
| Not satisfied: 16.3% | Not satisfied: 5% | |||||
| Third/fourth degree: | Spont: | |||||
| Not satisfied: 21.3% | Not satisfied: 15.8% | |||||
| Signorello et al., 200138 | Cohort with a single prospective window | 6 months | Current dyspareunia: “pain on sexual intercourse” at 6 months | Multivariate models for type of perineal trauma: | Degree of perineal trauma, not episiotomy per se associated with dyspareunia. | |
| Midline | N = 921 | Mailed questionnaire | None: Referent | |||
| United States | Second degree:1.3 (0.8, 2.2) | |||||
| Third/fourth degree: 1.5 (0.7, 3.5) | ||||||
| Karacam and Eroglu, 200372 | Prospective cohort | 3 months | Any dyspareunia (not defined) | 64.58% | 54.17% | Not reported |
| Mediolateral | N = 100 | Telephone interview | No significant differences between groups in rate of mothers' dyspareunia. | |||
| Turkey | ||||||
| Sartore et al., 200471 | Prospective cohort | 3 months | Current dyspareunia (not defined); classified as “absent, mild, moderate, severe”; reported Y/N | 7.9% | 3.4% | Summary measure: |
| Mediolateral | N = 519 | In-person interview | RR: 2.43 (1.05, 5.45) | |||
| Italy | ||||||
| Citation | Study Design | Timing of Outcome Assessment after Birth | Outcome Assessed | Outcome among Those with Episiotomy* | Outcome among Those without Episiotomy* | Authors Conclusions |
|---|---|---|---|---|---|---|
| Country | Episiotomy Type | Approach | Definitions Provided | |||
| Dyspareunia at 3 Months | ||||||
| Rockner et al., 198880 | Prospective cohort | 3 months | Current dyspareunia | 31/154 (20%) | 9/46 (20%) | No significant difference |
| Sweden | Mediolateral: 88% | Questionnaire (method not specified) | (not defined) | |||
| Larsson et al., 199181 | Prospective cohort | 2 to 3 months | Dyspareunia | 66/410 (16%) | 69/627 (11%) | None made regarding sexual function |
| Sweden | Mediolateral: 98% | In-person interview with midwife | (not defined) | |||
| Sartore et al., 200471 | Prospective cohort | 3 months | Current dyspareunia (not defined); classified as “absent, mild, moderate, severe”; reported Y/N | 20/254 (7.9%) | 9/265 (3.4%) | RR: 2.43 (1.08, 5.45) |
| Italy | Mediolateral: 100% | In-person interview | ||||
| Dyspareunia within 3 Months | ||||||
| Rockner et al., 198880 | Prospective cohort | 3 months | Any dyspareunia | 68/154 (44%) | 20/46 (43%) | No significant difference |
| Sweden | Mediolateral: | Questionnaire (method not specified) | (not defined) | |||
| 88% | ||||||
| Karacam and Eroglu, 200372 | Prospective cohort | 3 months | Any dyspareunia (not defined) | 31/48 (64.58%) | 26/48 (54.17%) | No significant differences between groups in rate of mothers' dyspareunia |
| Turkey | Mediolateral: | Telephone interview | ||||
| 100% | ||||||
Note: RR, relative risk; Y, yes; N, no.
Randomized Controlled Trials. Two publications present results from RCTs of restrictive compared to liberal use of episiotomy; the investigators used an intention-to-treat analysis. These trials provide evidence about the long-term effects of a particular type of policy about episiotomy use on the sexual outcomes of populations of women. The earlier of the two trials was conducted in the United Kingdom in 1982.23 Perineal outcomes in the West Berkshire Perineal Management Trial differed clinically and statistically by group. Among women in the liberal-use group, 51.4 percent had had an episiotomy (all mediolateral), 6.0 percent had an episiotomy with extension to third- or fourth-degree laceration, 24.5 percent had a spontaneous perineal tear only, and 24.3 percent had no perineal trauma. In the restrictive-use group, 10.2 percent had an episiotomy, 1.2 percent had an episiotomy with extension, 55.8 percent had a spontaneous perineal tear only, and 33.9 percent had no trauma. By 1 month after delivery, 37 percent of the restrictive group and 27 percent in the liberal group had resumed sexual intercourse (P < 0.01). The proportion of women with resumption of intercourse by 3 months, current dyspareunia at 3 months, or any dyspareunia within the 3 months of followup did not differ significantly by group.23 By the third year of followup, the likelihood of “ever suffering painful intercourse” remained comparable across groups.33
The trial conducted by Klein and colleagues in Canada also found less episiotomy use in the restrictive group with higher rates of spontaneous lacerations.29 Among women in the liberal-use group, 67.2 percent had an episiotomy (midline), 14.2 percent had an episiotomy with extension to third- or fourth-degree laceration or sulcal tear high in the vaginal vault, 12 percent had a spontaneous perineal tear only, and 6.6 percent had no perineal trauma. In the restrictive-use group, 42 percent had an episiotomy, 15 percent had an episiotomy with extension, 35 percent had a spontaneous perineal tear only, and 7.5 percent had no perineal trauma. Women in the restrictive group resumed intercourse an average of 1 week earlier that those in the liberal group; however, all other measures of sexual function were equivalent by 3 months.29
These trials were designed primarily to assess rates of episiotomy and perineal trauma under different strategies to guide use of episiotomy. Restrictive use, as addressed in KQ 1, was hypothesized to result in less severe trauma among women with lacerations and in a higher proportion of women without perineal lacerations. If women experienced less perineal trauma, this improvement would be expected to be associated with less pain with future intercourse. Therefore, the Canadian trial team also undertook a separate analysis of the relationship between the degree of perineal trauma and sexual function. Using data from the 3-month interviews, they regrouped participants by perineal status that was systematically documented at the time of the index birth, creating a prospective cohort. In this cohort analysis, women with an intact perineum were most likely to have resumed intercourse by 6 weeks (76.5 percent), followed by those with spontaneous tears (62.5 percent), episiotomy alone (61.7 percent) and third- and fourth-degree lacerations (55.4 percent). Women with episiotomy had the slowest return to intercourse. Pain with the first intercourse followed a similar pattern.34
Prospective Cohorts. Signorello and colleagues were the sole research team from the United States to assess sexual function.38 They documented trauma at the time of childbirth by chart review and followed up women at 6 months. They reported that the degree of trauma, rather than whether it resulted from episiotomy or spontaneous tear, was the primary determinant of pain with intercourse at 6 months. In prospective 6-month data, the risk of pain with intercourse was higher among those with second-degree trauma compared to no trauma (RR 1.3, 95% CI, 0.8–2.2), and highest with third- and fourth-degree trauma (RR 1.5; 95% CI, 0.7–3.5), although not statistically significant.
These cohort studies do not find large or statistically significant differences in sexual function. Only one study identified lasting differences in dyspareunia at 3 months. Sartore and colleagues reported that women with episiotomy were more than twice as likely to have pain than those without episiotomy.71 An aggregate estimate for current dyspareunia at 3 months can be estimated from three of the cohort studies using 818 women with episiotomy and 938 women without episiotomy.71, 80, 81 We used meta-analysis techniques to calculate an aggregate risk ratio for the combined population of the prospective cohorts. In these studies, women with episiotomy were 54 percent more likely to have pain with intercourse 3 months after delivery (RR: 1.54, 95% CI: (I: 1.19, 2.00), with an absolute increase in risk of dyspareunia of 5 percent among women who had episiotomy: 14.3 percent versus 9.3 percent. Similar estimates for the two studies that assessed any dyspareunia during the 3 months after childbirth reveal no difference in the overall probability of having had painful intercourse. Among 50 women with episiotomy, 65 percent have had pain with intercourse and 50 women without episiotomy, 54 percent had had pain with intercourse but this was not statistically significant.72
The RTI International-University of North Carolina Evidence-based Practice Center (RTI-UNC EPC) identified a modest body of literature addressing the relationship between episiotomy and maternal outcomes. This chapter presents the conclusions from each of our key questions and discusses these conclusions in the context of our ratings of the strength of the body of evidence that we reviewed in detail in Chapter 3. Additionally, we discuss limitations of the review and this literature in general. Finally, we summarize needs for future research.
The focus of this systematic review is on maternal outcomes of “routine” episiotomy, with specific emphasis on five key questions:
KQ 1. Does the practice of liberal or routine episiotomy, compared to more selective use of episiotomy, influence maternal postpartum outcomes?
KQ 2. Does episiotomy incision type (i.e., midline or mediolateral) influence maternal postpartum outcomes?
KQ 3. Does the repair of the perineal defect (suture type and repair approach) influence maternal postpartum outcomes?
KQ 4. Does episiotomy have a long-term influence on urinary incontinence, fecal incontinence, or pelvic floor defects?
KQ 5. Does episiotomy or incision type, or both, influence future sexual function?
We have not assessed literature on maternal or fetal outcomes at the time of use in response to a maternal or fetal emergency or concurrent with use of vacuum or forceps. Moreover, much of the literature we did review to answer these key questions did not examine a full range of maternal outcomes.
Several elements of our review and approach to documentation warrant emphasis. First, conceptualizing “routine use” is a challenge because many studies describe the category by negatives such as “not for fetal distress” and “not for dystocia.” Thus, we provide the operational definitions of “routine” (sometimes denoted as “liberal”) and “restricted” use that authors of included publications used; in this way, readers may apply this information as a filter through which to view study findings. Second, readers need to appreciate that the majority of the included studies reflect outcomes of mediolateral episiotomy, rather than midline; the latter is the predominant approach used in the United States. For that reason, we specifically note the type of episiotomy used in individual studies in the text and tables throughout this report. Finally, we have developed detailed evidence tables (Appendix C *) that include these details as well as numerous other specifics of study design, measurement methods, and outcomes.
This systematic evidence review assessed 7 randomized controlled trials of routine versus restricted use of episiotomy and identifies the sole trial of midline versus median episiotomy. We present evidence from 17 randomized controlled trials (RCTs) that is relevant to choosing among options for repair methods. We have also extended prior reviews to encompass longer-term maternal outcomes. Specifically, we have systematically assessed the evidence from 3 trials and 12 prospective cohorts related to the influence of episiotomy on measures of pelvic floor relaxation and urinary and fecal continence and the evidence from 4 trials and 6 prospective cohorts that provide information about sexual function and satisfaction.
As described in Chapter 3 and documented in our evidence tables, we gave close attention to grading the quality of individual studies. To complete the picture of the strength of evidence, we used that information and the collective picture of relevant work on each key question to arrive at a systematic rating of the overall strength of the evidence. To accomplish this, we created four ratings, based largely on past methods for this step from previous evidence reports of the RTI-UNC EPC, including systematic reviews performed for the U.S. Preventive Services Task Force. This approach employs four categories to describe the strength of evidence, as defined below:
I. The evidence is from studies of strong design; results are both clinically important and consistent with minor exceptions at most; results are free from serious doubts about generalizability, bias, or flaws in research design. Studies with negative results have sufficiently large samples to have adequate statistical power.
II. The evidence is from studies of strong design, but some uncertainty remains because of inconsistencies or concern about generalizability, bias, research design flaws, or adequate sample size. Alternatively, the evidence is consistent but derives from studies of weaker design.
III. The evidence is from a limited number of studies of weaker design. Studies with strong design either have not been done or are inconclusive. No published literature.
| Key Question | Grade (I–IV Scale)* |
|---|---|
| 1. Episiotomy and maternal postpartum outcomes | II |
| 2. Episiotomy incision type and maternal morbidity | III |
| 3. Repair of perineal defect and maternal morbidity | |
Methods: 2-layer vs. 3-layer repair | III |
Methods: Continuous vs. interrupted sutures | III |
Materials: Absorbable vs. tissue adhesive | III |
Materials: Absorbable sutures — standard vs. rapidly absorbed | III |
Materials: Untreated catgut vs. treated catgut | III |
Materials: Nonabsorbable vs. absorbable | III |
Materials: Polyglycolic acid vs. chromic catgut | II |
Combined methods and materials | III |
| 4. Episiotomy and urinary incontinence, fecal incontinence, and pelvic floor defects | II |
| 5. Episiotomy and future sexual function | II |
This literature, spanning two decades from the mid-1980s to the present, has high internal consistency with respect to the postpartum effects of routine (or liberal) versus restrictive strategies for episiotomy use.22, 23, 28–32 We found few if any meaningful discrepancies in findings, and overall we regarded this body of evidence as Grade II.
Across studies, women in the restrictive-use groups had less severe posterior perineal trauma, more frequent but not severe anterior vaginal trauma, less overall need for suturing, and higher probability of having an intact perineum when compared to routine- or liberal-use policies. These differences in perineal trauma were associated with less pain in the short term among those in restrictive-use groups, with fairly prompt resolution of pain regardless of trial arm. Women in restrictive use arms had no greater or lesser risk of wound healing complications and were more likely to resume intercourse earlier. Overall loss to followup was pronounced during the weeks after birth; remarkably little is known about recovery trajectory or complications thereafter.
Although these trials are of fair to poor quality overall, we base our conclusion that this evidence does not support routine use of episiotomy on the notable consistency of findings. Routine episiotomy does not achieve any of the short-term goals it has been hypothesized to achieve. Indeed, routine use is harmful to the extent that it creates a surgical incision of greater extent than many women might have experienced had episiotomy not been performed.
On the question of midline vs. mediolateral episiotomy, only a single study found that women who had a midline episiotomy had a significantly greater rate of anal sphincter injuries than women in the mediolateral episiotomy group.36 Treatment groups did not report differences in pain or satisfaction with intercourse at 3 months after the intervention.
Because of considerable methodological flaws in this trial (poor internal validity), any conclusions must be drawn cautiously, and we rate this “body” of evidence as Grade III. However, because differences in sphincter injury rates are clinically important, we consider the finding of increased risk of severe injury with midline episiotomy compared to mediolateral to be relevant observational evidence.
Methods. Because of the heterogeneity of methods used for repair of episiotomies and perineal lacerations, overall conclusions are applicable only to the repair method under study. Generally, we rate the strength of evidence for these issues as Grade III except in one instance, mentioned below.
Two trials60, 63, 66 studied a two-layer approach, in which the perineal skin is left unsutured, against a three-layer approach, in which the skin is sutured closed. Two trials48, 58 investigated the differences between a continuous (or subcuticular) method and an interrupted (or transcutaneous) method. Specific limitations of the trials and conclusions can be found below, grouped by the particular method comparison under study. Overall limitations in this body of evidence and suggestions for future research are presented later in this chapter.
Two-layer vs. three-layer repair. Both trials provided consistent evidence that favored the two-layer approach, although statistically significant differences between the two approaches were not always found. The Ipswich Childbirth study had an overall good quality with adequate definitions and measurement of outcomes, reported masking of the outcome assessors and patients, and less than 15 percent loss to followup at 1 year.60, 63 Definitions and measurement of the outcomes were only fair in the other trial,66 making it harder to compare to other trials. The latter trial also reported outcomes only on women who had completed all followup assessments; it did not report on possible differences between completers and the women who missed one or more followup assessments.
Because the two-layer approach involves less suturing, it also means less inflammation and bruising; this in turn could result in less pain and perineal morbidity. Another explanation for the differences may lie with suture type. In the first trial, which produced fewer significant differences between the approaches, the investigators balanced polyglycolic acid and chromic catgut sutures.60, 63In the other trial, a portion of the significant differences between the groups may be explained by an imbalance in suture type used. Chromic catgut was predominantly used and is hypothesized to be associated with increased pain, edema, and inflammation.66
Even with the limitations discussed above, the pool of evidence from both trials suggests that less overall perineal morbidity is associated with the two-layer repair approach, which leaves the perineal skin unsutured, than with the three-layer approach. The reduction in pain, need for analgesia, wound healing problems, and sexual morbidity, as well as a decrease in the time and cost required for initial suturing of the perineal skin, removal, and possible resuturing, may make the two-layer approach more beneficial than the traditional three-layer approach.
Continuous vs. interrupted sutures. Although the evidence is unclear, it suggests that a continuous method of repair, though it may be technically more difficult, may be superior to the interrupted method. Two good-quality trials produced inconsistent evidence that the continuous method of repair has less perineal morbidity and more patient satisfaction associated with it than the interrupted method of repair.48, 58
Both trials, through a factorial design of randomization, also randomized women to different suture material groups. Both trials achieved valance with respect to suture type; authors represented results for methods of repair regardless of suture type. Both trials defined and measured outcomes well and achieved good followup. In both trials, the authors describe greater familiarity with the interrupted method of repair, which is said to be technically easier to perform than the continuous method. One clinical group (for the trials conducted in Southmead, United Kingdom) even suggests that their inconsistencies with other trials might have be attributable to the lack of practice with the method and subsequent unpopularity with the operators that performed the repair.58 Whether such differences in outcome arise for clinicians and women outside the United Kingdom, where methods of repair and training of those performing the repair could be different compared to other countries, remains to be seen.
Materials. Because this review includes trials dating back to 1974, the materials used differ over time. Two trials47, 64 compared absorbable sutures with tissue adhesive; two trials48, 62 compared absorbable sutures with their rapidly absorbed versions; one trial56 compared untreated with treated catgut; two trials54, 58 compared nonabsorbable sutures with absorbable sutures; and eight trials50–53, 58, 59, 61, 63, 65 compared polyglycolic acid with chromic catgut. Because of this heterogeneity, specific limitations of the trials and conclusions can be found below, grouped by the particular material comparison under study. Overall limitations in this body of evidence and suggestions for future research can be found later in this chapter.
Absorbable vs. tissue adhesive. Both trials47, 64 were of poor quality because the method of randomization was inadequate or broken. However, even though sample size was small (n < 65 in both trials), both groups did define and measure perineal pain well and achieved good followup. These trials contribute possible evidence that repair with tissue adhesive may decrease perineal pain experienced in different situations in the immediate postpartum. This conclusion must be weighed in light of the inadequate randomization of these studies. Our review suggests that this question merits further study in a well-randomized trial.
Absorbable sutures: standard vs. rapidly absorbed. The mixed results from the good trial48 and lack of significant differences between groups in the poor trial62 suggest that evidence is insufficient about any difference in perineal pain between standard and rapidly absorbed sutures. We saw stronger evidence that women who had rapidly absorbed sutures required less removal of the material, presumably because it had been absorbed more quickly in the postpartum period. We had difficulty assessing the effect of type of absorbable suture on other healing outcomes because the poor trial grouped all outcomes together. Although the two trials evaluated sexual functioning at different time points, evidence suggests that rapidly absorbed sutures may decrease the amount of dyspareunia and the severity thereof in the puerperium. One item to note in the good-quality trial was the masking of the suture material. The trial acquired undyed sutures direct from the manufacturer, thereby achieving a very high level of internal validity and decreasing the amount of bias in assessment of the outcomes.
Untreated catgut vs. treated catgut. Only one trial addressed treated and untreated catgut.56 This trial achieved fair randomization and was able to blind the assessors and the patients. Loss to followup at 10 days and 3 months was minimal, but at 3 years, loss to followup was only fair (70 percent). A small amount of crossover to the other suture material group occurred, but the investigations did perform an intent-to-treat analysis.
This trial produced no evidence that “softgut” (i.e., treated catgut) is superior to untreated catgut with regard to perineal morbidity. In fact, the trial may indicate that softgut may be associated with higher morbidity, as there appeared to be more perineal pain in the immediate postpartum period and more painful sexual intercourse in the longer-term period. Though untreated catgut sutures needed to be removed more often, the authors attributed the difference to the tendency for the sutures to dry out. However, they speculated that such drying out could not completely explain the differences in perineal pain.
The women were repaired using different techniques, but the randomized groups were balanced in that respect. Investigators used the interrupted method approximately 60 percent of the time in both groups. Stratifying by technique of repair showed more marked dyspareunia and need for suture removal in women who were repaired using interrupted sutures, a finding that is consistent with other trials investigating method of repair
Nonabsorbable vs. absorbable. Because of the study design of the fair-quality trial54 and lack of control for possible confounding by method of repair, we cannot draw conclusions about the role of silk sutures in perineal morbidity from this trial. The authors present data by suture material and then do not mention differences in the methods until the conclusions section of their article. They concluded that the subcuticular method lent itself to short-term advantages, but they did not present the data to support their conclusion. Thus, although this trial may contribute to a body of evidence that looks at combinations of materials and methods, it does not contribute to the overall understanding of the role of suture materials in perineal morbidity, separate from methods of repair. The Mahomed et al. trial58 found no differences between the two groups in the short-term postpartum period, but did find differences at 3 months, indicating a possible delayed effect of the suture material.
Polyglycolic acid vs. chromic catgut. In 2004, the Cochrane Library published a systematic review and meta-analysis of information on polyglycolic acid versus catgut suture material for repair of perineal trauma.82 In it, they report that polyglycolic acid sutures were associated with less pain in the short-term postpartum period (Odds ratio [OR] = 0.62; 95% confidence interval [CI], 0.54–0.71) and with less need for analgesia (OR = 0.63; 95% CI, 0.52–0.77). No differences were found in long-term pain outcomes or in reports of dyspareunia.
Our systematic evidence review includes six of the eight trials that were included in the Cochrane review and an additional two trials. Overall, the evidence is from a combination of poor, fair, and good trials, but we considered the strength of evidence as Grade II; moreover, it is consistent with the previous Cochrane review. Basically, evidence indicates that polyglycolic acid sutures are associated with less perineal pain, a lesser need for analgesia use, and fewer healing problems in the short-term postpartum. For long-term outcomes, the evidence is consistent that outcomes of the use of polyglycolic acid sutures and chromic catgut do not differ substantially. One trial not in the Cochrane review65 did report more perineal pain and dyspareunia in the polyglycolic-acid group at 6 months, an outcome the authors attributed to the slower absorption rate of polyglycolic-acid sutures; however, these results were neither statistically significant nor precise. Overall, the body of evidence for the comparison of polyglycolic-acid sutures versus chromic-catgut sutures suggests that using polyglycolic-acid sutures for perineal repair offers many short-term advantages.
Combined Methods and Materials. Instead of investigating methods and materials separately, two trials49, 55 compared entire approaches, combining both materials and methods in a single randomization design. The poor trial49 found no differences between the groups; the fair-quality trial55 found that women repaired with polyglycolic-acid sutures using a continuous, subcuticular approach suffered less perineal morbidity. This result is consistent with other trials that investigated subcuticular suturing and polyglycolic-acid sutures separately, perhaps reinforcing the notion that this method and suture type are superior to other options available to obstetric clinicians. Overall limitations in this body of evidence and suggestions for future research are provided later in this chapter.
Summary. The heterogeneity of methods and materials used for repair of episiotomy and perineal laceration arises in part from the passage of time and differences in practice across continents. Another set of issues to consider in studying the repair of episiotomy are the economic and geographic differences among clinical practices across the world. The choice of suture material might be restricted in resource-poor settings. If a clinic cannot afford or does not have access to polyglycolic acid, which, as reported in one of the studies,66 is more expensive than other absorbable sutures, then the clinic may have to make do with available sutures but perhaps supplement them with a method of repair that can decrease perineal morbidity.
During the time period that this review encompasses, investigators studied three major classes of suture material (nonabsorbable, absorbable, and tissue adhesive) and two subtypes of sutures (treated versus untreated and standard versus rapidly absorbed). These materials were all studied in the presence of different approaches to the method of suturing; therefore, individual effects of the materials themselves cannot be examined. Likewise, the methods of repair were examined in the context of different materials among the studies and within them for different stages of repair. For these reasons, truly determining the effects of a certain method of repair is impossible, because we are unable to tell whether the outcomes are confounded or modified by suture material.
The literature that provides evidence about routine episiotomy, continence, and pelvic floor defects and function is limited in several domains. All but two of the cohorts34, 75 report outcomes for use of mediolateral episiotomy rather than midline episiotomy. The length and incompleteness of followup limits the usefulness of the data. We aimed to identify publications with followup ranging from years to decades after births to women with known episotomy histories. However, at completion, only five of 16 publications provide data with followup at 1 year or longer;33, 68–70, 74the longest interval was 5 years.69 Followup conducted within 6 months of birth, as in 10 percent of the 16 studies, does not reflect full recovery of the pelvic floor from vaginal birth. For that reason, we considered 6 months followup as an intermediate point of comparison rather than an evaluation of final pelvic floor and continence status.
All measures for followup at or beyond a year were self-reported by interview or questionnaire except a single study with perineometry conducted at 12 months.68 Neither self-report for many urinary and rectal continence symptoms nor perineometry and physiologic measures have been adequately validated. These measures do not relate directly to physical examination findings or individual functional status. Indeed, urodynamic testing and physical examination have very limited documentation of their ability to predict or classify continence status. These limitations must be kept in mind. The greatest clinical relevance would be to assess continence and pelvic floor deficits among women with known episiotomy histories beginning in their 40s and proceeding through their lifetimes. None of the identified studies provided such data.
In summary, these prospective studies are limited because they do not follow women long enough to detect disease occurrence. At present, the assumption that intermediate variables such as pelvic muscle strength measured by perineometry, urodynamic test results, or early reports of symptoms can predict later disease has not been validated. Prospective evaluation only during the months after birth when the pelvic floor is still in a recovery and stabilization period may be misleading. Conclusions about whether episiotomy prevents or increases risk for incontinence and prolapse later in adult life cannot be reached from currently available randomized and cohort studies.
The studies addressing this question need to be considered in three groups: the six studies that virtually exclusively evaluate effects of mediolateral incision,23, 33, 71, 72, 80, 81 three that evaluate midline (median) incision,29, 34, 38 and one that compared the two incision types.36 From just the clinical trials of episiotomy strategy—liberal versus restrictive—one trial addresses each type of incision, and one directly compares the two incision types. None finds substantive differences in sexual function. Overall, this body of literature supports a conclusion that perineal trauma is associated with probability of pain with intercourse in a dose-response fashion such that greater perineal injury is associated with greater probability of pain.
The quality of the evidence (to which we assigned Grade III) to assess this question is limited for reflecting on the consequences of episiotomy. Both the clinical trials and the prospective cohorts assess overly simplified measures of sexual function. Definitions and specification of approaches to measurement are insufficient in many studies to assure accurate interpretation of findings. Validated instruments intended to assess nuances of sexual function such as type of pain, location of pain, severity of pain, orgasm, lubrication, and libido have not been deployed in the published research to assess prospectively the influence of perineal trauma in childbirth on future sexual function. More complex measures would need to be used to understand properly relationships between perineal trauma and future sexual function. Specific factors such as prior sexual function and current libido, in addition to factors such as duration of second-stage labor, size of infant, and lactation status need to be incorporated into multivariable models to generate more-informative and less-biased estimates of the long-term effects of episiotomy in this area. With these caveats, the evidence does not suggest that episotomy results in improved sexual function outcomes.
Our systematic review should be interpreted in the context of several limitations. First, as with all systematic reviews, its findings depend on the predefined approach to searching the literature and on the quality of the published literature identified. The limitations of the available studies (see Chapter 3) include the following:
age of the data (trials from the 1980s and early 1990s were conducted in an era when background rate of episiotomy was higher);
insufficient size of most trials for assessing clinically relevant endpoints (third- and fourth-degree lacerations, long-term incontinence);
inadequate specification of a priori primary and secondary outcomes with few references to power calculations to determine required study size;
infrequent use of multivariate modeling to account for shortcomings of randomization or need for stratification within RCTs or potential confounders in prospective cohort studies;
infrequent use of masking of the assessor for outcomes;
use of a wide variety of measures and timepoints for maternal postpartum outcomes, making comparisons among studies difficult;
rare use of validated outcome measures;
limited reporting of precise definitions of self-reported outcomes, particularly for pain, sexual function, and incontinence; and
inconsistent reporting of appropriate statistical measures (i.e., use of P values without measures of magnitude or confidence intervals), making it difficult to determine if null findings represent lack of effect or limitations in power.
An additional limitation of the literature on KQ 1 results from the nature of the intervention. In essence, KQ 1 reviews studies of clinician behavior when asked to implement different policies for episitomy use. Differences in episiotomy rates observed in groups assigned to routine (i.e., liberal) use vary from a low of 23 percent29 to a high of 52 percent,30 suggesting that clinician behavior is not easily modified, even in the context of an RCT. Inconsistencies in the way that clinicians define and interpret routine use and restricted use of episiotomy within and across trials may temper differences between protocol groups. Additionally, violations of protocol can invalidate initial power calculations and lead to results that cannot be interpreted.
Our review process also had some limitations. Because of time and resource constraints, we did not conduct dual, independent, blinded review of articles for inclusion or abstraction of information into evidence tables. Instead, one reviewer performed the initial review, and a second reviewer examined that input and recommended changes. Differences were reconciled between the two reviewers. We used dual review for grading the quality of individual articles, allowing us to evaluate rigorously systematic bias in these assessments.
Studies comparing restrictive to liberal use of episiotomy report that, even under a restrictive approach, episiotomy rates vary between 8 percent and 52 percent.22, 23, 28–32 This disparity suggests that episiotomy is considered to be clinically indicated for a substantial number of women even at the lowest levels recorded by our review. Currently, the evidence suggests that the putative benefits of episiotomy do not outweigh the harms in the general population. Instead, outcomes from episiotomy are worse because some proportion of women who would have had lesser injury instead had a surgical incision.
The majority of these studies to assess outcomes of routine episiotomy used mediolateral episiotomy. We do not, however, conclude that additional study of outcomes of routine use versus liberal use of midline episiotomy is warranted. Observational studies other than RCTs clearly and consistently relate midline episiotomy to higher rates of anal sphincter and rectal injury than those observed with mediolateral episiotomy.9, 40, 42, 83 Thus, we would expect trials of routine use of midline episiotomy to have more numerous unfavorable outcomes than those observed in these studies, an effort not worth replicating given the lack of benefit of episiotomy supported by existing evidence.
If episiotomy were restricted to indicated use, an important question remains for women and their care providers: Which, if any, of the prevailing indications for episiotomy are supported by an adequate research base? A two-stage research agenda could address this need. First, a systematic review may clarify current knowledge about outcomes of episiotomy for the leading presumed indications. Second, primary data collection may be needed to fill in research gaps identified by such a review and to improve understanding of whether these are indeed indications for episiotomy. Work relating to the latter element of such a research agenda is under way on several topics including recent publication of a retrospective cohort study that suggests that use of episiotomy conferred no benefit in averting neonatal injury at the time of births complicated by shoulder dystocia.84 Additional evidence will be required to fully investigate what circumstances should be considered indications for episiotomy.
Establishing an evidence base for indications would lead to a health services research agenda focused on variations in rates and outcomes. Several issues are paramount: What safe and conservative rates of episiotomy are attainable? Should measures of quality of childbirth care include episiotomy rates? What approaches are most successful in reducing unnecessary use of episiotomy?
Furthermore, if the professional community accepts that routine episiotomy is not an effective means to reduce perineal injury, that attitude should enable them to redouble efforts to understand fully various (other) approaches to attending the second stage of labor that can promote maternal and infant safety, minimize perineal trauma, and maximize maternal comfort. The failure of one intervention-oriented method such as episiotomy to deliver such results does not reduce the likelihood that other approaches, or combinations of approaches, may be useful. These approaches include giving attention to maternal position, avoiding fundal pressure, reducing coached pushing, providing perineal support, and employing “hands poised” versus hands on techniques to support the perineum, and the role for lubrication and types of lubrication for use during crowning of the infant head. Any or all of these techniques may help women and their care providers reach desired outcomes more frequently and deserve to be subjected to rigorous study.
Researchers must also continue to investigate the relationship between self-care practices such as Kegel pelvic floor exercises, general physical fitness, and nutrition, and the risk for pelvic floor defects including incontinence and prolaspes. To the degree that pelvic floor recovery can be facilitated or “rehabilitation” achieved by nonsurgical means, numerous women would benefit from such research. To understand pelvic floor defects and childbirth experiences properly, including history of episiotomy, studies need to be designed to identify populations of women who have a known episiotomy history to evaluate their continence and pelvic organ prolapse status in the age groups between 40 and 70 years.
Understanding the relationship of pelvic floor morbidity to childbirth experiences will require increasingly sophisticated analysis methods and study designs. Evaluation and incorporation of confounders and modifiers of the effect of exposure must become the norm for prospective data analysis. Factors such as maternal race and ethnicity, body mass index, infant birth weight, duration of second-stage labor, duration of strenuous pushing, and elements of reproductive history such as outcomes of prior births require attention. Cohorts of women who participated in perinatal research in the 1980s will soon enter the timeframe in which meaningful followup of pelvic floor status can be obtained.
Future research on sexual function and sexuality after childbirth is needed. Very limited data are available even to describe what women should expect as normal. Research will need to take into account breastfeeding status, episiotomy and laceration history, repair methods, and contraceptive type. Greater attention is needed to distinguish dyspareunia and characteristics that help describe dypareunia (an anatomic symptom), from “satisfaction” with its components of relationship quality, sexual aptitude, and cerebral contribution, and from ability to achieve and consistency of achieving orgasm. Sexual function outcomes need to be regarded as appropriate primary research aims so that these concerns do not remain secondary measures with insufficient attention to reach meaningful answers.
Our review of the literature on the repair of perineal defects points to another avenue for further research. Clinical judgment suggests that the perineal outcome of repair is a function of both materials and methods. Consistencies in the evidence from the studies of repair (e.g., polyglycolic-acid sutures are better than chromic catgut; continuous suturing is better than interrupted suturing) can be used to inform future studies in more creative randomization designs or multivariate analyses. More sophisticated designs might allow a trial to compare complete approaches to repair rather than individual components, such as the studies performed by Doyle et al.49 and Isager-Sally et al.55
One clinically relevant study might compare combinations of the materials and methods that seem to decrease morbidity (e.g., subcuticular polyglycolic-acid sutures) with new materials such as tissue adhesive that enter the market with (unproven) claims of reduced perineal morbidity. Unless multivariate models are used to tease out mixed-effects of methods and materials or future research begins randomizing groups to entire approaches to repair, results can be informative and applicable to a population only to a certain point. The gap in information may mean that, in the future, women who are receiving appropriate episiotomies may still not receive a thorough repair. Some observers, however, may regard mounting such a trial as questionable. Thus, a useful first step might be to develop sample-size estimates based on a range of important outcomes and, in this way, to determine whether such a trial is even feasible to attempt.
Our systematic review finds no health benefits from episiotomy. We found fair to good evidence suggesting that the immediate outcomes for routine (liberal-use policies) episiotomy are no better than those for indicated use of episiotomy under more restrictive-use policies. Indeed, routine use is harmful to the extent that it creates a surgical incision of greater extent than many women might have experienced had episiotomy not been performed. Weak trial evidence, consistent with observational data, suggests that the harms of midline episiotomy are greater than the harms of mediolateral episiotomy.
For outcomes of repairing an episiotomy, fair to good evidence, albeit across different comparisons of methods and materials, suggests that leaving the perineal skin unsutured may confer some benefit; if suturing is indicated, then a continuous, subcuticular method is better than an interrupted, transcutaneous method. Regarding suture material, the evidence is consistent and clear that absorbable sutures are preferred and that polyglycolic-acid sutures have significantly less perineal morbidity associated with them. Newer materials, such as tissue adhesive, may offer further benefits, but the data are at present wholly inadequate to inform care practices.
The level of evidence for long-term sequelae, specifically fecal and urinary incontinence, pelvic floor function, and future sexual function is fair to poor. Nonetheless, it is consistent in demonstrating the lack of benefit of the procedure in a comparatively early timeframe. For women in later adult life, when morbidity is most likely to occur in the form of severe and persistent incontinence or pelvic organ prolapse, the expected results of routine episiotomy are unknown.
| Field in ProCite | Code | Meaning of Code | Description & Comments |
| Field Number: 11 | I | Abstract included | Article was pulled for review |
| Original field name: Title | E | Abstract excluded | Article was NOT pulled for review |
| Labeled as: Abstract Inclusion/Exclusion | B | Background | Article was excluded from the review but pulled for background |
| (From the Abstract Review Form) | |||
| Field in ProCite | Code* | Meaning of Code | Description & Comments |
| Field number: 12 | I | Full text included | “Full text INCLUDED” is checked at the end of the form |
| Original field name: Reprint Status | B | Background | “Full text EXCLUDED but used for BACKGROUND CITATION” is checked at the end of the form; these articles, because they are excluded, will also have an exclusion code(s) in this field |
| Labeled as: Full Text Inclusion/Exclusion | U | Unavailable | The article was to be pulled for review but was not retrievable by the libraries |
| (From the Full Text Inclusion/Exclusion form) | E | Full text excluded-Unclassified | These articles are obvious excludes but the reviewers did not agree on the reason…this may be settled in the future |
| *Please note: An article could be excluded for more than one reason if the reasons are E3-E6 | SETTLED 11/12/04 | ||
| E1 | Full text excluded - Not original research | #1 is “No” | |
| E2 | Full text excluded - Wrong population | #2 is “No” | |
| E3 | Full text excluded - N<40 | #3 is ”No” | |
| E4 | Full text excluded - Wrong outcome and/or study design | #4 is “No” | |
| E5 | Full text excluded - Foreign language | #5 is “No” | |
| E6 | Full text excluded - Wrong time period | #6 is “No” | |
More than one code can be entered into this field. Separate codes with a comma
| Field in ProCite | Code* | Meaning of Code | Description & Comments |
| Field number: 13 | 1 | Addresses KQ1 | Enter if box is checked under “Outcomes and Key Questions” |
| Original field name: Place of Meeting | 2 | Addresses KQ2 | Enter if box is checked under “Outcomes and Key Questions” |
| Labeled as: Key Questions Addressed | 3 | Addresses KQ3 | Enter if box is checked under “Outcomes and Key Questions” |
| (From the Full Text Inclusion/Exclusion form) | 4 | Addresses KQ4 | Enter if box is checked under “Outcomes and Key Questions” |
| 5 | Addresses KQ5 | Enter if box is checked under “Outcomes and Key Questions” |
More than one code can be entered into this field. Separate codes with a comma
| Field in ProCite | Code | Meaning of Code | Description & Comments* |
| Field Number: 14 | Y | Yes | The reference list of this background article was hand-searched for additional references |
| Original field name: Medium Designator | N | No | The reference list of this background article was not hand-searched for additional references |
| Labeled as: Background-Hand Searched |
These articles have a “B” in field number 11 or field number 12. This field will be blank for all other articles.
| Field in ProCite | Code | Meaning of Code | Description & Comments |
| Field Number: 15 | RCT | RCT | This study was a randomized-controlled trial |
| Original field name: Edition | |||
| Labeled as: Study Design | COHORT | Prospective Cohort | This study was a prospective cohort |
| Search Terms | Results |
|---|---|
| “Episiotomy” [MeSH] Field: All Fields, Limits: English, Randomized Controlled Trial, Human | 75 |
| “Episiotomy”[MeSH], English, Review, Human | 68 |
| Labor Stage, Second [mh], English, Review, Human | 40 |
| Labor Stage, Second [mh], English, Randomized Controlled Trial, Human | 58 |
| Search Number | Search Terms | Results |
|---|---|---|
| #1 | “Episiotomy”[MeSH:NoExp] Field: All Fields, Limits: English, Human | 676 |
| #2 | “Episiotomy” English, Editorial, Human | 14 |
| #3 | “Episiotomy” English, Letter, Human | 58 |
| #4 | “Episiotomy” English, Review, Human | 68 |
| #5 | “Episiotomy” English, Meta-Analysis, Human | 3 |
| #6 | “Episiotomy” English, Practice Guideline, Human | 0 |
| #7 | #2 OR #3 OR #4 OR #5 OR #6 | 140 |
| #8 | #1 NOT #7 | 536 |
| #9 | Repair | 138222 |
| #10 | #1 AND #9 | 86 |
| #11 | labor stage, second [mh] | 638 |
| #12 | #9 AND #11 | 6 |
| Search Number | Search Terms | Results |
|---|---|---|
| #1 | “Episiotomy”[MeSH:NoExp] Field: All Fields, Limits: English, Human | 306 |
| Search Number | Search Terms | Results |
|---|---|---|
| #1 | “Episiotomy”[MeSH:NoExp] Field: All Fields, Limits: English, Human | 49 |
First Author: _____________________________________________________________
Journal: _________________________________________________________________
Year of Article: ________ Abstractor Initials: ___ ___ ___
| 1. Original research | Yes | No | Cannot Determine |
| (Exclude editorials, commentaries, letters to editor, reviews, etc.) | |||
| 2. Includes females of reproductive age | Yes | No | Cannot Determine |
| 3. Addresses one or more of the following. (Check all that apply.) | Yes | No | Cannot Determine |
| ____ Outcomes of routine episiotomy (KQ1) | |||
| ____ Outcomes of episiotomy incision type (KQ2) | |||
| ____ Approach/outcomes of repair of perineal defects (KQ3) | |||
| ____ Urinary/fecal incontinence/pelvic floor defects/prolapse (KQ4) | |||
| ____ Sexual function (KQ5) | |||
| If any KQ is checked, circle “Yes” in box. | |||
| 4. Study N is greater than or equal to 40 subjects. | Yes | No | Cannot Determine |
| 5. Study published between 1950 and 2004. | Yes | No | Cannot Determine |
| 6. If KQ1, 2 or 3 (outcomes of routine episiotomy / episiotomy incision type / repair of the perineal defect on maternal postpartum outcomes): | Yes | No | Cannot Determine |
| RCT study design used. | |||
| 7. If KQ4 or KQ5 (outcomes of episiotomy on urinary and fecal incontinence/pelvic floor/prolaspe/sexual function): | Yes | No | Cannot Determine |
| ____ RCT study design used | |||
| ____ Prospective cohort | |||
| If either is checked, circle “Yes” in box. | |||
| 8. Published in English. | Yes | No | Cannot Determine |
| If non-English, specify language: _______________________ | |||
| Applies to key question # _____________________________ | |||
_____ CHECK HERE IF ARTICLE TO BE PULLED FOR BACKGROUND CITATION.
IF ANY ITEMS IN GRAY BOX, THE ARTICLE IS EXCLUDED.
IF ITEMS 7, 8 OR 9 MARKED “NO,” ARTICLE MAY BE EXCLUDED IN FUTURE.
IF CANNOT DETERMINE, ARTICLE WILL BE PULLED FOR REVIEW.
| Article #: __________ | First Author: ______________ | Reviewer's Initials: ___ ___ |
| Criteria for Inclusion/Exclusion | Meets criteria | |
|---|---|---|
| 1. Original research that includes routine episiotomy (i.e., not for distress or assisted vaginal deliveries) | Yes | No (STOP!) |
| (Excludes editorials, commentaries, letters to editor, reviews, etc.) | ||
| 2. Addresses sequelae of vaginal child birth | Yes | No (STOP!) |
| 3. N is greater than or equal to 40 subjects | Yes | No |
| 4. Addresses at least one of the key questions with the appropriate study design | Yes | No |
| (See below to determine. Ifat least one line has both columns of boxes checked, circle Yes. Otherwise, circle No.) | ||
| 5. Published in English | Yes | No |
| If no, what language: _________________________ | ||
| 6. Published between 1950 and 2004 | Yes | No |
Please check all outcomes that apply. If outcome is checked, please check if the listed design applies to the study:
| Outcomes and Key Questions | Study Design | |
|---|---|---|
| KQ1 |
Outcomes of routine episiotomy |
RCT |
| KQ2 |
Outcomes of episiotomy incision type |
RCT |
| KQ3 |
Outcomes of repair and/or repair method of perineal defects |
RCT |
| KQ4 |
Urinary/fecal incontinence/pelvic floor defects/prolapse |
RCT |
Prospective cohort | ||
| KQ5 |
Sexual function |
RCT |
Prospective cohort | ||
| If “Yes” is circled for ALL criteria… |
Full text INCLUDED |
| If ANY “No” is circled… |
Full text EXCLUDEDOR… |
Full text EXCLUDED but used for BACKGROUND CITATION | |
Article number: _______________________________
Abstractor Name: _______________________________
Date of abstraction: __ / __ / ___
Name of second abstractor/reviewer: _______________________________
Date of review: __ / __ / ___
Year Published: _______________________________
Surname of first author: _______________________________
| 8. Healthcare Setting | ||
| a. Labor & Delivery/Maternity Unit |
Yes |
No |
| b. ED |
Yes |
No |
| c. ICU |
Yes |
No |
| d. Other (Please specify :_________________________) |
Yes |
No |
| e. Not specified |
Yes |
No |
Where was the study conducted? (List all countries)
_____________________________________________
_____________________________________________
_____________________________________________
_____________________________________________
| 10. What is the funding source of the study? (Mark all that apply) | |||
| a. Industry |
Yes |
No | |
| b. Government |
Yes |
No | |
| c. Professional Society |
Yes |
No | |
| d. Hospital/Managed Care Organization |
Yes |
No | |
| e. Foundation |
Yes |
No | |
| f. Consumer/Patient Organization |
Yes |
No | |
| g. Not reported |
Yes |
No | |
| h. Unclear |
Yes |
No | |
| i. Other (specify): ___________________________________ |
Yes |
No | |
_______________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
Inclusion Criteria
__________________________________________________
__________________________________________________
__________________________________________________
__________________________________________________
Exclusion Criteria
__________________________________________________
__________________________________________________
__________________________________________________
__________________________________________________
| 14. Is the study an RCT? |
Yes (Continue) |
No (Skip to Question #18) |
Total number of randomization arms in this study ___________________
Description of each randomization arm
Group 1: _______________________________________________________________________
Group 2: _______________________________________________________________________
Group 3. _______________________________________________________________________
Describe randomization methods
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
Describe blinding/masking methods and how maintained
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
Comments: _____________________________________________________________________________________
______________________________________________________________________________________
______________________________________________________________________________________
| Group #1 | Group #2 | Group #3 or Totals for Whole Study | Overall (t tests/p values were given) | ||
|---|---|---|---|---|---|
| 19. Defining characteristic of each group/ trial arm(Please label all subsequent table columns with characteristic) | |||||
| 20. Age at enrollment/randomization | |||||
| a. Minimum age | |||||
| b. Maximum age | |||||
| c. Mean age | |||||
| d. Standard deviation | |||||
| 21. Number of participants at enrollment/randomization(see section #10 on page 9 for numbers of participants available at followup) | |||||
| 22. Race (Record as presented in article, as N or %. Follow percent with %. If race categories do not match the below list, please describe each category as explained in the article)* | |||||
| a. White | |||||
| b. Hispanic | |||||
| c. African-American | |||||
| d. Asian | |||||
| e. Native American | |||||
| f. Other (describe) | |||||
| 23. Parity (Record as presented in article, as N or %. Follow percent with %)* | |||||
| a. Nulliparous | |||||
| b. Primiparous | |||||
| c. Multiparous | |||||
| 24. Education (please describe categories) | |||||
| a. | |||||
| b. | |||||
| c. | |||||
| d. | |||||
| e. | |||||
| f. | |||||
| g. | |||||
Don't forget to label your columns!
![]() | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (t tests/p values were given) | |
| 25. Other demographic variable #1 (please describe) | |||||
| 26. Other demographic variable #2 (please describe) | |||||
| 27. Other demographic variable #3 (please describe) | |||||
If percent is calculated from the numbers abstracted from the table, please describe how calculated.
Comments: _______________________________________________________________________________
__________________________________________________________________________________________
__________________________________________________________________________________________
Don't forget to label your columns!
![]() | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (t tests/p values were given) | |
|---|---|---|---|---|---|
| 28. Duration of second stage labor(Describe definition from article and list quantitative values if possible) | |||||
| 29. Delivery attendant (Record as presented in article, as N or %. Follow percent with %. If categories below do no match those presented in article, please adjust the categories) | |||||
| a. Unspecified | |||||
| b. Student | |||||
| c. Midwife | |||||
| d. Obstetrician | |||||
| e. Other (e.g., family physician…please specify) | |||||
| 30. Anesthesia at time of delivery (Record as presented in article, as N or %. Follow percent with %) | |||||
| a. None | |||||
| b. Local | |||||
| c. Epidural | |||||
| d. Spinal | |||||
| e. Sacral block | |||||
| 31. Delivery Position(Describe-will be classified at a later time. Possible positions include: dorsal/supine, horizontal, lateral, semisitting, squatting, kneeling) | |||||
| 32. Episiotomy use (Record as presented in article, as N or %. Follow percent with %) | |||||
| a. None | |||||
| b. Midline | |||||
| c. Mediolateral | |||||
| d. Done, not specified | |||||
| 33. Mode of Delivery (Record as presented in article, as N or %. Follow percent with %) | |||||
| a. Spontaneous | |||||
| b. Vacuum | |||||
| c. Forceps | |||||
| d. Other (specify) | |||||
| 34. Birthweight (Specify units of weight) | |||||
| a. Minimum weight | |||||
| b. Maximum weight | |||||
| c. Mean weight | |||||
| d. Standard deviation | |||||
| e. Other (please specify) | |||||
| 35. Estimated gestational age (specify units) | |||||
| a. Minimum age | |||||
| b. Maximum age | |||||
| c. Mean age | |||||
| d. Standard deviation | |||||
| e. Other (please specify) | |||||
Comments: _______________________________________________________________________________
_________________________________________________________________________________________
_________________________________________________________________________________________
Don't forget to label your columns!
![]() | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (t tests/p values were given) | |
| 36. Description of repair approach | |||||
| 37. Repair done by (Record as presented in article, as N or %. Follow percent with %. If categories below do no match those presented in article, please adjust the categories) | |||||
| a. Unspecified | |||||
| b. Student | |||||
| c. Midwife | |||||
| d. Obstetrician | |||||
| e. Other (e.g., family physician… please specify) | |||||
| 38. Suture Type (Record as presented in article, as N or %. Follow percent with %) | |||||
| a. Chromic catgut | |||||
| b. Polyglycolic acid | |||||
| c. Other (specify) | |||||
Comments: ______________________________________________________________________________
_________________________________________________________________________________________
_________________________________________________________________________________________
Please describe the number of participants that contributed data to each timepoint of followup during the study and reasons for missing data if applicable. Please refer to section 7, question #20 on page 4 of this form for number of participants in each group at the time of randomization or enrollment.
Don't forget to label your columns!
![]() | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | If data is missing, please describe why |
| 39. Timepoint #1: | ||||
| 40. Timepoint #2: | ||||
| 41. Timepoint #3: | ||||
| 42. Timepoint #4: | ||||
| 43. Timepoint #5: |
Comments: ______________________________________________________________________________
_________________________________________________________________________________________
_________________________________________________________________________________________
_________________________________________________________________________________________
_________________________________________________________________________________________
*Additional space for outcomes can be found in the section 11 addendum on page 16.
| Definition of Outcome(Please describe how the authors defined/operationalized the outcome in the space below the outcome name) | How is outcome measured? (e.g., visual inspection, interview) | Length of time since delivery (Specify units) | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (RR/OR with CI & p values where given) | ||
|---|---|---|---|---|---|---|---|---|
| 44.Posterior Lacerations/Defects | ||||||||
| a. Intact perineum _______________________________________________________________________________________ | ||||||||
| b. First degree tear _______________________________________________________________________________________ | ||||||||
| c. Second degree tear _______________________________________________________________________________________ | ||||||||
| d. Third degree tear _______________________________________________________________________________________ | ||||||||
| e. Fourth degree tear _______________________________________________________________________________________ | ||||||||
| f. Third/fourth degree tear combined _______________________________________________________________________________________ | ||||||||
| 45. Other Lacerations/Defects | ||||||||
| a. Anterior _______________________________________________________________________________________ | ||||||||
| b. Other vaginal _______________________________________________________________________________________ | ||||||||
| 46. Perineal Pain(see page 17 for additional room) ___________________________________________________________________________________________________ | ||||||||
| 47. Analgesia Requirements ___________________________________________________________________________________________________ | ||||||||
| 48. Suturing Required ___________________________________________________________________________________________________ | ||||||||
| 49. Infection ___________________________________________________________________________________________________ | ||||||||
| 50. Wound breakdown ___________________________________________________________________________________________________ | ||||||||
| 51. Texture/appearance of scar ___________________________________________________________________________________________________ | ||||||||
| 52. Satisfaction with birth experience ___________________________________________________________________________________________________ | ||||||||
| 53. Pelvic floor defects ___________________________________________________________________________________________________ | ||||||||
| 54. Urinary Incontinence | ||||||||
| a. Stress incontinence _______________________________________________________________________________________ | ||||||||
| b. Urgency incontinence _______________________________________________________________________________________ | ||||||||
| 55. Fecal Incontinence | ||||||||
| a. Incontinence of flatus _______________________________________________________________________________________ | ||||||||
| 56. Fecal Incontinence (continued) | ||||||||
| b. Incontinence of liquid stool _______________________________________________________________________________________ | ||||||||
| c. Incontinence of formed stool _______________________________________________________________________________________ | ||||||||
| 57. Dyspareunia ___________________________________________________________________________________________________ | ||||||||
| 58. Recommencement of sexual intercourse ___________________________________________________________________________________________________ | ||||||||
| 59. Satisfaction from sexual intercourse ___________________________________________________________________________________________________ | ||||||||
| 60. Additional outcome #1 ___________________________________________________________________________________________________ | ||||||||
| 61. Additional outcome #2 ___________________________________________________________________________________________________ | ||||||||
| 62. Additional outcome #3 ___________________________________________________________________________________________________ | ||||||||
Please use this table for additional information on outcomes from above
| Definition of Outcome(Please describe how the authors defined/operationalized the outcome in the space below the outcome name) | How is outcome measured? (e.g., visual inspection, interview) | Length of time since delivery (Specify units) | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (RR/OR with CI & p values where given) |
|---|---|---|---|---|---|---|
| Additional information for outcome # _____ in section 11 above.(Enter a number between 42 and 60) | ||||||
| Additional information for outcome # _____ in section 11 above.(Enter a number between 42 and 60) | ||||||
| Additional information for outcome # _____ in section 11 above.(Enter a number between 42 and 60) |
Please use this table for additional information on perineal pain
| Definition of perineal pain(Please describe how the authors defined/operationalized the outcome in the space below the outcome name) | How is outcome measured? (e.g., visual inspection, interview) | Length of time since delivery (Specify units) | Group #1 _____________ _____________ _____________ _____________ _____________ | Group #2 _____________ _____________ _____________ _____________ _____________ | Group #3 or Totals for Whole Study _____________ _____________ _____________ | Overall (RR/OR with CI & p values where given) |
|---|---|---|---|---|---|---|
Abstractor's Initials ___ ___
Article #: ________________
| 63. Was the randomization plan adequate? |
Yes |
No |
N/A (prospective cohort) | |
| 64. Was the randomization plan carried out adequately? |
Yes |
No |
N/A (prospective cohort) | |
| 65. Was similarity of groups at baseline reported? |
Yes |
No | ||
| a. Were statistics reported? |
Yes |
No | ||
| b. Were there statistically significant differences between the groups at baseline? |
Yes |
No |
Not reported | |
| 66. Were eligibility criteria specified? |
Yes |
No | ||
| 67. Were the outcome assessors masked? |
Yes |
No |
Not reported | |
| 68. Was crossover (from one group to another) reported? |
Yes |
No | ||
| 69. Was loss-to-followup reported? |
Yes |
No | ||
| 70. Please give the number of participants remaining in the analyses for each group in the study for the primary outcome(see section 10). | Group 1 | Group 2 | Group 3 | |
| i.e.: # remaining | ||||
| # at randomization | ||||
| 71. Were there post-randomization exclusions? |
Yes |
No |
Not reported | |
| 72. Was intention-to-treat (ITT) analysis reported? |
Yes |
No |
Not reported | |
| 73. Overall quality rating |
Good |
Fair |
Poor | |
Assessment of Quality of Individual Articles for RCT's
| Randomization Approach | Randomization Implementation | Masking of Outcome Assessors and/or Participants | Operational Definitions and Measurements | ||||||
| Is there description of the approach to randomization? | Is there proven good balance with statistical significance? | (Please circle one) | (Please circle one) | ||||||
| Yes | No | Yes | No | Good | Fair | Poor | Good | Fair | Poor |
| Is there a fatal flaw in the approach (such as lottery cards)? | Is there good balance achieved as shown in table? | NR | Notes: | ||||||
| Yes1 | No | Yes | No | Notes: | |||||
| Explain: | |||||||||
| Overall Randomization Approach and Implementation | |||||||||
| (Please circle one) | |||||||||
| Good2FairPoor | |||||||||
| Post-Randomization Exclusions | Loss to Follow-up: Short-term | Loss to Follow-up: Long-term | Overall Quality3 | ||||||
| (Please circle one) | (Please list numbers and percentages for each follow-up time point) | (Please list numbers and percentages for each follow-up time point) | (Please circle one) | ||||||
| Yes | No | T1 (describe): | T1 (describe): | GOOD | |||||
| Please describe: | T2 (describe): | T2 (describe): | FAIR | ||||||
| T3 (describe): | T3 (describe): | POOR | |||||||
| T4 (describe): | T4 (describe): | ||||||||
If fatal flaw in randomization approach exists, overall randomization approach and implementation is poor and overall quality of the article/trial is also poor
Approach must be described and there must be good balance in order to achieve an overall randomization and implementation score of good
All component ratings must be good with minimal loss to follow-up for the article/trial to receive an overall quality rating of good. If an article has one or two fair or poor ratings, an overall quality score of fair should be assigned. If an article/trial has three or more fair or poor ratings and/or large loss to follow-up, the overall quality should be poor.
| AP | antepartum |
| BMI | body mass index |
| CC | chromic catgut |
| cm | centimeter |
| cont | continuous |
| deg | degree |
| diff | difference |
| ext | extension |
| G | group |
| g | grams |
| GA | gestational age |
| GP | General Practitioner |
| hrs | hours |
| instr | instrumental |
| interr | interrupted |
| int | interview |
| L&D/MU | Labor and Delivery Maternity Unit |
| LSCS | lower segment cesarean section |
| mL | millileter |
| mm | millimeter |
| mod | moderate |
| mos | months |
| N | number |
| NA | not applicable |
| NR | not reported |
| NS | not significant |
| OR | odds ratio |
| PFMS | pelvic floor muscles |
| PGA | polyglycolic acid |
| PNC | prenatal care |
| pt | point |
| quest | postal questionnaire, self-report questionnaire |
| RCT | randomized controlled trial |
| RR | relative risk/risk ratio |
| SD | standard deviation |
| sec | second (adjective) |
| SHO | senior house officer |
| sig diff | significant differences |
| spont | spontaneous |
| subcut | subcuticular |
| transcut | transcutaneous |
| UK | United Kingdom |
| VAS | Visual Analog Scale |
| wks | weeks |
| yr | year |
This study was supported by Contract 290-02-0016 from the Agency for Healthcare Research and Quality (AHRQ), Task No. 4. We acknowledge the continuing support of Kenneth Fink, MD, MGA, MPH, Director of the AHRQ Evidence-Based Practice Center (EPC) Program, and Marian James, PhD, the AHRQ Task Order Officer for this project.
The investigators deeply appreciate the considerable support, commitment, and contributions of the EPC team staff at RTI International and the University of North Carolina (UNC). From UNC, we thank EPC Co-Director, Timothy S. Carey, MD, MPH; EPC Literature Search Specialist, B. Lynn Whitener, PhD and Leah Randolph, M.A. and Laura Morgan, M.A Research Assistants. We also express our gratitude to Loraine Monroe, EPC word processing specialist, and Debra Bost, editor at RTI International.
We also extend our appreciation to the members of our Technical Expert Advisory Group (TEAG), who provided advice and input during our research process. The RTI-UNC EPC team solicited the views of TEAG members from the beginning of the project. TEAG members also provided insights into and reactions to work in progress and advice on substantive issues or possibly overlooked areas of research. TEAG members participated in refining the analytic framework and key questions and discussing the preliminary assessment of the literature, including inclusion/exclusion criteria, and also provided input on the information and categories, including evidence tables. The TEAG was both a substantive resource and a “sounding board” throughout the study. It was also the body from which expertise was formally sought at several junctions. TEAG members are listed below:
| Leah Albers, CNM, DrPH | John O.L. DeLancey, MD Professor Division of Gynecology |
| Professor, College of Nursing and | Department of Obstetrics & Gynecology |
| Dept. OB-GYN, School of Medicine University of New Mexico Health Sciences Center | University of Michigan |
| Jose Belizan, MD, PhD | William Droegemueller, MD |
| Director Latin American Center for Perinatology and Human Development Pan American Health Organization, World Health Organization | Director of Evaluation, |
| American Board of OB/GYN | |
| David A. Grimes, MD | |
| Vice President Biomedical Affairs, Family Health International | |
| Linda Brubaker, MD | Dwight J. Rouse, M.D., M.S.P.H. Professor |
| Professor of Obstetrics/Gynecology and Urology | Obstetrics and Gynecology |
| Loyola University | School of Medicine |
| University of Alabama at Birmingham | |
| Pierre Buekens, MD, PhD, MPH | |
| Dean, School of Public Health | |
| Tulane University |
We gratefully acknowledge the following individuals who reviewed the initial draft of this report and provided us with constructive feedback. External reviewers comprised clinicians, researchers, representatives of professional societies, and potential users of the report. We would also like to extend our appreciation to David Atkins, MD, and Susan Meikle, MD MSPH from AHRQ for contributing peer review comments. Our peer review panel also includes all members of the TEAG. Peer review was a separate duty for these individuals and not part of their commitment as TEAG members. All are active professionals in the field. The peer reviewers were asked to provide comments on the content, structure, and format of the evidence report and to complete a checklist. The peer reviewers’ comments and suggestions formed the basis of our revisions to the evidence report. Acknowledgments are made with the explicit statement that this does not constitute endorsement of the report.
| Individuals | Organizations |
|---|---|
| William Hueston, MD | Robin Bell, MB BS PhD MPH FAFPHM Cert Health Econ |
| Chair, Department of Family Medicine; Professor | Deputy Director of Research |
| Medical University of South Carolina | Jean Hailes Foundation |
| Clayton Victoria, Australia | |
| Michael Klein, MD, CCFP, FAAP, FCFP, ABFP UBC | Leslie Cragin, CNM,PhD |
| BC Children's Hospital | Associate Clinical Professor |
| Emeritus Professor of Family Practice and Pediatrics | UCSF Dept. Ob/Gyn at SFGH |
| BC, Canada | |
| Anne M. Weber, MD, MS | Linda Herrick, RNC, BSN, CCE, CD |
| Director, Female Pelvic Med & Reconstructive Surgery Department of Ob/Gyn University of Pittsburgh | Director |
| Magee-Womens Hospital | Academy of Certified Birth Educators and Labor Support Professionals |
| 2001 East Prairie Circle | |
| Suite I | |
| Patricia McLaughlin, RNC, MPA, MSN | |
| Manager, Professional Development and Clinical Programs | |
| Association of Women's Health, Obstetric and Neonatal Nurses | |
| 2000 L Street, N.W. | |
| Suite 740 | |
| Washington, DC 20036 |
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm
Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm
Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm
Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm
Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm
Appendixes are provided electronically at http://www.ahrq.gov/clinic/tp/epistp.htm