Figure 1. Shrinkage Plot for Rate Difference in Conductive Hearing Loss at 6-10 Years Presence of early life OM versus Absence of early life OM
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Acting Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
Carolyn Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Jean Slutsky, P.A., M.S.P.H.
Acting Director, Center for Practice and Technology Assessment
Agency for Healthcare Research and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.
Objectives. The purpose of this evidence-based report is to review the evidence on the natural history of otitis media with effusion (OME), the late effects of early life otitis media on hearing and speech and language development, and the operating characteristics of various methods of diagnosing OME. OME is defined as “fluid in the middle ear without signs or symptoms of ear infection.” The evidence compiled in this report is intended to aid clinicians, health care provider organizations, and others to develop clinical practice guidelines or medical review criteria for OME. The report also identified areas for future research.
Search Strategy. The MEDLINE search strategy used both controlled vocabulary MeSH (Medical Subject Headings) terms and keywords to ensure that all relevant citations were retrieved. Search terms for otitis media with effusion were combined with search terms for natural history, speech and language development, hearing, and diagnosis. The otitis media module included otitis media, otitis media with effusion, suppurative otitis media, allergic otitis media, fluid ear, glue ear, middle ear effusion, mucoid otitis media, nonsuppurative otitis media, secretory otitis media, and serous otitis media. The natural history terms included natural course, natural history, placebo, placebos, resolution, self-limited, self-limiting, untreated, and a variety of terms for spontaneous resolution. The speech and language module included speech and language, speech and language disorders, child language, communication, communication disorders, language development and tests, voice, and voice disorders. The hearing module included hearing and hearing disorders and hearing aids and tests, as well as the text word hearing. The diagnosis module used diagnosis and diagnostic techniques and procedures, as well as the text words audiometry, diagnosis, diagnostic, otoscopy, tympanometry history, speech and language development, hearing, and diagnosis.
Selection Criteria. Excluded were studies on patients with immunodeficiencies, craniofacial anomalies (including cleft palate), primary mucosal disorders, or genetic conditions. Prospective cohort studies were included for questions that addressed natural history, speech, language, and hearing. Prospective diagnostic studies were used to evaluate the operating characteristics of diagnostic methods.
Data Collection and Analysis. Two physicians or one physician and one health services researcher independently screened all titles and/or abstracts for potential inclusion, evaluated the quality of the articles, and abstracted data from full-length articles onto pre-designed forms. The selection criteria included human studies that addressed a key question about OME in children. Excluded were case reports, editorials, letters, reviews, practice guidelines, and non-English language publications.
Main Results. We found that 22.5 to 42.7 percent of OME in children older than 3 years of age cumulatively resolves over a period of three months, depending on the definition of OME resolution. Based on a limited number of cohort studies, we found no evidence to support an impact of early life otits media, defined as a history of otitis media at less than 3 years of age, on expressive language, receptive language, or cognitive verbal intelligence at age older than 3 years. However, this evidence is insufficient to exclude the possibility that a clinically important effect does exist, therefore strong conclusions cannot be drawn about the effect of otitis media at an early age on subsequent speech and language development. The generalizability of this finding on speech and language is suspect because the populations represented by the six cohorts utilized in the meta-analyses were primarily those of particular ethnic/racial origin. Moreover, the findings cannot be generalized to children with craniofacial defects, primary mucosal disorders, immunodeficiencies, genetic conditions, or pre-existing developmental disorders, and may not necessarily be generalized to children with persistent bilateral otitis media. Children with early life otitis media have a higher risk of conductive hearing loss, defined using a threshold greater than or equal to 20 dB at any frequency with or without treatment, at age 6 to 10 years than children without early life otitis media. The pooled relative risk of conductive hearing loss was 2.6 (95% CI: 1.6 to 4.2). We found insufficient data to assess early-life OM on permanent (or sensorineural) hearing loss. Among nine diagnostic methods, pneumatic otoscopy had the best apparent performance with a sensitivity of 93.8 percent (95% CI: 91.4%, 96.3%) and a specificity of 80.5 percent (95% CI: 75.1%, 86.0%). However, tester qualifications were reported inconsistently, and training was not specified.
Conclusions. Although these estimates must be viewed with great caution due to heterogeneity that arose from study design and documentation issues for which we could not adjust in our analysis, about 22.5 to 42.7 percent cumulatively resolved over a period of three months, depending on the definition of OME resolution. Our findings on the possible effects of early life otitis media on speech and language development are in general agreement with the 1994 Agency for Healthcare Research and Quality OME guideline conclusion that rigorous, methodologically sound research does not adequately support or refute the possible effect of otitis media on speech and language. We found that children with early life otitis media have a greater risk of conductive hearing loss at age 6 to10 years. In addition, we found that pneumatic otoscopy had the best operating characteristics among the nine alternatives examined, for diagnosing the presence of middle-ear effusion in OME at single points in time.
Considering the abundance of literature addressing otitis media, these findings concerning natural history, speech and language development, and hearing are very limited. Future research on the natural history of otitis media with effusion must focus on improving study quality. In particular, control of therapeutic intervention during the study and the distinction between OME persistence and recurrence needs to be addressed. For evaluation of long-term effects of early life otitis media on speech, language, or hearing, a coordinated approach that uses uniform definitions and considers the interactions of multiple risk factors, interventions, and outcome measures is recommended. Such an integrated approach is also important for the evaluation of diagnostic methods. Further, a systematic review of diagnostic studies that employ algorithms or aggregated scores may be useful.
The purpose of this evidence-based report is to review the evidence on the natural history of otitis media with effusion (OME), the impact of otitis media on long-term speech and language development and on hearing, and the operating characteristics of various methods of diagnosing otitis media with effusion. OME is defined as “fluid in the middle ear without signs or symptoms of ear infection.” The evidence compiled in this report is intended to aid clinicians, health care provider organizations, and others to develop clinical practice guidelines or medical review criteria for OME. The report will also identify areas for future research.
Based on degree of importance (including level of controversy) and feasibility of answering the question, the Technical Expert Panel limited the scope of this evidence report to four key questions. These questions address (1) the natural history of otitis media with effusion (OME), (2) the long-term effects of early-life otitis media, defined as positive otitis media history at less than three years of age, on speech and language development, (3) the long-term effects of early-life otitis media on hearing, and (4) the operating characteristics of various methods of diagnosing otitis media with effusion.
A 12-member Technical Expert Panel that consisted of clinical experts, a consumer, and a representative of a managed care organization convened to provide the following assistance:
advise the project in the ranking of proposed key questions and influencing factors
guide the development of the scope and definition of OME
advise in development of the search strategy, and
review and comment on the plan of analysis.
The Technical Expert Panel and project staff developed a literature search strategy. Project staff searched MEDLINE (1966-January 2000), the Cochrane Library (through January 2000), and EMBASE (1980-January 2000). Additional articles were identified by review of reference lists in proceedings, published articles, reports, and guidelines.
The MEDLINE search strategy used both controlled vocabulary MeSH (Medical Subject Headings) terms and keywords to ensure that all relevant citations were retrieved. The strategy included search terms for otitis media with effusion combined with search terms for natural history, speech and language development, hearing, and diagnosis.
The otitis media module included otitis media, otitis media with effusion, suppurative otitis media, allergic otitis media, fluid ear, glue ear, middle-ear effusion, mucoid otitis media, nonsuppurative otitis media, secretory otitis media, and serous otitis media.
The natural history terms included natural course, natural history, placebo, placebos, resolution, self-limited, self limiting, and untreated, as well as a variety of terms for spontaneous resolution.
The speech and language module included speech and language, speech and language disorders, child language, communication, communication disorders, language development and tests, voice, and voice disorders.
The hearing module included hearing and hearing disorders, hearing aids and tests, and the text word hearing.
The diagnosis module used diagnosis and diagnostic techniques and procedures, as well as the text words audiometry, diagnosis, diagnostic, otoscopy, and tympanometry.
Two physicians or one physician and one health services researcher independently screened all titles and/or abstracts for potential inclusion, evaluated the quality of the articles, and abstracted data from full-length articles onto pre-designed forms. The selection criteria included human studies that addressed a key question about OME in children. Excluded were case reports, editorials, letters, reviews, practice guidelines, non-English language publications, and studies on patients with immunodeficiency disorders or craniofacial anomalies, including cleft palate.
For the natural history question, we used only prospective cohort(s) studies on untreated subjects from which outcome data were abstractable for children up through age 12 years. For the speech and language and hearing questions, we used only prospective cohort studies that fulfilled the following criteria: the degree of OME was determined during the first three years of life, upper age limit was 22 years, the degree of OM was graded in some way, and the outcome was measured when the child was older than age three years. For the diagnostic methods question, we used only prospective studies on children up through 12 years of age that fulfilled four criteria: the diagnostic procedure of interest was performed within 24 hours of the reference standard, was not an algorithm or combination of multiple diagnostic procedures, used one of the acceptable reference standards specified in the scope, and produced abstractable data.
The first step of all analyses was to obtain a distribution of studies stratified by the population characteristics, type of outcome measures, and non-treatment factors. This step provided us with an overview of the emphasis of past research in this area and an opportunity to identify gaps and areas for future research.
In strata with more than three studies, we performed a meta-analysis for a pooled random effects estimate of an outcome with 95% confidence intervals. In addition to deriving the pooled estimate, we evaluated the heterogeneity of the study outcomes. For the evaluation of diagnostic methods, we estimated the sensitivity, specificity, and positive and negative predictive values for each diagnostic procedure compared to a particular reference standard with three or more studies.
This evidence report was reviewed by the Technical Expert Panel as well as an 18-member peer review panel that consisted of content experts, consumers, representatives of managed care organizations, an expert in pediatric pharmacology, and methodologists. All comments received from these individuals were reviewed and acted upon appropriately.
No meta-analyses for children under three years of age were possible, because we could identify only two studies each for the <six months and the three-months-to-three-years age groups. For the over-three-years age group, two sets of meta-analyses showed that 22.5 to 42.7 percent of ears with OME cumulatively resolved over a period of three months, depending on the definition of OME resolution. These estimates must be viewed with caution due to the clinical heterogeneity evident in the data synthesized and due to the weaknesses of design or documentation of the study cohorts. In particular, in most cases investigators did not document whether subjects had received medical or surgical treatment during the course of the study that could affect OME outcome or how compliance with non-treatment was established. Of those investigators who reported how many children received treatment, the majority did not stratify their findings by treatment status.
A few of the studies analyzed OME resolution by influencing factors such as gender, care at home versus daycare, season, side of affected ear, race or ethnicity, or diagnostic instrument. Because of the paucity of such studies, quantitative synthesis was not possible, and we refrain from making any conclusions regarding the effect of these influencing factors on resolution.
As measured by scoring of documentation in the published articles, the quality of the twenty-eight cohort studies on natural history was generally poor.
Half of the studies that attempted to study the natural history of OME did not control for or document control of interventions, either medical or surgical, that might affect OME outcome during the study period. The majority of these studies did not stratify findings by intervention status.
The interval between examinations for OME in these studies ranged from one day to three years. For studies with long follow-up intervals, it was not possible to determine whether the presence of OME was due to persistence or recurrence. The criteria for follow-up varied among studies. Most studies continued follow-up for the duration of the study period regardless of the OME status at a particular exam, but four cohorts discontinued follow-up of individuals who had type A or normal tympanograms at any exam.
Studies that addressed the effects of early-life otitis media on long-term speech and language development among children differed considerably with respect to risk factors studied, type of outcome measured, method of measurement, unit of measurement, age at outcome determination, and study design.
The meta-analyses that could be conducted on long-term expressive language, receptive language, and cognitive verbal intelligence showed no effect of early otitis media as measured during the first three years of life. These findings may not be generalizable, since five of the six cohorts that were included in these three meta-analyses focused primarily on children from specific ethnic/racial groups or from particular socioeconomic groups. Furthermore, the results of these studies cannot be applied to children with craniofacial defects, primary mucosal disorders, immunodeficiency disorders, genetic conditions, or pre-existing developmental disorders, because children with these conditions were excluded from this analysis. In addition, only one of the studies included in these meta-analyses focused solely on persistent bilateral otitis media as opposed to unspecified unilateral or bilateral otitis media.
Few studies on the effects of early-life otitis media on long-term hearing used a prospective cohort study design.
Of the eight cohort studies analyzed, one set of four studies reported percentage of conductive hearing loss at six to ten years of age. For this analysis, the threshold for conductive hearing loss was defined as greater than or equal to 20 dB at any frequency, with or without treatment of otitis media.
The pooled risk of conductive hearing loss at six to ten years among 346 children who had a positive history of early-life OM was 22 percent (95% CI: 7% to 36%). In contrast, the pooled risk ofconductive hearing loss at six to ten years of age among 237 children with no history of early-life OM was 6 percent (95% CI: 1% to 12%). The pooled rate difference of conductive hearing loss at six to ten years of age between children with a positive OM history and those with a negative OM history was 11 percent (95% CI: 3% to 19%). Neither the studies pooled for the rate difference nor the studies pooled for the risk ratio showed significant heterogeneity in the outcomes.
The findings were based on four homogeneous, though very different populations, one from Finland, another from Sweden, one primarily of American Indian children, and another primarily of Eskimo children. The four studies also differed on the definition and collection of OM history and on exclusion factors.
We found insufficient data to assess the impact of early-life OM on permanent (sensorineural) hearing loss.
Based on our evaluation of 52 diagnostic studies, we were able to assess the ability of the following methods to diagnose middle-ear effusion in OME at a single point in time: acoustic reflectometry at ≤5 or >5 reflective units (RU); pneumatic otoscopy; portable tympanometry; professional tympanometry using acoustic reflex at 500 or 1000 Hz; professional tympanometry using static compensated acoustic admittance at 0.1, 0.2, and 0.3; professional tympanometry using B curve as abnormal; and professional tympanometry using B or C2 curves as abnormal. All comparisons used myringotomy as the reference standard.
Among the eight diagnostic methods, the receiver-operator characteristic points (plotting sensitivity against 1 minus specificity) showed that pneumatic otoscopy was closest to the optimal operating point where both sensitivity and specificity would be 100%. However, tester qualifications were reported inconsistently, and training was not specified. The pooled sensitivity was 94 percent (95% CI: 91%, 96%) and the pooled specificity was 80 percent (95% CI: 75%, 86%). These findings were based on 2,694 children from seven studies that reported a pooled prevalence of OME of 63 percent (95% CI: 58%, 67%). The prevalence rate ranged from 56 percent to 71 percent, which indicated significant heterogeneity among outcomes (p<0.001).
Natural History of OME: Literature on the natural history of otitis media with effusion was difficult to interpret because of its generally poor quality, the lack of control for therapeutic interventions, the inability to distinguish persistent from recurrent OME due to the length of follow-up intervals, and the varied criteria for continued follow-up from exam to exam. Differing definitions of OME resolution and diagnostic methods made comparison difficult. Few studies considered the child or the episode as the unit of analysis, included younger children, or assessed types of OME other than newly diagnosed OME of unknown duration. In addition, few studies addressed the possible effects of influencing factors on OME resolution.
Early-Life OM and Long-Term Speech and Language Development: The literature on the long-term effects of early-life otitis media on speech and language development diverged considerably with respect to methodology. As a result, findings could not be combined easily.
Early-Life OM and Long-Term Hearing: Although the literature on the long-term effects of early-life otitis media on hearing was abundant, few studies used a prospective cohort study design. Because of the limited nature of this evidence and because the rate of intervention is highly dependent on the threshold hearing level adopted, the findings of this analysis should be applied with caution.
Diagnostic Methods for OME: Nine comparisons of diagnostic methods enabled derivations of pooled estimates of diagnostic accuracy. However, more comparisons could not be made, including those that would have evaluated clinical signs and/or symptoms, air and/or bone threshold audiometry, binaural micro-tympanoscopy, and non-pneumatic otoscopy. Diagnostic methods that use algorithms or aggregated scorings are important but are not included in this evidence assessment.
Future research on the natural history of otitis media with effusion must focus on improvement of study quality and establishing the effect of OME on long-term outcomes such as speech, language, and hearing. In particular, control of therapeutic interventions during the study and the distinction between OME persistence and recurrence need to be addressed. Adopting a less restrictive definition of non-intervention might simplify the analysis of studies of the natural history of OME. In addition, researchers, in conjunction with clinicians, should agree upon standard procedures for follow-up, including intervals of follow-up, definition of OME resolution, and diagnostic methods, so that resolution rates are indeed comparable. Future research must consider the child as the unit of analysis, since the outcomes of ultimate interest, such as speech, language, and hearing, are functional requirements of a child, not an ear. More research is needed on the role of influencing factors on the natural history of OME, so that the clinician on a particular day in a particular setting can make a better decision when assessing a particular child with particular characteristics.
Evaluation of long-term effects of early-life otitis media on speech, language, or hearing requires a coordinated systematic approach that uses a rational conceptual framework. Such an approach should address the risk factors, interventions, and outcome measures in an integrated fashion. The definition, classification, and type and unit of measure should be developed by a team of experts with the goal of standardizing definitions and approaches. Literature on findings should report both univariate and multivariate findings to enhance understanding of the patient and study characteristics and to allow pooling of data. An integrated approach is also important for the evaluation of diagnostic methods. Such an approach will provide guidance for future studies. Future studies of diagnostic assessments of OME also should consider cost-effectiveness analysis, which can take into account the variable proficiency of clinicians in performing pneumatic otoscopy as well as the consequences of testing and patient preferences. Cost-effectiveness analysis will lead to a more informed decision on the best diagnostic method for OME.
The purpose of this evidence-based report is to review the evidence on the diagnosis, natural history, and late effects of otitis media with effusion (OME) on long-term speech, language, and hearing. The evidence compiled in this report is intended to aid clinicians, health care provider organizations, and others to develop clinical practice guidelines or medical review criteria for OME. The report will also identify areas for future research. Despite the relatively recent publication of the 1994 Otitis Media with Effusion in Children guideline (Stool, Berg, Berman, et al., 1994), the technical experts believed that OME remained a topic worthy of evidence-based inquiry due to the continued controversy over the care of children with OME and the potential availability of new information.
The technical experts initially proposed 20 questions that addressed 10 broad areas related to the diagnosis and treatment of otitis media with effusion: (1) allergens, (2) natural history, (3) speech and language, (4) diagnostic methods, (5) surgical interventions, (6) hearing, (7) antibiotics, (8) steroids, (9) antihistamines and decongestants, and (10) alternative or complementary therapies. These twenty questions were ranked based on the following criteria: (1) degree of potential impact on OME outcomes and on future guideline development and (2) the feasibility of answering the question within the one year time frame and the availability of new information in the literature. The scope of this report covers the four highest ranked questions: (1) the natural history of otitis media with effusion (OME), (2) the long-term impact of early-life otitis media on speech and language, (3) the long-term impact of early-life otitis media on hearing, and (4) the accuracy of methods of diagnosis of otitis media with effusion.
The definition of otitis media has been a complicated issue. Ben H. Senturia, quoted in Bluestone (1999), stated that “In the past, there has been a confusion of terms, in part because of a failure to distinguish conceptually between the disease process, otitis media, and one of the manifestations of that disease process, namely otitis media with effusion. Otitis media is dynamic and at any one time should be considered a single point in a continuum of the disease process.”
Recent comments on the definition of OME point to some of the complex issues involved:
The OME guideline (Stool, Berg, Berman et al., 1994) defined OME as “fluid in the middle ear without signs or symptoms of ear infection.” The guideline listed the following synonyms for OME: serous otitis media, secretory otitis media, allergic otitis media, catarrhal otitis media, nonsuppurative otitis media, mucoid otitis media, secondary otitis media, hydrotubotympanum, exudative catarrh, tubotympanitis, tympanic hydrops, glue ear, fluid ear, middle ear effusion, and tubotympanic catarrh.
Bluestone (1999) defined otitis media with effusion (OME) as “an inflammation of the middle ear with liquid collected in the middle-ear space. The signs and symptoms of acute infection are absent and there is no perforation of the tympanic membrane.” He stated that middle-ear effusions can be acute (< 3 weeks), subacute (3 weeks to 3 months), or chronic (> 3 months). He also stated that researchers should precisely define OME.
Patterson and Paparella (1999) agreed that the different forms of otitis media (OM) are “interrelated and occur in a continuum.” They recognized OME as one of the three major forms of OM, the other two being chronic OM (active or inactive) and silent OM. They classified OME as serous OM, purulent OM, and chronic OM. They divided serous OM into acute serous OM, chronic serous OM, and mucoid OM.
Jung and Hanson (1999) agreed that OM consists of various stages. Though they viewed purulent otitis media, serous otitis media, and mucoid otitis media as different stages, they considered OME to encompass all three, except the early stages of acute otitis media (AOM).
Paradise (1995) also agreed that “AOM and OME constitute elements in an otitis media disease spectrum, that there often is a transition zone between them and that the two conditions sometimes may be indistinguishable from each other diagnostically.”
For this evidence report, the Technical Expert Panel decided to use the definition used in the OME Guideline (Stool, Berg, Berman et al., 1994): “fluid in the middle ear without signs or symptoms of ear infection.”
Various methods have been proposed for the diagnosis of OME. The OME guideline panel drew several conclusions regarding diagnosis of OME (Stool, Berg, Berman et al., 1994). They recommended the use of pneumatic otoscopy as the primary diagnostic method with tympanometry as a confirmatory diagnostic method. These recommendations were based on limited scientific evidence and strong panel consensus and on limited scientific evidence and expert opinion, respectively. The OME guideline panel found no evidence linking the outcome of algorithms that combine the results of pneumatic otoscopy and tympanometry to the presence of middle-ear effusion. In addition, the panel believed that the evidence was insufficient to make any recommendation regarding the use of acoustic reflectometry in the diagnosis of OME. Finally, the panel decided not to make a recommendation on the use of tuning fork tests in the diagnosis of OME due to the lack of adequate studies. The OME guideline panel did not present any meta-analyses on diagnostic methods.
Pneumatic otoscopy is performed with a handheld unit that consists of a light source, a magnifying lens, and a speculum. The otoscope allows visual inspection of the tympanic membrane as well as the external ear canal. With the speculum securely in place, the degree of movement of the tympanic membrane in response to pneumatic pressure may be observed. Decreased tympanic membrane mobility in response to pneumatic pressure is believed to be related to the presence of middle-ear effusion as found in OME. (Carlson and Stool, 1999)
Tympanometry is performed by inserting into the ear a probe that emits a tone and measures the amount of sound energy reflected from the tympanic membrane as a function of ear canal air pressure. The instrument may or may not be handheld. The output of tympanometric measurement may be qualitative, that is, tympanogram patterns, or quantitative, for example static admittance, equivalent ear volume, tympanometric width, tympanometric peak pressure, or acoustic reflex. The flat or type B tympanogram is believed to be associated with the presence of middle-ear effusion. The type A tympanogram is believed to indicate normal middle-ear status. The relationship of the type C tympanogram to middle-ear status is less clear. (Carlson and Stool, 1999; Nozza, 1996)
The acoustic middle-ear muscle reflex, either ipsilateral or contralateral, may also be measured by acoustic emittance instruments and represents the contraction of the stapedius and tensor tympani in response to sound stimulation. Its absence may be related to the presence of middle-ear effusion depending on the clinical situation (Nozza, 1996).
Acoustic reflectometry is performed using a handheld instrument that measures the response of the tympanic membrane to a frequency-sweep sound spectrum. The spectral gradient angle, which is a function of the frequency and amplitude, may be related to middle-ear effusion presence. (Carlson and Stool, 1999; Nozza, 1996)
Evoked otoacoustic emissions are a measure of ear canal sounds that are generated in the cochlea. These sounds have the potential for clinical application (Nozza, 1996).
Audiometry measures hearing acuity, using behavioral or non-behavioral methods, at various sound frequencies. It is known that children may have decreased hearing in the presence of middle-ear effusion (Carlson and Stool, 1999). Although no “universal agreement” appears to exist regarding the definition of hearing loss, a hearing threshold no worse than 15 decibels (dB) is considered normal in children, and 20 dB may be considered normal in older children (Madell, 1999)
Accurate estimates of the prevalence or incidence of OME were not found, since published population-based estimates are not available on the specific diagnosis of OME. Data on office visits reported from the National Ambulatory Medical Care Survey (NAMCS) provide the best indication of prevalence and incidence of the disease, although nonsuppurative, suppurative, and unspecified otitis media were all grouped into the term otitis media, and OME was not separated from AOM in the analyses (Schappert, 1992;Schappert, 1996; Woodwell, 1997a; Woodwell, 1997b; Woodwell and Schappert, 1995). Gates (1996), commenting on the NAMCS data, stated, “for children it is probably safe to presume that AOME [AOM with effusion, i.e. AOM] is the principal disorder noted in these surveys.” The OME Guideline panel of the Agency for Health Care Policy and Research [presently the Agency for Healthcare Research and Quality (AHRQ)] estimated that 25 percent to 35 percent of the NAMCS visits for otitis media were for OME (Stool, Berg, Berman et al., 1994a).
Schappert (1992) reported on the 1975 to 1990 NAMCS data. Visits by patients younger than 15 years of age constituted 70.6 percent of all office visits with the principal diagnosis of otitis media in 1975, 78.9 percent of all office visits with the principal diagnosis of otitis media in 1980, 81.9 percent of all office visits with the principal diagnosis of otitis media in 1985, and 80.5 percent of all office visits with the principal diagnosis of otitis media in 1990. From 1975 to 1990, the percent of office visits with otitis media as the principal diagnosis increased among those less than 15 years of age; from 7.3 percent to 17.4 percent for children under 2 years old, from 10.4 percent to 18.1 percent for the 2-5 year olds, from 6.9 percent to 10.5 percent for the 6-10 year olds, and from 2.6 percent to 5.2 percent for the 11-14 year olds. The number of visits with a principal diagnosis of otitis media per 100 persons per year for the same time period (1975 to 1990) increased from 31.5 to 102.1 for children less than 2 years of age, 20.8 to 47.8 for those 2-5 years of age, 10.2 to 18.2 for those 6-10 years of age, and 3.3 to 8.0 for those 11-14 years of age.
Rosenfeld (1994) noted that about a quarter of OME cases are discovered incidentally during well-child examinations. About 60 percent of children would have OME by 2 years old and 80 percent before school entry. The Agency for Healthcare Research and Quality 1994 OME guideline reported that in one study of children 2 to 6 years old in group child care, 53 percent had at least one episode of OME during the first year of the study, 61 percent had at least one episode during the second year of the study, and 30 percent had recurrent OME (Stool, Berg, Berman et al., 1994).
NAMCS also stratified data by specific physician type. From 1975 to 1990, the percent of office visits with a principal diagnosis of otitis media increased for pediatricians from 8.1 percent to 14.3 percent, for general practitioners and family physicians from 1.3 percent to 3.5 percent, and for otolaryngologists from 12.8 percent to 20.2 percent.
Data for 1975 to 1990 were also stratified by age. In 1990, the number of visits with a principal diagnosis of otitis media per 100 persons per year among children younger than 2 years was 62.9 for pediatricians, 24.0 for general practitioners and family physicians, and 9.1 for otolaryngologists. In 1990, the number of visits with a principal diagnosis of otitis media per 100 persons per year among children 2 to 5 years old was 29.0 for pediatricians, 11.4 for general practitioners and family physicians, and 6.6 for otolaryngologists (Schappert, 1992).
The reports on the NAMCS data for 1993 to 1996 did not stratify by age (Schappert, 1996; Woodwell, 1997a; Woodwell, 1997b; Woodwell and Schappert, 1995). However, if the 1993 to 1996 data were similar to that in 1975 to 1990, it would be reasonable to conclude that the majority of these patients were younger than 15 years of age.
NAMCS office visit data for 1993 to 1996 generally support earlier data. Suppurative and unspecified otitis media was the third most frequently listed principal diagnosis in 1993, the sixth most frequent in 1994, the fourth most frequent in 1995, and the seventh most frequent in 1996 for ambulatory care visits to physician offices, hospital outpatient departments, and emergency departments (Schappert, 1996; Woodwell, 1997a; Woodwell, 1997b; Woodwell and Schappert, 1995). In 1996, visits for a principal diagnosis of otitis media and eustachian tube disorders occurred 82.8 percent of the time in physician offices, 5.3 percent in hospital outpatient departments, and 11.9 percent in emergency departments (Woodwell, 1997a).
The NAMCS also provided data on the duration of office visits for otitis media. The percent of visits for otitis media of duration 6-15 minutes increased between 1975 to 1990 from 64 percent to 78 percent and was associated with a decrease in visits less than six minutes from 24 percent to 13 percent (Schappert, 1992). In terms of surgical procedures, the rate of ambulatory surgery visits per 10,000 population for those younger than 15 years of age for otitis media and eustachian tube disorders was 86.9 in 1994 and 83.9 in 1995, based on 498,000 and 484,000 visits respectively ( Hall and Lawrence, 1997; Kozak, Hall, Pokras et al., 1997). In 1995, the number of myringotomy with tympanostomy tube placements reported by NAMCS was 521,000 for a rate of 90.2 such procedures per 10,000 children less than 15 years of age (Hall and Lawrence, 1997).
In general, the NAMCS data demonstrated the significance of otitis media—and by implication OME—based on the prevalence and incidence of the disease and the frequency and duration of visits and surgical interventions.
The treatment, complications and sequelae, and adverse effects of otitis media, including OME, are a substantial financial burden to the nation. Three estimates of the cost of otitis media (OM) are found in the literature (Gates, 1996b; Stool and Field, 1989; Stool, Berg, Berman et al., 1994). A fourth study reported the cost per episode of “persistent middle ear infection” (Berman, Roark, and Luckey, 1994). We have assessed the strengths and weaknesses of these four studies and presented our own estimate of otitis media costs in a previous evidence-based analysis (Takata, Chan, Shekelle, et al., in press).
Gates (1996b) placed the annual national total cost of acute otitis media at $3.15 billion for the 0- to 4-year old age group in an unspecified date in the early or mid-1990's and placed the cost of treatment of chronic otitis media with effusion at $1.854 billion dollars per year (Gates, 1996). Stool and Field (1989) put the national cost of otitis media at $2.4-3.4 billion in the 0- to 6-year old age group in an unspecified year presumably in the middle or late 1980's. Both Gates (1996b) and Stool and Field (1989) assumed prevalences of otitis media that were at variance with the only available national data on the utilization of care visits for otitis media. Stool, Berg, Berman, et al. (1994) presented the only estimate of the national cost of otitis media using a data-based estimate of costs per case. Using claims data from more than 100 health insurers, they estimated the overall average cost of treating a 2-year old child with OME in 1991 to be $1,330 and the national total cost of OME in 1991 to be $1.09 billion. Berman, Roark, and Luckey (1994) estimated the cost of treating persistent middle ear effusion in a 13-month old boy at $720-$1,372 under Colorado Medicaid reimbursement levels in 1992 and $1,265-$2,588 under private practice reimbursement rates. Not all these studies included the indirect cost of family caregiving services required due to a child having otitis media.
An attempt to provide an updated estimate on the cost of otitis media included insights into the cost of OME and chronic middle ear infection (Takata, Chan, Shekelle, et al., in press). This estimate was derived for the year 1995, referred to children under 18 years of age, and was based on reports of national, rather than regional, utilization for otitis media such as the National Ambulatory Medical Care Survey and the National Health Interview Survey. Using these national data sets, it was estimated that 2.22 million episodes of OME or chronic middle ear infection occurred in 1995. It was assumed that eighty percent of these episodes were unrelated to acute otitis media. Data from three sources (Stool, Berg, Berman et al., 1994; Berman, Rourke, and Lucky, 1994; and the U.S. Bureau of the Census, 1992,1996) were used to estimate direct, indirect, and total costs of $1,321, $490.25, and $1,811 for treatment of an episode of OME or chronic middle ear infection. For 1995, the total national cost of treating OME or chronic middle ear infection would stand at $4.02 billion.
Whether based on the four prior estimates of otitis media or OME cost or the more recent estimate, the economic burden of OME on the nation is large. Any effort to improve cost-effective care of OME will result in significant savings in national medical expenditures as well as improved quality of care provided to children with OME.
Eleven organizations were contacted for technical expert and peer reviewer nominations. They included the American Academy of Family Physicians (AAFP), the American Academy of Pediatrics (AAP), the American Academy of Otolaryngology-Head and Neck Surgery Foundation (AAO-HNS), the Ambulatory Pediatric Association (APA), the American Academy of Audiology (AAA), the American Speech-Language-Hearing Association (ASHA), the Society for Ear Nose and Throat Advances in Children (SENTAC), the National Association of Pediatric Nurse Associates and Practitioners (NAPNAP), the American Association of Health Plans (AAHP), the Foundation for Accountability (FACCT), and Family Voices.
| Technical Expert | Area of Expertise | Affiliation/Location |
|---|---|---|
| Larry Culpepper, MD, MPH | Family Medicine | Boston Medical Center, MA |
| Douglas G. Long, MD | Family Medicine | Manchester Community Health Center, NH |
| Richard M. Rosenfeld, MD, MPH | Otolaryngology | SUNY Health Science Center Brooklyn, NY |
| Norman Wendell Todd, Jr., MD | Otolaryngology | Emory University, GA |
| Allan Lieberthal, MD | Pediatrics | Southern California Kaiser Permanente Medical Group, CA |
| Anthony Magit, MD | Pediatric Otolaryngology | Children's Hospital San Diego, CA |
| Jack Paradise, MD | Pediatrics | Children's Hospital, Pittsburgh, PA |
| Ross Miller, MD | Quality Management | CIGNA Health Care, CA |
| Joanne Roberts, PhD | Speech and Hearing | University of North Carolina, NC |
| Lisa L. Hunter, PhD | Audiology | University of Minnesota, MN |
| Linda Carlson, MS, RN, CPNP | Nurse Practitioner | Statesboro, GA |
| Fran Goldfarb, MA | Consumer | Family Voices, Los Angeles, CA |
A draft work plan for the topic assessment and refinement phase was mailed to the technical experts and representatives of our partners (the American Academy of Pediatrics, the American Academy of Family Practice, and the American Academy of Otolaryngology-Head and Neck Surgery) for review and comments together with a preliminary review that provided a summary of:
Incidence and prevalence of otitis media with effusion, treatment and management alternatives, the characteristics and size of the affected populations, and the most affected practice settings and providers;
The burden of illness associated with otitis media with effusion, including morbidity and mortality.
Extent to which variation exists in practices associated with the prevention, diagnosis, treatment, or diagnosis and treatment of otitis media with effusion.
We reviewed and compiled four previously conducted evidence-based analyses on otitis media with effusion. Particular attention was given to the report of The Otitis Media Guideline Panel on “Managing Otitis Media with Effusion in Young Children” ( Stool, Berg,Berman et al., 1994) We distributed these reports to the panel of technical experts for review and for preparation of the initial telephone conference call during which assessment and refinement of the topics wereassessed and refined.
| 1. | What is the relative risk of developing OME in the child who has food or inhalant allergies compared to the child without food or inhalant allergies |
| 2. | What is the natural history for various types of OME? |
| 3. | What is the long-term level of speech and language development in infants and preschool children with untreated OME? What are the high risk groups? |
| 4. | What is the accuracy of various diagnostic methods? |
| 5. | When should conservative treatment (non-surgical) be considered a failure? |
| 6. | What is the evidence on effectiveness of various diagnostic instruments in deciding on intervention for OME? |
| 7. | What is the evidence regarding level of hearing decrease and whether unilateral or bilateral hearing decrease is an indication for intervention? |
| 8. | What is the effectiveness of the use of hearing levels to decide on intervention for OME? |
| 9. | Are antibiotics more effective than placebo in treating OME? |
| 10. | Are steroids more effective than placebo in treating OME? |
| 11. | Do antibiotics add an incremental benefit to steroids in treating OME? |
| 12. | Are interventions for allergies (food or inhalant) more effective than placebo in treating OME? |
| 13. | Are antihistamines and/or decongestants more effective than placebo in treating OME? |
| 14. | Are tympanostomy tubes more effective than other interventions in treating OME? |
| 15. | Is adenoidectomy more effective than other interventions in treating OME of greater than 3 months duration? |
| 16. | Is tonsillectomy more effective than other interventions in treating OME of greater than 3 months duration? |
| 17. | Is myringotomy more effective than other interventions in treating OME of greater than 3 months duration? |
| 18. | Are alternative or complementary therapies more effective than other interventions in treating OME of greater than 3 months duration? |
| 19. | Are prophylactic antibiotics more effective than other interventions in treating OME? |
| 20. | What is the effectiveness of monitoring by pneumatic otoscopy, tympanometry, acoustic reflectometry with spectral gradient, and otoacoustic emissions to decide on intervention for OME? |
importance, which included
potential impact on OME outcomes and
potential impact on development of future OME guidelines by the partner organizations; and
feasibility, which included
possibility of conducting a literature search, review, and data synthesis in 6 months,
availability of sufficient information (data) in the literature to answer the question, and
if applicable, sufficient new information (data) available to affect the results of the last systematic review of the question significantly.
| Topic of question | Rank Total | E1 | E2 | E3 | E4 | E5 | E6 | E7 | E8 | E9 | E10 | E11 | E12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1. | Food or inhalant allergies | 8 | 5 | 3 | ||||||||||
| 2. | Natural history | 78 | 10 | 9 | 10 | 9 | 7 | 9 | 4 | 2 | 6 | 7 | 5 | |
| 3. | Speech and language development | 97 | 8 | 10 | 4 | 8 | 9 | 5 | 8 | 5 | 10 | 10 | 10 | 10 |
| 4. | Accuracy of diagnostic methods | 57 | 5 | 10 | 6 | 10 | 6 | 9 | 8 | 1 | 2 | |||
| 5. | Conservative treatment | 39 | 9 | 5 | 6 | 4 | 1 | 3 | 1 | 9 | 1 | |||
| 6. | Diagnostic instruments vs intervention | 38 | 7 | 8 | 6 | 9 | 8 | |||||||
| 7. | Level of hearing decrease | 67.5 | 7 | 7 | 8 | 4 | 9 | 6.5 | 9 | 8 | 9 | |||
| 8. | Hearing levels and intervention | 16.5 | 6 | 1 | 3 | 6.5 | ||||||||
| 9. | Antibiotics versus placebo | 51 | 6 | 4 | 9 | 6 | 5 | 1 | 7 | 5 | 2 | 6 | ||
| 10. | Steroids versus placebo | 23 | 4 | 7 | 1 | 6 | 5 | |||||||
| 11. | Antibiotics and steroids | 22 | 5 | 2 | 3 | 3 | 5 | 4 | ||||||
| 12. | Treatment for allergies vs placebo | 2 | 1 | 1 | ||||||||||
| 13. | Antihistamines/decongestants vs placebo | 10 | 10 | |||||||||||
| 14. | Tympanostomy tubes vs other interventions | 51 | 3 | 1 | 8 | 10 | 2 | 2 | 4 | 4 | 3 | 6 | 8 | |
| 15. | Adenoidectomy vs other interventions | 32 | 7 | 2 | 1 | 5 | 3 | 2 | 2 | 3 | 7 | |||
| 16. | Tonsillectomy vs other interventions | 2 | 2 | |||||||||||
| 17. | Myringotomy vs other interventions | 5 | 2 | 3 | ||||||||||
| 18. | Alternative/complementary therapies vs other interventions | 10 | 2 | 7 | 1 | |||||||||
| 19. | Prophylactic antibiotics vs other interventions | 15 | 4 | 3 | 4 | 4 | ||||||||
| 20. | Effectiveness of diagnostic methods for monitoring | 36 | 8 | 9 | 8 | 8 | 3 | |||||||
Note: Kendall Coefficient of Concordance = 0.36, p=0.0001.
| Potential Key Questions | Comments/Notes | |
|---|---|---|
| 1. | What is the relative risk of developing OME in the child who has food or inhalant allergies compared to the child without food or inhalant allergies? |
|
| 2. | What is the natural history for various types of OME? | |
| 3. | What is the long-term level of speech and language development in infants and preschool children with untreated OME? What are the high risk groups? |
|
| 4. | What are the accuracy of various diagnostic methods? |
|
| 5. | When should conservative treatment (non-surgical) be considered a failure? |
|
| 6. | What is the evidence on effectiveness of various diagnostic instruments in deciding on intervention for OME? |
|
| 7. | What is the evidence regarding level of hearing decrease and whether unilateral or bilateral hearing decrease is an indication for intervention? |
|
| 8. | What is the effectiveness of the use of hearing levels to decide on intervention for OME? |
|
| 9. | Are antibiotics more effective than placebo in treating OME? |
|
| 10. | Are steroids more effective than placebo in treating OME? |
|
| 11. | Do antibiotics add an incremental benefit to steroids in treating OME? |
|
| 12. | Are interventions for allergies (food or inhalant) more effective than placebo in treating OME? |
|
| 13. | Are antihistamines and/or decongestants more effective than placebo in treating OME? |
|
| 14. | Are tympanostomy tubes more effective than other interventions in treating OME? |
|
| 15. | Is adenoidectomy more effective than other interventions in treating OME of greater-than 3 months duration? |
|
| 16. | Is tonsillectomy more effective than other interventions in treating OME of greater-than 3 months duration? |
|
| 17. | Is myringotomy more effective than other interventions in treating OME of greater-than 3 months duration? |
|
| 18. | Are alternative or complementary therapies more effective than other interventions in treating OME of greater-than 3 months duration? |
|
| 19. | Are prophylactic antibiotics more effective than other interventions in treating OME? |
|
| 20. | What is the effectiveness of monitoring by pneumatic otoscopy, tympanometry, acoustic reflectometry with spectral gradient, and otoacoustic emissions to decide on intervention for OME? |
|
| Epidemiology |
| |
| Definition of AOM |
| |
The four top ranking key questions were selected for consideration in the evidence report. After several revisions at the suggestions of the technical experts, the wordings of the four top ranking (key) questions are as follows:
What is the natural history (spontaneous resolution rate over time without treatment) for:
OME persisting after a discrete episode of acute otitis media,
newly diagnosed OME of unknown duration (unilateral or bilateral),
OME persisting for weeks or months (unilateral or bilateral),
unilateral OME lasting 3 months or longer,
bilateral OME lasting 3 months or longer?
Do infants and preschool children with longer duration early-life OME have greater delays in speech and language development (receptive or expressive) later in life as compared to those with shorter duration OME? One specific formulation of this question is: Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for speech and language developmental delays?
What are the risk factors that modulate the effect of OME on speech and language development in infants and preschool children?
Do infants and preschool children with longer duration early-life OME as compared to those with shorter duration OME have permanent (or sensorineural) hearing loss later in life? One specific formulation of this question is: Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for permanent (or sensorineural) hearing loss later in life?
What are the risk factors that interact with the effect of OME on hearing later in life (unilateral or bilateral) in infants and preschool children?
What are the sensitivity, specificity, and predictive values for alternative methods of diagnosing OME compared with one of the reference standards?
These methods include, but are not limited to:
signs/symptoms
non-pneumatic otoscopy
pneumatic otoscopy, validated or un-validated examiner
binocular micro-tympanoscopy
portable tympanometer
professional tympanometer
quantitative tympanometry
acoustic reflectometry (specify model and year)
otoacoustic emissions
audiometry, air or. bone conduction thresholds.
The reference standards to be used in evaluating these diagnostic tests will include tympanocentesis, sedated or non-sedated; MRI; myringotomy, sedated or non-sedated; validated pneumatic otoscopy; and CT Scan.
The project staff developed a causal pathway and a scope for each of the key questions. We distributed these documents to the panel of technical experts for review and for preparation for the initial telephone conference call. During this call, we assessed and refined the topics and discussed the proposed key questions, target condition, patient populations, clinical context, interventions, and outcomes of interest. The following characteristics of outcomes were proposed: 1) outcomes would be divided into short term and long term; 2) long term outcomes would consist of percent time with effusion, frequency of acute otitis media, hearing loss, speech and language performance, cognition, academic achievement, and other developmental outcomes; 3) duration of short term outcomes would be defined as four weeks or less or eight weeks or less; and 4) duration of long term outcomes would be defined as greater than one year.
During this first conference call, the Technical Expert Panel decided to use the OME Guideline definition of OME: “fluid in the middle ear without signs or symptoms of ear infection”. (Stool, Berg, Berman et al., 1994) Though the technical experts agreed on the definition of OME they could not agree on which signs or symptoms should be absent, i.e. what signs or symptoms differentiated OME from acute otitis media. During the first conference call, the technical experts advised us to avoid the use of the terms ‘acute’, ‘subacute’, or ‘chronic’ as descriptors of OME. Instead we should use the duration of OME, such as “under 3 months” versus “greater than or equal to 3 months” as descriptors.
![]() |
| What is the natural history (spontaneous resolution rate over time without treatment) for: |
| •OME persisting after a discrete episode of acute otitis media, |
| •newly diagnosed OME of unknown duration (unilateral or bilateral) |
| •OME persisting for weeks or months (unilateral or bilateral) |
| •unilateral OME lasting 3 months or longer, |
| •bilateral OME lasting 3 months or longer? |
![]() |
![]() |
![]() |
![]() |
Based on the causal pathways and the discussions during the two conference calls, we developed the scope for each of the four key questions. The scope specifically defined the disease entity, study population, practice settings (including provider type), time period in practice setting, exclusion factors, interventions, influencing factors, outcome measures, literature sources, language, and study design for each key question to be included in the evidence report. We conducted a second poll of the technical experts on each of these domains in which we sought their approval, disapproval, or recommendations for revision on each domain.Appendix B presents the version of the scope distributed to the technical experts for polling of their comments and approval. Appendix C presents the results of the polling of the experts' comments on the scope. In response to these comments, we further revised the key questions, causal pathways, and the various domains of the scope. We incorporated comments from technical experts to the extent possible, except those related to other domains, those that were obvious misunderstandings or misinterpretations, or those suggestions for deletions or additions that could be handled during the analysis phase of the project. Specifically, the project team retained several influencing factors recommended for deletion by some experts. The project team took note of the deletions and would later stratify the analysis, if possible, by the group of factors unanimously recommended by the experts versus those that were not. Such stratified analysis would depend on the number of studies that specifically address these factors. We revised the key questions according to experts' suggestions. We further revised the scope and reworded the key questions according to the final round of comments: the final version of the scope is included in Appendix D.
In preparation for supplemental analysis, we took polls to solicit the technical experts' opinion on the importance of the risk factors identified in the analytical framework. Specifically, we asked the experts the following questions for each factor:
Regarding Key Question 1: “Does this factor influence the natural history of OME?”
Regarding Key Question 2: “Does this factor have an independent effect on speech and language development separate from its effects on OME or unspecified OM?”
Regarding Key Question 3: “Does this factor have an independent effect on long-term hearing separate from its effects on OME or unspecified OM?”
Regarding Key Question 4: “Does this factor have an independent effect on the accuracy of a diagnostic method separate from its effects on OME or unspecified OM?”
The experts had a choice of responding “yes”, “no”, or “don't know.” For each such opinion, the experts were asked to indicate the basis of their opinion by choosing one or more of the following: “Judgment/Experience,” “Theoretical Construct,” or “Literature”.
| Key Question 1: Natural History | |
|---|---|
| Factors | Total “Yes” Based on Responses of 11 Technical Expertsa |
| total duration of OME (≥3 mos) | 10 |
| otitis prone (AOM) | 9 |
| number of previous OMEs | 9 |
| number of hours attending child care center | 9 |
| tobacco smoke exposure | 9 |
| season of the year | 9 |
| age at first OM | 9 |
| not breast-fed | 9 |
| Allergies | 8 |
| age of child | 8 |
| family history of OME | 7 |
| number of children in household | 7 |
| prior tubes | 7 |
| ethnicity/race | 6 |
| barotrauma challenges | 6 |
| prior adenoidectomy | 5 |
| socioeconomic status | 5 |
| laterality, unilateral versus bilateral | 4 |
| Gender | 3 |
| skill to diagnose (validated) | 3 |
| age of onset of previous OME | 3 |
| tympanometry | 3 |
| monitoring frequency | 2 |
| monitoring time | 2 |
| MRI | 2 |
| type of examiner | 2 |
| setting of care | 1 |
| parent/caregiver education | 1 |
| hearing level, conductive versus sensorineural | 1 |
| primary provider | 1 |
| acoustic reflectometry | 1 |
| Otoscopy | 1 |
| pneumatic otoscopy | 1 |
| parent/caregiver preference for treatment | 1 |
| developmental delay | 0 |
11 technical expterts responded; 1 astained
| Key Question 2: Speech and Language | |
|---|---|
| Factors | Total “Yes” Based on Responses of 12 Technical Expertsa |
| developmental delay | 11 |
| quality of child care | 10 |
| hearing level, conductive versus sensorineural | 10 |
| parent/caregiver education | 10 |
| quality of parent-child interaction | 10 |
| socioeconomic status | 8 |
| laterality, unilateral versus bilateral | 8 |
| early intervention program | 7 |
| total duration of OME (≥3 mos) | 7 |
| Gender | 6 |
| number of children in household | 6 |
| duration of middle ear effusion | 6 |
| chronic illness of any type | 6 |
| number of hours attending child care center | 5 |
| number of previous OMEs | 4 |
| presence of active ear disease | 4 |
| ethnicity/race | 3 |
| tobacco smoke exposure | 3 |
| OM complications | 3 |
| child temperament | 3 |
| Allergies | 2 |
| ambient noise | 2 |
| age at first OM | 1 |
| not breast-fed | 1 |
| skill to diagnose (validated) | 1 |
| type of examiner | 0 |
| setting of care | 0 |
| recheck times | 0 |
| frequency of recheck | 0 |
| primary provider | 0 |
| tympanometry | 0 |
| acoustic reflectometry | 0 |
| pneumatic otoscopy | 0 |
| MRI | 0 |
| equipment | 0 |
| audiometry | 0 |
Note:Items in bold were added in the second poll after the first poll.
| Key Question 3: Hearing | |
|---|---|
| Factors | Total “Yes” Based on Responses of 12 Technical Experts |
| OM complications | 10 |
| laterality, unilateral versus bilateral | 7 |
| hearing level, conductive versus sensorineural | 7 |
| developmental delay | 6 |
| presence of active ear disease | 6 |
| total duration of OME (≥3 mos) | 4 |
| number of previous OMEs | 3 |
| duration of middle ear effusion | 3 |
| chronic illness of any type | 3 |
| ambient noise | 3 |
| Allergies | 2 |
| age at first OM | 1 |
| ethnicity/race | 1 |
| socioeconomic status | 1 |
| quality of child care | 1 |
| early intervention program | 1 |
| tobacco smoke exposure | 1 |
| number of children in household | 1 |
| child temperament | 1 |
| Equipment | 1 |
| Audiometry | 1 |
| Gender | 0 |
| number of hours attending child care center | 0 |
| not breast-fed | 0 |
| parent/caregiver education | 0 |
| quality of parent-child interaction | 0 |
| skill to diagnose (validated) | 0 |
| type of examiner | 0 |
| setting of care | 0 |
| recheck times | 0 |
| frequency of recheck | 0 |
| primary provider | 0 |
| Tympanometry | 0 |
| acoustic reflectometry | 0 |
| pneumatic otoscopy | 0 |
| MRI | 0 |
Note:Items in bold were added in the second poll after the first poll
| Key Question 4: Diagnostic Tests | |
|---|---|
| Factors | Total “Yes” Based on Responses of 11 Technical Expertsa |
| age of child | 11 |
| Otolaryngologist | 6 |
| nurse practitioner | 5 |
| Pediatrician | 5 |
| family physician | 5 |
| laterality, unilateral versus bilateral | 5 |
| Anesthetic | 5 |
| age at first OM | 5 |
| developmental delay | 4 |
| physician assistant | 4 |
| Others | 3 |
11 technical experts responded; 1 abstained.
The Technical Expert Panel and project staff developed a literature search strategy. The literature search included the search of three databases: MEDLINE (1966-January2000), the Cochrane Library (through January 2000), and EMBASE (1980 -January 2000). We identified additional articles by review of reference lists in proceedings, published articles, reports, and guidelines.
The project librarian developed an overall search strategy for MEDLINE (Appendix G) that incorporated the input from the technical experts and followed the scope of the project. The MEDLINE database is produced by the U.S. National Library of Medicine and is widely recognized as the premier source for bibliographic coverage of biomedical literature. It encompasses information from Index Medicus, Index to Dental Literature, and International Nursing, as well as other sources of coverage in the areas of allied health, biological and physical sciences, humanities, and information science as they relate to medicine and health care, communication disorders, population biology, and reproductive biology. We searched the MEDLINE database for publications dating back to 1966. Further, we included articles in the English language only for the following reasons. First, our experience with our evidence assessment of the management of acute otitis media (Takata, Chan, Shekelle et al., in press; Chan, Takata, Shekelle et al., in press) demonstrated a low yield from non-English language publications (only two studies accepted out of 97 reviewed and both of theses were also published in English). Second, we needed to balance limited resources between reviewing non-English language literature and answering additional key questions. Since empiric evidence of the need to include non-English language literature in meta-analyses was mixed (Moher, Pham, Klassen et al., 2000) and reviewing non-English literature would be resource intensive, we chose to limit our scope to English language literature only.
The MEDLINE search strategy used both controlled vocabulary terms and keywords. The strategy was organized into modules or clusters of search statements. The main groupings included: otitis media with effusion (OME), mastoid, otitis media; natural history; speech and language; hearing; and diagnosis. These groupings corresponded to the key questions.
For the “otitis media with effusion” concept, both the controlled vocabulary term otitis media with effusion and text word were used. A variety of additional terms were used; such as allergic otitis media, fluid ear, glue ear, middle ear effusion, mucoid otitis media, nonsuppurative otitis media, secretory otitis media, and serous otitis media. For the “mastoid” concept, both the controlled vocabulary and the text word were used. The otitis media module included what is referred to as an “explode” of otitis media, which included the controlled vocabulary headings “otitis media”, “otitis media with effusion”, and “otitis media, suppurative.”
The “natural history” module combined “OME” or “mastoid” with a combination of text words and controlled vocabulary terms for natural history including “natural course”, “natural history”, “placebo”, “placebos”, “resolution”, “self limited”, “self limiting”, “untreated”, and a variety of terms for spontaneous resolution.
Both the “speech and language” module and the “hearing” module combined OME, or mastoid, or an explode of “otitis media” with the speech, language, and hearing concepts. The speech and language component used the controlled vocabulary terms for speech and language, speech and language disorders, child language, communication, communication disorders, language development and tests, voice, and voice disorders. In addition, the text words “speech” and “language” were added. The hearing module used the controlled vocabulary terms for hearing and hearing disorders and hearing aids and tests, as well as the text word “hearing”.
The “diagnosis” module combined OME or mastoid with a combination of text words and controlled vocabulary terms for diagnosis. In addition to the controlled vocabulary terms for diagnosis and diagnostic techniques and procedures, a number of text words were added for audiometry, diagnosis, diagnostic, otoscopy, and tympanometry.
We customized the search strategy initially developed for MEDLINE for EMBASE. EMBASE, the Excerpta Medica database, produced by Elsevier Science, is a major biomedical and pharmaceutical database that indexes over 3,800 international journals. EMBASE is one of the most widely used biomedical and pharmaceutical databases. The database currently contains over 6 million records, with more than 400,000 citations and abstracts added yearly. We searched the EMBASE database for citations dating back to 1980. For the search in the Cochrane Library, we used “otitis media” as the search term. The Cochrane Library contains several databases: (1) The Cochrane Database of Systematic Reviews, which contains Cochrane reviews published by the Cochrane Collaboration, an international organization dedicated to applying evidence-based-medicine principles to the review of important clinical topics; (2) The Cochrane Controlled Trials Register which is a bibliographic database of controlled trials; (3) The Database of Abstracts of Reviews of Effectiveness (DARE), which includes structured abstracts of systematic reviews that have been critically appraised by reviewers at the NHS Centre for Reviews and Dissemination in York and by other experts, for example from the American College of Physicians'; Journal Club and the journal Evidence-Based Medicine; and (4) The Cochrane Review Methodology Database which is a bibliography of articles on the science of research synthesis.
The Cochrane Library search yielded 666 titles/abstracts. The MEDLINE search resulted in 2,379 titles/abstracts. After eliminating duplicates, we retained 2,207 titles/abstracts for screening. The EMBASE search retrieved 1980 citations. After eliminating duplicates, we retained 327 for screening.
We conducted all searches in January 2000 and subjected a total of 3,200 titles/abstracts to screening by two physician reviewers. By merging and eliminating duplicates from the titles/abstracts from the three databases, we created a database of titles and abstracts.
EndNote software (EndNote Windows Version 3.0, 1st Edition. Niles Software Inc., Berkeley, CA) was used to keep a complete record of all titles/abstracts and identify duplications. This software stores, organizes, and tracks references by source (e.g. identified in MEDLINE), search strategy (date of search, index code specifying search criteria used), and a unique identification (UI) code for each article (assigned by the source used to find article). Electronic removal of duplicate citations was supplemented by manual cross-checking. In the event an article was identified by an expert panel member or through reference checking, the title and author of the reference were entered into MEDLINE through the Ovid search system (Ovid Technologies, Inc. 1998, Version: 7.8 Millennium source ID 1.3932.1.156.1.7, Revision: 1.303.2.8) to determine the UI. If a UI could not be found for the article, an alternate identification code was assigned.
EndNote assigned a record number to each new reference added to the master file. This number would not change once an article was added to the list and was used, in addition to the UI, to sort references for article retrieval and review.
Upon completing the literature search and duplicate checking, we exported the master list generated from EndNote to a Microsoft Excel spreadsheet for data export and analysis. We added codes including status of article retrieval, reviewer, and the results of the review.
After retrieving of titles and abstracts from the literature search, two physician reviewers reviewed the abstracts against the inclusion/exclusion criteria to determine eligibility for inclusion in the evidence synthesis as defined in the scope and key questions. Titles/abstracts were not masked prior to review. A pre-designed screening form was used to record the reviews. A meeting was held to review the instructions for screening (Appendix H), including the use of the computerized data forms. The reviewers entered the screening results directly into the computer and forwarded the results electronically to our data analyst for processing. The screening results for each title/abstract were matched between the two reviewers, and discrepancies on inclusion or exclusion were resolved in conference calls among the two reviewers and the task order coordinator. The data analyst generated summary reports indicating those abstracts that passed the screening criteria and those that failed and the reasons for failure.
| Reason Code | Reason for Rejection | Number (Percent) of Citations |
|---|---|---|
| R0 | Written in non-English language | 170 ( 5.3%) |
| R1 | Case report/editorial/letter/clinical/overview/ practice guidelines/consensus statements | 459 (14.3%) |
| R2 | Non-human subjects | 19 ( 0.6%) |
| R3 | Study condition not OM | 804 (25.1%) |
| R4 | Age of study population >12 yearsa | 57 ( 1.8%) |
| R5 | Study population exclusively on any one of the following: Craniofacial defects, primary mucosal disorders, Immunodeficiencies, or Down or other genetic syndromes | 15 ( 0.5%) |
| R7 | Any key questions not addressed | 697 (21.8%) |
| R8 | Duplicate citation | 9 ( 0.3%) |
The age limit was later extended to 22 years of age for Questions 2 and 3.
The third source of reference material was the six proceedings of the International Symposium on Recent Advances in Otitis Media with Effusion from which we identified 159 additional citations for full article review.
The fourth source consisted of references in books and articles from which we identified 31 full articles for further review.
The titles/abstracts identified for further review were forwarded to the library for full article retrieval. Libraries at both the Los Angeles County - University of Southern California Medical Center and the University of Southern California Health Sciences Campus were the primary sources of the articles. Those not found were retrieved through the Inter-Library Loan Program.
Because a large number of titles/abstracts had inadequate information for full evaluation, a secondary screening of full articles was conducted. Two physicians or a physician and a health services researcher reviewed each article. Articles were not masked prior to review. Discrepancies on inclusion/exclusion were resolved between the reviewers.
| Reason Code | Reason for Rejection | Cochrane and Medline N=614 | AAP Data Files N=29 | Symposia on OME N=107 | Technical Experts N=17 | Articles and books N=31 | Total N=798a |
|---|---|---|---|---|---|---|---|
| R0 | Non-English language | 1 | 0 | 0 | 0 | 0 | 1 (0.1%) |
| R1 | Case report/editorial/ letter/clinical/ overview/ practice guidelines/ consensus statements | 142 | 24 | 22 | 4 | 17 | 209 (26.2%) |
| R2 | Non-human subjects | 4 | 0 | 0 | 0 | 2 | 6 (0.8%) |
| R3 | Study condition not OM | 33 | 0 | 0 | 2 | 2 | 37 (4.6%) |
| R4 | Age of study population >22 years | 0 | 0 | 2 | 0 | 1 | 3 (0.4%) |
| R5 | Study population exclusively one of the following: craniofacial defects, primary mucosal disorders, immunodeficiencies, or Down or other genetic syndromes | 3 | 0 | 1 | 2 | 0 | 6 (0.8%) |
| R7 | Key questions not addressed | 354 | 5 | 74 | 9 | 2 | 444 (55.6%) |
| R8 | Duplicate citation | 39 | 0 | 3 | 0 | 7 | 49 (6.1%) |
| R9 | Data not abstractable from article | 38 | 0 | 5 | 0 | 0 | 43 (5.4%) |
3 incorrect citations not included.
| Cochrane and Medline | AAP Data Files | SymposiaProceedings on OME | Technical Experts | Articles and books | All Sources | |
|---|---|---|---|---|---|---|
| Total Citations | 3200 | 1441 | 159a | 39 | 40 | 4879 |
| Number of articles reviewed at secondary screening | 975b | 32 | 159 | 36 | 40 | 1242 |
| Number accepted after secondary screening | 372 | 3 | 52 | 19 | 5 | 451 |
| Question 1 | 127 | 0 | 14 | 0 | 0 | 141 |
| Question 2 | 74 | 1 | 20 | 17 | 0 | 112 |
| Question 3 | 157 | 2 | 22 | 2 | 3 | 186 |
| Question 4 | 69 | 0 | 4 | 0 | 2 | 75 |
| Total of above c | 427 | 3 | 60 | 19 | 5 | 514 |
Exact number of citations not determined.
Four cases previously rejected because of age limit >12 were added to the original 971 accepted citations.
The ‘total of above’ number can exceed the number accepted because an article can address more than one question.
After establishing the analytical plan and before data abstraction, a physician and one health services researcher carried out a tertiary review of the 449 articles according to a set of established criteria for each key question. During this tertiary review, study design and quality were also evaluated.
For Question 1, we used three criteria for tertiary screening: 1) was the study a prospective cohort(s) study or a randomized control trial (RCT), 2) whether the control group in the RCT used the other ear as the ‘unit of control’ or not, and 3) were the outcome data abstractable?
For Questions 2 and 3, we established 5 criteria for tertiary screening:
Was the degree of OME determined for the first 3 years of life, and could the OM degree for the period before 3 years of age be linked to a specific outcome? If a study began at age 3, the study was not considered to fulfill this criterion.
Was the upper age limit 22 years of age? If a study included subjects older than 22 years, this criterion could be fulfilled only if outcomes for the 22 years of age and under was reported.
Was the degree of OM graded in some way, such as total time with OM, some measure of OM persistence, OM recurrence, or some measure of OM severity, and could the OM degree grade be linked to a specific outcome?
Was the study prospective? A study is considered prospective if the outcomes were measured prospectively. Cross-sectional and case-control studies were specifically excluded. A study that followed subjects prospectively for both OM history and outcome measures was considered a prospective cohort study. Studies that followed subjects prospectively for outcome measures (i.e. over a period of time) but retrospectively for OM history were considered to be retrospective-prospective studies and were accepted for inclusion. Studies that collected data on the outcome measures at one point in time, were considered retrospective cohort studies regardless of whether OM history was collected prospectively or retrospectively and were excluded. Studies that presented a cross-sectional analysis of prospectively collected data were not considered prospective and were excluded. Randomized controlled studies of an intervention with longitudinally measured outcomes were considered prospective and were included.
Was the outcome measured when the child was older than 3 years of age?
For Question 4, we established 3 criteria for inclusion during tertiary review: 1) Were the diagnostic procedure of interest and the reference standard performed within 24 hours of each other? 2) Is the diagnostic procedure not an algorithm or combination of multiple diagnostic procedures, 3) Are the reference standards one of those specified in the scope (tympanocentesis, MRI, myringotomy, validated pneumatic otoscopy, or CT scan), and 4) Are the data abstractable.
After tertiary review and data abstraction, a total of 114 articles were included in this evidence report, five of which addressed more than one key question:
38 chort studies for question 1, the natural history question;
21 studies for question 2, the speech and language question;
8 studies for question 3, the hearing question; and
52 studies for question 4, the diagnostic method question.
| Accepted at Secondary Screening | Abstracted in Evidence Tables | |
|---|---|---|
| 449 | 114 (25%) | |
| Question 1 | 141 | 38 (27%) |
| Question 2 | 112 | 21 (19%) |
| Question 3 | 186 | 8 ( 4%) |
| Question 4 | 75 | 52 (69%) |
| Total of above a | 514 | 119 (23%) |
An article can address more than one question.
We established criteria used for the assessment of study quality prior to the review of articles. Only prospective cohort studies were reviewed for Questions 1, 2 and 3 because of concerns about the validity of case-control, cross-sectional, and retrospective cohort studies. Diagnostic studies were reviewed for Question 4. The criteria used to evaluate the quality of both types of studies were modified from the work by the McMaster University Group (Jaeschke, Guyatt, and Sackett, 1994; Sackett, 1981; Trout, 1981; Tugwell, 1981).
The quality of natural history studies was evaluated against the following criteria:
Was the study a prospective cohort study?
Was the outcome(s) of the study clearly defined?
Was the out come(s) measured at a clearlly defined timepoint(s)?
Was the cohort of subjects followed without any intervention?
Was there blinded assessment of the outcome(s) of the study?
Were point estimates and measures of variability provided for the main adverse outcomes measured?
The quality of prospective cohort studies was evaluated against eight components:
Was the study cohort(s) clearly defined, with clearly spelled out inclusion and exclusion criteria?
Was the study cohort(s) assembled at a uniform point in the course of the child's illness?
Were the pathways by which patients entered the study clearly described?
Was complete follow-up achieved?
Were withdrawals and drop-outs described?
Were objective outcome criteria developed and used?
Was the outcome assessment “blind”?
Was adjustment for extraneous factors carried out?
The quality of diagnostic studies was evaluated against six components:
Was the reference standard appropriate?
Were the test results and the reference standard assessed independently of each other?
Were the readers of the results of the diagnostic test or the reference standard blinded?
Did the patient sample include an appropriate spectrum of mild and severe, treated and untreated patients to whom the diagnostic tests were applied in clinical practice?
Were the reproducibility of the test result (precision) and its interpretation (observer variation) determined?
Were the methods for performing the test described in sufficient detail to permit replication?
Articles were not masked prior to review. The Task Order Coordinator resolved minor discrepancies between the two reviews of each article. Conferences were held to resolve discrepancies whenever needed.
For the articles deemed eligible for inclusion in the Evidence Report, data abstraction was carried out by a two-member team that consisted of a physician reviewer and a health services researcher. One of the two members abstracted the data onto the evidence table, and the other member checked the data for accuracy. Data abstracted included parameters necessary to define study groups, inclusion/exclusion criteria, influencing factors, and outcome measures to be used in analysis.
Specific instructions for data abstraction were recorded. For Question 1, the outcome indicators for abstraction included partial OME resolution (resolution in one ear for bilateral OME only), complete OME resolution, relapse/recurrence (fluctuation/dynamic course), AOM after OME. For Question 2, the outcome indicators for abstraction were expressive or receptive language, expressive or receptive speech, and cognitive verbal intelligence. For Question 3, the outcome indicators for abstraction included conductive or sensorineural hearing loss. For Question 4, the outcome indicators for abstraction were sensitivity, specificity, positive and negative predictive values, and prevalence rate. For all questions, the time or age at which each outcome was measured was recorded. The outcome measures included both continuous and categorical measures. Continuous measurements included mean time or a median time and the categorical measures included proportion with resolution at specified times where both numerators and denominators were recorded. A key issue was that individual children, not populations, must be tracked. The latter would be acceptable only if we knew for certain that the same children were checked at both times.
To reduce selection bias, we assigned two physician reviewers to screen and review titles/abstracts and full articles at every stage of the selection process. We assigned one physician and one health services researcher who was familiar with experimental design and biostatistics to abstract data. We assessed completeness of our collection of retrieved articles by cross-checking with studies included in other meta-analyses and references listed in review articles. The software program EndNote was used to check batches of articles added to the master list for duplicate references by comparing author, year title, and reference type. Following the importation of the first literature search, we used the software program EndNote to check subsequent references for duplication prior to their addition to the master list. After the master list was completed, we performed a second, manual, check to ensure no duplication.
To assess the extent of publication bias, we searched multiple sources and unpublished material identified by the Technical Expert Panel and internal content experts. We also studied funnel plots—scatter plots of sample size versus the estimated effect size from each study. When publication bias existed, a portion of points would be missing from the funnel plot, typically at the null effect level. Because graphical evaluation can be subjective, we also conducted an adjusted rank correlation test (Begg, 1999) and a regression asymmetry test (Egger, Smith, Schneider et al., 1997) as formal statistical tests for publication bias. We conducted these tests using the statistical package Stata (StataCorp. 1999).
The mechanisms used to enhance consistency in screening and data abstraction include the use of pre-designed forms with explicit instructions and continuous and prompt resolution of discrepancies. Data were entered into a Microsoft Excel spreadsheet directly by the screeners or data abstractors. A third project staff cross-checked data for individual studies abstracted by each data collector. We resolved discrepancies by rechecking the article or by consensus via conference calls.
An evidence table was prepared for each key question. Each evidence table provides a comprehensive tabular display of data abstracted from the literature in response to the question. It contains the name of the first author, year of publication, study design and quality score, how and by whom OME diagnosis was done, when and where the study took place, inclusion and exclusion criteria, important influencing factors, sample size, outcome measures and their definitions, and study findings. A total of four evidence tables was prepared; they are included in the section called Evidence Tables in this report.
Based on the discussions of technical experts during the conference calls and the designated time frame for the evidence assessment, a supplemental analysis plan was developed for each key question to synthesize the data.
Question 1. What is the natural history (spontaneous resolution rate over time without treatment) for the following diagnostic groups: a) OME persisting after a discrete episode of acute otitis media, b) newly diagnosed OME of unknown duration (unilateral or bilateral), c) OME persisting for weeks or months (unilateral or bilateral), d) unilateral OME lasting 3 months or longer, and e) bilateral OME lasting 3 months or longer?
The outcome measures for this questions included complete and/or partial resolution rates, relapse/recurrence rates, and incidence of AOM after OME. The scope listed 31 non-treatment factors that might affect the course of the illness and confound the outcomes. They included: age, gender, ethnicity/race, socioeconomic status, number of hours in child care center, tobacco smoke exposure, season, number of children in household, breast-fed status, barotrauma challenges, OME laterality, hearing level, total duration of OME, age at first OM onset age of previous OME, number of previous OMEs, family history of OME, otitis prone (AOM), allergies, prior tubes, prior adenoidectomy, developmental delay, caregiver preference for treatment, caregiver education, examiner skill, examiner type, health care setting, monitoring time, monitoring frequency, monitoring personnel, and monitoring method.
Furthermore, the type of study is an important consideration for the assessment of natural history. A stratified random sample of a broad, well-defined population provides evidence of good generalizability, but may be restricted in the amount of clinical information on participants. A single (untreated) arm from a clinical trial will usually provide much more clinical evidence about OME, but this is usually assessed on a very selected group of children, making generalizing the results to the general population more difficult. For this evidence assessment we used only prospective cohort studies as these came closest to the ideal of enrolling a sample from a broad poulation.
The first step of the analysis was to obtain a distribution of studies stratified by the 5 diagnostic groups (namely, OME persisting after a discrete episode of acute otitis media, newly diagnosed OME of unknown duration (unilateral or bilateral), OME persisting for weeks or months (unilateral or bilateral), unilateral OME lasting 3 months or longer, bilateral OME lasting 3 months or longer), by type of outcome measures, and by non-treatment factors. This stratification provided us with an overview of the emphasis of past research in this area and an opportunity to identify gaps and areas for future research.
Using the DerSimonian and Laird random effects model (DerSimonian and Laird 1986) to pool rates across studies, we performed a meta-analysis on strata with more than 3 studies for a pooled estimate of an outcome with 95% confidence intervals. This method produces a summary measure by weighting each study's measure by the inverse of the sum of the within-study variance and the between-study variance. This approach allowed both sampling variation and between-study heterogeneity to affect the pooled estimate.
In addition to the pooled estimate, we reported the Q statistic and p-value for the Chi-squared test of heterogeneity, which tests the null hypothesis that the individual study results are homogeneous (Laird and Mosteller 1990).
Question 2. Do infants and preschool children with longer duration of early-life OME as compared to those with shorter duration OME have greater delays in speech and language development (receptive or expressive) later in life? One specific formulation of this question is: Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for speech and language developmental delays? What are the risk factors that interact with the effect of OME on speech and language development in infants and preschool children?
For the first part of the question, the outcome of interest was speech and language developmental delay and the risk factor of interest was OME-associated conductive hearing loss and/or long versus short duration of early-life OME. For this question we included only comparative studies. Further, since prospective comparative cohort studies provide better evidence than retrospective comparative cohort studies, we conducted our assessment using only prospective comparative cohort studies.
The risk factor of interest was whether a child had or did not have OME-associated conductive hearing loss in the first 3 years of life, or whether duration of OME during the first three years of life was long or short. For this risk factor, we collected data on five related variables: hearing level, total duration of OME ≥3 months, number of previous OMEs, duration of middle ear effusion (MEE), and repeated or persistent versus infrequent early-life OME. The hearing level was used to determine whether a child had OME-associated conductive hearing loss in the first 3 years of life. The total duration of OME greater or equal to 3 months was used to define length of duration. We used the repeated or persistent versus infrequent early-life OME to define the risk. If a study did not classify the study subjects this way and (instead) reported data by number of previous OMEs and/or duration of MEE's, we sought the advice of the technical experts to stratify the samples based on these variables.
The influencing factors of outcome included both treatment and non-treatment factors. Here we are using “influencing factors” as a general term including risk factors for OM and/or confounding factors for the dependent variables of interest. The non-treatment factors included: age at first OM, gender, ethnicity/race, socioeconomic status, number of hours at a child care center, quality of child care, early intervention program, tobacco smoke exposure, number of children in household, breast-feeding status, OME laterality, allergies, developmental delay, OM complications, e.g. perforated TM, cholesteatoma, chronic illness of any type, caregiver education, quality of parent-child interaction, examiner skill, examiner type, health care setting, age at rechecks, frequency of rechecks, primary care provider, and type of equipment to measure hearing. Treatment factors included any combination of the following: tympanostomy tubes, adenoidectomy, myringotomy, antibiotics, systemic steroids, decongestant, antihistamine, N-acetyl-cysteine or others.
The outcome measures for this question related to speech and language developmental delay. These outcomes were measured by different instruments at different times, by different professionals, in different settings. In preparation for information synthesis, with the assistance of our speech and language technical expert, we classified the tests used in our final set of studies into the five outcome categories: expressive language, receptive language, expressive speech, receptive speech, and cognitive verbal intelligence. For analysis, we first stratified studies by the type of outcome measures, risk factor measures, and treatment or non-treatment risk factors. For any comparison among 3 or more studies, we conducted a meta-analysis. In each meta-analysis, we derived a pooled effect size defined as the proportion of standardized difference between the positive and negative otitis media groups. We pooled across studies the standardized mean differences between the groups and divided by a pooled standard deviation. We used a random effects model (DerSimonian and Laird, 1986) and the Hedges estimate of the pooled standard deviation (Hedges and Olkin, 1985). We used Stata (StatCorp. Stata Statistical Software: Release 6.0. College Station, TX: Stata Corporation. 1999) for the analyses.
To answer the second part of the question: “What are the risk factors that interact with the effect of OME on speech and language development in infants and preschool children?,” we planned to conduct meta-regression analysis to identify the risk factors that contribute significantly to speech and language delays. For this analysis we would include both comparative and single cohort studies. Many technical issues must be addressed to set up data appropriately for meta-regression analysis. Due to restriction of the time frame, this part of the question was not included in this assessment but should be an area of future research.
Question 3. Do infants and preschool children with longer duration early-life OME as compared to those with shorter-duration OME have permanent (or sensorineural) hearing loss later in life? One specific formulation of this question is the following: Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for permanent (or sensorineural) hearing loss later in life? What risk factors interact with the effect of OME on hearing loss later in life (unilateral or bilateral) in infants and preschool children?
The analysis plan for Question 3 followed that for Question 2. In preparation for information synthesis, we sought the assistance of our audiology technical expert, to group the tests used in our final set of studies into homogeneous categories.
Our audiology expert advised that (1) an acoustic reflex at 500 or 1000 Hz were both acceptable for study, (2) the criterion for abnormal reflex threshold would depend on the study, whether ipsilateral or contralateral, and on the frequency used for testing, (3) the abnormal reflex criteria should be based on normative data, and (4) quantitative tympanometry should be classified as:
Static Compensated Acoustic Admittance including: peak admittance, peak compensated admittance, peak compliance, static compliance, static admittance, and peak compliance.
Tympanometric Gradient including: gradient, pressure gradient and tympanometric gradient (Madsen compliance was excluded because Madsen compliance units were arbitrary units and the this instrument was from an era in which the units were not on a calibrated scale.)
Tympanometric Width referring to terms containing the words width, referring to tympanometry.
Question 4. What are the sensitivity, specificity, and predictive values for alternative methods of diagnosing OME compared with one of the reference standards?
The diagnostic methods to be assessed included: a) signs/symptoms, b) non-pneumatic otoscopy, c) pneumatic otoscopy, validated or unvalidated examiner, d) binaural (or bilateral) micro-tympanoscopy, e) portable tympanometer, f) professional tympanometer, g) quantitative tympanometry, h) acoustic reflectometry, i) otoacoustic emissions, and j) audiometry, air or. bone conduction thresholds. The reference standards used to evaluate the accuracy of the diagnostic methods included: a) tympanocentesis, sedated or non-sedated, b) MRI, c) myringotomy, sedated or non-sedated, d) validated pneumatic otoscopy, and e) CT Scan.
Diagnostic methods based on algorithms, combinations of methods, or combination of scores, were not within the scope of this report because the sources of variation of such combinational methods would be difficult to detect in published articles and the analysis of them would not be feasible within our timeframe. Also excluded were studies where the experimental diagnostic test and the reference standard test were performed more than 24 hours apart.
Our strategy for evaluating the diagnostic value of a procedure was to derive pooled estimates for sensitivity, specificity, and prevalence rate for each diagnostic procedure and reference standard with 3 or more comparison studies. We used the DerSimonian and Laird random effects model (DerSimonian and Laird 1986) to derive random effects estimates and 95% confidence intervals. We also pooled the prevalence rates to determine the heterogeneity of the study populations. Using the pooled estimates, we plotted the performance of each diagnostic test in terms of sensitivity and (1-specificity) and identified the best performer among the tests included in the comparison. We then derived the positive and negative predictive values for the best diagnostic test for various prevalence levels.
To prepare for a meta-analysis for each comparison, we abstracted data from the evidence table; one meta-analysis was performed for sensitivity and specificity,. The following data elements were entered into the SAS program to be converted into a SAS data set: study ID number, author and year of publication, number of adverse outcomes in the experimental group, total number of patients in the experimental group, number of adverse outcomes in the control group, and total number of patients in the control group. We used a SAS macro software program developed by RAND statistical staff to perform all meta-analyses and used the beta-test version of the software package “MetaGraphs” (1998, Belmont Research, Inc. 84 Sherman Street, Cambridge, MA 02140) for graphing.
The following statistics were generated from the SAS macro program: (a) study-level statistics (incidence rate, relative risk, risk difference, number needed to treat (NNT), odds ratio, and their 95 percent confidence intervals); (b) crude estimates and their 95 percent confidence intervals for all studies combined; (c) fixed effects estimates and their 95 percent confidence intervals for all studies combined; (d) random effects estimates and their 95 percent confidence intervals based on the DerSimonian and Laird method for pooling study results, and Chi-squared test of homogeneity; (e) weight for each study for both the fixed effects model and random effects model used to calculate of risk difference and relative risk.
To use MetaGraph for graphing, we entered the data into ASCII files using the UltraEdit-32 software. Funnel plots were produced for the purpose of screening possible publication bias, and the shrinkage plots were generated to display the effect size of each study and compare it against the overall model estimate, together with the 95 percent confidence limits. We evaluated the funnel plots graphically for asymmetry that resulted from the non-publication of small, negative studies. Because graphical evaluation can be subjective, we also conducted an adjusted rank correlation test (Begg, 1999) and a regression asymmetry test (Egger, Smith, Schneider et al., 1997) as formal statistical tests for publication bias. We conducted these tests in the statistical package Stata (StataCorp. 1999).
| Peer Reviewer | Area of Expertise | Affiliation/Location |
|---|---|---|
| Howard Bauchner, M.D. | General Pediatrics | Child and Adolescent Health Scholar in Residence Agency for Healthcare Research and Quality |
| Hanan S. Bell, Ph.D. | Methodology reviewer | Seattle, WA |
| Alfred O. Berg, MD, MPH | Family Medicine | University of Washington, Seattle, WA |
| Patricia A. Fall, MS, CRNP | Nurse Practitioner | Wexford, PA |
| George A. Gates, MD | Otolaryngology | University of Washington, Seattle, WA |
| Janice Goertz, RN, CPNP | Nurse Practitioner | Portage, MI |
| Judith Gravel, PhD | Hearing and Speech | Albert Einstein College of Medicine, Bronx, NY |
| Mark P.Haggard, Ph.D. | Hearing/ Psychoacoustics | Institute for Hearing Research, Nottingham, UK |
| Vic Hasselblad, Ph.D. | Meta-analysis reviewer | Duke University, Durham, NC |
| Tracy Lieu, MD | Pediatrician/Health Plan | Harvard Pilgrim Health Care, Boston, MA |
| Martin C. Mahoney, MD, Ph.D. | Family Medicine | DeGraff Family Medicine, North Tonawanda, NY |
| A. Richard Maw MS FRCS | Otolaryngology | Bristol Royal Infirmary, Bristol UK |
| Robert Ruben, MD | Otolaryngology | Montefiore Medical Center, Bronx, NY |
| Anne GM Schilder, MD, Ph.D. | Otolaryngology | University Medical Center Utrecht, The Netherlands |
| Steve Shelov, MD | Pediatrics | Scarsdale, N.Y |
| Sylvan Stool, MD | Otolaryngology | The Children's Hospital, Denver, CO |
| Robin Yurk, MD, MPH | Consumer/Health Plan | Community Clinic, Inc. Rockville, MD |
| Dr. J.O.M. Zaat (Joost Zaat) | Methodology reviewer | Purmerend, The Netherlands |
| Enclosed is a draft evidence report on the diagnosis and treatment of otitis media with effusion. You may make your comments either directly on the draft evidence report, or on a separate sheet of paper. If you choose to record your comments on a separate piece of paper, please use the page and paragraph number to identify to which part of the report your comments pertain. |
| We ask that you consider the following questions while you read this report. We realized that some of the questions may not pertain to your area of expertise. Please feel free to comment only on those that you feel most suited to answer. |
|
OME persisting after a discrete episode of acute otitis media,
newly diagnosed OME of unknown duration (unilateral or bilateral),
OME persisting for weeks or months (unilateral or bilateral),
unilateral OME lasting 3 months or longer,
bilateral OME lasting 3 months or longer?
| Cohort Identifier | Relevant Articles (ID, authors, year) | Comments |
|---|---|---|
| Birch | 862 Birch and Elbrønd (1984) | 0.75- to 7-year old children followed from 1/1982 to 4/1982. |
| Casselbrant I | 1000 Casselbrant, Brostoff, Ashoff, and Bluestone (1985) | 2- to 5-year old children followed from 9/1981 to 8/1983. |
| 2929 Casselbrant, Brostoff, Ashoff, and Bluestone (1990) | ||
| Casselbrant II | 2929 Casselbrant, Brostoff, Ashoff, and Bluestone (1990) | 5- to 12-year old children followed from 9/1984 to 5/1985. |
| Ernston | 1202 Erston and Sundberg (1984) | Children 1- to 11-years old embedded in a controlled trial. |
| Fiellau-Nikolajsen I | 1237 Fiellau-Nikolajsen and Lous (1979) | Children in Hjoerring, Denmark first examined in 1976. |
| 1235 Fiellau-Nikolajsen (1979) | ||
| 1242 Fiellau-Nikolajsen (1981) | ||
| 3051 Lous and Fiellau-Nikolajsen (1988)a | ||
| Fiellau-Nikolajsen II | 1235 Fiellau-Nikolajsen (1979) | Children in Hjoerring, Denmark first examined in 1978. |
| 1245 Fiellau-Nikolajsen (1983) | ||
| Fiellau-Nikolajsen III | 1777 Lous and Fiellau-Nikolajsen (1981) | Children in Hirtshals and Sindal, Denmark first examined in 1978. |
| Holmquist | 1494 Holmquist, Fadala, and Qattan (1987) | 7- to 9.5-year old children followed 2/1983 to 4/1983. |
| Lamothe | 1714 Lamothe, Boudreault, Blancette, Tetreault, and Poliquin (1981) | First graders followed over a 6 week period in 1979. |
| Leiberman | 1735 Leiberman and Bartal (1986) | 2- to 12-year old children who had a follow-up exam after a 2.5 year delay in ventilating tube placement. |
| Marchisio | 9 Marchisio, Principi, Salpietro, Boschi, Chetri, Caramia, Longhi, Reali, Meloni, DeSantis, Sacher, and Cupido (1998) | Primary school children followed for 12 weeks after the initial exam, and then a subset were randomized to a placebo group for another 8 weeks of follow-up. |
| Mills | 1927 Mills and Vaughan-Jone (1992) | Prospective single cohort embedded in a comparative cohort of children 1- to 14-years old who had a follow-up exam about 2 months after the initial visit. |
| Portoian-Shuhaiber | 2184 Portoian-Shuhaiber and Cullinan (1984) | 5- to 6-year old children followed for 10 weeks in 1979. |
| Renvall I | 2240 Renvall, Lidén, Jungert, and Nilsson (1978) | 10- to 11- year old children examined after a 3-year interval. |
| Renvall II | 2242 Renvall, Anniansson, and Lidén (1982) | 4-year old children followed over a 12 week period in 1980. |
| Reves | 2243 Reves, Budgett, Miller, Wadsworth, and Haines (1985) | 3- to 6-year old children followed 11/1983 to 2/1984. |
| Roberts | 2262 Roberts, Johnson, Carlin, Turczyk, Karnutta, and Yaffee (1995) | Newborns followed for 2 months after birth. |
| Robinson | 2270 Robinson, Allen, and Root (1988) | Infants followed for 6 weeks. |
| Sly I | 2457 Sly, Zambie, Fernandes, and Frazer (1980) | 4- to 5-year old children recruited in 2/1977. |
| Sly II | 2457 Sly, Zambie, Fernandes, and Frazer (1980) | 4- to 5-year old children recruited in 9/1977. |
| Tos I | 1486 Holm-Jensen, Sørenson, and Tos (1981) | Children born in 1975 and followed from 5/1979 to 2/1985. |
| 2636 Tos (1981)a | ||
| 543 Tos, Holm-Jensen, Sørenson, and Mogensen (1982) | ||
| 2639 Tos (1983)a | ||
| 2642 Tos (1984)a | ||
| 4834 Tos (1984)a | ||
| 4835 Tos (1988)a | ||
| Tos II | 2629 Tos (1979) | Children born in 1976 and followed from 11/1977 to 2/1985. |
| 2190 Poulsen and Tos (1980) | ||
| 2631 Tos (1980)a | ||
| 2634 Tos (1980) | ||
| 2593 Thomsen and Tos (1981) | ||
| 2639 Tos (1983)a | ||
| 2642 Tos (1984)a | ||
| 4834 Tos (1984)a | ||
| 4835 Tos (1988)a | ||
| Tos III | 2189 Poulsen and Tos (1978) | Children born in 1977 and followed from 1–2/1977 to 2/1985. |
| 2627 Tos (1979) | ||
| 2631 Tos (1980)a | ||
| 2634 Tos (1980) | ||
| 2639 Tos (1983)a | ||
| 2642 Tos (1984)a | ||
| 4834 Tos (1984)a | ||
| 4835 Tos (1988)a | ||
| Tos IV | 1946 Moller and Tos (1990) | Children checked daily for 30 days. |
| van Balen | 91 van Balen, De Melker, Touw-Otten (1996) | 6-month to 6-year old children followed for 3 months in the early 1990's. |
| Williamson | 2791 Williamson (1994) | 5- to 8-year old children followed from 1988-1989 to 1991. |
| Wilmot | 2795 Wilmot (1988) | 6-month to 10-year old children were followed for 12 months after developing OME after AOM. |
| Zielhuis | 2863 Zielhuis, Rach, and van den Broek (1990) | 2- to 4-years old children followed from 1982-1983. |
The article did not include abstractable data relevant to the specific Question 1 outcome measures.
| ID Number | Author | Year | Included | Excluded | Reason for Exclusion From Assessment |
|---|---|---|---|---|---|
| 1000 | Casselbrant | 1985 | X | ||
| 1238, 1240 | Fiellau-Nikolajsen | 1980 | X | 1238 is a study of tympanometry as a diagnostic tool. 1240 is the same cohort as in 1245 and does not present sufficient data tracking individual cases or episodes of OME. | |
| 2242 | Renvall | 1982 | X | ||
| 2593 | Thomsen | 1981 | X | 2593 uses same cohort as 2190, 2629, 2631, 2639, 2642, 4834, and 4835. | |
| 2627 | Tos | 1979 | X | ||
| 2634 | Tos | 1980 | X | ||
| 543 | Tos | 1982 | X | 543 uses same cohort as 1486, 2636, 2639, 2642, 4834, and 4835. | |
| 2791 | Williamson | 1994 | X | ||
| 2863 | Zielhuis | 1990 | X | ||
| 2857 | Zeisel | 1995 | X | 2857 included 13% with purulent OME. Antibiotic administration and other interventions for either purulent or non-purulent OME. Insufficient data track individual cases or episodes of OME. | |
| 2243 | Reves | 1985 | X | ||
| 91 | van Balen | 1996 | X | Randomized controlled trial with an initial 3-month watchful waiting period of children with OME. | |
| 1777 | Lous | 1981 | X | ||
| ID# | Author | Year | Reason Not Included |
|---|---|---|---|
| 705 | Aniansson | 1985 | Screening study over 2 years. Some retest of same subjects, but no control over treatment. |
| 3051 | Lous | 1988 | Data not abstractable as presented. Available counts are in essential agreement with articles 1235, 1237, and 1242 apart from minor differences in number of tympanogram types, e.g. type A in Jan 1975 (629 in articles 1235, 1237, and 3051 and 631 in article 1242) and types B, C1, and C2 in Jun-July 1976 (32, 31,, and 42 respectively in articles 1235, 1242, and 3051 and 37, 28, and 40 respectively in article 1237). |
| 2578 | Teele | 1980 | Data not abstractable; data on persistent OME after AOM are not presented and cannot be derived. |
| 2631 | Tos | 1980 | Article does not present any new information other than the actual initial counts of tympanogram types for the 1976 and 1977 Tos cohorts utilized in the evidence tables for article 2629 and 2627. |
| 2636 | Tos | 1981 | Data not abstractable as presented relative to the data on 3- and 6-month follow-up presented in article 1486. Individual cases of type B, C1, or C2 tympanograms cannot be tracked from 2/1979 to 11/1979 or 2/1980. |
| 2639 | Tos | 1983 | The only relevant new information is that the authors mention that in the 1976 cohort, 50% of tympanogram type B changed to types A or C over the first 3 months of the study. However, the number with type B who presented for the 3-month follow-up exam is not given so per cent change cannot be calculated. |
| 2642 | Tos | 1984 | Article does not present any abstractable data relevant to Q1. |
| 4834 | Tos | 1984 | Article does not present any new data relevant to Q1. |
| 4835 | Tos | 1988 | Article does not present any new data relevant to Q1. |
| 2795 | Wilmot | 1988 | Article presented data on OME following AOM which was eliminated as a condition of interest because it had been studied in a recent evidence analysis. |
| ID | Author | Year | Study Quality Scorea |
|---|---|---|---|
| 862 | Birch | 1984 | 3 (1,1,1,0,0,0) |
| 1000 | Casselbrant | 1985 | 3 (1,1,1,0,0,0) |
| 2929 | Casselbrant | 1990 | 3 (1,1,1,0,0,0) |
| 1202 | Ernstson | 1984 | 2 (1,0,0,1,0,0) |
| 1235 | Fiellau-Nikolajsen | 1979 | 4 (1,1,1,0,0,1) |
| 1237 | Fiellau-Nikolajsen | 1979 | 4 (1,1,1,0,0,1) |
| 1242 | Fiellau-Nikolajsen | 1981 | 4 (1,1,1,0,0,1) |
| 1245 | Fiellau-Nikolajsen | 1983 | 3 (1,1,1,0,0,0) |
| 1486 | Holm-Jensen | 1981 | 3 (1,1,1,0,0,0) |
| 1494 | Holmquist | 1987 | 3 (1,1,1,0,0,0) |
| 1714 | Lamothe | 1981 | 4 (1,1,1,1,0,0) |
| 1735 | Leiberman | 1986 | 2 (1,1,0,0,0,0) |
| 1777 | Lous | 1981 | 3 (1,1,1,0,0,0) |
| 9 | Marchisio | 1998 | 3 (1,1,1,0,0,0) |
| 1927 | Mills | 1992 | 1 (1,0,0,0,0,0) |
| 1946 | Moller | 1990 | 3 (1,1,1,0,0,0) |
| 2184 | Portoian-Shuhaiber | 1984 | 3 (1,1,1,0,0,0) |
| 2189 | Poulsen | 1978 | 3 (1,1,1,0,0,0) |
| 2190 | Poulsen | 1980 | 3 (1,1,1,0,0,0) |
| 2240 | Renvall | 1978 | 3 (1,1,1,0,0,0) |
| 2242 | Renvall | 1952 | 4 (1,1,1,1,0,0) |
| 2243 | Reves | 1985 | 3 (1,1,1,0,0,0) |
| 2262 | Roberts | 1995 | 4 (1,1,1,0,1,0) |
| 2270 | Robinson | 1988 | 1 (1,1,0,0,0,0) |
| 2457 | Sly | 1980 | 3 (1,1,1,0,0,0) |
| 2593 | Thomsen | 1981 | 3 (1,1,1,0,0,0) |
| 2627 | Tos | 1979 | 3 (1,1,1,0,0,0) |
| 2629 | Tos | 1979 | 3 (1,1,1,0,0,0) |
| 2634 | Tos | 1980 | 3 (1,1,1,0,0,0) |
| 543 | Tos | 1982 | 3 (1,1,1,0,0,0) |
| 91 | van Balen | 1996 | 3 (1,1,1,0,0,0) |
| 2791 | Williamson | 1994 | 2 (1,1,0,0,0,0) |
| 2863 | Zielhuis | 1990 | 3 (1,1,1,0,0,0) |
The six components of study quality are: a prospective cohort study; outcome clearly defined; time point at which outcome measured clearly defined; subjects followed without any intervention; blinded assessment of outcome; and point and variability estimates provided for main outcome measures. 1 indicates presence and 0 indicates absence
| Tympanogram Type | A | As | B | ||||
|---|---|---|---|---|---|---|---|
| Cohorta | Pressure (mmH2O) | immitance | Pressure (mmH2O) | immitance | Pressure (mmH2O) | immitance | Comments |
| Birch | > -100 | < 0.25mlc, 1 | 1stapedial reflex absent and max compliance unreadable; Madsen Electronics tympanoscope, model ZS 330, 226 Hz probe tone | ||||
| Fiellau-Nikolajsen I | > -100 | > 0.1 d | 200 to -4001 | ≤ 0.1d | 1or indeterminable | ||
| Fiellau-Nikolajsen II | middle-ear effusion=flat curve or <= -100 with absent middle ear reflexes | ||||||
| Fiellau-Nikolajsen III | > -100 | multiple criteria1 | 1type B=otoadmittance < 0.20millimhos, absolute gradient < 0.04millimhos and absence of ipsilateral acoustic reflex; Grason-Stadler Middle Ear Analyzer 1722 | ||||
| Holmquist | 50 to -99 | flat curve | Madsen ZA 330, 226 Hz probe tone | ||||
| Portoian-Shuhaiber | abnormal defined as an abnormal tympanometric curve and/or absent acoustic reflex; Grason-Stadler Middle-ear Analyser (Model 1722) | ||||||
| Tos I | 0 to -99 | flat curve | Madsen ZO 70 tympanometer, 220 Hz probe tone | ||||
| Tos II | 0 to -99 | flat curve | Madsen ZO 70 tympanometer, 220 Hz probe tone | ||||
| Tos III | >-100 | flat curve1 | 1<= 0.1e; Madsen ZO 70 impedance meter | ||||
| Tos IV AZ7 | 99 to -99 | flat curve without impedance minimum1 | 1or with a measurable impedance minimum and relative gradient < 0.1; Impedance audiometer AZ 7 (Interacoustics) | ||||
| ZS 331 | 99 to -99 | flat training1 | 1or compliance below 0.25mlc and absent ipsilateral stapedial reflex; Impedance tympanoscope ZS 331 (Madsen Electronics) | ||||
| Reves | -100 to 50 | >0.3b | < -100 | lowb | tympanometer 85 AR 11 (American Electro Medics) | ||
| Robinson | -149 to +50 | >0.2mlc | types As, B, C, and Cs are failures; Maico MA 610 portable impedance screener, 226 Hz probe tone | ||||
| Sly I and II | flat curve1 | 1or compliance < 0.3mlc or peak compliance occurred at or below -100 mmH20; Teledyne Avionics acoustic impedance meter model TA-1D | |||||
| van Balen | -99 to 200 | ≥ 0.2mmhoc | ≤ -400 | < 0.2mmhoc | |||
| Williamson | 200 to -99 | flat curve | Grayson-Stadler [sic] Earscan impedance audiometer | ||||
| Zielhuis | ≥ -99 | ≥ 0.2mlc | ≤ -400 | < 0.2mlc | Grason-Stadler-model 27 | ||
impedance
admittance
relative gradient
impedance slope
| Tympanogram Type | C | C1 | C2 | ||||
|---|---|---|---|---|---|---|---|
| Cohorta | Pressure (mmH2O) | immitance | Pressure (mmH2O) | immitance | Pressure (mmH2O) | immitance | Comments |
| Birch | ≤ -100 | ||||||
| Fiellau-Nikolajsen I | -100 to -199 | >0.1d | -200 to -400 | >0.1d | |||
| Fiellau-Nikolajsen II | middle-ear effusion=flat curve or <= -100 with absent middle ear reflexes | ||||||
| Fiellau-Nikolajsen III | ≤ -100 | ||||||
| Holmquist | 100 to -300 | ||||||
| Portoian-Shuhaiber | abnormal defined as an abnormal tympanometric curve and/or absent acoustic reflex | ||||||
| Tos I | -100 to -199 | -200 to -350 | |||||
| Tos II | -100 to -199 | -200 to -350 | |||||
| Tos III | -100 to -300 | >0.1e | -100 to -199 | -200 to -350 | |||
| Tos IV AZ 7 | -100 to -199 | >-200 | |||||
| ZS 331 | -100 to -199 | >-200 | |||||
| Reves | < -100 | >0.3b | |||||
| Robinson | types As, B, C, and Cs are failures | ||||||
| Sly I and II | |||||||
| van Balen | -199 to -100 | ≥ 0.2mmhoc | -399 to -200 | ≥ 0.2mmhoc | |||
| Williamson | -100 to -199 | -200 to -400 | |||||
| Zielhuis | -100 to -199 | ≥ 0.2mlc | -200 to -399 | ≥ 0.2mlc | |||
impedance
admittance
relative gradient
impedance slope
| Ears Resolved Intervala | ||||||||
|---|---|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic Methodb | antibioticc | surgeryc | <2wk | <2m | <3m | <6m | Article(s) |
| <6-month old cohorts | ||||||||
| Robertse | oto | unknown | unknown | 22/24 (92%) | 24/24 (100%) | 2262 Roberts 1995 | ||
| Tos III | Tymp | unknown | unknown | 1/4d (25%) | 1/4 d (25%) | 2627 Tos 1979 | ||
interval calculated from cohort inception and not cumulative, unless otherwise noted
oto=otoscopy, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
interval started at 6-month follow-up
cumulative resolution rate
| Ears Resolved Intervala | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic Methodb | antibioticc | surgeryc | <6wk | <3m | <6m | <9m | <24m | Article(s) |
| 6-month to 3-year old cohorts | |||||||||
| Robinson | tymp | unknown | unknown | 10 of 25 d (40.0%) | 2270 Robinson 1988 | ||||
| 1 2634 Tos 1980; | |||||||||
| 2 2190 Poulson 1980; | |||||||||
| 6/51 1 | 15/59 2 | 19/51 3 | 9/48 4 | 3 2629 Tos 1979; | |||||
| Tos II | tymp | unknown | yes | (12%) | (25%) | (37%) | (19%) | 4 2593 Thomsen 1981 | |
| Tos IIee | tymp | unknown | yes | 6/51 1 (12%) | 16/511 (31%) | 24/511 (47%) | 1 2634 Tos 1980 | ||
interval calculated from cohort inception and not cumulative, unless otherwise noted
oto=otoscopy, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
interval is minimum of 6wk so may be greater
cumulative resolution rate
| Ears Resolved Intervala | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic Methodb | antibioticc | surgeryc | <2wk | <3wk | <1m | <6wk | <3m | <4m | <6m | <8m | <1y | <3y | Article(s) |
| >3-year old cohorts | ||||||||||||||
| Fiellau-Nikolajsen Id | tymp | unknown | yes | 14/941 (15%) | 22/911 (24%) | 32/691 (46%) | 33/652 (51%) | 11237 Fiellau-Nikolajsen 1979; 21242 Fiellau-Nikolajsen 1981 | ||||||
| Fiellau-Nikolajsen IId | tymp | unknown | unknown | 7/64 (11%) | 16/62 (26%) | 1235 Fiellau-Nikolajsen 1979 | ||||||||
| Holmquist | tymp | unknown | unknown | 251/511 (49%) | 1494 Holmquist 1987 | |||||||||
| Lamothe | pneum oto | no | no | 24/64 (38%) | 25/53 (47%) | 1714 Lamothe 1981 | ||||||||
| Lamothed | pneum oto | no | no | 24/64 (38%) | 38/53 (72%) | 1714 Lamothe 1981 | ||||||||
| Renvall I | tymp | unknown | unknown | 282/335 (84%) | 2240 Renvall 1978 | |||||||||
| Renvall IId | tymp | no | no | 10/40 (25%) | 16/40 (40%) | 2242 Renvall 1982 | ||||||||
| Sly | tymp | no | no | 1/9 (11%) | 4/9 (44%) | 6/9 (67%) | 2457 Sly 1980 | |||||||
| Sly II | tymp | no | no | 0/5 (0%) | 0/5 (0%) | 0/5 (0%) | 2457 Sly 1980 | |||||||
| Tos I | tymp | unknown | unknown | 3/921 (3%); 3/872 (4%) | 14/931 (15%); 14/872 (16%) | 11486 Holm-Jensen 1981; 2543 Tos 1982 | ||||||||
| Tos Id | tymp | unknown | unknown | 3/871 (3%) | 17/871 (20%) | 1543 Tos 1982 | ||||||||
| Williamson | tymp | unknown | yes | 35/67 (52%) | 52/67 (78%) | 61/67 (91%) | 2791 Williamson 1994 | |||||||
interval calculated from cohort inception and not cumulative, unless otherwise noted
oto=otoscopy, pneum oto=pneumatic otoscopy, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
cumulative resolution rates
| Ears Resolved Intervala | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic Methodb | antibioticc | surgeryc | <1m | <2m | <3m | <4m | <5m | <6m | <3y | Article(s) |
| Age not stratifiable | |||||||||||
| Casselbrant Id | algorithm | yes | yes | 92/137 (67%) | 109/137 (80%) | 130/137 (95%) | 134/137 (98%) | 136/137 (99%) | 137/137 (100%) | 1000 Casselbrant 1985 | |
| Renvall I | tymp | unknown | unknown | 282/335 (84%) | 2240 Renvall 1978 | ||||||
interval calculated from cohort inception and not cumulative, unless otherwise noted
algorithm=algorithm based on pneumatic otoscopy, tympanometry, and acoustic reflex, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
cumulative resolution rate
| Resolved intervala | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic methodb | antibioticc | surgeryc | <2wk | <1m | <6wk | <10wk | <3m | <4m | <6m | <8m | <1y | Article(s) |
| >3-year old cohorts | |||||||||||||
| Fiellau-Nikolajsen II | tymp | unknown | unknown | 28/81 (35%) | 46/80 (58%) | 53/78 (68%) | 1245 Fiellau-Nikolajsen 1983 | ||||||
| Marchisio | pneum oto, tymp | unknown | no | 325/451 (72%) | 9 Marchisio 1998 | ||||||||
| Portoian-Shuhaiber | tymp | unknown | unknown | 65/130 (50%) | 2184 Portoian-Shuhaiber 1984 | ||||||||
| Sly I | tymp | no | no | 1/7 (14%) | 3/7 (43%) | 5/7 (71%) | 2457 Sly 1980 | ||||||
| Sly II | tymp | no | no | 0/3 (0%) | 0/3 (0%) | 0/3 (0%) | 2457 Sly 1980 | ||||||
| Williamson | tymp | unknown | yes | 22/50 (44%) | 38/50 (76%) | 45/50 90%) | |||||||
interval calculated from cohort inception unless otherwise noted
oto=otoscopy, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
| Resolved intervala | ||||||
|---|---|---|---|---|---|---|
| Cohort ID | Diagnostic Methodb | antibioticc | surgeryc | <2m | <3m | Article(s) |
| Age not stratifiable | ||||||
| Mills | pnem oto, tymp | unknown | unknown | 57/192 (30%) | 1927 Mills 1992 | |
| Reves | tymp | unknown | unknown | 40/68 (59%) | 2243 Reves 1985 | |
| van Balen | tymp | unknown | unknown | 223/443 (50%) | 91 van Balen 1996 | |
interval calculated from cohort inception unless otherwise noted
oto=otoscopy, tymp=tympanometry (type B to A transition)
Did any of the patients receive antibiotic or surgery?
Data from a number of studies were reported in a format that made them unusable for quantitative syntheses. Ten studies were eliminated from quantitative syntheses, because they failed to stratify by age (i.e. less than and greater than 3 years old). Day-to-day variability in tympanogram types was described in a cohort of kindergarten children examined on each weekday for 30 days (Moller and Tos,1990). Ernston and Sundberg (1984) described a group of children who participated in a controlled trial of children with OME that persisted for at least 3 months; they found that 15.3 percent (11 of 72) of such children had OME resolution by five weeks followup. A study of children with OME that persisted for 3 months or more, showed 45 percent (49 of 109) OME resolution at 2.5 year followup (Leiberman and Bartal,1986). However, the investigators acknowledged that middle-ear effusion noted after such a long interval could be either persistent or recurrent. Birch and Elbrønd (1984) and Zielhuis, Rach, and van den Broek (1990) derived equations to describe the OME resolution rates they observed in their cohorts, but we were unable to abstract actual counts from these articles.
We performed two sets of meta-analyses that matched age groups, unit of analysis, outcome type, and time to resolution in three or more cohorts. All meta-analyses presented here used the ‘ear’ as the unit of analysis. Few studies considered the child or the episode as the unit of analysis and no meta-analyses were possible.
| ArticleID | Author | Criterion | Age at diagnosis | Antibiotic used? | Surgery performed? | Followup interval | Number ears resolved | Total number ears | Resoluation rate in % | Random Effects Pooled Estimate (95% CI) | Test of Heterogeity Q statistic (P-value) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2457 | Sly-1980 | B or C to A | 5yr | no | unknown | <6wk | 18 | 32 | 56.3 | ||
| 2457 | Sly-1980 | B or C to A | 5yr | no | unknown | <6wk | 11 | 22 | 50.0 | ||
| 1714 | Lamothe-1981 | Otoscopy | 6yr | no | unknown | <6wk | 25 | 53 | 47.2 | ||
| Total | 54 | 107 | 44.9 | 42.3 (24.1, 60.6) | 7.85 (p=0.02) | ||||||
| ArticleID | Author | Criterion | Age at diagnosis | Antibiotic used? | Surgery performed? | Followup interval | Number ears resolved | Total number ears | Resoluation rate in % | Random Effects Pooled Estimate (95% CI) | Test of Heterogeity Q statistic (P-value) |
| 2457 | Sly-1980 | B to A | 5yr | no | unknown | <6wk | 6 | 9 | 66.7 | ||
| 2457 | Sly-1980 | B to A | 5yr | no | unknown | <6wk | 0 | 5 | 0.0 | ||
| 1714 | Lamothe-1981 | Otoscopy | 6yr | no | unknown | <6wk | 25 | 53 | 47.2 | ||
| Total | 31 | 67 | 46.3 | 37.2 (1.8, 72.5) | 16.4 (p<0.001) | ||||||
Note: Lamothe's study used otoscopy and is included in all meta-analyses
| ArticleID | Author | Criterion | Age at diagnosis | Antibiotic used? | Surgery performed? | Followup interval | Number ears resolved | Total number ears | Resoluation rate in % | Random Effects Pooled Estimate (95% CI) | Test of Heterogeity Q statistic (P-value) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1237 | Fiellau-Nikolajsen-1979 | B or C to A | 3–4yr | unknown | unknown | <3mo | 154 | 348 | 44.3 | ||
| 1235 | Fiellau-Nikolajsen-1979 | B or C to A | 3–4yr | unknown | unknown | <3mo | 83 | 200 | 41.5 | ||
| 543 | Tos-1982 | B or C to A | 4yr | unknown | unknown | <3mo | 103 | 393 | 26.2 | ||
| 2242 | Renvall-1982 | otoscopy | 4yr | no | unknown | <12wk | 86 | 144 | 59.7 | ||
| Total | 426 | 1085 | 39.3 | 42.7 (29.3,56.1) | 63.01 (p<0.001) | ||||||
| ArticleID | Author | Criterion | Age at diagnosis | Antibiotic used? | Surgery performed? | Followup interval | Number ears resolved | Total number ears | Resoluation rate in % | Random Effects Pooled Estimate (95% CI) | Test of Heterogeity Q statistic (P-value) |
| 1237 | Fiellau-Nikolajsen-1979 | B to A | 3–4yr | unknown | unknown | <3mo | 22 | 91 | 24.2 | ||
| 1235 | Fiellau-Nikolajsen-1979 | B to A | 3–4yr | unknown | unknown | <3mo | 16 | 62 | 25.8 | ||
| 543 | Tos-1982 | B to A | 4yr | unknown | unknown | <3mo | 3 | 87 | 3.4 | ||
| 2242 | Renvall-1982 | otoscopy | 4yr | no | unknown | <12wk | 16 | 40 | 40.0 | ||
| Total | 57 | 280 | 20.4 | 22.5 (5.9,39.0) | 44.28 (p<0.001) | ||||||
The Role of Influencing Factors in Resolution Similarly, because very few studies assessed the role of factors that might influence resolution, no meta-analyses were performed. The results of individual studies are summarized here:
Lamothe, Boudreault, Blanchette, and colleagues (1981) and Fiellau-Nikolajsen and Lous (1979) assessed the effects of gender and found quicker resolution among females than males.
Fiellau-Nikolajsen (1979) noted that children who received at-home care had quicker resolution of OME than did children in daycare.
Sly, Zambie, Fernandes, and colleagues (1980) compared OME resolution in small cohorts first studied in February with those first studied in September and found resolution to be more rapid in the February cohort.
Lamothe, Boudreault, Blanchette, and colleagues (1981) also assessed the effect of the side of the affected ear and noted more rapid resolution in affected right ears than affected left ears.
Portoain-Shuhaiber and Cullinan (1984) stratified by racial/ethnic origin and noted quicker resolution of OME in African children than in Indian or Caucasian children.
Moller and Tos (1990) found that different tympanometry instruments, which they described as impedance tympanoscopy and impedance audiometry, gave different rates of OME among the same group of children.
Zielhuis, Rach, and van den Broek (1990) found that season and age at the end of the episode had a statistically significant effect on OME resolution rates, while gender, upper respiratory tract infection, and history of AOM did not, although they presented only percentages without denominators.
We found sufficient data only to analyze the rates of resolution of OME among children older than 3 years old. No data existed on the role of such factors as duration of OME, initial occurrence of AOM, or presence of OME in one vs. both ears. Data on the roles of gender, at-home vs. day care, race/ethnicity, time of year, age of onset, or side of ear were too limited to draw any conclusions.
Do infants and preschool children with longer-duration early-life OME as compared to those with shorter duration OME have greater delays in speech and language development (receptive or expressive) later in life?
Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for speech and language developmental delays?
What are the risk factors that interact with the effect of OME on speech and language development in infants and preschool children?
| ID# | Author | Year | Cohort |
|---|---|---|---|
| 1623 | Kaplan | 1973 | Eskimo villages in the Yukon and Kuskokwim River Delta areas of Southwestern Alaska |
| 1255 | Fischler | 1985 | Four Indian reservations in Arizona |
| 4657 | Roberts | 1986 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 3118 | Roberts | 1988 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 4806 | Roberts | 1988 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 4656 | Roberts | 1989 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 3117 | Roberts | 1991 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 4319 | Roberts | 1995 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 1373 | Gravel | 1992 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 4728 | Gravel | 1996 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 1941 | Mody | 1999 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 2295 | Ruben | 1997 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 1219 | Feagans | 1987 | Medical and Day Care Intervention Project in Pennsylvania |
| 2135 | Paul | 1993 | Portland Language Development Project (PLDP), Oregon |
| 4651 | Klein | 1988 | The Greater Boston Otitis Media Study Group, MA |
| 2583 | Teele | 1990 | The Greater Boston Otitis Media Study Group, MA |
| 1435 | Harsten | 1993 | University Hospital of Lund, Sweden |
| 877 | Black | 1993 | University of Maryland Medical System, Baltimore, MD |
| 4675 | Owen | 1996 | University of Texas Medical Branch, TX |
| 1277 | Freeark | 1992 | University-based pediatric clinic in Michigan |
1) OM was diagnosed/assessed before the age of 3 years, 2) speech or language outcome was measured at or before the age of 22 years, 3) a prospective cohort study design was used, 4) OM was graded, and 5) speech or language outcome was measured beyond 3 years of age. Of the 20 studies, 17 were prospective cohort studies and three (Freeark, 1992; Fischler, 1985; Paul, 1993) were retrospective-prospective studies in which OM history was retrospectively obtained, but outcome measures were prospectively obtained. We did not exclude any studies based on their quality.
When we reviewed the studies included in the 1994 OME Guideline for inclusion in our analysis (Stool, Berg, Berman et al., 1994), we excluded six of the studies. The Friel-Patti and Finitzo (1990) study was excluded, because it was not a prospective cohort study and did not report outcomes for children over 3 years of age. The Friel-Patti (1982) study was excluded because no outcomes were measured in children over 3 years of age. The studies of Lous and Fiellau-Nikolajsen (1988) and Rach, Ziehlhuis, van Baarle, and colleagues (1991 were excluded, because the investigators did not measure OM severity before the age of 3 years. The Rach, Zielhuis, and van den Broek (1988) study was excluded, because it was not a prospective cohort study. The study by Wright, Sell, McConnell, and colleagues (1988) was excluded, because no outcomes were measured in children over 3 years of age. Finally, unlike the OME Guideline (Stool, Berg, Berman et al., 1994), we included the studies by Klein (1988) and Freeark (1992), because study quality was not one of our rejection criteria.
| ID# | Author | Year | Cohort |
|---|---|---|---|
| 3119 | Roberts | 1995 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 4841 | Wallace | 1988 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 2739 | Wallace | 1988 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 2740 | Wallace | 1988 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 4842 | Wallace | 1992 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 667 | Abraham | 1996 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 4796 | Petinou | 1996 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 2742 | Wallace | 1996 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 3096 | Petinou | 1999 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 4671 | Luloff | 1993 | Longitudinal study investigating the efficacy of drug prophylaxis on otitis media in greater Boston area, MA |
| 4673 | Tsushima | 1993 | Longitudinal study investigating the efficacy of drug prophylaxis on otitis media in greater Boston area, MA |
| 4674 | Wendler-Shaw | 1993 | Longitudinal study investigating the efficacy of drug prophylaxis on otitis media in greater Boston area, MA |
| 875 | Black | 1988 | Maryland Otitis Media Study Group, Baltimore |
| 4708 | Downs | 1988 | Not specified |
| 2719 | Vernon-Feagans | 1996 | Ongoing study of health and day-care in a semi-rural area of northeastern United States |
| 1288 | Friel-Patti | 1982 | Parkland Memorial Hospital, Dallas, TX |
| 4713 | Feldman | 1996 | Pittsburgh-area Child Development/Otitis Media Study Group, PA |
| 4642 | Paradise | 2000 | Pittsburgh-area Child Development/Otitis Media Study Group, PA |
| 2819 | Wright | 1988 | Pneumococcal vaccine study in Nashville, TN |
| 1677 | Knishkowy | 1991 | PROD (Promotion of Growth and Development) Program, Western Jerusalem |
| 2579 | Teele | 1984 | The Greater Boston Otitis Media Study Group, MA |
| 4664 | Feagans | 1994 | Three day-care facilities in central Pennsylvania, PA |
| ID# | Author Year | Definition of Positive/Negative OM History | Definition of OM | OM Diagnosis Method |
|---|---|---|---|---|
| 877 | Black 1993 | OM History defined by number of episodes of OM within the first year of life documented by otologic examination.
| Not provided |
|
| 1219 | Feagans 1987 | OM History described by frequency and duration of OM from 0 to 3 years. They were treated as continuous variables for correlation with outcome. No grouping of children by positive or negative history was done.
| Not provided. |
|
| 1255 | Fischler 1985 | OM History defined by number of OM episodes by age 2 years and over age 2.
| Any mention of one of the following:
|
|
| 1277 | Freeark 1992 | OM history severity defined by a) number of separate episodes of OM and b) total number of days of effusion over the first 3 years of life.
| Not specified |
|
| 1373 | Gravel 1992 | OM groups were defined by otoscopic histories during the first year of life.
| Not specified |
|
| 4728 | Gravel 1996 | Same as 1373 | Same as 1373 | Same as 1373 |
| 1435 | Harsten 1993 | OM groups were defined by the number of AOM episodes during the first 3 years of life.
|
|
|
| 1623 | Kaplan 1973 | OM groups were based on age of onset of first episode of otorrhea.
|
|
|
| 4651 | Klein 1988 | OM history was measured by time spent with effusion during the first 2 years of life and used a ‘window’ of 23 days to each observation of effusion, whether accompanied by signs of illness or not. It could be shortened or extended by multiple examinations.
|
|
|
| 1941 | Mody 1999 | OM history defined by pneumatic otoscopy findings during first year of life
|
|
|
| 4675 | Owen 1996 | OME history was measured by days or duration with OME durng the first 3 years of life. Middle ear status was monitored by home visits every 2 to 4 weeks, irrespective of symptoms for the first 3 years of life.
|
|
|
| 2135 | Paul 1993 | OM history was measured by middle ear involvement defined as either the placement of myringotomy tubes or the presence of six or more ear infections treated by a physician before the second birthday by parent report.
|
|
|
| 4657 | Roberts 1986 | OM history was based on total OME duration in days during first 3 years of life. Duration of each episode of unilateral and bilateral was calculated by subtracting the data of onset of OME from the resolution date. Days of total OME was analyzed both as a continuous and categorical variable.
|
|
|
| 4656 | Roberts 1989 | Same as 4657 | Same as 4657 | Same as 4657 |
| 3118, 4806 | Roberts 1988 | Same as 4657 | Same as 4657 | Same as 4657 |
| 3117 | Roberts 1991 | Same as 4657 | Same as 4657 | Same as 4657 |
| 4318 | Roberts 1995 | Same as 4657 | Same as 4657 | Same as 4657 |
| 2295 | Ruben 1997 | OM history was based on the findings of pneumatic otoscopy at every scheduled and sick visit during the first year of life
| Not specified |
|
| 2583 | Teele 1990 | OM history was measured by the number of days with middle ear effusion (MEE) by age 3 years. Unless documented to be shorter, each episode of MEE lasted 29 days. |
|
|
The age during which the outcome was measured also varied. This age ranged from one to 3 years. The diagnostic method and the examiner also varied. Several studies based diagnosis on chart review or parents' record. A few studies used pneumatic otoscopy performed by pediatricians, otolaryngologists, or trained professionals.
| ID | Author(s) Year | Study Quality Score a |
|---|---|---|
| 877 | Black 1993 | 3 (0,0,0,0,1,1,0,1) |
| 1219 | Feagans 1987 | 5 (1,1,1,0,0,1,0,1) |
| 1255 | Fischler 1985 | 5 (1,1,0,0,1,1,1,0) |
| 1277 | Freeark 1992 | 4 (1,0,0,0,0,1,1,1) |
| 1373 | Gravel 1992 | 5 (1,1,0,0,1,1,1,0) |
| 4728 | Gravel 1996 | 3 (0,1,0,0,0,1,1,0) |
| 1435 | Harsten 1993 | 6 (1,1,1,0,1,1,1,0) |
| 1623 | Kaplan 1973 | 6 (1,1,0,0,1,1,1,1) |
| 4651 | Klein 1988 | 4 (1,0,1,0,0,1,0,1) |
| 1941 | Mody 1999 | 4 (1,1,0,0,0,1,0,1) |
| 4675 | Owen 1996 | 4 (1,1,0,0,0,1,0,1) |
| 2135 | Paul 1993 | 3 (0,0,1,0,0,1,0,1) |
| 4657 | Roberts 1986 | 6 (1,0,1,0,1,1,1,1) |
| 4656 | Roberts 1989 | 6 (1,0,1,0,1,1,1,1) |
| 4806 | Roberts 1988 | 4 (1,0,1,0,0,1,0,1) |
| 3118 | Roberts 1988 | 4 (1,0,1,0,0,1,0,1) |
| 3117 | Roberts 1991 | 2 (0,0,0,0,0,1,0,1) |
| 4319 | Roberts 1995 | 3 (0,0,0,0,0,1,1,1) |
| 2295 | Ruben 1997 | 5 (1,1,0,0,1,1,1,0) |
| 2583 | Teele 1990 | 6 (1,0,1,0,1,1,1,1) |
The eight components of study quality score are: (1) study cohort clearly defined; (2) subjects assembled at a uniform time point; (3) pathway of subject entry clearly described; (4) complete follow-up achieved; (5) withdrawals/drop-outs described; (6) objective outcomes used; (7) outcome assessment blinded; and (8) extraneous factors adjusted. ‘1’ indicates presence and ‘0’ indicates absence.
| ID# | Author Year | Cohort | Age of OM History | Age at Outcome Measure | Major Outcome statistic | Outcome Measure | Test | Notes |
|---|---|---|---|---|---|---|---|---|
| 4675 | Owen 1996 | Texas | 0–3 years | 5 years | Correlation | Cognition | Stanford Binet | |
| 4319 | Roberts 1995 | North Carolina | 2 months- 3 years | 12 years | Correlation | Cognition | WISC-R | |
| 1623 | Kaplan 1973 | Alaska | 0–1 year | 10 years | Mean (range) | Cognition | WISC | Stratified by concurrent hearing status |
| 877 | Black 1993 | Maryland | 8–22 months | 4–6 years | Mean (SD) | Cognition | McCarthy | |
| 1373 | Gravel 1992 | New York | 0–1 year | 4 years | Mean (SD) | Cognition | Stanford-Binet | |
| 4657 | Roberts 1986 | North Carolina | 2 months- 3 years | 3.5–6 years | Mean (SD) | Cognition | McCarthy | |
| 4656 | Roberts 1989 | North Carolina | 2 months- 3 years | 8 years | Mean (SD) | Cognition | WISC-R | |
| 4651 | Klein 1988 | Massachusetts | 0–2 years | 7 years | Multivariate | Cognition | WISC-R | |
| 2583 | Teele 1990 | Massachusetts | 0–2 years | 7 years | Multivariate | Cognition | WISC-R | Adjusted for SES and gender |
| 3118/4806 | Roberts 1988 | North Carolina | 2 months- 3 years | 2.5–8 years | Correlation | Expressive language | Elicited language play | |
| 3117 | Roberts 1991 | North Carolina | 2 months- 3 years | 4.5–6 years | Correlation | Expressive language | CELF | Stratified by socioeconomic status |
| 4675 | Owen 1996 | Texas | 0–3 years | 5 years | Correlation | Expressive language | Goldman-Fristoe test | |
| 1255 | Fischler 1985 | Arizona | 0–2 years | 6–8 years | Mean (SD) | Expressive language | TOLD | |
| 1373 | Gravel 1992 | New York | 0–1 year | 4 years | Mean (SD) | Expressive language | SICD-R in months | |
| 2135 | Paul 1993 | Oregon | 0–2 years | 4 years | Mean (SD) | Expressive language | MLU | Stratified by normal/late talkers |
| 1219 | Feagans 1987 | Pennsylvania | 0–3 years | 5–7 years | Multivariate | Expressive language | MLU, Paraphrase | Reported for total group only |
| 2583 | Teele 1990 | Massachusetts | 0–2 years | 7 years | Multivariate | Expressive language | WUG test | Mean adjusted for SES and gender |
| 2295 | Ruben 1997 | New York | 0–1 year | 2–9 years, yearly | Percent difference | Expressive language | Unknown | |
| 1277 | Freeark 1992 | Michigan | 0–3 years | 3–4 years | Proportion | Expressive language | Verbal Scale Index | Stratified by Parent Verbal Stimulation (PVS) |
| 4728 | Gravel 1996 | New York | 0–1 year | 9 years | Raw data not reported | Expressive language | WRAML | Only statistical testing was reported |
| 3117 | Roberts 1991 | North Carolina | 2 months- 3 years | 4.5–6 years | Correlation | Receptive language | CELF | Stratified by socioeconomic status (SES) |
| 3117 | Roberts 1991 | North Carolina | 2 months- 3 years | 4.5–6 years | Correlation | Receptive language | PPVT-R | Stratified by SES |
| 4675 | Owen 1996 | Texas | 0–3 years | 5 years | Correlation | Receptive language | CAVAT | |
| 1255 | Fischler 1985 | Arizona | 0–2 years | 6–8 years | Mean (SD) | Receptive language | TOLD | |
| 877 | Black 1993 | Maryland | 8–22 months | 4–6 years | Mean (SD) | Receptive language | PPVT-R | |
| 1373 | Gravel 1992 | New York | 0–1 year | 4 years | Mean (SD) | Receptive language | SICD-R in months | |
| 2583 | Teele 1990 | Massachusetts | 0–2 years | 7 years | Multivariate | Receptive language | WUG test | Adjusted for SES and gender |
| 2295 | Ruben 1997 | New York | 0–1 year | 2–9 years, yearly | Percent difference | Receptive language | Unknown | |
| 4728 | Gravel 1996 | New York | 0–1 year | 9 years | No raw data | Receptive language | CELF-R | Statistical significance only |
| 3118/4806 | Roberts 1988 | North Carolina | 2 months- 3 years | 2.5–8 years | Correlation | Expressive speech | Goldman-Fristoe | |
| 2135 | Paul 1993 | Oregon | 0–2 years | 4 years | Mean (SD) | Expressive speech | Goldman-Fristoe | Stratified by normal/late talkers |
| 2583 | Teele 1990 | Massachusetts | 0–2 years | 7 years | Multivariate | Expressive speech | Goldman-Fristoe | Adjusted for SES and gender |
| 2295 | Ruben 1997 | New York | 0–1 year | 2–9 years, yearly | Percent difference | Expressive speech | Unknown | |
| 3118/4806 | Roberts 1988 | North Carolina | 2 months- 3 years | 2.5–8 years | Correlation | Receptive speech | Articulation tests | |
| 1941 | Mody 1999 | New York | 0–1 year | 9 years | Mean (SD) | Receptive speech | Synthetic speech syllables | |
| 1435 | Harsten 1993 | Sweden | 0–3 years | 4 years | Proportion | Receptive speech | Linguistic analysis | |
| 4728 | Gravel 1996 | New York | 0–1 year | 9 years | No raw data | Receptive speech | Pediatric Speech Intelligibility | Only statistical testing result was reported |
CAVAT= Carrow Elicited Language Inventory
CELF= Clinical Evaluation of Language Functions
MLU= Mean Length of Utterance
PPVT-R= Peabody Picture Vocabulary Test-Revised
SD= Standard Deviation
SICD-R= Sequenced Inventory of Communication Development-Revised
TOLD= Test of Language Development
WISC-R= Wechsler Intelligence Scale for Children-Revised
WRAML= Wide Range Assessment of Memory and Learning
WUG= “WUG” test (Berko-Gleason)
The factor we used to determine which studies to examine further by meta-analysis was the major outcome measure, which indicates the type of statistical measured used. Whenever three or more studies reported the same outcome measure, we considered pooling the data. Based on this strategy, we conducted three possible meta-analyses to derive the pooled difference between positive and negative early-life otitis media for: expressive language development, receptive language development, and development of cognitive verbal intelligence.
| ID Number | Author-Year | Cohort | Age of OM history | Age of outcome measure | Name of Test | Positive OM History | Negative OM History | Standardized Mean Difference (95% CI) | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | N | Mean | SD | |||||||
| 1255 | Fischler-1985a | Arizona | 0–2yr | 6–8 yrs | TOLD | 33 | 60.0 | 20.4 | 71 | 64.8 | 28.8 | - 0.18 (-0.59, 0.23) |
| 1373 | Gravel-1992 | New York | 0–1yr | 4yrs | SICD-R | 8 | 36.0 | 5.2 | 12 | 39.0 | 6.2 | - 0.49 (-1.40, 0.42) |
| 2135 | Paul-1993a | Oregon | 0–2yr | 4yrs | MLU | 8 | 57.8 | 3.8 | 13 | 54.6 | 10.7 | 0.35 (-0.54, 1.24) |
| Random Effects estimate | -0.14 (-0.49, 0.20) | |||||||||||
| Test of standardized mean difference equals 0: z=0.82; p=0.413. | ||||||||||||
| Test of heterogeneity: Chi-squared=1.77 (degrees of freedom=2); p-value=0.412. | ||||||||||||
Retrospective-prospective studies.
OM=Otitis media
TOLD=Test of Language Development
SICD-R=Sequenced Inventory of Communication Development-Revised
MLU=Mean Length of Utterance
N=Number of subjects
SD=Standard deviation
CI=Confidence interval
| ID Number | Author-Year | Cohort | Age of OM history | Age of outcome measure | Name of Test | Positive OM History | Negative OM History | Standardized Mean Difference (95% CI) | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | N | Mean | SD | |||||||
| 877 | Black-1993 | Maryland | 8–22 mos | 4–6 yrs | PPVT-R | 21 | 83 | 17 | 10 | 72 | 18 | 0.62 (-0.15, 1.39) |
| 1255 | Fischler a -1985 | Arizona | 0–2yr | 6–8 yrs | TOLD | 33 | 67 | 28 | 71 | 73 | 32 | -0.19 (-0.61, 0.22) |
| 1373 | Gravel-1992 | New York | 0–1yr | 4yr | SICD-R | 8 | 36 | 5 | 13 | 38 | 5 | -0.38 (-1.27, 0.51) |
| 2579 | Teele-1990 | Boston | 0–2yr | 3yr | PPVT-R | 52 | 101 | 17 | 80 | 96 | 15 | 0.31 (-0.04, 0.67) |
| Random effects estimate | 0.10 (-0.29, 0.49) | |||||||||||
| Test of standardized mean difference equals 0: z=0.52; p=0.606. | ||||||||||||
| Test of heterogeneity: Chi-squared=6.22 (degrees of freedom=3); p=0.102. | ||||||||||||
Retrospective-prospective study.
PPVT-R=Peabody Picture Vocabulary Test-Revised
TOLD=Test of Language Development
SICD-R=Sequenced Inventory of Communication Development-Revised
N=Number of subjects
SD=Standard deviation
CI=Confidence interval
| ID Number | Author-Year | Cohort | Age of OM history | Age of outcome measure | Name of Test | Positive OM History | Negative OM History | Standardized Mean Difference (95% CI) | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | N | Mean | SD | |||||||
| 877 | Black-1993 | MD | 8–22 mos | 4–6 yrs | McCarthy | 21 | 46.7 | 11.5 | 10 | 41.0 | 10.7 | 0.49 (-0.27, 1.26) |
| 1373 | Gravel-1992 | NY | 0–1yr | 4yrs | Stanford-Binet | 9 | 88.3 | 15.9 | 13 | 84.3 | 9.4 | 0.31 (-0.55, 1.17) |
| 4657 | Roberts-1986 | NC | 2mos-3yrs | 3.5–6yrs | McCarthy | 19 | 52.0 | 8.0 | 19 | 52.0 | 9.0 | 0.00 (-0.64, 0.64) |
| Random effects estimate | 0.23 (-0.20, 0.65) | |||||||||||
| Test of standardized mean difference equals 0: z=1.05; p=0.292. | ||||||||||||
| Test of heterogeneity: Chi-squared=0.99 (degrees of freedom=2); p=0.609. | ||||||||||||
OM=Otitis media
N=Number of subjects
SD=Standard deviation
CI=Confidence interval
The data do not support an effect of early-life OME on language development or cognitive verbal intelligence. However, differences among the cohorts and study conditions and the wide 95% confidence intervals make it difficult to conclude that there is no effect. We found insufficient data to assess early-life OME on speech development.
Do infants and preschool children with longer duration early life OME as compared to those with shorter duration OME have permanent (or sensorineural) hearing loss later in life? One specific formulation of this question is: Is OME-associated conductive hearing loss in the first 3 years of life a risk factor for permanent (or sensorineural) hearing loss later in life?
What are the risk factors that interact with the effect of OME on hearing later in life (unilateral or bilateral) in infants and preschool children?
| ID# | Author-Year | Rejection Reason |
|---|---|---|
| 2221 | Rahko-1995 | Otitis media not measured at less than 3 years of age |
| 2762 | Webster-1989 | Case control study |
| 4728 | Gravel-1996 | No significant difference observed but no hearing data reported |
| 4846 | Wright-1984 | Hearing data at 3–4 years not abstractable. |
| ID# | Author-Year | Cohort |
|---|---|---|
| 147 | Sorri-1995 | Birth cohort from Northern Finland |
| 1255 | Fischler-1985 | Four Indian reservations in Arizona |
| 1373 | Gravel-1992 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 1435 | Harsten-1993 | University Hospital of Lund, Sweden |
| 1623 | Kaplan-1973 | Eskimo villages in the Yukon and Kuskokwim River Delta areas of Southwestern Alaska |
| 2233 | Reed-1967 | Eskimo villages in the Yukon and Kuskokwim River Delta areas of Southwestern Alaska |
| 2309 | Ryding-1997 | University Hospital of Lund, Sweden |
| 2854 | Zargi-1992 | University of Ljubljana, Ljubljana, Slovenia |
| ID# | Author-Year | Cohort |
|---|---|---|
| 2264 | Roberts-1998 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 4808 | Roberts-1988 | Frank Porter Graham Child Development Center, Chapel Hill, NC |
| 667 | Abraham-1996 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 2740 | Wallace-1988 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 4680 | Gravel-2000 | LIFE (Longitudinal Infant Follow-up and Evaluation) Program of the Rose F. Kennedy Center, Albert Einstein College of Medicine, Bronx, NY |
| 1288 | Friel-Patti-1982 | Parkland Memorial Hospital, Dallas, TX |
| 2819 | Wright-1988 | Pneumococcal vaccine study in Nashville, TN |
| 1677 | Knishkowy-1991 | PROD (Promotion of Growth and Development) Program, Western Jerusalem |
| 1544 | Hutchings-1992 | Six general practices in Oxford, England |
| 4838 | Vernon-Feagans-1996 | Three day-care facilities in central Pennsylvania, PA |
| ID# | Author Year | Definition of Positive/Negative OM History | Definition of OM | OM Diagnosis Method |
|---|---|---|---|---|
| 147 | Sorri 1995 | OM history was obtained from all possible sources (health care centers, hospitals, and private surgeries). Only children with clear-cut differences in their history were considered.
| Not specified | Not specified |
| 1255 | Fischler 1985 |
| Any mention of one of the following:
|
|
| 1373 | Gravel 1992 | OM groups were defined by otoscopic histories during the first year of life.
| Not specified |
|
| 1435 | Harsten 1993 | OM groups were defined by the number of AOM episodes during the first 3 years of life.
|
|
|
| 1623 | Kaplan 1973 | OM groups were based on age of onset of first episode of otorrhea.
|
|
|
| 2233 | Reed 1967 | Same study population as 1623. This article reported findings at 3–5 years. Article 1623 reported findings at 10 years of age. | Same as 1623 | Same as 1623 |
| 2309 | Ryding 1997 | OM history groups were defined by the number of recurrent AOM during the first 3 years of life.
|
|
|
| 2854 | Zargi 1992 | OM history based on parental interviews and by review of hospital charts and other medical documentation.
| Not specified | Not specified |
| ID | Author Year | Study Quality Scorea |
|---|---|---|
| 147 | Sorri 1995 | 4 (0,1,0,0,1,1,0,1) |
| 1255 | Fischler 1985 | 5 (1,1,0,0,1,1,1,0) |
| 1373 | Gravel 1992 | 5 (1,1,0,0,1,1,0,1) |
| 1435 | Harsten 1993 | 6 (1,1,1,0,1,1,1,0) |
| 1623 | Kaplan 1973 | 5 (1,1,0,0,1,1,1,0) |
| 2233 | Reed 1967 | 4 (1,1,0,0,0,1,0,1) |
| 2309 | Ryding 1997 | 5 (0,1,1,0,0,1,1,1) |
| 2854 | Zargi 1992 | 2 (1,0,0,0,0,1,0,0) |
The eight components of study quality score are: (1) study cohort clearly defined; (2) subjects assembled at a uniform time point; (3) pathway of subject entry clearly described; (4) complete follow-up achieved; (5) withdrawals/drop-outs described; (6) objective outcomes used; (7) outcome assessment blinded; and (8) extraneous factors adjusted. ‘1’ indicates presence and ‘0’ indicates absence.
| ID Number | Author Year | Age at OM history | Age at outcome measure | Outcome measured in % hearing loss | Outcome measured in mean pure tone | Other outcome measure |
|---|---|---|---|---|---|---|
| 147 | Sorri 1995 | 0–2yrs | 7yrs | >20 dB pure tone averages, type not specified | Mean air-conduction (AC) threshold, right/left ear | |
| 1255 | Fischler 1985 | 0–2yrs | 6–8yrs | >25 dB at 500Hz; | ||
| >20 dB at 1000 Hz; | ||||||
| >20 dB at 2000 Hz; | ||||||
| >25 dB at 4000 Hz; | ||||||
| >25 dB at 6000 Hz. | ||||||
| Pure tone type not specified. | ||||||
| 1373 | Gravel 1992 | 0–1yrs | 4 yrs | Mean pure tone averages obtained at octave frequencies from 500 through 4000 Hz at a minimum, right/left ear | Pediatric Speech Intelligibility sentence (PSI S) to competing messages (CM) ratio | |
| 1435 | Harsten 1993 | 0–3yrs | 4yrs | >=25 dB tone-audiometry at any frequency, type not specified. | ||
| 1435 | Harsten 1993 | 0–3yrs | 7yrs | >=25 dB tone-audiometry at any frequency, type not specified. | ||
| 1623 | Kaplan 1973 | 0–1yrs | 10yrs | >=25 dB air and bone conduction | ||
| 2233 | Reed 1967 | 0–2yrs | 3–5yrs | >25 dB pure tone air averages | ||
| 2309 | Ryding 1997 | 0–3yrs | 10yrs | Median level of air conduction hearing, right/left ear | ||
| 2854 | Zargi 1992 | 0–2yrs | 6–8yrs | >10 dB for air-conduction hearing loss | Sensorineural hearing loss | |
dB=decibel
PSI=Pediatric Speech Intelligibility
S=Primary sentence
CM=Competing messages
| Author Year | Age of OM history | Age at Hearing Testing | OM+ Sample Size | OM-Sample Size | OM+ Percent Hearing Loss | OM-Percent Hearing Loss | Rate Difference in % | 95% CI of Rate Difference in % | Risk Ratio | 95% CI of Risk Ratio |
|---|---|---|---|---|---|---|---|---|---|---|
| Sorri 1995 | 0–2yrs | 7yrs | 64 | 35 | 51.6 | 20.0 | 31.6 | (13.5, 49.6) | 2.6 | (1.3, 5.2) |
| Fischler 1985 | 0–2yrs | 6–8yrs | 96 | 70 | 9.4 | 1.4 | 7.9 | ( 1.5, 14.4) | 6.6 | (0.8, 50.6) |
| Harsten 1993 | 0–3yrs | 7yrs | 24 | 56 | 8.3 | 5.4 | 3.0 | (- 9.6, 15.5) | 1.6 | (0.3, 8.7) |
| Kaplan 1973 | 0–1yrs | 10yrs | 162 | 76 | 19.8 | 7.9 | 11.9 | ( 3.2, 20.5) | 2.5 | (1.1, 5.7) |
| Random effects estimates | 346 | 237 | 21.7 | 6.4 | 11.3 | ( 3.3, 19.3) | 2.6 | (1.6, 4.2) | ||
| Test of heterogeneity Chi-square test value | 40.3 | 10.5 | 7.3 | 1.1 | ||||||
| Test of heterogeneity Chi-square test p-value | <0.001 | 0.015 | 0.064 | 0.768 | ||||||
| The following analysis excluded article by Sorri. | ||||||||||
| Fischler 1985 | 0–2yrs | 6–8yrs | 96 | 70 | 9.4 | 1.4 | 7.9 | (1.5,14.4) | 6.6 | (0.8,50.6) |
| Harsten 1993 | 0–3yrs | 7yrs | 24 | 56 | 8.3 | 5.4 | 3.0 | (-9.6,15.5) | 1.6 | (0.3,8.7) |
| Kaplan 1973 | 0–1yrs | 10yrs | 162 | 76 | 19.8 | 7.9 | 11.9 | (3.2,20.5) | 2.5 | (1.1,5.7) |
| Random effects estimates | 282 | 202 | 13.0 | 4.2 | 8.4 | (3.6,13.2) | 2.6 | (1.3,5.2) | ||
| Test of heterogeneity Chi-square test value | 6.8 | 4.4 | 1.4 | 1.1 | ||||||
| Test of heterogeneity Chi-square test p-value | 0.034 | 0.114 | 0.508 | 0.566 | ||||||
Hearing Loss was at >20–25 dB threshold at any frequency with or without treatment measured at 6–10 years of age.
Sorri, Fischler and Harsten did not specify type of pure-tone test used in defining hearing loss. Kaplan used air and bone conduction.
Abbreviations: OM+: positive otitis media history; OM-: negative otitis media history; CI: confidence interval.
The results support that history of early-life OME is associated with increased risk for conductive hearing loss. However, the number of studies with similar outcome measures is small. We found insufficient data to assess the early-life OM on permanent hearing loss.
What are the sensitivity, specificity, and predictive values for alternative methods of diagnosing OME compared with one of the reference standards?
These methods include, but are not limited to:
signs/symptoms
non-pneumatic otoscopy
pneumatic otoscopy, validated or non-validated examiner
binaural micro-tympanoscopy
portable tympanometry
professional tympanometry
quantitative tympanometry
acoustic reflectometry (specify model and year)
otoacoustic emissions
audiometry, air or bone conduction thresholds.
After secondary and tertiary screening of the 449 articles we retrieved for review, we identified 75 articles that fell within the scope of this question. When we compared our list with the 1994 OME Guideline (Stool, Berg, Berman et al., 1994), we found five studies that were included in the 1994 OME Guideline but not in our assessment. We excluded the Kaleida (1992) and the Shurin, Pelton and Finkelstein (1977) studies, because data were not abstractable. The McDermott and Giebink, Le, and colleagues (1983) and the Teele and Teele (1984) studies were excluded, because they did not address the scope of this question. The Lampe, Weir, Spier, and colleagues (1985) study was excluded, because it was a duplicate of another study. We included three studies that were rejected by the developers of the 1994 OME Guideline, because we did not reject any studies based on study quality. These were the studies by Haughton (1977), Karma (1989), and Marchart (1986).
| ID Number | Author, Year |
|---|---|
| 759 | Avery 1986 |
| 766 | Babonis 1991 |
| 784 | Barnett 1998 |
| 810 | Beery 1975 |
| 817 | Ben-David 1981 |
| 886 | Block 1998 |
| 888 | Bluestone 1973 |
| 889 | Bluestone 1979 |
| 989 | Cantekin 1977 |
| 990 | Cantekin 1980 |
| 1238 | Fiellau-Nikolajsen 1980 |
| 1241 | Fiellau-Nikolajsen 1980 |
| 1245 | Fiellau-Nikolajsen 1983 |
| 1250 | Finitzo 1992 |
| 1280 | Freyss 1980 |
| 1282 | Fria 1980 |
| 4879 | Fried 1985 |
| 4878 | Gersdorff 1986 |
| 1384 | Grimaldi 1976 |
| 1397 | Haapaniemi 1997 |
| 1446 | Haughton 1977 |
| 3022 | Johnson 1980 |
| 1600 | Jonathan 1989 |
| 1632 | Karma 1989 |
| 1646 | Kemaloglu 1999 |
| 1650 | Kennedy 1982 |
| 1685 | Koivunen 1997 |
| 1785 | Lovette 1976 |
| 1804 | Macknin 1987 |
| 1817 | Mains 1989 |
| 1837 | Marchant 1986 |
| 1936 | Mitchell 1990 |
| 2012 | Nozza 1992 |
| 2013 | Nozza 1994 |
| 2048 | Orchik 1978 |
| 2049 | Orchik 1978 |
| 2050 | Orchik 1980 |
| 2055 | Ovesen 1993 |
| 2058 | Oyiborhoro 1987 |
| 2118 | Paradise 1976 |
| 4790 | Paradise 1996 |
| 4793 | Park 1988 |
| 2236 | Rees 1992 |
| 4804 | Renvall 1996 |
| 2344 | Sassen 1994 |
| 2412 | Shaw 1978 |
| 2545 | Szucs 1995 |
| 2601 | Tom 1994 |
| 2607 | Toner 1990 |
| 2675 | van Balen 1994 |
| 2713 | Vaughan-Jones 1992 |
| 2758 | Watters 1997 |
| ID Number | Author(s) Year | Rejection Reason |
|---|---|---|
| 2877 | Alho 1998 | Data not abstractable |
| 694 | Amedee 1995 | Diagnostic procedure and gold standard greater than 24 hours apart. |
| 887 | Block 1999 | Not addressing OME |
| 912 | Boswell 1993 | No gold standard |
| 968 | Buhrer 1985 | No gold standard |
| 1015 | Chang 1998 | Not a diagnostic study |
| 1149 | Douniadakis 1993 | No gold standard |
| 1167 | Duncan 1982 | Diagnostic procedure and gold standard greater than 24 hours apart. |
| 1233 | Fields 1993 | Diagnostic procedure and gold standard greater than 24 hours apart. |
| 1236 | Fiellau-Nikolajsen 1979 | Diagnostic procedure and gold standard greater than 24 hours apart. |
| 1239 | Fiellau-Nikolajsen 1980 | Diagnostic procedure and gold standard greater than 24 hours apart. |
| 1281 | Fria 1980 | Procedure not in scope |
| 4749 | Kaleida 1992 | Data not abstractable |
| 2014 | Nozza 1997 | Not all referrals had gold standard |
| 2070 | Palmu 1999 | Only 42/242 ears had myringotomy |
| 2145 | Pellett 1997 | No gold standard |
| 2385 | Schwartz 1987 | No gold standard |
| 2434 | Silman 1992 | No gold standard |
| 2435 | Silman 1994 | No gold standard |
| 2438 | Silva 1997 | Data not abstractable |
| 2442 | Silverman 1995 | No gold standard |
| 2556 | Takahashi 1999 | Data not abstractable |
| 2786 | Williams 1977 | Diagnostic procedure and gold standard greater than 24 hours apart. |
Of the 52 studies, 33 (63 percent) scored three points or fewer on our six-point quality scale. Of the 19 studies that scored more than three points, 15 studies scored four points, three studies scored five points, and one study scored six points. Most of the studies in this group are of poor quality, a finding that replicates those of Lijmer, Mol, Heisterkamp and colleagues (Lijmer, Mol, Heisterkamp et al., 1999).
| Diagnostic Method | Reference Standard | |||
|---|---|---|---|---|
| Myringotomy | Tympanocentesis | Validated pneumatic otoscopy | Total | |
| Acoustic reflectometry | 6 | 1 | 3 | 10 |
| Audiometry-air and bone conduction thresholds | 2 | 2 | ||
| Audiometry-air conduction threshold | 4 | 1 | 5 | |
| Binaural micro-tympanoscopy | 2 | 2 | ||
| Non-pneumatic otoscopy | 4 | 4 | ||
| Pneumatic otoscopy-examiner validation not specified | 3 | 3 | ||
| Pneumatic otoscopy-unvalidated examiner | 9 | 9 | ||
| Pneumatic otoscopy-validated examiner | 1 | 1 | ||
| Portable tympanometer | 8 | 8 | ||
| Professional tympanometry | 35 | 6 | 6 | 47 |
| Quantitative tympanometry | 4 | 4 | ||
| Signs/symptoms | 2 | 2 | ||
| Total | 80 | 8 | 9 | 97 |
Note: A record can be counted more than once within a cell (i.e. controlling for intervention group.)
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Fried-1985 | 44 | 62 | 71.0 | ||||
| Macknin-1987 | 84 | 128 | 65.6 | |||||
| Babonis-1991 | 68 | 118 | 57.6 | |||||
| Total | 196 | 308 | 63.6 | 64.2 | (57.0, 71.5) | 3.6 | 0.168 | |
| Specificity | Fried-1985 | 36 | 40 | 90.0 | ||||
| Macknin-1987 | 43 | 70 | 61.4 | |||||
| Babonis-1991 | 90 | 102 | 88.2 | |||||
| Total | 169 | 212 | 79.7 | 80.4 | (65.0, 95.9) | 18.4 | <0.001 | |
| Prevalence | Fried-1985 | 62 | 102 | 60.8 | ||||
| Macknin-1987 | 128 | 198 | 64.6 | |||||
| Babonis-1991 | 118 | 220 | 53.6 | |||||
| Total | 308 | 520 | 59.2 | 59.6 | (52.5, 66.7) | 5.4 | 0.067 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Paradise-1976 | 136 | 138 | 98.6 | ||||
| Cantekin-1977 | 213 | 230 | 92.6 | |||||
| Bluestone-1979 | 242 | 256 | 94.5 | |||||
| Karma-1989 | 726 | 753 | 96.4 | |||||
| Mains-1989 | 102 | 116 | 87.9 | |||||
| Toner-1990 | 108 | 124 | 87.1 | |||||
| Finitzo-1992 | 107 | 115 | 93.0 | |||||
| Total | 1634 | 1732 | 94.3 | 93.8 | (91.4, 96.3) | 28.8 | <0.001 | |
| Specificity | Paradise-1976 | 56 | 75 | 74.7 | ||||
| Cantekin-1977 | 113 | 140 | 80.7 | |||||
| Bluestone-1979 | 131 | 169 | 77.5 | |||||
| Karma-1989 | 277 | 339 | 81.7 | |||||
| Mains-1989 | 84 | 93 | 90.3 | |||||
| Toner-1990 | 87 | 98 | 88.8 | |||||
| Finitzo-1992 | 28 | 48 | 58.3 | |||||
| Total | 776 | 962 | 80.7 | 80.5 | (75.1, 86.0) | 27.2 | <0.001 | |
| Prevalence | Paradise-1976 | 138 | 213 | 64.8 | ||||
| Cantekin-1977 | 230 | 370 | 62.2 | |||||
| Bluestone-1979 | 256 | 425 | 60.2 | |||||
| Karma-1989 | 753 | 1092 | 69.0 | |||||
| Mains-1989 | 116 | 209 | 55.5 | |||||
| Toner-1990 | 124 | 222 | 55.9 | |||||
| Finitzo-1992 | 115 | 163 | 70.6 | |||||
| Total | 1732 | 2694 | 64.3 | 62.8 | (58.3, 67.2) | 30.7 | <0.001 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Orchik-1978 | 21 | 39 | 53.8 | ||||
| Babonis-1991 | 92 | 118 | 78.0 | |||||
| Rees-1992 | 260 | 260 | 100.0 | |||||
| Vaughan-Jones-1992 | 120 | 135 | 88.9 | |||||
| van Balen-1994 | 147 | 156 | 94.2 | |||||
| Koivunen-1997 | 52 | 66 | 78.8 | |||||
| Total | 692 | 774 | 89.4 | 84.5 | (76.0, 93.1) | 39.6 | <0.001 | |
| Specificity | Orchik-1978 | 35 | 37 | 94.6 | ||||
| Babonis-1991 | 84 | 102 | 82.4 | |||||
| Rees-1992 | 9 | 50 | 18.0 | |||||
| Vaughan-Jones-1992 | 41 | 65 | 63.1 | |||||
| van Balen-1994 | 37 | 77 | 48.1 | |||||
| Koivunen-1997 | 137 | 175 | 78.3 | |||||
| Total | 343 | 506 | 67.8 | 64.4 | (44.3, 84.4) | 167.1 | <0.001 | |
| Prevalence | Orchik-1978 | 39 | 76 | 51.3 | ||||
| Babonis-1991 | 118 | 220 | 53.6 | |||||
| Rees-1992 | 260 | 310 | 83.9 | |||||
| Vaughan-Jones-1992 | 135 | 200 | 67.5 | |||||
| van Balen-1994 | 156 | 233 | 67.0 | |||||
| Koivunen-1997 | 66 | 241 | 27.4 | |||||
| Total | 774 | 1280 | 60.5 | 58.5 | (40.3, 76.7) | 268.9 | <0.001 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Orchik-1978 | 35 | 39 | 89.7 | ||||
| Orchik-1980 | 34 | 39 | 87.2 | |||||
| Nozza-1992 | 71 | 81 | 87.7 | |||||
| Nozza-1994 | 106 | 124 | 85.5 | |||||
| Total | 246 | 283 | 86.9 | 87.1 | (83.2, 91.0) | 0.6 | 0.901 | |
| Specificity | Orchik-1978 | 29 | 37 | 78.4 | ||||
| Orchik-1980 | 26 | 37 | 70.3 | |||||
| Nozza-1992 | 26 | 30 | 86.7 | |||||
| Nozza-1994 | 61 | 94 | 64.9 | |||||
| Total | 142 | 198 | 71.7 | 74.8 | (64.6, 85.0) | 8.2 | 0.041 | |
| Prevalence | Orchik-1978 | 39 | 76 | 51.3 | ||||
| Orchik-1980 | 39 | 76 | 51.3 | |||||
| Nozza-1992 | 81 | 111 | 73.0 | |||||
| Nozza-1994 | 124 | 218 | 56.9 | |||||
| Total | 283 | 481 | 58.8 | 58.6 | (48.5, 68.6) | 14.9 | 0.002 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| (A) Including both Nozza articles | ||||||||
| Sensitivity | Fiellau-Nikolajsen-1980 | 9 | 46 | 19.6 | ||||
| Nozza-1992 | 25 | 81 | 30.9 | |||||
| Nozza-1994 | 37 | 137 | 27.0 | |||||
| Barnett-1998 | 95 | 175 | 54.3 | |||||
| Total | 166 | 439 | 37.8 | 33.2 | (17.5, 48.9) | 38.1 | <0.001 | |
| Specificity | Fiellau-Nikolajsen-1980 | 42 | 42 | 100.0 | ||||
| Nozza-1992 | 29 | 30 | 96.7 | |||||
| Nozza-1994 | 109 | 112 | 97.3 | |||||
| Barnett-1998 | 104 | 124 | 83.9 | |||||
| Total | 284 | 308 | 92.2 | 95.0 | (88.5, 100) | 13.9 | 0.003 | |
| Prevalence | Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | ||||
| Nozza-1992 | 81 | 111 | 73.0 | |||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Barnett-1998 | 175 | 299 | 58.5 | |||||
| Total | 439 | 747 | 58.8 | 59.7 | (51.8, 67.7) | 14.2 | 0.003 | |
| (B) Excluding Nozza-1992 study | ||||||||
| Sensitivity | Fiellau-Nikolajsen-1980 | 9 | 46 | 19.6 | ||||
| Nozza-1994 | 37 | 137 | 27.0 | |||||
| Barnett-1998 | 95 | 175 | 54.3 | |||||
| Total | 141 | 358 | 39.4 | 33.9 | (12.7, 55.0) | 36.9 | <0.001 | |
| Specificity | Fiellau-Nikolajsen-1980 | 42 | 42 | 100.0 | ||||
| Nozza-1994 | 109 | 112 | 97.3 | |||||
| Barnett-1998 | 104 | 124 | 83.9 | |||||
| Total | 255 | 278 | 91.7 | 94.1 | (83.9, 100) | 13.7 | 0.001 | |
| Prevalence | Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | ||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Barnett-1998 | 175 | 299 | 58.5 | |||||
| Total | 358 | 636 | 56.3 | 56.3 | (52.5, 60.2) | 1.4 | 0.510 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| (A) Including Both Nozza Studies | ||||||||
| Sensitivity | Fiellau-Nikolajsen-1980 | 21 | 46 | 45.7 | ||||
| Nozza-1992 | 74 | 81 | 91.4 | |||||
| Nozza-1994 | 63 | 137 | 46.0 | |||||
| Barnett-1998 | 110 | 175 | 62.9 | |||||
| Total | 268 | 439 | 61.0 | 61.8 | (39.0, 84.7) | 93.4 | <0.001 | |
| Specificity | Fiellau-Nikolajsen-1980 | 40 | 42 | 95.2 | ||||
| Nozza-1992 | 21 | 30 | 70.0 | |||||
| Nozza-1994 | 103 | 112 | 92.0 | |||||
| Barnett-1998 | 93 | 124 | 75.0 | |||||
| Total | 257 | 308 | 83.4 | 84.5 | (74.0. 95.0) | 23.0 | <0.001 | |
| Prevalence | Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | ||||
| Nozza-1992 | 81 | 111 | 73.0 | |||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Barnett-1998 | 175 | 299 | 58.5 | |||||
| Total | 439 | 747 | 58.8 | 59.7 | (51.8, 67.7) | 14.2 | 0.003 | |
| (B) Excluding Nozza-1992 Study | ||||||||
| Sensitivity | Fiellau-Nikolajsen-1980 | 21 | 46 | 45.7 | ||||
| Nozza-1994 | 63 | 137 | 46.0 | |||||
| Barnett-1998 | 110 | 175 | 62.9 | |||||
| Total | 194 | 358 | 54.2 | 52.2 | (39.5, 64.8) | 10.7 | 0.005 | |
| Specificity | Fiellau-Nikolajsen-1980 | 40 | 42 | 95.2 | ||||
| Nozza-1994 | 103 | 112 | 92.0 | |||||
| Barnett-1998 | 93 | 124 | 75.0 | |||||
| Total | 236 | 278 | 84.9 | 87.7 | (76.8. 98.5) | 17.9 | <0.001 | |
| Prevalence | Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | ||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Barnett-1998 | 175 | 299 | 58.5 | |||||
| Total | 358 | 636 | 56.3 | 56.3 | (52.5, 60.2) | 1.4 | 0.510 | |
| Random Effect Estimate | Test of Heterogenneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| (A) Including both Nozza articles | ||||||||
| Sensitivity | Orchik-1978 | 35 | 39 | 89.7 | ||||
| Fiellau-Nikolajsen-1980 | 16 | 46 | 34.8 | |||||
| Nozza-1992 | 59 | 81 | 72.8 | |||||
| Nozza-1994 | 96 | 137 | 70.1 | |||||
| Total | 206 | 303 | 68.0 | 67.4 | (49.2, 85.7) | 41.6 | <0.001 | |
| Specificity | Orchik-1978 | 15 | 37 | 40.5 | ||||
| Fiellau-Nikolajsen-1980 | 10 | 42 | 23.8 | |||||
| Nozza-1992 | 24 | 30 | 80.0 | |||||
| Nozza-1994 | 90 | 112 | 80.4 | |||||
| Total | 139 | 221 | 62.9 | 56.4 | (27.5, 85.3) | 69.4 | <0.001 | |
| Prevalence | Orchik-1978 | 39 | 76 | 51.3 | ||||
| Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | |||||
| Nozza-1992 | 81 | 111 | 73.0 | |||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Total | 303 | 524 | 57.8 | 58.2 | (48.1, 68.3) | 15.9 | 0.001 | |
| (B) Excluding Nozza-1992 study | ||||||||
| Sensitivity | Orchik-1978 | 35 | 39 | 89.7 | ||||
| Fiellau-Nikolajsen-1980 | 16 | 46 | 34.8 | |||||
| Nozza-1994 | 96 | 137 | 70.1 | |||||
| Total | 147 | 222 | 66.2 | 65.4 | (39.1, 91.7) | 41.5 | <0.001 | |
| Specificity | Orchik-1978 | 15 | 37 | 40.5 | ||||
| Fiellau-Nikolajsen-1980 | 10 | 42 | 23.8 | |||||
| Nozza-1994 | 90 | 112 | 80.4 | |||||
| Total | 115 | 191 | 60.2 | 48.6 | (10.2, 87.0) | 64.7 | <0.001 | |
| Prevalence | Orchik-1978 | 39 | 76 | 51.3 | ||||
| Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | |||||
| Nozza-1994 | 137 | 249 | 55.0 | |||||
| Total | 222 | 413 | 53.8 | 53.8 | (49.0, 58.6) | 0.4 | 0.811 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Orchik-1978 | 46 | 84 | 54.8 | ||||
| Shaw-1978 | 48 | 49 | 98.0 | |||||
| Johnson-1980 | 55 | 74 | 74.3 | |||||
| Ben-David-1981 | 195 | 250 | 78.0 | |||||
| Kennedy-1982 | 44 | 51 | 86.3 | |||||
| Gersdorff-1986 | 61 | 81 | 75.3 | |||||
| Park-1988 | 182 | 248 | 73.4 | |||||
| Mitchell-1990 | 57 | 65 | 87.7 | |||||
| Toner-1990 | 107 | 124 | 86.3 | |||||
| Finitzo-1992 | 65 | 72 | 90.3 | |||||
| Vaughan-Jones-1992 | 91 | 135 | 67.4 | |||||
| Ovessen-1993 | 310 | 342 | 90.6 | |||||
| Sassen-1994 | 225 | 279 | 80.6 | |||||
| Tom-1994 | 100 | 153 | 65.4 | |||||
| Renvall-1996 | 87 | 101 | 86.1 | |||||
| Watters-1997 | 679 | 745 | 91.1 | |||||
| Total | 2352 | 2853 | 82.4 | 80.9 | (76.1, 85.7) | 196.6 | <0.001 | |
| Specificity | Orchik-1978 | 52 | 58 | 89.7 | ||||
| Shaw-1978 | 3 | 10 | 30.0 | |||||
| Johnson-1980 | 31 | 41 | 75.6 | |||||
| Ben-David-1981 | 39 | 61 | 63.9 | |||||
| Kennedy-1982 | 21 | 24 | 87.5 | |||||
| Gersdorff-1986 | 27 | 47 | 57.4 | |||||
| Park-1988 | 15 | 38 | 39.5 | |||||
| Mitchell-1990 | 10 | 19 | 52.6 | |||||
| Toner-1990 | 91 | 98 | 92.9 | |||||
| Finitzo-1992 | 19 | 22 | 86.4 | |||||
| Vaughan-Jones-1992 | 61 | 65 | 93.8 | |||||
| Ovessen-1993 | 37 | 51 | 72.5 | |||||
| Sassen-1994 | 59 | 101 | 58.4 | |||||
| Tom-1994 | 47 | 60 | 78.3 | |||||
| Renvall-1996 | 25 | 26 | 96.2 | |||||
| Watters-1997 | 166 | 210 | 79.0 | |||||
| Total | 703 | 931 | 75.5 | 74.5 | (66.9, 82.0) | 147.8 | <0.001 | |
| Prevalence | Orchik-1978 | 84 | 142 | 59.2 | ||||
| Shaw-1978 | 49 | 59 | 83.1 | |||||
| Johnson-1980 | 74 | 115 | 64.3 | |||||
| Ben-David-1981 | 250 | 311 | 80.4 | |||||
| Kennedy-1982 | 51 | 75 | 68.0 | |||||
| Gersdorff-1986 | 81 | 128 | 63.3 | |||||
| Park-1988 | 248 | 286 | 86.7 | |||||
| Mitchell-1990 | 65 | 84 | 77.4 | |||||
| Toner-1990 | 124 | 222 | 55.9 | |||||
| Finitzo-1992 | 72 | 94 | 76.6 | |||||
| Vaughan-Jones-1992 | 135 | 200 | 67.5 | |||||
| Ovessen-1993 | 342 | 393 | 87.0 | |||||
| Sassen-1994 | 279 | 380 | 73.4 | |||||
| Tom-1994 | 153 | 213 | 71.8 | |||||
| Renvall-1996 | 101 | 127 | 79.5 | |||||
| Watters-1997 | 745 | 955 | 78.0 | |||||
| Total | 2853 | 3784 | 75.4 | 73.6 | (69.1, 78.1) | 156.6 | <0.001 | |
| Random Effect Estimate | Test of Heterogeneity | |||||||
|---|---|---|---|---|---|---|---|---|
| Measure | Author-Year | X | N | % | % | 95% CI | Q | P-Value |
| Sensitivity | Orchik-1978 | 31 | 39 | 79.5 | ||||
| Fiellau-Nikolajsen-1980 | 42 | 46 | 91.3 | |||||
| Kennedy-1982 | 51 | 51 | 100.0 | |||||
| Vaughan-Jones-1992 | 120 | 135 | 88.9 | |||||
| Ovessen-1993 | 323 | 342 | 94.4 | |||||
| Sassen-1994 | 253 | 279 | 90.7 | |||||
| Total | 820 | 892 | 91.9 | 93.8 | (91.1, 96.4) | 9.44 | 0.093 | |
| Specificity | Orchik-1978 | 32 | 37 | 86.5 | ||||
| Fiellau-Nikolajsen-1980 | 37 | 42 | 88.1 | |||||
| Kennedy-1982 | 11 | 24 | 45.8 | |||||
| Vaughan-Jones-1992 | 41 | 65 | 63.1 | |||||
| Ovessen-1993 | 27 | 51 | 52.9 | |||||
| Sassen-1994 | 33 | 101 | 32.7 | |||||
| Total | 181 | 320 | 56.6 | 61.8 | (41.5, 82.1) | 89.8 | <0.001 | |
| Prevalence | Orchik-1978 | 39 | 76 | 51.3 | ||||
| Fiellau-Nikolajsen-1980 | 46 | 88 | 52.3 | |||||
| Kennedy-1982 | 51 | 75 | 68.0 | |||||
| Vaughan-Jones-1992 | 135 | 200 | 67.5 | |||||
| Ovessen-1993 | 342 | 393 | 87.0 | |||||
| Sassen-1994 | 279 | 380 | 73.4 | |||||
| Total | 892 | 1212 | 73.6 | 67.3 | (56.3, 78.2) | 90.1 | <0.001 | |
| Random Effect Estimate | Test of Heterogeneity | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Measure | ID | Diagnostic Comparison versus Myringotomy | Number Articles | Number Cases | Measure % | % | 95% CI | Q | P-Value |
| Sensitivity | 1 | Acoustic reflectometry (>=5 vs <5) | 3 | 308 | 63.6 | 64.2 | (57.0, 71.5) | 3.6 | 0.168 |
| 2 | Pneumatic otoscopy | 7 | 1732 | 94.3 | 93.8 | (91.4, 96.3) | 28.8 | <0.001 | |
| 3 | Portable tympanometry | 6 | 774 | 89.4 | 84.5 | (76.0, 93.1) | 39.6 | <0.001 | |
| 5 | Professional tympanometry (using static compensated acoustic admittance at 0.1) | 3 | 358 | 39.4 | 33.9 | (12.7, 55.0) | 36.9 | <0.001 | |
| 6 | Professional tympanometry (using static compensated acoustic admittance at 0.2) | 3 | 359 | 54.2 | 52.2 | (39.5, 64.8) | 10.7 | 0.005 | |
| 7 | Professional tympanometry (using static compensated acoustic admittance at 0.3) | 3 | 222 | 66.2 | 65.4 | (39.1, 91.7) | 41.5 | <0.001 | |
| 8 | Professional tympanometry (using flat or B curve as abnormal) | 16 | 2853 | 82.4 | 80.9 | (76.1, 85.7) | 196.6 | <0.001 | |
| 9 | Professional tympanometry (using flat or B or C2 curve as abnormal) | 6 | 892 | 91.9 | 93.8 | (91.1, 96.4) | 9.4 | 0.093 | |
| Specificity | 1 | Acoustic reflectometry (>=5 vs <5) | 3 | 212 | 79.7 | 80.4 | (65.0, 95.9) | 18.4 | <0.001 |
| 2 | Pneumatic otoscopy | 7 | 962 | 80.7 | 80.5 | (75.1, 86.0) | 27.2 | <0.001 | |
| 3 | Portable tympanometry | 6 | 506 | 67.8 | 64.4 | (44.3, 84.4) | 167.1 | <0.001 | |
| 5 | Professional tympanometry (using static compensated acoustic admittance at 0.1) | 3 | 278 | 91.7 | 94.1 | (83.9, 100) | 13.7 | 0.001 | |
| 6 | Professional tympanometry (using static compensated acoustic admittance at 0.2) | 3 | 278 | 84.9 | 87.7 | (76.8, 98.5) | 17.9 | <0.001 | |
| 7 | Professional tympanometry (using static compensated acoustic admittance at 0.3) | 3 | 191 | 60.2 | 48.6 | (10.2, 87.0) | 64.7 | <0.001 | |
| 8 | Professional tympanometry (using flat or B curve as abnormal) | 16 | 931 | 75.5 | 74.5 | (66.9, 82.0) | 147.8 | <0.001 | |
| 9 | Professional tympanometry (using flat or B or C2 curve as abnormal) | 6 | 320 | 56.6 | 61.8 | (41.5, 82.1) | 89.8 | <0.001 | |
| Prevalence | 1 | Acoustic reflectometry (>=5 vs <5) | 3 | 520 | 59.2 | 59.6 | (52.5, 66.7) | 5.4 | 0.067 |
| 2 | Pneumatic otoscopy | 7 | 2694 | 64.3 | 62.8 | (58.3, 67.2) | 30.7 | <0.001 | |
| 3 | Portable tympanometry | 6 | 1280 | 60.5 | 58.5 | (40.3, 76.7) | 268.9 | <0.001 | |
| 5 | Professional tympanometry (using static compensated acoustic admittance at 0.1) | 3 | 636 | 56.3 | 56.3 | (52.5, 60.2) | 1.4 | 0.510 | |
| 6 | Professional tympanometry (using static compensated acoustic admittance at 0.2) | 3 | 636 | 56.3 | 56.3 | (52.5, 60.2) | 1.4 | 0.510 | |
| 7 | Professional tympanometry (using static compensated acoustic admittance at 0.3) | 3 | 413 | 53.8 | 53.8 | (49.0, 58.6) | 0.4 | 0.811 | |
| 8 | Professional tympanometry (using flat or B curve as abnormal) | 16 | 3784 | 75.4 | 73.6 | (69.1, 78.1) | 156.6 | <0.001 | |
| 9 | Professional tympanometry (using flat or B or C2 curve as abnormal) | 6 | 1212 | 73.6 | 67.3 | (56.3, 78.2) | 90.1 | <0.001 | |
Note: Comparison #4 had only two studies and thus not included in the summary
Among the nine diagnostic methods, pneumatic otoscopy and professional tympanometry (using flat or B or C2 curve as abnormal) had the highest sensitivity at 93.8 percent (95% CI: 91.4, 96.3) and 93.8 percent (95% CI: 91.1, 96.4) compared with myringotomy, respectively. The diagnostic test with the highest specificity was professional tympanometry (using static compensated acoustic admittance at 0.1) at 95.0 percent (95% CI: 88.5, 100).
If we consider both sensitivity and (1 minus specificity) in the receiver-operator characteristic display in Figure 5
| Diagnostic Comparison | Author-Year | Study Qualitya | Test Performer |
|---|---|---|---|
| 1. Acoustic reflectometry (>=5 vs <5 RU) with myringotomy | Fried-1985 | 1(100000) | Not specified |
| Macknin-1987 | 4(111001) | Pediatrician | |
| Babonis-1991 | 5(111011) | One of the authors, specialty not specified | |
| 2. Pneumatic otoscopy with myringotomy | Paradise-1976 | 4(111001) | Pediatrician |
| Cantekin-1977 | 1(100000) | Not specified | |
| Bluestone-1979 | 2(100001) | Not specified | |
| Karma-1989 | 3(100101) | Otolaryngologist/pediatrician | |
| Mains-1989 | 1(100000) | Senior registrar and senior house officer | |
| Toner-1990 | 2(100001) | One of the authors, specialty not specified | |
| Finitzo-1992 | 4(111001) | Pediatric otolaryngologist | |
| 3. Portable tympanometry (mixed criteria) with myringotomy | Orchik-1978 | 4(111001) | Not specified |
| Babonis-1991 | 5(111011) | One of the authors, specialty not specified | |
| Rees-1992 | 1(100000) | Not specified | |
| Vaughan-Jones-1992 | 2(100001) | Not specified | |
| van Balen-1994 | 4(111010) | General practitioner with special training from ENT | |
| Koivunen-1997 | 3(110001) | department Trained nurse | |
| 4. Professional tympanometer (using acoustic reflex at 500 or 1000 Hz) with myringotomy | Orchik-1978 | 4(111001) | Not specified |
| Orchik-1980 | 4(111001) | Not specified | |
| Nozza-1992 | 4(111001) | Audiologist/nurses | |
| Nozza-1994 | 4(111001) | Clinically certified and licensed audiologist | |
| 5. Professional tympanometer (using static compensated acoustic admittance at 0.1) with myringotomy | Fiellau-Nikolajsen-1980 | 4(111001) | Author |
| Nozza-1992 | 4(111001) | Audiologist/nurses | |
| Nozza-1994 | 4(111001) | Clinically certified and licensed audiologist | |
| Barnett-1998 | 4(111001) | Research assistant | |
| 6. Professional tympanometer (using static compensated acoustic admittance at 0.2) with myringotomy | Fiellau-Nikolajsen-1980 | 4(111001) | Author |
| Nozza-1992 | 4(111001) | Audiologist/nurses | |
| Nozza-1994 | 4(111001) | Clinically certified and licensed audiologist | |
| Barnett-1998 | 4(111001) | Research assistant | |
| 7. Professional tympanometer (using static compensated acoustic admittance at 0.3) with myringotomy | Orchik-1978 | 4(111001) | Not specified |
| Fiellau-Nikolajsen-1980 | 4(111001) | Author | |
| Nozza-1992 | 4(111001) | Audiologist/nurses | |
| Nozza-1994 | 4(111001) | Clinically certified and licensed audiologist | |
| 8. Professional tympanometer (using flat or B curve as abnormal) with myringotomy | Orchik-1978 | 4(111001) | Not specified |
| Shaw-1978 | 1(100000) | Not specified | |
| Johnson-1980 | 2(110000) | Audiologist | |
| Ben-David-1981 | 1(100000) | Not specified | |
| Kennedy-1982 | 1(100000) | Audiologist | |
| Gersdorff-1986 | 1(100000) | Not specified | |
| Park-1988 | 1(100000) | Not specified | |
| Mitchell-1990 | 1(100000) | Not specified | |
| Toner-1990 | 2(100001) | Not specified | |
| Finitzo-1992 | 4(111001) | Certified audiologist | |
| Vaughan-Jones-1992 | 2(100001) | Not specified | |
| Ovessen-1993 | 4(111001) | Otolaryngologist | |
| Sassen-1994 | 3(111000) | Not specified | |
| Tom-1994 | 4(111001) | Certified audiologists | |
| Renvall-1996 | 1(100000) | Not specified | |
| Watters-1997 | 2(110000) | Paediatric audiologist | |
| 9. Professional tympanometer (using flat or B or C2 curve as abnormal) with myringotomy | Orchik-1978 | 4(111001) | Not specified |
| Fiellau-Nikolajsen-1980 | 4(111001) | Author | |
| Kennedy-1982 | 1(100000) | Audiologist | |
| Vaughan-Jones-1992 | 2(100001) | Not specified | |
| Ovessen-1993 | 4(111001) | Otolaryngologist | |
| Sassen-1994 | 3(111000) | Not specified | |
The first number is the total score of the six components. The six components of study quality score are: (1) appropriate reference standard; (2) test and reference standard assessed independently of each other; (3) blinded reading of results; (4) patient sample included an appropriate spectrum as in clinical practice; (5) reproducibility and interpretation of test results determined; and (6) description of test method sufficient to permit replication. ‘1’ indicates presence and ‘0’ indicates absence of the criterion.]
The meta-analyses revealed that pneumatic otoscopy and professional tympanometry had the highest sensitivity compared with myringotomy. While the diagnostic test with the highest specificity was professional tympanometry (using static compensated acoustic admittance at 0.1), pneumatic otoscopy optimized both sensitivity and specificity. However, the poor quality of many of the studies included in the analysis must be considered. Moreover, most studies failed to provide enough information to assess the qualifications of testers.
Despite the recent publication of the 1994 Guidelines on Otitis Media with Effusion in Children (Stool, Berg, Berman, et al., 1994), controversy persists over the management and treatment of otitis media with effusion (OME). The present evidence report systematically reviews the recent literature and provides an update to the 1994 guideline on the diagnosis and the late effects of OME on speech, language, and hearing. In addition, while the 1994 OME guideline (Stool, Berg, Berman et al., 1994) did not formally assess OME resolution rates, the present report reviews the natural history of OME. Although the technical expert panel proposed 20 potential key questions for this report, our time constraints allowed us to address only the four key questions judged by the panel to be most pressing and to have the most significant body of recent literature. We did not address any key questions on treatment or management. Of a total of 449 articles originally identified for this report, only a small percentage qualified for inclusion in the synthesis. Thus, the number of meta-analyses performed was a fraction of those we had hoped would inform this report. Our conclusions should be read with this perspective in mind.
Based on the available evidence, this report assessed the natural history of OME in terms of resolution rates. The availability of relevant studies limited our analysis to an assessment of natural history at two periods of followup and only for children 3 years of age and older; we were unable to perform meta-analysis for the under-3 age group. For children older than 3 years of age, we were able to conduct two sets of meta-analyses on the resolution of otitis media with effusion at two followup intervals. These sets were matched by unit of analysis, age group, and OME type and, when possible, diagnostic method.
The first set of meta-analyses assessed resolution at 6 weeks followup. In children older than 3 years of age with OME of unknown duration at the onset of the study, 37.2 or 42.3 percent of ears with OME were free of OME at 6 weeks followup, without regard to their interval OME status, depending on the tympanometric diagnostic criteria for OME resolution. Spontaneous resolution rates were significantly different among the cohorts included in the first set of meta-analyses.
The second set of meta-analyses assessed resolution at 3 months followup. Over a period of 3 months, in studies with cumulative resolution rates, and in children older than 3 years of age with OME of unknown duration, 22.5 or 42.7 percent of ears with OME resolve, depending on the tympanometric diagnostic criteria for OME resolution. A disadvantage of using ears as the unit of analysis is the clinical interpretation of the resolution rate: the distribution of OME resolution using ears cannot be assumed to be the same when children are the unit of analysis. Spontaneous resolution rates were not significantly different among the cohorts included in the second set of meta-analyses.
Apart from age, tympanometric criteria for OME diagnosis, and criteria for resolution on followup, which were reported in the studies, we did not have sufficient information from the studies to consider other influential factors. It was also not possible to know with certainty if the children in the various cohorts presented at identical stages in the course of OME. It is unclear why the non-cumulative, 6-week resolution rate estimate should overlap the cumulative, 3-month resolution rate estimate to such an extent. The non-cumulative resolution rate would be expected to include children whose OME had resolved and then re-accumulated middle-ear effusions (which would make the non-cumulative rate greater than the cumulative rate). Therefore, we view these estimates of OME resolution with great caution.
In the Results section, we also described isolated studies of daily followup, resolution of OME of at least 3 months duration at study onset, and a study that derived OME resolution rate equations. The findings of these studies were too limited to draw any broad conclusions. Similarly, a few studies assessed the effects of influencing factors on OME resolution, but, again, we hesitate to draw any generalizations based on such limited evidence.
Literature on the natural history of OME was difficult to interpret for several reasons. These reasons include its generally poor quality of the research, the lack of control for therapeutic interventions, the inability to distinguish persistent as opposed to recurrent OME due to the length of followup, and the varied criteria for continued followup from examination to examination. We recognize that the meaning of quality summary scores, as measured in this analysis by scoring of documentation in the published articles, can be ambiguous (Jüni, Witschi, Bloch et al., 1999). Nevertheless, whether as a summary score or considering individual quality domains, the quality of the twenty-eight cohort studies on natural history was poor. Twenty-four of twenty-seven of these studies had a quality score of three or less on a scale of one (lowest) to six (highest), even though our preselection process guaranteed all cohort studies a minimum score of one. Half of the studies that attempted to assess the natural history of OME did not control for or document control of interventions, either medical or surgical, that might affect OME outcome during the study period. The majority of these investigations did not stratify findings by intervention status. In addition, the intervals between examinations for OME in these studies varied from one day to 3 years; and, it was impossible to assess whether the “continuing” presence of OME, especially after a long interval, represented persistent or recurrent OME. Furthermore, the criteria for followup varied among studies. Most studies continued followup for the whole study period regardless of OME status at a particular exam, but four cohorts terminated followup of individuals who had type A or normal tympanograms at any exam.
Differing definitions of OME resolution and diagnostic methods also make comparison difficult. As noted in the Results chapter, a difference in the definition of OME resolution between tympanogram type B or C transition to A and tympanogram type B transition to A resulted in a three-fold difference in the estimate of OME resolution in one case. As can be seen from our assessment of the operating characteristics of various OME diagnostic methods in the response to Key Question 4 in the Results, the apparent OME resolution rate could be greatly influenced by the sensitivity and specificity of a test.
A paucity of research studies made it impossible to address several other issues. These included the use of the child or the episode as the unit of analysis. Furthermore, we could not assess OME resolution in younger children or the resolution of OME that was not newly diagnosed and of unknown duration.
The literature was lacking in evidence addressing potential factors that might influence OME resolution. A few studies analyzed the potential influence on OME resolution of factors such as gender, at home care versus daycare, season of onset, side of affected ear, race/ethnicity, or diagnostic instrument. Because of the paucity of such studies, quantitative synthesis was not possible. Such information is also necessary to allow generalizations to specific clinical situations.
Similar to the results of Rosenfeld (1999a), our pooled OME resolution rates could be stratified by age and by tympanometric criteria for OME. Although Rosenfeld did not specify his tympanometric criteria, his analysis found a one-month OME resolution rate of 52 percent (95% CI: 47%, 58%) for OME of unspecified duration in cohort studies with ears as the unit of analysis, regardless of age. This finding is similar to our observation of 42.3 percent resolution (95% CI: 24.1%, 60.6%) for children older than 3 years of age. This finding was not based on cumulative resolution rates but on the disappearance of OME at various points in time in children who had OME at the start of the observation period. Further, Rosenfeld (1999a) found a two-to 3-month OME resolution rate of 63 percent (95% CI: 60%, 63%) for OME of unspecified duration in cohort studies with ears as the unit of analysis, regardless of age. This rate is somewhat higher than our estimate of 42.7 percent (95% CI: 29.3%, 56.1%), which was based on studies that measured cumulative resolution.
We restricted our analysis to prospective cohort studies to provide better assessments of the natural history of OME in the general population of children. Nevertheless, we recognize that information on the natural history of OME may also be gleaned from analysis of the results of randomized, controlled trials of treatment of OME, as Rosenfeld (1999a) subsequently reported. Such estimates would be more applicable than would cohort studies on the general population of children to those who, for whatever reason, are being followed closely for their OME by their health provider. Children being screened for OME in the general population might not necessarily have presented to the health care system for evaluation. As a result, these children might represent a less severe class of OME or at least a different population from those children who were already identified with and being followed for OME and who were deemed potential candidates for therapeutic intervention. Rosenfeld (1999a) reported OME resolution rates of 12 percent (95% CI: 8%, 16%), 23 percent (95% CI: 21%, 26%), and 24 percent (95% CI: 17%, 32%), at two weeks, one month, and one to 3 months followup, respectively, in children with OME that had lasted weeks or months and were randomized to the placebo or no-drug arms of randomized controlled trials. In addition, he assessed OME resolution rates of children with OME that lasted 3 months or longer who were surgical candidates for tympanostomy tubes. He found the following OME resolution rates at 6 months, 1 year, 2 years, 3 years, and 4 years for children assigned to the no intervention groups: 27 percent (95% CI: 20%, 35%), 32 percent (95% CI: 24%, 40%), 31 percent (95% CI: 25%, 37%), 49 percent (95% CI: 37%, 62%), and 59 percent (95% CI: 46%, 72%). A strength of these studies is that the diagnosis of OME tended to rely on pneumatic otoscopy as well as tympanometry, rather than on tympanometry alone. A major difficulty of these studies, as with the cohort studies, is the inability to account for interventions received from other providers and beyond the control of the investigators. In fact, one of these investigators mentioned that “Antibiotic treatment was not controlled for in this study” (Maw and Bawden, 1994). Another difficulty, as with the cohort studies, is the inability to distinguish persistent from recurrent OME, due to the long intervals between followup assessments.
Finally, the publication of multiple articles based on the study of a single cohort raises a complex issue. If multiple studies from one cohort were included once in a meta-analysis, the results would be unbiased. However, if findings from a single cohort were included more than once in a meta-analysis, then bias would exist.
Our intent was to examine the influence of long and short-duration OME that results in hearing loss on delays in speech and language development and the ability of other factors to modify the effect of OME. However, we found few studies that addressed these questions. Only 20 of 112 studies (19%) (on 12 cohorts) met our criteria for consideration for this question. These studies further suffered from lack of uniformity with respect to risk factors studied, type of outcome measured, method of measurement, unit of measurement, and age at outcome determination.
The generalizability of our findings also is questionable, since nine of the twelve cohorts primarily included children from specific ethnic/racial groups or particular socioeconomic groups. Five of the six studies included in the three meta-analyses were based on such cohorts. However, we excluded studies on children with specific medical conditions such as craniofacial defects, primary mucosal disorders, immunodeficiencies, and genetic disorders. The available literature did not address the effect of pre-existing speech, language, or other developmental disorders, and only one study focussed on the effect of bilateral persistent middle-ear effusion.
Based on our criteria, only nine of the twenty studies (45 percent) were of acceptable quality. These studies scored five points or more on an eight-point scale.
Our analysis was limited to assessing whether early-onset OME resulted in delays in expressive and receptive language development and cognitive verbal intelligence. For children older than 3 years of age who had a positive history of otitis media during the first 3 years of life, our meta-analyses on long-term expressive language (three studies), receptive language (four studies), and cognitive verbal intelligence (three studies) showed no effect of early otitis media. The 1994 OME guideline (Stool, Berg, Berman et al., 1994) had determined that meta-analysis of the 14 “adequate” studies was not possible, because of the wide variety of measurement tools used for outcome assessment as well as a lack of standardization of data reporting. We were able to identify articles that had appeared since the release of the 1994 OME guidelines.
Nevertheless, our findings on the possible effects of early-life OM on speech and language development are in general agreement with the conclusion of the 1994 OME guideline (Stool, Berg, Berman et al., 1994). They concluded that “rigorous, methodologically sound research does not adequately support or refute the theory that untreated otitis media with effusion results in speech/language delays or deficits.” Our findings were also in agreement with those noted in the 1994 OME guideline that “Conflicting findings among studies can be accounted for in several ways: limitations in the research designs, lack of uniformity of test instrument selection, lack of definition of hearing status, and interactions between otitis media with effusion and other risk factors,” as well as differences in populations studied. Thus, we caution clinicians not to generalize these findings to children with the underlying chronic medical conditions that were excluded from this study or to those with pre-existing developmental disorders. In addition, generalization to children with persistent bilateral OM may not be valid, since only one of the studies specifically assessed bilateral as opposed to unilateral or bilateral otitis media.
Several ongoing, prospective studies are assessing the effect of early otitis media on long-term speech and language development of children older than 3 years. However, the results of these studies have not yet been published (Paradise, Dollaghan, Campbell et al., 2000;Feldman, Dollaghan, Campbell et al., 1999;Paradise, Rockette, Colborn et al., 1997;Roberts, Burchinal, Zeisel et al., 1998;Roberts, Burchinal, Jackson et al., 2000). These studies will provide further data on which to base an assessment of the effect of OM on speech and language development.
Although the immediate effect of OM on the conductive aspects of hearing is well recognized, the long-term effect of early OM on sensorineural hearing has not been well established (Madell, 1999). Our findings indicate that early OM may have an effect on long-term hearing.
Of the eight cohort studies we analyzed, four reported percent of hearing loss at 6 to 10 years of age. Children with early-life OM have a 2.6 times higher risk of hearing loss at 6 to 10 years of age than do children with no early history of OM with an estimated rate difference of 11 percent. The rate difference and the risk ratio were not significantly different among the cohorts. If early OM does indeed lead to long-term hearing deficits, the clinical implications are significant. Depending on the degree of hearing loss, the hypothesized effect of OM on speech and language development as mediated through hearing loss may be of greater duration than would be explained by the intermittent and transient episodes of conductive hearing loss associated with OM. In addition, the question of the effect of OME treatment on long-term hearing gains greater relevance.
The literature available for assessment of the long-term effects of early-life OM on hearing is both limited and of poor quality. Only four percent (eight of the 186) of the studies that addressed OM and hearing qualified for inclusion in our analysis, and only half of these were of acceptable quality. The evidence on the effects of early OM on long-term hearing also suffered from the same methodological issues as the evidence on the effects of early-life OM on long-term speech and language development. Because of the limited nature of this evidence, and because the rate of intervention depends greatly on the threshold hearing level adopted, the findings of our analysis should be used with caution.
Previous assessments of diagnostic techniques for OME have recommended several different techniques. The 1994 OME Guidelines recommended pneumatic otoscopy (Stool, Berg, Berman, et al., 1994).
Using 52 diagnostic studies, we were able to evaluate the diagnostic accuracy of the following eight methods: acoustic reflectometry at ≤5 or >5 RU; pneumatic otoscopy; portable tympanometry; professional tympanometry using static compensated acoustic admittance at 0.1, 0.2, and 0.3; professional tympanometry using B curve as abnormal; and professional tympanometry using B or C2 curves as abnormal. All comparisons used myringotomy as the reference standard.
Among the eight diagnostic methods, the receiver-operator characteristic points (plotting sensitivity against [1 minus specificity]) for pneumatic otoscopy were closest to the optimal point of 100% sensitivity and 100% specificity. The pooled sensitivity for pneumatic otoscopy was 94 percent (95% CI: 91%, 96%); the pooled specificity was 80 percent (95% CI: 75%, 86%); the positive predictive value was 89 percent (95% CI: 87%, 92%); the negative predictive value was 89 percent (95% CI: 86%, 93%), and the accuracy was 89 percent (95% CI: 87%, 91%). These findings were based on 2,694 children from seven studies that reported a pooled prevalence rate for OME of 63 percent (95% CI: 58%, 67%). The prevalence rates among the studies ranged from 56 percent to 71 percent, which was a significant variation (p<0.001). The diagnostic test with the highest specificity was professional tympanometry (using static compensated acoustic admittance at 0.1) at 95.0 percent (95% CI: 88.5, 100).
Our findings are in general agreement with the recommendations and findings of the 1994 OME guideline (Stool, Berg, Berman et al., 1994). The 1994 OME guideline stated that “The diagnostic evaluation of suspected otitis media with effusion should include pneumatic otoscopy. Otoscopy alone (without the use of the pneumatic otoscope to test tympanic membrane mobility) is not recommended.” This recommendation was deemed a strong one based on limited scientific evidence and strong Panel consensus. Based on limited scientific evidence and expert opinion, the 1994 OME guideline also allowed the following option: “Tympanometry may be used as a confirmatory test for otitis media with effusion.” The OME Guidelines did not include quantitative syntheses of the evidence.
Our analysis considered several references not cited by the 1994 OME guideline; some were accepted, and some were rejected. The results of our meta-analyses confirm that pneumatic otoscopy had the best operating characteristics among the nine alternatives examined. Our findings also confirm that certain, but not all, categories of tympanometry also perform well in identifying middle-ear effusion in OME as well as in distinguishing it from other entities. While the 1994 OME guideline did not make a recommendation regarding acoustic reflectometry, our findings suggest that acoustic reflectometry does not perform as well as pneumatic otoscopy and certain types of tympanometry. The use of the spectral gradient angle, as the unit of measurement, may improve the sensitivity of acoustic reflectometry compared to the use of reflectivity, but this observation is based on a single study that found a sensitivity of 95.4 percent using a threshold of 95 degrees (Barnett, 1998) (compared to the pooled sensitivity of 64.2 percent for a threshold reflectivity of 5 found in this study. However, the specificity was 31.5 percent when a spectral gradient angle of 95 degrees was used as the threshold, compared to the pooled specificity of 80.4 percent with a threshold reflectivity of 5 in this study. Unlike the 1994 OME guideline, which commented on the combination of tympanometry and pneumatic otoscopy, we did not assess combinational diagnostic methods or algorithms. OME guideline, which commented on the combination of tympanometry and pneumatic otoscopy, we did not assess combinational diagnostic methods or algorithms.
The finding that pneumatic otoscopy can do as well as or better than tympanometry and acoustic reflectometry has significant practical implications. For the typical clinician, pneumatic otoscopy should be easier to employ than other diagnostic methods. The important question may be what degree of training will be needed for the clinician to be as effective with pneumatic otoscopy as were the examiners in the studies reviewed in this report. Also, while we did not do a cost-effectiveness analysis, the cost of pneumatic otoscopy, in terms of direct and indirect costs, would appear to be less than that for tympanometry or acoustic reflectometry.
Because of inadequate evidence, we could not conduct evaluations of clinical signs and/or symptoms, air and/or bone threshold audiometry, binocular micro-tympanoscopy, and non-pneumatic otoscopy in the diagnosis of OME. In addition, diagnostic methods that use algorithms or aggregated scorings are important but were not included in the scope of this evidence assessment. The sources of variation of such combinational methods are difficult to detect in published articles. In addition, we must emphasize that we assessed the diagnosis of OME middle-ear effusion at single points in time rather than the diagnosis of persistent or recurrent OME over time. The meta-analyses of diagnostic tools raised several methodological concerns. One concern centered on pooling data from studies of diverse populations. The differences in OME prevalence among the studies point to one aspect of this diversity. In addition, we were concerned about the different models of instruments within the broad categories of diagnostic methods. We assume instrumentation has improved or at least changed over time in the areas of tympanometry and acoustic reflectometry as well as in the other diagnostic methods that could not be assessed in depth.
In addition, as alluded to above, a definitive assessment of diagnostic methods for the diagnosis of middle-ear effusion in OME would require an assessment of the cost-effectiveness of the different diagnostic procedures. Such an assessment could not be incorporated into our analysis due to a limited time frame. Cost-effectiveness analysis would take into account the specific impact of test results, including false positives and false negatives, which will depend on examiner proficiency and the patient preferences for specific outcomes. Cost-effectiveness analysis would also establish optimal operating points or thresholds for diagnostic methods measured on both ordinal and continuous scales (Sox, Stern, Owen et al, 1989).
Several limitations of the study applied to the evidence analysis as a whole. These included the issues of the selection of key questions, publication language as an exclusion factor, study quality, and the analysis of influencing factors. This section addresses these issues. Limitations that pertained to individual questions are discussed in the Results section.
Several peer reviewers commented on the absence of OME treatment as a topic of inquiry of this study. The key questions were selected using a standard consensus methodology, as described in the Methods section. The technical experts were asked to consider various factors when selecting questions. One of these factors was the importance of the question (as assessed by the potential impact on OME outcomes and on development of future OME guidelines). Another consideration was feasibility (as assessed by the ability to complete the study in 6 months and the availability of sufficient information or new information, if a systematic review had already been done in the past). Therefore, treatment questions were not arbitrarily included or avoided; instead, adherence to our key question selection method led to the questions that were included. Whether the methodology for selecting key questions needs to be reconsidered in future systematic reviews is a valid question. In addition, whether the composition of a particular technical expert panel might influence the selection of key questions is also a valid question for future systematic reviews.
| Country of Origin | All Questions (n=112) | Question 1: Natural History (n=33) | Question 2: Speech and Language (n=20) | Question 3: Hearing (n=7) | Question 4: Diagnostic Methods (n=52) |
|---|---|---|---|---|---|
| Belgium | 2 | 2 | |||
| Canada | 1 | 1 | |||
| Denmark | 18 | 14 | 4 | ||
| England | 9 | 3 | 6 | ||
| Finland | 4 | 1 | 3 | ||
| France | 1 | 1 | |||
| Israel | 2 | 1 | 1 | ||
| Italy | 1 | 1 | |||
| Korea | 1 | 1 | |||
| Kuwait | 1 | 1 | |||
| Northern Ireland | 2 | 2 | |||
| Scotland | 2 | 1 | 1 | ||
| Sweden | 6 | 3 | 1 | 2 | |
| The Netherlands | 4 | 2 | 2 | ||
| Turkey | 1 | 1 | |||
| USA | 53 | 6 | 19 | 4 | 24 |
| not specified | 4 | 4 | |||
| Non-USA | 55 (51%) | 27 (82%) | 1 (5%) | 3 (43%) | 24 (50%) |
| USA | 53 (49%) | 6 (18%) | 19 (95%) | 4 (57%) | 24 (50%) |
For two reasons, no prospective cohort studies were excluded based on study quality. First, we decided that if sufficient numbers of studies were available, we would study the variability of findings by either stratified or sensitivity analyses. Secondly, we recognized the potential problems with summary quality scores (Jüni, Witschi, Bloch et al., 1999). For example, we recognized that a study may have adhered to a high quality of study design but not have documented that design in the article. The abstracted data may be viewed along with our quality score, based on the design described in the article, in the evidence tables.
As we mentioned in the Results section, we were unable to conduct in-depth analysis by influencing factors for any of our key questions, due to the constraints of the available evidence. In particular, our assessment of the effect of early-life otitis media on long-term speech, language, and hearing was limited to an assessment of the effect of OME duration. The available evidence was not sufficient to allow us to address the second part of the question, namely, the influence of other risk factors, using standard analytic techniques within the resources and timeframe of this evidence analysis. The technical expert panel had listed many demographic (including SES), environmental, and clinical factors that might either act independently or interact with OME to affect speech and language. To address this question, a meta-regression approach would be required to identify the risk factors that contribute significantly to speech, language, and hearing delays in the context of otitis media and could include both comparative and single cohort studies. However, many issues must be addressed in order to set up data for meta-regression analysis appropriately. Such issues would require a great deal more input from technical experts than was possible with the resources available in this study.
The need for future research on otitis media with effusion (OME) and otitis media (OM) can be substantiated only by the demonstration of a negative effect of OM on important outcomes such as speech, language, and hearing. The available data do not provide such evidence in the case of speech and language and, suggest, at most, a possible effect of early-life OM on long-term hearing, based on evidence that may not be generalizable. Thus, future research must still establish the effect of early-life OM on speech, language, and hearing. Such future research will benefit by addressing the following general issues, which affect study quality and outcomes assessment. The definitions of OME and OM and of relevant interventions, influencing factors, and outcomes should be standardized.
A common, testable framework, with flexibility for competing hypotheses, that links predisposing factors to OME and OM and then OME or OM to outcomes such as long-term speech and language development and hearing, should be adopted. This framework should include hypotheses on the role of child characteristics, environmental and social influences such as socioeconomic status, and medical factors such as interventions, on outcomes.
Agreement on appropriate follow-up intervals to provide valid estimates of duration or frequency of OME and OM is needed to help in comparing results from different studies.
Additional areas where future research should focus include potential gaps in practice and newer outcome measures. Such outcome measures include general health status and quality issues such as satisfaction with treatment.
Future research on the natural history of OME must focus on improvement of study quality and establishing the effect of OME on long-term outcomes such as speech, language, and hearing. In particular, control of therapeutic intervention during the study and the distinction between OME persistence and recurrence need to be addressed. Considering the difficulties of conducting a natural history study on OME, a less restrictive definition of nonintervention might be considered. Even with a less restrictive definition of nonintervention, studies should consider presenting data that has been stratified by the level of intervention each child receives during the study period. For example, if a child is allowed to have antibiotics, the exact circumstances when antibiotics may be given should be determined a priori, the number of episodes of such antibiotic administration should be noted, and the outcome measures reported should be stratified by intervention level. However, with a less restrictive definition of nonintervention, the researcher runs the risk of conducting a study with little meaning or applicability to natural history.
Researchers and clinicians should agree upon standard procedures for follow-up, including intervals of follow-up, so that resolution rates are indeed comparable. As the study by Moller and Tos (1990) demonstrated, even daily exams did not necessarily lead to a greater distinction between persistence of OME and recurrence; thus, we do not expect this to be an easy issue to resolve. The issue of assessment of OME duration or recurrence is as important as the issue of diagnosis of OME at a single point in time.
Researchers need to agree upon a definition of OME resolution and the diagnostic methods with which to make that determination. The latter will require further research into the operating characteristics of OME diagnostic methods, as we have reported in this evidence report. Further enhancements in diagnostic methods must also be achieved. Although perhaps more relevant to studies on outcomes other than resolution, more research is needed that views the child as the unit of analysis, since the actual outcomes of concern, such as speech, language, and hearing, are functional requirements of a child, not an ear.
Further research on the role of influencing factors, such as socioeconomic status, on the natural history of OME may help the clinician in a particular setting make a better decision when assessing a child with particular characteristics. Among the influencing factors the technical expert panel thought were potentially important, the only factors addressed were age, gender, daycare setting, season, and racial/ethnic origin, and those were addressed by only a few studies. Study findings will be useful when they are generalizable, either because of their similarity to the population served by a particular clinician or because they address children with characteristics similar to a specific child the clinician is assessing. However, the issue of OME resolution is relevant only if OME has an impact on outcomes of relevance, such as long-term speech, language, or hearing.
For evaluation of long-term effects of early-life otitis media on speech and language development, a coordinated uniform approach that uses a rational conceptual framework is recommended. Such an approach should address the risk factors, such as socioeconomic status, interventions, and outcome measures in an integrated fashion. Conceptual frameworks include the Global Language Model, the Interactive Language and Attention Model, and a transactional model (Vernon-Feagans, 1999; Roberts and Wallace, 1997). The Global Language Model hypothesizes that mild to moderate hearing loss that results from otitis media is the actual causal factor that leads to speech and language deficits. The Interactive Language and Attention Model also hypothesizes an important role for hearing loss. However, this model also distinguishes between early and later developmental effects and the timing of hearing loss in the life of the child and places greater emphasis on risk and protective factors that may interact with hearing loss as well as directly affecting speech and language development. The transactional model ascribes an important role to differences in parent or caregiver response to children with and without chronic otitis media on long-term effects.
Generalizability of study findings will be enhanced in future research if details of risk or influencing factors and interventions are well planned and documented. Included in this assessment of risk factors should be the issue of hearing loss (both conductive and sensorineural) associated with otitis media as a possible cause of long-term speech and language deficits.
For future systematic reviews, we propose the consideration of an “individual-level-data meta-analysis” method (Mathew and Nordstrom, 1999; Stewart and Clarke, 1995; Stewart and Parmar, 1993) to study the long-term effects on outcomes such as speech, language, or hearing, with many of the suggestions for improvement of study quality noted above. This approach would call for the collaboration of investigators from various institutions who have been following cohorts of children prospectively to contribute data on individual members of their cohorts. Eligible cohorts are identified based on a priori criteria. Risk factors, interventions, and outcomes of interest are also defined a priori. The unique feature of “individual-level-data meta-analysis” is the ability it confers to retrieve a uniform set of data directly on risk factor, intervention, and outcome data, case by case. This case-specific data set could then be analyzed using meta-regression or other multivariate techniques. A meta-analysis of updated individual patient data has been found to provide the least biased and most reliable means of addressing questions that have not been satisfactorily resolved by individual studies (Stewart and Parmar, 1993). However, the quality of data and the ability for cohort investigators to collect and share relevant data are important factors in the success of this approach (Steward and Clarke, 1995).
In all aspects of analysis, definitions, classifications, and types and units of measure should be developed by a team of experts prior to the start of a study. A consensus on the definitions and classification of otitis media and on relevant outcome measures will allow for comparisons among cohorts. We realize the lack of knowledge as to what specific aspects of speech and language development might be affected by early otitis media. Nevertheless, we would encourage experts to develop a uniform panel of tests that would measure the broad array of possible aspects of speech and language development hypothesized to be affected and could be consistently applied in research studies by all investigators. Literature on findings should report univariate as well as multivariate analysis findings to allow pooling of data. Many studies reported correlation coefficients or regression coefficients, which are difficult to interpret and to use in quantitative synthesis.
Several prospective studies on the effect of early otitis media on long-term speech and language development are ongoing (Paradise, Dollaghan, Campbell et al., 2000; Feldman, Dollaghan, Campbell et al., 1999; Paradise, Rockette, Colborn et al., 1997; Roberts, Burchinal, Zeisel et al., 1998; Roberts, Burchinal, Jackson et al., 2000). Whether these studies answer more definitively the questions regarding the effect of otitis media on long-term speech and language and delineate areas, apart from the general research issues noted above, for further prospective studies on speech and language will be better assessed when the results of these studies are reported in the peer-reviewed literature.
Future research should attempt to confirm whether early-life otitis media leads to more permanent hearing loss than intermittent and transient conductive hearing loss. The importance of hearing loss, whether intermittent and transient or permanent and long-standing, associated with early-life otitis media, should be addressed as noted in the section above on the effect of early otitis media on speech and language development. Similar methodological recommendations, including the “individual-level-data meta-analysis” approach, apply to research on long-term hearing effects and speech and language effects of early otitis media. If OM does affect long-term hearing, the effect of OM treatment on long-term hearing and its cost-effectiveness are of great importance and must be addressed in future prospective studies.
Future research on the diagnosis of OME will need to start with the definition of OME. The difficulty in reaching a consensus on the definition of OME was seen in our discussion of this issue with our technical expert panel. The technical experts agreed that OME was defined as “fluid in the middle ear without signs or symptoms of ear infection,” as proposed by the 1994 OME guideline (Stool, Berg, Berman et al., 1994). However, they could not agree on which signs or symptoms should be absent, i.e. what signs or symptoms differentiated OME from acute otitis media. Without such agreement, we believe little progress can be made in improving the diagnosis of OME.
Limiting the assessment of OME diagnostic methods to those that address middle-ear effusion specifically, as we did, will require further expert consensus on important conceptual issues. One issue that was brought to our attention by one of our technical experts and that was discussed in depth by our technical expert panel was whether diagnosis of middle-ear effusion in the child with OME was different than in the child with acute otitis media. For example, since the child with acute otitis media is in discomfort, whatever the symptoms that are ascribed to acute otitis media, that child will be more difficult to examine for the presence of middle-ear effusion than a child with OME, who by definition is asymptomatic. After much discussion, our panel decided that the diagnosis of middle-ear effusion was different in the context of these two clinical conditions; however, we are aware that other experts may not agree with this opinion. Future systematic reviews will require studies of much higher quality than are currently available. In addition, future studies must provide details on the characteristics of the children studied and the study setting so that the generalizability of the findings can be assessed. Studies confined to children with known middle-ear effusions in tertiary care settings may be easier to conduct, but the clinician in general practice is faced with children whose middle-ear status is unknown at the time of presentation. Future research must provide information that is applicable to the child with unknown middle-ear status in the primary care setting.
Pneumatic otoscopy might appear to be less costly and more easily employed by the typical clinician than other diagnostic options such as tympanometry and acoustic reflectometry. Nevertheless, future studies on the diagnostic assessment of OME should consider cost-effectiveness analysis, which can take into account the variable proficiency of clinicians in performing pneumatic otoscopy as well as the consequences of testing and patient preferences (Sox, Stern, Owen et al., 1989). Cost-effectiveness analysis will enable more informed decisions on the best diagnostic method for OME. The assessment of more complex diagnostic methods such as combination tests or algorithms would also benefit from cost-effectiveness analysis. Such analysis should be undertaken in the future.
[The eight components of study quality score are: study cohort clearly defined; subjects assembled at a uniform time point; pathway of subject entry clearly described; complete follow-up achieved; withdrawals/drop-outs described; objective outcomes used; outcome assessment blinded; and extraneous factors adjusted. 1 indicates presence and 0 indicates absence.]
[The eight components of study quality score are: study cohort clearly defined; subjects assembled at a uniform time point; pathway of subject entry clearly described; complete follow-up achieved; withdrawals/drop-outs described; objective outcomes used; outcome assessment blinded; and extraneous factors adjusted. 1 indicates presence and 0 indicates absence.]
“The six components of Study Quality Score are: appropriate reference standard; test and reference standard assessed independently of each other; blinded reading of results; patient sample included an appropriate spectrum as in clinical practice; reproducibility and interpretation of test results determined; and description of test method sufficient to permit replication. 1 indicates presence and 0 indicates absence.”
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]