Figure 3.1 Geographic distribution of kernicterus cases
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Acting Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
Carolyn M. Clancy, M.D.
Acting Director
Agency for Healthcare Research and Quality
Jean Slutsky, Acting Director
Center for Practice and Technology Assessment
Agency for Healthcare Research and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.
Objectives. This report summarizes the evidence on the effect of bilirubin on neurodevelopmental outcomes. It also examines the role of various effect modifiers on neurodevelopment, the efficacy of phototherapy, the accuracy of transcutaneous bilirubin (TcB) measurements, and the various strategies for predicting hyperbilirubinemia.
Search strategy. Primary research articles evaluated for this report were identified through a MEDLINE® search of English language literature published between 1966 and September 2001.
Selection criteria. Healthy infants ≥ 34 weeks gestation or ≥ 2,500 grams with hyperbilirubinemia comprised the target population. To be included, studies reported on bilirubin level and neurodevelopmental or behavioural outcomes. For assessment of treatment efficacy, the Evidence-based Practice Center (EPC) included studies that evaluated any form of treatment for neonatal hyperbilirubinemia and had at least 10 subjects per arm. For the diagnosis review section, only studies with a minimum of 10 subjects per arm and which used laboratory assay of serum bilirubin were included.
Data collection and analysis. The EPC reviewed 4,560 abstracts, retrieved 241 articles for close examination, and included 138 articles in this report. There were 28 articles on cases of kernicterus, 35 articles reported on correlations, 21 articles reported on treatments, and 54 articles were included in the diagnosis review section. Evidence tables of study features and results were produced. Summary tables reported an appraisal of the methodological quality of the studies and summarized results.
The EPC calculated the number needed to treat (NNT) to quantify the efficacy of treatment for neonatal hyperbilirubinemia. The EPC combined the sensitivity and specificity of the test independently and used the summary receiver operating characteristics curve to evaluate diagnostic test performance. A meta-analysis of correlation coefficients was conducted to correlate performance of TcB measurements with serum bilirubin.
Main results/conclusions. A summary of 28 reports, which spanned over 30 years, on 123 cases of kernicterus in term or near-term infants affirms the role of elevated bilirubin level in kernicterus. The disease, although infrequent, has significant mortality (at least 10 percent) and long-term morbidity (at least 70 percent).
Except in cases of kernicterus with sequelae, use of a single total serum bilirubin (TSB) level (within the range described in the studies) to predict long-term behavioral or neurodevelopmental outcomes for infants ≥ 34 weeks gestation is inadequate and will lead to conflicting results.
Six to ten jaundiced, otherwise healthy neonates with TSB ≥ 15 mg/dl, would need to be treated with phototherapy to prevent TSB from rising above 20 mg/dl in one infant. Phototherapy combined with cessation of breastfeeding and substitution with formula was found to be the most efficient treatment protocol.
Based on the evidence from the systematic review, TcB measurements by each of the three devices described in the literature—the Minolta AirShields bilirubinometer, the Ingram Icterometer, and the SpectRx BiliCheck™—have a linear correlation to total serum bilirubin and may be useful as screening devices to detect clinically significant jaundice and decrease the need of serum bilirubin determinations.
The report presents a comprehensive literature review of the effect of bilirubin on neurodevelopmental outcomes. This report also examines the role of various effect modifiers such as sepsis and hemolysis on neurodevelopment, the efficacy of phototherapy, the accuracy of transcutaneous measurement of bilirubin and the various strategies in predicting hyperbilirubinemia. As background, in 1994, the American Academy of Pediatrics published guidelines on the management of neonatal hyperbilirubinemia developed by the Provisional Committee for Quality Improvement and Subcommittee on Hyperbilirubinemia. Although there was no definitive evidence showing a specific level of bilirubin and subsequent serious adverse neurodevelopmental outcome, the task force, relying on retrospective epidemiologic data primarily derived from North American and European research, offered recommendations for the management of neonatal hyperbilirubinemia. These were based on evidence, when appropriate data existed, and based on expert consensus when data were lacking.
These recommendations were specifically directed at evaluation and treatment of hyperbilirubinemia in healthy term newborns; i.e. only infants without signs of illness or apparent hemolytic disease. Highlights in the recommendations included visual inspection of the skin to determine jaundice, use of total serum bilirubin (TSB) level as the relevant variable in determining treatment, and recommendation for exchange transfusion only if intensive phototherapy fails to lower the TSB to less than 20 mg/dl. Criticisms regarding these recommendations include the fact that visual inspection of jaundice may not be reliable, criteria based on age measured by number of days rather than hours is too coarse for precise interpretation of bilirubin level, and evidence about whether there is any adverse effect of phototherapy treatment on healthy term newborns is absent. Furthermore, some term infants without evidence of hemolysis may develop hyperbilirubinemia and kernicterus and there have not been any randomized controlled trials to assess the relation between bilirubin levels and adverse neurodevelopmental effects. Through review of evidence for five key questions, the report aims to supply data for an update of these recommendations.
The Evidence-based Practice Center (EPC) formed an evidence review team consisting of pediatricians and EPC methodological staff to review the literature and perform data abstraction and analysis. The evidence review team held meetings and teleconferences with external technical experts representing the American Academy of Pediatrics (AAP), the American Academy of Family Physicians, the National Association of Pediatric Nurse Practitioners, the Center for Quality of Care Research and Education, Harvard School of Public Health, and the organization, Parents of Infants and Children with Kernicterus. The EPC and its panel of external technical experts refined key questions proposed by the AAP and identified issues central to this report. A comprehensive search of the medical literature was conducted to identify the evidence available to address the following questions:
1. What is the relationship between peak bilirubin levels and/or duration of hyperbilirubinemia and neurodevelopmental outcome?
2. What is the evidence for effect modification of the results in Question 1, by gestational age, hemolysis, serum albumin, and other factors?
3. What are the quantitative estimates of efficacy of treatment at (1) reducing peak bilirubin levels (e.g., number needed to treat at 20 mg/dl to keep TSB from rising); (2) reducing the duration of hyperbilirubinemia (e.g., average number of hours by which time TSB greater than 20 mg/dl may be shortened by treatment); and (3) improving neurodevelopmental outcomes?
4. What is the efficacy of various strategies for predicting hyperbilrubinemia, including hour-specific bilirubin percentiles?
5. What is the accuracy of transcutaneous bilirubin (TcB) measurements?
The target population included infants of at least 34 weeks gestational age. Based on the findings of an earlier National Institute of Child Health and Human Development (NICHD) study, in which none of the 1,339 infants greater than or equal to 2,500 grams were less than 34 weeks, the EPC felt justified in grouping infants weighing greater than or equal to 2,500 grams with those greater than or equal to 34 weeks gestation.
MEDLINE® and PreMEDLINE® databases were searched for this evidence report. In September 2001, the MEDLINE® database was searched for publications from 1966 to the present using relevant MeSH terms (“hyperbilirubinemia”, “hyperbilirubinemia, hereditary”, “bilirubin”, “jaundice, neonatal”, “kernicterus”) and textwords (“bilirubin”, “hyperbilirubin$”, “jaundice”, “kernicterus”, “neonat$”). The abstracts were limited to human and English studies focusing on newborns between birth and one month of age. In addition, the same textwords used for the MEDLINE® search were used to search the PreMEDLINE® database. The strategy yielded 4,280 MEDLINE® and 45 PreMEDLINE® abstracts. The EPC consulted domain experts and examined relevant review articles for additional studies.
In our preliminary screening of abstracts for each question, we identified over 600 potentially relevant articles for Questions 1, 2, and 3. For Questions 1 and 2, we included only studies that reported neurodevelopmental outcomes. Except for part of Question 3, studies concerning effects of different variables on bilirubin without neurodevelopmental outcome were not included in this review. For the specific question of quantitative estimates of treatment efficacy, all studies concerning therapies for bilirubin ≥ 20 mg/dl were included in the review. The inclusion and exclusion criteria for the systematic review were discussed in several teleconferences of the EPC evidence review team and technical experts. The criteria underwent several revisions before final acceptance by the panel members. The final screening criteria for inclusion and exclusion of articles are described below.
For the two questions on the association of neonatal hyperbilirubinemia with neurodevelopment outcomes, the inclusion criteria were: Infants ≥ 34 weeks of gestation or ≥ 2,500 grams and a sample size of more than five subjects per arm (except for case reports of kernicterus). The predictors were jaundice or hyperbilirubinemia and at least one of the neurodevelopmental outcomes was reported in the article. The study designs included prospective cohorts (more than two arms), prospective cross-sectional study, prospective longitudinal study, prospective single-arm study, or retrospective cohorts (more than two arms).
For key Question 3 on the treatments for neonatal hyperbilirubinemia, studies which focused on the number needed to treat had the following inclusion criteria: Infants ≥ 34 weeks of gestation or ≥ 2,500 grams and a study size of more than 10 subjects per arm. Treatments included any treatment for neonatal hyperbilirubinemia. Outcomes included serum bilirubin level ≥ 20 mg/dl or frequency of exchange transfusion specifically for bilirubin level ≥ 20 mg/dl. The study design was randomized or non-randomized controlled trials.
For all other studies reviewed for Question 3, the selection criteria were: Infants ≥ 34 weeks of gestation or ≥ 2,500 grams and sample size of more than 10 subjects per arm for phototherapy; any sample size for other treatments. Any treatment for neonatal hyperbilirubinemia was included and at least one neurodevelopmental outcome was reported in the article.
For Key Questions 4 or 5 on the diagnosis of hyperbilirubinemia, the inclusion criteria were: Infants ≥ 34 weeks of gestation or birthweight ≥ 2,500 grams and a sample size of more than 10 subjects. The reference standard was laboratory-based serum bilirubin.
Six hundred and sixty-three of a total 4,560 abstracts were identified as potentially relevant articles after preliminary screening. There were 158, 174, 99, 153, and 79 abstracts for Questions 1, 2, 3, 4 and 5 respectively.
After full-text screening (according to the inclusion and exclusion criteria described above), 138 of 253 retrieved articles were included in this report. Twenty-eight articles reported on cases of kernicterus, 35 articles reported correlations, 21 articles reported on treatments, and 54 articles were included in the review of diagnosis. There was some inevitable overlap because treatment effects and neurodevelopmental outcomes were inherent in the study designs.
Methodological quality or internal validity addresses the design, conduct, and reporting of the study. Some of the items belonging to this domain are widely used in various “quality” scales and for randomized controlled trials, and usually include items such as concealment of random allocation, treatment blinding, and handling of dropouts. Because different types of study designs are used to address different questions and for consistency in the interpretation across different designs, we defined a three-category scale to report the methodological quality of the studies in the evidence report: A (least bias), B (susceptible to some bias), or C (likely to have large bias).
Prospective. Complete methods and results (including inclusion/exclusion criteria). Proper control/comparison group, correct analyses performed.
Prospective or retrospective. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Prospective or retrospective. Significant design or reporting errors, large amount of missing information or bias.
Randomized controlled trial. Complete methods and results (including inclusion/exclusion criteria) described. Proper randomization and/or blinding, and correct analyses performed.
Non-randomized controlled trial or other prospective design (prospective cohort or case-control study). Proper selection of control group. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Retrospective or no control group. Significant design or reporting errors, large amount of missing information or bias.
Prospective. Complete methods and results (including inclusion/exclusion criteria) described. Proper reference standard used and correct analyses performed.
Prospective or retrospective. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Prospective or retrospective. Significant design or reporting errors, large amount of missing information or bias.
The number needed to treat (NNT), expressing the benefit of an active treatment over a control, was calculated to quantify the efficacy of treatment for neonatal hyperbilirubinemia. For Question 3 in this report, NNT can be interpreted as the number of newborns needed to be treated at 20 mg/dl to keep the TSB in one newborn from rising.
A meta-analysis of Question 4 was conducted using the summary ROC method to combine studies, which evaluated diagnostic test performance. A meta-analysis of correlation coefficients was conducted to correlate performance of transcutaneous bilirubin measurements with serum bilirubin.
A summary of 28 reports that spanned over thirty years of 123 cases of kernicterus in term/near-term infants affirms the role of elevated bilirubin level in kernicterus. The disease, although infrequent, has significant mortality (at least 10 percent) and long-term morbidity (at least 70 percent). It is important to note that a significant amount of demographic information was missing in these case reports. Thirty-five term/near-term (≥ 34 weeks gestation) infants with idiopathic hyperbilirubinemia developed kernicterus with a total serum bilirubin level (TSB) ranging from 22.5 mg/dl to 54 mg/dl. Eighty-eight infants with hyperbilirubinemia and other co-morbid factors (like sepsis and hemolysis) developed kernicterus with TSB ranging from 4 mg/dl to 51 mg/dl.
Excluding the Collaborative Perinatal Project (CPP), and the studies looking at IQ, of the nine studies primarily looking at behavioral and neurodevelopmental outcomes in patients, only three studies were of high methodological quality. One showed a correlation between bilirubin level and decreased scores on newborn behavioral measurements. One found no difference in prevalence of central nervous system abnormalities at age 4 years when bilirubin was below 20 mg/dl, but infants with bilirubin above 20 mg/dl had a higher prevalence of central nervous system abnormalities. Another study that followed infants with bilirubin greater than 16 mg/dl found no relationship between bilirubin and neuro-visual-motor testing at 61 to 82 months of age.
Six high quality studies (not counting the CPP) showed significant relationship between abnormalities in brainstem auditory evoked potentials and high bilirubin levels. The majority reported resolution with treatment. Three studies reported hearing impairment associated with elevated bilirubin (>16 mg/dl to > 20 mg/dl).
Again excluding CPP, of the eight studies reporting intelligence outcomes in subjects with hyperbilirubinemia, four were considered high quality. These four studies reported no association between IQ and bilirubin level with follow-up ranging from 6.5 years to 17 years.
The Collaborative Perinatal Project, with 54,795 live births between 1959 and 1966 from 12 centers in the United States has, by far, the largest database for the study of hyperbilirubinemia. Newman and Klebanoff focused only on black and white infants with birthweight ≥ 2,500 grams and performed a comprehensive analysis on the 7-year outcomes of 33,272 subjects. All causes of jaundice were included in the analysis. The authors found no consistent association between peak bilirubin level and IQ. Rate of sensorineural hearing loss was not related to bilirubin level. Only the frequency of abnormal or suspicious neurologic examination result was associated with bilirubin.
Short-term studies tend to be of high methodological quality compared to long-term studies but they use tools that have unknown predictive abilities. Long-term studies suffer from high attrition rates of study population and a non-uniform approach to defining “normal neurodevelopmental outcomes.”
Given the overall diverse conclusions, except in cases of kernicterus with sequelae, we conclude that the use of a single total serum bilirubin level (within the range described in the studies) to predict long-term behavioral or neurodevelopmental outcomes is inadequate and will lead to conflicting results.
The only study that directly addressed the above question used the CPP population and reported that at age 4 years, the frequency of low IQ with increasing bilirubin levels increased more rapidly in infants with infected amniotic fluid. At age 7 years, neurologic abnormalities were also more prevalent in that subgroup of infants.
In our summary of case reports of kernicterus, all four infants with multiple comorbid factors had sequelae.
Reserve albumin concentration and duration of hyperbilirubinemia will need further studies to understand the nature of neurodevelopment in relation to bilirubin physiology.
Regardless of different protocols of phototherapy, the Number-Needed-to-Treat (NNT) for prevention of serum bilirubin level exceeding 20 mg/dl ranged from 6 to 10 in healthy term or near-term infants. This implies that one needs to treat six to ten otherwise healthy jaundiced neonates with TSB ≥ 15 mg/dl by phototherapy in order to prevent the TSB in one infant from rising above 20 mg/dl. Phototherapy combined with cessation of breastfeeding and substitution with formula was found to be the most efficient treatment protocol for healthy term or near-term infants with jaundice.
For accuracy of various strategies for prediction of neonatal hyperbilirubinemia, 153 articles were included after title and abstract screening. However, only 17 articles were included after full text screening and 10 articles remained after data abstraction, seven percent of the original number. This was the lowest yield of articles among the five questions addressed, suggesting that relatively few prospective or retrospective studies addressed this question without significant design or reporting errors, missing data, or apparent bias.
A conclusion is difficult to make from these studies. The first challenge is the lack of consistency in defining clinically significant neonatal hyperbilirubinemia. Not only did multiple studies use different levels of total serum bilirubin (TSB) to define neonatal hyperbilirubinemia, but the levels of TSB defined as significant also varied by age; age at TSB determination varied by study as well. The second challenge is the lack of consistency in study populations. These studies were conducted among multiple racial groups in multiple countries, including China, Denmark, India, Israel, Japan, Spain and the United States. Although infants were defined as healthy term and near term newborns, these studies included neonates with potential for hemolysis from ABO incompatible pregnancies, as well as breastfed and bottle-fed infants. This information was often not specified.
Based on the evidence from the systematic review, transcutaneous measurements of bilirubin by each of the three devices described in the literature, the Minolta Airshields Jaundice Meter™, the Ingram Icterometer, and the SpectRx BiliCheck™ have a linear correlation to total serum bilirubin and may be useful as screening devices to detect clinically significant jaundice and decrease the need of serum bilirubin determinations.
The Minolta Airshields Jaundice Meter™ appears to perform less well in black infants as compared to white infants, performs best when measurements are made at the sternum, and performs less well when infants have been exposed to phototherapy. This instrument requires daily calibrations and each institution must develop its own correlation curves of TcB to TSB. As a screening test it does not perform consistently across studies as evidenced by the summary ROC curves. The Ingram Icterometer has the added limitation of lacking objectivity of the other methods as it depends on observer visualization of depth of yellow color of the skin.
The recently introduced BiliCheck™ and Colormate III devices that utilize reflectance data from multiple wavelengths, appear to be a significant improvement over the older devices, the Ingram Icterometer and the Minolta AirShields bilirubinometer, because of its ability to determine correction factors for the effect of melanin and hemoglobin. In one study, the BiliCheck™ was shown to be as accurate as standard laboratory methods in predicting TSB determined by the reference standard of high performance liquid chromatography (HPLC).
Future research in kernicterus would benefit from a uniform definition of the disease. Making kernicterus a reportable condition coupled with multi-center cooperation will help to elucidate its epidemiology and other factors in the development of the disease.
Duration of hyperbilirubinemia, bilirubin binding, and the role of hour-specific bilirubin measurement all merit further investigation.
Validation of an age-specific (by hour) nomogram for TSB in healthy full term infants, with evaluation of potential differences by gender, race and ethnicity, as well as prenatal, natal and postnatal factors would be beneficial. Once established, use of the 95th percentile to define clinically significant jaundice would provide uniformity across studies. This would be analogous to the use of age-specific systolic and diastolic blood pressure to define hypertension and age-specific body mass index to define overweight and obesity in children. Validation of a standardized TSB nomogram would either incorporate these potential differences or result in the development of population-specific nomograms if differences were significant. This would be analogous to the use of population-specific growth charts for weight and height percentiles in children.
Given the interlaboratory variability of measurements of serum bilirubin, future studies should use HPLC as the reference standard along with the routine laboratory methods of TSB in use when evaluating noninvasive measures of bilirubin.
Validation is needed of new technological advances in the transcutaneous measurement of bilirubin, such as the Bilicheck™ and Colormate III (that have the ability to correct for skin color effects and hemoglobin) in diverse clinical populations. This would address issues that might affect performance such as race, gestational age, age at measurement, phototherapy, sunlight exposure, feeding, and accuracy as screening instruments and for ongoing monitoring of jaundice. Additionally, studies should address cost effectiveness and reproducibility in actual clinical practice.
The American Academy of Pediatrics (AAP) asked the Agency for Healthcare Research and Quality (AHRQ) to commission a report which would critically examine the available evidence regarding the effect of high levels of bilirubin on behavioral and neurodevelopmental outcomes, the role of various effect modifiers (e.g., sepsis and hemolysis) on neurodevelopment, the efficacy of phototherapy, the reliability of various strategies in predicting hyperbilirubinemia, and the accuracy of transcutaneous measurement of bilirubin. The report will be used by AAP in updating the 1994 AAP guidelines for the management of neonatal hyperbilirubinemia. This review focuses on otherwise healthy term or near-term (at least 34 weeks gestational age or at least 2,500 grams birth weight) infants with hyperbilirubinemia.
High levels of serum bilirubin in the neonatal period have long been associated with the development of kernicterus, a rare condition characterized by hypertonicity, poor feeding, and high-pitched cry. Kernicterus was a frequent complication of hyperbilirubinemia caused by Rh incompatibility, a condition that also led to severe hemolytic anemia, and, in some cases, fetal demise due to hydrops fetalis. Over the past decades, improved obstetric and neonatal care has reduced the incidence of high bilirubin levels and kernicterus. Hydrops fetalis and severe hemolytic anemia caused by maternal-fetal Rh incompatibility have been dramatically reduced by the use of RhoGam™. Post-natal phototherapy has further reduced the incidence of hyperbilirubinemia.
Because of these advances, concerns were raised that the standards of therapy for hyperbilirubinemia in otherwise healthy term newborns were too aggressive leading to over-treatment of many newborns unnecessarily. In 1994, the American Academy of Pediatrics published a set of clinical practice guidelines for the management of hyperbilirubinemia in the healthy term newborn (Practice Parameter: Management of hyperbilirubinemia in the healthy term newborn. Provisional Committee for Quality Improvement and Subcommittee on Hyperbilirubinemia, American Academy of Pediatrics, 1994). The Subcommittee Task Force, relying on retrospective epidemiologic data primarily derived from North America and Europe and a 1990 report by Newman and Maisels (1990), issued a set of recommendations. These recommendations were based on evidence when appropriate data existed, and, when data was lacking, were based on expert consensus. There were no specific data concerning the relationship between specific levels of bilirubin and adverse neurodevelopmental outcomes. The AAP recommendations advocated an approach to the jaundiced infant that was less aggressive than previously recommended.
The AAP recommendations were specifically directed at evaluation and treatment of hyperbilirubinemia in healthy term newborns, i.e., only infants without signs of illness or apparent hemolytic disease. Some of the highlights in the recommendations included visual inspection of the skin to determine jaundice, use of total serum bilirubin (TSB) level as the relevant variable in determining treatment, and use of exchange transfusion only if intensive phototherapy fails to lower the TSB to less than 20 mg/dl between 24 to 48 hours of life, and below 25 mg/dl after 48 hours.
Criticisms regarding these recommendations included the fact that visual inspection of jaundice is not reliable. Criteria based on number of days since birth (instead of number of hours) is too coarse of a time interval for precise interpretation of the bilirubin level (a 25-hour-old baby with a bilirubin of 15 mg/dl is at a different percentile of bilirubin than a 48-hour-old baby with the same level of bilirubin, even though both of them would be classified as 2-day-old infants and would have fallen into the same category in the treatment guidelines). The adverse effects of phototherapy treatment on healthy term newborns are unknown, thus making the assessment of risks and benefits of the treatment difficult. Some authors have argued that only in the framework of randomized controlled trials will it be possible to adequately assess the relationship between bilirubin levels and development of adverse neurological sequelae (Seidman and Stevenson, 1995). More recently, continued reporting of new cases of kernicterus have raised the concern that standards for the treatment of hyperbilirubinemia have not been followed (Johnson, Bhutani, and Brown, 2002).
Bilirubin has an affinity for neuronal membrane phospholipids and can interfere with ion exchange and nerve conduction, and is therefore potentially neurotoxic. Unconjugated free bilirubin can enter the central nervous system, although conjugated or albumin-bound bilirubin does not. Autopsy studies of infants with kernicterus demonstrated bilirubin staining of the basal ganglia. Thus, there is a plausible association of high bilirubin levels with clinical kernicterus.
Neither hyperbilirubinemia nor kernicterus are reportable diseases, and there are no reliable sources of information providing national incidence estimates. Since the advent of effective prevention of Rh incompatibility and treatment of elevated bilirubin levels with phototherapy, both kernicterus and hyperbilirubinemia have become uncommon. When laboratory records of a 1995-1996 birth cohort of over 50,000 California infants was examined, Newman, Escobar, Gonzales, et al. (1999) reported that 2 percent had total serum bilirubin levels over 20 mg/dl; 0.15 percent had levels over 25 mg/dl, and only 0.01 percent had levels over 30 mg/dl. (These data were from infants with identified hyperbilirubinemia, and as such, it represents a minimum estimate of the true incidence of extreme hyperbilirubinemia.) This is undoubtedly the result of both successful prevention of hemolytic anemia, and of the application of effective treatment of elevated serum bilirubin levels in accordance with currently accepted medical practice. Projecting the California estimates to the national birthrate of four million per year (National Vital Statistics Reports, 2002), we can expect 80,000, 6,000, and 400 newborns with bilirubin levels of greater than 20 mg/dl, 25 mg/dl, and 30 mg/dl, respectively.
However, there is continued concern that the rise in early hospital discharges, coupled with a rise in breastfeeding rates, has led to a rise in the rate of preventable kernicterus resulting from “unattended to” hyperbilirubinemia (Sentinel Event Alert, 2001). A report published in 2002 (Johnson, Bhutani, and Brown, 2002), based on a national registry established since 1992, reported only 90 cases, although the efficiency of case ascertainment is not clear. Thus, there are no data to reliably establish incidence trends for either hyperbilirubinemia or kernicterus.
Despite these constraints, there has been substantial research on the neurodevelopmental outcomes of hyperbilirubinemia and its prediction and treatment. Subsequent sections of this review describe in more detail the precise study questions and the existing published work in this area.
This evidence report is based on a systematic review of the literature. Our EPC formed an evidence review team consisting of pediatricians and EPC methodological staff to review the literature and perform data abstraction and analysis. The evidence review team held several meetings and teleconferences with external technical experts representing the American Academy of Pediatrics, the American Academy of Family Physicians, the National Association of Pediatric Nurse Practitioners, the Center for Quality of Care Research and Education, Harvard School of Public Health, and the organization, Parents of Infants and Children with Kernicterus. The EPC and its panel of external technical experts refined key questions proposed by the AAP and identified issues central to this report. A comprehensive search of the medical literature was conducted to identify the evidence available to address the questions. For this evidence report, we compiled evidence tables of study features and results, appraised the methodological quality of the studies, assessed the correlations of the predictors and outcomes, summarized the results, and performed meta-analyses when there were sufficient data.
This section documents the methods and procedures that were used to develop this evidence report. It begins with a description of the scope of the research questions and proceeds to a detailed description of the techniques and approaches that were used in the literature review.
Question 1: What is the relationship between peak bilirubin levels and/or duration of hyperbilirubinemia and neurodevelopmental outcome?
Question 2: What is the evidence for effect modification of the results in question 1, by gestational age, hemolysis, serum albumin, and other factors?
Question 3: What are the quantitative estimates of efficacy of treatment for: 1) reducing peak bilirubin levels (e.g., number-needed-to-treat (NNT) at 20 mg/dl to keep TSB from rising); 2) reducing the duration of hyperbilirubinemia (e.g., average number of hours by which time TSB greater than 20 mg/dl may be shortened by treatment); and 3) improving neurodevelopmental outcomes?
Question 4: What is the efficacy of various strategies for predicting hyperbilirubinemia, including hour-specific bilirubin percentiles?
Question 5: What is the accuracy of transcutaneous bilirubin measurements?
This section describes the search terms, strategies, and databases that were used in the literature retrieval; the article screening and selection process; methods that were used for developing the data extraction forms, abstracting data, and reviewing and analyzing the literature; and the results of the literature review.
We searched the MEDLINE® database on September 25, 2001 for publications from 1966 to the present using relevant MeSH terms (“hyperbilirubinemia”, “hyperbilirubinemia, hereditary”, “bilirubin”, “jaundice, neonatal”, “kernicterus”) and text words (“bilirubin”, “hyperbilirubinemia”, “jaundice”, “kernicterus”, “neonatal”). The abstracts were limited to human and English studies focusing on newborns between birth and one month of age. In addition, the same textwords used for the MEDLINE® search were used to search the Pre- MEDLINE® database. The strategy yielded 4,280 MEDLINE® and 45 Pre-MEDLINE® abstracts. We consulted domain experts and examined relevant review articles for additional studies. A supplemental search for case reports of kernicterus in reference lists of relevant articles and reviews was also performed.
In our preliminary screening of abstracts, we identified over 600 potentially relevant articles for questions 1, 2, and 3. To handle this large number of articles, we devised the following scheme to address the key questions and to ensure that the report was completed within the time and resources constraints. We included only studies that measured neurodevelopment or behavioral outcomes (except for question 3 part-1, for which we evaluated all studies addressing the number-needed-to-treat (NNT) issue regardless of whether the study reported these outcomes). For the specific question on quantitative estimates of treatment efficacy, all studies concerning therapies designed to prevent hyperbilirubinemia (generally defined as bilirubin ≥ 20 mg/dl) were included in the review. The inclusion and exclusion criteria for the systematic review were discussed in several teleconferences of the EPC evidence review team and technical experts. The criteria underwent several revisions before their final acceptance by the panel members. The final screening criteria for inclusion and exclusion of articles are described below.
The target population of this review was healthy, full-term infants. For the purpose of this review, we included articles concerning infants who were at least 34 weeks estimated gestational age (EGA) at the time of birth. From studies that reported birthweight rather than age, infants whose birthweight was greater or equal to 2,500 grams were included. This cut-off was derived from findings of the National Institute of Child Health and Human Development hyperbilirubinemia study (Bryla, 1985), in which none of the 1,339 infants greater than or equal to 2,500 grams was less than 34 weeks EGA. Articles were selected for inclusion in the systematic review based on the following additional criteria:
Key Question 1 or 2 (risks association)
Population: Infants ≥ 34 weeks EGA or ≥ 2,500 grams
Sample Size: More than 5 subjects per arm
Predictors: Jaundice or hyperbilirubinemia
Outcomes: At least one behavioral/neurodevelopmental outcome reported in the article
Study Design: Prospective cohorts (more than 2 arms), prospective cross-sectional study, prospective longitudinal study, prospective single-arm study, or retrospective cohorts (more than 2 arms).
Case reports of kernicterus:
Population: Kernicterus case
Study Design: Case reports with kernicterus as a predictor or an outcome
Definitions of kernicterus: Acute phase of kernicterus (poor feeding, lethargy, high-pitched cry, increased tone, opisthotonus, seizures), kernicterus sequelae (motor delay, sensorineural hearing loss, gaze palsy, dental dysplasia, cerebral palsy, mental retardation), necropsy finding of yellow-staining in the brain nuclei.
Key Question 3 (treatments)
Number-Needed-to-Treat (NNT) question:
Population: Infants ≥ 34 weeks EGA or ≥ 2,500 grams
Sample Size: More than 10 subjects per arm
Treatments: Any treatment for neonatal hyperbilirubinemia
Outcomes: Serum bilirubin level ≥ 20 mg/dl or frequency of exchange transfusion specifically for bilirubin level ≥ 20 mg/dl
Study Design: Randomized or non-randomized controlled trials
All other issues:
Population: Infants ≥ 34 EGA or ≥ 2,500 grams
Sample Size: More than 10 subjects per arm for phototherapy; any sample size for other treatments
Treatments: Any treatment for neonatal hyperbilirubinemia
Outcomes: At least one neurodevelopmental outcome was reported in the article
Key Question 4 or 5 (diagnosis)
Population: Infants ≥ 34 EGA or birthweight ≥ 2,500 grams
Sample Size: More than 10 subjects
Reference Standard: Laboratory-based serum bilirubin
Case reports of kernicterus were excluded if they did not report serum bilirubin level, or gestational age and birthweight.
Preliminary screening identified 663 out of a total of 4,560 abstracts located through the literature search described above. There were 158, 174, 99, 153, and 79 abstracts for question 1, 2, 3, 4 and 5 respectively.
After full-text screening (according to the inclusion and exclusion criteria described above), 138 of total 253 retrieved articles were included in this report. There were 35 articles in the correlation section (questions 1 and 2), 28 articles of kernicterus case reports, 21 articles in the treatment section (question 3), and 54 articles in the diagnosis section (questions 4 and 5). There were inevitable overlaps because treatment effects and neurodevelopmental outcomes were inherent in the study designs. Below is a summary of the four-step selection process.
| Step 1 | Step 2 | Step 3 | Step 4 | |
|---|---|---|---|---|
| Number of Articles Identified by Literature Searches | Number of Articles Included After Title & Abstract Screening | Number of Articles Included After Full-Text Screening (% Included, Step 2 → Step 3) | Final Number of Articles Included (% Included, Step 3 → Step 4) | Percent of Articles Included, Step 2 → Step 4 |
| Correlation | Q1: 158 | Q1/Q2/Q3: 134 (31%) | Q1/Q2: 37 | 13% |
| Q2: 174 | Kernicterus: 37 | Kernicterus: 28 | 76% | |
| Treatment | Q3: 99 | Q3: 21 a | 21% | |
| Diagnosis | Q4: 153 | Q4: 17 (11%) | Q4: 10 b (59%) | 7% |
| Q5: 79 | Q5: 58(73%) | Q5: 46 b (79%) | 58% | |
| Total: 4560 | 663 | 253c | 138 | |
Eight studies were relevant to Q1, Q2 and Q3. In this table, they were summarized in Q3 section. Two articles were relevant to Q1/Q2 and Q3, 9 were definite Q3 articles, and 4 were NNT articles.
Two studies were relevant to Q4 and Q5
Some articles were not retrieved until the end of the review
Articles that passed the full-text screening were grouped according to topic and were carefully analyzed in their entirety. Data were abstracted onto the data extraction form that had been specially designed for each topic (see Appendix B for example of data extraction forms). Extracted data were synthesized into evidence tables.
The evidence found for the key questions is summarized in three complementary forms. The evidence tables provide detailed information about key features of the study design and results of all the studies reviewed. In addition, narrative description and tabular summary of the strength and quality of the evidence of each study are provided for each question. For question 5, meta-analyses were performed to provide quantitative estimates of test performance.
A total of six evidence tables are included in this evidence report. Two evidence tables were created for question 3 because of different types of outcomes analyzed. One table each was created for question 4 and question 5. For case reports of kernicterus, one evidence table was created for recording relevant data for each kernicterus case.
Grading of the evidence can be useful for indicating the overall methodological quality of a study. While a simple evidence grading system using a single scale may be desirable, the “quality” of evidence is multi-dimensional, and a single metric cannot fully capture information needed to interpret a clinical study. We believe that information on individual components of a study contribute more to the evaluation of evidence by deliberating bodies than a single summary score. The evidence-grading scheme used here assesses four dimensions that are important for the proper interpretation of the evidence:
study size
applicability
summary of results
methodological quality
Applicability, also known as generalizability or external validity, addresses the issue of whether the study population is sufficiently broad to be generalizable to the population at large. Individual studies are often unable to achieve broad applicability due to restricted study population characteristics and a small number of study subjects (Lau, Ioannidis, and Schmid, 1997). In this evidence report, because of the relative homogeneity (primary focus being healthy newborns) of the study populations, applicability is not explicitly graded for each study. Instead, where applicable, studies are grouped together to form more similar subgroups for analyses or discussion.
The study sample size is used as a measure of the weight of the evidence. A large study provides a more precise estimation of the treatment effect but does not automatically confer broad applicability unless the study included a broad spectrum of patients. Very small studies, taken individually, cannot achieve broad applicability. But several small studies that enrolled diverse populations, taken together, may have broad applicability. The study size is included as a separate dimension used to assist the assessment of applicability. For summarizing all studies, this would be the number of studies and the total number of patients in these studies.
Methodological quality or internal validity addresses the design, conduct, and reporting of the study. Some of the items belonging to this entity are widely used in various “quality” scales and for randomized controlled trials. They usually include items such as concealment of random allocation, treatment blinding, and handling of dropouts. Because different types of study designs are used to address different questions and for consistency in the interpretation across different designs, we defined a three category scale to report the methodological quality of the studies in the evidence report: A (least bias), B (susceptible to some bias), or C (likely to have large bias). These criteria are described below.
Criteria for evaluating the methodological quality of studies are that assess association (questions 1 and 2):
Prospective. Complete methods and results (including inclusion/exclusion criteria). Proper control/comparison group, correct analyses performed.
Prospective or retrospective. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Prospective or retrospective. Significant design or reporting errors, large amount of missing information or potential bias.
Criteria for evaluating the methodological quality of studies that assess effects of treatments (question 3):
Randomized controlled trial. Complete methods and results (including inclusion/exclusion criteria) described. Proper randomization and/or blinding, and correct analyses performed.
Non-randomized controlled trial or other prospective design (prospective cohort or case-control study). Proper selection of control group. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Retrospective or no control group. Significant design or reporting errors, large amount of missing information or significant potential bias.
Criteria for evaluating the methodological quality of studies that assess diagnostic test performance (questions 4 and 5):
Prospective. Complete methods and results (including inclusion/exclusion criteria) described. Proper reference standard used and correct analyses performed.
Prospective or retrospective. Not all criteria of A. Some deficiencies; however, unlikely to cause major bias.
Prospective or retrospective. Significant design or reporting errors, large amount of missing information or bias.
Confounders (for Key Question 1 only): (1) An ideal study design to answer Question 1 would be to follow two groups, jaundiced and normal infants, without treating any infant for a current or consequent jaundice condition, and observe their neurodevelopmental outcomes. Therefore, any treatment received by the subjects in the study was defined as a confounder. (2) If subjects had known risk factors of jaundice, such as prematurity or low birth weight, the risk factors were defined as confounders. (3) Any disease condition other than jaundice was defined as a confounder. (4) Since bilirubin level is the essential predictor, if the study did not report or measure bilirubin levels for the subjects, lack of bilirubin measurements was defined as a confounder.
Acute phase of kernicterus: poor feeding, lethargy, high-pitched cry, increased tone, opisthotonus, seizures.
Chronic kernicterus sequelae: motor delay, sensorineural hearing loss, gaze palsy, dental dysplasia, cerebral palsy, mental retardation.
Brainstem Auditory Evoked Potential (BAEP) or Brainstem Auditory Evoked Response (BAER)
This test is generally used for screening of newborn hearing and the report uses the following definition:
“It is recorded as five to seven waves. Waves I, III and V can be obtained consistently in all age groups. Waves II and IV appear less consistently. The latency of each wave (time of occurrence of the wave peak after stimulus onset) increases, and the amplitude decreases with reductions in stimulus intensity or loudness. Developmental change occurs in the latency of the various waves; latency decreases with increasing age, with the earliest waves reaching mature latency values earlier in life than the later waves. It is used as an audiometric test, providing information about the ability of the peripheral auditory system to transmit information to the auditory nerve and beyond, and it is used in the monitoring of central nervous system pathology. It is not accurate in predicting neurologic recovery and outcome. (Behrman, Kliegman, Jenson, 2000).”
In this report, two statistical analyses were performed when there was sufficient data: the number needed to treat (NNT) and receiver operating characteristics (ROC) curve.
NNT, expressing the benefit of an active treatment over a control, was calculated to quantify the efficacy of treatment for neonatal hyperbilirubinemia. NNT can be used either for summarizing the results of a therapeutic trial or for individualized medical decision-making. For key Question 3 in this report, NNT can be interpreted as the number of newborns needed to be treated at 20 mg/dl to keep the TSB in one newborn from rising.
The absolute risk reduction (ARR) is the difference between the event rate in the treatment group and the event rate in the control group. It is the denominator in the NNT calculation. We report 95% confidence intervals along with all estimates.
ROC curves were developed for individual studies of key question 4 if multiple thresholds of a diagnostic technology were reported. The areas under the ROC curves (AUC) were calculated to provide an assessment of the overall accuracy of the test and allow indirect comparisons with other tests.
Meta-analyses were performed to quantify the transcutaneous bilirubin measurements where the data were sufficient. We used three complementary methods for assessing diagnostic test performance: summary receiver operating characteristics (SROC) analysis, independently combined sensitivity and specificity values, and meta-analysis of correlation coefficients. All meta-analysis schematic and statistics were reported in the Meta-Analyses chapter.
The SROC method assumes that the variability in the reported sensitivity and specificity values from different studies is due to different cut-off values applied (Moses, Shapiro, and Littenberg, 1993). Each study provides a pair of sensitivity and specificity values to the analysis. It uses a regression method to fit a curve that best describes the data in the ROC space. We used the unweighted SROC method because it is probably less biased than the weighted regression method (Irwig, Macaskill, Glasziou et al. 1995).
The areas under different SROC curves can also be calculated and compared across technologies. However, the range of sensitivity and specificity values from studies in a meta-analysis of diagnostic tests is often limited, and extrapolation of the SROC analysis beyond the values of actual data is not reliable. Most of the technologies we examined have narrow reported ranges of sensitivity or specificity values. Therefore, we did not calculate the area under the SROC curve for any of the technologies.
When there is little variability in the test results—studies appeared to be operating at similar thresholds and reported similar results—SROC analysis provided little additional information. In this case, separately averaged sensitivity and specificity values across studies will give similarly useful summary information.
We combined the sensitivity and specificity values of the tests across studies using a random-effects model to estimate the average values. A random-effects model incorporates both the within-study variation (sampling error) and between-study variation (true treatment-effect differences) into the overall treatment estimate. It gives a wider confidence interval than the fixed-effects model (which considers only within-study variability) when estimates are based on heterogeneous results.
When each is combined separately, sensitivity and specificity tend to underestimate the true test sensitivity and specificity. They are nonetheless useful estimates of the average test performance and provide an indication of the approximate test operating point for most of the studies. The appropriateness of this method can be verified by inspecting the location of the combined estimates and noting the distance of the estimates from the SROC curve. In our experience, the random effects-averaged sensitivity and specificity results are close to the unweighted SROC curve and well within the confidence intervals of each other. Average sensitivity and specificity results also serve as useful baseline test performance values for the decision and cost-effectiveness analysis.
Correlation coefficients measure the correlation of one diagnostic test to another, but do not provide any information about the clinical utility of the diagnostic test. Also, they are inadequate for measuring the accuracy of a test in estimating serum bilirubin levels for two reasons. Although correlation coefficients (r) measure the association between transcutaneous bilirubin and “standard” serum bilirubin measurements, the correlation coefficient is highly dependent on the distribution of serum bilirubin in the study population selected. Second, correlation measures ignore bias and measure relative rather than absolute agreement (Bland and Altman, 1986). Nonetheless, a large number of studies continued using the correlation coefficient but only some of them did sensitivity and specificity analyses for examining the test accuracy.
Meta-analyses were performed to compare transcutaneous bilirubin (TcB) against laboratory essay of total serum bilirubin (TSB) measurements, where the data were sufficient. These studies provided the correlation coefficient for TcB vs. TSB, but several analytic issues arose regarding potential duplication of information. To address these issues, the following rules were developed and applied:
Rule 1. Whenever a study provided not only data for the whole population but also data splitting the population into subgroups of patients, only the one entry about the whole population was used to avoid duplicating the data. However, if the subgroups of patients did not meet our inclusion criteria described before, the whole population data was dropped and only the qualifying subgroup data was entered.
Rule 2. When a study gave data for separate conditions on the same subjects and then also an aggregate of all conditions, only aggregate data was used to avoid duplication.
Rule 3. Studies were indexed by different measurement sites or measurement protocols (metrics). When several different metrics were provided for a specific comparison, all were retained and entered in the respective metric-specific subgroup analyses. However, only one metric was retained for the overall synthesis to avoid duplication. The order of preference was: forehead, sternum, other sites (preferring the most often used site).
We combined the correlation coefficient across studies using a random-effects model. The random-effects model incorporates both within and between study variations. Such models yield more “conservative” (i.e. wider) confidence intervals. Subgroup analyses on different factors that might affect the correlation coefficient of the test and gold standard were also performed.
Since using the repeated measurements in one subject would overestimate the correlation coefficient, the number of measurements was replaced by the number of patients whenever it was available. We report 95 percent confidence intervals for all estimates.
Statistical analyses using the SROC curve method and combining sensitivity and specificity using the random-effects model was performed using “Meta-Test” version 0.6. The computer program was developed by the EPC director (Dr. Lau) and is available to the public. Meta-analysis of correlation coefficient using a random-effects model was performed using Comprehensive Meta-Analysis™ version 1.0.23 (Biostat™ Inc.). We report 95 percent confidence intervals along with all estimates.
Where data was available, we extracted serum bilirubin measurements on admission or at the time of diagnose of kernicterus. We also extracted the peak serum bilirubin levels. From these data, we created histograms on the distribution of serum bilirubin levels. We performed subgroup analyses of these cases to see if there are factors related to the patterns of serum bilirubin distribution.
Results of this review are presented in the following order. First, we present findings for possible association between neonatal hyperbilirubinemia and neurodevelopmental outcomes. Second, efficacy data and effects of treatment for neonatal hyperbilirubinemia are reported. Third, accuracy data for diagnosis and prediction of neonatal hyperbilirubinemia are tabulated and discussed. Fourth, limitations and summary results are discussed.
The first part of the results for this question deals with kernicterus; the second part deals with otherwise healthy term or near-term infants who had hyperbilirubinemia.
| Sex | Feeding | Total | |||
|---|---|---|---|---|---|
| Breast | Formula | Unknown | |||
| Newborns with idiopathic jaundice with kernicterus | M | 7 | 1 | 6 | 14 |
| F | 2 | 0 | 4 | 6 | |
| ? | 11 | 0 | 4 | 15 | |
| Subtotal | 20 | 1 | 14 | 35 | |
| Newborns with comorbid factors with kernicterus | M | 8 | 0 | 17 | 25 |
| F | 1 | 1 | 19 | 21 | |
| ? | 6 | 0 | 36 | 42 | |
| Subtotal | 15 | 1 | 72 | 88 | |
| Total | 35 | 2 | 86 | 123 | |
| Race/Ethnicity | ||||||||
|---|---|---|---|---|---|---|---|---|
| AA | AF | Chinese | Malay | White/NE | Pal | Unknown | Total | |
| Newborns with idiopathic jaundice with kernicterus | 0 | 0 | 6 | 2 | 9 | 1 | 17 | 35 |
| Newborns with comorbid factors with kernicterus | 4 | 2 | 14 | 1 | 6 | 61 | 88 | |
| Total | 4 | 2 | 23 | 3 | 15 | 1 | 78 | 123 |
AA = African American; AF = African; NE = Northern European; Pal = Palestinian
The 28 case reports with a total of 123 cases were from 14 different countries. The number of kernicterus cases in each study ranged from one to 12. The distribution of kernicterus cases geographically is shown in Figure 3.1
Kernicterus has been defined by pathological findings, acute clinical findings and chronic sequelae (like deafness or athetoid cerebral palsy). Because of the small number of subjects, all definitions of kernicterus have been tabulated together in the analysis. Exceptions will be noted in the following discussion.
Among infants ≥ 34 weeks or who had birthweight ≥ 2,500 grams and had no known explanation for kernicterus, there were 35 infants with peak bilirubin ranging from 22.5 mg/dl to 54 mg/dl. Fifteen had no information on their gender; 14 were males and six were females. Fourteen provided no information on feeding; 20 were breast-fed and one was formula-fed. The majority of the infants with kernicterus had bilirubin greater than 25 mg/dl: 25 percent of the kernicterus cases had peak total serum bilirubin (TSB) level up to 29.9 mg/d; 50 percent of the kernicterus cases had peak TSB up to 34.9 mg/dl (Figure 3.2
| Definitions of KI | N | Mean Peak TSB±SD, mg/dl (range) | Mean BW±SD†, g (range) | Sex | |
|---|---|---|---|---|---|
| Term infants w/ idiopathic Jaundice | Acute phase of KI w/o long term follow-up | 9 | 33.4±11.4 (22.5–54.0) | 3496±535 (2700–4500) | 3 Females; 2 Males; 4 ? |
| Acute phase of KI but normal long-term follow-up | 5 | 32.3±3.6 (27.0–36.0) | 3088±575 (2400–3742) | 5? | |
| Chronic KI sequelae | 17 | 37.0±5.9 (29.5–49.7) | 3271±442 (2700–4280) | 1 Females; 11 Males; 6 ? | |
| Death | 4‡ | 36.9±11.0 (23.0–48.0) | 2902±503 (2324–3357) | 2 Female; 1 Male; 1 ? | |
| Total | 35 | 35.4±8.0 (22.5–54.0) | 3251±512 (2324–4500) | 6 Females; 14 Males; 15 ? | |
KI = Kernicterus
Contain some missing data
? = Unknown
One infant had acute phase of KI and chronic KI sequelae; then died at age of 5 months. Necropsy was not done. One infant had acute phase of KI and died on the 5th day of life; necropsy found yellow-staining in the brain nuclei.
| Diagnosis of KI | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Clinical | Total (N) | Mean Peak TSB±SD, mg/dl (range) | Mean BW±SD†, g (range) | Sex | |||||
| Comorbidity | Acute phase w/o follow-up | Acute phase but normal follow-up | Chronic sequelae | Autopsy | |||||
| Term infants with cormobid factors | ABO incompatibility | 6 | 0 | 12 | 1‡ | 19 | 31.6±8.2 (19.0–51.0) | 3118±680 (2270–4313) | 6 Females; 11 Males; 2 ? |
| Rhesus incompatibility | 3 | 2 | 27 | 1 | 33 | 32.1±7.1 (17.7–46.0) | 3063±387 (2300–3969) | 5 Females; 4 Male; 24 ? | |
| G6PD deficiency | 10 | 0 | 1 | 2 | 13 | 31.8±8.5 (23.0–50.0) | 3353±437 (2700–4100) | 2 Males; 11 ? | |
| Sepsis or Infections | 3 | 1 | 4 | 5 | 13 | 31.8±9.9 (14.5–47.8) | 3368±586 (2580–4360) | 7 Females; 4 Males; 2 ? | |
| Multiple conditions | 3 | 0 | 4 | 3 | 10 | 29.1±16.1 (4.0–49.2) | 2913±750 (1780–3686) | 3 Female; 4 Males; 3 ? | |
| Total | 25 | 3 | 48 | 12 | 88 | 31.6±9.0 (4.0–51.0) | 3155±534 (1780–4360) | 20 Females; 25 Males; 42 ? | |
KI = Kernicterus
? = Unknown
Contain some missing data
This infant had acute phase of KI and chronic KI sequelae; then died at age 19 months.
In the 88 term and near-term infants diagnosed with kernicterus and who had hemolysis, sepsis and other neonatal complications, the bilirubin level ranged from 4.0 mg/dl to 51.0 mg/dl. Forty-two provided no information on their gender; 25 were males and 21 were females. Seventy-two had no information on feeding; 15 were breast-fed and 1 was formula-fed. The majority of those infants with kernicterus had bilirubin greater than 20 mg/dl: 25 percent of the kernicterus cases had peak total serum bilirubin (TSB) level up to 24.9 mg/dl; 50 percent of the kernicterus cases had peak TSB up to 29.9 mg/dl (Figure 3.4
Five infants without clinical signs of kernicterus were diagnosed with kernicterus by autopsy. Eight infants died of kernicterus. One infant was found to have a normal neurological exam at 4 months of age (Hanko, Lindemann, Hansen, 2001). Another one with galactosemia and a bilirubin of 43.6 mg/dl who had acute kernicterus was normal at 5 months of age (Ebbesen, 2000). Forty-nine patients had chronic sequelae ranging from hearing loss to athetoid cerebral palsy; the follow-up age reported ranged from 4 months to 14 years. Twenty-one patients were diagnosed with kernicterus with no follow-up information. To summarize, not including the autopsy diagnosed kernicterus, 10 percent of these infants died (8/82); two percent of infants were found to be normal at 4 to 5 months of age and at least 60 percent had chronic sequelae (Table 3.3). The distribution of peak TSB levels was slightly higher when only infants who died or had chronic sequelae were included (Figure 3.5
This section examines the evidence associating bilirubin exposures with neurodevelopmental outcomes primarily in subjects without kernicterus. Studies that were designed specifically to address the behavioral and neurodevelopmental outcomes in healthy infants greater or equal to 34 weeks of gestation will be discussed first, as that group of infants is the subject of interest in this review. With the exception of the results from the Collaborative Perinatal Project (CPP, see below), the rest of the studies comprising mixed subjects (preterm and term, diseased and non-diseased) will be categorized and discussed by outcome measures which include behavioral and neurologic outcomes, hearing impairment including sensorineural hearing loss, and intelligence measurements. CPP, with the largest number of subjects (54,795) in this review, has generated many follow-up studies with smaller number of subjects and those studies will be discussed together in a separate section.
| Author, Year, UI # | Subjects N (Control) | Peak bilirubin Level (range) mg/dl | Outcomes | Confounders | Quality |
|---|---|---|---|---|---|
| Rapisardi, 1989 89213178 | 29 | 8.8 (2.3–18.5) | @50–71 hrs of life: CV of cry frequencies, as an indicator of “neurophysiological functioning” | PhotoRx, Age | B |
| The correlation between the CV of F1 during phonation (PF1) and level of bilirubin was significant (r=0.37, p<0.05), suggesting that more variability in F1 was related to higher concentrations of bilirubin. | |||||
| Escher-Gräub, 1986 87136731 | 76 ( 401) | 11.7–14.6 | @day 4–5 of life: BNBAS | None | B |
| Jaundiced infants not requiring PhotoRx showed a less mature behavioral organization than the infants of the control group. | |||||
| Soorani-Lunsing, 2001 11726727 | 20 (20) b | 18.3 (>12.9) | @3–8 days, 3 and 12 months of life: Neurological exams | PhotoRx; No TSB level available for control group | C |
| Infants with TSB > 12.9 mg/dl had significantly higher prevalence of mild neurologic dysfunction.Controlling for other factors, the odds of developing mild neurologic dysfunction were 6.62 (95%CI 1.59–27.51), 3.75 (95%CI 1.03–14.53), and 9.47 (95%CI 1.67–53.65) at 3–8 days, 3 and 12 months of life respectively. | |||||
| @12 months of life: Behavioral Outcomes | |||||
| It didn't differ between study and control subjects according to the parental questionnaire. | |||||
| Grimmer, 1999 99345621 | 16 (18) | 23.0 (20.1–28.69) | @61–187 months: Neurological Outcome (Touwen sub-scales) | PhotoRx | C |
| None of the children studied had medical problems or overtly abnormal neurological findings. | |||||
| Cluster profiles did not differ significantly between formerly jaundiced children and controls, except for worse scores of formerly jaundiced children on the choreiform dyskinesia scale (P=0.028). | |||||
| Paludetto, 1986 86296856 | 17 (17) a | 10.6 (8.4–14.3 ) | @3rd, 4th days of life, and 1 month of age: BNBAS | Not all controls had TSB measured | C |
| There was no difference in any behavioral item of the BNBAS score at every observation. | |||||
Only 4 controls had bilirubin measurement.
No TSB level available for control group
BET = exchange transfusion; PhotoRx = phototherapy; BNBAS = Brazelton Neonatal Behavioral Assessment Scale; CV = coefficient of variation; F1 = formant frequency 1
Rapisardi, Vohr, Cashore, et al. (1989) studied and divided 29 healthy full-term infants into three different groups according to bilirubin level (less than or equal to 6 mg/dl, 6 to 12 mg/dl, greater than 12 mg/dl). Some of these infants received phototherapy. No further details were provided. There were no differences in mean values of acoustic measures (cry fundamental frequency, resonant frequency, energy and duration). There was a statistically significant difference between the three groups in the coefficient of variation (CV) of formant frequency-1 (frequency of resonance as determined by the filtering of vocal tract that is affected by the cross-sectional area and motor innervation of the supraglottal airway). Infants with the highest levels of bilirubin showed a greater CV of formant frequency-1 during phonation than infants with lower or medium levels of bilirubin (P<0.05). The authors suggested that “an increase in the variability of the first formant could be due to an increase in instability of the neural mechanisms controlling the cross-sectional area of the supraglottal airways.”
Escher-Graub and Fricker (1986) compared 76 term infants with maximum bilirubin of 11.7 to 14.6 mg/dl to 401 control term infants with bilirubin of less than 5.8 mg/dl using the Brazelton Neonatal Behavioral Assessment Scale (BNBAS). The BNBAS was done either on the fourth or fifth day of life. These term infants had no evidence of hemolysis and did not require phototherapy. The results showed that the two groups differed significantly (p<0.01 to 0.05) on 12 of the 28 scale items. Five of the six dimensions of newborn behavior were affected: habituation, orientation, motor performance, and regulation of state and autonomic stability. “Infants of the jaundice group showed a less mature behavioral organization than the infants of the control group.”
Soorani-Lunsing, Woltil, and Hadders-Algra (2001) assessed the neurologic condition of 20 healthy non-hemolytic term newborns with peak bilirubin of 13.6 to 26 mg/dl and 20 clinically non-jaundiced control infants. Ten of the jaundiced infants received phototherapy. A standardized neurologic examination similar to Prechtl's was carried out at the age of 12 months. “The examiner occasionally knew group membership of the children, but was never aware of the degree of hyperbilirubinemia of the study children.” Moderate hyperbilirubinemia was associated with a significant increase in minor neurologic dysfunction throughout the first year of life: newborn, 3 months of age and 12 months of age. Examples of minor neurologic dysfunction were mild abnormalities in postural behavior, the presence of high frequency tremors, mild deviancies in muscle tone regulation and mild asymmetries in infantile reactions and tendon reflexes. The odds of developing minor neurologic dysfunction at 12 months were 9.47 (95% CI 1.67 – 53.65). Neurologic outcome at 12 months was strongly related to the peak bilirubin level. The authors suggested that “total serum bilirubin levels of 335 micromole/liter (19.6 mg/dl) should be avoided.” The authors also noted that the behavior of the study and control children at age 12 months did not differ according to the parental questionnaires
Grimmer, Berger-Jones, Buhrer, et al. (1999) identified 29 non-hemolytic healthy term newborns with a total bilirubin between 20 and 30 mg/dl based on ten-year records of a referral center: Sixteen were available for a standardized, age-adjusted neurological examination (according to Touwen) at 61 to 187 months of age. All of them received phototherapy. Compared to 18 case controls with bilirubin less than 12 mg/dl, jaundiced children scored significantly (P=0.028) worse only on the choreiform dyskinesia scale. There was no difference in the other neurological subscales (fine and gross motor, sensorimotor apparatus and others).
Paludetto, Mansi, Rinaldi, et al. (1986), also using the Brazelton Neonatal Behavioral Scale, studied 17 jaundiced term 3-day-old infants (bilirubin of 8.4 to 14.3 with a mean of 10.6 mg/dl) not treated with phototherapy and 17 clinically non-jaundiced matched subjects; 14 four-day-old infants (8.4 to 12.9 with a mean of 10.4 mg/dl) and 14 clinically non-jaundiced matched subjects; 10 “ex-jaundiced” 1-month-old infants (9.1 to 15.9 with a mean of 11.5 mg/dl) and 10 clinically non-jaundiced matched subjects. No differences were found. The authors concluded “that moderate levels of hyperbilirubinemia not treated with phototherapy do not influence neonatal behavior.” Bilirubin levels were not measured in the control group.
The following group of studies consists of subjects who, in addition to healthy term newborns, might include newborns younger than 34-week gestation, neonatal complications such as sepsis, respiratory distress, hemolytic disorders and other factors. Nevertheless, some of the conclusions drawn might be applicable to a healthy term population. In these studies, greater emphasis will be placed on the reported results for the group of infants who were greater or equal to 34-week gestation or have birthweight greater or equal 2,500 grams.
| Author, Year, UI # | Subjects N (Control) | Peak bilirubin Level (range) mg/dl | Outcomes | Confounders | Quality |
|---|---|---|---|---|---|
| Vohr, 1989 89328704 | 23 a (27) | 14.3±2.8 (10–20) | On day 1 to 2: BNBAS | PhotoRx | A |
| Vohr, 1990 90339225 | Jaundiced infants had significantly lower BNBAS scores in every behavioral item except autonomic stability. | ||||
| Significant correlations were found in increased levels of TSB with decreased scores on the individual BNBAS items. After controlling for PhotoRx by partial correlations, most correlations remained significant except for state regulation and autonomic stability. | |||||
| Valaes, 1980 81031849 | 44 c (445) | 47% 16–20 | @61–81 months of age: Visual-Motor integration test (VMI) | Prematurity; BET; Phenobarbital | B |
| 29% 20–25 | Degree of jaundice was found not to be associated with neurologic scores. | ||||
| 24% >25 | |||||
| Hyman, 1969 69132491 | 405 b | ≥ 15 | @4 years of age: CNS abnormalities | BET; streptomycin Rx; BW; Selection bias | B |
| No significant difference in incidence of CNS abnormalities among infants exposed to different levels of bilirubin below 20 mg/dl. However, with TSB> 20 mg/dl, incidence of CNS abnormalities increased sharply. | |||||
| Yilmaz, 2001 21125314 | 87 g | 11% 23-20 | @10–72 months: Neurological exam | PhotoRx; BET; Duration of jaundice | C |
| 49% 20–23.9 | Subjects with maximum TSB levels < 20 mg/dl didn't show any neurological abnormalities, whereas 9.3% of the subjects with TSB 20–23.9 mg/dl (N=43) had only dysconjugate gaze and 17.6% of the subjects with TSB ≥ 24 mg/dl had neurological manifestations. | ||||
| 39% ≥ 24 | |||||
| Agrawal, 1998 99232291 | 30 f (25) | 22.4±2.7 | @1 year of age: DDST | PhotoRx; BET; lost to follow-up | C |
| Neurological development was normal in all infants with TSB 15–20 mg/dl, in 89% of infants with TSB 21–25 mg/dl, and in 67% infants with TSB > 25 mg/dl. | |||||
| Wolf, 1997 98030356 | 45 e @4 months | 28.5±6.3 | @4 months of age: Infant Motor Screening (IMS) | PhotoRx; BET; Prematurity; Illness | C |
| Wolf, 1999 99156393 | 35 @1 year | Linear correlation showed an association of maximum TSB and test rating at 4 months (r=0.32, p<0.05) and test scores at 4 months (r=0.44, p<0.03) | |||
| Mean TSB in the normal IMS group was 27.3±5.3 mg/dl; in the suspect and abnormal IMS group the mean TSB was 28±4 mg/dl and 33.7±10.3 mg/dl, respectively. These differences were not significant (p=0.06) | |||||
| @1 year of age: Bayley (BSID - PDI) | |||||
| 8 (23%) term infants, with mean TSB 33.4 mg/dl, had abnormal and suspect BSID and clinical diagnosis. | |||||
| 27 (77%) term infants, with mean TSB 26.5 mg/dl, scored normal on the BSID. | |||||
| “Correlation between Bayley motor score and TSB was 0.59 (p<0.01)” | |||||
| Chen, 1995 96017937 | 72 (22) | 39% 10–15 | @1 year of age: DDST and Neurological exams | PhotoRx; BET | C |
| 32% 15–20 | None (0%) of the infants w/ TSB 10–20 mg/dl showed any abnormality in the DDST and neurological examination. | ||||
| 29% >20 | Among infants w/ TSB > 20 mg/dl, 4 (22%) were abnormal in gross motor and fine motor skills of DDST. | ||||
| Grunebaum, 1991 91231060 | 46 | 12.06±2.6 | @age 31.1±16.6 months: Growth, Neurological exam, DDST | PhotoRx; Lost to follow-up | C |
| Growth and neurological examination were normal in all infants | |||||
| 2 (4%) had abnormal DDST, but all had normal DDST repeated 1 month later | |||||
| Holmes, 1968 68310019 | 63 (17) d | 6.4–24 (46% 6.4–12.5; 54% 15–24) | @4 years 7 months - 7 years 8 months: Neurologic examination;s Motor development (Oseretsky test); Audiometric exams | Age; BET; Streptomycin | C |
| In normal FT babies, mild to moderate hyperbilirubinemia is not associated with findings on Oseretsky motor development test. | |||||
| All had normal hearing, including subset with streptomycin Rx. | |||||
60% ABO incompatibility [56% of controls has ABO incompatibility as well]
Including 10% preemies and some hemolytic diseases
29% ABO incompatibility; 2% G6PD deficiency; 69% unknown cause of jaundice
All causes of jaundice were included in the study
43% term+ 57% preterm (mean GA 36.6±3.5 wk.). 44% LBW; 16% ABO incompatibility; 16% sepsis; 6% congenital syphilis
26.7% had ABO incompatibility and 63.3% had idiopathic hyperbilirubinemia
44% ABO incompatibility, 16% Rh incompatibility; 40% non-hemolytic jaundice
Vohr, Lester, Rapisardi, et al. (1989) or Vohr, Karp, O'Dea, et al. (1990) divided 50 term infants into either a group with low bilirubin of <8mg/dl or a group with moderate bilirubin of 10 mg/dl to 20 mg/dl. The Brazelton Neonatal Behavioral Assessment Scale (BNBAS) was administered between the second day and fifth day of life (mean ± S.D.; 65± 22 hours in moderate bilirubin group; 57 ± 13 hours in low bilirubin group, differences not significant). Some of these infants had positive Coombs test. None of the patients in the low bilirubin group received phototherapy. Seventeen of 23 patients in the moderate group received phototherapy. Partial correlation analysis controlling for phototherapy revealed that increased bilirubin concentration correlated negatively with BNBAS in state range (r= -0.28), orientation clusters (r= -0.42), and individual Brazelton items that involve auditory processing (r = -0.39 to -0.58).
Valaes, Kipouros, Petmezaki, et al. (1980), in a selective analysis of his maternal phenobarbital study, studied 44 term and preterm infants with peak bilirubin greater than 16 mg/dl. This group consisted of infants with ABO incompatibility, G6PD deficiency and nonspecific hyperbilirubinemia. Nine of them received exchange transfusion. These subjects had neurologic examination, Visual-Motor Integration test, Draw-a-Woman test and Georgas Vocabulary test at 61 to 82 months of age. No relationship between the level of bilirubin (from 16 mg/dl to >25 mg/dl) and test results was found.
Hyman, Keaster, Hanson, et al. (1969), from an original sample of 742 infants with hemolytic disease or indirect hyperbilirubinemia of ≥ 15 mg/dl (49 of whom died in the neonatal period), followed and studied 405 infants until 4 or more years of age. Analysis of the original group of 742 neonates as compared with the study group of 405 subjects showed that 70 percent of those with bilirubin ≥ 20 mg/dl as compared with 55 percent of those with bilirubin < 20 mg/dl were followed. Three hundred sixty-eight were term. Three hundred forty-eight had Rh and 48 had ABO incompatibility. Three hundred fifty-one received streptomycin and exchange transfusion. There was no significant difference in incidence of CNS abnormalities (sensorineural hearing loss, athetosis, strabismus, seizures, minimal cerebral dysfunction) among infants exposed to different levels of bilirubin below 20 mg/dl. Infants exposed to bilirubin above 20 mg/dl had a 20 percent incidence of CNS abnormalities vs. 7 percent incidence in the group with exposure below 20 mg/dl.
Yilmaz, Karadeniz, Yildiz, et al. (2001) studied 87 (out of a possible 755) children who were term newborns with jaundice. There were no control subjects in this study. Fourteen subjects had Rh and 38 had ABO incompatibility and 35 had non-hemolytic jaundice. 11.4 percent of subjects were treated with phototherapy; 11.5 percent were treated with phototherapy and exchange transfusion. Follow-up examinations were done from 10–72 months. Neurological dysfunction was observed in 11.5 percent (10/87) of cases. These patients had significantly higher peak bilirubin levels (26.1 ± 4.9 mg/dl vs. 23.2 ± 3.4 mg/dl) and longer duration of jaundice (6.1 ± 1.9 days vs. 4.7 ± 2.4 days) than those who were neurologically normal (p<0.05). There was no statistically significant relationship between the etiology of hyperbilirubinemia and the neurological prognosis.
Agarwal, Narayan, Kumari, et al. (1998) studied 30 term newborns with bilirubin > 15 mg/dl. Ten percent had Rh incompatibility, 26.7 percent had ABO incompatibility and 63.3 percent had idiopathic hyperbilirubinemia. Fourteen received exchange transfusion; the rest had phototherapy. Two subjects had kernicterus. At one year of age, screening by DDST showed that development was normal in 100 percent of infants with bilirubin of 15–20 mg/dl, 88.9 percent of infants with bilirubin of 21–25 mg/dl, and 66.7 percent of infants with bilirubin >25 mg/dl. No further statistical details were provided.
Wolf, Beunen, Casaer, et al. (1997) and Wolf, Wolf, Beunen, et al. (1999) studied the effect of maximum serum bilirubin in excess of 23.4 mg/dl on the neurodevelopmental outcome of 50 Zimbabwean neonates at four months and at one year of age. All received phototherapy, twenty-six were preterm infants, seven had sepsis without meningitis and seven received exchange transfusion. Out of 50 infants with serum bilirubin greater than 23.4 mg/dl, forty-five were evaluated at 4 months using the infant motor screen; six infants scored abnormal, six scored suspect. Forty-three were followed until one year of age, two died and five were lost to follow-up. Eleven scored within abnormal range on the Bayley Scales of Infant Development at 1 year of age and five developed the choreo-athetoid type of cerebral palsy. All of those infants were either preterm babies or had hemolytic diseases. All term infants without hemolysis and with bilirubin levels between 23.4 mg/dl and 29.2 mg/dl were normal at one year of age.
Chen and Kang (1995) studied Visual Evoked Potentials (VEP) in 72 term infants with hyperbilirubinemia and 22 control infants in four sessions for 8 weeks after birth. Reasons for jaundice were not stated. Infants with bilirubin >15 mg/dl were treated with phototherapy. Those with bilirubin ≥ 20 mg/dl also had exchange transfusion. The levels of maximum bilirubin found positively related to the wave latencies of first VEP. Within 8 weeks after birth, the wave latencies were significantly prolonged in infants in the severe (20 to 29.8 mg/dl) and moderate (15 to 18.6 mg/dl) groups than in the controls (4.8 to 8 mg/dl). Neurological examination and the Denver Developmental Screen Test (DDST) were performed on 62 of the term infants with neonatal hyperbilirubinemia at 1 year of age. Of the sixty-two, 24 infants belonged to the low (11.6 to 14.4 mg/dl), twenty to the moderate and 18 to the severe groups. None of the infants in the low and moderate groups showed any abnormality in the neurological examination and DDST. In the severe group, four of the 18 infants were abnormal in gross motor and fine motor skills of the DDST. One of these four infants showed general hypotonia, and his VEP's showed markedly low amplitudes and prolonged peak latencies for 8 weeks. Statistical analyses of the neurodevelopmental findings were not presented.
Grunebaum, Amir, Merlob, et al. (1991) enrolled 60 term and pre-term infants at 3 weeks of age into a breast milk jaundice study. Most infants were seen at out-patient clinic during the end of the second week of life (25 percent) or during the third week of life (28.3 percent). 13.3 percent were seen at an age of 5 weeks after birth or later. Infants, in whom the jaundice resolved within 3 weeks, were excluded from the study. Range of bilirubin at presentation was 4.2 mg/dl to 16.8 mg/dl. Fifteen infants received phototherapy. In 14 infants there was a brief interruption of nursing (in 13 cases for 24 hours and in one case for 48 hours). Forty-six subjects had neurodevelopmental examination between 10 and 60 months (mean age 31.1 ± 16.6 months). Their neurological examination was normal. All children had normal DDST except for two children. One had a delay in language ability and the other a delay in fine motor ability. A repeat DDST one month later was normal in both children.
Holmes, Miller, and Smith (1968) studied 63 children with neonatal bilirubin of 6.4 mg/dl to 24 mg/dl and 17 control children without neonatal clinical jaundice. These 80 children were selected out of a total of 195 available subjects. Only babies ≥ 2.5 Kg and who received more than one bilirubin determination were included in the experimental group. All causes of jaundice were included in the study. One case was excluded because he and his father displayed bizarre behavior. Another was excluded because he was mentally retarded; his bilirubin was in “low” range of 6–12 mg/dl. Eleven of 63 received exchange transfusion and 13 of 63 received streptomycin. A modified Oseretsky motor test and audiometry were performed in these children from 4 years 7 months to 7 years 8 months. The authors found no relationship between hyperbilirubinemia and the Oseretsky motor development test. Authors further noted that none of the subjects developed hearing impairment.
| Author, Year, UI # | Subjects N (Control) | Peak Bilirubin Level (range) mg/dl | Outcomes | Confounders | Quality |
|---|---|---|---|---|---|
| Tan, 1992 92139200 | 30 (31) | 16.7 | @0–3 days of Rx: BAER | PhotoRx | A |
| In PhotoRx group, there was significant difference between the preexposure and postexposure values of latency V and IPL III–V and I–V. | |||||
| The correlation between the bilirubin levels and the BAER values during PhotoRx was significant r=0.967 and p<0.001 for latency V; r=0.999 and p< 0.001 for IPL I–V; and r=0.963 and p< 0.001 for IPL III–V. | |||||
| Valaes, 1980 81031849 | 44 d (445) | 47% 16–20 | @61–81 months of age: Sensorineural hearing defect | Prematurity; BET; PB exposure | B |
| 29% 20–25 | Sensorineural hearing defect was significantly more common in children with moderate (bilirubin >12 mg/dl) or marked (bilirubin > 16 mg/dl) neonatal jaundice than in the children with slight or no jaundice (9.2% vs. 1.1%). | ||||
| 24% >25 | |||||
| Hyman, 1969 69132491 | 351 b | ≥ 15 | @4 years of age: Hearing loss | BW | B |
| Infants with TSB ≥ 20 mg/dl (N=108) had significantly higher prevalence of sensorineural hearing loss (10%), than infants with TSB < 20 mg/dl (N=243) (2%). | |||||
| Johnston, 1967 67081206 | 129 c (95) | >20 | @5–6 years of age: Audiological assessment | BET; Prematurity | B |
| Infants with TSB > 20 mg/dl had significantly more sensorineural hearing loss (7%), compared to infants with “low” bilirubin levels (0%). | |||||
| Suresh, 1997 98374505 | 42 g | Neonatal: 26.6±5.8 | @2 months to 21 years old: Hearing loss | Rx for hyperbiliru-binemia; Prematurity | C |
| Postnatal: 20.5±5.5 | Hearing was normal in 94% patients. No sensorineural hearing loss (reported in 36 patients). | ||||
| Boo, 1994 95018454 | 28 e (100) | 21.6±7.5 [18.4±7.4] | @11.6±3.9 days: Hearing loss | PhotoRx | C |
| 16% (13/83) of subjects with bilirubin < 19.8 mg/dl vs. 33% (15/45) of subjects with bilirubin > 19.8 mg/dl had hearing loss. The difference was not significant. | |||||
| Logistic regression analysis showed that the significant risk factors associated with hearing loss in the term jaundiced neonates were exchange transfusion and age of onset of hyperbilirubinemia. | |||||
| Esbjorner, 1991 91281442 | 9 a | 20.3 (16.4–25.6) | @4 (3.5–6) days, post Rx: ABR | PhotoRx | C |
| Significant correlation between the latencies to wave I, III, V, IPL I–V and the reserve albumin concentration (r=-0.89, p<0.01), but no significant correlation between the latency to wave V and the total bilirubin concentration was found (r=0.02, p=0.95) | |||||
| Rosta, 1971 72185033 | 84 f | ≥ 15 | @8 years of age: Hearing loss | BET, Streptomycin Rx | C |
| Severe sensorineural hearing loss were detected in 5 (6%), a mild to moderate one in 9 (11%) patients. “The above incidence was surpassing the frequency of similar alterations in the normal population.” | |||||
One infant had ABO incompatibility
Including 10% preemies and some hemolytic diseases
71% iso-immunization; 20% ABO incompatibility
29% ABO incompatibility; 2% G6PD deficiency
Non-physiologic causes of jaundice were known in 39 subjects but details were not provided
“Nearly equal distribution Rh and ABO immune-hemolysis, and hyperbilirubinemia of unknown origin”
All patients, age 2 months to 21 years old, were Crigler-Najjar syndrome type 1, resulting in life long jaundice.
| Author, Year, UI # | Sample N (Control) | Peak Bilirubin Level (range) mg/dl | Mean Latency (msec) | Mean Interpeak (msec) | Confounders | Quality | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| I | III | (IV),V | I–III | III–V | I–V | |||||
| Tan, 1992 92139200 | 30 (31) | 16.7 | 1.75 [1.77] | 4.56 [4.54] | 7.00 [6.75] | 2.80 [2.77] | 2.44 [2.25] | 5.25 [5.00] | None | A |
| Vohr, 1989 89328704 | 25 a (25) | 13.5±2.6 (10.2–19) | 1.8 [1.8] | 4.5 [4.3] | 7.5 [7.3] | 2.7 [2.6] | 3.0 [3.0] | 5.8 [5.5] | PhotoRx | A |
| Vohr, 1990 90339225 | ||||||||||
| Gupta, 1998 98355379 | 15 (15) | 21.98 (18.4–26.4) | - | - | - | 2.94 [2.48] | 3.65 [3.20] | 6.00 [5.12] | sedation; hemolytic disease; (Unknown Rx effect) | B |
| 30 (15) | 16.6 (15.8–18.0) | - | - | - | 2.65 [2.48] | 3.34 [3.20] | 5.83 [5.12] | |||
| 45 c (15) | 18.4 (15.8–26.4) | - | - | - | 2.88 [2.48] | 3.50 [3.20] | 5.92 [5.12] | |||
| Sabatino, 1996 97171477 | 48 (42) | 4–25.8 | 1.9 [1.8] | 4.8 [4.0] | 8.1 [7.0] | 3.0 [2.3] | 3.1 [2.9] | 6.1 [5.2] | None | B |
| Perlman, 1983 84041257 | 24 b (19) | 18.7±2.7 | - | - | - | 2.77 [2.67] | 3.20 [3.09] | 5.97 [5.75] | PhotoRx; BET | B |
| Agrawal, 1998 99232291 | 30 e (25) | 22.4±2.7 | 1.8 [1.7] | 4.8 [4.5] | 7.3 [6.7] | 3.0 [2.8] | 2.4 [2.2] | 5.4 [5.1] | None | C |
| Gupta, 1990 91244386 | 25 d (20) | 20–35 | 1.98 [1.96] | 5.22 [5.01] | 7.34 [7.10] | - | - | 5.36 [5.14] | BET; | C |
Bold = significantly different (p < .05)
n.s. = not significantly different
* = significantly different between pre- and post- treatment
60% ABO incompatibility [56% of controls has ABO incompatibility as well]
40% Rh incompatibility; 25% ABO incompatibility
38% G6PD deficiency; 20% ABO incompatibility
28% ABO incompatibility, 40% Rh incompatibility; 5% Subaponeurotic bleed
10% had Rh incompatibility, 26.7% had ABO incompatibility and 63.3% had idiopathic hyperbilirubinemia
[ ] Controls' value; Abn = abnormal = IPL I–V values > 4.34±0.15 msec; VEP =Visual Evoked Potential
Tan, Skurr, and Yip (1992) presented similar findings in a group of 30 healthy non-hemolytic term infants with mean bilirubin of 15 mg/dl and who had undergone phototherapy. When compared to control (bilirubin <8.2 mg/dl), this group of infants who had prolonged BAEP latency V (p<0.02) and interpeak latencies III–V (p<0.05). Concomitant improvement occurred in these values as the bilirubin levels declined in response to phototherapy.
Vohr, Lester, Rapisardi, et al. (1989), in a separate study of 50 term infants (some with ABO incompatibility, none with Rh incompatibility), found that moderate bilirubin group had significantly prolonged brain-stem conduction time. Correlation analysis showed that serum bilirubin values correlated positively with brain-stem conduction time (r = 0.36, p<0.01).
Vohr, Karp, O'Dea, et al. (1990), in the same study mentioned above, found increased bilirubin concentration correlated with an increased latency of brain-stem auditory evoked response in wave 4, 5 (latency of 7.50 ± 0.29 msec in the moderate bilirubin group vs. 7.32 ± 0.26 msec in the low bilirubin group, p<0.05).
In a similar study, Gupta and Mann (1998) performed BAEP's on 60 neonates (37–40 week gestation). Forty-five patients with bilirubin levels ranging from 15.8 mg/dl to 26.4 mg/dl were compared to 15 control patients who were either clinically non-jaundiced or had bilirubin < 12 mg/dl. Causes of hyperbilirubinemia included G6PD deficiency, ABO incompatibility and unknown reasons. Treatment information was not provided. A positive correlation was found between increasing levels of serum bilirubin and brain stem conduction time (P<0.01). Retest at one month of age showed 33.33 percent of cases with a mean serum bilirubin of 19.46 mg/dl and 80 percent with a mean bilirubin of 15.97 mg/dl showed total recovery. After 6 months, three cases with a mean bilirubin of 26.3 mg/dl and one case with a mean bilirubin of 17.7 mg/dl failed to show any improvement. The authors also concluded that neonates with distortion of normal wave patterns on BAEP's were found to have a poorer prognosis compared with those with delayed inter-peak (I-P) latencies.
Sabatino, Verrotti, Ramenghi, et al. (1996) studied brain stem auditory evoked potentials (BAEP) in 90 full-term newborns with jaundice. After enrolling only the subjects that showed the three major BAEP components (waves I, III and V), 48 full-term subjects with bilirubin from 13.9 to 25.8 mg/dl and without evidence of hemolysis were followed over a 3-year period. BAEP's were performed on day 3 and repeated 5–7 days post phototherapy or exchange transfusion until the bilirubin had decreased below 8 mg/dl. On day 3, latencies of waves III and V were significantly increased compared to control subjects. Recordings performed 5 to 7 days post therapy showed no significant differences during subsequent session. Serial neuropsychological evaluations at 6, 18 and 30 months using the Brunett-Lezine test showed no neurodevelopmental abnormality. Statistical analyses of the neuropsychological evaluations were not given.
Perlman, Fainmesser, Sohmer, et al. (1983) studied BAEP's in 24 neonates (37–41 week gestation) with bilirubin levels of 15 mg/dl to 25 mg/dl. Six had ABO incompatibility, one had Rh incompatibility, two had hemorrhage and the remainder had unknown reasons for jaundice. All 24 received phototherapy; three infants also received exchange transfusion. Nineteen neonates who were either clinically non-jaundiced or had bilirubin levels <12 mg/dl served as control. Hyperbilirubinemic infants had prolonged brainstem transmission time (P<0.01). Wave complex IV–V was present in all 19 control infants; it was absent in at least one recording of 10 of 24 jaundiced infants (P<0.001). Nine of 13 infants who had sequential studies over several days showed apparent improvement manifested by the subsequent appearance of waves that were initially absent or by shortening of initially prolonged latency.
Agrawal, Shukla, Misra, et al. (1998) in the previously mentioned study of 30 term newborns with bilirubin > 15 mg/dl, recorded BAEP's in these infants before and after phototherapy or exchange transfusion and at the age of 2 to 4 months. Seventeen of 30 infants showed abnormalities on initial BAEP's; these abnormalities correlated significantly with bilirubin level. After therapy, abnormalities reverted back to normal in 10 cases but persisted in seven out of 17. Follow-up BAEP's were done in 21of 30 cases at 14.4 ± 2.2 weeks. Of the 15 patients who had an initial abnormal BAEP, in only three were there persistent abnormalities. Two of them had kernicterus.
Gupta, Raj, and Anand (1990) studied 25 term infants with bilirubin > 20 mg/dl and who received exchange transfusion. Seven had Rh and 10 had ABO incompatibility; one had a subaponeurotic bleed; seven had undetermined cause for jaundice. Twenty normal term infants without jaundice served as control. BAEP's were performed at 10 ± 3 days, just before hospital discharge. Mean latencies (wave III, V) and I–V interpeak latency (brainstem conduction time) were significantly prolonged in jaundiced neonates as compared with controls (P<0.01). BAEP abnormalities were found with greater frequency in jaundiced neonates requiring a repeat exchange transfusion (P <0.001). However, all infants had normal BAEP's on follow-up retesting at 3 months.
Valaes, Kipouros, Petmezaki, et al. (1980) followed up 415 children (233 exposed to prenatal phenobarbital, 182 were not) at 61 to 82 months of age in a prenatal phenobarbital study. None of them received phototherapy. An unspecified number had exchange transfusion. Ten children were diagnosed with sensorineural hearing defect. It was significantly more common in children with moderate (bilirubin >12 mg/dl) or marked (bilirubin > 16 mg/dl) neonatal jaundice than in the children with slight or no jaundice (9.2 percent vs. 1.1 percent; P<0.05).
Hyman, Keaster, Hanson, et al. (1969), in the previously mentioned study of patients with hemolytic disease or hyperbilirubinemia, reported that in the 351 patients who received streptomycin, there was a significant association of sensorineural hearing loss and bilirubin ≥ 20 mg/dl as tested by audiometry after age 4 years. Eleven out of 108 patients (10.2 percent) with bilirubin ≥20 mg/dl vs. Five out of 243 patients (2.1 percent) with bilirubin <20 mg/dl developed sensorineural hearing loss (P<0.001).
Johnston, Angara, Baumal, et al. (1967) studied 129 children at 5 to 6 years of age with neonatal bilirubin >20 mg/dl. Ninety-two babies had Rh incompatibility, twenty-six had ABO incompatibility and 11 had other reasons for their hyperbilirubinemia. There were 24 premature babies in this group. Ninety-five percent of the cases received exchange transfusion. Seven children in this group (five of them are full term) had sensorineural hearing loss. In the control group of 95 children with low bilirubin (11 were premature babies), there was not a single case of sensorineural hearing loss (p=0.00128). Both groups of children received streptomycin when they were neonates.
Suresh and Lucey (1997) obtained information via returned questionnaires on 42 patients (2 months to 21-years-old) with Crigler-Najjar syndrome type-1 via returned questionnaires. Mean bilirubin was 19.8 ± 4.5 mg/dl (typical) in the neonatal period. The mainstay of therapy in these patients with Crigler-Najjar was home phototherapy for 10 to 16 hours a day. Hearing was reported to be normal in 34/36 patients. The two children with hearing loss had conductive loss from otitis media.
Boo, Oakes, Lye, et al. (1994) studied brain stem evoked response in 128 jaundiced term infants at a mean age of 11.6 days (S.D.=3.9). One hundred thirty-six were originally admitted to the nursery. Eight were excluded because of aminoglycoside treatment, cyanotic heart disease and congenital infection. Non-physiologic causes of jaundice were identified in 39 subjects but details were not provided. All subjects had phototherapy. Forty-three also had exchange transfusion. Sixteen percent (13/83) of subjects with bilirubin < 19.8 mg/dl vs. 33 percent (15/45) of subjects with bilirubin > 19.8 mg/dl had hearing loss. The difference was not significant. Logistic regression analysis showed that the significant risk factors associated with hearing loss in the term jaundiced neonates were exchange transfusion and age of onset of hyperbilirubinemia.
Esbjorner, Larsson, Leissner, et al. (1991) studied 9 healthy neonates (35 to 42 week gestation) with bilirubin from 16.4 mg/dl to 25.6 mg/dl. One case of jaundice was caused by ABO incompatibility; no underlying disease was found in the rest. All received phototherapy. No significant correlation between the BAEP latency to wave V and the total bilirubin concentration was found at 4 days of age. The authors did find a statistically significant correlation between the reserve albumin concentration (as measured by monoacetyldiaminodiphenyl sulphone analysis) and the latency to wave V (r = -0.89, p<0.001).
Rosta, Makoi, Bekefi, et al. (1971) studied 84 children with neonatal bilirubin of >15 mg/dl at 7 to 9 years of age. Etiologies of hyperbilirubinemia included ABO or Rh incompatibilities, and hyperbilirubinemia of unknown origin. All subjects had birthweight ≥2,500 grams. The authors did not specify exactly how many subjects received exchange transfusion and/or streptomycin. Five out of 84 had a severe sensorineural hearing loss and nine had a mild to moderate sensorineural hearing loss. There was no control group in this study.
| Study, Year, UI # | Subjects N (Control) | Bilirubin Level (range) mg/dl | Outcomes | Confounders | Quality |
|---|---|---|---|---|---|
| Bengtsson, 1974 74170112 | 111 g (115) | 20.0–51.0 | @6.5–13 years old: IQ; Sensorineural hearing loss | BET | B |
| No statistically significant correlation was found between I.Q. and maximal bilirubin values | |||||
| 3 (7%) children with ABO-incompatible hyperbilirubinemia had neurogenic hearing loss, while none was found in control (P>0.05). | |||||
| Rosta, 1971 72185033 | 84 I | ≥ 15 | @8 years of age: IQ | BET, Streptomycin Rx | C |
| Distribution of IQ was the same as in the normal population. 4 (5%) children had IQ below 0.70, 11 (13%) had an IQ between 0.70 and 0.80. | |||||
| Hyman, 1969 69132491 | 29 h | ≥ 15 | @4 years of age: IQ; Auditory rote memory difficulties | BET; streptomycin Rx; BW; Selection bias | C |
| No association was found between high bilirubin exposure and low IQ (N=38). | |||||
| No significant difference in the prevalence of auditory rote memory difficulties between infants with TSB ≥ 20 mg/dl (N=17) and infants with TSB < 20 mg/dl (N=21) | |||||
All had negative Coombs' tests
from original 405 subjects, Including 10% preemies and some hemolytic diseases
“Nearly equal distribution of Rh and ABO immune-hemolysis, and hyperbilirubinemia of unknown origin”
| Author, Year, UI # | Sample N (Controls) | Bilirubin Level (range) mg/dl | Mean IQ | Confounders | Quality | ||
|---|---|---|---|---|---|---|---|
| Full | Verbal | Performance | |||||
| Seidman, 1991 91375839 | 308 c (1,496) | Var1 | 103 | - | - | PhotoRx; BET | B |
| 144 d (1,496) | Var2 | 103 | |||||
| Bengtsson, 1974 74170112 | 66 | 20–35 | 107 | - | - | BET | B |
| 45 a | 20–51 | 106 | - | - | |||
| Culley, 1970 70266548 | 73 b (48) | 12–16 | 106 | - | - | Estimated TSB levels | B |
| > 16 | 107 | - | - | ||||
| Ozmert, 1996 97154941 | 13 (27) | 17–20 | 102 | 102 | 102 | Possible selection bias (Unknown Rx effect) | C |
| 26 (27) | 20–22 | 103 | 106 | 101 | |||
| 27 (27) | 22–25 | 99 | 101 | 95 | |||
| 19 (27) | > 25 | 102 | 103 | 101 | |||
| 17 g (27) | > 20 | 87 | 95 | 93 | |||
| Odell, 1970 70075541 | 8 (6) f | > 19 | 94 | - | - | BET | C |
| 25% < 80 | |||||||
| Johnston, 1967 67081206 | 129 e (82) | > 20 | 105 | - | - | BET; Prematurity | C |
100% ABO incompatibility
Healthy infants (no hemolytic disease; no congenital malformations; no perinatal asphyxia; no neonatal illness or infections)
Including some infants with Coombs positive tests
Including infants with hemolytic disease (2% subjects had Coombs positive tests)
71% iso-immunization; 20% ABO incompatibility
43% Rh incompatibility; 14% ABO incompatibility; 14% Other blood incompatibility; 21% unknown cause of jaundice
65% Rh and 35% ABO incompatibility
Var1 = 5–8 mg/dl on 1 day of life, 10–15 mg/dl on the 2nd day of life, or 13–20 mg/dl thereafter
Var2 = >8 mg/dl on the 1 day of life, >15 mg/dl on the 2nd day of life, and >20 mg/dl thereafter
Bengtsson and Verneholt (1974) studied 111 children with neonatal bilirubin of 20 mg/dl to 51 mg/dl and 115 controls without visible jaundice. All were healthy full-term infants with negative Coombs' test. Forty-four of the hyperbilirubinemic group received exchange transfusion. Follow-up examinations were done between 6½ to 13 years of age. No statistically significant correlation was found between IQ and maximal bilirubin values. Out of the 111 children with hyperbilirubinemia, two had athetotic coordination disturbance and four had sensorineural hearing loss, while none of the 115 controls had such disorders. The differences were not statistically significant.
Rosta, Makoi, Bekefi, et al. (1971), in the previously mentioned uncontrolled study of 84 children with bilirubin >15 mg/dl and in whom follow-up examinations were done 7 to 10 years later, found that “the distribution of IQ was the same as in the normal population. Four children had IQ below 0.70; 11 had an IQ in the range between 0.70 –0.80.” Actual comparative data to the normal population was not provided.
Hyman, Keaster, Hanson, et al. (1969), in the previously mentioned study of infants with hemolytic disease and hyperbilirubinemia of other etiology, selected 29 available patients with either sensorineural hearing loss or Minimal Cerebral Dysfunction and nine control patients without CNS abnormality for further testing after age 4 years by one psychologist. These tests included Stanford-Binet, Wechsler Intelligence Scale for Children and others. “Based on the tests used, 29 patients (22 term and seven premature) were found to have normal mentality, five (four term and one premature) had subnormal mentality, and four (three term and one premature) were of superior mentality. Patients exposed to high bilirubin [≥ 20 mg/dl] and low bilirubin [< 20 mg/dl] during the neonatal period were almost equally represented in each mentality rating. Further analyses of the five patients with subnormal mentality showed that three had very high bilirubin exposure, 30 mg/dl or greater. No other correlation between bilirubin and mentality could be made.”
Seidman, Paz, Stevenson, et al. (1991) analyzed 1,948 17-year-olds (out of a total of 2,044 from a single hospital) who had neonatal records. Bilirubin values were available in 585 (30 percent of the 1,948) infants; subjects who had no clinical jaundice and for whom bilirubin levels were not obtained were included in the mild bilirubinemia group. Subjects' transformed intelligence test scores (from verbal Otis test and nonverbal matrices test results to values that correlated with Wechsler Adult Intelligence Scale) and medical examination findings performed at the military draft board at 17 years of age were stratified according to serum bilirubin levels. No direct linear association was found between neonatal bilirubin levels and intelligence test scores. However, the risk for IQ score less than 85 was found to be significantly higher (p=0.014) among full-term Coombs-negative male subjects with serum bilirubin greater than 20 mg/dl (odds ratio, 2.96; 95% CI 1.29–6.79).
Culley, Powell, Waterhouse, et al. (1970) studied Stanford-Binet intelligence quotient at 6 years of age in 169 neonates with non-hemolytic jaundice and 159 control neonates. These neonates' birthweight ranged from <1500 g to >3000 g. Treatment information was not provided. After adjusting for either birth weight or gestation in each jaundiced group, authors concluded that “in non-hemolytic jaundice there is no evidence of reduction in intelligence quotients with increasing jaundice up to the permitted level of around 20 mg/dl.”
Ozmert, Erdem, Topcu, et al. (1996) studied 102 (out of 416) children between 8 and 13 years of age. All these children had neonatal bilirubin >17 mg/dl (17–48 mg/dl), ≥ 37 week gestation and birthweight ≥ 3000 g. Seventeen children had positive direct Coombs' test (11 Rh, six ABO incompatibility). Treatment information was not provided. The control group consisted of 27 children of the same age but without neonatal hyperbilirubinemia. The Wechsler Intelligence Scale Revised for Turkish Children and other tests were performed in these children. Children who had positive direct Coombs tests and bilirubin >20 mg/dl had lower IQ scores and more prominent neurological abnormalities than the control group (P<0.05). IQ scores and prominent neurological abnormalities did not differ among the other groups of children with negative Coombs test and hyperbilirubinemia. The authors also noted an important risk factor for prominent neurological abnormalities was the duration that the infant's serum indirect bilirubin remained > 20 mg/dl.
Odell, Storey, and Rosenberg (1970) located 32 children between the ages of 4 and 7 years of age who had a history of neonatal jaundice. These 32 children represented only a third of the total population on whom bilirubin studies were performed during neonatal life. Twenty-four had exchange transfusion. Sixteen had either ABO or Rh incompatibility. Birthweight ranged from 1400 grams to 3827 grams. Fifteen patients had birth weight >2,500 grams. Maximum bilirubin level ranged from 11.4 mg/dl to 31.6 mg/dl with a median of 19 mg/dl. Stanford-Binet or Wechsler Intelligence Scale, and other visual-motor and short-term memory tests were given to this group of children. Six chi-square analyses were performed using IQ <80, ≥ 80 vs. maximum bilirubin of 20, 21, 22, 23, 24 mg/dl and values of bilirubin <19 mg/dl and ≥ 19 mg/dl. No statistical significant relationship was found. The authors further reported that there was a significant correlation (P <0.01) between the presence or absence of brain damage (measured by psychometric testing) and the degree to which the patient's sera had been saturated with bilirubin (as determined by the salicylate displacement technique).
Johnston, Angara, Baumal, et al (1967), in the previously mentioned study of 129 children with neonatal bilirubin >20 mg/dl, found the average IQ in this group was 104.8. The control group of 82 children had an average IQ of 102.8. No further statistical analysis was provided.
| Author, Year, UI # | Subjects (N) | Outcomes | Confounders | Quality |
|---|---|---|---|---|
| Newman, 1993 94021217 | CPP subjects, born from 1959 to 1966, BW>2500 g (N=41,324; 33,272 had 7-year outcome data) | @7 yr. of age: Adjusted IQ, Neurologic exams; Hearing loss | BET | A |
| No significant difference in adjusted IQ between children with TSB < 20 mg/dl and ≥ 20 mg/dl in black and white subjects. | ||||
| There was no significant association between peak TSB and IQ (r=0.13 in whites; r=0.34 in blacks, p>0.05). | ||||
| Risk of abnormal or suspicious neurologic exam findings increased in stepwise manner from TSB <10 mg/dl to TSB ≥ 20 mg/dl. (P<0.001) Neurologic exam items most associated with bilirubin levels were mild and nonspecific motor abnormalities. | ||||
| 2.2% of CPP subjects had hearing loss. Adjusted OR of hearing loss was 1.08 (95% CI 0.3–3.5) | ||||
| Rubin, 1979 79153498 | Subjects (N=241) both in CPP and Educational Follow-up Study( EFS); plus a control group from EFS population (N=125) | @8 months of age: Bayley MDI & PDI | Lost to follow-up; BET | B |
| Infants with TSB 16–23 mg/dl (N=153) had significantly lower Bayley MDI and PDI scores at 8 months of age, compared to infants with TSB 11–15 mg/dl (N=66) and those with TSB ≤ 10 mg/dl (N=120). Controlling for GA, there was no longer a difference on MDI, but PDI remained significant. | ||||
| @1 year of age: Neurological exams | ||||
| Infants with TSB 16–23 mg/dl (N=153) had significantly more neurologic abnormalities at 1 year of age, compared to infants with TSB 11–15 mg/dl (N=69) and those with SB ≤ 10 mg/dl (N=120). Controlling for GA, the difference still remained significant. | ||||
| @5 & 6 years of age: MRT & ITPA | ||||
| No significant differences were found between groups. | ||||
| @7 years of age: IQ; Neurological exams | ||||
| No significant difference was found between groups. | ||||
| Scheidt, 1977 77209363 | CPP subjects, born from 1959 to 1964, only term and FT infants (GA≥36 wk and BW>2500 g) were included for current review (original sample size=24,524) | @8 months of age: Bayley MDI & PDI (White infants): | None was found (Unknown Rx effect) | B |
| PDI score was significantly decreased with elevated TSB levels (≥10 mg/dl). However, significant decrease in Bayley MDI score only occurred in white term infants with TSB≥15 mg/dl. | ||||
| @8 months of age: Bayley MDI & PDI (Black infants): | ||||
| For black term (GA ≥ 36 wk and BW > 2500 g) infants, both Bayley MDI and PDI scores were significantly decreased with elevated TSB levels (≥10 mg/dl). | ||||
| No association between TSB levels and Bayley MDI and PDI scores was found in black FT subjects (GA ≥ 37 wk and BW > 2500 g). | ||||
| Boggs, 1967 67261091 | CCP subjects, born from 1959 to 1966, BW > 2500 g (N=18,484) | @8 months of age: Bayley MDI & PDI | Hemolytic disease; BET; Congenital malformation | B |
| For infants with BW 2501–3000 g (N=4,916), the estimated increase in percent of infants with a low PDI score was 1.2% for each 1 mg/dl rise in TSB level. The estimated increase in percent of infants with a low MDI score was 0.8% for each 1 mg/dl rise in TSB level. | ||||
| For infants with BW ≥ 3000 g (N=13,568), the estimated increase in percent of infants with a low PDI score was 0.5% for each 1 mg/dl rise in total SB level. The estimated increase in percent of infants with a low MDI score was 0.3% for each 1 mg/dl rise in TSB level. | ||||
| Naeye, 1978 79054333 | CPP subjects, born from 1959 to 1966, mixed term and preterm infants (N=41,444) | @4 yr. of age: IQ | Prematurity, Placental infection (Unknown Rx effect) | C |
| Increased frequency of mental impairment in subjects with peak bilirubin > 7 mg/dl | ||||
| The frequency of low IQ with increasing bilirubin levels increased more rapidly in infected infants. | ||||
| @7 yr. of age: Neurologic exam | ||||
| An increase in the prevalence of abnormalities started at peak neonatal bilirubin levels of 12 to 13 mg/dl. | ||||
| Neurologic abnormalities were more prevalent in subjects who had amniotic fluid infections. | ||||
MRT = metropolitan readiness tests
ITPA = Illinois test of psycholinguistic ability
| Author, Year, UI # | Sample N (Controls) | Peak Bilirubin Level (range) mg/dl | Mean IQ | Confounders | Quality |
|---|---|---|---|---|---|
| Newman, 1993 94021217 | 41,324 a | < 10 | White:104; Black: 93 | BET | A |
| 10–15 | White: 103; Black: 93 | ||||
| White: 21,375 | 15–20 | White: 104; Black: 93 | |||
| Black: 19,949 | 20–25 | White: 105; Black: 91 | |||
| ≥ 25 | White: 106; Black: 90 | ||||
| Rubin, 1979 79153498 | 203 b (114) | 11–15 | 102 | Lost to follow-up; BET | B |
| 16–23 | 102 | ||||
| Upadhyay, 1971 71238892 | 20 c (20) | > 20 | 94 (10% < 80) | BET | C |
Bold = below normal range
Including infants with hemolytic disease (2% subjects had positive Coombs' tests)
Subjects were from CPP, likely to include hemolytic jaundice and jaundice due to infections (not explicitly stated)
50% ABO incompatibility; 35% Rh incompatibility; 5% Septicemia with a gram-negative organism
From the CPP sample of 54,795 live-born infants, Newman and Klebanoff (1993) excluded infants who were not singletons, with birthweight < 2,500 grams or unknown, with race other than black or white, in whom no bilirubin measurement was recorded and the ones who died before their first birthday. This resulted in a study sample of 41,324 (or 75.4 percent of the original CPP sample) singleton black or white infants with birthweight ≥ 2,500 grams, who survived for at least 1 year and had at least one bilirubin level recorded. Outcome data at 7 years were available on approximately 80 percent (33,272 out of 41,324) of these infants. Neonatal bilirubin levels were not associated with loss to follow-up. IQ was measured using the Wechsler Intelligence Scale for children at age 7.1 ± 0.4 years (mean ± SD) in whites and 7.2 ± 0.8 years in blacks. Neurologic examinations were done at 1 and 7 years of age. Only results at age 7 years were presented; results at age 1 year were similar. The result was classified into normal, suspicious and abnormal. Hearing evaluations were done at 3 years ± 2 months and 8 years ± 3 months using pure-tone audiometry. Hearing examination at 8 years was “discontinued for administrative reasons” at six of the 12 centers. Thus, subjects for whom hearing data were available were not representative of the cohort as a whole. There was no consistent association between peak bilirubin level and IQ. To compare results with those of Naeye (see below) and Seidman et al (see above), IQ scores were dichotomized to <85 or ≥85. There was no association between bilirubin levels and dichotomized IQ in either race. The risk of abnormal or suspicious neurologic examination results increased in a stepwise manner from 14.9 percent (4,346 out of 29,258) in the group whose maximum bilirubin level was <10 mg/dl to 22.4 percent (60 out of 268) in the group whose maximum bilirubin was ≥ 20 mg/dl (P<0.001). Athetosis was not associated with peak bilirubin levels in this study. The proportion of children with sensorineural hearing loss at age 8 years was about 2 percent regardless of bilirubin level. However, authors noted that “because a comparatively small subset [16,886 out of 33,272 subjects at 8 years or 51 percent] of the CPP received hearing examinations and sensorineural hearing loss was uncommon, the power to detect an effect of very high bilirubin levels was limited."
Rubin, Balow, and Fisch (1979) enrolled all 241 subjects with maximum bilirubin of 11 mg/dl and above from the 1,613 participants both in the Educational Follow-up study (EFS) and the Minnesota section of the Collaborative Perinatal Project into a study trying to determine the relationship of neonatal hyperbilirubinemia to cognitive development at ages 4 through 7. One hundred and sixty-four subjects with bilirubin levels from 11 to 15 mg/dl were classified as the moderately elevated group; 77 subjects with bilirubin from 16 to 23 mg/dl were classified as the high bilirubin group. A control group of 125 subjects with bilirubin levels of 0–10 mg/dl was randomly drawn from the remainder EFS population and designated as the low bilirubin group. Nine subjects from the high and moderate bilirubin groups and one from the low bilirubin group had respiratory distress. Fourteen (18.2 percent) of the high bilirubin group had exchange transfusions. Bayley scales of Mental and Motor Development were administered at 8 months. IQ measures were obtained at age 4 using the Stanford-Binet, Short form L-M, and at age 7 using the Wechsler Intelligence Scale for Children. Neurologic examinations were administered at 1 year and 7 years. At age 8 months, high bilirubin group had a lower mean Bayley Motor Score the low bilirubin group (p<0.05). At age one year, high bilirubin group also has more neurologic abnormalities than the low bilirubin group (p=0.007). There were no significant differences found amongst bilirubin groups on the Stanford-Binet at age 4, Wechsler Intelligence Scale or the neurologic examinations at age 7.
Scheidt, Mellits, Hardy, et al. (1977), using the CPP population, analyzed the relationships between maximum bilirubin levels and neurodevelopment at 8 months and at 1 year of age. This study included white and black infants. Infants from multiple births, with Down syndrome, meningomyelocoele, hydrocephalus and congenital heart disease were excluded. Only those infants examined within a narrow time frame were included. 24,524 infants were examined at age 8 months ± 2 weeks and 27,720 were examined at age 12 months ± 4 weeks. The effects of bilirubin were evaluated within five birthweight/gestational age categories suggested by Yerushalmy. The two groups that are of particular interest to the present report are group 4 (>2,500 grams and ≥ 36 weeks) and group 5 (>2,500 grams and ≥ 37 weeks). A significant difference in Bayley motor score occurred with bilirubin ≤ 9 mg/dl vs. 15 to 19 mg/dl for white groups 3, 4 and 5 and for black groups 1, 2 and 4 (P≤0.05). A significant difference in motor scores also occurred with bilirubin ≤ 9 mg/dl vs. 10 to 14 mg/dl for white group 3, 4, 5 and black groups 1, 2, 3, 4 (P ≤ 0.02). Infants in group 2 (1,501 – 2,500 g, ≤ 36 weeks) had most variables in the 8 months Bayley and 1-year neurologic examination associated with increased level of bilirubin.
Boggs, Jr., Hardy, and Frazier (1967) compiled a preliminary report of approximately 23,000 subjects from the CPP and their 8-months standardized Bayley scores. The subjects were singletons; some had Coombs positive hemolysis, some had exchange transfusions and some had congenital malformations (analysis excluding infants with congenital malformations was essentially unchanged). Authors found that there was a significant (p<0.005) linear association between the maximum bilirubin levels and low motor score (<27) at 8 months. This trend was observed in all birthweight groups. The slope of the regression line was steeper in the low birthweight group. Similar findings were reported for the mental score.
From the CPP population, Naeye (1978) excluded subjects with multiple births, major congenital malformation, chromosomal disorders, lead intoxication, metabolic problems, CNS diseases, congenital syphilis and rubella. Subjects with mothers who had severe hypotension during labor or delivery or who were drug or alcohol addicted were also excluded. 45,347 infants were available for analysis regarding the relationship of hyperbilirubinemia, amniotic fluid infections, and neurodevelopmental outcomes at 1 year and 7 years. Neutrophil infiltration of the subchorionic plate of the placenta was taken as evidence of amniotic fluid infection. With or without amniotic infection, authors reported an increased frequency of IQ<90 at age 4 years with peak bilirubin ≥ 7 mg/dl. The incidence of IQ ≤ 90 with rising bilirubin levels increased more rapidly in infants with amniotic infection than in infants without. Authors also reported an increase in abnormal neurologic tests at 7 years of age, starting at peak bilirubin levels of 12 to 15 mg/dl.
In another study relating neonatal bilirubin to neurodevelopmental outcome, Upadhyay (1971), using the CPP subgroup from Johns Hopkins, identified 34 subjects (out of 4,195) with birthweight > 2,500 grams and maximum bilirubin > 20 mg/dl. Twenty subjects were evaluated. Fourteen out of 34 were excluded (five were evaluated prior to standardization of the bilirubin method, six were infants of high risk mothers, two were lost to follow-up and one child died at 14 months of kernicterus). Etiologies of hyperbilirubinemia included ABO and Rh incompatibility, sepsis and unknown reasons. Eleven had exchange transfusions. There were 20 matched control subjects with bilirubin less than 10 mg/dl. There was no difference in mean IQ at age 4 years between the two groups. However, in the psychological evaluation, eight out of 20 in the high bilirubin group vs. three out of 20 in the control were rated as non-normal (P<0.01).
In the previously mentioned studies by Wolf, Beunen, Casaer, et al (1997) and Wold, Wolf, Beunen, et al. (1999) on 50 Zimbabwean infants with bilirubin > 23.4 mg/dl, the researchers found 11 infants with abnormal scores on the Bayley Scales at 1 year of age and five who developed choreo-athetoid cerebral palsy. All those infants were either preterm babies or had hemolytic diseases. All term infants without hemolysis and with bilirubin between 23.4 and 29.2 mg/dl were normal at one year of age.
When comparing the group of term/near-term infants with comorbid factors who had kernicterus to the group of infants with idiopathic hyperbilirubinemia and kernicterus, the overall mean bilirubin was 31.6 ± 9 mg/dl in the former vs. 35.4 ± 8 mg/dl in the latter (difference nonsignificant). Infants with G6PD deficiency, sepsis, ABO or Rh incompatibility had similar mean bilirubin levels. Infants with more than one comorbid factor had slightly lower mean bilirubin of 29.1 ± 16.1 mg/dl.
Eighteen of twenty-three (78 percent) term infants with idiopathic hyperbilirubinemia and who developed acute kernicterus survived the neonatal period with chronic sequelae. Thirty-nine of forty-one (95 percent) term infants with kernicterus and ABO or Rh incompatibility had chronic sequelae. Four of five (80 percent) infants with sepsis and kernicterus had chronic sequelae. And all four infants with multiple comorbid factors had sequelae.
There was no direct study concerning serum albumin level as an effect modifier of neurodevelopmental outcome in infants with hyperbilirubinemia. There was one report that found a significant association between reserve albumin concentration and latency to wave V in BAEP studies (Esbjorner, Larsson, Leissner, et al, 1991).
In addition, Ozmert, Erdem, Topcu et al. (1996) noted important risk factors for prominent neurological abnormalities were exchange transfusion and the duration that the infant's serum indirect bilirubin remained above 20 mg/dl.
All the above factors require further studies to understand the nature of neurodevelopment in relation to bilirubin physiology.
| Study, Year, UI# | Country | Subject Details | Phototherapy Protocol | Outcome | Study Design | Quality |
|---|---|---|---|---|---|---|
| Brown, 1985 85112402 | US | Racially diverse jaundiced infants with BW > 2500 g TSB > 13 mg/dl | Continuously PhotoRx began at the time the SB level reached 13 mg/dl in the first 96 hrs. Duration of PhotoRx was 96 hrs. Infants' eyes were covered. | ET: carried out when SB > 20 mg/dl in both groups | RCT | A |
| Maurer, 1985 85112404 | ||||||
| Martinez, 1993 93141315 | US | FT breast-fed infants with SB ≥ 17 mg/dl | (I) Continue breast-feeding, administer PhotoRx | SB ≥ 20 mg/dl | RCT | A |
| (II) Discontinue breast-feeding, substitute formula, and administer PhotoRx. | ||||||
| Infants' eyes were covered. | ||||||
| John, 1975 75183114 | Australia | Mature infants who developed unexplained jaundice with SB > 15 mg/dl | Each course consisted of a continuously PhotoRx for 18 hrs, followed by a 6 hour break. 3 or 4 courses were given. Infants' eyes were covered. | ET: carried out when SB > 20 mg/dl in both groups | Non-RCT | C |
Healthy = no hemolytic disease; no congenital malformations; no perinatal asphyxia; no neonatal illness or infections
Non-RCT = nonrandomized control trial
ET = exchange transfusion
| Study, Year, UI# | Groups | No. of Infants whose SB>20 mg/dl (%) | Total number of Infants | Mean SB level at entry (range, mg/dl) | Follow-up | ARR | NNT (95%CI) |
|---|---|---|---|---|---|---|---|
| Brown, 1985 85112402 | Ctrl grp I b | 10 (17) | 60 | ≥ 13 | 144 hrs | +0.5% | - |
| Rx grp I b | 12 (17) | 70 | |||||
| *sample from NICHD trial | Ctrl grp II | 13 (17) | 76 | ≥ 13 | 144 hrs | 14.3% | 7 |
| Rx grp II | 2 (3) | 70 | (6–8) | ||||
| Maurer, 1985 85112404 | Ctrl grp I c | 6 (20) | 30 | 15.8 (13–23) | 144 hrs | +0.6% | - |
| Rx grp I c | 7 (21) | 34 | 16.5 (10–23) | ||||
| *sample from NICHD trial | Ctrl grp II d | 17 (16) | 106 | 15.6 (10–24) | 144 hrs | 9.4% | 11 |
| Rx grp II d | 7 (7) | 106 | 15.5 (10–22) | (10–12) | |||
| Martinez, 1993 93141315 | Ctrl grp I | 6 (24) | 25 | 17.8 | 48 hrs | 10.1% | 10 |
| Rx grp I | 5 (14) | 36 | 18.0 | (8–11) | |||
| Ctrl grp II | 5 (19) | 26 | 17.8 | 48 hrs | 16.6% | 6 | |
| Rx grp II | 1 (3) | 38 | 17.9 | (5–7) | |||
| John, 1975 75183114 | Ctrl grp | 70 (18) | 381 | 15–18 | ND | 11.2% | 9 |
| Rx grp | 8 (7) | 111 | (8–10) | ||||
Healthy = no hemolytic disease; no congenital malformations; no perinatal asphyxia; no neonatal illness or infections
Hemolytic disease group (infants with hemolysis, high risk group). BET were carried when infants' SB >18 mg/dl in the high risk group.
Coombs' test positive (91% ABO incompatibility; 9% Rh incompatibility)
Including some infants in the high-risk group. ET were carried when infants' SB >18 mg/dl in the high risk group.
ARR = absolute risk reduction rate; NNT = number needed to treat
Two studies (Brown, Kim, Wu, et al., 1985; Maurer, Kirkpatrick, McWilliams, et al., 1985) evaluated the same sample of infants. Both reports were derived from a RCT of phototherapy for neonatal hyperbilirubinemia commissioned by the National Institute of Child Health and Human Development (NICHD) between 1974 and 1976.
Because the phototherapy protocols differed significantly in the remaining studies, their results could not be statistically combined, and are reported separately here. A total of 893 term or near-term jaundiced infants (325 in the treatment group and 568 in the control group) were evaluated in the current review.
The development, design, and sample composition of NICHD phototherapy trial was reported in detail elsewhere (Bryla, 1985). NICHD's controlled trial of phototherapy for neonatal hyperbilirubinemia consisted of 672 infants who received phototherapy and 667 control infants. Of the 672 subjects in the treatment group, 140 had a birthweight ≥ 2,500 grams; all had gestational age (GA) ≥ 34 weeks. Of the 667 control infants, 136 had a birthweight ≥ 2,500 grams; all had a GA ≥ 34 weeks. The racial composition of the total 276 term/near-term subjects was 26 percent black, 38 percent Mexican-American, and 36 percent non-black/non-Mexican-American. Males represented 59 percent of the total sample.
The same group of infants, 140 subjects in the treatment group and 136 controls with BW ≥ 2,500 grams and GA ≥ 34 weeks, were evaluated for the effect of phototherapy on the hyperbilirubinemia of Coombs' positive hemolytic disease in the study of Maurer, Kirkpatrick, McWilliams, et al. (1985). Of the 276 infants whose birthweight was 2,500 grams or more, 64 (23.1 percent) had positive Coombs' tests, 58 secondary to ABO and 6 secondary to Rh incompatibility. Thirty-four out of 64 in this group received phototherapy. The other 30 were placed in the control group. Of the 212 subjects who had negative Coombs' tests, 106 were in the treatment group and the same number was in the control group. No therapeutic effect on reducing the exchange transfusion rate was observed in infants with Coombs' positive hemolytic disease, but there was a 9.4 percent absolute risk reduction in babies who had negative Coombs' tests. In this group of infants, the NNT for prevention of the needs for exchange transfusion, or a TSB > 20 mg/dl, was 11 (95% CI, 10 to 12) (Maurer, Kirkpatrick, McWilliams, et al., 1985).
A more recent RCT compared the effect of four different interventions on hyperbilirubinemia (serum bilirubin concentration ≥ 291 μmole/l or 17 mg/dl) in 125 full-term breast-fed infants. Infants with any congenital anomalies, neonatal complications, hematocrit >65 percent, significant bruising or large cephalhematomas, or hemolytic disease were excluded. The four interventions in the study were (1) continue breast-feeding and observe (N=25); (2) discontinue breast-feeding, substitute formula (N=26); (3) discontinue breast-feeding, substitute formula and administer phototherapy (N=38); (4) continue breast-feeding, administer phototherapy (N=36). The interventions were considered failures if serum bilirubin levels reached 324 μmole/l or 20 mg/dl (Martinez, Maisels, Otheguy, et al., 1993). For the purpose of the current review, we re-grouped the subjects into treatment group or phototherapy group, and, control group or no phototherapy group. Therefore, the original group 4 and 3 became the treatment group I and II, and the original group 1 and 2 were the corresponding control group I and II. It was found that treatment I, phototherapy with continuing breast-feeding, had 10.1 percent ARR and the NNT for prevention of a serum bilirubin exceeding 20 mg/dl was 10 (95% CI 9 to 12). Compared to treatment I, treatment II (phototherapy with discontinuation of breast-feeding) was significantly more efficacious. It had 16.6 percent ARR and the NNT for prevention of a serum bilirubin exceeding 20 mg/dl was 6 (95% CI, 5 to 7).
John (1975) reported the effect of phototherapy in 492 full-term neonates born during 1971 and 1972 and who developed unexplained jaundice with bilirubin levels > 15 mg/dl. One hundred and eleven subjects received phototherapy. Three hundred and eighty-one infants did not. The author stated, “The choice of therapy was, in effect, random since two pediatricians approved of the treatment and two did not.” The results showed that phototherapy had a 11.2 percent risk reduction of exchange transfusion, performed in both treatment and control groups when serum bilirubin levels exceeded 20 mg/dl. Therefore, the NNT for prevention of a serum bilirubin exceeding 20 mg/dl was 9 (95% CI 8 to 10).
| Study, Year, UI # | Sample N (Control) | Mean Bili. Levels before Rx → After Rx (mg/dl) | Mean latency (msec) Before Rx → After Rx | Mean Interpeak (msec) Before Rx → After Rx | Follow-up (follow-up rate) | Quality | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| I | (II), III | (IV),V8.1→7.0 [7.0] | I–III | III–V | I–V | |||||
| Phototherapy and/or Exchange Transfusion | ||||||||||
| Sabatino, 1996 97171477 | 48 (42) | 14–25.8→ < 8 | 1.9→1.6 [1.8]→[1.6] | 4.8→4.0 [4.0]→[3.9] | 8.1 →7.0[7.0] [6.9] | 3.0→2.3 [2.3]→[2.2] | 3.1→2.9 [2.9]→[2.8] | 6.1→5.2 [5.2]→[5.0] | All normal (100%) | B |
| Deorari, 1994 95189280 | 7a (20) | 24.6±1.6→14.8±2.1 | 2.1→1.8[1.9] | 5.5→5.4*[5.1] | 8.1→7.4*[7.0] | - | - | 6.0→5.6[5.1] | All normal (100%) | C |
| Agrawal, 1998 99232291 | 30b (25) | 22.4±2.7→ ND | 1.8→1.7*[1.7] | 4.8→4.6*[4.5] | 7.3→6.9*[6.7] | 3.0→2.8*[2.8] | 2.4→2.3*[2.2] | 5.4→5.1*[5.1] | 10% ab-normal (100%) | C |
| Exchange Transfusion | ||||||||||
| Hung, 1989 90054139 | 6 c | 20.4→ 11.8 | 2.5→2.2* | 5.2→5.1* | 7.4→7.3* | - | - | - | n.d. | B |
| Kuriyama, 1986 89246798 | 6d (8) | 21.5±4.6→11.6±3.5 | 1.9→1.7[1.8] | 5.0→7.8[4.6] | 7.4→7.0[6.9] | 3.1→3.0[2.8] | 2.4→2.3[2.3] | 5.5→5.3[5.1] | All normal (83%) | B |
| Funato, 1996 97104685 | 8e (16) | 25.3±2.6→11.0±2.4 | 2.0→1.8*[1.8] | - | - | 2.9→2.9[2.7] | - | 5.3→5.2*[5.0] | All normal (100%) | C |
| Nwaesi, 1984 85037769 | 9 f | 22.3±1.4→15.3±3.2 | 1.9→1.9 | - | - | 3.0→2.8* | 3.4→3.1* | 6.4→5.8* | n.d. | C |
| Treatments (not specified) | ||||||||||
| Bhandari, 1993 94075016 | 30 g | 16.95±2.8→ ND | 1.8→1.7 | 4.6→4.4 | 6.7→6.7 | 2.7→2.7 | 2.2→2.1 | 4.8→5.0 | n.d. | B |
Bold = significantly different than controls
* = significantly different between pre- and post- Rx
[ ] controls' value
n.d. = not done
All underwent PhotoRx. ET was carried out when ABR was considered abnormal. Two ABO incompatibility, 1 Rh incompatibility, 1 G6PD deficiency; 3 Idiopathic.
10% had Rh incompatibility, 26.7% had ABO incompatibility and 63.3% had idiopathic hyperbilirubinemia
Causes of jaundice was “mostly ABO, G6PD, and unknown factors”
83% Hemolytic disease; 17% Hemoperitoneum
One ABO incompatibility, 3 Polycythemia, 1 Cephalhematoma, 5 Idiopathic jaundice
33% Rh incompatibility; 67% ABO incompatibility
20% ABO incompatibility, 7% Rh isoimmunization; 30% G6PD deficiency
| Study, Year, UI # | Follow-up Outcomes | Confounders | Quality |
|---|---|---|---|
| Phototherapy and/or Exchange Transfusion | |||
| Sabatino, 1996 97171477 | @1, 2, and 3 years of age: Brunett-Lezine test | PhtoRx; ET | B |
| Neuropsychological evaluations performed in subjects showed scores similar to those of age-matched controls. | |||
| Deorari, 1994 95189280 | @1 year of age: DQ; Neurological sequelae | PhotoRx; ET | C |
| Mean development motor quotient and mean development mental quotient in study and control groups were comparable and normal at 1 year of age | |||
| None of the study or control group babies developed seizures, motor deficits or features of cerebral palsy | |||
| Agrawal, 1998 99232291 | @1 year of age: DDST | PhotoRx; ET; Lost to follow-up | C |
| Neurological development was normal in all infants with SB 15–20 mg/dl, in 89% infants with SB 21–25 mg/dl, and in 67% infants with SB > 25 mg/dl | |||
| Exchange Transfusion | |||
| Kuriyama, 1986 89246798 | @25 days to 6 months: neurological development; hearing | ET | C |
| All are normal development and not hearing abnormalities. | |||
| Funato, 1996 97104685 | @1.5–6 years of age: DQ/IQ | ET | C |
| All had normal DQ/IQ > 85 except for 1 patient who developed borderline intelligence with IQ=77 at 6 years old | |||
Sabatino, Verrotti, Ramenghi, et al. (1996) studied brain stem auditory evoked potentials (BAEP) in 90 full-term newborns with jaundice. After enrolling only the subjects who showed the three major BAEP components (waves I, III and V), 48 full-term subjects with bilirubin from 13.9 to 25.8 mg/dl and without evidence of hemolysis were followed over a 3-year period. BAEP's were performed on day 3 and repeated five to seven days post phototherapy or exchange transfusion until the bilirubin had decreased below 8 mg/dl. On day 3, latencies of waves III and V; and, interpeak latency III–V and I–V were significantly prolonged compared to control subjects. Recordings performed 5 to 7 days post therapy and during subsequent session showed no significant differences. Serial neuropsychological evaluations at 6, 18 and 30 months using the Brunett-Lezine test showed no neurodevelopmental abnormality. Statistical analyses of the neuropsychological evaluations were not given (Sabatino, Verrotti, Ramenghi, et al., 1996).
Infants with ABO incompatibility, Rh incompatibility, G6PD deficiency or idiopathic jaundice were treated with phototherapy and/or exchange transfusion in 7 studies (Agrawal, Shukla, Misra, et al., 1998; Bhandari, Narang, Mann, et al., 1993; Deorari, Singh, Ahuja, et al., 1994; Funato, Teraoka, Tamai, et al., 1996; Hung, 1989; Kuriyama, Tomiwa, Konishi, et al., 1986; Nwaesei, Van Aerde, Boyden, et al., 1984). The results consistently reported that treatments for jaundice significantly reduced the prolonged auditory brainstem response (ABR) latencies and interpeak latencies back to normal. The mean SB levels of the 96 infants also decreased from 20–25 mg/dl before treatments to 11–15 mg/dl after treatments. Of the 7 studies, four studies had follow-up results of neurological and intelligence outcomes. Three studies (Deorari, Singh, Ahuja, et al., 1994; Funato, Teraoka, Tamai, et al., 1996; Kuriyama, Tomiwa, Konishi, et al., 1986) with 14 jaundiced infants in the treatment group and 28 non-jaundiced infants in the control group, showed all normal developments at 25 days to 6 years of age. Moreover, Agarwal, Narayan, Kumari, et al. (1998) studied 30 term newborns with bilirubin > 15 mg/dl, recorded BAEP's in these infants before and after phototherapy or exchange transfusion and at the age of 2 to 4 months. Of the 30 infants, 10 percent had Rh incompatibility, 26.7 percent had ABO incompatibility and 63.3 percent had idiopathic hyperbilirubinemia. Seventeen out of the 30 infants showed abnormalities on initial BAEP's; these abnormalities correlated significantly with bilirubin level. After therapy, abnormalities reverted back to normal in 10 cases but persisted in 7 out of 17. Two subjects had kernicterus with bilirubin > 15 mg/dl. Follow-up BAEP's were done in 21/30 cases at 14.4 ± 2.2 weeks. Of the 15 patients who had an initial abnormal BAEP, in only 3 were there persistent abnormalities. At one year of age, screening by DDST showed that neurological development was normal in 100 percent of infants with bilirubin of 15–20 mg/dl; neurological development was normal in 88.9 percent of infants with bilirubin of 21 to 25 mg/dl; and neurological development was normal in 66.7 percent of infants with bilirubin >25 mg/dl. No further statistical details were provided (Agrawal, Shukla, Misra, et al., 1998).
| Study, Year, UI # | Subject N (Controls) | Mean Bilirubin levels (mg/dl) | Outcomes | Bias/ Confounders | Quality | ||||
|---|---|---|---|---|---|---|---|---|---|
| Behavioral Outcomes | |||||||||
| Abrol, 1998 20235719 | 15 (15) | 12.1–15.0 (8.0–12.0) | Orientation cluster items | 24–48 h after initiation of Phototherapy | 24–48 h after cessation of Phototherapy | ND | A | ||
| Visual animate | P<0.01 | P<0.05 | |||||||
| Visual inanimate | P<0.01 | P<0.05 | |||||||
| Animate visual and auditory | P<0.01 | P<0.05 | |||||||
| Ju SH, 1991 91289763 | 29 (14) | 12–15 (<12) | Orientation cluster | Before Rx | After Rx | 24 h after Rx | 2wks age | Incomplete binding since had mild or no clinically observable jaundice | B |
| Mean | NS | P<0.01 | P<0.05 | NS | Large number of drop-outs at 2 week follow-up in the control group Small sample size | ||||
| Animate visual | NS | P<0.05 | P<0.05 | NS | |||||
| Animate visual & auditory | NS | P<0.05 | P<0.05 | NS | B | ||||
| Inanimate visual & auditory | NS | P<0.05 | P<0.05 | NS | |||||
| Alertness | NS | P<0.05 | P<0.05 | NS | |||||
| Paludetto, 1983 84057358 | 30 (30) | 13.3 | Orientation cluster | 3rd day | 4th day | 1month | Loss of subject and matched control at each observation 30->14->12 at one month of age | B | |
| 8.4–17.5 (9.6) | Animate visualP<0.01 | P<0.05 | NS | ||||||
| (3.5–14.5) | Inanimate visual P<0.05 | P<0.01 | P<0.05 | ||||||
| Inanimate visual P<0.005 and auditory | P<0.01 | P<0.05 | |||||||
| Alertness P<0.005 | P<0.05 | NS | |||||||
| Motor performance Pull to sit P<0.001 | NS | NS | |||||||
| Regulation of state Cuddliness P<0.001 | NS | NS | |||||||
| Telzrow, 1980 80225471 | 10 (10) | 17.07 | Mean cluster scores on Brazelton's scale | No bilirubin assessment made in control group | B | ||||
| Day 3 | Day 6 | Day 10 | |||||||
| Orientation | P< 0.05 | P<0.05 | P=0.05 | ||||||
| Motor | NS | P<0.01 | NS | ||||||
| Valkeakari 1981 81156354 | 41 (42) | - | Vineland social maturity scale | Phototherapy | Control | No information about the bilirubin levels of study or control group. Multiple analyses from small N. | B | ||
| Self help general | 12.56 ± 0.50 | 12.64 ± 0.41 | |||||||
| Self help eating | 8.61 ± 0.70 | 8.67 ± 0.65 | |||||||
| Self help dressing | 4.95 ± 1.75 | 4.44 ± 1.52 | |||||||
| Locomotion | 5.76 ± 0.92 | 5.66 ± 1.07 | |||||||
| Occupation | 7.51 ± 0.84 | 7.31 ± 0.79 | |||||||
| Communication | 5.98 ± 0.11 | 5.94 ± 0.31 | |||||||
| Socialization | 5.85 ± 0.79 | 5.82 ± 0.82 | |||||||
| Social age | 4.20 ± 0.51 | 4.07 ± 0.48 | |||||||
| Bowel control | 1.81 ± 0.54 | 1.65 ± 0.46 | |||||||
| Bladder control | 1.93 ± 0.56 | 1.74 ± 0.47 | |||||||
| Speech score | 1.41 ± 0.38 | 1.45 ± 0.34 | |||||||
| Neurological or Motor Outcomes | |||||||||
| Seidman, 1994 94223372 | 42 (42) | 20.0 ± 0.4 (18.8 ± 0.3) | The phototherapy group showed an average IQ score of 108 ± 2 at 17 years which was not significantly different from the average IQ score of 107 ± 2 in the control group. | Retrospective study method used | B | ||||
| Valkeakari 1981 81156354 | 41 (42) | ND | There were 2 infants with febrile convulsions in the PT group as compared to one in control group. There was only one infant with breath holding spells in PT group and none in control group. | No information about the bilirubin levels of study or control group | B | ||||
| Misra, 1980 80203306 | 81 (60) | 10–30 (8.5) | Ten of the 81 subjects manifested some degree of mental retardation at some or the other time during follow-up while all the 60 controls were normal. But on long term follow-up 3 of the 31 subjects remained retarded with AQ<90 at their last examination and remaining 7 had become normal. | Large number of patients lost to follow-up 81→ 12 | C | ||||
| Unable to sort out effect of phototherapy from higher bilirubin as control group had lower bilirubin | |||||||||
| Study, Year, UI # | Subjects N (Controls) | Mean Bilirubin levels (Range) mg/dl | Outcomes | Bias/ Confounders | Quality | ||
|---|---|---|---|---|---|---|---|
| Granati, 1984 84283738 | 110 (110) | 19.2 ± 2.3 (14.2 –24.6) | Findings | PT group | Control group | Basic testing of vision and hearing may not have detected subtle measurements Bilirubin ranged from 14.0 –24.6 in subjects (mean 19.5) | B |
| [9.7 ± 1.7] [7.2–12.2] | Normal visual function | 77 ( 95.1%) | 84(94.5%) | ||||
| Reduced visual acuity | 3(3.7%) | 4(4.4%) | |||||
| Strabismus | 1(1.2%) | 1(1.1%) | |||||
| Valkeakari, 1981 81156354 | 41 (42) | - | Findings | PT | Controls | No information about the bilirubin levels of study or control group | B |
| Normal vision | 33 | 36 | |||||
| Vision determination unsuccessful | 7 | 6 | |||||
| Anisometry | 1 | - | |||||
| Phoria/Trophia | 10 | 12 | |||||
| Changes in fundus (non specific) | 1 | 1 | |||||
| changes in lens (physiological) | - | 2 | |||||
| Nystagmus | - | - | |||||
| Khanna, 1984 85103758 | 52 (52) | - | Retinoscopic finding | PT Group | Controls | Unclear how the total cohort of 112 was selected. | C |
| Emmetropia | 7(13.4%) | 4(7.8%) | Wide age range at follow-up( 6 months to 7 years) | ||||
| Hypermetropia | 36(69.2%) | 39(76.4%) | Wide bilirubin range (15–69.2 mg/dl) | ||||
| Myopia | 6(11.5%) | 4(17.8%) | |||||
| Astigmatism | 3(5.7%) | 4(7.8%) | |||||
| Dyslexia test | - | - | |||||
| Fundus examination | 5 | 6 | |||||
| Myopic discs | 3 | 2 | |||||
| Pale discs | 2 | - | |||||
| Variations in size | 1 | - | |||||
| Dull macular reflex | - | 2 | |||||
| Mottling/ greying around macula | - | 2 | |||||
( ) Controls' value
A total of seven studies looked at the effect of phototherapy on neurodevelopmental outcomes. Five of these studies looked at the effect of hyperbilirubinemia and phototherapy on behavior. Of the five studies, four used the Brazelton Neonatal Behavioral Assessment Scale (BNBAS) and one (Valkeakari, Anttolainen, Aurekoski, et al., 1981) used the Vineland Social maturity scale. No study evaluating neurodevelopmental outcomes of exchange transfusion was identified.
Abrol and Sankarasubramanian (1998) evaluated the effect of phototherapy on neonatal behavior using the BNBAS in 30 full term infants. Fifteen infants received phototherapy for TSB between 12 and 15 mg/dl. Phototherapy was discontinued when serum bilirubin levels were <12.0 mg/dl. Their outcomes were compared to a control group consisting of 15 infants who had TSB between 8–12 mg/dl at 48 hours. Only control infants were breast-fed during both assessments. Behavioral assessment done 24–48 hours after initiation of phototherapy showed that the study group performed significantly poorer in the orientation cluster as compared to the control group. A repeat assessment done 24–48 hours after cessation of phototherapy demonstrated no improvement in the study group. However there was no significant difference between the 2 groups in other clusters of the BNBAS scale.
Ju and Lin (1991) studied the effects of moderate non-hemolytic jaundice and phototherapy on newborn behavior in a randomized controlled trial. Initially 29 full term infants with TSB 12–15 mg/dl on 3rd –5th day of life were randomly assigned to phototherapy (N=14) and non-phototherapy (N=15) group. A third group of 14 infants with TSB < 12 mg/dl were recruited as control subjects. Phototherapy was given until TSB was < 12 mg/dl. The mean duration of phototherapy was 38 hours and no infant needed Exchange Transfusion. One infant in phototherapy group was excluded due to G6PD deficiency and evidence of hemolysis and four infants were excluded from control group when their serum bilirubin exceeded 12mg/dl and were put on phototherapy. Two infants were subsequently added to the non-phototherapy group to make a total of 13. Infant behavior was assessed by BNBAS 4 times: just before phototherapy, immediately after termination of phototherapy, 24 hours after phototherapy and at 2 weeks of age in phototherapy treatment group and at comparable age in other two groups. For analysis, 28 behavioral items were sub-categorized into six behavioral clusters. Twenty-one neurologic reflex items were used to generate a 7th reflex cluster. The results of seven cluster scores did not differ among the three groups at baseline examination. However, the treatment subjects performed poorly in several of the orientation cluster items as compared to the untreated and control groups (P<0.05). However, the untreated and control groups did not differ significantly. At 2 weeks of age no significant differences were found in the BNBAS scores among the three groups. The authors concluded that moderate non-hemolytic jaundice does not affect the behavior of full term infants and that phototherapy has short-term adverse effects on visual, auditory orientation and alertness, which may impair infant-caregiver interaction.
In 1983, Paludetto, Mansi, Rinaldi, et al. (1983) evaluated possible behavioral changes in healthy jaundiced infants treated with phototherapy. Thirty full term infants with TSB between 8.4–17.5 mg/dl and undergoing phototherapy for 6 hours or more were included the treatment group and 30 infants with TSB 3.5–14.5 mg/dl served as controls. The two groups were examined using the BNBAS at the age of 3days, 4 days and 1 month. On the 3rd and 4th day, the control group scored better than the study group, particularly in the orientation cluster. The visual cluster score appeared to be more compromised in the study group. Only 12 treatment and 12 control infants were assessed at the 1-month and the treatment group continued to perform poorly in some items of the orientation cluster.
Telzrow, Snyder, Tronick, et al. (1980) evaluated behavior of 10 healthy full term jaundiced infants during and after phototherapy. Ten non-jaundiced infants served as controls. Three evaluations were made using BNBAS on 3, 6 and 10 days of age. The treatment group had significantly lower scores on all three days for the orientation cluster and on day 6 for motor cluster scores. No significant correlation was found between level of jaundice and infant's behavior.
Seidman, Paz, Stevenson, et al. (1994) evaluated IQ at the age of 17 years, in 42 full term infants with severe hyperbilirubinemia who were treated with phototherapy, most of whom (31) also were treated with exchange transfusion. Forty-two infants who did not receive phototherapy were selected as controls. No significant difference in IQ between the two groups was found.
Valkeakari, Anttolainen, Aurekoski, et al. (1981) examined 41 subjects at age 3 years. All were full-term, had negative Coombs' tests and received prophylactic phototherapy for more than 100 hours during first week of life. A comparable control group of 42 infants who had not received phototherapy was also evaluated. Psychological assessment was done using Vineland social maturity scale. No significant difference could be demonstrated between the two groups in the physical, neurological, ophthalmologic and psychological examination conducted. The authors concluded that phototherapy is unlikely to cause harmful long-term developmental effects.
Misra, Srivastava, Bajpai, et al. (1980) followed 81 infants for a period of 36 months. All of them had phototherapy. Sixty non-exposed infants served as controls. All 81 subjects had immediate and 6-month follow-up up studies while only 31 subjects had follow-up studies from 6 to 36 months. Both groups showed a normal growth pattern comparable to Indian Council Medical Research standards. Ten of 81 subjects showed mental retardation at some point (either during the follow-up period or at other times) but all controls were normal. At the end of 36 months, three out of 31 study subjects had mental retardation with an Attainment Quotient (AQ< 90). The cause of jaundice in all three was unknown. Two had IQ between 70 to 89 and one had Attainment quotient between 50 to 69.
Granati, Largajolli, Rubaltelli, et al. (1984) conducted a follow-up study of 110 full term infants treated with phototherapy. One hundred and ten matched non-icteric infants served as control. No significant difference was found in the visual function between the two groups. Three of 81 infants in phototherapy group and four of 89 infants in the control group had reduced visual acuity and one in each group had strabismus. In addition, an evaluation of growth, hearing, neurological and psychological function did not reveal significant difference between the two groups.
Khanna, Mehta, Matthews, et al. (1984) studied 56 full term children who had received phototherapy for indirect hyperbilirubinemia in the early neonatal period. They were compared with 56 matched controls with no jaundice or mild physiological jaundice not requiring phototherapy. A detailed ophthamological examination including tests for visual acuity, dyslexia tests, retinoscopy and fundoscopy revealed no significant differences between the two groups.
| Study,Year UI # | Country | Race (N) | Definition of Hyperbilirubinemia | Reference Standard: Laboratory-based assay of TSB | Quality |
|---|---|---|---|---|---|
| Prediction Method: Cord Bilirubin | |||||
| Carbonell, 2001 21130776 | Spain | ND (585) | SB ≥ 17 mg/dl in the first 4 days of life | Bilirubinometer Bil-Red by direct reading at 461 mcm | A |
| Knudsen, 1992 92306741 | Denmark | ND (138) | SB > 11.7 mg/dl at day 3 | Standard direct spectroscopic method at 3rd postnatal day | A |
| Knudsen, 1989 89189829 | Denmark | ND (291) | “Clinical jaundice” following by SB > 10 mg/dl | Standard method of Blom and Doumas | B |
| Risemberg, 1977 77156212 | US | ND (91) | SB > 10 mg/dl at 36 hrs of age | Laboratory-based assay | B |
| Prediction Method: Early Serum Bilirubin (mg/dl) | |||||
| Awasthi, 1998 20234451 | India | ND (274) a | Peak SB > 15 mg/dl | Spectrophotometry on Toyo Bilirubin Analyzer | A |
| Carbonell, 2001 21130776 | Spain | ND (574+865) b | SB ≥ 17 mg/dl in the first 4 days of life | Bilirubinometer Bil-Red by direct reading at 461 mcm | A |
| Seidman, 1999 20110115 | Israel | Multiple race (1177) | TSB > 10 mg/dl at day 2, > 14 mg/dl at day 3; and, > 17 mg/dl at day 4 or 5 | Unitstat Bilirubinometer | A |
| Risemberg, 1977 77156212 | US | ND (91) | SB > 16 mg/dl at 36 hrs of age | Laboratory-based assay | B |
| Prediction Method: End-tidal Carbon-Monoxide Concentration (ETCOc, p.p.m) | |||||
| Stevenson, 2001 21326594 | US + International | Multiple race (1370) | TSB ≥ 95th %tile | Laboratory-based assay. Each sites used its own laboratory and method | A |
| Okuyama, 2001 21366215 | Japan | ND (51) | Peak SB ≥ 15 mg/dl | Laboratory-based assay | B |
| Prediction Method: Predischarge Risk Index | |||||
| Newman, 2000 20529193 | US | (496) | TSB within the first 30 days of life ≥ 25 mg/dl | Data from the KPMCP integrated laboratory information system | B |
| Prediction Method: Predischage Risk Zone, Determined by Hour-Specific Bilirubin Percentile | |||||
| Bhutani, 1999 99117600 | US | Multiple race (2840) | Subsequent TSB risk zone ≥ 95th %tile | 2,5-DPH diazo mathod | A |
SB = serum bilirubin; TSB = total serum bilirubin
All infants were healthy, term (GA≥34) infants, unless noted:
28% “Preterm”
574 infants in the phase I study; 865 infants in the phase II study. 90% infants were breast-fed exclusively.
| Study, Year, UI # | TP (n) | FN (n) | TN (n) | FP (n) | Sens (%) | Spec (%) | Thresholds | |
|---|---|---|---|---|---|---|---|---|
| Method | SB (mg/dl) | |||||||
| Prediction Method: Cord Bilirubin (μmol/l) | ||||||||
| Carbonell, 2001 21130776 | 4 | 17 | 534 | 30 | 22 | 95 | 37 | 17 |
| Knudsen, 1992 92306741 | 28 | 0 | 6 | 104 | 100 | 5 | 20 | 11.7 |
| 27 | 1 | 23 | 87 | 96 | 21 | 25 | 11.7 | |
| 25 | 3 | 46 | 64 | 89 | 42 | 30 | 11.7 | |
| 20 | 8 | 75 | 35 | 71 | 68 | 35 | 11.7 | |
| 11 | 17 | 94 | 16 | 39 | 85 | 40 | 11.7 | |
| Knudsen, 1989 89189829 | 4 | 7 | 70 | 3 | 36 | 96 | 40 | 10 j |
| Risemberg, 1977 77156212 | 12 | 16 | 63 | 0 | 43 | 100 | 68 | 10.0 |
| Prediction Method: Early Serum Bilirubin (mg/dl) | ||||||||
| Awasthi, 1998 20234451 | 24 | 11 | 157 | 82 | 69 | 66 | 3.99 f | 15 |
| Carbonell, 2001 21130776 | 7 | 0 | 74 | 88 | 100 | 46 | 6 @24 hrs h | 17 |
| 11 | 0 | 102 | 56 | 100 | 64 | 9 @48 hrs h | 17 | |
| 25 | 0 | 239 | 159 | 100 | 60 | 6 @24 hrs i | 17 | |
| 45 | 1 | 348 | 426 | 98 | 45 | 9 @ 48 hrs i | 17 | |
| Seidman, 1999 20110115 | 49 | 11 | - | - | 82 | - | 5 @day 1 | HB g |
| Risemberg, 1977 77156212 | 13 | 0 | 78 | 0 | 100 | 100 | 10 @12 hrs | 16 |
| 13 | 0 | 78 | 0 | 100 | 100 | 15 @24 hrs | 16 | |
| Prediction Method: End-tidal Carbon-Monoxide Concentration (ETCOc, p.p.m) | ||||||||
| Stevenson, 2001 21326594 | 92 | 28 | 635 | 615 | 77 | 51 | 1.5 @36 hrs | ≥ 95th %tile |
| - | - | - | - | 90 | 65 | Comb | ≥ 95th %tile | |
| Okuyama, 2001 21366215 | 5 | 2 | 27 | 17 | 71 | 61 | 1.6 @36 hrs | 15 |
| 6 | 1 | 35 | 9 | 86 | 80 | 1.8 @42 hrs | 15 | |
| 6 | 1 | 32 | 12 | 86 | 73 | 1.8 @48 hrs | 15 | |
| 6 | 1 | 29 | 15 | 86 | 66 | 1.8 @60 hrs | 15 | |
| Prediction Method: Predischage Risk Index | ||||||||
| Newman, 2000 20529193 | 58 | 1 | 129 | 273 | 98 | 32 | RI > 7 | 25 |
| 52 | 7 | 244 | 158 | 88 | 61 | RI >10 | 25 | |
| 25 | 34 | 364 | 38 | 42 | 91 | RI > 15 | 25 | |
| 8 | 51 | 399 | 3 | 14 | 99 | RI > 20 | 25 | |
| Prediction Method: Predischage Risk Zone, Determined by Hour-Specific Bilirubin Percentile | ||||||||
| Bhutani, 1999 99117600 | 126 | 0 | 1756 | 958 | 100 | 65 | ≥ 40th %tile | ≥ 95th %tile |
| 114 | 12 | 2300 | 414 | 91 | 85 | ≥ 75th %tile | ≥ 95th %tile | |
| 68 | 58 | 2610 | 104 | 54 | 96 | ≥ 95th %tile | ≥ 95th %tile | |
TP = true positive; FN = false negative; TN = true negative; FP = false positive; N = number of subjects; n = number of measurements; Sens = sensitivity; Spec = specificity; SB = serum bilirubin; r = regression coefficient between prediction method and SB; RI = risk index; HB = hyperbilirubinemia
Comb = combined ETCOc and TSB method (either an TSB between 75th and 95th %tile or an ETCOc ≥ population mean or both @ 36±6 hrs). Infants with TSB ≥ 95th %tile @ 30±6 hrs were excluded.
mean SB at 18–24 hours of age
TSB > 10 mg/dl at day 2, > 14 mg/dl at day 3; and, > 17 mg/dl at day 4 or 5
Phase I study
Phase II study
“Clinical jaundice” following by SB > 10 mg/dl
Carbonell, Botet, Figueras, et al. (2001) studied the predictive value of cord bilirubin in 585 healthy term newborns. A cord bilirubin ≥ 2.2 mg/dl predicted significant hyperbilrubinemia, defined as TSB ≥ 17 mg/dl between the 3rd and 4th days of life, with a sensitivity of only 22 percent and a specificity of 95 percent.
Data Source: Knudsen, 1992 [#92306741]
Risemberg, Mazzi, MacDonald, et al. (1977) studied cord bilirubin in 91 neonates from ABO incompatible pregnancies and 30 control infants to determine predictability for severe hyperbilirubinemia, defined as TSB > 16 mg/dl at 12 to 36 hours of age. A cord bilirubin level of greater than 4 mg/dl was found to indicate high risk, requiring frequent re-evaluation and therapy.
Awasthi and Rehman (1998) studied 274 term (and preterm) infants to determine the usefulness of a serum bilirubin (SB) measurement at 18 to 24 hours of age in predicting peak SB > 15 mg/dl. The prevalence of peak SB >15 mg/dl in this population was 12.8 percent and a SB of > 3.99 mg/dl at 18 to 24 hours was predictive with a sensitivity of 69 percent. Further stratified analysis by feeding type within 12 hours of birth revealed a crude odds ratio for prediction test positive of 29 if breastfeeding and two if not breastfeeding.
In a study of 2,004 healthy term newborns, Carbonell, Botet, Figueras, et al. (2001) concluded that a TSB ≥ 6 mg/dl at 24 hours or ≥ 9 mg/dl at 48 hours is predictive of hyperbilirubinemia defined as >17 mg/dl with a sensitivity of 98 to 100 percent and a specificity of 45 to 64 percent.
Seidman, Ergaz, Paz, et al. (1999) defined severe hyperbilirubinemia in healthy term newborns as a TSB > 10.0 mg/dl at day 2, >14.0 mg/dl at day 3, and > 17.0 mg/dl at age 4 and 5 days. They studied 1177 infants and found odds ratios (OR) of 36.5 for TSB > 5 mg/dl on day 1, 3.1 for TSB on day 1 (per 1 mg/dl), and 2.4 for change in TSB from day 1 to day 2 (per 1 mg/dl). A multiple regression model, which also included selected maternal factors, was predictive of hyperbilirubinemia with sensitivity of 82 percent and a specificity of 83 percent.
As part of a study of the predictive value of cord bilirubin in 91 neonates from ABO incompatible pregnancies, Risemberg, Mazzi, MacDonald, et al. (1977) obtained TSB at 12, 24, 36 and 48 hours of life. A TSB of greater than 10 mg/dl at age 12 hours correlated perfectly with TSB greater than 16 mg/dl at 36 hours of life.
Stevenson, Fanaroff, Martisels, et al. (2001) studied 1370 neonates from nine multinational sites to determine whether end tidal carbon monoxide (ETCOc), alone or in combination with TSB measurements, can predict hyperbilirubinemia, defined as an age-specific TSB ≥ 95 percentile, in the first week of life. Findings supported the measurement of TSB before discharge to predict hyperbilirubinemia, but measurement of ETCOc did not improve the predictability of age-specific TSB levels.
Okuyama, Yonetani, Uetani, et al. (2001) measured ETCOc every 6 hours during the first 72 hours of life among 51 healthy, full term newborns. Hyperbilirubinemia, defined as peak TSB ≥ 15 mg/dl, developed in seven of the 51 neonates. EDCOc at age 42 hours was most predictive of hyperbilirubinemia by ROC analysis, with sensitivity 86 percent and specificity 80 percent respectively.
Data Source: Newman, 2000 [#20529193]
Data Source: Bhutani and Johnson, 1999 [#99117600]
Based upon analysis of the receiver operating characteristic (ROC) curves developed for each of the predictive strategies described above, one can conclude that hour-specific bilirubin percentiles yield the greatest accuracy, with an area-under-the-curve (AUC) of 0.93, compared with 0.74 for cord bilirubin levels, and 0.80 for pre-discharge risk index.
| Author, Year, UI # | Correlation Coefficient | Sensitivity and/or Specificity | Subgroup Analyses | Subgroups |
|---|---|---|---|---|
| AirShields Minolta bilirubinometer | ||||
| Bhat, 1987 88085341 | Y | N | Y | GA/BW |
| Bhutta, 1991 92015688 | Y | Y | N | - |
| Bilgen, 1998 99123665 | Y | Y | N | - |
| Boo, 1984 85085660 | Y | N | Y | Race/Skin Color |
| Bourchier, 1987 88247126 | Y | N | Y | Race/Skin Color |
| Carbonell, 2001 21130776 | Y | Y | Y | Other: time |
| Christo, 1988 89253855 | Y | Y | Y | Phototherapy |
| Dai, 1996 97093944 | N | Y | N | - |
| Fok, 1986 86268743 | Y | Y | Y | Measurement site Phototherapy |
| Goldman, 1982 82241425 | Y | N | Y | Race/Skin Color |
| Hanneman, 1982 82105249 | Y | N | Y | GA/BW Race/Skin Color Phototherapy |
| Harish, 1998 98373176 | Y | Y | Y | GA/BW Phototherapy |
| Hegyi, 1981 81144434 | Y | N | Y | Measurement site |
| Karrar, 1989 89271698 | Y | Y | N | - |
| Kenny, 1984 89271698 | Y | N | Y | Measurement site Other: Feeding |
| Kivlaha, 1984 84296964 | Y | N | N | - |
| Knudsen, 1989 89189829 | Y | N | Y | Measurement sites |
| Knudsen, 1990 90242782 | ||||
| Knudsen, 1990 91196560 | Y | Y | N | - |
| Knudsen, 1992 92306741 | Y | Y | N | - |
| Knudsen, 1993 93333142 | Y | Y | N | - |
| Knudsen, 1995 93333142 | N | Y | N | - |
| Kumar, 1992 93084314 | Y | N | N | - |
| Laeeq, 1993 93267865 | Y | Y | N | - |
| Lin, 1993 93383693 | Y | Y | Y | Measurement site |
| Linder, 1994 94325008 | Y | Y | N | - |
| Maisels, 1982 82273864 | Y | Y | Y | Measurement site |
| Maisels, 1997 97247177 | Y | N | N | - |
| Schumacher, 1985 85241873 | Y | Y | N | - |
| Serrao, 1989 85241873 | Y | N | N | - |
| Sharma, 1988 89122266 | Y | N | Y | GA/BW |
| Sheridan-Pereira, 1982 83021365 | Y | Y | Y | Phototherapy |
| Smith, 1985 85112373 | Y | Y | N | - |
| Suckling, 1995 95327901 | Y | N | Y | GA/BW |
| Taha, 1984 85070990 | Y | Y | N | - |
| Tan, 1985 86101839 | Y | N | Y | Measure site Race/Skin Color |
| Tan, 1982 83044572 | ||||
| Tan, 1996 97017283 | Y | N | Y | Measurement site Race/Skin Color |
| Tsai, 1988 90177720 | Y | Y | Y | Measurement site |
| Yamauchi, 1989 89348834 | Y | N | Y | Measurement site |
| Yamauchi, 1990 90273849 | N | Y | N | - |
| Yamauchi, 1991 92188736 | Y | N | Y | Other: Time; Sunlight |
| Yamauchi, 1991 92188737 | Y | N | Y | Measurement site Other: Time |
| Yamauchi, 1988 89086035 | ||||
| BiliCheck™ | ||||
| Bhutani, 2000 20381429 | Y | Y | Y | Race/Skin Color |
| Lodha, 2000 20368382 | Y | Y | Y | Other: TSB > 13 mg/dl |
| Rubaltelli, 2001 21283339 | Y | Y | Y | Measurement site |
| Ingram Icterometer | ||||
| Bilgen, 1998 99123665 | Y | Y | N | - |
| Chaibva, 1974 75018763 | Y | N | N | - |
| Gupta, 1991 92091059 | Y | Y | Y | GA/BW |
| Schumacher, 1985 85241873 | Y | Y | N | - |
| Colormate III | ||||
| Tayaba 1998 98393764 | Y | N | Y | Race/Skin Color Phototherapy |
Bold = Study reported more than one instrument
The Minolta AirShields bilirubinometer (known in 2002 as the Minolta Airshields Jaundice Meter™) is a hand held instrument that employs fiberoptic techniques to illuminate the skin and subcutaneous tissue and then spectrophotometrically analyzes the intensity of the yellow color. The manufacturer recommends calibrating the instrument on a daily basis and each institution must generate its own correlation curves for TcB and TSB. The Minolta Airshields Jaundice Meter™ frequently described in the literature has changed manufacturer and its new manufacturer, listed as Minolta/Hill-Rom Air-Shield, Hatboro, Pennsylvania, does not catalog the model JM-102 in the product list on its company website.
The BiliCheck™ (SpectRX Inc., Norcross, GA) is a handheld noninvasive device that utilizes an in vivo multiwavelength spectral reflectance technique that allows for the device to determine the individual optical densities (ODs) attributed to bilirubin, hemoglobin, and melanin in the infant skin. The FDA cleared the BiliCheck™ for sale in 1999. The device is 12.4 × 7.6 × 12.1 cm dimensions (W × H × D), consisting of a light source, a microspectrophotometer, a fiberoptic probe, and a microprocessor control circuit that is used for analysis and interpretation of bilirubin measurement. The fiberoptic probe is placed on the infants' forehead skin and the light source is triggered after appropriate skin contact has occurred. Bilirubin related OD values are used to calculate the TcB algorithms. The BiliCheck™ theoretically improves upon previous transcutaneous measurement devices by accounting for skin thickness, blood content and flow, maturity and pigmentation (Bhutani, Gourley, Adler, et al., 2000).
The icterometer consists of a strip of transparent Plexiglas on which are painted five yellow transverse stripes of precise and graded hue. It has been used since 1925 (Schumacher, Thornbery, and Gutcher, 1985). The Plexiglas is pressed against the baby's nose until the skin blanches and then the color compared with the yellow stripes and a jaundice score applied. Advantages are the low cost compared to other transcutaneous methods.
The Colormate III manufactured by Chromatics Color Sciences International, Inc., New York, NY is a hand held colorimeter containing a Xenon flash tube and light sensors connected to a portable computer. The instrument measures over a band of wavelengths from 400 to 700 nm with specific filters used to assess the reflectance of light for specific wavelengths. The data are processed by the computer and reported as standard color coordinates (luminosity, redness and yellowness). The algorithm incorporated into the device examines the luminosity or lightness of the skin as well as the underlying color of normal skin and accounts for that in the baseline evaluation. The bilirubin estimate evaluates changes in the yellow component of the spectrum with a mathematical adjustment for the underlying lightness of the skin. The stability of the luminosity is included in the computer analysis. Measurements must be made on three of four sites (right or left cheek, back, chest, forehead). An early baseline measurement is made of the infants' skin prior to the onset of jaundice.
| Study, Year UI# | Country | Race (N) | Reference Standard: Laboratory-based Assay of TSB | Quality |
|---|---|---|---|---|
| Measurement site: At Forehead | ||||
| Bhutta, 1991 92015688 | Pakistan | ND (63) | Autoanalyser (Astra, Beckman Instruments Inc.) using the modified Jendraasik-Grof method | A |
| Bilgen, 1998 99123665 | Turkey | ND (96) | Bilitron 444 direct spectrophotometric method | A |
| Knudsen, 1990 91196560 | Denmark | ND (207) | Standard diazo method | A |
| Knudsen, 1992 92306741 | Denmark | ND (138) | Standard direct spectroscopic method | A |
| Lin, 1993 93383693 | Taiwan | Chinese (305) | Spectro-photometric method with BB-meter-Model III | A |
| Schumacher, 1985 85241873 | US | White (106) | Dupont ACA III analyzer, a direct spectrophotometry method | A |
| Tsai, 1988 90177720 | Taiwan | Chinese (98) | BB-meter-Model III (O'hara Co. Ltd) | A |
| Fok, 1986 86268743 | Hong Kong | Chinese (202)a | AO Unistat Bilirubinometer (American Optical, USA) | B |
| Harish, 1998 98373176 | India | ND (60) | Malloy and Evelyn method | B |
| Karrar, 1989 89271698 | Saudi Arabia | ND (155) | DuPont automatic clinical analyzer | B |
| Knudsen, 1993 93333142 | Denmark | Danish (73) | Standard diazo method | B |
| Knudsen, 1995 93333142 | Denmark | Danish (110) | Standard spectrophotometric method | B |
| Maisels, 1982 82273864 | US | White (157) | Automated modified diazo method | B |
| Smith, 1985 85112373 | US | Racial diverse (85)b | Method was not report | B |
| Taha, 1984 85070990 | Saudi Arabia | ND (68) | AO Unistat Bilirubinometer (American Optical, USA) | B |
| Dai, 1996 97093944 | Canada | Racial diverse (38) | Jendrassik-Grof method on a Hitachi 717 analyzer (Boehringer Mannheim, Laral, Quebec) | C |
| Sheridan-Pereira, 1982 83021365 | Ireland | White (60)c | Method of Jendrassik Grof. | C |
| Measurement site: At Mid-Sternum | ||||
| Lin, 1993 93383693 | Taiwan | Chinese (305) | Spectro-photometric method with BB-meter-Model III | A |
| Fok, 1986 86268743 | Hong Kong | Chinese (202)a | AO Unistat Bilirubinometer (American Optical, USA) | B |
| Linder, 1994 94325008 | Israel | Middle-eastern (123) | AO Unistat Bilirubinometer (American Optical, USA) | B |
| Maisels, 1982 82273864 | US | White (135) | Automated modified diazo method | B |
| Measurement site: Mixed Sites | ||||
| Carbonell, 2001 21130776 | Spain | ND (574+865) | Bilirubinometer Bil-Red by direct reading at 461 mcm | A |
| Yamauchi, 1990 90273849 | Japan | Japanese (78) | AO Unistat Bilirubinometer (American Optical, USA) | A |
| Christo, 1988 89253855 | South India | ND (91) | Technicon RA1000 within 0.5 and 1 hour from serum blood sample drawn | B |
| Laeeq, 1993 93267865 | Pakistan | ND (105) | Standard diazo method | B |
SB = serum bilirubin; TSB = total serum bilirubin
All infants were healthy, term (GA≥34), and not on phototherapy or exchange transfusion, unless noted:
1.5% G-6-PD deficiency; 11% ABO incompatible, but none of them show a positive Coomb's test
5% ABO incompatibility
“Most were white and Mexican-American extractions”
| Study, Year UI# | r | TP (n) | FN (n) | TN (n) | FP (n) | Sens (%) | Spec (%) | Thresholds | |
|---|---|---|---|---|---|---|---|---|---|
| TcB | SB (mg/dl) | ||||||||
| Measurement site: At Forehead | |||||||||
| Bhutta, 1991 92015688 | 0.66 | 35 | 5 | 32 | 28 | 88 | 53 | 17 | 12.5 |
| Bilgen, 1998 99123665 | 0.83 | 17 | 0 | 44 | 35 | 100 | 56 | 13 | 12.9 |
| Knudsen, 1990 91196560 | 0.84 | 28 | 0 | 61 | 118 | 100 | 34 | 15 | 13 |
| 28 | 0 | 93 | 86 | 100 | 52 | 16 | 13 | ||
| 24 | 4 | 113 | 66 | 86 | 63 | 17 | 13 | ||
| 22 | 6 | 134 | 45 | 79 | 75 | 18 | 13 | ||
| 15 | 13 | 158 | 21 | 54 | 88 | 19 | 13 | ||
| 0.84 | 91 | 4 | 56 | 56 | 96 | 50 | 15 | 10.2 | |
| 83 | 12 | 81 | 31 | 87 | 72 | 16 | 10.2 | ||
| 68 | 27 | 91 | 21 | 72 | 81 | 17 | 10.2 | ||
| 56 | 39 | 101 | 11 | 59 | 90 | 18 | 10.2 | ||
| 31 | 64 | 108 | 4 | 33 | 96 | 19 | 10.2 | ||
| Knudsen, 1992 92306741 | 0.63 | 28 | 0 | 13 | 97 | 100 | 12 | 7 | 11.7 |
| 27 | 1 | 29 | 81 | 96 | 26 | 8 | 11.7 | ||
| 21 | 7 | 72 | 38 | 75 | 65 | 9 | 11.7 | ||
| 16 | 12 | 48 | 62 | 57 | 81 | 10 | 11.7 | ||
| 9 | 19 | 102 | 8 | 32 | 93 | 11 | 11.7 | ||
| Lin, 1993 93383693 | 0.82 | 12 | 6 | 230 | 57 | 68 | 80 | Var | 12.9 |
| Schumacher, 1985 85241873 | 0.74 | 16 | 1 | 69 | 20 | 94 | 78 | 20 | 12.9 |
| Tsai, 1988 90177720 | 0.87 | 19 | 2 | 141 | 16 | 90 | 90 | 16 | 13 |
| Fok, 1986 86268743 | 0.86 | 16 | 0 | 122 | 64 | 100 | 66 | 22 | 15 |
| Harish, 1998 98373176 | 0.83 | 24 | 1 | 21 | 14 | 96 | 59 | 18 | 13 |
| Karrar, 1989 89271698 | 0.82 | 36 | 13 | 95 | 11 | 74 | 90 | 21 | 12.5 |
| Knudsen, 1993 93333142 | 0.85 | 30 | 0 | 5 | 38 | 100 | 11 | 13 | 13 |
| 30 | 0 | 8 | 35 | 100 | 19 | 14 | 13 | ||
| 30 | 0 | 15 | 28 | 100 | 35 | 15 | 13 | ||
| 29 | 1 | 22 | 21 | 97 | 51 | 16 | 13 | ||
| 28 | 2 | 32 | 11 | 93 | 74 | 17 | 13 | ||
| Knudsen, 1995g93333142 | ND | 9 | 0 | 57 | 44 | 100 | 56 | 9 | 18 |
| 8 | 1 | 67 | 34 | 89 | 66 | 9.5 | 18 | ||
| 7 | 2 | 79 | 22 | 78 | 78 | 10 | 18 | ||
| 3 | 6 | 93 | 8 | 33 | 92 | 11 | 18 | ||
| Maisels, 1982 82273864 | 0.93 | 7 | 0 | 145 | 5 | 100 | 97 | 24 | 13 |
| 20 | 2 | 121 | 14 | 91 | 90 | 20 | 10 | ||
| Smith, 1985 85112373 | 0.90 | 6 | 1 | 64 | 14 | 86 | 82 | Comb | 12.9 |
| Taha, 1984 85070990 | 0.88 | 11 | 5 | 96 | 8 | 69 | 92 | 22.5 | 12.9 |
| Dai, 1996 97093944 | ND | 8 | 0 | 21 | 11 | 100 | 67 | 17 | 15.2 |
| Sheridan-Pereira, 1982 83021365 | 0.70 | 4 | 0 | 40 | 13 | 100 | 75 | 20 | 14.5 |
| Measurement site: Sternum | |||||||||
| Lin, 1993 93383693 | 0.86 | 15 | 3 | 241 | 46 | 84 | 84 | Var | 12.9 |
| Fok, 1986 86268743 | 0.91 | 10 | 6 | 158 | 28 | 63 | 85 | 22 | 15 |
| Linder, 1994 94325008 | 0.96 | 19 | 4 | 136 | 3 | 86 | 96 | ND | 12.9 |
| Maisels, 1982 82273864 | 0.93 | 4 | 0 | 126 | 5 | 100 | 96 | 23 | 13 |
| 11 | 0 | 105 | 19 | 100 | 85 | 19 | 10 | ||
| Measurement site: Mixed sites | |||||||||
| Carbonell, 2001 21130776 | 0.92 | 15 | 3 | 368 | 188 | 83 | 66 | 11 h | 17 |
| 17 | 1 | 287 | 269 | 94 | 52 | 13 j | 17 | ||
| 45 | 1 | 262 | 557 | 98 | 32 | 13 k | 17 | ||
| Yamauchi, 1990 90273849 | ND | 17 | 5 | 35 | 21 | 77 | 63 | 14 h | 15 |
| 20 | 2 | 47 | 9 | 91 | 84 | 21 i | 15 | ||
| Christo, 1988 89253855 | 0.90 | 16 | 0 | 65 | 10 | 100 | 87 | 17 | 13 |
| 9 | 0 | 24 | 2 | 100 | 92 | 23 | 17 | ||
| Laeeq, 1993 93267865 | 0.77 | - | - | - | - | 97 | 41 | ND | 6 |
| - | - | - | - | 91 | 78 | ND | 9.92 | ||
| - | - | - | - | 90 | 99 | ND | 15 | ||
TP = true positive; FN = false negative; TN = true negative; FP = false positive; N = number of subjects; n = number of measurements; Sens = sensitivity; Spec = specificity; TcB = transcutaneous bilirubin; SB = serum bilirubin; r = regression coefficient between TcB and SB
r = Pearson's correlation coefficient
Comb = Elevated TcB, Elevated ETco, or Elevated both TcB and Etco. “Elevated” was defined as more than 1 SD above the mean
Var = TcB ≥ 11, ≥ 16, or ≥ 20 on the 1st, 2nd, and following days of life
Corrected TcB was use: TcB reading taken at median 21 hours of life was “corrected” by the difference in TcB readings obtained 6 hours later.
TcB reading at 24 hours of life
TcB reading on day 3 of life
TcB reading at 48 hours of life in the phase I study
TcB reading at 48 hours of life in the phase II study
An additional challenge to comparison across studies is that each instrument must be calibrated on a daily basis as recommended by the manufacturer and each institution must establish the relationship between the jaundice meter index, TcB, and the TSB, in part due to interlaboratory variations of TSB measurements (Schreiner and Glick, 1982). As can be seen in the table, the TcB index that correlates with a given level of TSB varies from center to center, and as discussed above, the manufacturer recommends that each site develop its own correlation curves.
In 38 of the 41 studies, linear regression analysis was performed generating Pearson's correlation coefficient (r) for TcB versus TSB. Overall, it does appear that a linear relationship between TcB as measured by Minolta AirShields bilirubinometer and the TSB in predominately term or near-term infants not on phototherapy. The pooled correlation coefficient was 0.84 (95% CI 0.81–0.87) (see Figure S3 in Meta-Analyses).
| Study, Year UI # | Country | Race | Sample N | Correlation Coefficient (r) | Quality | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Forehead | Sternum | Abdomen | Upper back | Lower back | Palm | Sole | |||||
| Lin, 1993 93383693 | Taiwan | Chinese | 202 | 0.82 | 0.86 | - | - | - | - | - | A |
| Tsai, 1988 90177720 | Taiwan | Chinese | 98 | 0.87 | 0.78 | 0.76 | 0.69 | 0.69 | 0.49 | 0.47 | A |
| Yamauchi, 1991 92188737 | Japan | Japanese | 336 | 0.91 | 0.92 | 0.89 | 0.89 | 0.88 | - | 0.77 | A |
| Yamauchi, 1988 89086035 | |||||||||||
| Fok, 1986 86268743 | Hong Kong | Chinese | 202 | 0.86 | 0.91 | - | - | - | - | - | B |
| Hegyi, 1981 81144434 | US | White | 43 | 0.77 | 0.90 | 0.88 | 0.87 | 0.86 | 0.72 | 0.72 | B |
| Kenny, 1984 89271698 | US | White | 18 | 0.85 | 0.93 | - | 0.83 | - | - | - | B |
| Knudsen, 1989 89189829 | Denmark | ND | 76 | 0.86 | - | 0.89 | - | - | - | - | B |
| Knudsen, 1990 90242782 | |||||||||||
| Maisels, 1982 82273864 | US | White | 135 | 0.93 | 0.93 | - | - | - | - | - | B |
| Tan, 1985 86101839 | Singapore | Chinese | 224 | 0.93 | 0.94 | - | 0.86 | - | - | - | B |
| Tan, 1982 83044572 | Malay | 303 | 0.87 | 0.90 | - | 0.87 | - | - | - | ||
| Tan, 1996 97017283 | Singapore | Racial diverse a | 542 | 0.80 | 0.75 | - | - | - | - | - | B |
| Yamauchi, 1989 89348834 | Japan | Japanese | 114 | 0.90 | 0.92 | - | - | - | - | - | B |
Chinese + Malay + Indian infants
| Study, Year UI # | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Bhat, 1987 88085341 | India | ND (100) | GA ≥ 37 wk | 63 | 0.72 | P=.123 | B |
| GA < 37 wk | 37 | 0.52 | |||||
| BW > 2500 g | 37 | 1.03 ? | n.d | ||||
| BW ≤ 2500 g | 63 | 0.94 | |||||
| Hanneman, 1982 82105249 | US | White (57) | Infants GA≥38 wks | 35 | 0.90 | P=.739 | B |
| Infants GA 34–37 wks | 22 | 0.88 | |||||
| Harish, 1998 98373176 | India | ND (33) | Term infants | 60 | 0.83 | P=.363 | B |
| SGA term + Preterm infants | 40 | 0.76 | |||||
| Sharma, 1988 89122266 | India | ND (200) | Term infants | 120 | 0.74 | P=.485 | B |
| Preterm infants | 80 | 0.69 | |||||
| BW ≥ 2.5 Kg | 110 | 0.85 | P=.371 | ||||
| BW < 2.5 Kg | 90 | 0.81 | |||||
| Suckling, 1995 95327901 | Scotland | White (55) | Term AGA | 37 | 0.802 | P=.33 | C |
| Preterm or SGA | 39 | 0.702 | |||||
n.d. = not done, because the correlation 1.03 is not valid
Bold = significantly different between subgroups (p < .10)
? = Unknown
| Study, Year UI # | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Boo, 1984 85085660 | Malaysia | Mixed (105) | Malay infants | 38 | 0.83 | M vs. C: P=.101 | B |
| Chinese infants | 37 | 0.66 | M vs. I: P=.081 | ||||
| Indian infants | 30 | 0.63 | C vs. I: P=.842 | ||||
| Bourchier, 1987 88247126 | New Zealand | Mixed (729) | Caucasian infants | 412 | 0.58 | P=.842 | B |
| Maori infants | 317 | 0.57 | |||||
| Black infants | 108 | 0.52 | |||||
| Hanneman, 1982 82105249 | US | Mixed (99) | White Infants GA≥34 wks | 57 | 0.88–0.90 | P=.025 | B |
| Black infants GA≥34 wks | 42 | 0.74 | |||||
| Tan, 1985 86101839 | Singapore | Mixed (125) | Chinese infants | 102 | 0.93 | P=.018 | B |
| Tan, 1982 83044572 | Malay infants | 103 | 0.87 | ||||
| Tan, 1996 97017283 | Singapore | Mixed (542) | Chinese infants | 253 | 0.73 | M vs. C: P=.003 | B |
| Malay infants | 169 | 0.84 | M vs. I: P=.435 | ||||
| Indian infants | 120 | 0.81 | C vs. I: P=.077 | ||||
| Goldman, 1982 82241425 | US | Mixed (84) | White infants | 95 | 0.71 | P=.035 | C |
| Black infants | 108 | 0.52 | |||||
Bold = significantly different between subgroups (p < .10)
| Study, Year UI # | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Bourchier, 1987 88247126 | New Zealand | Mixed (729) | Caucasian infants | 412 | 0.67 | P=.253 | B |
| Maori infants | 317 | 0.62 | |||||
| Tan, 1985 86101839 | Singapore | Mixed (125) | Chinese infants | 69 | 0.93 | P=.115 | B |
| Tan, 1982 83044572 | Malay infants | 103 | 0.90 | ||||
| Tan, 1996 97017283 | Singapore | Mixed (542) | Chinese infants | 253 | 0.78 | M vs. C: P=.004 | B |
| Malay infants | 169 | 0.87 | M vs. I: P=.230 | ||||
| Indian infants | 120 | 0.83 | C vs. I: P=.202 | ||||
Bold = significantly different between subgroups (p < .10)
Meta-analysis of correlation coefficient (r) was performed for different races at forehead. A total of 564, 258, 392, 150, and 310 pairs of measurements were available for white, black, Chinese, Indian and Malay infants. At forehead, the pooled r for white infants was 0.65 (95%CI 0.59–0.69), for black infants was 0.56 (95%CI 0.47–0.64), for Chinese infants was 0.80 (95%CI 0.77–0.84), for Indian infants was 0.78 (95%CI 0.71–0.84), and for Malay infants was 0.85 (95%CI 0.82–0.88) (see Figure S6 in Meta-Analyses).
In two of the four studies that analyzed test performance separately for those who had received phototherapy, there were significantly lower correlation coefficients for those who had received phototherapy with a similar trend seen in the other two studies. Fok, Lau, Hui, et al. (1986) found that the correlation coefficient between TcB indices and TSB levels dropped from 0.91 to 0.26 at the sternum with exposure to phototherapy and from 0.86 to 0.79 for those in the shaded area of the forehead.
Meta-analysis of correlation coefficient (r) was performed for infants not on phototherapy and those on phototherapy. The pooled r for infants on phototherapy was 0.78 (95% CI 0.72–0.83) and that was 0.85 (95% CI 0.79–0.89) for infants not on phototherapy (see Figure S7 in Meta-Analyses).
| Study, year UI#; | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Carbonell, 2001 21130776 | Spain | ND (574) | < 24 hrs | 120 | 0.77 | <24 vs. 24–48 hrs: P=.194 | A |
| 24–48 | 126 | 0.83 | <24 vs. >48 hrs: P=.106 | ||||
| < 48 hrs | 412 | 0.83 | 24–48 vs. >48 hrs: P=1.000 | ||||
| Yamauchi, 1991 92188736 | Japan | Japanese 107 | Infants days 1–5, Sunlight | 28 | 0.812 | Sunlight vs. no sunlight 1–5 day: P=.159 | A |
| Infants days 1–5, No Sunlight | 53 | 0.901 | Sunlight 1–5 vs. 6–7 day: P=.018 | ||||
| Infants days 6–7, Sunlight | 53 | 0.937 | No sunlight 1–5 vs. 6–7 day: P=.936 | ||||
| Infants days 6–7, No sunlight | 52 | 0.903 | |||||
| Yamauchi, 1991 92188737 | Japan | Japanese (336) | Infants, days 0–3 | 68 | 0.922 | Days 0–3 vs. 4–5: P=.002 | A |
| Yamauchi, 1988 89086035 | Infants, days 4–5 | 159 | 0.814 | Days 0–3 vs. 6–12: P=.678 | |||
| Infants, days 6–12 | 349 | 0.930 | Days 4–5 vs. 6–12: P<0.0001 | ||||
| Kenny, 1984 89271698 | US | Caucasian (53) | Mixed-fed infants | 41 | 0.846 | M vs. B: P=.073 | B |
| Breast-fed infants | 24 | 0.939 | M vs. F: P=.921 | ||||
| Formula-fed infants | 17 | 0.837 | B vs. F: P=.133 | ||||
Bold p < .10
| Study, Year, UI# | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Carbonell, 2001 21130776 | Spain | ND (574) | < 24 hrs | 120 | 0.81 | <24 vs. 24–48 hrs: P=.1022 | A |
| 24–28 hrs | 126 | 0.89 | <24 vs. >48 hrs: P<.00001 | ||||
| >48 hrs | 412 | 0.94 | 24–48 vs. >48 hrs: P=.002 | ||||
| Yamauchi, 1991 92188736 | Japan | Japanese (107) | Infants days 1–5, Sunlight | 28 | 0.892 | Sunlight vs. no sunlight 1–5 day: P=.937 | A |
| Infants days 1–5, No sunlight | 53 | 0.888 | Sunlight 1–5 vs. 6–7 day: P=.800 | ||||
| Infants days 6–7, Sunlight | 53 | 0.904 | No sunlight 1–5 vs. 6–7 day: P=.280 | ||||
| Infants days 6–7, No sunlight | 52 | 0.926 | |||||
| Yamauchi, 1991 92188737 | Japan | Japanese (336) | Infants, days 0–3 | 68 | 0.905 | Days 0–3 vs. 4–5: P=.134 | A |
| Yamauchi, 1988 89086035 | Infants, days 4–5 | 159 | 0.856 | Days 0–3 vs. 6–12: P=.067 | |||
| Infants, days 6–12 | 349 | 0.941 | Days 4–5 vs. 6–12: P<0.0001 | ||||
Bold = p < .10
| Study, Year UI # | Country | Race (N) | Reference Standard: Laboratory-based Assay of TSB | Quality |
|---|---|---|---|---|
| Bhutani, 2000 20381429 | US | Mixed race (517) | High-performance liquid chromatography (HPLC) | A |
| Lodha, 2000 20368382 | India | Indian (109) | Twin beam Microbilimeter (Ginevri Technologie Biomediche) | A |
| Rubaltelli, 2001 21283339 | Europe | Mixed race (210)a | High-performance liquid chromatography (HPLC) | A |
All infants were healthy, term (GA≥34), and not on phototherapy or exchange transfusion, unless noted:
20 percent infants ≤ 36 weeks
MMSR = multi-wavelength spectral reflectance
| Study, Year, UI # | r | TP (n) | FN (n) | TN (n) | FP (n) | Sens (%) | Spec (%) | Thresholds | |
|---|---|---|---|---|---|---|---|---|---|
| BiliC | SB (mg/dl) | ||||||||
| Bhutani, 2000 20381429 | 0.91 | 23 | 0 | 349 | 147 | 100 | 88 | ≥ 75th %tile | ≥ 95th %tile |
| Lodha, 2000 20368382 | 0.83 | - | - | - | - | 69 | 89 | 13 | 13 |
| - | - | - | - | 47 | 99 | 15 | 15 | ||
| - | - | - | - | 20 | 100 | 18 | 18 | ||
| Rubaltelli, 2001 21283339 | 0.87~0.89 | - | - | - | - | 97 | 64 | 10 | 13 |
| - | - | - | - | 93 | 73 | 11 | 13 | ||
| - | - | - | - | 86 | 85 | 12 | 13 | ||
| - | - | - | - | 66 | 89 | 13 | 13 | ||
| - | - | - | - | 92 | 71 | 12 | 15 | ||
| - | - | - | - | 81 | 82 | 13 | 15 | ||
| - | - | - | - | 63 | 92 | 14 | 15 | ||
| - | - | - | - | 52 | 95 | 15 | 15 | ||
| - | - | - | - | 90 | 87 | 14 | 17 | ||
| - | - | - | - | 77 | 91 | 15 | 17 | ||
| - | - | - | - | 63 | 96 | 16 | 17 | ||
| - | - | - | - | 50 | 99 | 17 | 17 | ||
TP = true positive; FN = false negative; TN = true negative; FP = false positive; n / N = number of subjects; Sens = sensitivity; Spec = specificity; BiliC = BiliCheck measurement; SB = serum bilirubin; r = regression coefficient between BiliC and SB
All infants were healthy, term (GA≥34), and not on phototherapy or exchange transfusion, unless noted
| Study | Country | Race (N) | Subgroups | Pairs (N) | Coefficient (r) | Significance | Quality |
|---|---|---|---|---|---|---|---|
| Christo, 1988 89253855 | India | ND (117) | Term infants w/o PhotoRx | 91 | 0.89 | P=.479 | B |
| Infants on PhotoRx | 26 | 0.85 | |||||
| Fok, 1986 862687 43 | Hong Kong | Chinese (259) | Infants w/o PhotoRx | 605 | 0.86 | P=.043 | B |
| Infants on PhotoRx | 100 | 0.79 | |||||
| Hanneman, 1982 82105249 | US | White (87) | Infants GA≥38 wks w/o PhotoRx | 35 | 0.90 | P=.736 | B |
| Infants GA≥38 wks on PhotoRx | 7 | 0.86 | |||||
| Infants GA 34–37 wks w/o PhotoRx | 22 | 0.88 | P=.387 | ||||
| Infants GA 34–37 wks on PhotoRx | 23 | 0.80 | |||||
| Harish, 1998 98373176 | India | ND (33) | Term infants w/o PhotoRx | 60 | 0.83 | P=.067 | B |
| Term infants on PhotoRx | 33 | 0.65 | |||||
| Sheridan-Pereira, 1982 83021365 | Ireland | White (60) | Term w/o PhotoRx | 57 | 0.65 | P=.39 | C |
| Term on PhotoRx | 24 | 0.76 | |||||
PhotoRx = phototherapy
Bold = significantly different between subgroups (p < .10)
The final study (Rubaltelli, Gourley, Loskamp, et al., 2001) was a multicenter study that attempted to evaluate the pooled performance of multiple BiliCheck™ devices in a diverse international population with multiple users and multiple laboratory methods of bilirubin determination compared to the “gold standard” of HPLC determination of bilirubin. Thirty-five infants undergoing TSB as part of their normal care were recruited from six different European hospitals (N=210) who underwent BiliCheck™ measurements at both forehead and sternum, TSB determinations by each hospital's usual laboratory method and HPLC TSB at a single reference laboratory. All infants were greater than 30 weeks GA, and 20 percent of the sample was less than 36 weeks GA. The correlation coefficient (r) for the combined BiliCheck™ TcB data was 0.89 at the forehead and 0.88 at the sternum compared with the gold standard TSB measured by HPLC. The correlation for TcB and laboratory TSB was similar, r=0.87. The intra-device coefficient of variation for BiliCheck™ forehead was 6.69 percent. HPLC TSB and standard laboratory measurements had high correlation (r=0.927). The BiliCheck™ TcB at the forehead and the laboratory TSB underestimated TSB measured by HPLC slightly. With a threshold TSB by HPLC of 13 and BiliCheck™ TcB of 11 the sensitivity was 93 percent and specificity 73 percent (similar to the performance of the laboratory TSB of 95 percent and 76 percent respectively). At a threshold TSB of 17 mg/dl, the TcB had 90 percent sensitivity and 87 percent specificity, slightly better than standard laboratory TSB of 87 percent and 83 percent. The authors conclude that the accuracy and precision of the TcB by BiliCheck™ was comparable to the standard laboratory TSB when compared to the gold standard TSB measurement by HPLC. Analysis of covariance found no differences in test performance by postnatal age, gestational age, birth weight or race; however, the majority of infants were white (66.7 percent) and only 4.3 percent were of African descent.
| Study, Year, UI # | Country | Race (N) | Reference Standard: Laboratory-based assay of TSB | Quality |
|---|---|---|---|---|
| Bilgen, 1998 99123665 | Turkey | ND (96) | Bilitron 444 direct spectrophotometric method | A |
| Schumacher, 1985 85241873 | US | White (106) | Dupont ACA III analyzer, a direct spectrophotometry method | A |
| Gupta, 1991 92091059 | India | ND (88)a | Type BM2-TOYO bilirubin analyser | B |
All infants were healthy, term (GA≥34), and not on Phototherapy or exchange transfusion, unless noted:
77 term (GA≥37) infants; 11 preterm (GA 35–36) infants
| Study, Year, UI # | r | TP (n) | FN (n) | TN (n) | FP (n) | Sens (%) | Spec (%) | Thresholds | |
|---|---|---|---|---|---|---|---|---|---|
| Ict | SB (mg/dl) | ||||||||
| Bilgen, 1998 99123665 | 0.78 | 17 | 0 | 38 | 41 | 100 | 48 | 3 | 12.9 |
| Schumacher, 1985 85241873 | 0.63 | 14 | 3 | 66 | 23 | 82 | 74 | 3 | 12.9 |
| Gupta, 1991 92091059 | 0.97 | Term infants (N=77) | 97 | 71 | 3 | 10 | |||
| Preterm (GA 35–36) infants (N=11) | 50 | 86 | 3 | 10 | |||||
TP = true positive; FN = false negative; TN = true negative; FP = false positive; n / N = number of subjects; Sens = sensitivity; Spec = specificity; Ict = Ictermeter Index; SB = serum bilirubin; r = regression coefficient between Ict and SB
All infants were healthy, term (GA≥34), and not on Phototherapy or exchange transfusion, unless noted
One recent study, (Robertson, Kazmierczak, and Vos, 2002) compared the Jaundice Meter™ JM 102 (Minolta/Hill-ROM Air-Shields) to the SpectRx BiliCheck™. Jaundiced term infants not receiving phototherapy (n=101) had transcutaneous bilirubinometry (TcB) measurements taken at the forehead with both instruments and a serum bilirubin measurement (TSB) by the standard laboratory method (colorimetric diazonium salt method). In addition, skin pigmentation of the forehead was graded. The BiliCheck™ performed superiorly (r=.937) to the Jaundice Meter™ (r=0.704) in predicting TSB. With the BiliCheck™, skin color was not statistically significant in predicting TSB (p=0.890); however, skin color was significant with the Jaundice Meter™ (p=0.002).
Two studies compared the performance or the Minolta AirShields bilirubinometer to the Ingram Icterometer in predicting serum bilirubin in healthy, jaundiced term infants (Bilgen, Ince, Ozek, et al., 1998; Schumacher, Thornbery, and Gutcher, 1985). In both studies, there was a linear relationship between the TcB and TSB with the Minolta AirShields bilirubinometer having higher correlation (r=0.74 and 0.83) than the icterometer (r=0.63 and 0.78). However, both devices performed similarly as screening devices with 100 percent sensitivity to predict TSB > 12.9 in the Bilgen study and no statistically significant difference between the two methods to predict TSB >12.9 in the Schumacher study (94 percent sensitivity, 77.5 percent specificity for the Minolta AirShields device and 82 percent sensitivity, 74 percent specificity for the Ingram icterometer).
Summarizing case reports of kernicterus from different investigators in different countries from different time periods is problematic. First, definitions used in these reports varied greatly. They included gross yellow staining of the brain, microscopic neuronal degeneration, acute clinical neuromotor impairment, neuro-auditory impairment and chronic neuromotor impairment. In some cases, the diagnoses were not established until months or years after birth. Second, case reports without controls makes interpretation difficult, especially in infants with comorbid factors, and could very well lead to misinterpretation of the role of bilirubin in neurodevelopmental outcomes. Third, different reports used different outcome measures. “Normal at follow-up” may be based on parental reporting, physician assessment, or formal neuropsychological testing. Fourth, time of reported follow up ranged from days to years. Fifth, cases were reported from different countries at different time periods and with different milieu (some have high prevalence of G6PD and some use naphthalene moth balls routinely in their dressers) and varying standards of practice concerning hyperbilirubinemia. The effect of the differences on outcomes cannot be known for certain. Finally, it is difficult to infer from case reports the true incidence of this uncommon disorder. Nevertheless, one can conclude from these data that high levels of serum bilirubin are strongly associated with kernicterus. The distribution of bilirubin levels in reported cases of kernicterus is substantially skewed compared to the population-based distribution of neonatal bilirubin levels described by Newman, Escobar, Gonzales et al. (1999). (See Figure 4.1
Based on our summary of multiple case reports that spanned more than 30 years, we conclude that kernicterus, although infrequent, has significant mortality (at least 10 percent) and long-term morbidity (at least 70 percent). It is evident that the preponderance of kernicterus cases occurred in infants with high bilirubin (>20mg/dl). This summary affirms the role of elevated bilirubin level in kernicterus but does not provide incontrovertible proof of causality in this association.
Five of 26 (19 percent) term or near-term infants with kernicterus and reported follow-up data survived without sequelae, while only three of 63 (5 percent) infants with kernicterus and comorbid factors were reported to be normal at follow up. This suggests the importance of comorbid factors in determining long-term outcome in infants initially diagnosed with kernicterus.
Only a few prospective controlled studies looked specifically at behavioral and neurodevelopmental outcomes in healthy term infants with hyperbilirubinemia. Most of these studies have a small number of subjects. Two very short-term studies with well-defined measurement of newborn behavioral organization and physiologic measurement of cry are of high quality methodology (Escher-Graub and Fricker, 1986; Rapisardi, Vohr, Cashore et al., 1989). But the significance of long term abnormalities in newborn behavior scales and variations in cry formant frequencies are unknown. There remains very little information on the long-term effects of hyperbilirubinemia in healthy term infants.
Among the mixed studies (combined term and preterm; non-hemolytic and hemolytic; non-diseased and diseased) and studies with subjects from racial backgrounds other than black and white infants, the following observations can be made:
Excluding the Collaborative Perinatal Project (CPP) study and the studies looking at IQ, out of nine studies looking primarily at behavioral and neurodevelopmental outcomes, only three studies were of high methodologic quality. One very short-term study showed a correlation between bilirubin level and decreased scores on newborn behavioral measurements (Vohr, Lester, Rapisardi et al., 1989). One found no difference in prevalence of CNS abnormalities at age 4 years when bilirubin was below 20 mg/dl, but infants with bilirubin above 20 mg/dl had a higher prevalence of CNS abnormalities (Hyman, Keaster, Hanson et al, 1969). Another study that followed infants with bilirubin greater than 16 mg/dl found no relationship between bilirubin and neuro-visual-motor testing at 61 to 82 months of age (Valaes, Kipouros, Petmezaki et al., 1980). Although data reported in the rest of the studies are not of the highest caliber, there is a suggestion of abnormalities in neurodevelopmental screening tests in infants with bilirubin > 20 mg/dl, at least by the Denver Developmental Screening Test (DDST), when followed up at one year of age. It appears that bilirubin > 20 mg/dl may have short-term (up to one year of age) adverse effects at least by DDST, but there is no strong evidence to suggest neurologic abnormalities in children with neonatal bilirubin > 20 mg/dl when followed up to 7 years of age.
Nine of 15 studies (excluding the CPP) addressing neuro-auditory development and bilirubin level were of high quality. Six of them showed BAER abnormalities correlate with high bilirubin levels. The majority reported resolution with treatment. Three studies reported hearing impairment associated with elevated bilirubin (> 16 mg/dl to > 20 mg/dl). We conclude that high bilirubin level does have an adverse effect on neuro-auditory development, but the adverse effect on BAER is reversible.
Again, excluding the CPP, of the eight studies reporting intelligence outcomes in subjects with hyperbilirubinemia, four studies were considered high quality. These four studies reported no association between IQ and bilirubin level with follow up ranging from 6.5 years to 17 years. Seidman, Paz, Stevenson, et al. (1991) also reported no direct linear association between bilirubin levels and IQ scores. However, the risk for IQ score < 85 at age 17 years was found to be significantly higher among full-term Coombs-negative male subjects with serum bilirubin > 20 mg/dl. We conclude that there is no evidence to suggest a linear association of bilirubin level and IQ.
The Collaborative Perinatal Project (CPP), with 54,795 live births between 1959 and 1966 from 12 centers in the U.S. has, by far, the largest database for the study of hyperbilirubinemia. Newman and Klebanoff (1993), focusing only on black and white infants with birthweight ≥ 2500 grams, did a comprehensive analysis of 7-year outcome in 33,272 subjects. All causes of jaundice were included in the analysis. The study found no consistent association between peak bilirubin level and IQ. Sensorineural hearing loss was not related to bilirubin level. Only the frequency of abnormal or suspicious neurologic examinations was associated with bilirubin level.
A large prospective study comprised of healthy infants ≥ 34-week gestation with hyperbilirubinemia, specifically looking at long-term neurodevelopmental outcomes, has yet to be done. The report of Newman and Klebanoff (1993) came closest to that ideal because of the large number of subjects and the study analytic approach. However, a population born from 1959 to 1966 may no longer be relevant to present day newborns. There are more infants born to non-black and non-white mothers. More infants are breastfed. Phototherapy for hyperbilirubinemia has become standard therapy. Hospital stays are shorter. All these and other factors may have unknown long-term developmental effects on present day newborns with hyperbilirubinemia.
Short-term studies reviewed in this section, while in general having good methodologic quality, use tools that have unknown long-term predictive abilities. Long-term studies suffer from high attrition rates of the study population and a non-uniform approach to defining “normal neurodevelopmental outcomes.” The total bilirubin levels reported in all the above studies were measured anywhere from first day of life to more than two weeks of life. Definitions of significant hyperbilirubinemia ranged from 12 mg/dl to 20 mg/dl.
Given the diversity of conclusions reported, except in cases of kernicterus with sequelae, it is evident that the use of a single total serum bilirubin level (within the range described in the reviewed studies) to predict long-term behavioral or neurodevelopmental outcomes is inadequate and will lead to conflicting results.
Regardless of different protocols of phototherapy, the NNT for prevention of serum bilirubin level exceeding 20 mg/dl ranged from six to 10 in healthy term or near-term infants. This implies that one needs to treat six to 10 healthy, but jaundiced, neonates with TSB ≥15 mg/dl by phototherapy in order to prevent one event (TSB >20 mg/dl). Evidence for the efficacy of treatments for neonatal hyperbilirubinemia was limited. Overall, the four qualifying studies showed that phototherapy had an absolute risk reduction rate of 10 percent to 17 percent for prevention of serum bilirubin exceeding 20 mg/dl in healthy and jaundiced infants (TSB ≥13 mg/dl) with a gestational age of 34 weeks or more. Phototherapy combined with cessation of breastfeeding and substitution with formula was found to be the most efficient treatment protocol for healthy term or near-term infants with jaundice.
Eight studies examined the effect of bilirubin reduction on brainstem auditory response. All consistently showed treatments for neonatal hyperbilirubinemia significantly improved abnormal BAER's in both healthy jaundiced infants and jaundiced infants with hemolytic disease.
There is no evidence to suggest that phototherapy for neonatal hyperbilirubinemia has any long-term neurodevelopmental adverse effect in either healthy jaundiced infants or infants with hemolytic disease.
Nine studies evaluated the effect of phototherapy on neurodevelopment. Of the five studies that looked at infant behavior, three reported lower scores in the orientation cluster of the Brazelton Neonatal Behavioral Assessment Scale in the phototherapy-treated infants. The other two studies did not find behavioral changes in the phototherapy group.
Three studies evaluated the effect of phototherapy on visual outcomes. All showed no short or long-term (up to 36 months) effect on vision as a result of phototherapy while infants' eyes are properly protected during treatment.
For accuracy of various strategies for prediction of neonatal hyperbilirubinemia, 153 articles were included after title and abstract screening. Only 17 articles were included after full text screening and 10 articles remained after data abstraction, seven percent of the original number. This was the lowest yield of articles among the five questions addressed, suggesting that relatively few prospective or retrospective studies addressed this question without significant design or reporting errors, missing data or apparent bias.
A conclusion is difficult to draw from these studies. The first challenge is the lack of consistency in defining clinically significant neonatal hyperbilirubinemia. Not only did multiple studies use different levels of total serum bilirubin (TSB) to define neonatal hyperbilirubinemia, but the levels of TSB defined as significant also varied by age, yet age at TSB determination varied by study as well. For example, significant levels of TSB were defined as >11.7 mg/dl (Knudsen, 1992), ≥15 mg/dl (Okuyama, Yonetani, Uetani, et al., 2001), >15 mg/dl (Awasthi and Rehman, 1998), >16 mg/dl (Risemberg, Mazzi, MacDonald, et al., 1977), >17 mg/dl (Carbonell, Botet, Figueras, et al., 2001) and ≥25 mg/dl (Newman, Xiong, Gonzales, et al., 2000).
The second challenge is the lack of consistency in study populations. These studies were conducted among multiple racial groups in multiple countries, including China, Denmark, India, Israel, Japan, Spain and the United States. Although infants were defined as healthy term and near-term newborns, these studies included neonates with potential for hemolysis from ABO incompatible pregnancies (Risemberg, Mazzi, MacDonald, et al., 1977), as well as breastfed and bottle-fed infants (often not specified).
Based on the evidence from the systematic review, transcutaneous measurements of bilirubin by each of the three devices described in the literature, the Minolta AirShields bilirubinometer, the Ingram Icterometer, and the SpectRx BiliCheck™ have a linear correlation to total serum bilirubin and may be useful as screening devices to detect clinically significant jaundice and decrease the need of serum bilirubin determinations.
The Minolta AirShields bilirubinometer appears to perform less well in black infants as compared to white infants, performs best when measurements are made at the sternum, and performs less well when infants have been exposed to phototherapy. This instrument requires daily calibrations and each institution must develop its own correlation curves of TcB to TSB. As a screening test it does not perform consistently across studies as evidenced by the summary ROC curves. The Ingram Icterometer has the added limitation of lacking objectivity of the other methods as it depends on observer visualization of depth of yellow color of the skin.
The recently introduced BiliCheck™ device that utilizes reflectance data from multiple wavelengths appears to be a significant improvement over the older devices, the Ingram Icterometer and the Minolta AirShields bilirubinometer, because of its ability to determine correction factors for the effect of melanin and hemoglobin. In one study the BiliCheck™ was shown to be as accurate as the reference standard method of HPLC in predicting TSB (Rubaltelli, Gourley, Loskamp, et al., 2001). Future research should confirm these findings in larger samples of diverse populations and include effects of phototherapy.
As mentioned in the Conclusions chapter, kernicterus has many different definitions. Reaching a national consensus in defining this entity, as in the model suggested by Johnson, Bhutani and Brown (2002), will also help in formulating a valid comparison of different databases.
Kernicterus, in general, is associated with high bilirubin level (greater than 20 mg/dl). Without good prevalence and incidence data on hyperbilirubinemia and kernicterus, one would not be able to estimate the risk of kernicterus at a given bilirubin level. Making extreme hyperbilirubinemia and kernicterus a reportable condition would be a first step in that direction.
As kernicterus is infrequent, doing a case control study with kernicterus may help to delineate the role of bilirubin in the development of kernicterus. The necessity of multicenter cooperation is evident. One may have to rely on animal experiments to seek proof that bilirubin is the etiologic agent in kernicterus. It is important to seek other factors that may act in concert with bilirubin to cause kernicterus. Fundamental research on the blood brain barrier will likely yield useful additional data.
It is also clear that an absolute bilirubin level is insufficient by itself to predict long-term neurodevelopmental outcome. The data summarized in this report suggests that using conventional measures of bilirubin to study the effect on neurodevelopment will continue to yield conflicting results. Duration of hyperbilirubinemia and measures of bilirubin binding may provide a better approximation of the effect of bilirubin exposure on the central nervous system. Using hour-specific bilirubin, documenting length of exposure and employing laboratory-proven methods to measure bilirubin-albumin binding may therefore yield a more useful profile of bilirubin. These factors merit further investigation.
One area for further research would be the validation of an age-specific (by hour) nomogram for TSB in healthy full term infants, with evaluation of potential differences by gender, race and ethnicity, as well as prenatal, natal and postnatal factors. Once established, use of the 95th percentile to define clinically significant jaundice would provide uniformity across studies. This would be analogous to the use of age-specific systolic and diastolic blood pressure to define hypertension and age-specific body mass index to define overweight and obesity in children. Validation of a standardized TSB nomogram as discussed above would either incorporate these potential differences or result in the development of population-specific nomograms if differences were significant. This would be analogous to the use of population-specific growth charts for weight and height percentiles in children.
New technological advances in the transcutaneous measurement of bilirubin, such as the Bilicheck™ and Colormate III (that have the ability to correct for skin color effects and hemoglobin) have been introduced and appear promising. Future research should validate these initial findings in diverse clinical populations and address issues that might affect performance such as race, gestational age, age at measurement, phototherapy, sunlight exposure, feeding, and accuracy as screening instruments and for ongoing monitoring of jaundice. Additionally, studies should address cost effectiveness and reproducibility in actual clinical practice. Given the inter-laboratory variability of measurements of TSB (Gourley, 1999; Schreiner and Glick, 1982; Vreman, Verter, Oh, et al., 1996), future studies using HPLC should be used as the reference standard along with the routine laboratory methods of TSB in use when evaluating noninvasive measures of bilirubin.
| Study Characteristics and Pooled Results | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| # | Study | TP/FN | FP/TN | Sens | 95% CI | Spec | 95% CI | 1/Var. | Threshold |
| 1 | Lin | 12/6 | 57/230 | 0.67 | 0.41–0.86 | 0.80 | 0.75–0.84 | 3.9427 | Var |
| 2 | Bilgen | 17/0 | 35/44 | 1.00 | 0.80–1.00 | 0.56 | 0.44–0.67 | 0.8560 | 13 |
| 3 | Knudsen, 1993 | 30/0 | 28/15 | 1.00 | 0.88–1.00 | 0.35 | 0.21–0.51 | 0.8577 | 15 |
| 4 | Tsai | 19/2 | 16/141 | 0.90 | 0.68–0.98 | 0.90 | 0.84–0.94 | 2.1373 | 16 |
| 5 | Bhutta | 35/5 | 28/35 | 0.88 | 0.72–0.95 | 0.56 | 0.43–0.68 | 3.8087 | 17 |
| 6 | Knudsen, 1990 | 24/4 | 66/113 | 0.86 | 0.67–0.95 | 0.63 | 0.56–0.70 | 3.6711 | 17 |
| 7 | Harish | 24/1 | 14/21 | 0.96 | 0.78–1.00 | 0.60 | 0.42–0.76 | 1.4773 | 18 |
| 8 | Schumacher | 16/1 | 20/69 | 0.94 | 0.69–1.00 | 0.78 | 0.67–0.85 | 1.5284 | 20 |
| 9 | Karrar | 36/13 | 11/95 | 0.73 | 0.59–0.85 | 0.90 | 0.82–0.94 | 5.1256 | 21 |
| 10 | Taha | 11/15 | 8/96 | 0.69 | 0.42–0.88 | 0.92 | 0.85–0.96 | 2.5759 | 22.5 |
| 11 | Maisels | 7/0 | 5/145 | 1.00 | 0.59–1.00 | 0.97 | 0.92–0.99 | 0.6847 | 24 |
| Total (Range) | 268 | 1292 | 0.67–1.00 | 0.35–0.97 | |||||
| REM Pooled | 0.85 | 0.77–0.91 | 0.77 | 0.66–0.85 | |||||
Var = TcB ≥ 11, ≥ 16, or ≥ 20 on the 1st, 2nd, and following days of life
| Study Characteristics and Pooled Results | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| # | Study | TP/FN | FP/TN | Sens | 95% CI | Spec | 95% CI | 1/Var. | Threshold |
| 1 | Dai | 8/0 | 11/21 | 1.00 | 0.63–1.00 | 0.66 | 0.47–0.81 | 0.7232 | 17 |
| 2 | Sheridan-Per | 4/0 | 13/40 | 1.00 | 0.40–1.00 | 0.75 | 0.61–0.86 | 0.6259 | 20 |
| 3 | Fok | 16/0 | 64/122 | 1.00 | 0.79–1.00 | 0.66 | 0.58–0.72 | 0.8706 | 22 |
| Total(Range) | 28 | 271 | 1.00–1.00 | 0.66–0.75 | |||||
| REM Pooled | 0.95 | 0.77–0.99 | 0.67 | 0.61–0.73 | |||||
| Study Characteristics and Pooled Results | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| # | Study | TP/FN | FP/TN | Sens | 95% CI | Spec | 95% CI | 1/Var. | Threshold |
| 1 | Knudsen 1992 | 21/7 | 38/72 | 0.75 | 0.55–0.89 | 0.65 | 0.56–0.74 | 4.6245 | 9 |
| 2 | Knudsen 1990 | 68/27 | 21/91 | 0.72 | 0.61–0.80 | 0.81 | 0.73–0.88 | 9.2969 | 10.2 |
| 3 | Maisels | 20/2 | 14/121 | 0.91 | 0.70–0.98 | 0.90 | 0.83–1.94 | 2.1111 | 20 |
| Total (Range) | 145 | 357 | 0.72–0.91 | 0.65–0.90 | |||||
| REM Pooled | 0.76 | 0.64–0.85 | 0.80 | 0.63–0.91 | |||||
The results of the meta-analysis of correlation coefficients should be interpreted carefully due to several limitations:
The correlation coefficient does not provide any information about the clinical utility of the diagnostic test.
Although correlation coefficients (r) measure the association between transcutaneous bilirubin and “standard” serum bilirubin measurements, the correlation coefficient is highly dependent on the distribution of serum bilirubin in the study population selected.
Correlation measures ignore bias and measure relative rather than absolute agreement.
The analytic issues, which arose regarding potential duplication of information, were described in Chapter 2, Methodology.
The meta-analysis of correlation coefficients figures on the following pages indicate:
Abbreviated name of the measurement site/metric being analyzed (e.g. forehead)
Article citation (author and year)
Number of subjects used in the comparison
Graphic description of the individual effects and 95% confidence intervals
Combined random effects model estimate and its confidence bounds
[Titles Display] [Main Search Page]
------------------------------------------------------------------------
MEDLINE <1966 to September Week 3 2001>
------------------------------------------------------------------------
| # | Search History | Results |
|---|---|---|
| 1 | exp Hyperbilirubinemia/ | 15774 |
| 2 | exp Hyperbilirubinemia, Hereditary/ | 1234 |
| 3 | exp Bilirubin/ | 14754 |
| 4 | exp Jaundice, Neonatal/ | 4492 |
| 5 | exp Kernicterus/ | 547 |
| 6 | (bilirubin or hyperbilirubin$).tw | 16111 |
| 7 | kernicterus.tw. | 347 |
| 8 | jaundice.tw. | 13434 |
| 9 | neonat$.tw. | 98990 |
| 10 | 1 or 2 or 3 or 6 or 8 | 41610 |
| 11 | limit 10 to newborn infant | 7645 |
| 12 | (6 or 8) and 9 | 2900 |
| 13 | 4 or 5 or 11 or 12 | 8588 |
| 14 | limit 13 to human | 8431 |
| 15 | limit 14 to english language | 5746 |
| 16 | Case Report/ | 1022282 |
| 17 | 15 not 16 | 5063 |
| limit 17 to (addresses or bibliography or biography or comment | ||
| 18 | or dictionary or directory or editorial or festschrift or interview or lectures or legal cases or letter or news or overall or periodical index) | 304 |
| 19 | 17 not 18 | 4759 |
| limit 19 to (guideline or meta analysis or practice guideline | ||
| 20 | or review or review literature or review, academic or review multicase or review, tutorial) | 479 |
| 21 | 19 not 20 | 4280 |
[Titles Display] [Main Search Page]
------------------------------------------------------------------------
PREMEDLINE <September 24, 2001>
-------------------------------------
| # | Search History | Results |
|---|---|---|
| 1 | (bilirubin or hyperbilirubin$).tw. | 199 |
| 2 | kernicterus.tw. | 6 |
| 3 | jaundice.tw. | 156 |
| 4 | neonat$.tw. | 1541 |
| 5 | (1 or 3) and 4 | 49 |
| 6 | [Case Report/] | 0 |
| 7 | 2 or 5 | 51 |
| 8 | limit 7 to english language | 45 |
| 9 | limit 8 to review articles | 0 |
| 10 | from 8 keep 1–45 | 45 |
















| Description | |
|---|---|
| AAP | American Academy of Pediatrics |
| ABR | auditory brainstem response |
| AGA | birthweight was appropriate for gestational age |
| AHRQ | Agency for Healthcare Research and Quality |
| ARR | Absolute risk reduction |
| AUC | Areas under the ROC curve |
| BAER/BAEP | brainstem auditory evoked response / brainstem auditory evoked potential |
| BET | blood exchange transfusion |
| BMJ | Breast-milk jaundice |
| BNBAS | Brazelton neonatal behavioral assessment scale |
| BW | Birthweight |
| BSID | Bayley Scales of Infant Development |
| CB | Cord-blood bilirubin |
| CNS | Central nervous system |
| CPP | Collaborative Perinatal Project |
| C/S | cesarean section |
| DDST | Denver Developmental Screening Test |
| DQ | Developmental Quotient |
| ETCOc | End-tidal carbon-monoxide concentration |
| EEG | electroencephalograph |
| EFS | Educational Follow-up Study |
| EGA | Estimated gestational age |
| ET | Exchange Transfusion |
| GA | Gestational age |
| G6PD | glucose-6-phosphate dehydrogenase deficiency |
| HC | Head circumference |
| HPLC | High performance liquid chromatography |
| IMS | infant motor screening |
| IPL | interpeak latency |
| IQ | Intelligence Quotient |
| ITPA | Illinois test of Psycholinguistic Ability |
| Mo | Months |
| MR | mental retardation |
| MRT | Metropolitan Readiness Tests |
| NICHD | National Institute of Child Health and Human Development |
| ND | No data |
| NNT | Number Needed to Treat |
| OD | Optical densities |
| PB | Phenobarbital |
| PPV | Positive predictive value |
| PhotoRx | Phototherapy |
| Rh | Rhesus |
| ROC | Receiver operating characteristics |
| SB | Serum bilirubin |
| SROC | Summary receiver operating characteristics |
| TcB | Transcutaneous bilirubin |
| TSB | total serum bilirubin |
| VEP | visual evoked potentials |
| Wks | weeks |
| Yrs | years |
The Evidence-based Practice Center staff acknowledges the collaboration of technical experts and Peer Reviewers in the preparation of this evidence report.
Technical Experts
American Academy of Family Physicians
William Hueston, MD
Department of Family Medicine
Medical University of South Carolina
Charleston, South Carolina
American Academy of Pediatrics
M. Jeffrey Maisels MD
Department of Pediatrics
William Beaumont Hospital
Royal Oak, Michigan
Thomas B. Newman, MD, MPH
Department of Epidemiology
University of California, San Francisco
San Francisco, California
David K. Stevenson, MD
Department of Pediatrics
Stanford University Medical School
Stanford, California
National Association of Pediatric Nurse Practitioners
Cheryl Aldridge MN, RN
Argyle, Texas
Center for Quality of Care Research and Education
R. Heather Palmer, M.B.B.Ch., S.M
Professor of Health Policy and Management,
Director, Center for Quality of Care Research and Education
Boston, Massachusetts
Parents of Infants and Children with Kernicterus
Ms. Sue Sheridan
Boise, Idaho
Peer Reviewers
American Academy of Family Physicians
Diane J. Madlon-Kay, MD
Minneapolis, Minnesota
Martin C. Mahoney, MD, PhD, FAAFP
Clarence, New York
American Academy of Pediatrics
Charlie Homer, MD, MPH, President and
National Initiative for Children's Healthcare Quality
Institute for Healthcare Improvement
Boston, Massachusetts
National Association of Pediatric Nurse Practitioners
Julie Novak, DNSc, RN, MA, CPNP
Lafayette, Indiana
Deborah Parks, DSN, RN, CPNP
Houston, Texas
Centers for Disease Control and Prevention
National Center for Birth Defects and Developmental Disabilities
Rachel Avchen, MS, PhD
Research Scientist
Atlanta, Georgia
National Institute for Child Health and Human Development
Pregnancy and Perinatology Branch
Tonse N.K. Raju, MD
Medical Officer, Neonatal Research
Rockville, Maryland
Free Full text in PMC]
[PubMed]
Free Full text in PMC]
[PubMed]
Free Full text in PMC]
[PubMed]
Free Full text in PMC]
[PubMed]
Free Full text in PMC]
[PubMed]