Chapter 1. Introduction
The Department of Health and Human Services (DHHS) Office on Women's Health has
requested an evidence report from the Agency for Healthcare Research and Quality
(AHRQ) through the Evidence-based Practice Center program (EPC) that would
critically examine the literature concerning the relationship of breastfeeding and
various infant and maternal health outcomes. EPC evidence reports summarize evidence
addressing specific key questions; these reports do not make clinical practice or
health policy recommendations.
Breast milk is the natural nutrition for all infants. According to the American
Academy of Pediatrics (AAP), it is the preferred choice of feeding for all
infants.1 The goals of Healthy People 2010 for breastfeeding are an
initiation rate of 75 percent and continuation of breastfeeding of 50 percent at 6
months and 25 percent at 12 months postpartum.2 National Immunization Survey of U.S. children in 2005 (NIS 2005) indicated
that 73 percent had ever been breastfed. The percentage of infants who continued to
breastfeed to some extent is 39 percent at 6 months and 20 percent at 12 months
(www.cdc.gov/breastfeeding/data/NIS_data/ data_2005.htm).
In addition to providing essential nutrients to infants, benefits of breastfeeding
for both children and their mothers have been reported. Reports of the benefits for
children include decreases in incidence of otitis media and gastroenteritis,3 lower risk of obesity,4,
5 and lower risk of asthma.6 Other benefits reported include decreased rates of sudden infant death
syndrome, reduction in the incidence of type 1 and type 2 diabetes mellitus, certain
types of cancer, and improved performance on certain tests of cognitive
development.7
Reported benefits for mothers who breastfed their infants include increased
postpartum uterine activity (inferentially this would lead to reduced postpartum
blood loss),8 greater weight loss postpartum compared with mothers who bottle-fed their
infants,9 decreased incidence of premenopausal breast cancer10 and decreased incidence of ovarian cancer.11
In 2000, the DHHS Office on Women's Health, in cooperation with the Surgeon General
of the United States and several governmental and non-governmental agencies,
published the first departmental policy on breastfeeding, the HHS Blueprint
for Action on Breastfeeding (www.womenshealth.gov). The
DHHS Office on Women's Health endorses the recommendation from the American Academy
of Pediatrics (AAP), American Academy of Family Physicians (AAFP), American College
of Obstetricians and Gynecologists (ACOG), Association of Women's Health,
Obstetrical, and Neonatal Nurses (AWHONN), Le Leche League International, National
Medical Association (NMA), and many other health organizations, that mothers
exclusively breastfeed for 6 months. The DHHS Office on Women's Health has
commissioned a review to systematically examine the evidence for the effects of
breastfeeding. The DHHS Office on Women's Health has also requested that the focus
of the review be on studies from developed countries (i.e., “high
income” classification by World Bank)* as the findings from those studies are deemed more directly applicable to
population in this country.
As it is unethical to randomize subjects into breastfeeding versus non-breastfeeding
groups (although there were some randomized controlled trials (RCTs) in the 1980s on
preterm infants whose mothers desired not to breastfeed, their infants were
randomized into those who received donor breast milk versus those who received
preterm formula12,
13), much of the evidence on the benefits of breastfeeding came from
observational studies. Observational studies are subject to confounding. One of the
well-known confounders in breastfeeding research is demographic difference between
mothers who breastfeed and those who chose not to breastfeed due to self-selection.
Consistent with previously reported data,14 NIS 2005 showed that mothers who breastfeed tend to be white (versus
non-Hispanic black or African American), older, more educated, and in a higher
socioeconomic stratum (www.cdc.gov/breastfeeding/data/NIS_data/ data_2005.htm). While it is
possible to control for some of these demographic factors, it is not possible to
control for behavioral or attitudinal factors intrinsic in the desire to breastfeed.
Some authors have proposed strict standards in evaluating the quality of
observational studies. These standards should include the quality of the feeding
data, a clear definition of the outcome, the elimination of systematic differences
in outcome assessment between comparison groups (detection bias), and the control of
potential and well-known confounders. The feeding data should clearly define whether
it was prospectively or retrospectively collected, whether there was a precise
definition of exclusive breastfeeding, and whether the duration of breastfeeding was
reported.15,
16
Large number of infant and maternal outcomes has been examined in relation to a
history of breastfeeding. It was not feasible to review all possible outcomes in
mothers and children for this report; we sought guidance from our panel of technical
experts in the field of breastfeeding research in deciding on the specific outcomes
to review. After taking into consideration the following factors: relevance and
importance of outcome in a developed country, date (recent or old) of the last
systematic review on the outcome, availability or non-availability of data from a
developed country, consistency or inconsistency of outcomes in previously reported
studies, and consideration of the possibility that breastfeeding may have potential
harms as well as benefits, the following outcomes from developed countries have been
designated for review: for term infants, infectious diseases (including otitis
media, diarrhea, and lower respiratory tract infections), sudden infant death
syndrome, infant mortality, cognitive development, childhood cancer (including
leukemia), type 1 and 2 diabetes, asthma, atopic dermatitis, cardiovascular disease
(including hypertension), hyperlipidemia, and obesity; for preterm infants,
necrotizing enterocolitis (NEC) and cognitive development; for mothers, post-partum
depression, return to pre-pregnancy weight, breast cancer, ovarian cancer, diabetes
and osteoporosis.
It is also outside the scope of this report to examine the biological mechanisms
underpinning the effects of breast milk and therefore, studies on individual
components of breast milk will not be part of this report. Lastly, studies on the
effectiveness of interventions to promote and support breastfeeding are not
systematically covered in this review, as this topic will be reviewed for a
subsequent report. However, there is good quality evidence to support that some of
these interventions do lead to an increase in breastfeeding rates and also an
improvement of certain health outcomes in the study populations. Details of one
landmark study17 and its implications for future research in the study of the effects of
breastfeeding will be discussed in some details in this report.
Chapter 2. Methods
Overview
This evidence report on breastfeeding and health outcomes in infants and mothers
is based on a systematic review of the literature. To identify the specific
issues central to this report, the Tufts-New England Medical Center (Tufts-NEMC)
Evidence-based Practice Center (EPC) held teleconferences with a panel of
technical experts (TEP) and various stakeholders. A comprehensive search of the
medical literature was conducted to identify studies addressing the key
questions. Evidence tables of study characteristics and results were compiled,
and the methodological quality of the studies was appraised. Study results were
summarized with qualitative reviews of the evidence, summary tables, and
quantitative summary data, when appropriate.
A number of individuals and groups supported the Tufts-NEMC EPC in preparing this
report. The TEP served as our science partner. Technical experts and
representatives from the Agency for Healthcare Research and Quality (AHRQ), DHHS
Office on Women's Health, The National Institute of Child Health and Human
Development (NICHD), Centers for Disease Control and Prevention (CDC), American
Academy of Pediatrics (AAP), American College of Obstetrics and Gynecology
(ACOG), Association of Women's Health, Obstetrics and Neonatal Nurses (AWHONN),
and La Leche League worked with the EPC staff to refine key questions, identify
important issues, and define parameters for literature review in this report.
In the early phase of exploring the literature available for this report, it was
soon discovered that there was a large number of primary studies and systematic
reviews/meta-analyses on the various outcomes of interest. As it was not
feasible to review all the primary studies addressing the outcomes of interest,
therefore, in consultation with the Office on Women's Health and the TEP, we
developed an approach that capitalized on the existing systematic
reviews/meta-analyses. For outcomes of interest that had previously been
reviewed systematically, we assessed the quality of those reviews and summarized
their findings. For selected infant (necrotizing enterocolitis, cognitive
development, acute otitis media, asthma, type 1 and 2 diabetes, SIDS) and
maternal (weight changes, type 2 diabetes, breast cancer) outcomes, in addition
to reporting on the existing systematic reviews, we also updated them by
summarizing the relevant primary studies that were published after those
reviews. For outcomes of interest (osteoporosis, ovarian cancer, postpartum
depression, infant mortality) that had not been reviewed systematically, we
reviewed all the relevant primary studies that met our inclusion criteria.
Key Questions Addressed in This Report
Two key questions are addressed in this report. Question 1 pertains to infant
outcomes and question 2 pertains to maternal outcomes. The key questions
are:
- 1
What are the benefits and harms for infants and children in terms of
short-term outcomes, such as infectious diseases (including otitis
media, diarrhea, and lower respiratory tract infections), sudden
infant death syndrome and infant mortality, and longer-term outcomes
such as cognitive development, childhood cancer (including
leukemia), type 1 and 2 diabetes, asthma, atopic dermatitis,
cardiovascular disease (including hypertension), hyperlipidemia, and
obesity, compared among those who mostly breastfeed, mostly formula
feed, and mixed feed; and how are these outcomes associated with
duration of the type of feeding? Do the harms and benefits differ
for any specific subpopulations based on socio-demographic
factors?
- 2
What are the benefits and harms on maternal health short-term
outcomes, such as postpartum depression and return to pre-pregnancy
weight, and long-term outcomes, such as breast cancer, ovarian
cancer, type 2 diabetes mellitus and osteoporosis, compared among
breastfeeding, formula feeding, and mixed feeding, and how are these
associated with duration of the type of feeding? Do the harms and
benefits differ for any specific subpopulations based on
socio-demographic factors?
It should be emphasized that the focus of this review is on the effects of breast
milk feeding, not formula feeding. However, many studies did not distinguish
between exclusive and partially breastfed infants; presumably, some of the
effects reported from observational studies were from infants who received both
breast milk and formula milk feedings. Studies that examined only formula fed
infants were not included in this report. Lastly, studies on infant and maternal
health outcomes of interventions to promote and support breastfeeding were not
systematically covered in this review as that subject will be covered in a
separate report. However, our panel of technical experts felt that the study of
breastfeeding promotion in infants from Belarus17 was a landmark study and offered new directions into research on effects
from breast milk that it warrants discussion in this report. The details of that
study are described in the section Other Research in the results chapter.
Definitions of Breastfeeding in This Report
None of the studies in this review explicitly examined the difference between
“breastfeeding” an infant (infant suckling at her/his
mother's nipple) and “feeding of expressed breast milk” to
an infant. To distinguish between the two forms of feedings, we elected to use
the term “breastfeeding” when the studies concerned
primarily full-term infants (presumably they were, indeed, breastfed) and the
term “breast milk feeding” when the studies concerned
primarily preterm infants (as most of them received breast milk initially either
by gavage- or by bottle-feeding). For term infants,
“bottle-feeding” is used synonymously with
“formula feeding.”
Definitions of “exclusive breastfeeding” varied widely in the
literature. They ranged from “no supplement of any kind including
water while breastfeeding” to “occasional formula is
permissible while breastfeeding.” We elected to accept all definitions
of “exclusive breastfeeding” as provided by the different
study authors, but we qualified our findings by the details regarding those
definitions.
Literature Search Strategy
We conducted a comprehensive literature search to address the two key questions.
The EPC used the Ovid search engine to conduct searches on the
MEDLINE® database, CINAHL database, and the Cochrane
Database of Systemic Reviews. A wide variety of search terms were used to
capture the many potential sources of information related to the myriad of
different outcomes (see Appendix
A).* But the different outcomes were always searched in conjunction with the
following: “breastfeeding,” “breast milk
feeding,” “breast milk,” “human
milk,” “nursing”, and
“lactation”. Literature search of the outcomes alone without
references to breast milk feeding was not conducted. The search included
citations from 1966 to November of 2005. Updated searches on selected outcomes
took place in April and May of 2006. We also supplemented our computer search by
examining the bibliographies of the review articles. We also included articles
suggested by reviewers, provided that the articles met the inclusion criteria
for this review. For outcomes that were not slated for updates, additional
articles suggested by reviewers were also included as an addendum if they
provided useful information. We did not make efforts to identify unpublished
studies.
Study Selection
Selection of Outcomes for This Review
The TEP offered advice on selection of outcomes for review. Final selection
of the list of outcomes for review took into account the following factors:
the importance of the outcome, whether a systematic review of the outcome
has previously been reported from a developed country, whether the existing
systematic review of the outcome is outdated, whether the relationship
between breastfeeding and the outcome is thought to be equivocal, whether a
large number of primary studies has been published recently on the outcome,
and the total number of outcomes that could be adequately reviewed for this
report given the time constraint.
We included the following outcomes in this report:
Term infant outcomes: acute otitis media, hospitalization
for lower respiratory tract infection, gastrointestinal infection,
hypertension, cardiovascular diseases, hyperlipidemia, asthma, atopic
dermatitis, type 1 and 2 diabetes, obesity, sudden infant death syndrome
(SIDS), infant mortality, cognitive development, and childhood cancer
(including leukemia)
Preterm infant outcomes: necrotizing enterocolitis (NEC)
and cognitive development
Maternal outcomes: maternal weight changes, breast cancer,
ovarian cancer, post-partum depression, osteoporosis, and type 2 diabetes
Abstract Screening
All abstracts identified through the literature search were screened. At this
stage, eligible studies included all English language primary experimental
or observational studies that reported any health outcomes in human subjects
in relation to a history of breast milk feeding. As the inclusion criteria
were broad at this stage, the abstracts rejected at this stage did not
undergo a second rescreening process. Abstracts that were accepted at this
stage were examined a second time by different reviewers and categorized
into the different outcomes of interest.
Full Article Inclusion/Exclusion Criteria
Articles that passed the abstract screening process were retrieved and the
full articles were reviewed for eligibility. Full articles were examined
only once unless the articles were equivocal for inclusion or exclusion. In
that event, the article in question was screened again by a different
reviewer and a consensus was reached after discussion with the first
reviewer.
Because the outcomes selected ranged from very broad topic with common
occurrence (e.g., non-specific gastrointestinal infection) to a narrowly
focused topic with relatively few occurrences (e.g., SIDS), the types of
studies available for each outcome varied widely in the distribution of
study designs, sample sizes, and quality of breastfeeding data, it was not
possible (nor desirable) to design a strict set of inclusion and exclusion
criteria that would be applicable to all outcomes. Therefore, additional
inclusion/exclusion criteria germane to the specific outcome were also
described in the Results section under each outcome.
General inclusion criteria for the studies are as follow:
Study Design. Systematic reviews, experimental (randomized
controlled trials) and observational studies (prospective cohort and
case-control studies only)
Population. Healthy term infants in developed countries;
preterm infants in developed countries (for NEC and cognitive development);
healthy mothers in developed countries
Intervention/Exposure. Breastfeeding, breast milk feeding
(maternal term and preterm milk, banked term and preterm milk, fortified or
unfortified), exclusive or mixed feeding
Comparator. Formula feeding (preterm or term formula,
fortified or unfortified)
Data Extraction and Analysis
For those outcomes that have been subjected to a systematic
review/meta-analysis, we summarized their results into our report. In
addition to the results from the systematic reviews, we have also extracted
and summarized the relevant data from primary studies that were published
after the latest search dates of the reports for the following infant and
maternal outcomes: acute otitis media, childhood asthma, cognitive
development, SIDS, infant mortality, NEC, and maternal breast cancer.
For systematic reviews/meta-analyses that reported data from both developed
and developing countries, we reported only those results pertaining to
developed countries, if the reported data permitted us to do so. In those
instances where that were not possible, we noted that fact as a limitation
of our findings.
For outcomes that have multiple systematic reviews, we noted the overlapping
studies and examined whether their findings were interpreted similarly or
differently across reviews and reported our analyses.
For NEC, maternal weight changes, and acute otitis media, in order to better
clarify the overall findings, we also extracted relevant data from the
primary studies cited in the systematic reviews and combined them with data
from the primary studies that were published after the latest search dates
of the reports and reanalyzed the data.
For the remaining included outcomes, we extracted and summarized the relevant
data from the primary studies.
Data forms were developed separately for extraction of systematic
reviews/meta-analysis and primary studies. For systematic
reviews/meta-analysis, items extracted were: databases searched, study
design, population characteristics, descriptions of intervention/exposure,
models used for meta-analysis, results, and authors' conclusions. We
reported the estimates in the meta-analyses. We also reported any attempt by
the authors of the meta-analyses to explore heterogeneity using sub-group
analyses or meta-regression. For primary studies, items extracted were:
study design, population characteristics, eligibility criteria, descriptions
of intervention/exposure, any adjustments for confounders, and results.
Meta-Analysis
We used meta-analysis to expand on the individual studies' findings, if it was
appropriate and feasible to do so. Minimal criteria for meta-analysis are
comparable groupings, similar study designs, and quantifiable outcome data.
Secondary criteria for consideration of meta-analysis are similar study quality,
similar statistical adjustment of outcomes, and other factors. Before combining
the reported odds ratios or risk ratios reported in the individual studies from
the previous meta-analysis with the estimates from the updated primary studies
into a new summary odds ratio or risk ratio, we verified the previous reported
odds ratios or risk ratios by examining the data from the original studies.
We used the DerSimonian and Laird's random effects model for all
meta-analyses.18 The random effects model assigns a weight to each study based both on
the individual study variance and the between-study heterogeneity. Compared with
the fixed effect model, the random effects model is more conservative in that it
generally results in broader confidence intervals when between-study
heterogeneity is present. We tested for heterogeneity using Cochran's Q and
assessed its extent with I2, which evaluates the proportion of
between study variability that is attributed to heterogeneity rather than
chance.19,
20 Intercooled Stata 8.2 was used for the calculations and graphics.
Grading of Studies Analyzed in This Evidence Report
Studies accepted in evidence reports have been designed, conducted, analyzed, and
reported with various degrees of methodological rigor and completeness.
Deficiencies in any of these processes may lead to biased reporting or
interpretation of the results. While it is desirable to have a simple evidence
grading system using a single quantity, the quality of evidence is
multi-dimensional. A single metric cannot adequately capture information needed
to interpret a clinical study. However, grading of information can help the
reader to interpret the studies properly.
Grading of Systematic Reviews/Meta-Analyses
We assessed the methodological quality of studies based on predefined criteria.
For the assessment of systematic reviews, the criteria for methodological
quality was based on the QUOROM guidelines for meta-analyses and systematic
reviews of RCTs (a checklist organized into 21 headings and subheadings for the
preferred way to present the abstract, introduction, methods, results, and
discussion sections of a report of a meta-analysis),21 and reporting guidelines for meta-analysis in observational studies in
epidemiology (MOOSE) (a checklist for the specifications for presenting
background, search strategy, methods, results, discussion, conclusion, and
assessment of quality of individual studies and bias (e.g., publication
bias)a).22
b As the QUOROM and the MOOSE statements were primarily concerned with the
reporting standards of the reviews, we have also supplemented those criteria
with our own checklist of items designed to evaluate the quality of the
systematic review of observational studies (see Appendix B
b for details). Items in this checklist consisted of questions on
appropriate search strategy; justification for inclusion/exclusion criteria; how
well-defined were the population, intervention, comparator, outcomes and study
design; effort to minimize errors in data extraction; assessment of individual
study quality; consideration on the effect of confounders; combinability of the
data for meta-analysis; assessment of statistical and clinical heterogeneity;
reporting accuracies; and appropriateness of the conclusions based on the
reported data.
We applied a three category summary grading system (A, B, C) to each systematic
review/meta-analysis:
A (good)
Category A studies have the least bias and results are considered valid. A
study that adheres mostly to the commonly held concepts of high quality
including the following: a rigorously conducted systematic review or
meta-analysis; clear description of the population, setting, interventions
and comparison groups; clear description of the content of the comparison
groups; appropriate measurement of outcomes; appropriate statistical
assumptions and analytic methods and reporting; appropriate consideration
and adjustment for potential confounders; rigorous assessment of individual
study quality; no reporting errors; and well-reasoned conclusions based on
the data reported.
B (fair/moderate)
Category B studies are susceptible to some bias, but not sufficient to
invalidate the results. They do not meet all the criteria in category A
because they have some deficiencies, but none of which are likely to cause
major biases. The study may have suboptimal adjustment for potential
confounders. The study may also be missing information, making it difficult
to assess limitations and potential problems.
C (poor)
Category C studies have significant biases that may invalidate the results.
The study either did not consider potential confounders or did not adjust
for them appropriately. These studies have serious errors in design,
analysis or reporting; have large amounts of missing information, or
discrepancies in reporting.
It should be noted that while we assessed the methodological quality of the
systematic reviews or meta-analyses, it was not possible to evaluate the quality
of the primary studies included in those reviews/analyses, as we did not examine
those studies first hand. For systematic reviews/meta-analyses that had
equivocal grading between moderate and poor, the results and the reasons for the
initial grade assignments were presented to the entire group of project
investigators and the final grades were adjudicated.
Grading of Individual Primary Studies in Updates and New Reviews
A well-performed RCT with proper randomization, allocation concealment, clear
definitions of breastfeeding exposure compared with non-breastfeeding, and
blinded assessment of outcomes will yield the best evidence in supporting the
causality of breast milk in affecting health outcomes. But with the recognized
benefits of breast milk, this approach is ethically not feasible. Other types of
studies involve following the health outcomes from randomization of intervention
to promote and support breastfeeding (e.g., Belarus study17), this type of study will yield indirect evidence for the relationship
between breastfeeding and health outcomes provided that there is a differential
effect from the intervention on breastfeeding rates between the comparison
groups. Prospective observational cohort studies with proper adjustment of
potential confounders provide the bulk of data in this field. However, the
possibility of residual confounding that could explain the observed association
between breastfeeding and the specific health outcomes can never be completely
ruled out. Case-controlled design is an even less attractive option because of
the concern for case selection bias and suboptimal matching to control subjects.
For the assessment of RCTs, the criteria were based on the CONSORT statement for
reporting RCTs (a checklist with specifications for reporting all aspects of a
trial).23,
24 We mainly considered the methods used for randomization, allocation
concealment, and blinding as well as the use of intention-to-treat analysis, the
report of well-described valid primary outcomes, and the dropout rate. For
non-randomized trials, we used the report of eligibility criteria and assessed
the adequacy of controlling for differences between comparative groups in terms
of baseline characteristics and prognostic factors. We also considered the
report of intention-to-treat analysis, and the crossovers when so designed, as
well as important differential loss to followup between the comparative groups
or overall high loss to followup. The validity and the adequate description of
outcomes and results were also assessed. For the assessment of prospective
cohorts and case-control studies (cross-over design and retrospective cohort
studies were excluded from this review), we used a rating checklist largely
based on the Newcastle-Ottawa Quality Assessment scales for cohort and
case-control studies (www.ohri.ca/programs/clinical_epidemiology/oxford.htm). Items
assessed included selection of cases and controls or cohorts, comparability,
information concerning exposure/intervention, consideration for potential
confounders, and percentage of withdrawals or dropouts. In particular, we paid
close attention to the quality of the breastfeeding data, whether they were
obtained prospectively or by retrospective recall, whether a distinction was
made between exclusive and partial breastfeeding, and whether the duration of
breastfeeding was reported. We also paid close attention to consideration of and
appropriate adjustment for potential confounders.
We applied a three category summary grading system (A, B, C) to each study. This
system defines a generic grading system that is applicable to each type of study
design including randomized controlled trials, cohort, and case-control studies:
A (good)
Category A studies have the least bias and results are considered valid. A
study that adheres mostly to the commonly held concepts of high quality
including the following: clear description of the population, setting,
interventions and comparison groups; clear description of the comparison
groups; appropriate measurement of outcomes; appropriate statistical and
analytic methods and reporting; no reporting errors; less than 20 percent
dropout; clear reporting of dropouts; and appropriate consideration and
adjustment for potential confounders.
B (fair/moderate)
Category B studies are susceptible to some bias, but not sufficient to
invalidate the results. They do not meet all the criteria in category A
because they have some deficiencies, but none of which are likely to cause
major biases. The study may have suboptimal adjustment for potential
confounders. The study may also be missing information, making it difficult
to assess limitations and potential problems.
C (poor)
Category C studies have significant biases that may invalidate the results.
The study either did not consider potential confounders or did not adjust
for them appropriately. These studies have serious errors in design,
analysis or reporting; have large amounts of missing information, or
discrepancies in reporting.
For primary studies that had equivocal grading between moderate and poor, those
studies were reviewed and graded again by different reviewers and consensus was
reached after discussion among the reviewers. Lastly, it should be noted that
the summary quality grading system evaluates and grades the studies within their
own design strata. It does not attempt to assess the comparative validity of
studies across different design strata. Thus, one should be cognizant of the
study design when interpreting the methodological quality grade of a study.
Reporting of the Evidence
We reported each outcome separately in its own section in the results chapter. A
brief explanation of the importance of the outcome is followed by a description
of inclusion/exclusion criteria of studies examined that are specific to that
outcome. A description of the relevant systematic review is followed by a
description of the primary studies that were published after the latest search
date of the systematic review. A summary table highlighted important findings. A
summary conclusion regarding breast milk and that particular outcome is made in
the last section. Conclusions were drawn only from studies of high or moderate
(grade A or B) methodological quality. For dichotomous outcome, either the
summary risk ratio or odds ratio is reported. For continuous outcome, the
comparative difference in the actual measurement for that outcome is reported
(e.g., IQ points, mm Hg of blood pressure). When only studies of C quality are
available, we summarize the findings from those studies and explain the reasons
for the "C' rating, but we do not draw conclusions from them.
Extracted data are compiled in evidence tables. The tables offer a detailed
description of the studies that addressed each of the key questions. The tables
(see Appendix C)a provide detailed information about the study design, the sample size, the
patient characteristics, the intervention and comparison group feeding methods,
the followup, the major outcomes, and the methodological quality. In addition,
for systematic reviews and meta-analyses, we reported the databases searched and
for which time period, the number and the type of primary studies included, and
the type of comparison addressed.
Summary tables succinctly report summary measures of the main outcomes evaluated.
They include information regarding study design, intervention and comparison
group, feeding methods, study duration or followup, sample size (subjects
enrolled and analyzed in each arm), potential confounders, results of major
outcomes, and methodological quality. These tables were developed by condensing
information from the evidence tables. They are designed to facilitate
comparisons and synthesis across studies. A methodological quality was assigned
to each study as described previously.
Chapter 4. Discussion
Twenty-three outcomes were analyzed in this report. Approximately 400 articles would
needed to be reviewed if only articles with primary data were included; this is a
much larger volume of literature than can be feasibly reviewed within the time
period of this report. With the availability of many published systematic reviews on
breastfeeding, we used this literature as the evidence for a large number of
outcomes, supplemented by updates of these systematic reviews with new primary
studies. We performed several new systematic reviews on outcomes not previously
reported. The existing systematic reviews were conducted over a wide span of time
and by diverse groups of investigators; there were large variations in the approach
and quality of these reviews.
Even though we have assessed the reporting quality of these systematic reviews (using
standards of reporting of systematic reviews of observational studies - MOOSE
statement22 and additional parameters that we devised), we cannot reliably know the
validity of the reported summary data without knowing the details of the primary
studies. A number of systematic reviews reported that inclusion and exclusion of
some primary studies were reached by consensus between at least two investigators.
Without knowing the details of how those consensuses were reached, it would be
difficult to replicate the findings in those reviews as it is quite plausible that
someone who is not familiar with the details of the consensus might have come up
with a different set of studies for inclusion in the review. It should also be
stressed that a well-performed systematic review does not necessarily imply that the
body of evidence for a particular outcome of interest is of high quality. While some
systematic reviews assessed the quality of the individual studies, the methods used
varied. Any systematic review is limited by the quality of the primary studies
included in the review. Unless the method used to assess the quality of the primary
studies is transparent and the details made available for examination, it would be
difficult to reliably determine the validity of the conclusions.
In most circumstances, it would be unethical to randomize mother-infant pair into
breastfeeding (or breast milk feeding) or not breastfeeding arm in a trial.
Therefore, the breastfeeding literature is primarily comprised of observational
studies, either cohort or case-control studies. There are a number of potential
deficiencies related to the study designs that could limit the internal validity and
the generalizability of the findings. Some of these potential deficiencies include
(1) misclassification of exposure; (2) confounding from the process of
self-selection; (3) residual confounding; and (4) insufficient statistical power.
Misclassification of exposure (breastfeeding status/duration) is likely in the
studies reviewed in this report. Most studies relied on mothers' recall for the data
on breastfeeding. Recall is prone to error. One study from South Africa reported
that at 6 to 9 months post-delivery, 13 percent of mothers could not remember the
specific timing when they gave something other than breast milk to their infant. In
those mothers who could remember, 57 percent of them overestimated the duration of
exclusive breastfeeding by about 8 weeks, and 15 percent underestimated the duration
by about 3 weeks.215 Misclassification, may bias the effect estimate; particularly if the recall
error is nonrandom, such as in studies where cases are more likely to underestimate
the amount of breastfeeding than controls (for an example, see Norris and Scott
199690).
In studies where subjects were self-selected, there could be confounding from the
process of self-selection (e.g., if subjects who perceive their diseases are due to
a lack of breastfeeding were more likely to participate in the study).
Residual confounding is a possibility for all observational studies, because it is
difficult (if not impossible) to control for all potential confounding variables in
these studies. Although it is possible to control for differences in demographic
factors, it may not be possible to control for behavioral factors intrinsic in the
desire to breastfeed.
Large sample size is often needed to examine the relationship between breastfeeding
and various diseases and health conditions because of the need to adjust for
numerous confounders to minimize all the potential biases described earlier. It is
impossible to predict how these different limitations may interact to increase or
decrease the effect estimate.
Compounding the issue of less-than-ideal study design are the heterogeneity of the
breast milk itself and differences in how the feedings of breast milk were defined
across different studies. The composition of breast milk varies both within and
between individuals.216,
217 The composition could vary depending on preterm versus term delivery, the
maternal diet, maternal body weight, time of day, beginning versus near the end of
feed, first few months of lactation versus later lactation, milk volume, and
numerous other factors. On the other hand, the composition of formulas has also
changed significantly over the last twenty years. For example, contemporary formulas
have added ingredients like nucleotides and long-chain polyunsaturated fatty acids
that were absent from older formulations. How the heterogeneity both within and
between comparators would affect the effect estimate is unclear. Also, studies
defined breastfeeding differently. Many studies did not have a category of
“exclusive breastfeeding”. In the ones that did have this
category, “exclusive” could mean no supplement of any kind
including water or it could mean occasional formula supplement is permissible. This
mixing in of formula in the “exclusive” breastfeeding group may
potentially dilute the true effect of breast milk and bias the results toward the
null finding. In addition, no study in this review examined the differences between
actually breastfeeding an infant and bottle or gavage feeding an infant with breast
milk. How the act of breastfeeding itself plays a role in the different effects
measured is unknown.
We have summarized the effects of breastfeeding (or breast milk feeding) on a large
number of infant and maternal outcomes. Some of the outcomes are well defined and
specific (e.g., childhood acute lymphocytic leukemia, breast cancer); and some are
not so well defined and non-specific (e.g., asthma, gastrointestinal infections).
When the reported outcome is well defined and specific, it lends confidence that the
effect reported is valid for that outcome. When the reported outcome is not well
defined, one might have some reservation regarding the validity of the measured
effect for that outcome.
For all the above reasons, we find that there is a wide range of quality of evidence
for the different outcomes examined in this review.
For severe lower respiratory tract diseases, good quality studies did find a
relationship between breastfeeding and a reduction in the risk of hospitalization
secondary to lower respiratory tract diseases.
For acute otitis media, the results from our meta-analyses of cohort studies of good
and moderate methodological quality showed that breastfeeding was associated with a
significant reduction in the risk of acute otitis media. Comparing ever
breastfeeding with exclusive bottle-feeding, the pooled adjusted odds ratio of acute
otitis media was 0.77 (95%CI 0.64 – 0.91). When comparing exclusive
breastfeeding with exclusive bottle-feeding, either for more than 3 or 6 months
duration, the pooled odds ratio was 0.50 (95%CI 0.36 – 0.70).
For non-specific gastroenteritis, one systematic review identified three primary
studies that controlled for potential confounders. These studies reported that there
was a reduction in the risk of non-specific gastrointestinal infections during the
first year of life in breastfed infants from developed countries, although the
observed range of risk reduction was wide. However, one recent case-control study of
304 infants (167 cases and 137 controls) from England showed that the infants who
were breastfeeding had a reduced risk of diarrhea compared to infants who were not
breastfeeding (adjusted OR 0.36, 95% CI 0.18 to 0.74, P=0.005). The result was
adjusted for age, sex, social class, contact with person in and outside household,
and other factors. Also, analysis of nested observational cohorts from the Belarus
trial showed that the group of infants who were exclusively breastfed for at least 6
months compared to the group of infants who were breastfed for 3 to 6 months had a
statistically significant reduced risk of one or more episodes of gastrointestinal
infection in the first 12 months of life (adjusted OR = 0.61; 95%CI
0.41–0.93).214 The result was adjusted for geographic origin, urban versus
rural location, maternal education, and number of siblings in the household.
For necrotizing enterocolitis (NEC) in preterm infants, our meta-analysis of four
RCTs found a marginally statistically significant reduction (5% risk difference) of
the NEC risk with breast milk feeding. Taking into account the high case-fatality
rate of NEC, we consider this estimate is of meaningful clinical difference.
However, One must be cognizant of the clinical heterogeneity underlying these RCTs
in interpreting the findings of the meta-analysis. Three of the four RCTs were
published in the 1980's. Whether infants born in the early 1980's should be combined
with infants born in the late 1990's into a meta-analysis is debatable. Neonatal
care has made tremendous strides in the last 20 years. Present day preterm formula
milk is vastly different from preterm formula milk 20 years ago. All the studies had
patient populations that were quite heterogeneous, gestational age ranged from 23
weeks to more than 33 weeks and birth weight ranged from less than 1000 g to more
than 1,600 g. One study included only “healthy” infants, another
included both “healthy” and “ill” infants.
In addition, the types of breast milk, the methods of feedings, and the times of
enrollments into the trials were all different. How the heterogeneity in the studies
affected the findings is not clear. In addition, studies examining the issue of NEC
were frequently also examining the issue of neonatal sepsis, as it is not possible
to have NEC without concomitant sepsis. In future studies, it would be worthwhile to
examine the relationship of breast milk exposure and sepsis in preterm infants.
For asthma, our subgroup analysis showed that breastfeeding was associated with a
reduced risk in children under 10 years of age with a positive family history.
However, this association does not hold true for older children as one publication
reported a very large adjusted odds ratio (OR 8.7, 95%CI 3.4 – 22.2) for
developing asthma in children 6 to 13 years of age who were exclusively breastfed
for at least 4 months and had a positive history of maternal asthma.56 The relationship of breastfeeding, maternal history, and long-term outcome
of asthma bears further investigation.
For atopic dermatitis, available evidence from one well-performed systematic review
on full term infants in developed countries suggest that exclusive breastfeeding for
at least 3 months confer a protective advantage in the development of atopic
dermatitis in those subjects with a family history of atopy. The systematic review
did not make a distinction between atopic dermatitis of infancy (under 2 years of
age) and persistent or new atopic dermatitis at older ages. This is important
because the diagnosis of atopic dermatitis in patients younger than 2 years of age
are sometimes attributed to symptoms of infectious origin and breastfeeding may have
a protective effect against infections. But a stratified analysis by different
durations of followup showed that the risk reduction was similar in those with less
than 2 years compared with more than 2 years of followup.
For cognitive outcome in term infants from developed countries, sibling analysis and
prospective studies that controlled specifically for maternal intelligence found
little or no evidence to support an association between breastfeeding and cognitive
performance in children. Most of the published studies adjusted their analyses for
socioeconomic status and maternal education but not specifically for maternal
intelligence. For those studies that still reported a significant effect after
specific adjustment for maternal intelligence, residual confounding from other
factors like different home environments cannot be ruled out.
No definitive conclusion regarding the relationship of breast milk exposure and
cognitive development in preterm infants can be drawn at this time. Studies that
controlled for maternal intelligence reported conflicting results. In addition to
maternal intelligence, comorbidities (e.g., neurological impairment, extremely low
birth weight, other neonatal illnesses), early intervention, environmental, and
socioeconomic factors should also be controlled for in future investigation of this
relationship.
For adult blood pressure, evidence suggests that there is an association between a
history of breastfeeding during pregnancy and a small reduction in adult blood
pressure, but the clinical or public health implication of this finding is unclear.
Furthermore, The association weakened after stratification by study size, suggesting
the possibility of bias.
For adult cholesterol, a lack of explicit analysis of potential confounders in the
meta-analysis hampered the conclusion drawn from the study. Therefore, the
relationship between breastfeeding and adult cholesterol levels cannot be adequately
addressed at this time.
For cardiovascular mortality in adults, the meta-analysis was limited by the
statistical heterogeneity across studies, apparent outcome modification by
differences in gender (and therefore, calls into question the appropriateness of
combining outcomes from men and women into a single analysis), and more than 30% of
the subjects dropped out in the studies. Because of these reasons, no definitive
conclusions can be drawn regarding the relationship between breastfeeding and
cardiovascular mortality. Further investigation is warranted.
For Sudden Infant Death Syndrome (SIDS), our meta-analysis included only studies that
reported clear definitions of exposure, outcomes, and results adjusted for
well-known confounders or risk factors for SIDS. The summary estimate found a
statistically significant adjusted odds ratio for an association between
breastfeeding and a reduced risk of SIDS (adjusted OR 0.64, 95%CI 0.51 –
0.81). We conclude that there is a relationship between breastfeeding and a reduced
risk of SIDS. One must be cautious in interpreting this relationship, however. As
this finding stems from analysis of observational studies, this finding cannot prove
causality. It is plausible that infants who breastfed and breastfed well are less
prone to SIDS because of some yet unclarified neurophysiological reasons, and not
because breastfeeding itself directly confers a protective effect. Further
investigation is warranted.
For post-neonatal mortality (excluding SIDS), there are insufficient data to
characterize the relationship between breastfeeding and post-neonatal infant
mortality adequately. Further investigation is warranted.
For childhood leukemia, available evidence suggests that there is an association
between breastfeeding and a reduced risk of acute lymphocytic leukemia and acute
myelogenous leukemia. Our findings from the meta-analyses of the three
case-controlled studies that were graded good or fair quality by one systematic
review were consistent with the results from the other meta-analysis, but with
smaller effect size and smaller statistical significance. Further evaluation of the
biological mechanisms underpinning this relationship while taking into consideration
potential biases can be achieved with more large-scale case-controlled studies
utilizing population-based and socioeconomic status-matched controls.
For obesity, evidence from three systematic reviews and meta-analyses suggests that a
history of breastfeeding is associated with a reduction in the risk of obesity in
later life. However, one must be aware of the possibility of residual confounding in
interpreting this association. The pooled adjusted odds ratio of obesity comparing
ever breastfed to never breastfed was 0.76 (95%CI 0.67–0.86) in one
meta-analysis and 0.93 (95%CI: 0.88–0.99) in the other. The magnitude of
effects was reduced when more confounders were adjusted in these analyses.
For type 2 diabetes, based on findings from a high-quality systematic review and
meta-analyses of seven studies, early breastfeeding was associated with a lower risk
of type 2 diabetes in later life compared with those initially formula-fed. However,
only three studies appropriately adjusted for all the important confounders,
including birth weight, parental diabetes, socioeconomic status, and individual or
maternal body size. Even though these three studies found that adjustment did not
alter the crude estimate, we cannot be completely confident that potential
confounding by birth weight and maternal factors has been ruled out for the overall
pooled estimate. This potentially could exaggerate the magnitude of the association.
For type 1 diabetes, even though there are some data to support that breastfeeding
for more than 3 months is associated with a reduced risk of type 1 diabetes, this
finding must be interpreted with caution because of the likelihood of recall biases
and suboptimal adjustments for potential confounders in the primary studies.
For postpartum depression, studies of moderate quality reported an association
between not breastfeeding or short duration of breastfeeding and postpartum
depression. It is plausible that postpartum depression led to early cessation of
breastfeeding, as opposed to breastfeeding altering the risk of depression. Both
effects might occur concurrently. Additional factors that may have a bearing on both
postpartum depression and the decision to initiate or terminate breastfeeding should
be sought. Documentation of baseline mental health status before the initiation of
breastfeeding and detailed recording of breastfeeding data will improve the quality
of the studies and help understand the nature of the association.
There is no evidence of an association between lifetime breastfeeding duration and
maternal osteoporosis. Lactation does not appear to have an effect on long-term
changes in bone mineral densities. However, this conclusion is limited by the fact
that the feeding history in the studies was obtained by maternal recall and no data
on exclusivity of breastfeeding were available. Further investigation with accurate
breastfeeding data is warranted.
For breast cancer, there is good evidence to support the observation that
breastfeeding is associated with a reduction in the risk of breast cancer. This
association is more likely in those women with increased lifetime months of
breastfeeding their infants.
For ovarian cancer, there is some evidence to suggest an association between
breastfeeding and a reduction in the risk of maternal ovarian cancer. However, one
must be cautious in interpreting this association because it was largely based on
estimations of the odds ratios from retrospective studies.
For postpartum weight change, we found that the overall effect of breastfeeding on
return-to-pre-pregnancy weight (weight change from pre-pregnancy or first trimester
to 1 to 2 year postpartum) was negligible (less than 1 kg), and the effect of
breastfeeding on postpartum weight change was unclear. Results from the studies also
suggest that many other factors have larger effects on weight retention or
postpartum weight loss than breastfeeding. Methodological challenges in these
studies include the accurate measurement of energy balance, adequate control for
numerous covariables, and quantifying accurately the exclusivity and the duration of
breastfeeding. None of the included studies tackled all of these challenges.
Concerning the risk of maternal type 2 diabetes, a longer duration of lifetime
breastfeeding is associated with a reduced risk of developing type 2 diabetes among
parous women who did not have a history of gestational diabetes (GDM). There was a
difference in the risk of developing type 2 diabetes between women with and without
GDM in relation to lactation. Compared with women who did not have a history of GDM,
women with a history of GDM had a markedly increased risk of type 2 diabetes; and
lactation showed no significant relationship with diabetes risk among this group of
women. One must be cautious in interpreting these findings, as they are only
generalizable to population with characteristics similar to that of the Nurses'
Health cohort.
An important area of research that is not systematically reviewed in this report is
the use of breastfeeding promotion intervention trial to measure health effects
(this topic is not part of the scope of this report and it will be covered in a
separate report). The best known of these types of studies is the previously
described Promotion of Breastfeeding Intervention Trial (PROBIT) conducted in the
Republic of Belarus.17 Data from this study provided good evidence that breastfeeding is associated
with a reduction in the risk of gastrointestinal infection and atopic dermatitis.
Whether results from studies conducted in other countries are applicable to the
United States is unclear. In Belarus, mothers often stay in the hospital close to
one week post delivery, infant formulas can cost as much as 20% of an average
salary, and there is an obligatory prolonged maternity leave (approximately 3 years
in most cases). In contrast, in the United States, mothers are often discharged
within 48 hours post delivery and formula manufacturers provide rebates to the
Special Supplemental Nutrition Program for Women, Infants, and Children (www.wicprogram.org). The
factors in Belarus could work in conjunction with the intervention to help promote
the increase in the rate of exclusive breastfeeding. On the other hand, one may
argue that the results reported in the Belarus study could serve as a best-case
scenario in terms of the potential benefits of breastfeeding when optimal promotion
and support of breastfeeding are in place. More research in this country along the
line of the Belarus study should be considered.
Of note, there were a few individual primary studies on asthma, cardiovascular
mortality, and type 1 diabetes that reported increase in risk of those diseases in
subjects who had been breastfed. Even though those studies were few in numbers,
those findings should not be ignored and further investigation should be done.
Lastly, the outcomes analyzed in this review represent only a portion of all possible
health outcomes related to breastfeeding reported by investigators worldwide. To
work within the constraints of resources, we relied on the advice from our panel of
technical experts in finalizing the list of outcomes included in this review. Thus,
some important outcomes (e.g., growth and nutrition) have, by necessity, not been
included in this review. Additional systematic reviews germane to those important
outcomes would be of value.
Future Research
Assessment of the association between breastfeeding and health
outcomes
Observational studies will remain the major source of information in this field.
Clear subject selection criteria, adopting a common definition of
“exclusive breastfeeding”, reliable collection of feeding
data, specific and properly quantifiable outcomes of interest, controlling for
important potential confounders including child-specific factors, and blinded
assessment of the outcome measures will help immeasurably to improve the quality
of these studies. Traditional retrospective case-control studies, usually used
when the disease is rare, are less desirable because of the many caveats noted
earlier. Prospective nested case-control studies with blinded assessment of the
outcome measures would provide more reliable results.
As have been mentioned previously, it is not possible to eliminate self-selection
bias in observational studies because of behavioral or attitudinal factors
intrinsic in the desire to breastfeed. Thus, it is worthwhile to study these
factors to further understand the reasons for the decision to breastfeed.
Sibling analysis provides a method to control for hereditary and household
factors that are important in certain outcomes, provided that those factors are
similar for the siblings of interest. Although such analysis may be less
susceptible to confounders and effect modifiers that are shared by siblings, one
must remember that it is not immune to biases. This method should be used when
the appropriate data are available.
There is a large degree of heterogeneity across studies among many of the
outcomes. The heterogeneity persisted after adjusting for potential confounders.
It might be helpful to study breast milk composition (e.g., oligosaccharides,
nucleotides, and others) with respect to the residual heterogeneity. In
addition, maternal genetic variations in the production of those factors of
interest from breast milk can be studied (for an example, see the discussion by
Newburg 2005218 concerning the variability of antidiarrheal effect of breastfeeding
according to the prevalence of the secretor gene (fucosyltransferase
2) and the Lewis gene (fucosyltransferase 3) in the
study population).
Assessment of the efficacy/effectiveness of breastfeeding promotion
interventions
Cluster randomized controlled studies similar to the Belarus trial will provide
understanding of the effectiveness of various breastfeeding promotion
interventions. Any substantial differences in the degree of breastfeeding
between the two groups as a result of the intervention will provide further
opportunity to investigate any disparity in health outcomes between the two
groups.