The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The report topic was nominated by the American Psychiatric Association (APA) and the Laureate Psychiatric Clinic and Hospital. Funding for this report was provided by the Office of Research on Women's Health at the National Institutes of Health (NIH) and the Health Resources and Services Administration. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome comments on this evidence report. They may be sent by mail to the Task Order Officer named below at: Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850, or by e-mail to epc@ahrq.gov.
Carolyn M. Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Vivian W. Pinn, M.D.
Director
Office of Research on Women's Health
National Institutes of Health
Sabrina A. Matoff-Stepp, M.A.
Director
Office of Women's Health
Health Resources and Services Administration
Jean Slutsky, P.A., M.S.P.H.
Director, Center for Outcomes and Evidence
Agency for Healthcare Research and Quality
Beth A. Collins Sharp, R.N., Ph.D.
Acting Director, EPC Program
Agency for Healthcare Research and Quality
Marian James, Ph.D.
EPC Program Task Order Officer
Agency for Healthcare Research and Quality
Objectives. The RTI International—University of North Carolina at Chapel Hill Evidence-based Practice Center (RTI-UNC EPC) systematically reviewed evidence on efficacy of treatment for anorexia nervosa (AN), bulimia nervosa (BN), and binge eating disorder (BED), harms associated with treatments, factors associated with the treatment efficacy and with outcomes of these conditions, and whether treatment and outcomes for these conditions differ by sociodemographic characteristics.
Data Sources. We searched MEDLINE®, the Cumulative Index to Nursing and Applied Health (CINAHL), PSYCHINFO, the Educational Resources Information Center (ERIC), the National Agricultural Library (AGRICOLA), and Cochrane Collaboration libraries.
Review Methods. We reviewed each study against a priori inclusion/exclusion criteria. For included articles, a primary reviewer abstracted data directly into evidence tables; a second senior reviewer confirmed accuracy. We included studies published from 1980 to September, 2005, in all languages. Studies had to involve populations diagnosed primarily with AN, BN, or BED and report on eating, psychiatric or psychological, or biomarker outcomes.
Results. We report on 30 treatment studies for AN, 47 for BN, 25 for BED, and 34 outcome studies for AN, 13 for BN, 7 addressing both AN and BN, and 3 for BED.
The AN literature on medications was sparse and inconclusive. Some forms of family therapy are efficacious in treating adolescents. Cognitive behavioral therapy (CBT) may reduce relapse risk for adults after weight restoration.
For BN, fluoxetine (60 mg/day) reduces core bulimic symptoms (binge eating and purging) and associated psychological features in the short term. Individual or group CBT decreases core behavioral symptoms and psychological features in both the short and long term. How best to treat individuals who do not respond to CBT or fluoxetine remains unknown.
In BED, individual or group CBT reduces binge eating and improves abstinence rates for up to 4 months after treatment; however, CBT is not associated with weight loss. Medications may play a role in treating BED patients. Further research addressing how best to achieve both abstinence from binge eating and weight loss in overweight patients is needed.
Higher levels of depression and compulsivity were associated with poorer outcomes in AN; higher mortality was associated with concurrent alcohol and substance use disorders. Only depression was consistently associated with poorer outcomes in BN; BN was not associated with an increased risk of death. Because of sparse data, we could reach no conclusions concerning BED outcomes.
No or only weak evidence addresses treatment or outcomes difference for these disorders.
Conclusions. The literature regarding treatment efficacy and outcomes for AN, BN, and BED is of highly variable quality. In future studies, researchers must attend to issues of statistical power, research design, standardized outcome measures, and sophistication and appropriateness of statistical methodology.
The RTI International-University of North Carolina at Chapel Hill Evidence-based Practice Center (RTI-UNC EPC) conducted a systematic review of the literature on key questions concerning anorexia nervosa (AN), bulimia nervosa (BN), and eating disorders not otherwise specified (EDNOS) (focusing on binge eating disorder [BED]) to address questions posed by the American Psychiatric Association and Laureate Psychiatric Hospital through the Agency for Healthcare Research and Quality (AHRQ). Funding was provided by AHRQ, the Office of Research on Women's Health at the National Institutes of Health, and the Health Resources and Services Administration. We received guidance and input from a Technical Expert Panel (TEP).
We systematically reviewed the evidence on two categories of issues—treatment and outcomes for AN, BN, and BED—in six key questions (KQs): (1) efficacy of treatment, (2) harms associated with treatment, (3) factors associated with the efficacy of treatment, (4) whether efficacy of treatment differs by sex, gender, age, race, ethnicity, or cultural group, (5) factors associated with outcomes, and (6) whether outcomes differ by sex, gender, age, race, ethnicity, or cultural group.
AN is marked by low body weight, fear of weight gain, disturbance in the way in which one's body size is perceived, denial of illness, or undue influence of weight on self-evaluation. Although amenorrhea is a diagnostic criterion, it is of questionable relevance.
BN is characterized by recurrent episodes of binge eating in combination with some form of compensatory behavior. Binge eating is the consumption of an uncharacteristically large amount of food by social comparison coupled with a feeling of being out of control. Compensatory behaviors include self-induced vomiting; misuse of laxatives, diuretics, or other agents; fasting; and excessive exercise.
BED is marked by binge eating in the absence of compensatory behaviors, a series of associated features of binge eating, and marked distress regarding binge eating. Overweight and obesity are commonly seen in individuals with BED.
Although rigorous epidemiologic data are lacking in the United States, the mean prevalence of AN is 0.3 percent, of subthreshold AN 0.37 percent to 1.3 percent, of BN 1.0 percent, and of BED 0.7 percent to 3.0 percent. Mortality from AN is about 5 percent per decade of followup. Treatment for severe AN can involve inpatient or partial hospitalization in costly specialized settings. Inadequate insurance coverage often truncates the recommended duration of treatment. Treatment costs for AN are higher than those for obsessive-compulsive disorder and comparable to those for schizophrenia. In contrast, treatment for BN in the United States is typically on an outpatient basis.
We searched MEDLINE®, the Cumulative Index to Nursing and Applied Health (CINAHL), PSYCHINFO, the Educational Resources Information Center (ERIC), the National Agricultural Library (AGRICOLA), and Cochrane Collaboration libraries. Based on key questions and discussion with our TEP, we generated a list of article inclusion and exclusion criteria. We reviewed studies of humans, ages 10 years and older, of both sexes, published in all languages and from all nations, from 1980 to September 2005. Studies had to include populations diagnosed primarily with AN, BN, or BED and to report on at least one of our outcomes categories of interest: eating-related behaviors, psychiatric and psychological outcomes, and biomarker measures. We reviewed each abstract and article systematically against a priori criteria to determine whether to include it in the review. One reviewer initially evaluated abstracts for inclusion or exclusion. If that reviewer concluded that the article should be included in the review, it was retained. Articles that the reviewer determined did not meet our criteria were re-reviewed by a senior reviewer who could include the article if she disagreed with the initial determination. We assigned each excluded article a reason for exclusion.
The RTI-UNC EPC team abstracted data from included articles directly into evidence tables. For both the treatment and the outcomes literatures, a primary reviewer abstracted data directly into evidence tables; a second (senior) reviewer confirmed accuracy, completeness, and consistency. The two staff reconciled all disagreements about information in evidence tables.
Each abstractor independently evaluated study quality. Because of differences in the treatment and outcomes literature, we evaluated the two bodies of literature using separate criteria. For the treatment literature, our evaluation used 25 items in 11 categories: (1) research aim/study question, (2) study population, (3) randomization, (4) blinding, (5) interventions, (6) outcomes, (7) statistical analysis, (8) results, (9) discussion, (10) external validity, and (11) funding/sponsorship. For the outcomes literature, we evaluated the evidence against 17 items in 8 categories: (1) research aim/study question, (2) study population, (3) eating disorder diagnosis method, (4) study design, (5) statistical analysis, (6) results/outcome measurement, (7) external validity, and (8) discussion.
We focused our analysis on studies that received fair or good quality ratings. This included 19 studies discussed in 22 articles concerning treatment for AN: 38 studies discussed in 48 articles concerning treatment for BN: 20 studies discussed in 21 articles concerning treatment for BED: 26 studies discussed in 32 articles concerning outcomes for AN: 9 studies discussed in 13 articles concerning outcomes for BN: 7 studies discussed in 7 articles concerning outcomes for both AN and BN: and 3 studies discussed in 3 articles concerning outcomes for BED.
Anorexia Nervosa. We divided the treatment literature into medication-only (generally in the context of clinical management or hospitalization), medication plus behavioral intervention, and behavioral intervention only for either adults or adolescents. The literature regarding medication treatments for AN is sparse and inconclusive. The vast majority of studies had small sample sizes and rarely had adequate statistical power to allow for definitive conclusions. Although studies did include medication administered during or after inpatient intervention, no AN studies that systematically combined medication with behavioral interventions met our inclusion criteria, revealing a substantial gap in the literature.
In the behavioral intervention literature, preliminary evidence suggests that cognitive behavioral therapy (CBT) may reduce relapse risk for adults with AN after weight restoration. Sufficient evidence does not exist to determine whether CBT has any effect during the acute phase of the illness, and one study, also requiring replication, showed that a manual-based treatment combining elements of sound clinical management and supportive psychotherapy by a specialist was more effective than CBT during the acute phase. Family therapy as currently conceptualized does not appear to be effective with adults with AN with longer duration of illness. Specific forms of family therapy initially focusing on parental control of renutrition is efficacious in treating AN in adolescents and leads to clinically meaningful weight gain and psychological change. The lack of follow-up data compromises our ability to determine the extent to which treatment gains are maintained.
Bulimia Nervosa. In medication trials, fluoxetine (60 mg/day) administered for 6 weeks to 18 weeks reduced the core bulimia symptoms of binge eating and purging and associated psychological features in the short term. The 60 mg dose performs better than lower doses and is associated with prevention of relapse at 1 year. Evidence for the long-term effectiveness of relatively brief medication treatment does not exist. The optimal duration of treatment and the optimal strategy for maintenance of treatment gains are unknown.
Studies that combine drugs and behavioral interventions provide only preliminary evidence regarding the optimal combination of medication and psychotherapy or self-help. How best to treat individuals who do not respond to CBT or fluoxetine remains a major shortcoming of the literature. For behavioral interventions for BN, CBT administered individually or in group format is effective in reducing the core behavioral symptoms of binge eating and purging and psychological features in both the short and long term. Further evidence is required to establish the role for self-help in reducing bulimic behaviors.
Binge Eating Disorder. For BED, we addressed two critical outcomes—decrease in binge eating and decrease in weight in overweight individuals. Various medications were studied, including selective serotonin reuptake inhibitors (SSRIs); a combined serotonin, dopamine, and norepinephrine uptake inhibitor; tricyclic antidepressants; an anticonvulsant; and one appetite suppressant. In short-term trials, SSRIs led to greater rates of reduction in target eating, psychiatric and weight symptoms, and severity of illness than placebo controls. However, in the absence of clear endpoint data, and in the absence of data regarding abstinence from binge eating, we cannot judge the magnitude of the clinical impact of these interventions. Moreover, in the absence of follow-up data after drug discontinuation, we do not know whether observed changes in binge eating, depression, and weight persist.
The combination of CBT plus medication may improve both binge eating and weight loss, although sufficient trials have not been done to determine definitively which medications are best at producing and maintaining weight loss. Moreover, the optimal duration of medication treatment for sustained weight loss has not yet been addressed empirically.
Collectively, clinical trials incorporating CBT for BED indicated that CBT decreases either the number of binge days or the actual number of reported binge episodes. CBT leads to greater rates of abstinence than does a waiting list control approach when administered either individually or in group format, and this abstinence persists for up to 4 months posttreatment. CBT also improves the psychological aspects of BED, such as ratings of restraint, hunger, and disinhibition. Results are mixed as to whether CBT improves self-rated depression in this population. Finally, CBT does not appear to produce decreases in weight.
Various forms of self-help were efficacious in decreasing binge days, binge eating episodes, and psychological features associated with BED. Self-help also led to greater abstinence from binge eating than waiting list; short-term abstinence rates approximate those seen in face-to-face psychotherapy trials.
Strength of Evidence in Treatment Literature. We graded the strength of the body of evidence for each question separately. For efficacy of treatment (KQ 1), we graded evidence for AN treatment as weak, that for BN medication and behavioral interventions as strong, and that for BED therapies as moderate. For harms associated with treatment (KQ 2), we graded medication interventions for BN and BED as consistently strong; the literatures for all AN interventions and all other BN and BED interventions were graded as weak to nonexistent because many studies failed to address harms associated with treatment. For factors associated with efficacy of treatment (KQ 3), with the exception of behavioral interventions for BN, which we graded as moderate, we graded the literature uniformly as weak. No published literature provided evidence on whether the efficacy of treatment for these conditions differs by sociodemographic factors (KQ 4). Overall, the literature on the treatment of AN in particular was deficient.
Outcomes of Eating Disorders. One prospective cohort study, conducted in Sweden, followed individuals with AN in the community. Over a 10-year period, approximately half of the group had fully recovered; a small percentage continued to suffer from AN, and the remainder still had other eating disorders. Members of the AN group no longer differed from those in the comparison group in terms of weight, but they continued to be more depressed and to suffer from a variety of personality disorders, obsessive-compulsive disorder, Asperger syndrome, and autism spectrum disorders.
The remaining AN studies followed patient populations. Typically, at least one-half of the patients no longer suffered from AN at followup. However, many continued to have other eating disorders such as BN or EDNOS, and mortality was significantly higher than would be expected in the population matched by sex and age. Factors associated with recovery or good outcomes included lower levels of depression and compulsivity. Factors associated with increased mortality included concurrent alcohol and substance use disorders.
All of the BN outcomes studies followed patient populations. This literature emphasizes comparisons of various definitions of disease outcomes and diagnostic subtypes. Generally, more than one-half of the patients followed no longer had a BN diagnosis at the end of the study. A substantial percentage continued to suffer from other eating disorders, but BN was not associated with an increased mortality risk. A limited number of analyses uncovered factors significantly associated with outcomes of this disease, but only depression was consistently associated with worse outcomes.
Only sparse evidence addresses factors associated with BED outcomes. The three included studies have vastly different designs and research questions; more importantly, they do not converge on any systematic findings. Recalling that no studies of EDNOS outcomes exist, we conclude that the literature regarding outcomes of both EDNOS in general and BED in particular is seriously lacking; we believe that no conclusions can be drawn about factors influencing outcomes of these disorders.
Age of AN disease onset was examined in several AN outcomes studies. However, the relation between this variable and outcomes was mixed. No additional differences by participant sex, gender, age, race, ethnicity, or cultural group emerged from the AN, BN, or BED outcomes literature.
Strength of Evidence in Outcomes Literature. The strength of the evidence addressing factors associated with outcomes among individuals with AN and BN is moderate. In contrast, given the limited information about factors related to outcomes among individuals with BED (KQ 5), we rated BED evidence as weak. We used the body of literature concerning KQ 5 to examine differences in outcomes by sociodemographic factors (KQ 6). We graded the AN literature as weak and the BN and BED literature as nonexistent.
In conclusion, the literature regarding treatment efficacy and outcome for AN, BN, and BED is of highly variable quality. In the treatment literature, the largest deficiency rests with treatment efficacy for AN where the literature was weakest. Future studies require large numbers of participants, multiple sites, appropriate biomarker outcomes, and clear delineation of the age of participants. For BN, future studies should address novel treatments for the disorder, optimal duration of intervention, and optimal approaches for those who do not respond to medication or CBT. For BED, future studies should identify interventions that are effective for both elimination of binge eating and reduction of weight (in overweight individuals), optimal duration of intervention, and effective strategies for prevention of relapse. For all three disorders, exploration of additional treatment approaches is warranted. In addition, for all three disorders, greater attention must be paid to factors influencing outcomes, harms associated with treatment, and differential efficacy by sex, gender, age, race, ethnicity, or cultural group.
For all three disorders, consensus definitions of remission, recovery, and relapse are essential. Greater attention to disease presentations currently grouped under the heading of EDNOS is required for both treatment and outcome literature. For outcome studies, especially for BN and BED, population-based cohort studies with comparison groups and adequate durations of followup are required. For both future treatment and outcome studies, researchers must carefully attend to issues of statistical power, research design including the use of similar outcome measures across studies, and sophistication and appropriateness of statistical analyses.
The eating disorders discussed in this report include anorexia nervosa (AN), bulimia nervosa (BN), and eating disorders not otherwise specified (EDNOS). Although rigorous epidemiologic data specific only to the United States are lacking, the mean prevalence of AN in young females in Western Europe and the United States is 0.3 percent and the mean prevalence of BN is 1.0 percent. Clinically concerning subthreshold conditions are more prevalent.1 These eating disorders are associated with substantial morbidity and mortality.2, 3 The financial and social impact of these potentially fatal disorders on disability, productivity, and quality of life remains unknown.
AN is a serious psychiatric illness marked by an inability to maintain a normal healthy body weight, often dropping well below 85 percent of ideal body weight. Patients who are still growing fail to make expected increases in weight (and often height) and bone density. Despite increasing weight loss, individuals with AN continue to obsess about weight, remain dissatisfied with the perceived size of their bodies, and engage in an array of unhealthy behaviors to perpetuate weight loss (e.g., purging, dieting, excessive exercise, fasting). Individuals with AN place central importance on their shape and weight as a marker of self-worth and self-esteem. Although amenorrhea is a diagnostic criterion, it is of questionable relevance. There do not appear to be meaningful differences between individuals with AN who do and do not menstruate.4, 5 Typical personality features of individuals with AN include perfectionism, obsessionality, anxiety, harm avoidance, and low self-esteem.6
The most common comorbid psychiatric conditions include major depression7, 8 and anxiety disorders.9, 10 Anxiety disorders often predate the onset of the eating disorder,9, 10 and depression often persists post-recovery.11
| Diagnostic Criteria | |
|---|---|
| Russell's Criteria for Anorexia Nervosa |
|
| Feighner's Criteria for Anorexia Nervosa |
|
| DSM III Criteria for Anorexia Nervosa (307.10) |
|
| DSM III-R Criteria for Anorexia Nervosa (307.10) |
|
| DSM IV Criteria for Anorexia Nervosa (307.10) |
|
| Specify type: | |
| |
| ICD-9 Criteria for Anorexia Nervosa (307.1) | A disorder in which the main features are persistent active refusal to eat and marked loss of weight |
| The level of activity and alertness is characteristically high in relation to the degree of emaciation | |
| Typically the disorder begins in teenage girls but it may sometimes begin before puberty and rarely occurs in males | |
| Amenorrhoea is usual and there may be a variety of other changes including slow pulse and respiration and low body temperature and dependent oedema | |
| Unusual eating habits and attitudes toward food are typical and sometimes starvation follows or alternates with periods of overeating | |
| The accompanying psychiatric symptoms are diverse | |
| ICD-10 Criteria for Anorexia Nervosa (F50.0) |
|
| ICD-10 Criteria for Atypical Anorexia Nervosa (F50.1) | Disorder that fulfills some of the features of anorexia nervosa but in which the overall clinical picture does not justify that diagnosis. For instance, one of the key symptoms, such as amenorrhoea or marked dread of being fat, may be absent in the presence of marked weight loss or weight-reducing behavior. This diagnosis should not be made in the presence of known physical disorders associated with weight loss |
DSM, Diagnostic and Statistical Manual; ICD, International Classification of Diseases. For citations, see text.
The mean prevalence of AN in young females in Western Europe and the United States is 0.3 percent.1 The prevalence of subthreshold AN, defined as one criterion short of threshold, is greater—ranging from 0.37 percent to 1.3 percent.18, 19
Although awareness of the disorder has increased, the data on changing incidence are conflicting. Some studies suggest that the incidence is increasing,20–26 and others report stable rates.27–31 Epidemiological studies indicate that the peak age of onset is between 15 and 19 years.32 Anecdotal reports suggest increasing presentations in prepubertal children33 and new onset cases in mid- and late-life.34, 35 The gender ratio for AN is approximately 9:1, women to men.16
The etiology of AN remains incompletely understood. Although numerous psychological, social, and biological factors have been implicated as potentially causal, few specific risk factors have been consistently replicated in studies of the etiology of the disorder.36, 37 Although not disorder-specific, common risk factors across eating disorders include sex, race or ethnicity, childhood eating and gastrointestinal problems, elevated shape and weight concerns, negative self-evaluation, sexual abuse and other adverse events, and general psychiatric comorbidity.36 In addition, prematurity, smallness for gestational age, and cephalohematoma have been identified as risk factors for AN.38
The preponderance of reports from western cultures fueled early conceptualizations of AN as a culturally determined disorder, but the past decade of biological and genetic research has revealed that AN is familial39 and that the observed familial aggregation is attributable primarily to genetic factors.40–42 Moreover, molecular genetic studies have identified areas of the human genome that may harbor susceptibility loci for AN43, 44 and specific genes that may influence risk.45, 46
In addition, an array of pharmacologic, genetic, and neuroimaging studies have identified fundamental disturbances in serotonergic function in individuals with AN even after recovery.47 Although serotonin has received considerable research attention, given the interrelatedness of neurotransmitter function, other neurotransmitter systems, most notably dopamine, are also implicated in these disorders.48 The ultimate understanding of AN etiology will likely include main effects of both biological and environmental factors as well as their interactions and correlations.
AN has serious medical and psychological consequences that can persist even after recovery. Features associated with the eating disorder including depression, anxiety, social withdrawal, heightened self-consciousness, fatigue, and multiple medical complications.7, 49–51 The social toll of AN interferes with normal adolescent development.52 Across psychiatric disorders, the highest risks of premature death, from both natural and unnatural causes, are from substance abuse and eating disorders.53
A history of AN is associated with greater problems with reproduction,54 osteoporosis,55–57 continued low body mass index (BMI, a commonly used measure of normal weight, overweight, or obesity calculated as weight in kilograms divided by height in meters squared [kg/m2]), and major depression.11 Chapter 6 reviews eating-related, psychological, and biomarker-measured outcomes of AN in detail.
Given the high morbidity and mortality associated with AN, developing effective treatments for AN is critical. Because of the frequent medical complications and nutritional compromise, clinical practice typically includes a comprehensive medical evaluation and nutritional counseling. Typically, less medically compromised cases of AN are treated on an outpatient basis by psychiatrists, psychologists, and other therapists with primary care providers managing medical care. Professional organizations have developed several English-language treatment guidelines or position papers for the treatment of AN; these include the American Psychiatric Association,58 the National Institute for Clinical Excellence,59 the Society for Adolescent Medicine,60 the American Academy of Pediatrics,61 and the Royal Australian and New Zealand College of Psychiatrists.62
Psychotherapeutic approaches include individual psychotherapy (cognitive-behavioral, interpersonal, behavioral, and psychodynamic), family therapy (especially for younger patients), and group therapy. The American Psychiatric Association Working Group on Eating Disorders concluded that hospitalization is appropriate for individuals below 75 percent of ideal body weight.58 Weight is not the only parameter to be considered in level of care decisions. Other considerations include medical complications, suicide attempt or plan, failure of outpatient or partial hospitalization treatment, psychiatric comorbidity, role impairment, poor psychosocial support, compromised pregnancy, and lack of availability of less intensive treatment options.58 Such treatment commonly involves highly specialized multidisciplinary teams including psychologists, psychiatrists, internists or pediatricians, nutritionists, social workers, and nurse specialists.
Striegel-Moore et al. reported the average length of stay to be 26 days using an insurance database of approximately 4 million individuals in the United States;63 this is substantially shorter than the lengths of stay in other countries, including New Zealand (72 days)64 and Europe, which ranges from 40.6 days (Finland) to 135.8 days (Switzerland).65 They found that, per patient, AN treatment costs in the United States were higher than those for obsessive-compulsive disorder and comparable to those for schizophrenia, both of which have prevalences similar to those of AN.63
A workshop sponsored by the National Institute of Mental Health (NIMH) examined problems in conducting research on AN treatment.66 It highlighted obstacles such as relatively low incidence and prevalence, lack of consensus on best treatments, variable presentation within the patient population based on age and illness factors, high costs of providing treatment, and the complex interaction of medical and psychiatric problems associated with the illness. This report also highlighted the importance of improving and expanding the workforce in the eating disorders research field.
BN is characterized by recurrent episodes of binge eating in combination with some form of inappropriate compensatory behavior. Binge eating is the consumption of an abnormally large amount of food coupled with a feeling of being out of control. Compensatory behaviors (aimed at preventing weight gain) include self-induced vomiting; the misuse of laxatives, diuretics, or other agents; fasting; and excessive exercise.
The onset of BN usually occurs in adolescence or early adulthood and is most frequently seen in women who are of normal body weight.16 Although the gender ratio is approximately 9:1, women to men, the diagnostic criteria themselves are gender-biased. In contrast to women, men tend to present with a greater reliance on nonpurging forms of compensatory behavior such as excessive exercise.67, 68 Considerations of differences in the clinical presentation of BN in men may lead to revised estimates.67, 69
Approximately 80 percent of patients with BN are diagnosed with another psychiatric disorder at some time in their life.70 Commonly comorbid psychiatric conditions include anxiety disorders, major depression, dysthymia, substance use, and personality disorders.9, 71–77 Personality features of individuals with BN include some features shared with AN such as high harm avoidance, perfectionism, and low self-esteem. Features more specific to BN include higher novelty seeking, higher impulsivity, lower self-directedness, and lower cooperativeness.78–80
| Diagnostic Criteria | |
|---|---|
| DSM III Criteria for Bulimia Nervosa (307.51) |
|
| DSM III-R Criteria for Bulimia Nervosa (307.51) |
|
| DSM IV Criteria for Bulimia Nervosa (307.51) |
|
| Specify type: | |
| Purging type: During the current episode of bulimia nervosa, the person has regularly engaged in self-induced vomiting or the misuse of laxatives, diuretics, or enemas | |
| Nonpurging type: During the current episode of bulimia nervosa, the person has used inappropriate compensatory behaviors, such as fasting or excessive exercise, but has not regularly engaged in self-induced vomiting or the misuse of laxatives, diuretics, or enemas | |
| ICD-10 Criteria for Bulimia Nervosa (F50.2) |
|
| ICD-10 Criteria for Atypical Bulimia Nervosa (F50.3) | Disorder that fulfills some of the features of bulimia nervosa, but in which the overall clinical picture does not justify that diagnosis. For instance, there may be recurrent bouts of overeating or overuse of purgatives without significant weight change, or the typical overconcern about body shape and weight may be absent |
DSM, Diagnostic and Statistical Manual; ICD, International Classification of Diseases. For citations, see text.
A recent review estimated the prevalence of BN to be 1 percent for women and 0.1 percent for men across Western Europe and the United States.1 The prevalence of subthreshold BN was considerably higher: 1.5 percent for full syndrome and 5.4 percent for partial syndrome. Because of the late introduction of BN into psychiatric nomenclature, few studies have explored temporal changes in the incidence of the disorder. Moreover, few studies have estimated the prevalence of BN among children and adolescents.
Historically, like AN, BN has been conceptualized as having sociocultural origins. Substantial familial aggregation of BN has been reported.39 Twin studies reveal a moderate to substantial contribution of additive genetic factors (between 54 percent and 83 percent) and unique environmental factors to BN.81, 82 Linkage analyses have identified areas on chromosome 10p that may be implicated in BN.83 Numerous candidate genes have been studied for their role in risk for the disorder.46
Ongoing biological studies suggest fundamental disturbances in serotonergic function in individuals with BN.80, 84 The ultimate understanding of the etiology of BN and of other disturbances that contribute to the development of inappropriate responses to satiety clues85 will most likely include main effects of both biological and environmental factors as well as their interactions and correlations.
Although BN is not typically associated with the serious physical complications normally associated with AN, patients commonly report physical symptoms such as fatigue, lethargy, bloating, and gastrointestinal problems. Individuals with BN who engage in frequent vomiting may experience electrolyte abnormalities, metabolic alkalosis, erosion of dental enamel, swelling of the parotid glands, and scars and calluses on the backs of their hands.86 Those who frequently misuse laxatives can have edema, fluid loss and subsequent dehydration, electrolyte abnormalities, metabolic acidosis, and potentially permanent loss of normal bowel function.86 Chapter 6 reviews eating-related, psychological, and biomarker-measured outcomes of BN in detail.
In the United States, most treatment for BN is conducted on an outpatient basis. Given the frequency of medical87 and nutritional complications, a comprehensive medical evaluation is the typical first step in treatment. Thereafter, psychotherapy, delivered either individually or in group format, is usually the cornerstone of BN interventions. Common approaches include cognitive-behavioral therapy and interpersonal psychotherapy. In cases in which the individual is experiencing medical complications of BN, is pregnant, or is unable to bring an entrenched binge-purge cycle under control on an outpatient basis, partial hospitalization or inpatient treatment is often warranted.
In 1996, the Food and Drug Administration (FDA) approved fluoxetine for the treatment of BN. Currently, this is the only FDA-approved medication for the treatment of any eating disorder.
Eating disorders not otherwise specified (EDNOS) is a diagnostic category that captures those individuals with eating disorders who do not meet criteria for AN or BN. The DSM IV lists six different examples of presentations of EDNOS:
all features of AN except amenorrhea;
all features of AN except remaining in a normal weight range;
all criteria for BN except frequency of binge eating or purging or duration of 3 months;
regular inappropriate compensatory behavior after eating small amounts of food;
chewing and spitting out food; and
binge eating disorder (BED).
Clinical reports suggest that individuals with EDNOS constitute the majority of individuals seeking professional help for an eating disorder.88, 89 This suggests that the nomenclature for eating disorders is imperfect. Moreover, our attempts to address the key questions of this evidence report for the global category of EDNOS indicated a paucity of investigations on the nature of the highly heterogeneous category of EDNOS and on the treatment and outcome of specific presentations of EDNOS. We redirected the task to focus on BED, the one category of EDNOS that has a corpus of research.
The symptom of binge eating was first recognized in a subset of obese individuals by Stunkard in 1959.90 BED has had a slow and controversial evolution in the psychiatric nosology for eating disorders.91–94 DSM IV currently includes BED as a disorder requiring further study.
| Diagnostic Criteria | |
|---|---|
| DSM IV Criteria for Binge Eating Disorder (307.50) |
|
DSM, Diagnostic and Statistical Manual.
Population-based studies suggest that between 0.7 percent and 3 percent of individuals in community samples meet criteria for BED.92, 97–99 Community studies of obese individuals have found a prevalence of BED between 5 percent and 8 percent.100, 101 Population-based studies of BED and the component behavior of binge eating report a relatively equal gender distribution,92, 99 few differences in prevalence across races or ethnic groups,102 and possibly increased risk associated with lower socioeconomic status.103, 104 In a population-based study of female twins, 37 percent of obese women (BMI ≥ 30) endorsed the symptom of binge eating,105 representing 2.7 percent of the female population studied.
In a community-based case-control study, Fairburn et al.106 found significant differences in exposure to risk factors between women with BED and healthy controls, but surprisingly few differences between women with BED and BN. In comparison to healthy controls, women with BED reported greater adverse childhood experiences, parental depression, personal vulnerability to depression, and exposure to negative comments about weight, shape, and eating.
BED has been shown to aggregate in families.107 Although heritability estimates for frank BED are not yet available, the heritability of binge eating in the absence of compensatory behaviors has been estimated to be 41 percent.108 In addition, binge eating has been explored as a potential intermediate behavioral phenotype in understanding the genetics of obesity. It has also been preliminarily identified in some studies as an important phenotypic characteristic of individuals with a mutation in the melanocortin 4 receptor (MC4R), a candidate gene that influences eating behavior,109 although this finding has not been replicated.110
Given that BED has only recently entered the psychiatric nomenclature, we have minimal population-based data on morbidity and mortality. The presence of binge eating or BED in obese individuals carries substantial risk. Obese individuals with binge eating or BED in clinical and community studies report earlier onsets of obesity and dieting,92, 111, 112 greater weight fluctuations,112 more cognitive features of disordered eating,113 lower self-esteem and self-efficacy,114 and higher scores on depression indices.114–117 Chapter 6 reviews eating-related, psychological, and biomarker-measured outcomes of BED in detail.
In the United States, treatment for BED is typically conducted on an outpatient basis. Psychological and dietary interventions aim to reduce binge eating and control weight.118 Common psychotherapeutic approaches include cognitive-behavioral and interpersonal psychotherapy; nutritional approaches include very low calorie diets and behavioral self-management strategies.118 Pharmacotherapy targeting both the core symptoms of binge eating and weight loss are also available as off-label interventions.119
Given that eating disorders are an important public health problem, the Agency for Healthcare Research and Quality (AHRQ), the National Institutes of Health's Office of Research on Women's Health, together with the Health Resources and Services Administration (HRSA), and in consultation with National Institute of Mental Health (NIMH), commissioned an evidence report through its Evidence Based Practice Program and assigned it to the RTI International-University of North Carolina Evidence-Based Practice Center (RTI-UNC EPC). The issue is also of particular concern to the American Psychiatric Association and the Laureate Psychiatric Clinic and Hospital, which nominated the topic.
Chapter 2 describes our methodological approach, including the development of key questions and their analytic framework, our search strategies, and inclusion/exclusion criteria. In Chapters 3 through 5, we separately present the results of our literature search and synthesis on the treatment of each disease (respectively, AN, BN, and BED). Chapter 6 documents our findings about outcomes associated with each disease. Chapter 7 further discusses our findings, grades the strength of the bodies of literature, highlights methodological shortcomings of the extant research, and offers recommendations for future research. Appendixes (available electronically at http://www.ahrq.gov provide a detailed description of our search strings (Appendix A *), our quality rating forms (Appendix B), detailed evidence tables (Appendix C), list of excluded studies (Appendix D), and acknowledgments including our Technical Expert Panel and peer reviewers (Appendix E).
We identified experts in the field of eating disorders to provide assistance throughout the project. The Technical Expert Panel (TEP) (seeAppendix E) contributes to AHRQ's broader goals of (1) creating and maintaining science partnerships as well as public-private partnerships and (2) meeting the needs of an array of potential customers and users of this product. The TEP served as both a resource and sounding board during the project. Our TEP comprised 10 individuals: three psychiatrists and two psychologists with eating disorder expertise; two nurses; one pediatric/adolescent medicine physician; one nutritionist; and one patient advocate.
To ensure accountability and scientifically relevant work, the TEP was called upon to provide guidance at all stages of the project. TEP members participated in conference calls and e-mail exchanges to
refine the analytic framework and key questions at the beginning of the project;
refine the scope of the project; and
discuss inclusion and exclusion criteria.
Because of their extensive knowledge of the literature on eating disorders, including numerous articles authored by TEP members, and their active involvement in professional organizations and as practitioners in the field, we also asked TEP members to participate in external peer review of the draft report.
We anticipate this report will be of value to members of the various professional organizations who treat eating disorders. These include the Academy for Eating Disorders, American Academy of Pediatrics, American Academy of Family Practice, American College of Obstetricians and Gynecologists, American Dietetics Association, American Psychiatric Association, American Psychological Association, International Association of Eating Disorders Professionals, National Association of Social Workers, and Society for Adolescent Medicine.
More generally, the report will assist these organizations in their mission to inform and educate practitioners. From this review, the National Institutes of Health can identify serious gaps in the research on eating disorders to guide funding policy. It can inform practitioners on the current evidence about outcomes associated with having these eating disorders and treating patients with them. Researchers will benefit from the concise analysis of the current status of the field, which will enable them to design future studies to address deficiencies in the field. Health educators can use this report to improve health communication. Finally, policymakers can use this report to allocate resources toward future research and initiatives that are likely to be successful.
In this chapter, we document the procedures that the RTI International - University of North Carolina at Chapel Hill Evidence-based Practice Center (RTI-UNC EPC) used to develop this comprehensive evidence report on the management and outcomes related to eating disorders. To provide a framework for the review, we first present the key questions and their underlying analytic framework. We then describe our strategy for identifying articles relevant to our key questions, our inclusion/exclusion criteria, and the process we used to abstract relevant information from eligible articles and generate our evidence tables. We also discuss our criteria for grading the quality of individual articles and the strength of the evidence as a whole. Last, we explain the peer review process.
This report spans key questions (KQs) regarding both treatment and outcomes of three eating disorders: anorexia nervosa (AN), bulimia nervosa (BN), and eating disorders not otherwise specified (EDNOS), which we refined to focus exclusively on binge eating disorder (BED) because of the lack of availability of data on other EDNOS conditions. We examine issues concerning treatment efficacy and disease outcomes separately for each disorder. The American Psychiatric Association and Laureate Psychiatric Clinic and Hospital initially offered these questions, and we put them into final form with input from our Technical Expert Panel (TEP).
1. What is the evidence for the efficacy of treatments or combination of treatments for each of the following eating disorders: AN, BN, and BED?
2. What is the evidence of harms associated with the treatment or combination of treatments for each of the following eating disorders: AN, BN, and BED?
3. What factors are associated with the efficacy of treatment among patients with the following eating disorders: AN, BN, and BED?
4. Does the efficacy of treatment for AN, BN, and BED differ by sex, gender, age, race, ethnicity, or cultural group?
5. What factors are associated with outcomes among individuals with the following eating disorders: AN, BN, and BED?
6. Do outcomes for AN, BN, and BED differ by sex, gender, age, race, ethnicity, or cultural group?
In the analytic framework for these questions (Figure 1
Also depicted on the framework are the six KQs discussed in this report. KQ 1 addresses the efficacy of available treatments for the three disorders; we categorize outcomes as eating-related outcomes that deal with the core behavioral and psychological pathology of the disorders, psychiatric or psychological outcomes that focus on the presence of comorbid depression and anxiety, and biomarker outcomes that reflect weight, body mass index (BMI), and other biological indices of the disorders. Treatment may include relapse, diagnostic crossover, and symptomatic change. KQ 2 explores the harms associated with both medication and psychological treatments for these disorders. KQs 3 and 4 highlight the roles of illness-related factors (e.g., comorbid depression, subtype of the eating disorders, early onset of illness) and illness-independent factors (e.g., sex, gender, race or ethnicity, age) in influencing the outcomes of treating these conditions.
KQ 5 addresses short- and long-term outcomes of the disorders. We apply information from observational, cohort, and case series investigations and focus on eating-related, psychiatric or psychological, and biological indices. Finally, KQ 6 highlights whether these outcomes differ by sex, gender, age, race or ethnicity, or cultural groups.
| Category | Criteria |
|---|---|
| Study population | Humans |
| All races, ethnicities, and cultural groups | |
| 10 years of age or older. | |
| Study settings and geography | All nations |
| Time period | Published from 1980 to the present |
| Publication criteria | All languages |
| Articles in print | |
| Articles in the “gray literature,” published in nonpeer-reviewed journals, or unobtainable during the review period were excluded. | |
| Admissible evidence (study design and other criteria) | Original research studies that provide sufficient detail regarding methods and results to enable use and adjustment of the data and results. |
| Anorexia nervosa must be diagnosed according to DSM III, DSM III-R, DSM IV, ICD-10, Feighner, or Russell criteria. | |
| Bulimia nervosa must be diagnosed according to DSM III-R, DSM IV, or ICD-10 criteria. | |
| Eating disorders not otherwise specified (binge eating disorder) must be diagnosed according to DSM IV criteria. | |
| Relevant outcomes: eating related, psychiatric or psychological, and biomarker measures; must be able to be abstracted from data presented in the papers. | |
| Eligible study designs include: | |
| Randomized controlled trials (RCTs): | |
| Double-blinded, single-blinded, and cross-over designs (data from prior to the first cross-over). | |
| Anorexia nervosa studies: initiated with 10 or more participants and followed for any length of time. | |
| Eating disorders not otherwise specified (binge eating disorder) studies: initiated with 10 or more participants and followed for any length of time. | |
| Bulimia nervosa studies: initiated with 30 or more patients and followed for a minimum of 3 months. | |
| Outcomes studies: | |
| Observational studies including prospective and retrospective cohort studies and case series studies, with and without comparison populations. | |
| Disease population must be followed for a minimum of 1 year. Disease population must include 50 or more participants at the time of the analysis. | |
We excluded data that combined diseases because such mixed information would preclude us from separately examining evidence on any one of the three conditions. We also excluded editorials, letters, and commentaries; articles that did not report outcomes related to our key questions; and studies that did not provide sufficient information to be abstracted. Studies were required to report on at least one of our outcomes categories of interest: eating, psychiatric and psychological, or biomarker measures.
We defined individuals as having one of the three disorders of interest according to specific diagnostic criteria. We examined the impact of treatment through a review of the RCT efficacy of treatment literature.
To address a TEP concern that the size of the available AN and BED literature was too limited to permit us to constrain this review based on sample size or followup duration, we included very small AN and BED RCT treatment studies in our review (10 or more participants) and did not require specified followup durations for a study to be included. The BN literature, however, is much more voluminous, which allowed us to limit the treatment studies to larger ones (i.e., those with 30 or more participants).
To help ensure that we were not measuring short-term fluctuations in disease symptoms, we required BN efficacy of treatment studies to follow patients for a minimum of 3 months. The decision to place more stringent requirements on the BN literature was made in consultation with our TEP. Because of financial and time considerations, we used a recently completed EPC report entitled Drug Class Review on Second Generation Antidepressants 121 as a starting point for our discussion of harms or side effects related to receiving treatment for AN, BN, and BED; we then supplemented this information with harms reported in the RCT studies meeting our inclusion criteria.
We examined outcomes related to having one of the three eating disorders through a review of observational studies; outcomes included eating, psychiatric or psychological, and biomarker variables and death. Although many participants followed in these studies have received treatment, the outcomes of interest relate not to efficacy of treatments but rather to disease levels and other problems that persist over time. To avoid reporting short-term fluctuations among the disease populations and to have sufficient sample sizes to observe changes over time, we limited our review to studies of 50 or more individuals, followed for a minimum of 1year, with or without comparison groups. Our TEP concurred with this plan.
For both the RCT and outcome literatures, we were unable to perform pooled meta-analyses. Given the absence of consensus definitions of remission, recovery, and relapse for eating disorders, as well as the overabundance of outcome measures, we judged meta-analysis to be both inadvisable and infeasible.
Databases and search terms. To identify the relevant literature for our review, we conducted systematic searches based on search terms, reviewed included studies by our TEP, and hand searched reference lists. We searched standard electronic databases such as MEDLINE®, the Cumulative Index to Nursing and Applied Health (CINAHL), PsycINFO, the Educational Resources Information Center (ERIC), the National AGRICultural OnLine Access (AGRICOLA), and Cochrane Collaboration libraries.
Based on inclusion/exclusion criteria specified above, we generated a list of Medical Subject Heading (MeSH) search terms, supplemented by key word searches of MEDLINE®. Comparable terms were used to search other databases. MeSH terms included anorexia, anorexia nervosa, and bulimia. Text terms included binge eating disorder. We limited our searches by type of study, including RCT, single-blind method, double-blind method, random allocation, longitudinal studies, and observational studies. For interventions, we used therapeutics or cognitive therapy or family therapy or drug therapy or therapy, computer-assisted. For outcomes of disease, we used outcome assessment (health care), treatment outcome, outcome and process assessment (health care), and recurrence. Finally, we asked our external peer reviewers for titles of articles that we may have missed.
Figure 2
We retained the following for our review to answer KQs about treatment efficacy: 35 articles on AN, 58 articles on BN, and 26 articles on BED. To answer KQs about disease outcomes, we retained 38 articles on AN, 14 articles on BN, 7 articles on both AN and BN, and 3 articles on BED.
Article selection process. Once we had identified articles through the electronic database search, review articles, and bibliographies, we examined titles and abstracts to determine whether the studies met our inclusion criteria. One reviewer initially evaluated abstracts for inclusion or exclusion. If one reviewer concluded that the article should be included, it was retained. Abstracts initially excluded from the study by one reviewer received a second review by senior project staff—Nancy Berkman, PhD, MLIR (Project Director), Cynthia Bulik, PhD (Scientific Director), or Gerald Gartlehner, MD, MPH (UNC Project Manager).
In all, 478 articles appeared to meet our inclusion criteria through abstract review, so we obtained the full articles. For the full article review, one senior reviewer read each article and determined if it met our eligibility criteria. Those articles that the reviewer determined did not meet our criteria were re-reviewed by a second senior reviewer to ensure agreement that the article should be excluded. We assigned each of these articles one or more reasons for exclusion.
The senior staff members for this systematic review jointly developed the evidence tables. We created two designs for the evidence tables, one for KQs 1 to 4 (treatment studies) and one for KQs 5 and 6 (outcome studies). They are intended to provide sufficient information for readers to understand the study and determine its quality; we emphasized presenting information essential to answering the main questions. The formats of the two sets of evidence tables were based on successful designs used for prior systematic reviews.
Columns in the evidence tables for treatment studies report baseline and outcome measures for eating-related, psychological or psychiatric, and biomarker variables. For each outcome measured, the tables present data in a consistent format. Given the large number of outcomes that these studies typically report, our evidence table entries are relatively long. In contrast, the outcome studies evidence tables are shorter. However, because of the appreciable variety of study approaches and outcomes reported in this literature the presentation of outcome data is, by necessity, less consistent than that for the treatment studies.
For this work, the RTI-UNC EPC team decided to abstract data from included articles directly into evidence tables; this system has worked effectively in many of our past reviews. Because we bypassed the use of data abstraction forms, we had significant efficiencies in production.
We trained data abstractors intensively, thoroughly familiarizing them with table designs, required information and formats, and examples of abstracted articles. As the work progressed, we shared various reporting requirements with abstractors to ensure that information appeared in a consistent and easily understandable manner.
For both the treatment and the outcomes literatures, the first reviewer (UNC faculty, postdoctoral psychology fellow, or psychology graduate student) initially entered data from the article into the evidence table. The second reviewer (Drs. Berkman, Bulik, Brownley, Carey, or Gartlehner) read the article and edited the initial table entry for accuracy, completeness, and consistency. All disagreements concerning the information reported in the evidence tables were reconciled by the two abstractors.
The final evidence tables are presented in their entirety in Appendix C *. Separate tables are included for treatment studies by disease and type of treatment intervention:
Appendix C also presents three evidence tables for outcome studies organized only by disease:
Within each evidence table, entries are listed alphabetically by the last name of the first author. Abbreviations and acronyms used in the tables appear in a glossary at the beginning of the appendix.
| Acronym and Full Name of Test | Description of Test and Subscales |
|---|---|
| ABOS: Anorectic Behaviour Scale for Inpatient Observation124 | Proxy-report (relatives) questionnaire to obtain information about patient's behaviors and attitudes; 3 factors: eating behaviors, concerns with weight and food, denial of proteins; bulimic-like behaviors; hyperactivity. |
| ABS: Anorectic Behavior Scale125 | Administrator-completed questionnaire about patient's behavior while in hospital; 8 items on resistance to eating, 8 items on methods of disposing of food, 6 items on overactivity. |
| ANSS: Anorexia Nervosa Symptom Score126 | Clinical rating scale with psychological, social, and physical severity scores and subscales. |
| BAT: Body Attitudes Test127,128 | Self-report questionnaire to measure subjective body experience and attitude towards one's body; 3 factors: negative attitudes about body size, lack of familiarity with one's own body, body dissatisfaction. |
| BDI: Beck Depression Inventory129 | One of the most widely used self-report measures for depression. It is a 21-item test presented in multiple choice format that measures the presence and degree of depression in adolescents and adults. |
| BEDCI: Binge Eating Disorder Clinical Interview130 | Structured clinical interview to establish the diagnosis of BED and both purging and nonpurging types of BN. |
| BES: Binge Eating Scale131 | Self-report measure of binge eating severity as measured by loss of control over eating behavior; 8 items on behavioral manifestations, 8 items on feelings and cognitions. |
| BIAQ: Body Image Avoidance Questionnaire132 | Self-report measure to assess avoidance of situations that provoke concern about physical appearance (including wearing tight fitting clothing, social outings, physical intimacy); 4 subscales: Eating Restraint, Clothing, Grooming/Weighing, Social Activities |
| BITE: Bulimic Investigation Test Edinburgh133 | Brief self-report questionnaire with 2 subscales designed to assess the symptoms and severity of binge eating episodes. |
| BSI: Brief Symptom Inventory134 | Brief self-report instrument to assess patients at intake for psychiatric problems; 9 Primary Symptom Dimensions: Somatization, Obsessive-Compulsive, Interpersonal Sensitivity, Depression, Anxiety, Hostility, Phobic Anxiety, Paranoid Ideation, Psychoticism; 3 Global Indices: Global Severity Index, Positive Symptom Distress Index, Positive Symptom Total. |
| BSQ: Body Shape Questionnaire135 | Self-report inventory to measure worries about weight and body shape. |
| BSQ-short version: Body Shape Questionnaire - Short Version136 | Self-report inventory to measure worries about weight and body shape. |
| BSS: Body Satisfaction Scale137 | Self-report instrument to assess body image satisfaction; 3 subscales: general, body, head. |
| Bulimic Thoughts Questionnaire138 | Self-report instrument of cognitive patterns and distortions associated with bulimic behavior. |
| CBCL: Child Behavior Checklist139 | Parent-report standardized assessment of behavioral problems and social competencies of children ages 4 to 18; 3 scores: total, internalizing behaviors (fearful, shy, anxious, inhibited), externalizing behaviors (aggressive, antisocial, under controlled). |
| CCEI: Crown-Crisp Experimental Index140 | Scale to measure neurotic symptomatology; 6 subscales: free-floating anxiety, phobic anxiety, obsessionality, somatic concomitants of anxiety, depression, hysterical personality. |
| CDI: Children's Depression Inventory141 | Brief self-report test to measure cognitive, affective, and behavioral signs of depression in persons 6 to 17 years of age; 5 factors: negative mood, interpersonal problems, ineffectiveness, anhedonia, negative self-esteem. |
| CDRS: Contour Drawing Rating Scale142 | Instrument to assess body size perception and dissatisfaction; 9 male and 9 female contour drawings shown to subjects who are asked to indicate which most closely resembles their current size and their ideal figure; the discrepancy is a measure of body dissatisfaction in 3 scores: real body, ideal body, body satisfaction index. |
| CGI or GIS: Clinical Global Impression143 | Clinician-rated scale to assess treatment response in psychiatric patients; 3 subscales: severity of illness (CGI-S), global improvement (CSI-G), efficacy index (CGI-EI). |
| DICA-R: Diagnostic Interview for Children and Adolescents - Revised144 | Semistructured clinical interview to determine Axis I psychiatric diagnoses in children and adolescents. |
| DIET: Dieter's Inventory of Eating Temptations145 | Self-report inventory to assess behavioral competence in 6 weight control situations: overeating, negative emotions, exercise, resisting temptation, positive social, food choice. |
| DSED: Diagnostic Survey for Eating Disorders146 | Self-report questionnaire to quantify frequency of disturbed behavior. |
| EAT: Eating Attitudes Test147 | Standardized self-report measure of symptoms and concern characteristics of eating disorders; 2 versions: EAT-26, EAT-40. |
| EDE: Eating Disorder Examination148 | Semistructured interview to measure specific psychopathology of anorexia nervosa and bulimia nervosa; 4 subscales: dietary restraint, eating concern, weight concern, shape concern. |
| EDE-Q4: Eating Disorders Evaluation Questionnaire - Version 4149 | Self-report assessment of thoughts and behaviors commonly found in eating disorders; 4 subscales: dietary restraint, eating concern, weight concern, shape concern. |
| EDI-1: Eating Disorder Inventory-1149 | Self-report questionnaire to measure psychiatric and behavioral traits commonly associated with eating disorders; 8 scales: drive for thinness, bulimia, body dissatisfaction, ineffectiveness, perfectionism, interpersonal distrust, interoceptive awareness, maturity fears. |
| EDI-2: Eating Disorder Inventory-2150 | Standardized self-report measure of psychiatric symptoms commonly associated with anorexia nervosa, bulimia nervosa, or other eating disorders; 8 subscales as for EDI-1, plus asceticism, impulse regulation, and social insecurity. |
| FACES III: Family Adaptability and Cohesion Evaluation Scales151 | Instrument to assess family adaptation and cohesion. Family cohesion assesses degree of separation or connection of family members to the family; 4 levels of family cohesion range from extreme low cohesion to extreme high cohesion: disengaged, separated, connected, enmeshed; 4 levels of adaptability: rigid, structured, flexible, chaotic. |
| FAM III: Family Assessment Measure152 | Self-report measure that assesses the strengths and weaknesses of functioning within a family; can be completed by pre-adolescents, adolescents, and adult family members (ages 10 years to adult); contains 7 subscales: Task Accomplishment, Role Performance, Communication, Affective Expression, Involvement, Control, Values and Norms. |
| FES: Family Environment Scale153 | Instrument to assess actual, preferred, and expected social environment of all types of families; 10 subscales: cohesion, expressiveness, conflict, independence, achievement, intellectual-cultural, active-recreation, moral-religious, organization, control. |
| FMPS: Frost Multidimensional Perfectionism Scale154 | Self-report measure of perfectionism; original measure had 6 subscales (Concern Over Mistakes, Personal Standards, Parental Expectations, Parental Criticism, Doubts About Actions, Organization). |
| FNE: Fear of Negative Evaluation155,156 | Scale to measure social anxiety about receiving negative evaluations from others; 2 subscales: Negative Expectations, Negative Public Evaluation. |
| Brief-FNE: Brief Fear of Negative Evaluation157 | Brief version of the original FNE. |
| FRS: Figure Rating Scale158 | Silhouette drawings of male and female adult body figures ranging from very thin to very large used as measure of personal body perception; 3 subscales: Real Body, Ideal Body, Body Satisfaction Index. |
| GAAS: Goldberg Anorectic Attitude Scale159 | Scale to measure short-term changes in anorectic cognitions across treatment including measures of hyperactivity, access, self-care, selective appetite, and denial of illness. |
| GAF: Global Assessment of Functioning16 | Clinician-derived instrument to measure the highest level of social and occupational functioning in the previous week and year; sometimes broken down into the GAF-F function score (not including symptoms) and the GAF-S symptom score (not including function). |
| GIS: Global Improvement Scale143 | See CGI (Clinical Global Improvement Scale). |
| HAM-A: Hamilton Anxiety Rating Scale160 | Semistructured interview to assess severity of anxiety symptomatology. |
| HAM-D or HDRS: Hamilton Depression Rating Scale161 | Semistructured interview to assess an array of behavioral, affective, and vegetative symptoms of depression. |
| HGSHS: Harvard Group Scale of Hypnotic Susceptibility, Form A162 | Measure of susceptibility to a wide range of hypnotic experiences, designed for assessing groups of subjects. |
| HRQ: Helping Relationship Questionnaire163 | Patient-rated instrument to measure therapeutic alliance. |
| HSCL: Hopkins Symptom Checklist134 | Self-report screening instrument to identify common psychiatric symptoms; 9 subscales: somatization, obsessive-compulsive symptoms, interpersonal sensitivity, depression, anxiety, anger or hostility, phobic anxiety, paranoid ideation, psychotic symptoms. |
| IBC: Interactive Behavior Code164 | A global interferential measure of communication, problem solving, and conflict, with 22 coded items rated by independent observers; summary scores are computed for negative and positive communication. |
| IIP: Inventory of Interpersonal Problems165 | Instrument to measure interpersonal problems and level of distress arising from interpersonal sources. |
| LCB: Locus of Control of Behavior166 | Instrument to measure the extent to which individuals believe they are responsible for personal problem behavior. |
| LIFE: Longitudinal Interval Continuation Evaluation167 | Semistructured interview and rating system to assess longitudinal course of psychiatric disorders in several areas: psychopathology, nonpsychiatric mental illness, treatment, psychosocial functioning, overall severity, narrative account. |
| MCMI: Millon Clinical Multiaxial Inventory168 | Lengthy test to diagnose 14 personality disorders and 10 clinical syndromes; scales: 14 Personality Pattern Scales, 10 Clinical Syndrome Scales, 3 Modifying Indices, 1 Validity Index. |
| MMPI: Minnesota Multiphasic Personality Inventory169 | Test of adult psychopathology; 8 Validity Scales, 5 Superlative Self-Presentation Subscales, 10 Clinical Scales, 9 Restructured Clinical (RC) Scales, 15 Content Scales, 27 Content Component Scales, 20 Supplementary Scales, 31 Clinical Subscales (Harris-Lingoes and Social Introversion Subscales), and various special or setting-specific indices. |
| MOCI: Maudsley Obsessive Compulsive Index170 | Self-report questionnaire to measure the presence of obsessional-compulsive behaviors; scores: total obsessional symptoms; checking; washing; doubting/conscientious; slowness/repetition. |
| MPS: Multidimensional Perfectionism Scale154 | Self-report instrument to assess perfectionism; 6 subscales: concern over mistakes, personal standards, parental expectations, parental criticism, doubts about action, organization. |
| M-R Scales: Morgan and Russell Scales171 | Structured interview to give a brief but thorough assessment of the central clinical features of anorexia nervosa; 5 subscales: eating behavior, menstrual state, mental state, relevant attitudes, socioeconomic state; sixth scale allows a self-progress rating. |
| M-R-H Scale; Morgan-Russell-Hayward Scale172 | Guided interview concerned with clinical features of anorexia nervosa to evaluate eating behavior, body weight, mental state, and other attitudes relevant to anorexia nervosa; 5 scales: nutrition, menses, mental state, psycho-sexual state, socioeconomic state; additional subscales include: food intake, concern at body image, body weight, menstrual pattern, disturbance of mental state, attitudes toward sexual matters, overt sexual behavior, attitude to menstruation, relationship with family, emancipation from family, personal contacts, social activities, employment record. |
| MRT: Vandenberg and Kuse's Adaptation of Shepard and Metzler's Three-dimensional Mental Rotations Test173 | Self-report test of visuospatial ability in which participants view a depiction of a 3-dimensional target figure and 4 test figures and determine which of the test figures are rotated versions of the target figure. |
| PARQ: Parent Adolescent Relationship Questionnaire174 | Instrument completed by parents and adolescents 10 through 19 years of age to measure relationship between parents and adolescents; 3 scales: Overt Conflict/Skill Deficits, Extreme Beliefs, Family Structure. |
| PGWB: Dupuy's Psychological General Well-being Index175 | Self-report inventory to measure self-representations of intrapersonal affective or emotional states reflecting a sense of subjective well-being or distress; 6 intrapersonal subscales: anxiety, depressed mood, positive well-being, self-control, general health, vitality. |
| PSE: Present State Examination176 | Global index of mental state disturbance. |
| PSR: Psychiatric Status Rating177 | Clinician-administered instrument to determine the severity of a range of psychiatric disorders that has been used to determine eating disorder outcomes. |
| QEWP-R: Questionnaire of Eating and Weight Patterns - Revised178 | Self-report questionnaire to assess a range of features and problems associated with obesity and eating disorders. |
| RAS: Rathus Assertiveness Schedule179 | Self-report instrument to measure assertiveness. |
| RSE: Rosenberg Self-Esteem Scale180 | Self-report instrument to measure overall self-esteem. |
| SADS-C: Schedule for Affective Disorders and Schizophrenia-Change Version181 | Structured interview to differentiate schizophrenia from mood disorders; 2 subscales: depression, mania. |
| SAMS (Situational Appetite Measures) Urge and SAMS Efficacy182 | Complementary scales to measure the strength of the urge to binge in 40 different situations and the degree of confidence in one's ability to resist a binge in those same 40 situations. |
| SAS: Social Adjustment Scale183 | Self-report questionnaire to assess social and work-related functions; 6 subscales: work, social and leisure, extended family, marital, prenatal, family unit. |
| SCFI: Standardized Clinical Family Interview184 | Standardized clinical interview used with families in which the interviewer tries to get responses from all family members and adopts a neutral style. Questions concern numerous areas of family life, mainly what sort of family it is, who does what, who is like whom, life cycle, roles and responsibilities, conflicts, decisions, discipline, relation to the environment. |
| SCI: Shapiro Control Inventory185 | Self-report measure of the psychological construct of control (comparable to Locus of Control scales) with 9 subscales. |
| SCID-I: Structured Clinical Interview I for the DSM IV186 | Structured diagnostic interview to assess presence of current or past DSM IV Axis I major psychiatric disorders. |
| SCL-90 R Symptom Checklist 90-Revised134 | General measure of psychopathology, including various forms of anxiety, depression, paranoia, psychotic features. Subscales: Global Severity Index (GSI) to measure overall psychological distress; Positive Symptom Distress Index to measure the intensity of symptoms; Positive Symptom Total of number of self-reported symptoms (Somatization, Obsessive-Compulsive, Interpersonal Sensitivity, Depression, Hostility, Phobic Anxiety, Paranoid Ideation, Psychoticism). |
| SDS: Zung Self-rating Depression Scale187 | Self-report assessment to quantify depression, using criteria of pervasive depressed affect and its physiological and psychological concomitants. |
| SF-36: Medical Outcomes Study Short Form Health Survey188 | Self-report questionnaire to assess health-related quality of life; 8 subscales: physical function, role physical, bodily pain, general health, mental health, role emotional, social function, vitality, 2 composite scores: physical health; mental health. |
| SIAB-P: Structured Interview for Anorexia and Bulimia Nervosa189 | Interview to assess severity of current eating disorder symptoms; 6 subscales: body image and ideal of slimness, social integration and sexuality, depression, obsessive compulsive syndromes and anxiety, bulimic symptoms, laxative abuse. |
| SMFQ: Short Mood and Feeling Questionnaire190 | Self-report measure of childhood and adolescent depression for children 8 to 16 years of age. |
| SOC: Stages of Change Scale191 | Self-report inventory to describe how respondents feel as they initiate counseling; 4 subscales: Precontemplation, Contemplation, Action, Maintenance. |
| SPAQ: Seasonal Patterns Assessment Questionnaire192 | Self-report instrument to rate the presence and severity of seasonal variation in mood, sleep, and eating-related variables; 2 added items monitor seasonal bingeing and purging patterns. |
| STAI: State Trait Anxiety Inventory193 | Standardized self-report assessment of both state and trait anxiety (2 subscales). |
| STAXI: State Trait Anger Expression Inventory194 | Self-report inventory to assess components of anger and anger expression of normal and abnormal personality. |
| STPI: State Trait Personality Inventory193 | Self-report personality inventory. |
| SUDS: Subjective Units of Distress195 | Self-report measure of intensity of subjective distress in response to a particular stimulus. |
| TAS-20: Toronto Alexithymia Scale196 | Self-report inventory to assess the alexithymia construct (difficulty recognizing, identifying, and communicating emotions; reduced fantasy capacity; and an externally oriented cognitive style); 2 dimensions: identifying feelings (DIF), describing feelings (DDF). |
| TCI: Temperament and Character Inventory197 | Self-report measure of temperament and character; 7 subscales: Novelty Seeking, Harm Avoidance, Reward Dependence, Persistence, Self-Directedness, Cooperativeness, Self-Transcendence. |
| TFEQ: Three-Factor Eating Questionnaire198 | Self-report inventory; 3 subscales: Cognitive-Restraint, Hunger, Disinhibition. Also known as the Eating Inventory. |
| WAIS: Wechsler Adult Intelligence Scale199 | Structured, clinician-administered general test of intelligence for persons 16 years of age and older; 6 Verbal tests: Information, Comprehension, Arithmetic, Digit Span, Similarities, Vocabulary; 5 Performance subtests: Picture Arrangement, Picture Completion, Block Design, Object Assembly, Digit Symbol. |
| WELSQ: Weight Efficacy Life Style Questionnaire200 | Self-report measure of confidence about successfully resisting the desire to eat; 5 situational subscales: Negative Emotions, Availability, Social Pressure, Physical Discomfort, Positive Activities. |
| WLFL: Work, Leisure and Family Life Questionnaire201 | Self-report instrument to measure social adjustment and functioning; 8 scales: work outside the home, housework, social and leisure activities, extended family, marital, parental-older children, parental-baby, family unit. |
| YBC-EDS and YBOCS-ED: Yale-Brown-Cornell Eating Disorder Scale202 | Interview to assess preoccupations and rituals associated with eating disorders: symptom checklist produces 3 dimensions of preoccupations and rituals (severity, motivation, ego syntonicity) and covers 18 general categories of rituals and preoccupations. |
| Y-BOCS-BE: Yale-Brown Obsessive Compulsive Scale Modified for Binge Eating203 | Clinician-rated inventory of obsessive-compulsive problems adapted for use with binge-eating disorder. |
| Y-BOCS Score: Yale-Brown Obsessive Compulsive Scale204 | Clinician-rated scale with separate subtotals for severity of obsessions and compulsions; 2 subscales: obsessions, compulsions. |
| Youth Self-Report139,205 | Self-report inventory on various behavior problems. |
Rating the quality of individual articles. For this systematic review, we developed our approach to assessing the quality of individual articles using domains and elements recommended in the evidence report by West and colleagues, Systems to Rate the Strength of Scientific Evidence.122 We developed two quality-rating forms, one for the treatment literature and the other for the outcomes literature. Quality rating forms did not differ by disease. We tested several drafts of these forms, revising them as needed to ensure that they efficiently captured the desired information. The final grading forms can be found in Appendix B.
We assessed the treatment literature through 25 items in 11 categories: (1) research aim/study question, (2) study population, (3) randomization, (4) blinding, (5) interventions, (6) outcomes, (7) statistical analysis, (8) results, (9) discussion, (10) external validity, and (11) funding/sponsorship. We did not exclude any studies with so-called fatal flaws, such as the approach to randomization. Rather, we reduced the study's overall score if a category was flawed or inadequate. Because patients and those administering interventions in the psychological treatment studies could not be blinded, we did not evaluate these items when studies included these interventions. However, we always evaluated whether the outcome assessor was blinded. Studies that were reported in more than one article were given the same quality grade.
We weighted each item equally and calculated a score out of 100 percent. We then collapsed those scores into three categories: poor, 0 percent to 59 percent; fair, 60 percent to 74 percent; and good, 75 percent or better.
For the outcomes literature, we used 17 items in 8 categories: (1) research aim/study question, (2) study population, (3) eating disorder diagnosis method, (4) study design, (5) statistical analysis, (6) results/outcome measurement, (7) external validity, and (8) discussion. As with the RCTs, we weighted each item equally. Rather than calculating a score out of 100 percent, however, we converted ratings for each item into numeric values of 0, 1, or 2, in which 0 = poor, 1 = fair, and 2 = good. Studies without comparison groups were not evaluated by items addressing this aspect of design. However, studies that included comparison groups were scored as “good” on one item, whereas those without were scored as “poor” on that item. We calculated the mean score for all graded items and we concluded that, overall, an article should be graded as poor with a rating < 1, fair with a rating ≥ 1 and < 1.5, and good with a rating of ≥ 1.5.
Each quality grade was the composite (averaged) rating of two independent evaluators. The only items reconciled between the evaluators were those in which one rater provided a score for the item and the other said the item was not applicable. In assessing quality of the treatment studies, we asked the two evaluators to discuss their results if the difference in their total scores was 20 points or greater, but we did not require them to come to agreement.
Rating the strength of the available evidence. We rated the strength of the evidence base for both interventions and disease outcomes separately for the three diseases, using a single scheme for all bodies of evidence. Starting with the West et al. report that compared various schemes for grading bodies of evidence,122 we based our evaluation on criteria developed by Greer et al.,123 which we deemed most applicable to the study designs in this review. It includes three domains: quality of the research, quantity of studies (including number of studies and adequacy of the sample size), and consistency of findings.
We graded the body of literature applicable to each of the six KQs separately. For the treatment literature, we further divided studies by whether the intervention was pharmaceutical, behavioral, or a combination. Three senior staff defined by consensus four strength-of-evidence categories, as follows:
I. Strong evidence base. The evidence is from studies of strong design; results are both clinically important and consistent with minor exceptions at most; results are free from serious doubts about generalizability, bias, or flaws in research design. Studies with negative results have sufficiently large samples to have adequate statistical power.
II. Moderate evidence base. The evidence is from studies of strong design, but some uncertainty remains because of inconsistencies or concern about generalizability, bias, research design flaws, or adequate sample size. Alternatively, the evidence is consistent but derives from studies of weaker design.
III. Weak evidence base. The evidence is from a limited number of studies of weaker design. Studies with strong design either have not been done or are inconclusive.
IV. No evidence base. No published literature.
Among the more important activities involved in producing a credible evidence report is conducting an unbiased and broadly based review of the draft report. External reviewers for this report included clinicians, representatives of professional societies and advocacy groups, and potential users of the report, including TEP members (see Appendix D †). We charged peer reviewers with commenting on the content, structure, and format of the evidence report and asked them to complete a peer review checklist. We revised the report, as appropriate, based on their comments.
This chapter presents results of our literature search and our findings for the key questions (KQs) regarding treatment for anorexia nervosa (AN). We examine evidence for the efficacy of various treatments or combinations of treatments for AN (KQ 1), harms associated with the treatment or combination of treatments for AN (KQ 2), factors associated with the efficacy of treatment for AN (KQ 3), and whether the efficacy of treatment for AN differs by sex, gender, age, race, ethnicity, or cultural groups (KQ 4).
We identified 32 studies published in 35 articles addressing treatment efficacy for AN; of these 15 were medication trials. We were unable to categorize medication studies into adolescent and adult trials given the paucity of medication trials focusing on adolescents.
We rated two medication trials as good,206 six as fair,207–213 and seven as poor (not discussed further).124, 214–219 Of the studies judged fair or good, the medications studied included second-generation antidepressants,206, 207 tricyclic antidepressants,208, 209 nutritional supplements,213 and hormones.210–212 Study designs included medication versus placebo (six trials), medication A versus medication B versus placebo (one), and medication versus waiting list or nonmedication control (one).
Eighteen of the 32 studies were behavioral intervention trials. In this report behavioral interventions refer to all forms of psychotherapy including cognitive, supportive, dynamic, family, individual, and group. One trial was of therapeutic warming.220 We rated two of these trials as good,221, 222 nine as fair,223–231 and six as poor (not discussed further).220, 232–236 Of the 11 trials reviewed here, six were conducted among adults and five among adolescents. Behavioral interventions studied include cognitive behavioral therapy (CBT),223–225 cognitive analytic therapy (CAT),226 focal psychoanalytic therapy,228 and various forms of family therapy.221, 222, 229–231, 237 The behavioral intervention trials used two designs: psychotherapy A versus psychotherapy B, and psychotherapy A versus psychotherapy B versus control.
| Reasons Contributing to Poor Ratings | Types of Intervention, Number of Times Flaw Was Detected, and Citations |
|---|---|
| Research Aim | |
| Hypothesis not clearly described | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 0 | |
| Behavioral intervention trials (adolescents): 0 | |
| Study Population | |
| Characteristics not clearly described | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 0 | |
| Behavioral intervention trials (adolescents): 0 | |
| No specific inclusion or exclusion criteria | Medication-only trials: 1214 |
| Behavioral intervention trials (adults): 1233 | |
| Behavioral intervention trials (adolescents): 0 | |
| Randomization | |
| Protections against influence not in place | Medication-only trials: 6124,214–216,218,219 |
| Behavioral intervention trials (adults): 1233 | |
| Behavioral intervention trials (adolescents): 0 | |
| Approach not described | Medication-only trial: 6124,214–216,218,219 |
| Behavioral intervention trials (adults): 1233 | |
| Behavioral intervention trials (adolescents): 1236 | |
| Whether randomization had a fatal flaw not known | Medication-only trials: 6124,214–216,218,219 |
| Behavioral intervention trials (adults): 1233 | |
| Behavioral intervention trials (adolescents): 2235,236 | |
| Comparison group(s) not similar at baseline | Medication-only trials: 3214,215,219 |
| Behavioral intervention trials (adults): 0 | |
| Behavioral intervention trials (adolescents): 1236 | |
| Blinding | |
| Study subjects | Medication-only trials: 4215–217,219 |
| Behavioral intervention trials (adults): N/A | |
| Behavioral intervention trials (adolescents): N/A | |
| Investigators | Medication-only trials: 6124,215–219 |
| Behavioral intervention trials (adults): 1220 | |
| Behavioral intervention trials (adolescents): 0 | |
| Outcomes assessors | Medication-only trials: 6124,215–219 |
| Behavioral intervention trials (adults): 3220,233,234 | |
| Behavioral intervention trials (adolescents): 2235,236 | |
| Interventions | |
| Interventions not clearly described | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 0 | |
| Behavioral intervention trials (adolescents): 0 | |
| No reliable measurement of patient compliance | Medication-only trials: 5214–217,219 |
| Behavioral intervention trials (adults): 1220 | |
| Behavioral intervention trials (adolescents): 1235 | |
| Outcomes | |
| Results not clearly described | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 2220,233 | |
| Behavioral intervention trials (adolescents): 0 | |
| Adverse events not reported | Medication-only trials: 3214,215,217 |
| Behavioral intervention trials (adults): 2233,234 | |
| Behavioral intervention trials (adolescents): 1235 | |
| Statistical Analysis | |
| Statistics inappropriate | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 3220,232,233 | |
| Behavioral intervention trials (adolescents): 0 | |
| No controls for confounding (if needed) | Medication-only trials: 3214,218,219 |
| Behavioral intervention trials (adults): 2232,233 | |
| Behavioral intervention trials (adolescents): 2235,236 | |
| Intention-to-treat analysis not used | Medication-only trials: 5214,215,217–219 |
| Behavioral intervention trials (adults): 2220,233 | |
| Behavioral intervention trials (adolescents): 2235,236 | |
| Power analysis not done or not reported | Medication-only trials: 7124,214–219 |
| Behavioral intervention trials (adults): 4220,232–234 | |
| Behavioral intervention trials (adolescents): 1235 | |
| Results | |
| Loss to followup 26% or higher or not reported | Medication-only trials: 2214,215 |
| Behavioral intervention trials (adults): 1233 | |
| Behavioral intervention trials (adolescents): 0 | |
| Differential loss to followup 15% or higher or not reported | Medication-only trials: 1214,215 |
| Behavioral intervention trials (adults): 3220,233,234 | |
| Behavioral intervention trials (adolescents): 1236 | |
| Outcome measures not standard, reliable, or valid in all groups | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 1220 | |
| Behavioral intervention trials (adolescents): 0 | |
| Discussion | |
| Results do not support conclusions, taking possible biases and limitations into account | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 0 | |
| Behavioral intervention trials (adolescents): 0 | |
| Results not discussed within context of prior research | Medication-only trials: 0 |
| Behavioral intervention trials (adults): 0 | |
| External validity: population not representative of US population relevant to these treatments | Medication-only trials: 3215,217,218 |
| Behavioral intervention trials (adults): 1220 | |
| Behavioral intervention trials (adolescents): 0 | |
| Funding/sponsorship not reported | Medication-only trials: 6214–219 |
| Behavioral intervention trials (adults): 3220,232,234 | |
| Behavioral intervention trials (adolescents): 1235 | |
N/A, not applicable.
| Author | Total Enrollment | Total Dropouts | Group 1 Treatment (% dropout) | G2 Treatment (% dropout) | G3 Treatment (% dropout) | G4 Treatment (% dropout) |
|---|---|---|---|---|---|---|
| Medication Trials | ||||||
| Attia et al., 1998206 | 33 | 1 (+1 unreliable self-reporter) (3%) | Fluoxetine (NR) | Placebo (NR) | ||
| Kaye et al., 2001207 | 39 | 26 (66%) | Fluoxetine (16% at 30 days, 47% at 1 year) | Placebo (5% at 30 days, 85% at 1 year) | ||
| Biederman et al.,1985209 | 25 | 0 (0%) | Amitriptyline (0%) | Placebo (0%) | ||
| Halmi et al.,1986208 | 72 | 18 (25%) | Amitriptyline (30%) | Cyproheptadine (25%) | Placebo (20%) | |
| Hill, et al., 2000212 | 15 | 0 (0%) | Recombinant human growth hormone (0%) | Placebo (0%) | ||
| Klibanski et al., 1995210 | 48 | 4 (8%) | Estrogen/progestin (14%) | Control (4%) | ||
| Miller, Grieco, and Klibanski 2005211 | 38 | 5 (13%) | Testosterone (NR) | Placebo (NR) | ||
| Birmingham, Goldner, and Bakan1994213 | 54 | 19 (35%) | Zinc (39%) | Placebo (32%) | ||
| Behavioral Intervention Trials (Adult) | ||||||
| Channon et al., 1989225 | 24 | 3 (13%) | CBT (0%) | Behavioral treatment (13%) | Control (25%) | |
| McIntosh et al., 2005224 | 56 | 21 (38%) | CBT (37%) | Interpersonal psychotherapy (43%) | Nonspecific supportive clinical management (31%) | |
| Pike et al., 2003223 | 33 | 3 (9%) | CBT (0%) | Nutritional counseling (20%) | ||
| Dare et al., 2001228 | 84 | 30 (36%) | Focal psychotherapy (43%) | Family therapy (27%) | Cognitive analytic therapy (41%) | Routine (32%) |
| Treasure et al., 1995226 | 30 | 10 (33%) | Educational behavioral therapy (38%) | Cognitive analytic therapy (29%) | ||
| Crisp et al., 1991227 and Gowers et al.,1994238 | 90 | 17 (19%) | Inpatient (40%) | Outpatient psychotherapy/family therapy/dietary counseling (10%) | Group therapy (15%) | No further treatment (0%) |
| Behavioral Intervention Trials (Adolescent) | ||||||
| Eisler et al., 2000221 | 40 | 4 (10%) | Conjoint family therapy (11%) | Separated family therapy (10%) | ||
| Geist et al., 2000229 | 25 | 0 (0%) | Family therapy (0%) | Family group psychoeducation (0%) | ||
| Russell et al., 1987231 and Eisler et al., 1997239 | 80 | 28 (35%) | Family therapy (37%) | Individual therapy (33%) | ||
| Robin et al., 1994230 and Robin, Siegel, and Moye 1995237 | 24 | 2 (8%) | Behavioral family systems therapy (8%) | Ego-oriented individual therapy (8%) | ||
| Lock et al., 2005222 | 86 | 17 (20%) | Long-term treatment (24%) | Short-term treatment (16%) | ||
CBT, cognitive behavioral therapy; NR, not reported.
Of the 19 studies rated fair or good, 10 were conducted in the United States, six in the United Kingdom, two in Canada, and one in New Zealand. A total of 891 individuals participated in fair or good clinical trials for AN. One study failed to report sex. From those studies that reported sex, 861 women and 23 men participated. Seventeen studies failed to report ethnicity for participants. Of those that did, 123 participants were identified as white, eight as Asian and three as other ethnicity.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Attia et al., 1998206 | Eating: | Both groups experienced decreased clinician-rated ED symptoms and illness severity, ED concerns, depressed mood, obsessive-compulsive symptoms, and food preoccupation and rituals. Both groups increased percent IBW. | No statistics reported. | No differences on any measures. |
| Fluoxetine vs. placebo | AN behavior | |||
| Inpatient | BSQ | |||
| Good | CGI | |||
| EAT | ||||
| YBC-EDS | ||||
| Biomarker: | ||||
| IBW | ||||
| Psych: | ||||
| BDI | ||||
| CGI | ||||
| SCL-90 | ||||
| Kaye et al., 2001207 | Eating: | Fluoxetine completers experienced decreased anxious and depressed mood and increased percent ABW | No differences on any measures. | No differences on any measures. |
| Fluoxetine vs. placebo | YBC-EDS | |||
| Inpatient and outpatient | Biomarker: | |||
| Fair | ABW | |||
| Psych: | ||||
| HAM-A | ||||
| HDRS | ||||
| YBOCS | ||||
| Biederman et al., 1985209 | Eating: | No statistics reported. | No differences on any measures. | No statistics reported. |
| Amitriptyline vs. placebo | EAT | |||
| Inpatient and outpatient | Biomarker: | |||
| Fair | Weight | |||
| Psych: | ||||
| Global severity | ||||
| HSCL | ||||
| SADS-C | ||||
| Halmi et al., 1986208 | Eating: | No statistics reported. | Cyproheptadine associated with fewer days to target weight, higher caloric intake, and less depressed mood compared to placebo. | No statistics reported. |
| Amitriptyline vs. cyproheptadine vs. placebo | Caloric intake | BN subgroup: amitriptyline associated with improved tx efficacy compared to cyproheptadine; neither drug differed from placebo. | ||
| Inpatient | Biomarker: | For non-BN subgroup: cyproheptadine associated with improved tx efficacy compared to placebo. No other subgroup comparisons were significant. | ||
| Fair | Weight | |||
| Psych: | ||||
| HAM-D | ||||
| BDI | ||||
| SCL-90 | ||||
| Hill et al., 2000212 | Biomarker: | No statistics reported. | rhGH associated with fewer days to restoration of normal orthostatic response compared to placebo. | No statistics reported. |
| rhGH vs. placebo | Orthostasis | |||
| Inpatient | Weight | |||
| Good | ||||
| Klibanski et al., 1995210 | Eating: | No statistics reported. | No differences on any measures. | No differences on any measures. |
| Estrogen/progestin vs. nonmedication control | Recovery | |||
| Outpatient | Remission | |||
| Fair | Biomarker: | |||
| Bone density | ||||
| Percent Body fat | ||||
| Percent IBW | ||||
| Weight | ||||
| Miller et al., 2005211 | Biomarker: | No statistics reported. | Testosterone associated with less depressed mood compared to placebo. | Depressed mood increased less in testosterone-treated group. |
| Testosterone vs. placebo | BMI | |||
| Setting unknown | IBW | |||
| Fair | Psych: | |||
| BDI | ||||
| Birmingham et al., 1994213 | Biomarker: | No statistics reported. | No differences on any measures. | Zinc superior to placebo in rate of BMI increase. |
| Zinc vs. placebo | BMI | |||
| Inpatient | Percent body fat | |||
| Fair | Weight | |||
ABW, average body weight; AN, anorexia nervosa; BDI, Beck Depression Inventory; BMI, body mass index; BN, bulimia nervosa; BSQ, Body Shape Questionnaire; CGI, Clinical Global Impressions; EAT, Eating Attitudes Test; ED, eating disorders; HAM-A, Hamilton Anxiety Inventory; HAM-D, Hamilton Depression Inventory; HDRS, Hamilton Depression Rating Scale; HSCL, Hopkins Symptom Checklist; IBW, ideal body weight; Psych, psychiatric and psychological; rhGH, recombinant human grown hormone; SADS-C, Schedule for Affective Disorders and Schizophrenia-Change Version; SCL-90, (Hopkins) Symptom Checklist; tx, treatment; vs., versus; YBC-EDS, Yale-Brown-Cornell Eating Disorders Scale; YBOCS, Yale-Brown Obsessive-Compulsive scale.
Weight gain is the primary outcome variable in the treatment of AN. Secondary outcomes in this population include reduction of the psychological features of AN (e.g., body dissatisfaction and drive for thinness), reduction of associated behaviors such as overexercising, resumption of menses, and, in the bingeing and purging subtype, decreased binge eating and purging behaviors. Additional psychiatric outcomes include reduction in depression and anxiety.
Second-generation antidepressants. The term “second-generation antidepressants” is commonly used in the psychiatric and pharmacological literature to distinguish newer antidepressants such as selective norepinephrine reuptake inhibitors (SNRIs), selective serotonin reuptake inhibitors (SSRIs), bupropion, nefazodone, and trazodone from traditional or first-generation antidepressants such as tricyclic antidepressants and monoamine oxidase inhibitors. We adopted this term to be consistent in terminology with other research conducted in the area of psychopharmacology.
Fluoxetine. Two trials used fluoxetine at different stages of refeeding in AN patients. In an inpatient study, Attia et al.206 randomized 31 females between 16 and 45 years who had achieved weight restoration of at least 65 percent of ideal body weight (IBW) to fluoxetine (60 mg/day) or placebo. The mean BMI at randomization was 15 kg/m2. Patients continued to receive psychotherapy. No significant differences emerged between fluoxetine and placebo on weight gain (16 versus 13 pounds), psychological features of eating disorders, or depression or anxiety measures. Three percent of participants dropped out of fluoxetine treatment.
In the second study, patients were randomly assigned to either initiation on fluoxetine or placebo before inpatient discharge with a beginning dosage of 20 mg/day adjusted over 52 weeks to a maximum of 60 mg/day.207 The range of weight for all participants at randomization was 76 percent to 100 percent average body weight (ABW) with the majority above 90 percent. Outpatient psychotherapy was permitted. Dropout was considerable. Of 39 individuals randomized, only 13 remained at the 52-week endpoint (47 percent of fluoxetine and 85 percent of placebo). In this small group of completers, fluoxetine was associated with significantly greater weight gain, reduced anxiety, depression, obsessive-compulsive features, and eating-disorder-related symptoms.
Tricyclic antidepressants. Two trials of fair or good quality investigated tricyclic antidepressant medication use. Neither provided strong data supporting the use of these medications in treating AN patients.
Amitriptyline in doses up to 175 mg/day in 25 youth ages 11 to 17 years led to no significant differences in eating, mood, or weight outcomes in comparison to placebo.209 No patients dropped out in this trial. Halmi et al. compared amitriptyline (160 mg/day) versus cyproheptadine (32 mg/day) versus placebo in 72 females 13 to 36 years, determined to have AN according to the Diagnostic and Statistical Manual, third edition (DSM III).208 Daily caloric intake was significantly higher in cyproheptadine than placebo and significantly fewer days were needed to achieve target weight (in those who did) in both the amitriptyline and cyproheptadine groups, compared with placebo. Drop out was thirty percent in the amitriptyline group, 25 percent in the cyproheptadine group, and 20 percent in the placebo group.
Hormones. Investigators have studied three hormones in the treatment of AN: growth hormone (rGH), testosterone, and estrogen. Three weeks of transdermal testosterone (150 mg or 330 mg) administered to 38 patients with AN ages 18 to 50 led to greater decreases in depression in patients who were depressed at baseline, but differences in weight were not interpretable.211 Dropout was 13 percent overall.
Growth hormone (15 mg/kg/day) administered to 14 female and 1 male patient receiving inpatient care for AN led to fewer days to display normal orthostatic heart rate response to a standing challenge among the treatment group than among placebo group.212 No patient dropped out of this study.
Klibanski et al. compared estrogen/progesterone (0.625 mg Premarin® or 5 mg Provera® per day) versus nonmedication control in 48 females 16 to 43 years and found no differences between groups on bone density at 6 months.210 Dropout was 14 percent in hormone group and 4 percent in the nonmedication group.
Hormone treatment during the acute phase of AN illness does not appear to improve bone density.210 Scant, preliminary evidence suggests that rGH leads to faster normalization of orthostatic changes seen in AN212and that testosterone improves depression in individuals with AN and depressed mood.211
Nutritional supplements. The one study of nutritional supplements was performed in 54 female inpatients older than 15 years with 14 mg/day zinc. It provides preliminary evidence that zinc may increase the rate of increase in BMI.213 Dropout was 39 percent in zinc and 32 percent in placebo, suggesting that conclusions from this study must be viewed with great caution.
Summary of drug trials. All eight studies assessing the efficacy of medication interventions on AN examined weight gain; most reported on eating outcomes and some reported on additional symptom change.
Overall, none of the pharmacological interventions for AN had a significant impact on weight gain. Although tricyclic antidepressants may be associated with greater improvement in secondary mood outcomes, this outcome does not appear to be associated with improved weight gain. No trial has been adequately replicated.
Dropout rates for medication studies for AN are substantial, especially in outpatient trials. Conclusions drawn from studies with such high attrition must be reviewed with extreme caution.
Taken together, the literature regarding medication treatments for AN is sparse and inconclusive. The vast majority of studies had small sample sizes and rarely had adequate statistical power to allow for definitive conclusions. Many studies examined patients who were receiving additional treatments in conjunction with the study medication, including psychological interventions and concurrent pharmacological treatments. Some of these studies examined patients who were in inpatient settings, thus limiting generalizability to outpatient treatment. Only one conducted intention-to-treat analyses; the remaining studies reported completer analyses only. With one exception,209 no medication trials have focused on adolescent patients. Because followup was limited, assessing longer-term impact of interventions on such outcomes as bone density was impossible. Finally, only one male participated in any of these studies, thereby making it impossible to draw any conclusions about the pharmacological treatment of AN in boys and men.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Channon et al., 1989225 | Eating: | No statistics reported. | At 6-month FU, CBT associated with better psychosexual functioning than BT and BT was associated with greater improvement in menstrual functioning than CBT. | No statistics reported. |
| CBT vs. BT vs. ‘Usual care’ control | EDI | At 1-year FU, the BT group scored better than the CBT group on preferred weight. CBT and BT combined were associated with greater improvements on nutritional functioning than the control group. The control group showed greater improvements on drive for thinness than the combined CBT and BT groups. | ||
| Outpatient | M-R scale | |||
| Fair | Biomarker: | |||
| BMI | ||||
| M-R scale | ||||
| Psych: | ||||
| BDI | ||||
| MOCI | ||||
| M-R scale | ||||
| McIntosh et al., 2005224 | Eating: | Compared to IPT, NSCM associated with higher likelihood of ‘good’ global outcome. | NSCM superior to IPT in improving global functioning and eating restraint over 20 weeks. | |
| CBT vs. IPT vs. NSCM | EDE | NSCM superior to CBT in improving global functioning over 20 weeks. | ||
| Outpatient | EDI | CBT superior to IPT in improving eating restraint over 20 weeks. | ||
| Fair | Biomarker: | |||
| BMI | ||||
| Percent body fat | ||||
| Weight | ||||
| Psych: | ||||
| GAF | ||||
| HDRS | ||||
| Pike et al., 2003223 | Eating: | No statistics reported. | Compared to nutrition counseling, CBT associated with lower percentage tx failures, higher percentage ‘good’ outcome, and longer time (weeks) to relapse. | No statistics reported. |
| CBT vs. nutritional counseling | Recovery | |||
| Outpatient | Relapse | |||
| Fair | Tx failure | |||
| M-R scale | ||||
| Dare et al., 2001228 | Eating: | No statistics reported. | At 1-year FU, compared to routine tx, focal and family tx associated with higher weight; also, higher percentage of patients in focal and family tx were recovered or significantly improved (i.e., > 85% IBW, no/few menstrual or BN symptoms). | No statistics reported. |
| CAT vs. focal vs. family vs. ‘routine’ therapy | M-R scale | |||
| Outpatient | Recovery | |||
| Fair | Biomarker: | |||
| BMI | ||||
| Percent ABW | ||||
| M-R scale | ||||
| Psych: | ||||
| M-R scale | ||||
| Treasure et al., 1995226 | Eating: | No statistics reported. | Compared to EBT, CAT associated with higher self-rating of improvement. | No statistics reported. |
| CAT vs. EBT | M-R scales | |||
| Outpatient | Biomarker: | |||
| Fair | BMI | |||
| Weight | ||||
| Psych: | ||||
| M-R scales | ||||
| Self progress scale | ||||
| Crisp et al., 1991227 and Gowers et al., 1994238 | Eating: | At 1-year FU, global score and menstruation improved in all 4 groups, nutrition score improved in 3 active tx groups, and mental state improved in outpatient family/diet counseling group. | Compared to ‘no formal tx’, outpatient family/diet counseling associated with higher weight and BMI at 1- and 2-year FU. | Compared to ‘no formal tx,’ weight increased more at 1-year FU in all 3 active groups. |
| Inpatient tx vs. outpatient individual and family therapy and dietary counseling vs. group therapy vs. no formal tx | M-R scale | At 2-year FU, mental state improved in outpatient family/diet counseling; global score, menstruation, and nutrition improved in groups that received outpatient family/diet counseling and no formal tx. | Weight increased more at 2-year FU in outpatient family/diet counseling compared to ‘no formal tx’ group. | |
| Inpatient and outpatient | Remission | |||
| Fair | Biomarker: | |||
| BMI | ||||
| M-R scale | ||||
| Weight | ||||
| Psych: | ||||
| M-R scale | ||||
ABW, average body weight; BDI, Beck Depression Inventory; BMI, body mass index; BT, behavioral therapy; CAT, cognitive-analytic therapy; CBT, cognitive behavioral therapy; EBT, educational behavioral therapy; EDE, Eating Disorders Examination; EDI, Eating Disorders Inventory (EDI-2, Garner, 1991); FU, follow-up; GAF, Global Assessment of Functioning [DSM-IV]; HDRS, Hamilton Depression Rating Scale; IBW, ideal body weight; IPT, interpersonal therapy; MOCI, Maudsley Obsessional Compulsive Index; M-R, Morgan and Russell; NSCM, nonspecific supported clinical management, Psych, psychiatric and psychological; pt, patients; Tx, treatment, vs., versus.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Eisler et al., 2000221 | Eating: | No statistics reported. | No statistics reported. | CFT superior to SFT in reducing ED-related traits, depression, and obsessionality. |
| CFT vs. SFT | Bulimic symptoms | |||
| Outpatient | EAT | |||
| Good | EDI | |||
| Biomarker: | ||||
| Percent ABW | ||||
| BMI | ||||
| Weight | ||||
| Psych: | ||||
| MOCI | ||||
| SMFQ | ||||
| Depression | ||||
| Obsessionality | ||||
| Geist et al., 2000229 | Eating: | No statistics reported. | No differences on any measures. | No differences on any measures. |
| Family therapy vs. family group psycho-education | EDI | |||
| Inpatient | Biomarker: | |||
| Fair | Percent IBW | |||
| Psych: | ||||
| BSI | ||||
| CDI | ||||
| FAM III | ||||
| Russell et al., 1987231 and Eisler et al., 1997239 | Eating: | No statistics reported. | No statistics reported. | Among early onset, less chronic AN patients, family therapy superior to individual therapy in improving nutritional status, menstrual and psychosexual function, and weight over 1 year tx; family therapy also more likely associated with a ‘good’ outcome over 1-year tx and 5-year FU. |
| Family therapy vs. individual therapy | M-R scales | |||
| Outpatient | Readmit rate | |||
| Fair | Biomarker: | |||
| Percent ABW | ||||
| M-R scales | ||||
| Weight | ||||
| Psych: | ||||
| M-R scales | ||||
| Robin et al., 1994230 and Robin et al., 1995237 | Eating: | No statistics reported. | No differences on any measures. | BFST superior to EOIT in increasing BMI to post-tx and 1-year FU, and in improving mother's positive communication at FU. |
| BFST vs. EOIT | EAT | |||
| Outpatient and inpatient | EDI | |||
| Fair | Eating conflict | |||
| Biomarker: | ||||
| BMI | ||||
| Weight | ||||
| Menstruation | ||||
| Psych: | ||||
| BDI | ||||
| BSQ | ||||
| PARQ | ||||
| IBC | ||||
| Lock et al., 2005222 | Eating: | No differences on any measures. | No differences on any measures. | No differences on any measures among those with most severe YBC-EDS symptoms. |
| Long-term vs. short-term family therapy | EDE | Longer-term tx associated with better BMI outcome in those with most severe ED symptoms, and with better EDE global outcome in those with non-intact families. | ||
| Outpatient | YBC-EDS | |||
| Good | Biomarker: | |||
| BMI | ||||
| Weight | ||||
ABW, average body weight; AN, anorexia nervosa; BDI, Beck Depression Inventory; BFST, behavioral family systems therapy; BMI, body mass index; BSI, Brief Symptom Inventory; BSQ, Body Shape Questionnaire; CDI, Children's Depression Inventory; CFT, conjoint family therapy; EAT, Eating Attitudes Test; ED, eating disorders; EDE, Eating Disorders Examination; EDI, Eating Disorders Inventory; EOIT, ego-oriented individual therapy; FAM-III, Family Assessment Measure; FU, follow-up; IBC, Interaction Behavior Code; IBW, ideal body weight; MOCI, Maudsley Obsessional Compulsive Index; M-R, Morgan and Russell; PARQ, Parent Adolescent Relationship Questionnaire; Psych, psychiatric and psychological; SFT, separated family therapy; SMFQ, Short Mood and Feeling Questionnaire; tx, treatment; vs., versus; YBC-EDS, Yale-Brown-Cornell Eating Disorders Scale.
Behavioral interventions for adults with anorexia nervosa. In the psychotherapy trials for adults only and the combined adult and adolescent trials, investigators tested CBT (three trials), various types of nonspecific therapy (three), family therapies (two), CAT (two), dietary counseling (one), interpersonal psychotherapy (IPT) (one), behavioral therapy (BT) (one), and focal analytic therapy (one).
Cognitive behavioral therapy. CBT studies generally used a form of therapy tailored to AN that focused on cognitive and behavioral features associated with the maintenance of eating pathology. Of the three CBT studies, one followed inpatient weight restoration223 and two were done in the underweight state.224, 225 CBT significantly reduced relapse risk and increased the likelihood of good outcome compared to nutritional counseling based on nutritional education and food exchanges after inpatient weight restoration.223 Of those receiving CBT, a greater number of individuals with good outcomes were also receiving antidepressant medication.
One study of underweight AN outpatients compared CBT with IPT and nonspecific supportive clinical management (NSCM).224 IPT in the treatment of AN is based on IPT used for the treatment of depression240 and BN;241 it focuses on one of four interpersonal problem areas: interpersonal disputes, role transitions, grief, or interpersonal deficits. NSCM was designed for this study to mimic the type of treatment an individual could receive in the community from a provider familiar with the treatment of ED and incorporates elements of sound clinical management and supportive psychotherapy. In an intention-to-treat analysis, NSCM performed significantly better than IPT in producing global good outcome ratings; CBT outcomes fell in between and were not significantly different from the other two outcomes.224 The second study compared CBT with BT and a control group for 6 months.225 At 12-month followup, CBT showed no advantage over BT or control in eating, mood, or weight outcomes.
On the basis of one trial, preliminary evidence suggests that CBT delivered after weight restoration may help to decrease relapse. In contrast, when delivered during the acute phase of the illness, CBT does not appear to offer significant advantage over NSCM, which did offer advantage over IPT. No evidence suggests that nutritional counseling alone is efficacious in the treatment of AN.
Cognitive analytic therapy (CAT). The two studies that utilized CAT, a treatment which integrates psychodynamic with behavioral factors and focuses on interpersonal and transference issues, failed to find any advantage of CAT over educational behavioral therapy or focal family therapy in eating, mood, or weight outcomes.226, 228 Focal family therapy focused on eliminating the eating disorder from its controlling role in determining the relationship between the patient and other family members.
Family therapy. Of the three studies in this category, Dare et al. found family therapy to be superior to routine treatment but equivalent to a focal time-limited psychodynamic psychotherapy in increasing percentage of adult body weight, restoring menstruation, and decreasing bulimic symptoms; overall clinical improvement was modest, however.228
Crisp et al.227 found outpatient individual and family therapy with variable numbers of sessions to be superior to referral to a family physician for increased weight at 1- and 2-year followup.
The efficacy of family therapy in treating adults with AN has not yet been completely addressed. It may be more effective than medical management by a family physician and routine treatment; family therapy (including the family of origin) may be more effective in younger patients with shorter duration of illness. No studies have explored family therapy for adult patients that included the family of insertion (spouse and offspring of the patient) rather than the family of origin.
Family therapy. Four family therapy studies focused exclusively on adolescents and one combined adolescent and adult patients.231 Family therapy was more effective for younger patients with earlier onset than for older patients with a more chronic course in the United Kingdom trial performed by Russell et al.231 and the followup by Eisler et al.239 These studies did not yield evidence that the specific type of family therapy administered was helpful for the older more chronic group.228, 231 A form of family therapy focusing initially on parental control of re-nutrition delivered in two different manners revealed a significant advantage of conjoint therapy (family treated as a unit) over separated family therapy (parents and patient seen separately) on eating and mood outcomes but not on weight outcomes.221
In a second study, no differences emerged between family therapy and family psychoeducation on any outcomes at 16 weeks.229 For a specific form of family therapy, when delivered in conjunction with a common medical and dietary regimen, behavioral family systems therapy (BFST), also characterized initially by parents taking control of renutrition, Robin et al. found BFST to be superior to ego-oriented individual therapy in increasing BMI and restoring menstruation, although neither therapy was superior on eating or mood outcomes.230, 237 Addressing the issue of optimal duration of family therapy, Lock et al. randomized adolescents to either short (10 sessions over 6 months) or long (20 sessions over 12 months) manualized family therapy based on the initial parental control of refeeding model242 and found no differences on eating, psychiatric, or biomarker outcomes.222 Longer-term family therapy suggested that those with more severe eating-related obsessions and nonintact families did better with longer treatment. Finally, in the one study that included both adolescents and adults, family therapy was superior to individual therapy for adolescent patients with shorter duration of illness. This difference did not emerge for adult patients with longer duration of illness.231 Although few differences were observed across interventions, specific forms of family interventions did consistently show improvement over time with adolescent patients.
Summary of behavioral interventions for adults and adolescents with anorexia nervosa. Overall, one study of adults provides tentative evidence that CBT may reduce relapse risk for adults with AN after weight restoration has been accomplished.223 Sufficient evidence does not exist to determine whether CBT is effective during the acute phase of the illness (i.e., in the underweight state before weight restoration); one study found that a manualized nonspecific supportive treatment (NCSM) was more effective than CBT or IPT in terms of global outcome during the acute phase.224 The three family therapy studies provide no support for the efficacy of the type of family therapy delivered in adults with AN with longer duration of illness; the superiority of this approach for younger patients with a shorter illness course is based on one study.231 Two studies failed to find any benefit of CAT for eating, mood, or weight outcomes when compared to other treatments for this population.226, 228 No methodologically sound studies that systematically tested combinations of medication and psychotherapy were identified.
Serious methodological concerns arose with some of these trials. Two were very small (8 to 12 participants per group),225, 230 which does not provide adequate statistical power for the comparative analyses conducted. In addition, both had marked pretreatment differences between groups. Failure to control for contact time with a clinician while comparing multiple treatments, with some groups getting up to 80 percent more time in treatment than others, was another problem.228 In addition, only one group of researchers conducted a follow-up study to determine the long-term impact of their interventions.239
Five studies evaluated family therapy in adolescents with AN. Overall, family therapy based on principles of parental control of initial refeeding leads to clinically meaningful weight gain and psychological change. However, the majority of family therapy studies compares one form of family therapy to another form and were underpowered to detect significant differences between active similar treatments. One study suggested that family therapy was superior to a non-family therapy comparison intervention for adolescent patients with relatively short duration of illness.231 One additional study reported significantly greater weight gain at the end of treatment in family therapy than in ego-oriented individual therapy for adolescent AN patients.230 The other three studies all involved some sort of family treatment - either comparing conjoint to separated family therapy or comparing family therapy to family psychoeducation.221, 229 Conjoint therapy was superior to separated family therapy for improving eating and mood but not weight outcomes.221 Similarly, one study examining family therapy versus family psychoeducation found no differences between groups.229
Inadequate statistical power was a common problem among the behavioral interventions in AN, and power calculations were rarely reported. No studies had a pure no-treatment condition, which is appropriate given the gravity of the illness, although “usual” treatment took various forms. Many of these studies had adequate power to detect pre-post within-group differences or differences between no treatment and an active treatment, but few were adequately powered to detect differences across two or more treatment groups.
| Intervention | Adverse Events Reported* |
|---|---|
| Medication Trials | |
| Fluoxetine vs. placebo206 | Fluoxetine group: insomnia and agitation; blurred vision |
| Fluoxetine vs. placebo207 | No adverse events observed |
| Amitriptyline vs. cyproheptadine vs. placebo208 | Amitriptyline: drowsiness, excitement, confusion, increased motor activity, tachycardia, dry mouth, constipation. |
| Cyproheptadine: no consistent pattern observed | |
| Placebo: drowsiness, excitement, increased motor activity. | |
| Amitriptyline vs. placebo209 | Amitriptyline group: diaphoresis (2), drowsiness (6), dry mouth (4), blurred vision (1), urinary retention (1), hypotension (2), leucopenia (1) |
| Placebo: dry mouth (2), palpitations (1), dizziness (2) | |
| Estrogen vs. nonmedication control210 | Estrogen group: depression (1), hyperlipidemia (1) |
| Growth hormone vs. placebo212 | No adverse events observed |
| Testosterone vs. placebo211 | Testosterone group: Mild skin irritation at patch site (3), increased depression (1), increased fatigue and vertigo (1), nausea (1) |
| Placebo: Mild skin irritation at patch site (1) | |
| Zinc vs. placebo213 | NR |
| Behavioral Interventions Trials | |
| Behavioral family systems vs. ego-oriented individual230,237 | NR |
| CBT vs. behavioral therapy vs. control225 | NR |
| CBT vs. interpersonal psychotherapy vs. nonspecific supportive clinical management224 | No adverse events observed |
| CBT vs. nutritional counseling223 | CBT: Depression and suicidal ideation (1) |
| Nutritional: Depression and suicidal ideation (3) | |
| Cognitive analytical vs. educational behavioral226 | NR |
| Conjoint family vs. separated family221 | NR |
| Family therapy vs. family group psychoeducation229 | NR |
| Family therapy vs. nonspecific individual231,239 | NR |
| Focal psychotherapy vs. family therapy vs. cognitive analytical vs. routine treatment228 | NR |
| Inpatient + 12 individual/family vs. outpatient individual/family variable vs. 10 outpatient group vs. family physician vs. dietary counseling227,238 | NR |
| Short- vs. long-term family therapy222 | NR: Dropout attributed to other psychological problems |
CBT, cognitive behavioral therapy; NR, not reported; vs., versus.
If no numbers appear in parentheses, authors had only listed adverse events but not reported the number of cases.
For the trials using second-generation antidepressants, we refer to recent publications on the comparative effectiveness and tolerability of second-generation antidepressants.243 Common side effects associated with the use of second-generation antidepressants in major depressive disorder are nausea, headache, diarrhea, constipation, dizziness, fatigue, sweating, and sexual side effects. Rare but severe adverse events include hyponatremia, suicidality, and seizures. Up to 90 percent of patients experienced at least one adverse event during treatment. Overall, discontinuation rates attributed to adverse events did not differ significantly among individual drugs and ranged from 6 percent to 14 percent. The authors report no substantial differences in adverse events with respect to drugs that were also used in eating disorders trials (i.e., citalopram, fluoxetine, fluvoxamine, and sertraline).
Given the small sample sizes and completion rates of the two fluoxetine trials, we cannot draw definitive conclusions regarding whether harms associated with fluoxetine treatment in the underweight state differ in any way from treatment of normal-weight individuals with other psychiatric diagnoses. In these studies, Kaye et al. failed to report adverse events;207 Attia et al. reported one case of insomnia and agitation and one case of blurred vision.206
For tricyclic antidepressants, Halmi et al. reported sporadic cases of drowsiness, excitement, confusion, increased motor activity, tachycardia, dry mouth, and constipation associated with amitriptyline;208 however, the rate of adverse events did not differ from placebo.
The only specific adverse event associated with testosterone administration was skin irritation at the patch site. Estrogen administration yielded one case of depression and one of hyperlipidemia. No adverse effects were reported with either growth hormone or zinc administration.
We found no consistent factors associated with better or poorer treatment outcome across studies. In medication studies, individuals with the nonbulimic subtype of AN had better therapeutic outcomes on cyprohoptadine than amitriptyline and placebo.208 Bone density increased more in women with AN who were less than 70 percent of ideal body weight on estrogen replacement therapy.210 These subgroup analyses had very small samples, and conclusions should be regarded as tentative.
One observation that was an artifact of experimental design,223 post-weight restoration trial of CBT and nutritional counseling is related to patients being permitted to be on antidepressant medication. In one trial, a significantly higher percentage of CBT successes occurred among patients on medication. Miller et al.211 reported that 3 weeks of transdermal testosterone was superior in decreasing depression in individuals who were depressed at baseline.
In terms of family therapy, Lock et al. found that adolescents with severe eating-related obsessive-compulsive-related thinking and those who come from nonintact families benefitted from longer-term rather than shorter-term manual-based family therapy treatment.222 Eisler et al. found that families that scored higher on maternal criticism did better in separated rather than conjoint family therapy.221
Finally, with reference to weight gain, family therapy was more effective for AN patients whose illness began at an early age and had not become chronic.231, 239
The total number of individuals enrolled in the eight medication trials that reported the sex of the participants was 320. Of those, one was male. No medication studies reported differential outcome by age. With the exception of the one rGH trial212 and one amitriptyline trial,209 no medication studies have explicitly focused on the treatment of adolescent AN. Not a single medication study reported race or ethnicity of participants. Of the eight trials, seven were conducted in the United States and one in Canada. Based on these results, we conclude that no information exists regarding differential efficacy of pharmacotherapy interventions for AN by sex, gender, age, race, ethnicity, or cultural group.
The total number of individuals enrolled in the 11 psychotherapy trials was 572; of these, 22 were men or boys. Only two trials reported race or ethnicity of participants; they included eight Asian Americans, 10 Hispanic Americans, no African Americans, and three individuals of “other” race or ethnicity. In no instance were results analyzed specifically by race or ethnic group. No data exist regarding differential efficacy of psychotherapeutic treatment for AN by sex, gender, race, ethnicity, or cultural group.
In terms of age, scant evidence shows that interventions involving the family have greater efficacy for individuals below the age of 15 than for patients above that age. This information is based solely on studies by just one team of investigators who found family therapy to be more effective for adolescent AN patients with a shorter duration of illness than for adults with a more chronic course.231, 239 However, no definitive replications have been done. Moreover, no studies have explored the role of family therapy in adults focusing on the family of insertion rather than family of origin, which may be the relevant comparison, or other adaptation of family therapy for adults or adolescents.
This chapter presents results of our literature search and our findings for the four key questions (KQs) that pertain to bulimia nervosa (BN), including the efficacy of various treatments or combinations of treatments (KQ 1), harms associated with the treatment or combination of treatments (KQ 2), factors associated with the efficacy of treatment (KQ 3), and whether the efficacy of treatment differs by sex, gender, age, race, ethnicity, or cultural groups (KQ 4).
We identified 47 studies reported in 58 publications addressing treatment efficacy for BN. Of these, 14 were medication-only trials.244–257 We rated two of these trials as good,246, 248 9 as fair,244, 247, 249–255, 257 and three as poor.245, 256, 258 The drugs studied included second-generation antidepressants,244, 247–250, 252, 254, 255 tricyclic antidepressants,257 an anticonvulsant,251, 259 monoamine-oxidase inhibitors (MAOIs),253 and a 5HT3 antagonist.246
Six trials combined medication with behavioral interventions.260–265 Three used second-generation antidepressants,261, 262, 265 one used a tricyclic antidepressant,260 and two used both a second-generation antidepressant and a tricyclic antidepressant sequentially.263, 264 Of these, we rated two as good264, 265 and four as fair.260–263
We identified 19 behavioral intervention psychotherapy studies published in 24 articles.266–289 We rated three psychotherapy intervention trials as good,269, 270, 282 10 as fair,266, 273, 274, 276, 278, 280, 281, 283, 287, 288 and six as poor.275, 279, 284–286, 289 Of the 13 fair- and good-rated studies, 11 used some form of cognitive-behavioral therapy (CBT) in comparison to other interventions,266, 269, 270, 273, 274, 276, 278, 280, 283, 287, 288 one used dialectical behavior therapy (DBT),282 and one used nutritional management and stress management.281
We also identified five trials of various self-help methods.290–294 We rated four as fair290–293 and one as poor.294
Finally, we identified three studies of “other” interventions including active light,295 guided imagery,296 and crisis prevention.297 We rated all three studies as fair.
| Reasons Contributing to Poor Ratings | Types of Intervention, Number of Times Flaw Was Detected, and Citations |
|---|---|
| Research Aims | |
| Hypothesis not clearly described | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| Study Population | |
| Characteristics not clearly described | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 1289 | |
| No specific inclusion or exclusion criteria | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| Randomization | |
| Protections against influence not in place | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 1284 | |
| Approach not described | Medication-only trials: 1245 |
| Behavioral intervention and self-help trials: 4275,279,284,294,298 | |
| Whether randomization had a fatal flaw not known | Medication-only trials: 2245,256 |
| Behavioral intervention and self-help trials: 6275,279,284,286,289,294,298 | |
| Comparison group(s) not similar at baseline | Medication-only trials: 2245,256 |
| Behavioral intervention and self-help trials: 1289 | |
| Blinding | |
| Study subjects | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 1289 | |
| Investigators | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 1289 | |
| Outcomes assessors | Medication-only trials: 2245,256 |
| Behavioral intervention and self-help trials: 7275,279,284–286,289,294,298 | |
| Interventions | |
| Interventions not clearly described | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| No reliable measurement of patient compliance | Medication-only trials: 1256 |
| Behavioral intervention and self-help trials: 3279,285,289 | |
| Outcomes | |
| Results not clearly described | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| Adverse events not reported | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 6275,279,284–286,289 | |
| Statistical Analysis | |
| Statistics inappropriate | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| No controls for confounding (if needed) | Medication-only trials: 1245 |
| Behavioral intervention and self-help trials: 1289 | |
| Intention-to-treat analysis not used | Medication-only trials: 1256 |
| Behavioral intervention and self-help trials: 5275,284–286,289 | |
| Power analysis not done or not reported | Medication-only trials: 1245 |
| Behavioral intervention and self-help trials: 7275,279,284–286,289,294,298 | |
| Results | |
| Loss to followup 26% or higher or not reported | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 2289,294,298 | |
| Differential loss to followup 15% or higher or not reported | Medication-only trials: 1245 |
| Behavioral intervention and self-help trials: 3275,286,289 | |
| Outcome measures not standard, reliable, or valid in all groups | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| Discussion | |
| Results do not support conclusions, taking possible biases and limitations into account | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 0 | |
| Results not discussed within context of prior research | Medication-only trials: 1256 |
| Behavioral intervention and self-help trials: 0 | |
| External validity: population not representative of US population relevant to these treatments | Medication-only trials: 1256 |
| Behavioral intervention and self-help trials: 6279,284–286,289,294,298 | |
| Funding/sponsorship not reported | Medication-only trials: 0 |
| Behavioral intervention and self-help trials: 4279,285,286,289 | |
Of the 38 studies rated fair or good, 19 were conducted in the United States, five in Canada, four in Germany, three in the United Kingdom, two in Australia, and one each in Austria, Finland, New Zealand, and Norway. In addition, one multinational trial had US and Canadian sites; another had German and Australian sites.
Of the fair and good studies, three failed to report the age of participants; of the remainder, the age range of participants was 16 to 61 years with the majority of participants being adults. A total of 3,403 individuals participated in fair or good clinical trials for BN. From those that reported sex, 2,985 women and 23 men participated.
Thirty-one studies failed to report the race or ethnicity of participants. Of those that did, 1,203 participants were identified as white, 79 as nonwhite, 27 as African American, 40 as Hispanic American, 30 as Asian or Pacific Islander, and one as Native American.
| Author | Total Enrollment, N | Total Dropouts N (% dropout) | G1 Treatment (% Dropout) | G2 Treatment (% Dropout) | G3 Treatment (% Dropout) | G4 Treatment (% Dropout) G5 Treatment (% Dropout) |
|---|---|---|---|---|---|---|
| Medication Trial | ||||||
| Beumont et al., 1997244 | 67 | 27 (40%) | Fluoxetine (50%) | Placebo (30%) | ||
| Fichter et al., 1991248 | 39 | 0 (0%) | Fluoxetine (0%) | Placebo 0% | ||
| Fluoxetine BN Collaborative Study Group, 1992249 | 387 | 117 (30%) | Placebo (37%) | Fluoxetine, 20 mg (23%) | Fluoxetine, 60 mg (30%) | |
| Goldstein et al.,1995250 | 398 | 173 (43%) | Fluoxetine (40%) | Placebo (52%) | ||
| Kanerva et al., 1995252 | 50 | 4 (8%) | Fluoxetine (8%) | Placebo (8%) | ||
| Romano et al., 2002254 | 150 | 131 (87%) | Fluoxetine (83%) | Placebo (92%) | ||
| Fichter et al., 1996247 and Fichter et al., 1997299 | 72 | 24 (33%) | Fluvoxamine (51%) | Placebo (14%) | ||
| Pope et al., 1989255 | 46 | 4 (9%) | Trazodone (13%) | Placebo (4%) | ||
| Hoopes et al., 2003251 and Hedges et al., 2003259 | 68 | 28 (41%) | Topiramate (34%) | Placebo (47%) | ||
| Kennedy et al., 1993253 | 36 | 8 (21%) | Brofaromine (21%) | Placebo (24%) | ||
| Faris et al., 2000246 | 26 | 1 (4%) | Ondansetron (7%) | Placebo 0% | ||
| Walsh et al., 1991257 | 78 | 15 (19%) | Placebo (16%) | Desipramine (23%) | ||
| Medication Plus Behavior Intervention Trials | ||||||
| Goldbloom et al.,1997261 | 76 | 33 (43%) | Fluoxetine (39%) | CBT (35%) | Fluoxetine + CBT (55%) | |
| Mitchell et al., 2001262 | 91 | 2 (2%) | Placebo (5%) | Fluoxetine (0%) | Placebo + self-help manual (0%) | Fluoxetine + self-help manual (5%) |
| Walsh et al., 2004265 | 91 | 63 (69%) | Fluoxetine + guided self help (54%) | Placebo + guided self help (88%) | Fluoxetine (70%) | Placebo (64%) |
| Agras et al., 1992260 and Agras et al., 1994300 | 71 | 18 (25%) | Desipramine 16 weeks (NR) | Desipramine 24 weeks (NR) | Desipramine 16 weeks + CBT (NR) | Desipramine 24 weeks + CBT (NR) |
| CBT (NR) | ||||||
| Mitchell et al., 2002263 | 62 | 25 (40%) | IPT (32%) | Antidepressant medication (48%) | ||
| Walsh et al., 1997264 and Wilson et al., 1999301 | 120 | 41 (34%) | CBT + medication (NR) | CBT + Placebo (NR) | Supportive therapy + medication (NR) | Supportive therapy + placebo (NR) |
| Medication only (43%) | ||||||
| Behavioral Intervention Trials | ||||||
| Agras et al., 2000269 | 220 | 57 (26%) | CBT (28%) | IPT (24%) | ||
| Wolk and Devlin, 2001268 | 110 | 44 (40%) | CBT (NR) | IPT (NR) | ||
| Cooper and Steere, 1995274 | 31 | 4 (13%) | CBT (13%) | Behavioral therapy (13%) | ||
| Fairburn et al., 1991276 and Fairburn et al., 1993267 | 75 | 15 (20%) | CBT (16%) | Behavioral therapy (24%) | IPT (12%) | |
| Wilfley et al., 1993287 | 56 | 8 (14%) | CBT (33%) | IPT (11%) | Waiting list (0%) | |
| Wilson et al., 2002288 | 220 | Post treatment: 66 (30%), Follow up: 91 (41%) | CBT (NR) | IPT (NR) | ||
| Garner et al., 1993278 | 60 | 10 (17%) | CBT (17%) | Supportive expressive (17%) | ||
| Hsu et al., 2001280 | 100 | 27 (27%) | Nutritional therapy (39%) | Cognitive therapy (15%) | Cognitive and nutritional therapy (11%) | Sequential group (46%) |
| Sundgot-Borgen et al., 2002283 | 64 | 6 (9%) | Exercise (20%) | CBT (13%) | Nutrition (0%) | Waiting list (6%) |
| Healthy control (0%) | ||||||
| Chen et al., 2003273 | 60 | 16 (27%) | Individual CBT (27%) | Group CBT (27%) | ||
| Agras et al., 1989266 | 77 | 67 (13%) | Waiting list (5%) | Self monitoring (16%) | CBT (23%) | CBT + response prevention (6%) |
| Bulik et al., 1998270 and Bulik et al.,1998271 | 111 | 5 (5%) | Exposure to B-ERP (5%) | Exposure to P-ERP (6%) | Relaxation training (3%) | |
| Laessle et al., 1991281 | 55 | 7 (13%) | Nutritional management (19%) | Stress management (7%) | ||
| Safer, Telch, and Agras, 2001282 | 31 | 2 (6%) | Dialectical behavior therapy (13%) | Waiting list (7%) | ||
| Self-help Trials | ||||||
| Bailer et al., 2004290 | 81 | 25 (31%) | Self help (25%) | CBT (37%) | ||
| Carter et al., 2003291 | 85 | 20 (24%) | CBT (18%) | Nonspecific (25%) | Waiting list (28%) | |
| Durand and King, 2003292 | 68 | 14 (21%) | GP self-help (24%) | Specialist treatment (18%) | ||
| Thiels et al., 1998293 | 62 | 13 (21%) | CBT (13%) | Guided self change (29%) | ||
| Other Interventions | ||||||
| Braun et al., 1999295 | 34 | 10 (29%) | Active light (31%) | Dim light (28%) | ||
| Mitchell et al., 2004297 | 57 | 17 weeks: 9 (16%); | Crisis prevention 17 weeks: (10%), | Follow up 17 weeks: (22%), | ||
| 43 weeks: 16 (28%), | 43 weeks: (23%), | 43 weeks: (33%), | ||||
| 70 weeks: 23 (40%) | 70 weeks: (37%) | 70 weeks: (44%) | ||||
| Esplen et al., 1998296 | 58 | 8 (14%) | Guided imagery (14%) | Control (13%) | ||
B-ERP, exposure therapy with response prevention for bingeing; CBT, cognitive behavioral therapy; GP, general practitioner; IPT, interpersonal psychotherapy; N, number; NR, not reported; P-ERP, exposure therapy with response prevention for purging.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Beumont, Russell et al., 1997244 | Eating:
| Both groups decreased bulimic and vomiting episodes, ED concerns and symptoms; and worries about body shape at week 4. | Fluoxetine associated with lower restraint, weight concern, and shape concern at week 8 | Significant difference on weight at 8 weeks with weight decreasing in fluoxetine group and increasing in placebo group. |
| Fluoxetine vs. placebo | Biomarker:
| Both groups decreased bulimic and vomiting episodes; ED concerns and symptoms; worries about body shape; restraint, overeating, and concerns about eating, shape, and weight at week 8. | Fluoxetine group regained weight above baseline at FU while placebo group did not. | |
| Outpatient | Psych:
| Both groups decreased bulimic and vomiting episodes, restraint, overeating, and concerns about eating and shape at 3-month FU. Fluoxetine group increased weight at 3 month FU. | ||
| Fair | ||||
| Fichter et al., 1991248 | Eating:
| No statistics reported. | No differences on any measures. | No differences on any measures. |
| Fluoxetine vs. placebo | Biomarker:
| |||
| Inpatient | Psych:
| |||
| Good | ||||
| Fluoxetine BN Collaborative Study Group, 1992249 | Eating:
| No statistics reported. | Fluoxetine (60 mg) associated with greater reductions in binge eating and vomiting than fluoxetine (20 mg) or placebo. Fluoxetine (60 mg and 20 mg) associated with greater reductions in vomiting, weight, drive for thinness, bulimic intensity, carbohydrate craving, body dissatisfaction, and food and diet preoccupation than placebo. | No statistics reported. |
| Fluoxetine (20 mg) vs. fluoxetine (60 mg) vs. placebo | Biomarker:
| Fluoxetine (60 mg) associated with greater reductions in depressed mood, drive for thinness, oral control, and bulimia scores than placebo. | ||
| Outpatient | Psych:
| |||
| Fair | ||||
| Goldstein, Wilson, Thompson et al., 1995250 | Eating:
| No statistics reported. | Fluoxetine associated with greater median percentage reduction in vomiting (at weeks 1–10, 13, 16, and endpoint) and binge eating (at weeks 1–9, 13, 16, and endpoint); greater reduction in total bulimia symptoms, drive for thinness, global symptoms scores, and weight; greater tx response (≥ 50% improvement in bulimic episodes) | No statistics reported. |
| Fluoxetine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Kanerva, Rissanen, and Sarna, 1994252 | Eating:
| At 4 weeks, fluoxetine group decreased anxious mood and state anxiety. | No statistics reported | Fluoxetine associated with greater reduction in depressed and anxious mood, bulimia and food preoccupation over 8 weeks. Difference in weight with decrease in fluoxetine group and increase in placebo group. |
| Fluoxetine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Romano et al., 2002254 | Eating:
| Both groups worsened over the 52-week extended tx period. | No statistics reported. | Fluoxetine group had smaller mean increases in vomiting, binge eating, total ED behavior, ritual, preoccupation and symptom severity. Relapse occurred less frequently in the first 3 months of 52-week extended tx period. |
| Fluoxetine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Fichter et al., 1996247 | Eating:
| No statistics reported. | Fluvoxamine associated with higher binge abstinence rate, reduced clinical severity, and lower relapse rate. | Fluvoxamine superior in limiting increases in bulimic behavior (urge to binge, vomiting), global ED symptoms (SIAB total), EDI bulimia scores, fear of losing control, obsessive-compulsive symptoms, and, global severity during 12 week post-discharge relapse prevention phase. |
| Fichter et al., 1997299 | Biomarker:
| |||
| Fluvoxamine vs. placebo | Psych:
| |||
| Inpatient and outpatient | ||||
| Fair | ||||
| Pope et al., 1989255 | Eating:
| Trazadone group decreased binge and purge frequencies and fear of eating at 6 wks. | Trazadone associated greater percent decrease in binge and vomit frequencies and decrease in fear of eating and increase in self-esteem | No statistics reported. |
| Trazadone vs. placebo | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Hoopes et al., 2003;251 | Eating:
| No statistics reported. | Topiramate associated with greater percentage reduction in weekly number of binge and purge days, carbohydrate craving score, bulimic intensity, lower mean global symptoms and symptom intensity; and greater mean weight reduction. | Topiramate superior to placebo in reducing uncontrolled eating, body dissatisfaction, dieting, food preoccupation,and anxious mood, and in increasing patient-rated percent improved. |
| Hedges et al., 2003259 | Biomarker:
| Larger percentage of topiramate group achieved moderate (> 50% reduction) or marked (> 75% reduction) improvement in weekly binge/purge days. | ||
| Topiramate vs. placebo | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Kennedy et al., 1993253 | Eating:
| No statistics reported. | Brofaromine associated with greater reduction in vomiting episodes. | No statistics reported |
| Brofaromine vs. placebo | Biomarker:
| A greater percentage of brofaromine group lost > 1 kg of weight. A greater percentage of placebo group gained > 1 kg of weight. | ||
| Outpatient | Psych:
| |||
| Fair | ||||
| Faris et al., 2000246 | Eating:
| Ondansetron group increased average number of normal meals, and decreased time spent engaging in BN behaviors at week 4. | Ondansetron associated with lower binge/purge frequency at week 4. | Ondansetron superior in reducing binge/vomit frequency and time spent engaging in BN behaviors and in increasing normal meals over 4 weeks. |
| Ondansetron vs. placebo | Biomarker:
| |||
| Inpatient and outpatient | ||||
| Good | ||||
| Walsh et al., 1991257 | Eating:
| No statistics reported. | Desipramine associated with fewer binge and vomiting episodes/week, fewer ED symptoms and body shape concerns, lower BMI, fewer symptoms of depression, global symptoms, and obsessive/compulsiveness, less hostility and trait anxiety. | No statistics reported. |
| Desipramine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
BDI, Beck Depression Inventory; BITE, Bulimic Investigation Test Edinburgh; BMI, Body mass index; BSQ, Body Shape Questionnaire; CGI, Clinical Global Impression Scale; EAT, Eating Attitudes Test [EAT-26 items]; ED, Eating disorder; EDE, Eating Disorder Examination; EDI, Eating Disorders Inventory; FU, followup; HAM-A, Hamilton Anxiety Index; HAM-D (or HDRS), Hamilton Depression Rating Scale [HDRS-17 items, HDRS-21 items]; HRSD, Hamilton Rating Depression Scale; HSCL, Hopkins Symptom Check List (see SCL-90); kg, kilogram; PGI, Patient Global Impression; Psych, psychiatric and psychological; SCL, (Hopkins) Symptom Check List (SCL-90 items); SIAB, Structured Interview for Anorexia and Bulimia nervosa; STAI, Spielberger State-Trait Anxiety Inventory; tx, treatment; YBC-EDS, Yale-Brown-Cornell Eating Disorder Scale.
The medication-only trials used the following two designs: medication versus placebo (10) and medication (dose a) versus medication (dose b) versus placebo (1). The results of these studies are presented below by drug class.
Second-generation antidepressants. Fluoxetine. Six trials compared fluoxetine to placebo in outpatient and inpatient settings. The mean age of participants was mid-twenties; no studies of fluoxetine focused exclusively on adolescents.
Overall, fluoxetine (60 mg/day) administered for between 8 weeks and 16 weeks led to significant reductions in binge eating in most244, 249, 250, 254 but not all studies.248, 252 Fluoxetine (60 mg/day) also performed significantly better than fluoxetine (20 mg/day) in decreasing binge eating.249 No effect of fluoxetine (60 mg/day) compared with placebo was observed in the one study in which patients were already receiving intensive inpatient psychotherapy.248
Fluoxetine (60 mg/day) was superior to placebo in decreasing purging behavior,244, 249, 250, 254 although not in the inpatient setting.248
All six fluoxetine trials either failed to report abstinence rates (absence of binge eating and purging behaviors) or did not report whether abstinence rates differed significantly between drug and placebo groups.
With reference to eating-related attitudes, fluoxetine (60 mg/day) was associated with significant improvements in measures of restraint, weight concern, and food preoccupation and with Eating Disorders Inventory (EDI) subscale scores of bulimia, drive for thinness, and body dissatisfaction.244, 249, 250, 254 Again, the exception was the inpatient study.248
Fluoxetine had mixed results on depression and anxiety scores. Some studies showed greater efficacy than placebo in decreasing depression scores,249, 252 but others showed no advantage of fluoxetine.244, 248, 250, 254
One study explored the efficacy of fluoxetine (60 mg/day) versus placebo in preventing relapse of BN over 52 weeks.254 Relapse rates were significantly lower for those receiving fluoxetine (33 percent) than for those receiving placebo (51 percent). However, dropout was substantial during the observation period (83 percent in the fluoxetine group and 92 percent in the placebo group).
Drop-out rates in fluoxetine arms of these trials ranged from zero (in an inpatient study) to 50 percent (three studies had greater than 40 percent dropout). In one study, dropout was greater in the fluoxetine than in the placebo group,244 in three studies placebo had greater attrition,249, 250, 254 and one inpatient study reported no dropout in either group.248
Fluvoxamine. To compare maintenance of therapeutic gains and prevention of relapse of BN after inpatient treatment, Fichter et al. compared fluvoxamine (average dose 182 mg/day) with placebo for 19 weeks.247 Patients treated with fluvoxamine reported fewer urges to binge, lower frequency of vomiting, and lower depression scores than those receiving placebo. Both groups gained weight, with no differences between groups. Fluovoxamine was associated with a lower relapse rate. However, attrition was high (51 percent for those on fluovoxamine and 14 percent for those on placebo).
Trazodone. In a 6-week trial of trazodone (400 mg) versus placebo, trazodone led to significantly greater decreases in the frequency of binge eating and vomiting and decreased fear of eating.255 No differences in depression or anxiety were observed, although baseline levels were not indicative of severe depression.
Tricyclic antidepressants. One 6-week trial of desipramine (200–300 mg/day) versus placebo found the active drug to be significantly more effective than placebo in decreasing binge eating, vomiting, and scores on the Eating Attitudes Test (EAT) and Body Shape Questionnaire (BSQ).257 Abstinence rates from binge eating and purging did not differ between active drug and placebo. Both self-reported depression and anxiety were significantly decreased in the desipramine group compared with the placebo group; clinician-rated depression did not differ significantly. Patients in the desipramine group lost significantly more weight than those in the placebo group, who tended to gain weight. Dropout was 23 percent in the desipramine group and 16 percent in the placebo group.
Anticonvulsants. The single 10-week trial of the anticonvulsant topiramate (mean dose 100 mg/day) led to significantly greater reductions than placebo in the number of binge/purge days reported and in body dissatisfaction, drive for thinness, and EAT scores.251, 259 Abstinence rates from binge eating and purging were 22.6 percent for topiramate and 6 percent for placebo (not significantly different). Topiramate was associated with significant reductions in anxiety but not depression, and the topiramate group lost significantly more weight than the placebo group, who tended to gain weight. Dropout from topiramate treatment was 34 percent and 47 percent for placebo.
MAOI. One 8-week trial of brofaromine (mean dose 175 mg/day) revealed no differences between the active drug and placebo on binge eating or psychological features of the eating disorder.253 Brofaromine did lead to significant reductions in vomiting. Abstinence from binge eating and from vomiting were measured independently and did not differ between groups; no differences were observed on depression or anxiety scores, weight change, or drop-out rates (21 percent brofaromine and 24 percent placebo).
5HT3 antagonist. In a small 4-week trial of ondansetron versus placebo—self-administered when patients had an urge to binge or vomit—the active drug led to significantly greater decreases than placebo in binge and vomit frequencies and time spent in bulimic behavior, and to significant increases in normal meals.246 The investigators did not measure depression or anxiety, and they found no differences in weight change. One patient dropped out from ondansetron, none from placebo.
Summary of medication-only trials. Fluoxetine (60 mg/day) administered for 6 to 18 weeks has been shown in several fair- to good-rated trials to reduce the core bulimia symptoms of binge eating and purging and associated psychological features of the eating disorder in the short term. The 60 mg dose performs better than the 20 mg dose;249 it was also associated with prevention of relapse at 1 year in a study with considerable dropout.254 Considerable evidence exists for the use of 60 mg/day of fluoxetine to treat BN in the short term. Evidence for the long-term effectiveness of relatively brief medication treatment does not exist. The optimal duration of treatment and the optimal strategy for maintenance of treatment gains are unknown.
Single studies provide preliminary evidence of the efficacy of two other second-generation antidepressants, namely trazodone255 and fluvoxamine.247 Likewise, evidence from single studies provides preliminary evidence of the efficacy of desipramine257 and topiramate.251 One preliminary trial of ondansetron, a 5HT3 antagonist and antiemetic, led to an intriguing decrease in binge eating and vomiting when patients could self-administer when they had urges to binge or purge.246 This innovative study requires replication. One trial of brofaromine, an MAOI, showed a significantly greater effect on reducing vomiting than placebo.253
When reported, abstinence rates in medication-only trials suggest that medication treatment leads to abstinence in a minority of individuals. This finding indicates that although bulimia symptoms improved, they nonetheless persisted.
Drop-out rates in medication trials ranged from zero to 51 percent. No drug showed substantially greater attrition than others.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Goldbloom et al., 1997261 | Eating:
| Decreased shape and weight concerns in the fluoxetine and the fluoxetine + CBT groups. | At tx completion, CBT alone and fluoxetine + CBT associated with greater percent reduction in vomiting frequency, compared to fluoxetine alone. | No statistics reported. |
| Fluoxetine vs. CBT vs. fluoxetine + CBT | Biomarker:
| At 4 weeks post-tx, fluoxetine + CBT associated with fewer objective binge and vomit weekly episodes compared to fluoxetine alone. | ||
| Outpatient | Psych:
| CBT associated with fewer subjective binge episodes compared to fluoxetine alone. | ||
| Fair | Note: no sig diff in ITT analyses. | |||
| Mitchell et al., 2001262 | Eating:
| No statistics reported. | Fluoxetine, alone and with self-help, associated with greater percentage reduction in vomiting and greater clinician-rated and patient-rated clinical improvement, compared to self help plus placebo or placebo alone, at endpoint (16 week tx period). | No statistics reported. |
| Fluoxetine vs. placebo vs. self-help + placebo vs. fluoxetine + self-help | Psych:
| Self-help manual plus placebo or fluoxetine associated with greater percentage reduction in vomiting compared to placebo or fluoxetine with no self-help manual, at 4-week time point (after 2 weeks active tx). | ||
| Outpatient | ||||
| Fair | ||||
| Walsh et al., 2004265 | Eating:
| No statistics reported. | Fluoxetine associated with fewer objective bulimic and vomiting episodes and fewer vomiting days per month, less restraint, less depressed mood, and a lower general symptom index compared to placebo. Fluoxetine only and placebo groups greater decrease in bulimic episodes than self-help groups. | No statistics reported |
| Fluoxetine vs. placebo vs. guided self-help vs. fluoxetine + guided self-help | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Good | ||||
| Agras et al., 1992;260 and Agras et al., 1994300 | Eating:
| No statistics reported. | No statistics reported. | Desipramine + CBT superior to medication alone in reducing binge and purge frequency at 16 and 32 weeks, and in reducing diet preoccupation over 16 weeks. |
| Desipramine (16 weeks) vs. desipramine (24 weeks) vs. desipramine + CBT (16 weeks) vs. desipramine + CBT (24 weeks) vs. CBT alone (24 weeks) | Biomarker:
| Desipramine + CBT superior to CBT alone in reducing hunger disinhibition over 24 weeks, and superior to medication alone in reducing diet preoccupation at 16 weeks. | ||
| Outpatient | Psych:
| CBT alone superior to desipramine alone for 16 or 24 wks in reducing binge and purge frequency at 16 wks. CBT alone or in combination with desipramine for 24 weeks, superior to desipramine for 16 weeks in reducing binge frequency at 1 year FU. | ||
| Fair | Desipramine + CBT for 24 weeks superior to desipramine for 16 weeks in reducing binge frequency, hunger, disinhibition, and diet preoccupation at 1 year FU. | |||
| Mitchell et al., 2002263 | Eating:
| No statistics reported. | No differences on any measures. | No statistics reported. |
| IPT vs. fluoxetine (16 weeks) or vs. fluoxetine (8 weeks) followed by desipramine (8 weeks) | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Walsh et al., 1997264 and Wilson et al., 1999301 | Eating:
| All groups exhibited decreases in weekly bingeing and vomiting, EAT and BSQ scores, concerns about eating and eating restraint, global ED symptoms, and depressed mood. | No statistics reported. | CBT groups combined superior to supportive therapy groups combined in reducing binge and vomit episode frequencies. Behavioral interventions plus medication superior to behavioral interventions alone in reducing binge frequency, EAT scores, depressed mood, weight, and in increasing remission rate. CBT plus medication superior to medication alone in reducing binge and vomit frequencies, EAT scores, body image, and increasing remission rate by self-report. Medication alone superior to CBT alone in reducing BMI and weight. Medication alone superior to supportive therapy plus medication in reducing binge and vomit frequency. |
| CBT + placebo vs. CBT + medication (desipramine only or desipramine followed by fluoxetine) vs. Supportive therapy + placebo vs. Supportive therapy + medication vs. Medication alone | Biomarker:
| Weight and BMI decreased in 3 groups (CBT+ placebo, medication alone, and supportive therapy + medication). Anxiety decreased in each of the 3 groups receiving medication. | ||
| Outpatient | Psych:
| Importance of shape and weight concerns decreased in two groups (CBT plus placebo and supportive therapy plus medication). | ||
| Fair | ||||
BDI, Beck Depression Inventory; BES, Binge Eating Scale; BMI, body mass index; BSQ, Body Shape Questionnaire; CBT, cognitive behavior therapy; CGI, clinical global impression; EAT, Eating Attitudes Test; ED, eating disorders; EDE, eating disorders examination; EDI, eating disorder inventory; FU, followup; HAM-D, Hamilton Rating Score for Depression; ITT, intention-to-treat; IPT, interpersonal psychotherapy; PGI, patient global impression; Psych, psychiatric and psychological; RSE, Rosenberg Self-Esteem Questionnaire; SCL, (Hopkins) Symptom Checklist (SCL-53 items, SCL-90 items); TFEQ, Three Factor Eating Questionnaire; tx, treatment; vs., versus; YBC-ED, Yale-Brown-Cornell Eating Disorder Scale.
The total number of individuals enrolled in these combination trials was 1,895. The number of participants in the medication plus psychotherapy trials ranged from 71 to 120. No men participated in these trials. Participant ages ranged from 18 to 46. Three trials reported race or ethnicity of participants: 272 individuals were reported to be white, seven nonwhite, two Hispanic American, eight African American, and seven Asian. Five of these trials were conducted in the United States and one in Canada.
Second-generation antidepressants and CBT. Three trials used fluoxetine as the drug intervention. Comparing fluoxetine (60 mg/day) to CBT only to fluoxetine (60 mg/day) plus CBT in a 12-week trial, Goldbloom et al. used intention-to-treat analyses but found no difference across groups on eating related-measures.261 In completers, all three interventions led to significant improvement in core bulimic symptoms; however, both combined treatment and CBT alone led to greater decreases than fluoxetine alone in objective and subjective binges and vomiting episodes. Abstinence rates, depression scores, and weight did not differ across groups. Dropout was highest in combined treatment (55 percent) compared to the fluoxetine (39 percent) and CBT only groups (35 percent). The investigators did not provide long-term followup data.
Walsh et al. compared fluoxetine (60 mg/day) with placebo, each with or without self-help in the form of a cognitive-behavioral self-help book302 with instructions for use.265 Physicians and nurses in primary care provided the treatments. Fluoxetine (either alone or with self-help) was associated with significantly decreased objective binge episodes, vomiting, restrained eating, and depression. The self-help book had no independent effect. No differences emerged on weight change. Dropout was high: 54 percent in fluoxetine plus guided self-help to 88 percent in placebo plus guided self-help.
Using the same design but a different self-help manual, also based on principles of CBT, and administering treatment from a specialized eating disorders program, Mitchell et al. found fluoxetine to be associated with a significantly greater decrease than placebo in vomiting episodes but not binge eating episodes.262 No significant differences emerged in abstinence rates or depression. At the end of treatment, the investigators reported no independent effect of self-help. Dropout was low: none in fluoxetine only and fluoxetine plus self-help, 5 percent in placebo only and placebo plus self-help.
Tricyclic antidepressants and CBT. One complex trial compared desipramine treatment of different durations with or without CBT (16 versus 24 weeks) with CBT only.260 The 16-week combined treatment was better than drug only for decreasing binge eating and purging. Longer combined treatment was significantly better than drug only on binge eating, vomiting, dieting preoccupation, and hunger. Abstinence rates did not differ across groups. The authors did not report results concerning depression. Weight change did not differ significantly across groups. At 1-year followup, the combined 24-week intervention and CBT alone were both better than the 16-week drug only treatment in decreasing binge eating and vomiting. The 24-week combined treatment was also superior to 16-week drug only in decreasing binge frequency, dietary preoccupation, disinhibition, and hunger.300 In all but the medication-only group, between 78 percent and 100 percent of individuals who were abstinent at the end of treatment remained abstinent at followup. The overall drop-out rate was 25 percent.
Multiple drugs and CBT. Walsh et al. examined supportive psychotherapy, CBT, both with or without placebo and with or without medication, and medication alone in a five-group 16-week comparison.264, 301 They started patients on desipramine (mean dose 188 mg/day) and switched nonresponders to fluoxetine (60 mg/day) after 8 weeks. Analyses combining all arms of the study that included CBT versus all arms of the study that included supportive therapy indicated that CBT was superior to supportive therapy in reducing binge and vomit episode frequencies. Behavioral interventions plus medication were superior to behavioral interventions alone in reducing binge frequency, EAT scores, depressed mood, weight, and in increasing remission rate.
CBT plus medication was superior to medication alone in reducing binge and vomit frequencies, EAT scores, body image, and increasing remission rate by self-report. Medication alone was superior to CBT alone in reducing BMI and weight. Medication alone was superior to supportive therapy plus medication in reducing binge and vomit frequency. Medication led to significantly greater decreases in depression scores. CBT was associated with greater likelihood of remission. The overall drop-out rate was 34 percent.
Mitchell et al. randomized patients who did not respond to CBT to either interpersonal psychotherapy or fluoxetine (60 mg/day), which could be switched to desipramine in those who did not achieve abstinence.263 No difference in abstinence was observed between the two groups. Overall, the sequential second-level treatment was associated with high dropout.
Summary of medication plus psychotherapy trials. The combined medication plus behavioral intervention studies provide only preliminary evidence regarding the optimal combination of medication and psychotherapy or self-help. Given the variety of designs used and lack of replication, evidence remains weak. Combined CBT and fluoxetine and CBT alone led to greater decreases in binge eating and purging than fluoxetine alone in individuals who complete therapy.261 When delivered in the context of a specialist eating disorders program, both self-help and fluoxetine were associated with decreased vomiting; however, the addition of self-help to fluoxetine was not associated with increased efficacy.262 When these therapies were administered in a primary care setting, drop-out rates from fluoxetine (70 percent) and fluoxetine plus self-help (54 percent) were unacceptably high.265
The only study that looked at sequential treatment for individuals who did not respond to CBT revealed that the addition of interpersonal psychotherapy to fluoxetine (allowing the transition to desipramine) led to substantial attrition and minimal effects on subsequent abstinence rates. How best to treat individuals who do not respond to CBT or fluoxetine remains a major shortcoming of the literature.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Agras et al., 2000269 and Wolk and Devlin, 2001268 | Eating:
| No statistics reported. | CBT associated with higher percent remitted and percent recovered at end of tx (ITT analysis). | No statistics reported. |
| CBT vs. IPT | Biomarker:
| In completers-only analysis, CBT associated with fewer objective binges and purges; less eating restraint; and less weight, shape, and eating concerns at the end of tx. Stage of change predicted improvement in IPT but not CBT. | ||
| Outpatient | Psych:
| |||
| Good | ||||
| Cooper and Steere, 1995274 | Eating:
| No statistics reported. | Relapse rate lower in cognitive therapy group among those who were abstinent from binge-eating at end of tx and at 12 month FU. | Cognitive therapy superior to exposure therapy in reducing vomiting and depression between baseline and 12 month FU. |
| Cognitive therapy vs. exposure plus binge and purge response prevention | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Fairburn et al., 1991;276 Fairburn, Jones et al., 1993267 and Fairburn, Peveler et al., 1993277 | Eating:
| No statistics reported. | No statistics reported. | Over 18 week tx period, CBT superior to BT and IPT in reducing eating restraint, weight concerns, and overall ED psychopathology; CBT superior to IPT in reducing vomiting; and CBT superior to BT in reducing shape concerns. |
| CBT vs. BT vs. IPT | Biomarker:
| Over 12-month FU, CBT superior to BT in improving abstinence. | ||
| Outpatient | Psych:
| |||
| Fair | ||||
| Wilfley et al., 1993287 | Eating:
| CBT and IPT decreased binge frequency at 1 year FU. | No statistics reported. | Group CBT and group IPT superior to waiting-list in reducing binge frequency, and disinhibition over 16 weeks. |
| Group CBT vs. group IPT vs. waiting-list control | Psych:
| Group IPT superior to waiting-list in reducing restraint over 16 weeks. | ||
| Outpatient | ||||
| Fair | ||||
| Wilson et al., 2002288 | Eating:
| Both groups decreased shape and weight concerns at post-tx. | CBT showed greater mean reduction in eating restraint by tx week 6, greater improvements in self-efficacy by tx week 10, and a higher percentage reduction in binge eating at post-tx. | CBT superior in early (by week 6) improvement (reduction in frequency of vomit episodes) |
| CBT vs. IPT | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Garner et al., 1993278 | Eating:
| No statistics reported. | No statistics reported. | Over 18 week tx period, CBT superior in reducing dieting, food preoccupation, eating concerns, restraint, attitudes toward shape, bulimia behaviors, depressed mood, global symptoms, and symptoms of borderline personality disorder and dysthymia; and in improving self-esteem. |
| CBT vs. supportive-expressive therapy | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Hsu et al., 2001280 | Eating:
| No statistics reported. | No statistics reported. | CNT superior to NT alone and to group support in binge/purge abstinence and in reducing drive for thinness and BN symptoms. |
| CT vs. NT vs. CT+NT (CNT) vs. group support (control) | Psych:
| CT superior to NT in reducing BN symptoms and CT superior to group support in reducing drive for thinness. | ||
| Outpatient | ||||
| Fair | ||||
| Sundgot-Borgen et al., 2002283 | Eating:
| Exercise group decreased percent body fat at post-tx and fat mass at 18-month FU. | Body dissatisfaction lower in CBT compared to nutritional counseling group at post tx. | No statistics reported. |
| Exercise vs. CBT vs. nutrition counseling vs. waiting-list vs. healthy controls | Biomarker:
| Laxative use lower in exercise than CBT group at post tx. | ||
| Outpatient | Vomit frequency, bulimia symptoms, and body dissatisfaction lower in CBT than nutritional counseling group at 6 month FU. Drive for thinness and laxative abuse lower in exercise than CBT group, at 6 month FU. | |||
| Fair | Binge episodes lower in exercise than in CBT at 18 month FU. | |||
| Chen et al., 2003273 | Eating:
| No statistics reported. | Higher rate of abstinence in individual CBT than group CBT at end of tx. | Group CBT superior to individual CBT in reducing state anxiety. |
| Individual CBT vs. group CBT | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Agras et al., 1989266 | Eating:
| Decreased purges/week in self-monitoring, CBT, and CBT+ response groups at end of 4-month tx. | CBT associated with higher abstinence rate compared to waiting-list at end of tx, and compared to self-monitoring and response prevention at 6 month FU. | CBT alone superior to waiting-list in reducing purging frequency, increasing purging abstinence and decreasing depressed mood, by end of treatment. |
| Waiting-list vs. Self-monitoring vs. CBT vs. CBT+ response prevention | Biomarker:
| CBT alone and CBT+ response prevention superior to waiting-list in reducing depressed mood by end of treatment. | ||
| Outpatient | Psych:
| |||
| Fair | ||||
| Bulik et al., 1998;270Bulik et al., 1998;271 Carter, McIntosh et al., 2003272 | Eating:
| P-ERP and relaxation groups improved body dissatisfaction at 3 yr FU | B-ERP associated with less drive for thinness, lower clinician-rated food restriction, body dissatisfaction, and depressed mood, lower subjective distress than relaxation training at 3 year FU. | Relaxation superior to B-ERP in reducing depressed mood and clinician-rated body dissatisfaction from post-tx to 2 year FU. |
| 8 weeks CBT followed by B-ERP tx vs. P-ERP tx vs. relaxation training | Psych:
| P-ERP associated with fewer ED psychological and behavioral measures.than relaxation training at 3 year FU. | Relaxation superior to P-ERP in reducing ED psych and behavioral traits and depressed mood from post-tx to 3 year FU. | |
| Outpatient | B-ERP associated with less food restriction, higher GAFS score than relax training at 12 month FU. | |||
| Good | ||||
| Laessle et al., 1991281 | Eating:
| No statistics reported. | No difference on any measures. | Nutritional management superior to stress management in increasing calorie consumption and decreasing binge frequency over first 3 weeks of tx, and in increasing binge abstinence rate through 6 and 12 months. |
| Nutritional management vs. stress management | Psych:
| Stress management superior to nutrition management in reducing trait anxiety over 3 months of tx. | ||
| Outpatient | ||||
| Fair | ||||
| Safer et al., 2001282 | Eating: | No statistics reported. | DBT superior in post-tx abstinence rate | DBT superior in reducing the number of binge and purge episodes measured in last 4 of 20 weeks of tx. |
| DBT vs. waiting-list | Binge episodes | |||
| Outpatient | EDE | |||
| Good | Emotional eating scale | |||
| Purge episodes | ||||
| Psych: | ||||
| BDI | ||||
| Positive and Negative Affect Schedule | ||||
B-ERP, exposure with response prevention to pre-binge cues; BDI, Beck Depression Inventory; BMI, Body mass index; BN, bulimia nervosa; BSQ, Body Shape Questionnaire; BT, Behavioral Therapy; CBT, Cognitive Behavioral Therapy; CNT, Cognitive nutritional therapy; CT, Cognitive Therapy; DBT, dialectical behavior therapy; EAT, Eating Attitudes Test; ED, Eating disorder; EDE, Eating Disorder Examination (EDE-12 items); EDI, Eating Disorders Inventory; FU, follow-up; GAFS, Global Assessment of Functioning Symptoms; HDRS, Hamilton Depression Rating Scale; IIP, Inventory of Interpersonal Problems; IPT, interpersonal psychotherapy; ITT, intention-to-treat; MADRS, Montgomery and Asberg Depression Rating Scale; NT, nutritional therapy; P-ERP, exposure with response prevention to pre-purge cues; PSE, Present State Examination; Psych, psychiatric and psychological; RSE, Rosenberg Self-Esteem Scale; SCL-90, (Hopkins) symptom checklist (SCL-90 items, SFL-90-R [SCL-90-revised]); STAI, Speilberger State-Trait Anxiety Inventory; SUDS, subjective units of distress; TFEQ, Three Factor Eating Questionnaire; tx, treatment.
Psychotherapy trials for bulimia nervosa. Cognitive Behavior Therapy. CBT focusing on cognitive and behavioral factors that maintain bulimic behaviors is the most widely studied intervention for BN. Eleven trials of various designs delivered CBT either individually or in group format. CBT was compared with interpersonal psychotherapy (IPT),269, 276, 287, 288 with supportive expressive therapy,278 with nutritional counseling,280, 283 and with exercise.283 One study compared individually with group-administered CBT.273 Several studies dismantled CBT by comparing complete CBT with behavioral therapy (BT) in the absence of a cognitive component,276 by comparing cognitive therapy only with exposure with response prevention only,274 and by exploring the additive efficacy of exposure with response prevention grafted onto a basis of cognitive therapy.271 Exposure with response prevention is defined as exposing individuals to their high-risk cues (e.g., prebinge cues or prepurge cues) and then preventing the response (e.g., binge eating or purging) until the urge to engage in the behavior subsides.
In comparisons of individually administered CBT and IPT tailored for BN, CBT was associated with a significantly greater probability of remission than IPT269 and with greater decreases in vomiting and restraint269, 276 and binge eating269 at the end of treatment. In one study at 1-year followup, these differences were no longer apparent.276 Neither CBT nor IPT led to greater improvements in mood or changes in weight. Changes in dietary restraint and in eating self-efficacy mediated change in binge and purge frequency.288 Being in the precontemplation stage of change was associated with failure to achieve remission at the end of treatment.268
When administered in group format, differences between CBT and IPT were less clear. Both group-administered treatments led to significantly greater decreases than waiting list on days binged, psychological features of the eating disorder, disinhibition, and restraint, with no differences observed between the active therapies.287
When compared directly, few differences emerged between group and individual administration of CBT. Both showed decreases in objective and subjective binge episodes, vomiting, laxative use, overexercise and EDI bulimia, drive for thinness, and body dissatisfaction subscale scores.273 Group CBT was associated with greater decreases in anxiety; individual CBT was associated with significantly higher rates of abstinence. From a cost-effectiveness perspective, the study concluded that group CBT was more economical, given the similarity of outcomes.
In the dismantling studies, which attempted to parse out the effects of various components of CBT, the cognitive component emerged as critical to therapeutic outcome. Complete CBT led to better eating-related outcomes than BT,276 to lower relapse than exposure with response prevention only,274 and to greater abstinence than a self-monitoring only intervention.266
Two studies examined the additive efficacy of exposure with response prevention. Agras and colleagues found no additive benefit of exposure to CBT.266 Bulik et al. first treated all patients with a core of cognitive therapy and then explored the added efficacy of three augmentation strategies: exposure with response prevention to prebinge cues, exposure with response prevention to prepurge cues, and a relaxation therapy control.270 They found no evidence that either exposure treatment led to greater improvement in binge eating and vomiting than the relaxation control.
In other comparisons, cognitive therapy performed better than support only; adding a cognitive component to nutritional counseling led to a significantly greater decrease in drive for thinness than nutritional therapy alone.280 CBT was superior to nutritional counseling alone in improving core binge eating, vomiting, laxative use, and body dissatisfaction. CBT also led to significantly greater decreases than supportive-expressive therapy (a nondirective psychodynamically oriented treatment) in EDI bulimia, EAT scores, food preoccupation, eating concerns, and depression.278 Exercise therapy was superior to CBT at 18-month followup in improving drive for thinness, laxative abuse, and binge eating.283
Overall, dropout from CBT delivered individually or in group format ranged from 6 percent to 37 percent. Typical rates were about one-quarter of individuals randomized.
Other behavioral interventions. A single study compared nutritional management (focusing on decreasing restraint, detailed nutritional self-monitoring, and stimulus control) to stress management (focusing on decreasing stressors that may trigger binge eating). Both treatments led to significant decreases in binge eating and vomiting; abstinence from binge eating was greater in nutritional management than stress management, although abstinence from vomiting did not differ. Stress management was associated with greater reductions in trait anxiety.281
Dialectical behavioral therapy (DBT). DBT focuses on emotional dysregulation as the core problem in BN with symptoms viewed as attempts to manage unpleasant emotional states. A small study showed that patients receiving DBT had significantly greater decreases in binge eating and purging than did those on a waiting list and that abstinence was greater at the end of treatment in the DBT than in the waiting list group.282
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Bailer et al., 2004290 | Eating:
| No statistics reported. | Higher meal frequency in self-help at post-tx. | Self-help superior to CBT in reducing bulimia symptoms over 18 weeks. |
| Guided self-help vs. group CBT | Biomarker:
| Lower vomit frequency, depressed mood, laxative use, and bulimia symptoms, and higher BMI in self-help, at 1-year FU. | CBT superior to self-help in reducing drive for thinness over tx and FU periods. | |
| Outpatient | Psych:
| |||
| Fair | ||||
| Carter et al., 2003291 | Eating:
| Both self-help groups decreased binge and purge frequencies. | No differences on any measures. | CBT-based self-help superior to non-specific self-help and to waiting-list in reducing intense exercising. |
| CBT-based self-help vs. non-specific self-help vs. waiting-list | Psych:
| CBT-based self-help experienced a decrease in intense exercising. | ||
| Outpatient | ||||
| Fair | ||||
| Durand and King, 2003292 | Eating:
| No statistics reported. | No differences on any measures. | No differences on any measures. |
| General practice physician- based self-help vs. specialist-based self-help | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Thiels et al., 1998293 | Eating:
| No statistics reported. | Lower BITE scores in guided self-change group. | No differences on any measures. |
| CBT vs. guided self-change | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
BAI, Beck Anxiety Inventory; BDI, Beck Depression Inventory; BITE, Bulimic Investigation Text Edinborough; BMI, Body mass index; CBT, Cognitive Behavioral Therapy; EDE, Eating Disorder Examination; EDI, Eating Disorders Inventory; FU, followup; HDRS, Hamilton Depression Rating Scale [HDRS-17 items, HDRS-21 items]; IIP, Inventory of Interpersonal Problems; Psych, psychiatric and psychological; tx, treatment.
Carter et al. compared CBT-based self-help302with nonspecific self-help, focusing on self-assertion for women, with a waiting list control group in a 2-month trial.291 Both self-help approaches led to significant decreases in objective binge episodes and purging; the waiting list did not. CBT-based self-help was associated with greater reductions in reducing intense exercise than nonspecific self-help or waiting list. No change in depression was observed. Abstinence and weight values were not reported.
To understand the feasibility and efficacy of self-help delivered in general practitioner (GP) offices, Durand and King compared GP-supported CBT-based self-help303 with specialist outpatient treatment.292 The duration of treatment was at the clinician's discretion. Patients in both groups reported significant decreases in scores on the Bulimic Investigation Test Edinburgh (BITE) and Eating Disorders Examination (EDE) total; however, binge eating and vomiting did not drop significantly. Both groups reported significant decreases in depression, but no treatment was superior. Weight change was not reported. Drop-out rates were similar across groups (24 percent in the GP group and 18 percent in specialist care).
A German study by Thiels et al. compared 16 weeks of CBT with guided self-change using a manual.293 Guided self-change included 16 sessions with a therapist encouraging use of the manual and addressing motivation, obstacles, and emergent crises. Significant decreases occurred in overeating, vomiting, BITE scores, and EAT scores for both groups combined. Only on BITE scores did the CBT group perform significantly better than the guided self-change group. Depression dropped in both treatment groups with no significant differences between groups. Dropout was 13 percent in CBT and 29 percent in guided self-change.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Braun et al., 1999295 | Eating:
| No statistics reported. | No differences on any measures. | Bright light superior to dim light (placebo) in reducing binge frequency over 3 week tx. |
| Bright light therapy vs. dim light/placebo | Psych:
| |||
| Outpatient | ||||
| Fair | ||||
| Mitchell et al., 2004297 | Eating:
| No differences on any measures. | No differences on any measures. | No differences on any measures. |
| Crisis prevention vs. usual follow-up | ||||
| Outpatient | ||||
| Fair | ||||
| Esplen et al., 1998296 | Eating:
| No statistics reported. | Higher abstinence rate in guided imagery compared to control group. | Guided imagery superior to control in reducing binge and purge frequencies, drive for thinness, bulimia symptoms, and body dissatisfaction over 6 week tx period. |
| Guided imagery vs. control (eating behavior journaling therapies) | ||||
| Outpatient | ||||
| Fair | ||||
BDI, Beck Depression Inventory; EAT, Eating Attitudes Test (EAT-26 items); EDI, Eating Disorders Inventory; HAM-D, Hamilton Depression Rating Scale; Psych, psychiatric and psychological; tx, treatment; YBC-EDS, Yale-Brown-Cornell Eating Disorder Scale
Light therapy. In a small 8-week trial of 10,000 lux white light (active light) versus 50 lux red light (control), individuals in the active light group showed significantly greater decreases in binge eating than individuals in the control group.295 Mood improved in both groups but no additional differences were observed for any other eating disorder, psychological, or biomarker outcome. The investigators did not provide long-term follow-up data. Given the size of this trial and the absence of followup, results should be viewed as preliminary.
Crisis prevention. Individuals who were abstinent after a trial of CBT were randomized to either a crisis prevention group in which they were able to contact their clinician to receive up to eight additional visits over 17 months if they felt their condition was deteriorating or a control follow-up-only group.297 The percentage of individuals who resumed binge eating and purging did not differ over the 17-month interval; however, none of the individuals in the crisis prevention group used any of their available calls despite the reappearance of bulimic symptoms.
Guided imagery. Esplen et al. conducted a 6-week trial of patients in a guided imagery group and a control journaling group.296 Guided imagery was based on developing self-comforting in BN.304 Guided imagery led to a significantly greater decrease in measures of binge eating, purging, EDI bulimia, drive for thinness, and body dissatisfaction. At the end of treatment, 21 percent of individuals in guided imagery and no individuals in the control condition were abstinent. Drop-out rates were comparable across groups.
Summary of behavioral interventions for bulimia nervosa. A large number of fair- to good-rated trials provide evidence that CBT administered individually or in group format is effective in reducing the core behavioral symptoms of binge eating and purging and psychological features of BN in both the short and the long term. One study suggests that CBT leads to more rapid reduction of symptoms than IPT.276 Another suggests that individual CBT confers no advantage over the more economical group CBT approach;273 although this finding is important for service delivery, it requires replication. The cognitive component of CBT appears to be the active ingredient for change, as behavioral interventions alone are not as effective.274, 276 Exposure with response prevention, either alone or as an added component to a core of cognitive therapy, offers no additional therapeutic advantage to basic CBT.270, 272, 274
Adding a cognitive component to nutritional intervention led to greater effectiveness in one study,280 and CBT led to better outcomes than a psychodynamically oriented supportive-expressive therapy.278 Preliminary evidence suggests that DBT is effective and worth additional study for the treatment of BN.282
Four studies provided mixed evidence regarding the efficacy of self-help methods for BN. One German and one Austrian study provide support for guided self-help in comparison to group CBT290 and individually administered CBT.293 The nature of the self-help approach (CBT oriented versus nonspecific) did not lead to different outcomes.291 Preliminary evidence from the United Kingdom indicates that GPs can successfully deliver self-help.292 No self-help trials conducted in the United States met our inclusion criteria. Overall, especially in the absence of control conditions, few conclusions can be drawn regarding the efficacy of self-help approaches for BN. Moreover, the term self-help must be considered carefully as many of the interventions labeled self-help included considerable contact with providers.
One report yielded preliminary evidence for treating BN with light leading to some short-term decreases in binge eating.295 One study provided some support for guided imagery compared to journaling, although long-term maintenance of treatment effects is unknown.296 Crisis prevention approaches do not appear to be effective in the treatment of BN, based on one study, as patients do not avail themselves of the opportunity to contact their therapists when symptoms reemerge.297
| Intervention | Adverse Event *† |
|---|---|
| Medication Trials | |
| Fluoxetine vs. placebo244 | Fluoxetine group: Insomnia, nausea, and shakiness significantly more common |
| Placebo group: depression more common | |
| Fluoxetine vs. placebo248 | Fluoxetine: significantly more trembling than placebo |
| Fluoxetine 60mg (F60) vs. fluoxetine 20mg (F20) vs. placebo (PL)249 | Side effects by treatment group: |
| Insomnia: F60 (30); F20 (23); PL (10); (P < 0.001) | |
| Nausea: F60 (28); F20 (20); PL (14); (P = 0.021) | |
| Asthenia: F60 (23); F20 (16); PL (11); (P = 0.039) | |
| Tremor: F60 (12); F20 (4); PL (0); (P < 0.001) | |
| Sweating: F60(7); F20 (4); PL (1); (P = 0.036) | |
| Urinary frequency: F60 (5); F20 (0); PL (2); (P = 0.012) | |
| Palpitation: F60(5); F20(1); PL(1); (P = 0.017) | |
| Yawn: F60 (5); F20(1); PL(1); (P = 0.017) | |
| Mydriasis: F60 (3); F20 (0); PL(0); (P = 0.018) | |
| Vasodilation: F60(1); F20 (4); PL (0); (P = 0.029) | |
| Fluoxetine (F) vs. placebo (PL)250 | Side effects by treatment group: |
| Insomnia: F (102); PL (19); (P < 0.05) | |
| Nausea: F (90); PL(13); (P < 0.001) | |
| Asthenia, F (63); PL (7); (P < 0.001) | |
| Anxiety: F (52); PL (9); (P < 0.05) | |
| Tremor: F (42); PL (2); (P < 0.001) | |
| Dizziness: F (37); PL (4); (P < 0.05) | |
| Yawning, F (36); PL (0); (P < 0.001) | |
| Sweating: F (28); PL (2); (P < 0.05) | |
| Decreased libido: F (19); PL (1); (P < 0.05) | |
| Depression: F (30); PL (19); (P < 0.05) | |
| Myalgia: F (14); PL (12); (P < 0.05) | |
| Emotional lability: F (8); PL (8); (P < 0.05) | |
| Conjunctivitis: F (1); PL (3); (P < 0.05) | |
| Fluoxetine vs. placebo252 | Fluoxetine: hand tremor (5) |
| Placebo: Palpitations (1) | |
| Fluoxetine vs. placebo254 | Fluoxetine: rhinitis (24) |
| Placebo: rhinitis (12); (P < 0.04) | |
| Fluvoxamine vs. placebo247,299 | Fluvoxamine: nausea, dizziness and drowsiness significantly more common in patients receiving fluvoxamine |
| Fluvoxamine: Drop outs due to general side effects (8) | |
| Trazodone vs placebo255 | Trazodone significantly more dizziness and drowsiness than placebo |
| Topiramate vs. placebo251,259 | Topiramate: Dropouts (1) facial rash and irritability |
| Placebo: Dropouts (2) | |
| Brofaromine vs. placebo253 | Brofaromine: nausea (2); sleep disturbance, nausea, dizziness |
| Placebo: headache (1); dry mouth, nausea | |
| Ondansetron vs. placebo246 | No adverse events observed |
| Desipramine vs. placebo257 | NR |
| Medication plus Behavioral Intervention Trials | |
| Fluoxetine vs. individual CBT vs. fluoxetine and individual CBT261 | Fluoxetine: Dropouts due to side effects (4) |
| Fluoxetine plus CBT: Dropouts due to side effects (2) | |
| Nature of side effects NR | |
| Fluoxetine vs. manual based self-help262 | NR |
| Fluoxetine plus guided self-help vs. placebo plus guided self help vs. fluoxetine vs. placebo265 | NR |
| Desipramine 16 wks vs. despipramine 24 wks vs. desipramine 16 wks plus CBT vs. CBT only260,300 | NR |
| Interpersonal psychotherapy vs. antidepressant (fluoxetine replaced by desipramine if no effect) in CBT nonresponders263 | NR |
| CBT plus medication vs. CBT plus placebo vs. Supportive therapy plus med vs. supportive therapy plus placebo264,301 | NR |
| Behavioral Intervention Trials | |
| CBT vs. Interpersonal psychotherapy269 | 9 withdrawn from treatment: 7 severe depression, 1 acute onset of panic disorder |
| CBT vs. exposure response prevention274 | NR |
| CBT vs. Behavior therapy vs. interpersonal psychotherapy267,276,277 | Behavior therapy: Drop out (1) severe weight loss |
| Group CBT vs. group Interpersonal psychotherapy vs. waiting list control287 | NR |
| CBT vs. interpersonal psychotherapy288 | NR |
| CBT vs. supportive-expressive therapy278 | NR |
| Cognitive therapy vs. nutritional therapy280 | NR |
| CBT vs. physical exercise vs. nutritional counseling283 | Exercise: injury (1) |
| Individual CBT vs. Group CBT273 | Alcohol abuse (2), AN (1), visual hallucinations (1). No indication of which group these participants were in. |
| CBT vs. CBT plus response prevention vs. self-monitoring vs. waiting-list266 | NR |
| CBT plus exposure with response prevention to pre-binge cues vs. CBT plus exposure to response prevention with pre-purge cues vs. CBT plus relaxation training270–272 | NR |
| Nutritional management vs. stress management281 | NR |
| DBT vs. waiting list282 | NR |
| Self-help Trials | |
| Guided self-help vs. group CBT290 | NR |
| Self-help manual vs. waiting list control291 | NR |
| Self-help intervention vs. clinic intervention292 | NR |
| CBT vs. guided self-change sessions293 | NR |
| Other Trials | |
| Active light vs. placebo dim light295 | |
| Crisis prevention vs. follow up297 | |
| Guided imagery vs. control296 | |
CBT, cognitive behavioral therapy; DBT, dialectical behavioral therapy; NR: not reported
If no numbers appear in parentheses, authors had only listed adverse events but not reported the number of cases.
P values indicate differences between groups; they are reported with they are provided by the author.
Side effects of MAOI administration were nausea, sleep disturbance, and dizziness. No hypertensive crises were reported, although this danger should always be considered in patients who experience uncontrollable eating episodes.121
A few medication trials for BN explored factors associated with outcome. Walsh et al. reported that patients with greater concern for body shape and weight and longer duration of illness had more favorable treatment responses.257 The Fluoxetine BN Collaborative Study group found that heavier patients had higher response rates in each treatment group.249
Behavioral interventions in BN provided better and reasonably consistent information about factors associated with treatment response. Several investigators reported two factors as associated with poor outcome: high frequency of binge eating270, 272, 274, 298, 301 and longer duration of illness.274, 298
Evidence was mixed or contradictory for other factors. Higher body dissatisfaction was associated with both poorer270 and better outcome.277 With respect to weight, a history of obesity was reported as a positive prognostic indicator270 and as a predictor of dropout.278 Better outcomes or more rapid response were associated with higher baseline depression, lower severity of binge eating,287 and greater attitudinal disturbance at baseline.277 Positive response was reported to be associated with a history of obesity, a history of alcoholism, and high scores for self-directedness270 and self-control.280 Poorer outcomes were associated with greater food restriction, higher depression, higher drive for thinness and bulimia scores on the EDI, and greater cue reactivity,270 low self-esteem,277 and precontemplation stage of change.268
Factors associated with positive response to self help included higher EDI perfectionism scores, higher Dimensional Assessment of Personality Pathology (DAPP) compulsivity scores, higher DAPP intimacy problem scores, and lower cognitive behavior knowledge scores.291
Higher soothing receptivity and ability to tolerate aloneness were associated with more positive outcomes in guided imagery therapy.296
The total number of individuals enrolled in the 18 trials of drugs or drug plus behavioral interventions was 1,941. Of those 67 were men. No studies reported differential outcome by age. Thirteen studies failed to report the race or ethnicity of participants. Of those that did, 793 participants were identified as white, 57 as nonwhite, 33 as Asian, 12 as Hispanic American, and eight as African American. Of the 18 trials, 12 were conducted in the United States. No study analyzed results separately by sex or by race or ethnicity. Based on these results, we conclude that no information exists regarding differential efficacy of medication only or combined medication plus behavioral interventions for BN by sex, gender, age, race, ethnicity, or cultural group.
The total number of individuals enrolled in behavioral intervention or other intervention trials was 1,462. Of those, two were men. Of the 18 trials, 14 failed to reported race or ethnicity of participants. From the remaining four trials, 410 subjects were identified as white; 22 as nonwhite; 28 Hispanic-American; 26 as Asian; Maori or Pacific Islander; 19 as African-American or Afro-Caribbean; and one as Native American. In no instance did the investigators analyze results separately by race or ethnic group. No data exist regarding differential efficacy of behavioral interventions for BN by sex, gender, age, race, ethnicity, or cultural group.
This chapter presents results of our literature search and our findings for the four key questions (KQs) pertaining to binge eating disorder (BED). KQ 1 sought evidence for the efficacy of various treatments or combinations of treatments for BED. KQ 2 sought evidence of harms associated with the treatment or combination of treatments for BED. KQ 3 addressed factors associated with the efficacy of treatment for BED. KQ 4 addressed whether the efficacy of treatment for BED differs by sex, gender, age, race, ethnicity, or cultural groups. We report first on specific details about the yields of the literature searches and characteristics of the studies, then on literature pertaining to treatment (KQ 1, KQ 2, and KQ 3). Summary tables presenting findings grouped by selected outcomes appear at the end of this chapter.
We identified 26 studies addressing treatment efficacy for BED. Nine were medication-only trials.305–313 We rated four of these trials as good,305, 307, 309, 312 and five as fair.306, 308, 310, 311, 313 One study of a medication no longer available in the United States (d-fenfluramine) is not discussed here.313 The medications studied included second-generation antidepressants,305–309 tricyclic antidepressants,310 an anticonvulsant,311 sibutramine,312 and d-fenfluramine.313
Four trials combined medication with behavioral interventions using second-generation antidepressants,314, 315 a tricyclic antidepressant,316 and orlistat.317 Of these, we rated two as good,315, 317 one as fair,316 and one as poor.314
We identified eight behavioral-intervention-only studies. Of these, we rated one trial as good,318 three as fair,319–321 and four as poor.322–325 Of the four fair or good studies, three used some form of cognitive behavioral therapy (CBT) in comparison to other interventions318–320 and one used dialectical behavior therapy (DBT).321
Three trials investigated various self-help methods.326–328 We rated one as good326 and two, which report on the same sample at two points in time, as fair.327, 328 Finally, one trial involved exercise, rated as poor,329 and another examined virtual reality therapy, rated as fair.330
| Reasons Contributing to Poor Ratings | Types of Intervention, Number of Times Flaw Was Detected, and Citations |
|---|---|
| Research Aim | |
| Hypothesis not clearly described | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Study Population | |
| Characteristics not clearly described | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| No specific inclusion or exclusion criteria | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Randomization | |
| Protections against influence not in place | Medication-only trial: 1314 |
| Psychotherapy trials: 3322–324 | |
| Approach not described | Medication-only trials: 0 |
| Psychotherapy trials: 1324 | |
| Whether randomization had a fatal flaw not known | Medication-only trials: 0 |
| Psychotherapy trials: 4322–325 | |
| Comparison group(s) not similar at baseline | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Blinding | |
| Study subjects | Medication-only trials: 1314 |
| Psychotherapy trials: 0 | |
| Investigators | Medication-only trials: 1314 |
| Psychotherapy trials: 0 | |
| Outcomes assessors | Medication-only trial: 1314 |
| Psychotherapy trials: 4322–325 | |
| Interventions | |
| Interventions not clearly described | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| No reliable measurement of patient compliance | Medication-only trials: 0 |
| Psychotherapy trials: 2322,323 | |
| Outcomes | |
| Results not clearly described | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Adverse events not reported | Medication-only trials: 0 |
| Psychotherapy trials: 3322–324 | |
| Statistical Analysis | |
| Statistics inappropriate | Medication-only trials: 0 |
| Psychotherapy trials: 1325 | |
| No controls for confounding (if needed) | Medication-only trials: 0 |
| Psychotherapy trials: 2323,325 | |
| Intention-to-treat analysis not used | Medication-only trials: 0 |
| Psychotherapy trials: 3323–325 | |
| Power analysis not done or not reported | Medication-only trial: 1314 |
| Psychotherapy trials: 3322–324 | |
| Results | |
| Loss to followup 26% or higher or not reported | Medication-only trials: 0 |
| Psychotherapy trials: 1325 | |
| Differential loss to followup 15% or higher or not reported | Medication-only trials: 0 |
| Psychotherapy trials: 2324,325 | |
| Outcome measures not standard, reliable, or valid in all groups | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Discussion | |
| Results do not support conclusions, taking possible biases and limitations into account | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Results not discussed within context of prior research | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| External validity: population not representative of US population relevant to these treatments | Medication-only trials: 0 |
| Psychotherapy trials: 0 | |
| Funding/sponsorship not reported | Medication-only trial: 1314 |
| Psychotherapy trials: 0 | |
Of the 19 studies rated fair or good, 14 were conducted in the United States,305–309, 311, 315–318, 320, 321, 327, 328 and one each in Brazil,312 Germany,319 Italy,330 Switzerland,310and the United Kingdom.326 Five studies failed to report the age of participants; of the remainder, all focused on individuals 18 years of age or older (range, 18 to 65 years). With respect to sex, 1,132 individuals participated in fair or good clinical trials (984 women and 87 men; for 61 subjects, sex was not reported).
| Author | Total Enrollment, N | Total Dropouts, N (%) | G1 Treatment (% Dropout) | G2 Treatment (% Dropout) | G3 Treatment (% Dropout) | G4 Treatment (% Dropout) |
|---|---|---|---|---|---|---|
| Medication Trials | ||||||
| Arnold et al., 2002305 | 60 | 24 (40%) | Fluoxetine (57%) | Placebo (23%) | ||
| Hudson et al., 1998306 | 85 | 18 (21%) | Fluoxetine (NR) | Placebo (NR) | ||
| Pearlstein et al., 2003307 | 25 | 5 (20%) | Fluvoxamine (NR) | Placebo (NR) | ||
| McElroy et al., 2000309 | 34 | 8 (24%) | Sertaline (28%) | Placebo (19%) | ||
| McElroy et al., 2003308 | 38 | 7 (18%) | Citalopram (16%) | Placebo (21%) | ||
| Laederach-Hoffman et al., 1999310 | 31 | 2 (7%) | Imipramine (7%) | Placebo (6%) | ||
| McElroy et al., 2003311 | 61 | 26 (43%) | Topiramate (47%) | Placebo (39%) | ||
| Appolinario et al., 2003312 | 60 | 12 (20%) | Sibutramine (23%) | Placebo (17%) | ||
| Medication plus Behavioral Intervention Trials | ||||||
| Grilo, Masheb, and Wilson, 2005315 | 108 | 22 (20%) | Placebo (15%) | Fluoxetine (22%) | CBT + placebo (21%) | CBT + fluoxetine (23%) |
| Agras et al., 1994316 | 109 | 24 (22%) | Weight loss therapy (27%) | CBT + Weight loss (17%) | CBT + Weight loss + desipramine (23%) | |
| Grilo, Masheb, and Salant, 2005317 | 50 | 11 (22%) | Orlistat + CBT (24%) | Placebo + CBT (20%) | ||
| Behavioral Interventions | ||||||
| Gorin, Le Grange, and Stone, 2003320 | 94 | 32(34%) | Standard CBT (NR) | Standard CBT with spouse involvement (NR) | Waiting list control (NR) | |
| Hilbert and Tuschen-Caffier, 2004319 | 28 | 4 (14%) | CBT with a body exposure component (14%) | CBT with a cognitive restructuring component focused on body image (14%) | ||
| Wilfley et al., 2002318 | 162 | 29 (18%) | CBT (20%) | Interpersonal psychotherapy (16%) | ||
| Telch, Agras, and Linehan, 2001321 | 44 | 10 (23%) | Dialectical behavior therapy (18%) | Waiting list control (27%) | ||
| Self-help | ||||||
| Carter and Fairburn, 1998326 | 72 | 9 (12%) | Guided self-help (24%) | Pure self-help (0%) | Waiting list control (4%) | |
| Peterson et al., 1998328 | 50 (to active treatment) | 8 (16%) | Therapist-led (13%) | Partial self-help (11%) | Structured self-help (27%) | Waiting list control (0%) |
| Peterson et al., 2001327 | 51 | 7 (14%) | Therapist-led (NR) | Partial self-help (NR) | Structured self-help (NR) | |
| Riva et al., 2002330 | 20 | 0 (0%) | Virtual Reality (0%) | Psych-nutritional group (0%) | ||
CBT, cognitive behavioral therapy; G, group; N, number; NR, not reported.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Arnold et al., 2002305 | Eating:
| No statistics reported | Fluoxetine associated with lower illness severity and depressed mood, and less weight gain. | Fluoxetine superior in reducing binge frequency, illness severity, and depressed mood, and in controlling weight and BMI gain over 6 weeks. |
| Fluoxetine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Good | ||||
| Hudson et al., 1998306 | Eating:
| No statistics reported | No statistics reported | Fluvoxamine superior in reducing binge frequency, clinical severity, and BMI over 9 weeks. |
| Fluvoxamine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Pearlstein et al., 2003307 | Eating:
| No statistics reported | No statistics reported | No differences on any measures |
| Fluvoxamine vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Good | ||||
| McElroy et al., 2003311 | Eating:
| No statistics reported | No statistics reported. | Topiramate superior in reducing binge frequency, illness severity, eating-related obsessions, compulsions, BMI, and weight over 14 weeks. |
| Topiramate vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| McElroy et al., 2003308 | Eating:
| No statistics reported | Citalopram asscociated with greater reduction in frequency of binge days, BMI, and weight. | Citalopram superior to placebo in the rate of reduction in frequency of binges, illness severity, binge eating related obsessions and compulsions, and weight over 6 weeks. |
| Citalopram vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Laederach-Hoffman et al., 1999310 | Eating:
| Imipramine decreased binge frequency and depressed mood over 8 weeks, and decreased depressed mood and weight at 32 week FU. | No statistics reported | Imipramine superior to placebo in decreasing binge frequency, depressed mood, and body weight over 8 weeks of active tx, and 32-week FU. |
| Imipramine vs. placebo (with dietary and psychological counseling | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| McElroy, Casuto et al., 2000309 | Eating:
| No statistics reported | No statistics reported | Sertraline superior to placebo in reducing binge frequency, illness severity, and BMI, and in increasing global improvement over 6 weeks. |
| Sertraline vs. placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Good | ||||
| Appolinario et al., 2003312 | Eating:
| No statistics reported | Sibutramine associated with less depressed mood. | Sibutramine superior to placebo in reducing binge frequency and severity. |
| Sibutramine hydrochloride vs. placebo | Biomarker:
| Difference in weight at end of treatment with weight decreasing over treatment period in the sibutramine group but increasing in the placebo group. | ||
| Outpatient | Psych:
| |||
| Good | ||||
BDI, Beck Depression Inventory; BES, Binge Eating Scale; BMI, body mass index; CGI, Clinical Global Impressions; EDE, Eating Disorders Examination; FU, followup; HAM-D, Hamilton Depression Inventory; HDRS, Hamilton Depression Rating Scale; Psych, psychiatric and psychological; SCL-90, (Hopkins) Symptom Check List; SDS, Self-rating Depression Scale; vs., versus; YBOCS-BE, Yale-Brown Obsessive Compulsive Scale (modified for binge eating).
Second-generation antidepressants. Fluoxetine. One trial compared fluoxetine (average dose 71.3 mg/day) with placebo in 60 individuals meeting the Diagnostic and Statistical Manual for Psychiatric Disorders-Version IV (DSM IV) criteria for BED with three or more binges per week for 6 months and higher than 85 percent ideal body weight (IBW) in a 6-week flexible dose trial.305 Fluoxetine significantly decreased binges per week, severity of illness, and clinician-rated depression scores. It was associated with less weight gain than the placebo, although both groups gained weight during treatment. The investigators failed to report abstinence rates and long-term followup. Dropout was 57 percent in the fluoxetine group and 23 percent in the placebo group. Any inferences made from this study must be made with extreme caution because of the very high and differential attrition rate.
Other second-generation antidepressants. A 9-week trial compared fluvoxamine (50–300 mg/day) with placebo in 85 patients with BED, at least three binge eating episodes per week for 6 months, and higher than 85 percent of the midpoint of their ideal weight for height. Using intention-to-treat analyses, the investigators showed that patients on fluvoxamine had a significantly greater rate of reduction in binge frequency than those on placebo; however, the remission rate did not differ between groups.306 The rate of improvement in severity of illness but not in depression was greater in the fluvoxamine group than in the placebo group. Fluvoxamine led to a greater rate of reduction of body mass index (BMI); however, BMI at endpoint was not reported so the clinical significance of the weight change could not be evaluated. The investigators failed to report long-term followup. Overall dropout was 21 percent.
In a 12-week trial of fluvoxamine (average dose 239 mg/day) versus placebo in 20 patients with DSM-IV BED, investigators observed no differences between fluvoxamine and placebo on binge eating frequency, although both groups combined showed decreases in binge frequency at the end of treatment.307 Both groups combined had significant decreases in shape and weight concerns with no differences between them. Self-reported depression decreased similarly for both. Neither group showed significant weight change with treatment. The investigators failed to report long-term followup. Overall dropout was 20 percent.
McElroy et al. compared 6 weeks of sertraline (mean dose 187 mg/day) with placebo in 34 individuals with DSM-IV BED, at least three binge episodes per week for 6 months, and greater than 85 percent of IBW.309 Sertraline led to greater reduction in binges per week but not with complete remission when rated categorically. It was also associated with increased reduction in severity of illness but not with depression scores. The drug also led to greater reductions in weight; however, the investigators failed to report BMI at endpoint so the clinical significance of the weight change is unclear. The investigators failed to present long-term follow-up data. Dropout was 28 percent in the sertraline group and 19 percent in the placebo group.
In a 6-week trial of citalopram (40–60 mg/day) versus placebo in 38 individuals with BED, with three or more binge episodes per week for 6 months and more than 85 percent of IBW, the active drug led to a significantly greater rate of decrease of binge eating and binge eating days; however, the percentage of individuals remitted when measured categorically did not differ significantly.308 The citalopram group showed greater reductions in clinician-rated obsession and compulsion scores and in severity of illness and depression scores. The BMI rate of change was significantly greater in the citalopram group; patients lost on average 2.7 kg and those on placebo gained 5.2 kg during treatment. Although the rate of change data suggested more rapid response in the citalopram group, differences between the groups over time were not significant for the core outcome variables of binges per week or severity of illness. Dropout was 16 percent in the citalopram group and 21 percent in the placebo group.
Tricyclic antidepressants. Laederach-Hoffman et al. augmented standard bi-weekly diet counseling and psychological support with either impiramine (25 mg three times a day) or placebo in 31 individuals with DSM-IV BED and BMI greater than 27.5.310 Significantly greater reductions in binge eating episodes and Hamilton Depression Rating Scale (HAM-D) scores occurred in the impiramine group at 8 and 32 weeks. Body weight was significantly reduced in the imipramine group at 8 and 32 weeks (mean reduction of 2.1 kg at 8 weeks and 5.0 kg at 32 weeks); the placebo group gained weight. Abstinence rates were not reported. Low doses of imipramine when delivered in the context of psychological support and diet counseling led to maintenance of decreased binge eating, depression, and weight. Dropout was between 6 percent and 7 percent in both groups.
Anticonvulsants. One 14-week trial compared topiramate (average dose 212 mg/day) with placebo in 61 individuals with BED, BMI greater than 30, and a score greater than 15 on the Yale-Brown Obsessive Compulsive Scale for Binge Eating (YBOCS-BE).311 Patients receiving topiramate experienced a significantly greater rate of change and a significantly greater percentage reduction in binge episodes, binge days per week, and YBOC-BE. Severity of illness, but not depression scores, showed greater improvement in the topiramate group. Topiramate led to significantly greater and clinically meaningful weight loss (5.9 kg) than placebo (1.2 kg). No follow-up data were provided. The investigators failed to report abstinence rates or endpoint values, so estimating the magnitude of clinical significance of differences is difficult. Dropout was 47 percent in the topiramate group and 39 percent in the placebo group.
Sibutramine. A 12-week comparison of sibutramine (15 mg/day) with placebo in 60 individuals with DSM-IV BED and a Binge Eating Scale (BES) score of greater than or equal to 17 indicated that sibutramine produced significant decreases in binge days per week and BES scores than placebo.312 Sibitramine was also associated with a significant decrease in self-reported depression scores over the course of treatment. At week 12, the sibutramine group had lost on average 7.4 kg whereas the placebo group gained weight (a significant difference). The authors did not report abstinence rates or provide long-term follow-up data. Dropout was 23 percent in the sibutramine group and 17 percent in the placebo group.
Summary of medication-only trials. Treating BED in overweight individuals has two critical outcomes—decrease in binge eating and decrease in weight. Although not all BED studies explicitly sampled on the basis of weight, all focused on overweight individuals. Four selective serotonin reuptake inhibitors (SSRIs)—one serotonin, dopamine, and norepinephrine uptake inhibitor; one tricyclic antidepressant; one anticonvulsant; and one appetite suppressant—have been studied in BED. In short-term trials, SSRIs appear to lead to greater rates of reduction in target eating, psychiatric and weight symptoms, and severity of illness. However, in the absence of clear endpoint data, and in the absence of data regarding abstinence from binge eating, we cannot judge the magnitude of the clinical impact of these interventions. Moreover, lacking follow-up data after drug discontinuation, we do not know whether observed changes in binge eating, depression, and weight persist.
Low-dose imipramine as an augmentation strategy to standard dietary counseling and psychological support is associated with decreases in binge eating and weight that persist after discontinuation of the medication. This finding suggests a potentially promising pairing worth further investigation.
Both sibutramine and topiramate yielded promising results in terms of weight reduction for patients with BED: clinically significant reductions in BMI over the short term. The authors of these reports did not supply remission rates. Additional research is required to track patients after drug discontinuation to determine whether observed changes in eating behavior and weight persist.
Several studies reported rate of change of symptoms rather than actual differences in groups in change over time including endpoint values. Although rate of change is of interest, endpoint measures, including consistently defined abstinence rates, are critical to evaluate the clinical status of participants at the end of treatment.
Overall, drop-out rates were between 16 percent and 57 percent in the medication trials for BED. The high placebo response in BED is noteworthy.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Grilo, Masheb, Salant, 2005317 | Eating:
| No statistics reported | Greater percentage of CBT + orlistat group remitted and achieved at least 5 percent weight loss over 12 weeks. Group difference in weight loss maintained at 2-month FU | CBT + orlistat superior in total weight loss and in percent weight loss to post-tx over 12 weeks. |
| CBT + orlistat vs. CBT + placebo | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Good | ||||
| Agras et al., 1994316 | Eating:
| No statistics reported | No statistics reported | CBT plus weight loss (with or without desipramine) superior to weight loss alone in reducing binge frequency over 12 weeks. |
| Weight loss therapy vs. CBT+weight loss therapy vs. CBT+weight loss therapy + desipramine | Biomarker:
| Significant difference between groups at 12 wks in change in weight over time with weight decreasing in weight loss group and increasing in CBT groups. By 3 month FU, CBT plus desipramine superior to CBT without desipramine in reducing weight. | ||
| Outpatient | Psych:
| |||
| Fair | ||||
| Grilo, Masheb, Wilson, 2005315 | Eating:
| No statistics reported | No statistics reported | CBT groups superior to placebo and fluoxetine alone in decreasing binge frequency, eating and shape concerns, global eating score, disinhibition, and rate of remission. |
| Fluoxetine vs. placebo vs. CBT + placebo vs. CBT + fluoxetine | Biomarker:
| CBT + fluoxetine superior to placebo alone and fluoxetine alone in decreasing weight concerns and hunger; superior to fluoxetine alone in reducing depressed mood and dietary restraint; superior to placebo in decreasing body dissatisfaction. | ||
| Outpatient | Psych:
| CBT + placebo superior to placebo alone and fluoxetine alone in decreasing depressed mood; superior to fluoxetine alone in decreasing dietary restraint, weight concerns, and body dissatisfaction. | ||
| Good | ||||
BDI, Beck Depression Inventory; BMI, Body mass index; BSQ, Body Shape Questionnaire; CBT, Cognitive Behavioral Therapy; EDE, Eating Disorders Examination; FU, followup; Psych, psychiatric and psychological; TFEQ, Three Factor Eating Questionnaire; Tx, treatment, vs., versus.
The number of participants in these combination trials ranged from 50 to 109. Age ranged from 21 to 65 years. Of these three trials, two reported the race or ethnicity of participants: 140 individuals were reported as white, 12 as African American, and six as Hispanic American.315, 317 The United States was the site of all three trials.
Second-generation antidepressants and CBT. Grilo et al. compared fluoxetine (60 mg/day) with placebo, both with and without CBT, in a 16-week trial.315 Treatment groups receiving CBT reported greater reductions in binge episodes, eating and shape concerns, disinhibition, and depression and greater remission rates than did the medication-only or placebo groups. Weight loss did not differ across groups; the authors did not report within-group weight loss over time. Dropout between groups was comparable (between 15 percent and 23 percent).
Tricyclic antidepressants and CBT. Agras et al. compared the effects of weight-loss treatment, CBT, and desipramine in 109 individuals with DSM IV BED. They randomly allocated participants to 9 months of weight-loss-only therapy, 3 months of CBT followed by 6 months of weight-loss therapy, or 3 months of CBT followed by 6 months of weight-loss therapy and desipramine (300 mg/day).316 Groups receiving CBT showed significant reduction in binge eating at 12 weeks but not at any later follow-up point. Likewise, any observed differences on self-report measures of eating pathology were no longer significantly different at 36 weeks. Changes in depression scores did not differ across groups. Initial weight loss was greater in the weight-loss therapy group. At 3-month followup, the greatest weight loss was seen in the group including CBT and desipramine (average reduction of 4.8 kg from baseline). Dropout from acute treatment was comparable across groups: from 27 percent in the weight-loss therapy group to 17 percent in the CBT plus weight-loss therapy group.
Orlistat and CBT. In a 12-week trial of orlistat (120 mg three times/day) with CBT and placebo with CBT in 50 individuals with DSM-IV BED and BMI > 30, the orlistat group had greater remission rates at the end of treatment but not at 2-month followup.317 The authors reported no differences in any other eating-related or depression measures. Individuals in the orlistat group experienced greater initial weight loss (-3.5 kg) than those in the placebo group (-1.6 kg), but that loss was not maintained at followup; at followup, however, the orlistat group was more likely to have achieved a weight loss of 5 percent or more. Dropout (about 20 percent) was comparable between groups.
Summary of medication plus psychotherapy trials. Adding CBT conferred benefit on remission rate, but not weight loss, over fluoxetine alone or placebo alone in one trial.315 Adding CBT to orlistat was associated with a greater decrease in weight during treatment, although this does not appear to be maintained at followup.317 In one trial, adding desipramine to CBT and weight loss therapy led to greater maintenance of weight loss over time.316 Combining medication and CBT may improve both binge eating and weight loss, although sufficient trialshave not been done to determine definitively which medications are best at producing and maintaining weight loss. Moreover, the optimal duration of medication treatment for sustained reductions in binge eating and maintenance of weight loss has not yet been addressed empirically.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Gorin et al., 2003320 | Eating:
| No statistics reported | Higher percent abstinent in CBT groups compared to waiting list. | CBT (with and without spouse involvement) superior to waiting list in decreasing number of binge days, disinhibition, hunger, depressed mood, and BMI over 12 weeks. |
| Group-based CBT vs. CBT with spouse involvement vs. waiting list | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Hilbert and Tuschen-Caffier, 2004319 | Eating:
| Binge frequency, depressed mood, shape and weight concerns, body dissatisfaction, and restraint decreased in both groups over time. | No differences in percent recovered. | No differences on any measures. |
| CBT+exposure vs. CBT+cognitive interventions for image disturbance | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Wilfley et al., 2002318 | Eating:
| Both interventions associated with decreased number of binge days and eating restraint at post-tx, 4- and 8-month FU. | Less restraint in CBT at post-tx and 4-month FU. | CBT superior in decreasing eating restraint at post-tx and 4, 8, and 12 month FU. |
| CBT vs. IPT | Biomarker:
| Both tx associated with decreased GSI total scores; shape, weight, and eating concerns, restraint, and depressed mood at post-tx. | ||
| Outpatient | Psych:
| |||
| Good | ||||
| Telch et al., 2001321 | Eating:
| No statistics reported | No statistics reported | DBT superior to waiting list control in decreasing number of binge episodes and binge days, binge severity, and weight, shape, and eating concerns. |
| DBT vs. waiting list | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
BDI, Beck Depression Inventory; BES, Binge Eating Scale; BMI, body mass index; CBT, Cognitive Behavioral Therapy; DBT, Dialectical Behavior Therapy; EDE, Eating Disorders Examination; EES, Emotional Eating Scale; FU, followup; GSI, General Severity Index (derived from BSI); PANAS, Positive and Negative Affect Schedule; Psych, psychiatric and psychological; RSE, Rosenberg Self-Esteem Scale; SCL-90, (Hopkins) Symptom Check ListTFEQ, Three Factor Eating Questionnaire; Tx, treatment, vs. versus.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Carter and Fairburn, 1998326 | Eating:
| Both self-help groups decreased binge eating, GSI, and EDE global at 12-week post-tx. | Both self-help groups associated with higher abstinence rates, less binge eating, and lower GSI, EDE global and restraint scores, compared to waiting list at post-tx. | Guided self-help superior to non-guided self-help and waiting list in reducing eating restraint over 12 weeks. |
| Guided self-help vs. non-guided self-help vs. waiting list | Biomarker:
| Guided self-help only decreased eating restraint at post-tx. | Guided self-help associated with less restraint and binge eating at 3 month FU and with less binge eating at 6 month FU compared to non-guided self-help. | |
| Outpatient | Psych:
| |||
| Good | ||||
| Peterson et al., 1998328 | Eating:
| No statistics reported | Abstinence rates for binge eating higher in each of the CBT groups compared to waiting list | CBT groups superior to waiting list in decreasing objective and total binge episodes/week, hours spent binge eating/week, binge severity, disinhibition, and hunger over 8 weeks. |
| Therapist-led group CBT vs. partial self-help group CBT vs. structured self-help group CBT vs. waiting list | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
| Peterson et al., 2001327 | Eating:
| No statistics reported | Abstinence from total binge episodes higher in structured self-help group versus therapist-led self-help and partial self-help groups. | No differences on any measures |
| Therapist-led group CBT vs. partial self-help group CBT vs. structured self-help group CBT | Biomarker:
| |||
| Outpatient | Psych:
| |||
| Fair | ||||
BDI, Beck Depression Inventory; BES, Binge Eating Scale; BMI, Body mass index; BSI, Brief Symptom Inventory; BSQ, Body Shape Questionnaire; CBT, Cognitive Behavioral Therapy; EDE, Eating Disorders Examination; FU, followup; GSI, General Severity Index (derived from BSI); HDRS, Hamilton Depression Rating Scale; Psych, psychiatric and psychological; RSE, Rosenberg Self-Esteem Scale; TFEQ, Three Factor Eating Questionnaire; Tx, treatment, vs., versus.
| Source, Treatment, Setting, and Quality Score | Major Outcome Measures | Significant Change Over Time Within Groups | Significant Differences Between Groups at Endpoint | Significant Differences Between Groups in Change Over Time |
|---|---|---|---|---|
| Riva et al., 2002 | Eating:
| Virtual reality tx associated with increased ideal body score and WELSQ total score, and decreased state anxiety. | No statistics reported | Virtual reality tx superior to psycho-nutritional tx in increasing WELSQ total score and in decreasing state anxiety and overeating. |
| Virtual reality-based tx for body image vs. CBT-based psycho-nutritional group therapy | Psych:
| |||
| Inpatient | ||||
| Fair |
BMI, Body mass index; BIAQ, Body Image Avoidance Questionnaire; BSS, Body Satisfaction Scale; CBT, cognitive behavioral therapy; CDRS, Contour Drawing Rating Scale; DIET, Dieter's Inventory of Eating Temptations; FRS, Figure Rating Scale; Psych, psychiatric and psychological; STAI, Spielberger State-Trait Anxiety Inventory; Tx, treatment; WELSQ, Weight Efficacy Life-style Questionnaire.
In behavioral intervention trials, CBT tailored for BED was the most commonly tested therapeutic approach; one study used DBT. The total number of individuals enrolled in psychotherapy, self-help, exercise, and virtual reality trials was 481 (449 women and 32 men). Of the eight trials identified, participants ranged in age from 18 to 65 years. Six trials reported the race and ethnicity of participants: in all, they involved 401 persons identified as white, 19 individuals as nonwhite, eight as African American or Afro-Caribbean, six as Hispanic American, one as Native American, and one as Asian. In no instance were results analyzed specifically by race or ethnic group. Of the eight trials, five were conducted in the United States and one each in Germany, the United Kingdom, and Italy.
Behavioral intervention trials for binge eating disorder. CBT. A 12-week trial of standard CBT tailored for BED compared with CBT and spousal involvement and with a waiting list control group in 94 individuals with a BMI of 25 or more showed that both active CBT groups had significant reductions in days binged, BMI, disinhibition, hunger, depression, and self-esteem than the controls and were more likely to be abstinent from binge eating at the end of treatment. Adding spousal involvement did not produce significantly greater improvements than standard CBT.320 Both CBT groups had significantly lower depression scores and BMI, but they did not differ from each other. The average BMI decrease from baseline to followup was 0.11 for CBT and 0.77 for CBT with spousal involvement, suggesting that CBT alone, with or without a spouse participating, did not yield substantial weight change. Overall, dropout was 34 percent.
Hilbert et al. studied 5 months of group CBT with body exposure treatment and group CBT with cognitive restructuring of negative body cognitions in 28 women with BED, using a broad inclusion criterion of at least one binge per week.319 Both groups showed decreases in binge eating, psychological aspects of binge eating, self-report binge eating scores, and decreases in self-report depression, but differences between groups were not statistically significant. Neither group experienced significant weight loss. Dropout was 14 percent in each group.
Looking at the efficacy of group psychotherapy, Wilfley et al. compared group CBT with group IPT in 20 sessions with 3 additional individual sessions in 162 individuals with BED and BMI levels between 27 and 48.318 Both therapies led to significant decreases in the number of days binged at the end of treatment and at 4-month followup. CBT led to greater improvements in Eating Disorders Examination Restraint scores at all time points. At 12 months, the groups did not differ in abstinence (CBT, 72 percent; IPT, 70 percent), severity of illness, or depression; both treatments led to significant reductions in these parameters. No participants in either group experienced reductions in BMI across treatment or follow-up periods. Dropout was 20 percent in CBT and 16 percent in IPT.
Dialectical behavioral therapy. Twenty weeks of DBT led to greater reduction in binge days, binge episodes, weight concern, shape concern, and eating concern than did being in a waiting list control group in 44 women with DSM-IV BED.321 Depression and anxiety scores did not differ. The authors did not report whether DBT was associated with significant change in weight, although no differences in weight loss emerged between groups during treatment. Dropout was 18 percent in the DBT group and 55 percent in the waiting list group.
Self-help trials. Carter and Fairburn compared guided self-help using a book302 combined with six to eight sessions with a facilitator with self-help-only using the same book in the absence of a facilitator and with waiting list controls in 72 women with BED with weekly binges.326 Both self-help approaches were more efficacious than the control arm in reducing the mean number of binge days and improving abstinence and cessation rates and EDE scores. At the end of treatment, both self-help groups showed significantly greater reductions in clinical severity than the control group. No group reported significant weight loss at any point. Comparisons of the two self-help groups yielded no differences in eating, depression, or BMI measures at any follow-up point. Dropout was 24 percent from guided self-help and 4 percent from the control group; self-help-only had no dropouts.
In a four-group comparison, Peterson et al. compared therapist-led self-help, partial self-help, structured self-help, and waiting list controls in 61 individuals with DSM IV BED.328 In therapist-led self-help, a doctoral-level therapist led both the psychoeducational component and group discussion; in the partial self-help group, participants viewed a 30-minute psychoeducational videotape and then participated in a therapist-led discussion; and in the structured self-help group, subjects viewed the 30-minute psychoeducational videotape and then led their own 30-minute discussion. All self-help groups performed better than controls on objective binges, total binges, hours spent bingeing, and self-reported eating attitudes. For abstinence rates, all self-help groups (68 percent to 87 percent) were better than controls (12.5 percent). The groups did not differ in depression scores or BMI changes. Dropout was higher in the structured self-help group (27 percent) than in the therapist-led (13 percent) and partial (11 percent) self-help groups.
The second report on this sample compared therapist-led self-help, partial self-help, and structured self-help in 51 individuals with DSM-IV BED.327 All three approaches led to significant decreases in objective binges, hours spent bingeing, and body dissatisfaction. Structured self-help led to significantly greater abstinence at the end of treatment but not at followup. Depression scores decreased over time but not differentially across groups. BMI changes did not differ across groups; the authors did not report whether significant decreases occurred within groups, but the numerical changes appeared to be minimal. Dropout was not reported.
Additional interventions for binge eating disorder. In an inpatient trial, Riva et al. compared virtual reality therapy to psychonutritional control in 20 women with DSM IV BED.330 Virtual reality therapy uses interactive three-dimensional visualization, a head-mounted display, and data gloves to modify body image perceptions. In this very small study with a large number of outcome measures, the investigators compared seven sessions of virtual reality plus a low-calorie diet and physical training with psychonutritional CBT, a low-calorie diet, and physical training. Virtual reality showed significant improvements in weight efficacy and diet scores. Abstinence did not differ significantly between groups and was 100 percent in each, most likely secondary to intensive inpatient treatment. Dropout was not reported.
Summary of behavioral interventions for binge eating disorder. Investigators most frequently chose to study CBT. However, no basic trial comparing individually administered CBT with waiting list, treatment as usual, or a second therapy was rated as fair or good.
The three fair- or good-rated trials that incorporated CBT provided treatment for between 12 weeks and 5 months. Collectively, these trials indicated two main findings. First, CBT is effective in reducing either the number of binge days or the actual number of reported binge episodes. Second, in comparison to waiting list controls, it leads to greater rates of abstinence when administered either individually or in group format, and this abstinence persists for up to 4 months post treatment. CBT also improves the psychological aspects of BED such as ratings of restraint, hunger, and disinhibition. Results are mixed as to whether CBT improves self-rated depression in this population. In all three studies CBT did not lead to decreases in weight. Whether the successful treatment of BED with CBT is associated with less weight gain (as opposed to actual weight loss) over time in individuals with BED has not yet been adequately addressed. Similarly, DBT (one trial) is associated with decreases in binge eating and psychological aspects of the disorder but not with definitive change in depression or anxiety or apparent weight loss.
Although CBT and DBT decrease binge eating and related psychological features of the disorder, they have no observable impact on the important outcome variable of weight loss. This is a somewhat puzzling finding as one would expect decreases in binge eating to be associated with weight loss. The reason for no weight loss is unclear. It is possible that calories previously consumed as binges may be distributed over nonbinge meals; or, how patients label binges and nonbinge meals may change with treatment. In any case, despite reported changes in eating patterns, little demonstrable weight change is achieved.
Self-help (three trials) is efficacious in decreasing binge days, binge eating episodes, and psychological features associated with BED. It also leads to greater abstinence from binge eating when compared to individuals randomized to a waiting list control condition; short-term abstinence rates approximate those seen in face-to-face psychotherapy trials. No self-help trials led to significant decreases in self-rated depression scores or weight in comparison to waiting list controls. Virtual reality therapy must be viewed as experimental; the intensive inpatient treatment associated with this trial invariably affects the perfect abstinence rates observed in both treatment groups. Observing any added efficacy of virtual reality therapy is difficult at best.
Overall dropout rates in behavioral interventions for BED were between 11 percent and 27 percent in active treatment groups.
| Intervention | Adverse Events Reported |
|---|---|
| Medication Trials*† | |
| Fluoxetine versus placebo305 | Fluoxetine group: sedation (5), dry mouth (11), headache (9), nausea (7), insomnia (7), diarrhea (6), fatigue (6), increased urinary frequency (4), sexual dysfunction (4). |
| Both groups: hand and foot swelling, palpitations, and apathy; (P = NS) | |
| Fluvoxamine versus placebo306 | Fluvoxamine group: insomnia, headache, nausea, asthenia, depression, dizziness, somnolence, abnormal dreams, dry mouth, nervousness, and decreased libido. Insomnia, nausea, and abnormal dreams significantly more common in fluvoxamine than placebo. |
| Fluvoxamine versus placebo307 | Fluvoxamine group: sedation (8); nausea (4); dry mouth (4); decreased libido (3) |
| Placebo group: sedation (3); nausea (1); dry mouth (3); decreased libido (0) (P = NR) | |
| Sertraline versus placebo309 | Sertraline group: insomnia (7) |
| Placebo group: insomnia (1) (P = 0.04) | |
| Citalopram versus placebo308 | Citalopram group: sweating (9) (P = 0.008); fatigue (5) (P = 0.05) |
| Placebo group: sweating (1); fatigue (0) | |
| Also reported: dry mouth, headache, diarrhea, nausea, sedation, insomnia, sexual dysfunction | |
| Imipramine versus placebo310 | Imipramine group: skin eruptions and an aversion to tablet intake (1) anticholinergic effects (7) |
| Placebo group: hunger, sweating, palpitations, arrhythmia, and general malaise (1); anticholinergic effects (3); (P < 0.05) | |
| Topiramate versus placebo311 | Topiramate group: headache, paresthesias and amenorrhea |
| Placebo: leg cramps, sedation and testicular soreness | |
| Sibutramine hydrochloride versus placebo312 | Sibutramine: dry mouth (22); headache (6); constipation (7) |
| Placebo: dry mouth (3); headache (14); constipation (0) (P < 0.01) | |
| All other adverse events did not differ significantly (i.e., nausea, insomnia, sudoresis, lumbar pain, depressive mood, flu syndrome, malaise, others) (P = NS) | |
| Medication Plus Behavioral Intervention | |
| Placebo versus fluoxetine versus CBT + placebo versus CBT + fluoxetine315 | NR |
| Weight loss treatment versus CBT versus desipramine316 | 8 subjects discontinued desipramine because of side effects |
| Orlistat plus CBT versus Placebo plus CBT317 | Orlistat + CBT: significantly more gastrointestinal events |
| Behavioral Interventions | |
| Standard CBT versus CBT with spouse involvement versus waiting list control320 | NR |
| CBT + exposure versus CBT + cognitive interventions for body image disturbances319 | NR |
| CBT versus IPT318 | NR |
| Dialectical behavioral therapy versus waiting list control321 | 3 women in DBT group were treated with either psychotherapy or medication for a major depressive episode. |
| Self-help | |
| Guided self-help versus pure self-help versus waiting list control326 | NR |
| Therapist-led versus partial self-help versus structured self-help versus waiting list control328 | NR |
| Therapist-led versus partial self-help versus structured self-help327 | NR |
| Other Behavioral Interventions | |
| Virtual reality based treatment versus psychonutritional control330 | No adverse events observed |
CBT, cognitive behavioral therapy; IPT, interpersonal psychotherapy; NR, not reported; NS, not significant, vs., versus.
If no numbers appear in parentheses, authors had only listed adverse events but not reported the number of cases.
P values indicate differences between groups, they are reported when provided by author.
No direct adverse events were reported for any psychotherapy trials for BED. In the DBT trial, three individuals required treatment for depression during the follow-up period.
Few studies reported on factors associated with efficacy of treatment in BED. Early abstinence from binge eating was associated with significantly greater weight loss in one study.316 In one self-help trial, higher initial self-esteem was associated with poorer outcome; however, the effect was small, accounting for 6 percent of the variance in outcome.326
The total number of individuals enrolled in the 12 drug or medication plus behavioral intervention trials was 680; of those, 55 were men. The age range of participants was reported in eight of the 12 studies; no study reported differential outcome by age. Of the seven studies that did report race or ethnicity, 374 participants were identified as white, 29 as nonwhite, 12 as African American, and six as Hispanic-American. Ten trials were conducted in the United States. No study analyzed results separately by sex, gender, race, or ethnicity. Based on these results, we conclude that no information exists about differential efficacy of pharmacotherapy interventions for BED by sex, age, gender, race, ethnicity, or cultural group.
The total number of individuals enrolled in psychotherapy, self-help, or other behavioral trials was 532; of those, 32 were men. Participants ranged in age from 18 to 64. No studies looked at BED treatment for children or adolescents. From the trials that reported race or ethnicity, participants included 450 whites, 19 nonwhites, eight African Americans or Afro-Caribbeans, six Hispanic-Americans, one Native American, and one Asian. In no instance did the investigators analyze results separately by race or ethnic group. No data exist regarding differential efficacy of psychotherapeutic treatment for BED by sex, age, gender, race, ethnicity, or cultural group.
This chapter presents the results of our literature search and findings for key questions (KQs) 5 and 6. KQ 5 asks what factors are associated with outcomes among individuals with the following eating disorders: anorexia nervosa (AN), bulimia nervosa (BN), and binge eating disorder (BED). KQ 6 asks whether outcomes for each of these disorders differ by sex, gender, age, race, ethnicity, or cultural groups.
We report our results separately for each disease in three main sections of this chapter. Use of the term “significant” means that differences over time or between groups were statistically significant at least at the P < 0.05 level.
We include literature that discusses more than one disease if findings do not combine individuals with different eating disorders. The review focuses on four main outcomes categories of interest: those related to eating, those involving psychiatric or psychological variables, those measured by biomarkers (e.g., weight, menstruation), and death. Many studies were conducted outside the United States, including Germany, England, Scotland, Sweden, China, Japan, New Zealand, and Australia. For that reason, we note in many cases below the setting (city, country) of the studies to emphasize the extent to which this literature is not directly generalizable to US populations and reflects variations across locales.
We include summary tables containing information on outcomes for studies that we rated fair or good. Similar to text, tables group studies by design: cohort (following a group of individuals, with the disease, identified from the community) or case series (following a group of individuals, with the disease, who received treatment) and whether a nondisease comparison group is followed as well. Articles that discuss results from the same study (the same sample for the same amount of time) are grouped in the same row. Finally, within these categories, we list studies alphabetically by author.
| Reasons Contributing to Poor Ratings | Types of Disease, Number of Times Flaw Was Detected, and Citations |
|---|---|
| Research Aim | |
| Hypothesis not clearly described | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Study Population | |
| Characteristics not clearly described | Anorexia Nervosa: 1331 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| No specific inclusion or exclusion criteria | Anorexia Nervosa: 2331,332 |
| Bulimia Nervosa: 1333 | |
| Binge Eating Disorder: 0 | |
| Study groups not comparable to each other and/or to non-participants with regard to confounding factors or characteristics | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Eating Disorder Diagnosis Method | |
| Used independent clinician diagnosis or method used not reported | Anorexia Nervosa: 2331,334 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| None used to diagnose patients similar in treatment/disease and comparison groups | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Study Design | |
| Participants drawn from a treatment program in one city or area not reported | Anorexia Nervosa: 5332,334–337 |
| Bulimia Nervosa: 1333 | |
| Binge Eating Disorder: 0 | |
| No comparison group | Anorexia Nervosa: 6332331,334–337 |
| Bulimia Nervosa: 1333 | |
| Binge Eating Disorder: 0 | |
| Statistical Analysis | |
| Statistics inappropriate | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| No controls for confounding (if needed) | Anorexia Nervosa: 4331,332,335,336 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Power analysis not done or not reported | Anorexia Nervosa: 5331,332,334–336 |
| Bulimia Nervosa: 1333 | |
| Binge Eating Disorder: 0 | |
| Results/Outcome Measurement | |
| Outcome assessor not blinded or not reported | Anorexia Nervosa: 3331,332,337 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Outcome measures not standard, reliable, or valid in all groups | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Interpretation of statistical tests inappropriate | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| External Validity | |
| Population not representative of US population relevant to these treatments | Anorexia Nervosa: 2331,336 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Discussion | |
| Results do not support conclusions, taking possible biases and limitations into account | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
| Results not discussed within context of prior research | Anorexia Nervosa: 0 |
| Bulimia Nervosa: 0 | |
| Binge Eating Disorder: 0 | |
Our discussion of AN outcomes includes 38 articles exclusively discussing individuals with AN3, 7, 177, 331, 332, 334–366 and seven articles discussing individuals with both AN and BN.367–373 First we discuss results for KQ 5, then KQ 6.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Prospective Cohort, Comparison Group | ||
| Gillberg et al., 1994345 | Sweden | Years followed (mean): 5 |
| (Good) | ||
| Råstam et al., 1995356 | Cases: 51 | ED dx at FU: AN: 6%; BN: 22%; EDNOS: 14%; None: 59% |
| (Good) | Comparisons: 51 | Recovered (M-R scale): 47% |
| M-R outcomes: Good: 41%; Intermediate: 35%; Poor: 24% | ||
| Nilsson et al., 1999362 | Sweden | Years followed (mean): 10 |
| (Good) | ||
| Råstam et al., 2003349 | Cases: 51 | ED dx at FU: AN: 6%; BN: 4%; EDNOS: 18%; Any ED: 27% |
| (Good) | Comparisons: 51 | M-R outcomes: Good: 49%; Intermediate: 41%; Poor: 10% |
| Wentz et al., 2001352 | ||
| (Good) | ||
| Case Series, Comparison Groups | ||
| Bulik et al., 2000342 | New Zealand | Years followed (mean): 12 |
| (Good) | ||
| Sullivan et al., 1998350 | Cases: 70 | Recovery outcomes: Fully: 30%, Partially: 49%, Chronically ill (current AN, BN or EDNOS): 21%, AN only: 10% |
| (Good) | Comparisons: 98 | |
| Halmi et al., 19917 | USA | Years followed (mean): 10 |
| (Fair) | Cases: 62 | ED dx at FU: AN: 3%, BN: 3%, Normal weight bulimia: 23%, EDNOS: 39%, No ED: 27%, Unknown: 5% |
| Comparisons: 62 | ||
| Case Series, No Comparison Groups | ||
| Ben-Tovim et al., 2001367 | Australia | Years followed (mean): 5 |
| (Good) | Cases: 92 | ED dx at FU: AN: 21%, BN: 5%, EDNOS: 9%, No ED: 59%, Unknown: 2%, Deceased: 3% |
| M-R-H Outcomes: Good: 34%, Intermediate: 54%, Poor: 13% | ||
| Dancyger et al., 1997353 | USA | Years followed (mean): 10 |
| (Fair) | Cases: 52 | Recovered: 31%, Good: 13%, Intermediate: 21%, Poor: 35% |
| Deter et al., 1994343 | Germany | Years followed, mean (range): 11.8 (9–19) |
| (Fair) | Cases: 75 | Good: 54%; Intermediate: 25% Poor:11%, Deceased: 11% |
| AN: 17% | ||
| Eckert et al., 1995338 | USA | Years followed, mean (range): 9.6 (8.5 – 10.5) |
| (Fair) | Cases: 76 | Recovered: 24%, Good: 26%, Intermediate: 32%, Poor: 12%, Deceased: 7% |
| ED dx at FU: No ED: 24%, EDNOS: 36%, BN: 22%, AN: 9%, AN/BN: 3% | ||
| Eddy et al., 2002 | USA | Years followed, median (range): 8 (8–12) |
| (Fair) | Cases: 136 | Full recovery (by subtype): Restricting pure: 46%, Restricting not pure: 22%, Binge/purge:39% |
| Relapse from full recovery (by subtype): Restricting pure: 31%, Restricting not pure: 47%, Binge/purge: 68% | ||
| Restricting subtype crossover to binge/purge subtype: 52% | ||
| Fichter et al., 1999339 | Germany | Years followed (mean): 6.2 |
| (Good) | Cases: 95 | M-R outcomes: Good: 27%, Intermediate: 25%, Poor: 42% Deceased: 6% |
| ED dx at FU: AN: 27%, BN: 17%, EDNOS: 2%, No ED: 55% | ||
| Halvorsen et al., 2004366 | Norway | Year followed, mean (range): 8.8 (3.5 – 14.5) |
| (Fair) | Cases: 51 | M-R outcomes: Good: 80%, Intermediate: 16%, Poor: 4% |
| No ED: 82%, AN: 2%, BN: 2%, EDNOS: 14% | ||
| Herzog et al., 1996370 | USA | Years followed (mean): 4 |
| (Good) | Cases: 76 | Full recovery (no symptoms for ≥ 8 wks): ANR: 8%; ANBP: 17% |
| Partial recovery (symptom reduction): ANR: 54%; ANBP: 81% | ||
| Herzog, Schellberg et al., 1997359 | Germany | Years followed, mean: 11.7 |
| (Fair) | Cases:69 | Average time to first recovery: 5.8 years |
| Herzog et al., 1999369 | USA | Years followed: Up to 11 (median = 7.5) |
| (Good) | Cases: 136 | Full recovery (no symptoms for ≥ 8 wks): ANR: 34%; ANBP: 32% Partial recovery (symptom reduction): ANR: 83%; ANBP: 82% |
| No remission: ANR: 17%; ANBP: 18% | ||
| Relapse after full recovery: 40% | ||
| Isager et al., 1985340 | Denmark | Years followed, mean (range): 12.5 (4 – 22) |
| (Fair) | Cases: 142 | Average annual hazard rate of relapse: 3% |
| Lee et al., 2003347 | Hong Kong | Years followed: 9 |
| (Fair) | ||
| Lee et al., 2005363 | Cases: 74 | M-R scale outcomes: Good: 62% (typical: 52.6%; atypical: 89.47%), Intermediate: 33% (typical: 42.11%; atypical: 5.26%), Poor: 5% (typical: 5.26%, atypical: 5.26%) |
| (Fair) | ED dx at FU: No ED: 46% (typical: 40.68%; atypical: 57.14%), AN: 15%, BN: 20% (typical: 25.42%; atypical: 4.76%), EDNOS: 19% (typical: 15.25%; atypical: 28.57%) | |
| Löwe et al., 2001348 | Germany | Years followed (mean): 21.3 |
| (Fair) | Cases: 63 | Full recovery: 51%, Partial recovery: 21%, Poor (including death): 26%, Unknown: 2% |
| Morgan et al., 1983355 | United Kingdom | Years followed, mean (range): 5.8 (4 – 8.5) |
| (Fair) | Cases: 78 | M-R Outcomes: Good: 58%, Intermediate: 19%, Poor: 19%, Deceased: 1%, unknown: 3% |
| Strober et al., 1997341 | USA | Years followed (range): 10 – 15 |
| (Fair) | Cases: 93 | Full recovery: 76%, Partial recovery: 86% |
| Dx of chronically ill at FU: AN restricting: 3%, AN binge eating: 1%, BN: 10% | ||
| Tanaka et al., 2001351 | Japan | Years followed, mean (range): 8.3 (4.0 – 17.7) |
| (Fair) | Cases: 61 | M-R outcomes: Good: 51%, Intermediate: 13%, Poor: 25%, |
| Deceased: 11% | ||
AN, anorexia nervosa; ANBP, anorexia nervosa binge eating and/or purging subtype; ANR, anorexia nervosa restricting subtype; BED, binge eating disorder; BN,bulimia nervosa; Dx, diagnosis; ED, eating disorder; EDE, Eating Disorder Examination; EDI, Eating Disorder Inventory; EDNOS, eating disorder-not otherwise specified; FU, followup; M-R scores: Morgan and Russell Scale; M-R-H Scale, Morgan-Russell-Hayward Scale; SIAB, Structured Interview for Anorexia and Bulimia Nervosa; Tx, treatment; USA, United States of America.
Many studies evaluate eating-related outcomes based on the general Morgan-Russell (M-R) scale or some modification of the scale, which evaluates weight (and menstruation in females), or the average M-R scale, which is a composite rating of subscales measuring nutritional status, mental status, sexual adjustment, menstrual functioning, and socioeconomic status. General scale categories are defined as good—normal body weight and regular menstruation—intermediate, amenorrhea or low body weight (i.e., weight less than 85 percent of average body weight [ABW]); and poor—amenorrhea and low body weight (i.e., less than 85% ABW).
Prospective cohort studies with comparison groups. We included one prospective cohort study with outcomes for individuals with AN in our review that reported results in several articles, after participants were followed for 5 years345, 356 and 10 years.349, 352, 362 AN participants were 51 residents of Göteborg, Sweden (including three males), born in 1970, who had been diagnosed with AN as adolescents. Comparisons were Göteborg residents matched to the AN group by age, sex, and school attended. Data from all articles discussing this study did not match exactly; therefore, we caution readers about ostensible trends across time based on data from different studies.
At 5-year followup, approximately one-half of the individuals with AN were considered recovered: 59 percent had no eating disorder (ED) diagnosis and 41 percent had a good outcome according to M-R scale criteria. However, 6 percent still had AN and the remainder had other eating disorders including BN (22 percent) and EDNOS (14 percent). The AN group also remained significantly more symptomatic than the nondisease comparison group on several measures such as dietary restriction, concern about body weight, worry about appearance, and Eating Attitudes Test (EAT) scores.
By 10 years, the M-R scale outcomes had improved. One-half of the cohort who had AN at baseline had a good outcome (49 percent); the percentage of the group with a poor outcome had declined from 24 percent at 5 years to 10 percent at 10 years. Still, 27 percent had an ED diagnosis at followup.
Case series studies with comparison groups. One case series study with a nondisease comparison group discussed results in two articles, Bulik et al.342 and Sullivan et al.350 For this study, investigators recontacted 70 women 12 years after referral for treatment (inpatient, outpatient, or assessment) for AN at one facility in Christchurch, New Zealand. The AN group was not limited to those with adolescent onset of the disease. The comparison group (N = 98) resided in the same city and was matched by age. Although 30 percent of individuals with AN at baseline were fully recovered, 21 percent continued to have an eating disorder at followup, with 10 percent continuing to meet Diagnostic and Statistical Manual, version III, Revised (DSM III-R) criteria for AN. The AN group also continued to exhibit worse eating-related outcomes through other measures. Controlling for age and current AN status, individuals in the AN group reported higher scores on the Eating Disorder Inventory (EDI) drive for thinness and perfectionism subscales and the Three Factor Eating Questionnaire Scale (TFEQ) cognitive restraint and hunger subscales. Similarly, Halmi et al., in a separate US study, found that almost 30 percent of the AN group were recovered at followup.7
Case series studies with no comparison groups. Among case series studies with no comparison group, we reviewed three studies limited to patients with adolescent AN onset.341, 366, 369, 370 Among a mix of 51 former outpatients and inpatients who were followed from 3.5 to 14.5 years in Norway, Halvorsen et al. found that three-quarters of participants no longer had an ED and had a good M-R general scale outcome score.366 Without controlling for the length of followup, patients who no longer had an ED were significantly less likely to be depressed or suffer from an anxiety disorder, with the exception of obsessive-compulsive disorder, which did not differ across groups.
Similarly, after following 95 patients for 10 to 15 years in the US who had all received inpatient treatment, Strober et al. found that three-quarters of participants had achieved full recovery (free of any symptoms of AN and BN for 8 consecutive weeks).341 Significant predictors of chronic AN (intermediate or poor outcome) were an extreme compulsive drive to exercise and a history of poor social relating preceding onset of illness. Significant predictors of a longer time to recovery were a more hostile attitude towards one's family and extreme compulsivity in daily routines. In both models, early onset of disease was not a significant predictor.
Using survival analysis, D. Herzog et al. found that a shorter duration of the intake AN episode was a significant predictor of recovery after four years. Other variables in the model that were not significant predictors included age at ED onset, bulimic behaviors, impulse-control behaviors, current depression, and other Axis I disorders.370 Again, at 7-year followup, the D. Herzog study found a shorter duration of intake episode and higher percentage of ABW at intake predicted both a shorter time to full recovery and a shorter time to partial recovery.369
D. Herzog and colleagues compared outcomes for restricting and for binge/purge subtypes of AN. Not all had received inpatient treatment. At up to 4-year followup, the authors found that the percentage of patients who were fully recovered (asymptomatic for at least 8 consecutive weeks) was greater in the AN-binge/purge subtype (17 percent) than in the AN-restricting subtype (8 percent).370 Corresponding to these descriptive differences, the AN-binge/purge group was significantly more likely to have recovered fully than the AN-restricting group (relative risk [RR], 4.6; 95% confidence interval [CI], 0.98–21.9). A much larger percentage achieved partial recovery (did not meet full criteria for AN but still experienced substantial symptomatology); 81 percent in the binge/purge subtype and 54 percent in the restricting group. At 7-year followup, differences between the groups in the percentage that had recovered had diminished; approximately one-third in both subgroups had fully recovered and more than 80 percent had partially recovered.369 Forty percent of patients relapsed after first recovery. After following the group for 8 years, differences in duration of disease and ABW predicted being in the binge/purge subtype but measures of impulsivity including a history of alcohol abuse, drug abuse, kleptomania, suicidality, or borderline personality did not.177
Through 8-year followup, crossover between the restricting and binge/purge subtypes was high. Of those with the restricting subtype 52 percent changed to the binge/purge subtype, with most of the crossover occurring in the first 5 years of followup.177 In contrast, Strober et al. found a lower rate of crossover (29 percent); the median time to onset of binge eating was 24 months.341
The remaining case series studies discussing eating-related outcomes are not limited to a sample of patients with adolescent onset of AN. First we report outcomes based on M-R scale criteria because they are the most common measures across studies.
A group of females who had all received inpatient treatment in Heidelberg, Germany, were followed up at several points in time. After 6 years, only 27 percent had a good M-R scale outcome, 25 percent had an intermediate outcome, and 42 percent had a poor outcome.339 However, at later followup points, more than 40 percent of living patients had good outcomes.338, 339, 353, 354
Among 74 women, 72 percent of whom had received inpatient treatment for AN, followed for an average of 9 years in Hong Kong, bivariate analyses comparing an M-R outcome of good and Shapiro Control Inventory measures found that a good M-R outcome was associated with a better overall general sense of control, a greater positive sense of control, and a lower negative sense of control.347, 363 A better M-R outcome was also associated with an initial diagnosis of atypical AN (no fat phobia). Using descriptive analyses, Tanaka et al. found, for patients who all had received inpatient treatment, that a good versus poor M-R outcome was associated with younger age at referral, younger age at admission, higher body mass index (BMI) at followup, higher minimum BMI, better menstrual functioning, and better mental state and psychosocial measures.351
Morgan and colleagues used bivariate analyses to report on UK patients followed from 4 to 8.5 years, one-half of whom had been hospitalized.355 They reported that lower general M-R outcome scores were associated with longer duration of food difficulties and longer duration of amenorrhea. Poorer average M-R outcome scores were associated with a longer duration of food difficulties, a longer duration of amenorrhea, greater family hostility towards the patient, a disturbed relationship between the patient and family, and personality difficulties.
Ben-Tovim et al. examined the characteristics of the Morgan-Russell-Hayward Scale (M-R-H scale), a modification of the M-R scale, after adding items related to binge eating and vomiting to a subscale concerning dietary and eating patterns, body concern, and body weight.367 Using multivariate analyses, the authors found that total M-R-H Scale outcomes were significantly related to the dietary and eating patterns, body concern, and body weight subscale mentioned above. Other subscales measuring menstrual pattern, mental state, psychosexual state, and work and family relations were not significant in the model. Significant predictors in a second model, predicting the same outcome, included subscale 2 at baseline of the disability adjustment scale (measuring overall behavior and social role functioning), the Flinders Medical Centre Symptom Score at baseline (measuring ED symptoms), the Body Attitudes Questionnaire Subscales (measuring a range of body-related attitudes), attractiveness at 6 months, and lastly, change in the salience of weight and shape over the first 6 months of treatment.
Studies also examined diagnostic outcomes, including the persistence of eating disorders over time. Results varied greatly across studies and were not related to length of time to followup. The percentage of individuals who continued to have an AN diagnosis at followup ranged from 9 percent to 29 percent across studies, an EDNOS diagnosis from 2 percent to 36 percent, and no eating disorder from 24 percent to 59 percent of participants.338, 339, 363, 367, 374
W. Herzog and colleagues measured change over time in the likelihood of first recovery in the Heidelberg case series, after following patients for a mean of 11.7 years.359 The average patient had a first recovery in 5.8 years, with a greater likelihood of recovering in the first 6 years than later. Significant predictors of first recovery in multivariate models were lower serum creatinine levels at baseline, less purging behavior, and the interaction of less purging and fewer social disturbances as measured by the Anorexia Nervosa Symptom Score (ANSS).
Löwe et al. followed this same group of patients for 21 years.348 Among the 63 patients, 51 percent showed a good outcome and full recovery, 21 percent were partially recovered, and 26 percent had a poor outcome and 2 percent were unknown. Poor long-term outcome (at 21 years since inpatient admission) was related to low BMI, severe psychological symptoms and social problems, higher EDI perfectionism and interpersonal trust scores, and lower hemoglobin and alkaline phosphatase levels (at 12 years since inpatient admission).
After following this group of patients for 12 years, both Deter and W. Herzog343 (N = 84, including deceased patients) and Deter et al.365 (N = 70) found that the persistence of AN symptoms was predicted by older age at onset, more somatic symptoms, more laxative use, low albumin levels, and a high value on a global prognosis score developed from the ANSS.343, 365 Baseline factors associated with relapsing versus having a persistent disorder include being younger, having a shorter disease duration, and less vomiting.343
Eckert et al. found, in descriptive analyses in a group of patients who had received inpatient treatment, that recovered patients were less likely to have major affective disorder, anxiety disorders, and phobias.338
Isager and colleagues measured relapse rates (lost 15 percent or more of weight gained during course of treatment in a year's time) among 151 patients (93 percent female) who had received treatment (inpatient or outpatient) in Copenhagen, Denmark.340 After following patients from 4 to 22 years, they found patients were experiencing a 3 percent average annual hazard rate of relapse. Relapse was greater among those whose duration of therapeutic contact was less than 1 year.
Other factors related to these types of outcomes include the following. Factors associated with poor Psychiatric Scale Ratings for AN outcomes in the Fichter and Quadflieg study included binge eating in the month before treatment, other mental illness diagnoses before treatment, and lower body weight at the end of treatment.339 In research conducted by Lee and colleagues, a group of atypical AN patients scored better at followup on the Eating Attitudes Test - 26 and the Eating Disorders Evaluation Questionnaire.347, 363 Typical versus atypical AN patients at followup had a lower sense of control in the domain of body and a stronger desire for control.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Prospective Cohort, Comparison Group | ||
| Gillberg et al., 1995346 | Sweden | Years followed (mean): 5 |
| (Good) | Cases: 51 | Diagnoses in AN group*: |
| Comparisons: 51 | OCD: 30%, Any cluster C: 37%, any SCID personality disorder: 41%, 2 or more SCID personality disorders: 24%, Asperger syndrome: 12%, any autistic-like condition: 20%; empathy disorder: 30%; OCPD/AS/Autistic-like condition at both age 16 and 21: 45% | |
| Ivarsson et al., 2000360 | Sweden | Years followed (mean): 10 |
| (Good) | Cases: 51 | Current diagnoses in AN group*: OCD:16%, axis I disorder (including ED): 53% autism spectrum disorder: 18%, cluster C: 22%, |
| Nilsson et al., 1999362 | Comparisons: 51 | Lifetime diagnoses in AN group*: Any affective disorder: 96% OCD: 35%, OCPD:55%, any anxiety disorder: 57%, Any Axis I (including and excluding ED): 100%, depressive disorder: 84%, cluster C: 63%, autism spectrum disorder: 24% |
| (Good) | ||
| Råstam et al., 2003349 | ||
| (Good) | ||
| Wentz et al., 2000361 | ||
| (Good) | ||
| Wentz et al., 2001352 | ||
| (Good) | ||
| Case Series, Comparisons Groups | ||
| Bulik et al., 2000342 | New Zealand | Years followed (mean): 12 |
| (Good) | Cases: 70 | Lifetime diagnoses (controlling for age)*: |
| Sullivan et al., 1998350 | Comparisons: 98 | Major depression: Cases: 51%; Comparisons: 36% |
| (Good) | Any mood disorder: Cases: 60%; Comparisons: 42%, | |
| Alcohol or any drug dependence: Cases: 30%; Comparisons: 12% | ||
| OCD: Cases: 16%; Comparisons: 2% | ||
| Separation anxiety disorder: Comparisons: 17%; Comparisons: 2% | ||
| Overanxious disorder: Comparisons: 37%; Comparisons: 3% | ||
| Any anxiety disorder: Comparisons: 60%; Comparisons: 33% | ||
| Halmi et al., 19917 | USA | Years followed: 10 |
| (Fair) | Cases: 62 | Lifetime diagnoses*: |
| Comparisons: 62 | Major depression: Cases: 68%; Comparisons: 21% | |
| Dysthymia: Cases: 32%; Comparisons: 3% | ||
| Obsessive-compulsive disorder: Cases: 25%; Comparisons: 6% | ||
| Agoraphobia: Cases: 14%; Comparisons: 3% | ||
| Social phobia: Cases: 32%; Comparisons: 3% | ||
| Current diagnoses*: | ||
| Major depression: Cases: 29%; Comparisons: 6% | ||
| OCD: Cases: 11%; Comparisons: 2% | ||
| Case Series, No Comparison Groups | ||
| Eddy et al., 2002177 | USA | Years followed (median): 8 |
| (Fair) | Cases: 246 | History of drug abuse at intake*: |
| AN restricting pure: 0%; AN restricting not pure: 13%; AN binge purge: 16% | ||
| Halvorsen et al., 2004366 | Norway | Years followed (mean): 8.8 |
| (Fair) | Cases: 51 | Diagnosis at followup: Depression: 22%; Anxiety (not OCD): 27%; OCD: 2% |
| Diagnoses at followup*: | ||
| Depression: No ED group: 13%; ED group: 56% | ||
| Anxiety disorder (no OCD): No ED group: 20%; ED group: 56% | ||
| Löwe et al., 2001348 | Germany | Years followed (mean): 21 |
| (Fair) | Cases: 63 | Mood disorders by Psychiatric Status Rating Scale outcomes*: |
| Good: 8%; Intermediate: 31%; Poor: 38% | ||
| Substance use disorders by Psychiatric Status Rating Scale outcomes*: | ||
| Good: 5%; Intermediate: 6%; Poor: 50% | ||
| Strober, Freeman et al., 1996358 | USA | Years followed: 10 |
| (Good) | Cases: 95 | Substance use disorder: Abuse: 12%; Dependence: 7% |
Difference between groups (P < 0.05)
AN, anorexia nervosa; AS, Asperger syndrome; CD, compulsive disorder; ED, eating disorder; OCD, obsessive-compulsive disorder; OCPD, obsessive-compulsive personality disorder; sig, significant or significantly; SCID, Structured Clinical Inventory for DSM-IV; USA, United States of America.
Prospective cohort studies with comparison groups. The one prospective cohort study that we reviewed followed individuals, at 5 and 10 years, with AN at baseline and compared them with a matched community comparison group in Göteborg, Sweden.346 At 5 years, the AN group was significantly more likely to have various personality disorders including obsessive-compulsive personality disorder, any Cluster C personality disorder (avoidant, dependent, obsessive-compulsive, or passive aggressive), any personality disorder, or two or more personality disorders as measured by the Structured Clinical Interview II for the DSM-IV (SCID II). In addition, individuals in the AN group had significantly greater rates of Asperger syndrome, any autistic-like condition, and empathy disorder than the comparison group.
At 10 years,349, 352, 361, 362 the AN group continued to be significantly more likely than the comparison group to currently have a personality disorder, Asperger syndrome disorder or autism spectrum disorder, and lifetime and current obsessive-compulsive disorder. The AN group was not more likely, however, to have an anxiety disorder, excluding obsessive-compulsive disorder.
Ivarsson et al. examined depressive disorders in the AN and comparison groups in these cohorts at both 5- and 10-year followup.360 The AN group had a higher lifetime prevalence of depression. Being in the AN group was the only significant predictor of depressive disorder at 5-year followup (odds ratio [OR], 7.7; 95% CI, 1.15–19.6). At 10 years, being in the AN group (OR, 4.03; 95% CI, 1.15–14.19) and having a depressive disorder at 5 years were significant predictors of current depressive disorder. The absence of a mood disorder was significantly associated with resolution of the eating disorder.
Case series studies with comparison groups. Two studies followed individuals with AN who had received treatment and a comparison group. Both found higher rates of lifetime major depression and OCD among the AN group.7, 342, 350 The study in Christchurch, New Zealand, which followed women for 12 years, found, after controlling for age, significant differences in the lifetime prevalence of several psychological disorders including major depression, mood disorders, obsessive-compulsive disorder, anxiety disorders, and drug dependence.342, 350 The study conducted by Halmi and colleagues also identified that significant differences in the rates of diagnosis of major depression and OCD continued to be true at 10-year followup in their AN case series.7
Case series studies with no comparison groups. Descriptively, Eddy et al. found that a history of drug abuse differed among AN subgroups; it was more likely among the binge/purge subtype (16 percent).177 Correspondingly, among patients who all had adolescent onset of AN, Strober et al., using stepwise regression, found that binge eating at treatment intake was the only significant predictor of the onset of a substance use disorder. Other variables included in the model, such as depression, anxiety, and weight, were not significant predictors.358
Also using stepwise regression, Dancyger et al. measured factors related to Minnesota Multiphasic Personality Inventory (MMPI) scores at 10-year followup on a population of women who had received inpatient treatment and were not limited to those with adolescent onset.353 Poorer overall outcomes were related to higher scores on three MMPI subscales: hypochondriasis, paranoia, and psychopathic deviate.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Prospective Cohorts, Comparison Groups | ||
| Gillberg et al., 1994344 | Sweden | Years followed (mean): 5 |
| (Good) | ||
| Gillberg et al., 1994345 | Cases: 51 | Near average body weight at FU*: Cases: 53%; Comparisons: 96% |
| (Good) | Comparisons: 51 | Extremely underweight:* Cases: 8%; Comparisons: 0% |
| Regular or cyclical menstruation*: Cases: 50%; Comparisons: 90% | ||
| Dysdiadochokinesis*: Cases: 20%; Comparisons: 2% | ||
| Råstam et al., 2003349 | Sweden | Years followed (mean): 10 |
| (Good) | ||
| Wentz et al., 2000361 | Cases: 51 | Mean weight: Cases: 62.3 kg; Comparisons: 63.7 kg |
| (Good) | Comparisons: 51 | Regular or cyclical menstruation*: Cases: 65%; Comparisons: 85% |
| Wentz et al., 2001352 | Dysdiadochokinesis*: Cases: 22%; Comparisons: 4% | |
| (Good) | ||
| Case Series, Comparison Group | ||
| Bulik, et al. 2000342 | New Zealand | Years followed (mean): 12 |
| (Good) | ||
| Sullivan et al., 1998342 | Cases: 70 | BMI*: Cases: 20.1 kg/m2; Comparisons: 25.6 kg/m2 |
| (Good) | Comparisons: 98 | |
| Case Series, No Comparison Group | ||
| Eckert et al., 1995338 | USA | Years followed (range): 8.5–10.5 |
| (Fair) | Cases: 76 | ABW at FU: <85%: 23%; 85%–115%: 73%; >115%: 3% |
| Regular menses: 48% | ||
| Löwe et al., 2001348 | Germany | Years followed (mean): 21 |
| (Fair) | Cases: 63 | BMI by Psychiatric Status Rating Scale outcomes*: |
| Good: 21.6; Intermediate: 19.7; Poor: 15.3 | ||
Difference between groups (P < 0.05).
ABW, percentage of average body weight; BMI: body mass index; diff, different; FU, Followup; IBW, ideal body weight; kg, kilograms; sig, significant or significantly; USA, United States of America.
Prospective cohort studies with comparison groups. At 5 years, the study of the Göteborg, Sweden, cohort found that the AN group still weighed significantly less than the non-ED comparison group; more of the AN group was appreciably underweight than the comparison, and while only half of the AN group were near average body weight, nearly all of the comparison group were at that weight.344, 345 Regular or cyclical menstruation was significantly less likely in the AN group, and a large percentage of the AN group had dysdiadochokinesis (an inability to execute rapidly alternating movements).
At 10 years, various measures of weight, including direct measures in kilograms, ABW, and mean BMI (body mass index), did not differ significantly between groups.349, 352, 361 However, a significantly larger percentage of the AN group still did not have normal menstrual function and continued to demonstrate dysdiadochokinesis.
Case series studies with comparison groups. The AN cohort in the Christchurch, New Zealand, study had significantly lower BMI than comparison participants when controlling for age and current AN status.344, 345 Desired BMI was also lower in the chronically ill AN group than in recovered individuals or the comparison group.
Case series studies with no comparison groups. Hebebrand et al. examined factors associated with BMI at 0 to 33.6 years followup.354 A BMI of less than 17.5 at followup (criterion cutoff for AN diagnosis) was related to lower BMI at referral, older age at referral, and younger age at followup; by contrast, age at disease onset was not a significant predictor. A higher BMI was also found to be significantly related to a better Psychiatric Status Rating Scale outcome at followup.348
Eckert et al. followed patients who had received inpatient treatment 10 years previously.338 Lower weight was associated with greater food faddishness, laxative abuse, body image disturbance, fear of getting fat, disturbance in sexual adjustment, worse psychological adjustment, disturbed menses, and other weight loss behavior.
| Authors, Year | Country | |
|---|---|---|
| Quality Score | Sample Size | Outcomes |
| Case Series*, No Comparison Groups | ||
| Birmingham et al., 20053 | Canada | Years followed (mean): 7 |
| (Fair) | Cases: 326 | Deaths: N=17 (Suicide: N=7, Pneumonia: N=2, Hypoglycemia: N=2, Liver disease: N=2, Cancer: N=2, Alcohol poisoning: N=1, Subdural hemorrhage: N=1) |
| SMR: 10.5 | ||
| Crisp et al., 1992357 | England and Scotland | Years followed (mean): 22 |
| (Fair) | Cases:168 | England: Deaths: N=4 (Anorexia: N=2; Suicide: N=1; Cancer: N=1) (SMR: 1.36 times more likely than women of same age, 1973-1989) |
| Scotland: Deaths N=8 (Anorexia: N=3; Suicide: N=4; Cancer N=1) (SMR: 4.71 times more likely than women of same age, 1973-1979) | ||
| Deter et al., 1994343 | Germany | Years followed, mean (range): 11.8 (9–19) |
| (Fair) and Herzog, Schellberg et al., 1997 | Cases: 75 at FU | Deaths: N=9 (AN complications: N=7; Suicide: N=2) |
| (Fair) | ||
| Eckert et al., 1995338 | USA | Years followed, mean (range): 9.6 (8.5–10.5) |
| (Fair) | Cases: 76 | Deaths: N=5 (all complications of AN; no suicides); SMR: 12.8 |
| Eddy et al., 2002177 | USA | Years followed, median (range): 8 (8–12) |
| (Fair) | Cases: 136 | Deaths by subtype: Restricting pure: 8%; Restricting not pure: 8%, Binge/purge: 6% |
| History of suicidality by subtype: Restricting pure: 4%; Restricting not pure: 29%; Binge/purge: 27% | ||
| Fichter et al., 1999339 | Germany | Years followed (mean): 6.2 |
| (Good) | Cases: 95 | Deaths: N=6 (Traffic accident during exercise: N=1; Cardiac and renal failure: N=2; Hypocalcemia: N=2; Cardiac failure and cachexia: N=1) |
| Franko et al., 2004368 | USA | Years followed (mean): 8.6 |
| (Good) | Cases: 136 | Suicide attempts during study period: 22% |
| Hebebrand et al., 1997354 | Germany | Years followed, mean (range): 9.5, 0–33.6 |
| (Fair) | Cases: 272 | Deaths: N=12 (Emaciation: N=10, Suicide: N=2) |
| Mortality rate by patient weight at referral: | ||
| < 13 kg/m2: 11%, ≥ 13 kg/m2: 0.6%; BMI < 13 at referral associated with higher likelihood of mortality | ||
| Herzog et al., 2000371 | USA | Years followed: 11 |
| (Fair) | Cases: 110 | Deaths: N=7 (Suicide: N=3; Acute alcohol intoxication: N=1; Cardiorespiratory failure, heptic failure, and cirrhosis: N=1; Cardiac arrhythmia and seizure disorder: N=1; Fungal pneumonia: N=1) |
| SMR (all deaths): 9.6; SMR (suicide): 58.1 | ||
| Isager et al., 1985340 | Denmark | Years followed, mean (range): 12.5 (4–22) |
| (Fair) | Cases: 142 | Deaths N=9 (Suicide: N=6, Malnutrition: N= 2, Unknown: N=1) |
| Keel et al., 2003372 | USA | Years followed (mean): 8.6 |
| (Fair) | Cases: 136 | Deaths: N=11; SMR: 11.6 |
| Suicide: N=4; Suicide SMR: 56.9 | ||
| Lee et al., 2003347 | Hong Kong | Years followed (mean): 9 |
| (Fair) | Cases: 80 | Deaths: N=3 (Suicide: N=2, Emaciation: N=1); SMR: 10.5 |
| Löwe et al., 2001348 | Germany | Years followed (mean): 21.3 |
| (Fair) | Cases: 63 at FU | Deaths: N=14 (12 directly due to AN) |
| Møller-Madsen et al., 1996364 | Denmark | Years followed, mean (range): 7.8 (< 1–17) |
| (Fair) | Cases: 853 | Deaths: N=50 (AN complications: N=13, Natural causes: N=11, Suicide: N=18, Accidents: N=2, Unknown causes or could not be determined: N=4) |
| SMR: Females: 9.2; SMR: Males: 8.2 | ||
| Females only < 1 year following treatment admission, SMR=30.5 | ||
| Patton, 1988373 | United Kingdom | Years followed (mean): 7.6 |
| (Fair) | Deaths: N = 11 (Suicide: N = 6; low weight: N = 5) | |
| Cases: 332 | Overall SMR: 6.01; Higher than expected | |
| SMR at 4-year FU: 5.76, Higher than expected | ||
| SMR at 8-year FU: 2.70, Normal level | ||
| Sullivan et al., 1998350 | New Zealand | Years followed: 12 |
| (Good) | Cases: 70 | Deaths: N = 1 (suidice) |
| Tanaka et al., 2001351 | Japan | Years followed, mean (range): 8.3 (4.0–17.7) |
| (Fair) | Cases: 61 at FU | Deaths: N=7 (Emaciation: N=3; Suicide: N=2; Murder: N=1; Burn: N=1) |
AN, anorexia nervosa; FU, Followup; N, number; sig, significant; SMR, standardized mortality ratio; Tx, treatment; USA, United States of America.
*In case series studies, sample size is as of the date of the analysis and therefore does not include deceased cases.
Prospective cohort studies with comparison groups. No deaths were reported in the Göteborg, Sweden, study through the 10-year followup.
Case series with no comparison groups. All mortality data were obtained from case series studies without a comparison group. Several studies calculated standardized mortality ratios (SMR), allowing for comparison to the population based on age, sex, and time when the patient population was drawn.
The SMRs were elevated in the AN groups and ranged from 9 to 13 across studies.3, 338, 347, 364, 371, 372 In one study, SMRs were significantly elevated in a female patient population through 14 years of followup (ranging from 30.5 at less than 1 year followup to approximately 6 for the remainder of the period). The SMR was no longer significantly elevated after 14 years.364
Only in two studies conducted in the United Kingdom were the SMRs lower. Crisp et al. examined mortality among females more than 20 years after they had received treatment for AN in either London, England (1968 to1973), or Aberdeen, Scotland (1965 to 1973).357 In England, women with AN were 1.36 times more likely to die than women of the same age in England and Wales between 1973 and 1979. In Scotland, women with AN were 4.71 times more likely to die than women of the same age in Scotland during the same period.
Patton and colleagues conducted a record review of 332 AN patients, mostly female (96 percent), who had received treatment at Royal Free Hospital in the United Kingdom between 1971 and 1981.373 The SMR at 4-year followup was 5.76, which was a significant elevation; at 8-year followup, the SMR was 2.7 (not significant). Predictors of mortality included weight less than 35 kilograms at presentation and more than one inpatient admission.
In one study that followed patients for 8.6 years, significant predictors of death (controlling for age and duration of illness before intake) included greater severity of alcohol use disorders, greater severity of substance use disorders, worse social adjustment, and worse global assessment of functioning (GAF) scores. Predictors of shorter time to death included longer duration of illness at treatment intake, affective disorder hospitalization at intake, suicidality associated with mental illness other than an ED, substance abuse, and worse severity of alcohol use over the course of the illness.372 Descriptively, Isager et al. found that deceased patients were significantly more likely to have been hospitalized.340
Suicide was a common cause of death. Among the group of females with adolescent AN onset who received ED treatment at the Massachusetts General Hospital or other Boston area clinics the SMR was 58.1, significantly higher than that for the population as a whole.371
Franko et al. reported predictors of suicide attempts among the women in the Boston cohort.368 Thirty percent of their patients had a history of suicide attempts before they entered the study; during the study, 22 percent of AN patients attempted suicide. A history of a suicide attempt at intake significantly predicted time to a future attempt in individuals with AN. Using multiple regression techniques, the authors determined that a first suicide attempt was predicted by a history of suicide attempts at intake, greater drug use, participation in individual therapy, use of neuroleptic medications, and older age at disease onset.
A history of suicidality was significantly different among patient subtypes in one study - lower in the pure restricting group than other groups.177 However, the groups did not differ in rates of death at 8-year (median) followup.
Summary of studies addressing KQ 5. One prospective cohort study following individuals who had AN and a healthy comparison group has been conducted. Limited to individuals with adolescent onset of their illness and comparisons in Göteborg, Sweden, this study found that, over a 10-year period, approximately one-half of the group had fully recovered; a small percentage continued to suffer from AN, and the remainder still had other eating disorders. The AN group no longer differed from the comparison group in terms of weight but these individuals continued to be more depressed than comparisons and to suffer from a variety of personality and obsessive-compulsive disorders, Asperger syndrome, and autism spectrum disorders.
Two case series studies, which gathered followup measures from individuals who had received treatment for AN and a nondisease comparison group, were reviewed. They concluded that individuals with AN continued to be more likely to have eating and comorbid psychiatric diagnoses years after treatment. In one study, lower desired body weight and lower desired and actual BMI continued in the AN group, after controlling for current AN status. Individuals in the AN group were also more likely to be depressed and to suffer from mood and anxiety disorders. The second study, limited to psychiatric outcomes, found continued higher rates of major depression and obsessive-compulsive disorder.
The remaining studies had no comparison groups. Rates of recovery and good outcomes varied across studies. Only a relatively small percentage of patients continued to be diagnosed with AN or BN at long-term followup, but many continued to have eating disorders, and relapse rates were high. We did not find evidence that age of disease onset was related to disease chronicity. A relatively large percentage of patients cross over from the restricting subtype to the binge/purge subtype of the disease, but results are mixed concerning which subtype has better eating outcomes.
Few studies examined psychiatric and psychological comorbidities independently of their relationship to eating disorder outcomes. Among those that did and had a comparison group, individuals with AN had a higher probability of having a depression and anxiety disorders diagnosis (including obsessive-compulsive disorder) than comparison individuals. Based on the results of one cohort study, individuals with AN may also be more likely to have Asperger syndrome or autism spectrum disorder. Among individuals with AN, substance abuse may be associated with binge eating.
Through at least 5 years of followup, individuals with AN are more likely to weigh less than comparisons and evidence suggests that their desired weight is lower. We did not find similar predictors of continued low weight in the AN case series studies and so are unable to draw conclusions concerning these relationships. However, some evidence exists that lower weight at treatment presentation is related to poorer outcomes.
The mortality risk is significantly greater among those diagnosed with AN than in the population as a whole. The risk of suicide is particularly pronounced, as is the risk of death early in the followup period. Increased risk is associated with alcohol and substance use disorders.
We examined whether AN outcomes differed by participants' sex, gender, age, race, ethnicity, or cultural groups. We found insufficient evidence to evaluate differences by sex or gender. Males were included in only 19 of 38 reviewed studies and were never more than 10 percent of the analysis sample in any one study. No study included any analyses examining differences controlling for sex or gender.
No study that we reviewed provided outcomes based on the age of the participant at followup. Some studies limited participants to those whose AN onset was during adolescence, but none compared outcomes of those with adolescent onset to those with older onset. However, six studies did include a measure of age at disease onset. Whether this is a significant factor in the course of AN is of particular interest in the field.
Results were mixed. Descriptively, Tanaka et al. found that a good M-R rating was related to younger age at referral;351 Deter and Herzog found that earlier onset of disease was a significant predictor of AN symptoms at 12-year followup.343 Suicide attempts were more likely among those whose disease began at an older age.368 In contrast, Strober et al. did not find age at onset to be a significant factor in predicting chronic AN (intermediate or poor outcomes) at 10- to 15-year followup.341 It was also not a predictor of time to recovery after 4 years in the Heidelberg case series.370 Lastly, although Hebebrand et al. found age at onset not to be significantly related to lower BMI at followup,354 they reported that older age at referral and younger age at followup predicted worse outcome.
Only two studies, both from the United States, reported the race or ethnicity of participants. Nonwhite subjects constituted 4 percent of the Boston, Massachusetts, case series368 and 7 percent of the case series from the University of California at Los Angeles.341, 358
Our discussion of BN outcomes includes 14 articles exclusively discussing individuals with BN70, 333, 375–385 and seven articles discussing individuals with both AN and BN.367–373 As above for AN, we first discuss results for KQ 5, then results for KQ 6.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Case Series, Comparison Groups | ||
| Fichter and Quadflieg, 2004378 | Germany | Years followed: 12 |
| (Fair) | Cases: 163 | Case diagnosis at 6 year FU: Recovered/no ED: 67%; AN: 4%; BN purge: 21%; BN nonpurging: 1%; BED: 1%; EDNOS: 1%;Deceased: 1% |
| Comparisons: 202 | Case diagnosis at 12 year FU: Recovered/no ED: 66%; AN: 2%; BN purge: 10%; BN nonpurging: 1%; BED: 2%; EDNOS: 14%; Deceased: 3% | |
| Case Series, No Comparison Groups | ||
| Ben-Tovim et al., 2001367 | Australia | Years followed: 5 |
| (Good) | Cases: 86 | Diagnosis at FU: AN: 1%; BN: 8%; EDNOS: 13%; No ED:74%; Unknown: 5%; Deceased: 0 |
| M-R-H Outcomes: Good: 76%; Intermediate: 19%; Poor: 2%; Unknown: 2% | ||
| Fairburn et al., 2000375 | United Kingdom | Years followed: 5 |
| (Good) | Cases: 92 | Diagnosis at FU: BN: 15%; BED: 7%; AN: 1%; EDNOS: 32% |
| Fairburn et al., 2003377 | Any DSM-IV ED: 49%; Remission: 35%; Relapse: 26% | |
| (Good) | ||
| Stice and Fairburn, 2003386 | ||
| (Fair) | ||
| Fichter and Quadflieg, 199770 | Germany | Years followed (mean): 6.2 |
| (Fair) | Cases: 185 | Diagnosis at 2 years FU: AN: 2%; BN: 36%; EDNOS: 8%; No ED: 55% |
| Diagnosis at 6 years FU: AN: 4%; BN: 21%; BED: 1%; EDNOS: 2%; No ED: 71% | ||
| Herzog et al., 1993380 | USA | Years followed: 1 |
| (Good) | Cases: 96 | First shift to subclinical BN diagnosis (loss of full criteria without considering duration): 86% |
| Partial recovery: 71%; Full recovery: 56% | ||
| Herzog et al., 1996370 | USA | Years followed: 4 |
| (Good) | Cases: 150 | Partial recovery: 88%; Full recovery: 57% |
| Herzog et al., 1999369 | USA | Years followed (Median): 7.5 |
| (Good) | Cases at baseline: 110 | Full recovery: 74%; Partial recovery: 98%; Relapse after full recovery: 35% |
| Jäger et al., 2004381 | Germany | Years followed: 8 |
| (Fair) | Cases: 80 | Diagnosis at FU: BN: 29%; EDNOS (bulimic): 9%; EDNOS (anorexic): 1%; No ED diagnosis: 61% |
| No binges per week at FU: 63% | ||
| Keel et al., 1999384 | USA | Years followed (mean): 11.5 |
| (Fair) | ||
| Keel, Mitchell, Davis et al., 2000383 | Cases: 173 | Diagnosis at FU: BN: 11%; AN:1%; BED: 1%; EDNOS: 19%; lifetime history of AN: 36%; lifetime history of BED: 11% |
| (Fair) | ||
| Keel, Mitchell, Miller et al., 2000385 | Narrow definition of remission: Full: 42%, Partial: 28% | |
| (Fair) | Broad definition of remission: Full: 47%, Partial: 23% | |
AN, anorexia nervosa; BED, binge eating disorder; BN, bulimia nervosa; DSM-IV, Diagnostic and Statistical Manual for Mental Disorders, Fourth Edition; ED, eating disorder; EDNOS, eating disorder not otherwise specified; FU, followup; M-R-H Scale, Morgan-Russell-Hayward Scale; USA, United States of America.
Case series studies with comparison groups. Female patients who had received inpatient treatment (N = 163), in Germany were followed for 12 years.378 The comparison group (N = 202) included females ages 18 to 30 who had never received treatment for an eating disorder. The Structured Inventory for Anorexic and Bulimic Syndromes, Expert-Rating version (SIAB-EX) was used to compare eating disorder symptoms between cases and comparisons at 12 year followup. The BN group as a whole was significantly more symptomatic than the comparison group, as were individuals with BN who were considered to be recovered.
Lifetime psychiatric comorbidity predicted a significantly higher probability of having any eating disorder at 2- and 6-year followup. This variable was no longer significant at 12 years. In contrast, after 12 years, greater lifetime psychiatric comorbidity significantly predicted a higher probability of having a global eating disorder outcome as measured by the Psychiatric Status Rating Scale (PSR) (OR, 3.71; 95% CI, 1.16–11.91). A lifetime history of AN and older age at disease onset also predicted a worse PSR at 12 years.378
Case series studies with no comparison groups. Fairburn and colleagues conducted 5- and 6-year followup assessments of females recruited for two psychotherapy trials in the United Kingdom.375–377, 386 The investigators recruited 102 patients with BN through general practitioners and psychiatrists with no limitations on age at disease onset.
After 5 years, by a variety of measures, the group had improved since baseline and had experienced a significant reduction, in the previous 3 months, in mean objective bulimic episodes, self-induced vomiting episodes, and laxative misuse.375 Eating Disorder Examination (EDE) interview measures that significantly improved included those measuring restraint, shape concern, weight concern, and eating concerns.
Fairburn et al. examined whether outcomes differed between persistent disease (at least two episodes of behavior at one or both of last two assessments) and remitted disease (not engaged in any relevant behavior over past 3 months); they focused solely on binge eating or compensatory behaviors.377 The persistence of binge eating behavior was related to baseline duration of disturbed eating, overvaluation of shape and weight, and worse social adjustment. None of the tested baseline factors predicted compensatory behavior. However, binge eating and compensatory behaviors were significant predictors of each other.
At 6-year followup, using multivariate analysis, Fairburn, Norman et al. determined that significant predictors of current AN or BN status (adjusted for the type of treatment received and the duration of followup) included paternal obesity (OR, 5.73; 95% CI, 1.56–21.1) and premorbid obesity (OR, 4.31; 95% CI, 1.35–13.7).376
Stice and Fairburn categorized their BN patients into dietary and dietary-depressive subtypes using cluster analysis.386 Compared with persons in the dietary subtype, those in the dietary-depressive subtype were significantly more likely to have lifetime psychiatric treatment for eating disorders at baseline and during followup, greater persistence of binge eating and compensatory behaviors, and diagnoses of major depression, panic disorder, obsessive-compulsive disorder, social phobia, generalized anxiety disorder, and agoraphobia.
D. Herzog and colleagues examined eating-related outcomes for a group of female patients who sought treatment at Massachusetts General Hospital and other Boston area ED programs.369, 370, 380 The authors examined levels and predictors of full and partial recovery at 1, 4, and 7 years. Full recovery was defined as 8 consecutive weeks of being asymptomatic; partial recovery was defined as not meeting full criteria for AN or BN but still experiencing significant symptomatology.
The percentage of the group that fully recovered increased over time. At 1 year, 56 percent were fully recovered;380 at 4 years, 57 percent were fully recovered;370 and at 7 years, 73 percent had achieved a full recovery at some point during followup.369 The trend was similar for partial recovery at some point during followup: 1 year, 71 percent;380 4 years, 91 percent;370 and 7 years, 98 percent.369 Recovery was not, however, necessarily persistent even if it covers 8 consecutive weeks. By 7 years, 35 percent had relapsed after achieving a full recovery.
The authors investigated predictors of recovery at each followup. At 1 year, ideal body weight (IBW) was not a significant predictor of time to partial recovery.380 Variables included in their models at both 4- and 7- year followup included duration of the current disorder episode, age at onset of the current eating disorder, age at onset of the first eating disorder, weight, binge and purge frequency, and the co-occurrence of various other disorders including those involving a lack of impulse control, depression, personality and any Axis I disorder. At both points, no significant predictors of recovery emerged from among these variables.369, 370
Ben-Tovim et al. analyzed results from 86 female BN patients who had been treated by an eating disorder specialist in Adelaide, South Australia, and followed for 5 years.367 Not all had inpatient stays and age at onset was not reported. Using multivariate analyses, they reported that total M-R-H scale outcomes were significantly related to subscales for dietary and eating patterns, body concern, and body weight rather than other subscales concerning menstrual pattern, mental state, psychosexual state or work and family relations. In a second multivariate model, M-R-H total scores were predicted by overall behavior and social functioning at baseline, feeling fat at study recruitment, attractiveness at 6 months, and change in depression over the first 6 months.
Jäger et al. compared outcomes of female patients who had received analytic inpatient and systemic outpatient treatment at a hospital in Germany.381 Over time, binges, bulimia severity, the number of episodes of food restriction, and EAT measures of bulimia and dieting significantly decreased in both treatment groups; in addition, the number of normal meals increased. The group receiving analytic inpatient treatment had a greater decline in the severity index and the number of restrictions than the group receiving systemic outpatient therapy.
Keel and colleagues examined eating-related outcomes for 173 females with a mean of 11.5 years following evaluation at the University of Minnesota's Eating Disorders Clinic.383–385 Members of the group had participated in one of two previous treatment studies. A particular interest in this study was comparing results based on different definitions of remission. Defining remission as freedom from disordered eating for at least 6 months and the absence of undue influence of shape and weight on self-evaluation, the authors reported that 42 percent were in full remission and 28 percent in partial remission. Using a broader definition of remission, including absence of disordered eating for at least 8 weeks with no restrictions based on the influence of weight and shape, they reported 47 percent were in full remission and 23 percent were in partial remission.384
The authors compared the relation between prognostic factors and two specifications of the outcome measure: categorical (full or partial remission vs. not in remission) and continuous (log of the number of months since last binge/purge episode).384, 385 The two models showed little difference in results. Significant factors in relation to both outcome specifications included lifetime substance use, baseline substance use, current mood, substance use, and impulse control disorders, and results on a multidimensional personality questionnaire. Prognostic factors that were not statistically significant in relation to either outcome specification included age at onset, duration of symptoms at baseline, baseline depression or anxiety disorder, and lifetime mood or anxiety disorder.
Keel et al. compared the association among six definitions of BN outcomes and a variety of other outcome measures and prognostic variables.383 Definitions of BN outcomes varied based on the duration of abstinence required for full remission or recovery, the number of categories in which outcomes were placed, and how the categories were combined. Full recovery ranged from 47 percent to 38 percent based on the required duration of abstinence in the specification. Other outcomes that were significantly related to the eating disorder outcome in all specifications included depression, body image disturbance, impulse control, and social adjustment. The analysis did not identify any prognostic factors that were statistically significant in relation to all six eating disorder specifications. However, substance abuse was significant in four of six specifications, age of presentation in three specifications, and age of onset in two.
Including 101 of the females from the University of Minnesota study discussed above, Keel et al. also examined the independence and relative strength of depression compared with bulimic symptoms in predicting body dissatisfaction at followup.382 Baseline depression was both independent of and superior to bulimic symptoms in predicting body dissatisfaction at followup, demonstrating a direct association between depression and body dissatisfaction that is independent of bulimic symptoms.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Case Series, Comparison Groups | ||
| Fichter and Quadflieg, 2004378 | Germany | Years followed: 12 |
| (Fair) | Cases: 163 at 12 year followup | Psychiatric comorbidity at followup: |
| Lifetime 79.7%; current: 41.1% | ||
| Mood disorders: Lifetime: 69.0%; current: 16.5% | ||
| Comparisons: 202 | Major depression: Lifetime: 58.2%; current: 10.8% | |
| Anxiety: Lifetime: 36.1%; current: 22.2% | ||
| Substance use: Lifetime 36.1%; current: 14.6% | ||
| Borderline personality disorder: 9.5% | ||
| Case Series, No Comparison Groups | ||
| Fichter and Quadflieg, 1997 | Germany | Years followed (mean): 6 |
| (Fair) | Cases: 185 | Psychiatric comorbidity at 2-year followup: |
| Borderline personality disorder: 5%; Substance abuse: 24%; Mood disorders: 30%; Anxiety disorders: 13% | ||
| Psychiatric comorbidity at 6-year followup: | ||
| Borderline personality disorder: 4%; Substance abuse: 21%; Mood disorders: 46%; Anxiety disorders: 32% | ||
| Stice and Fairburn, 2003 | United Kingdom | Years followed: 5 |
| (Fair) | Cases: 82 | Psychiatric comorbidity at followup:* |
| Major depression: Dietary: 61%; Dietary-depressive: 81% | ||
| Panic disorder: Dietary: 15%; Dietary-depressive: 33% | ||
| Obsessive-compulsive disorder: Dietary: 2%; Dietary-depressive: 25% | ||
| Generalized anxiety disorder: Dietary: 11%; Dietary-depressive: 47% | ||
| Agoraphobia: Dietary: 4%; Dietary-depressive: 36% | ||
Difference between groups (P < 0.05).
Prospective cohort studies with comparison groups. The Fichter and Quadflieg study that followed females with BN and a healthy comparison group recorded psychiatric comorbidities in the BN group only.70, 378 In the first 6 years after treatment, general psychopathology, as measured by the Symptom Checklist 90-Revised (SCL-90), found that symptoms were worse at 2-year followup but better at 6-year followup compared to the end of treatment.70 At 12 years, 80 percent of patients had a lifetime psychiatric disorder, and 41 percent had a psychiatric disorder in the month before assessment. Half of the patients had suffered from a lifetime mood disorder or major depression and 36 percent had suffered from an anxiety or substance use disorder.378
Case series studies with no comparison groups. The Jäger et al. study that reported 8-year outcomes following either analytic inpatient or systemic outpatient treatment found that depression had declined in both groups381 but that the decline was greater in those who received inpatient treatment.
| Authors, Year | Country | |
|---|---|---|
| (Quality Score) | Sample Size | Outcomes |
| Case Series, No Comparison Groups | ||
| Fairburn et al., 2000375 | England | Years followed: 5 |
| (Good) | Cases: 92 | Change over time: |
| Weight: 69.8 kg, BMI: 25.5 | ||
| Fichter and Quadflieg, 1997 | Germany | Years followed (mean): 6 |
| (Fair) | Cases: 185 | Weight at followup: Good (19<BMI<30): 74%; Intermediate (BMI 30–40 or 17.5–19): 17%; Poor (BMI<17.5 or >40): 9% |
| Gendall, Bulik et al., 2000379 | New Zealand | Years followed: 1 |
| (Good) | Cases: 82 | Irregular menses: 30.5% |
| Keel et al., 1999384 | USA | Years followed (mean): 11.5 |
| (Fair) | Cases: 173 | BMI: 22.1, Weight: 60.7 kg |
BMI, Body mass index, measured in kg/m; USA, kg, kilograms; United States of America.
Case series studies with no comparison groups: Gendall et al. followed 82 females for 1 year who had participated in outpatient treatment trials in New Zealand.379 At followup, approximately 31 percent of the female participants had irregular menses. In multivariate analyses, irregular menses (irregular or absent menstrual cycles within the past 3 months) were significantly related to a greater maximum-minimum weight difference and current smoking.
Several studies reported improvements over time in weight measures. After 5 years, Fairburn and colleagues found that participants' mean weight and BMI had increased.375 At 6-year followup, Fichter and Quadflieg found that 74 percent of their participants were in the good weight range.70 Similarly, Keel et al. measured differences in weight variables in 173 females followed for approximately 11 years.384 BMI, actual weight, desired weight, and highest weight all significantly increased over time.
| Authors, Year | Country | |
|---|---|---|
| Quality Score | Sample Size | Outcomes |
| Case Series Studies, Comparison Groups* | ||
| Fichter and Quadflieg, 2004378 | Germany | Years followed: 12 |
| (Fair) | Cases: 163 at 12 year followup | BN Cases Deaths: |
| 2 year followup: 0 | ||
| 6 year followup: 2 | ||
| Comparisons: 202 | 12 year followup: 4, SMR: 2.36 | |
| Franko et al., 2004368 | USA | Years followed: 8.6 |
| (Good) | Cases: 110 | Suicide attempts: 11% |
| Predictors of time to first suicide attempt (adjusted): | ||
| Group therapy; Younger age at onset; History of drug use disorder; Individual therapy; Paranoid personality disorder; Greater severity of laxative use | ||
| Herzog, et al., 2000371 | USA | Years followed: 11 |
| (Fair) | Cases: 110 | Loss to followup deaths: 0 |
| Keel et al., 2003372 | USA | Years followed (Median): 9 |
| (Fair) | Cases: 110 | Deaths: 1, SMR: 1.3 |
| Patton et al. 1988373 | USA | Years followed: 4–15 |
| (Fair) | Cases: 96 | Deaths: N=3 (2 car accidents, 1 low weight) |
| Crude mortality rate: 3.3, SMR: 9.38 | ||
BN, bulimia nervosa; SMR, standardized mortality ratio; USA, United States of America.
In case series studies, sample size is as of the date of the analysis and therefore does not include deceased cases.
Case series studies with comparison groups. In the Fichter and Quadflieg study, 2.5 percent of the BN group were deceased at 12-year followup.378 The SMR was 2.36, not significantly different from the rate expected in the population matched by age and sex.
Case series studies with no comparison groups. Franko et al. reported predictors of suicide attempts in a group of 110 women with BN who had been recruited because they sought treatment for eating disorders at Massachusetts General Hospital and other Boston area clinics.368 At baseline, 23 percent reported a history of suicide attempts before assessment, and 11 percent reported suicide attempts during the study. After approximately 9 years of followup, significant predictors of shorter time to first suicide attempt included receiving group therapy, receiving individual therapy, younger age at onset, a history of drug use disorder, paranoid personality disorder at intake, and greater severity of laxative use.
In a companion study, D. Herzog et al. followed this same group of women in Boston for 11 years to examine rates and causes of death.371 At the end of that time, none of the women were deceased.
Keel et al. measured the mortality rates among 110 females, also recruited in Boston, in the same manner as Herzog et al., but the parameters of the recruitment dates differed somewhat. Participants were followed for a median of 9 years.372 One individual died during the followup period. The SMR of 1.3 was not significantly different from what would be expected in the population as a whole.
Patton et al. measured mortality rates in patients in the United Kingdom who were followed for 4 to 15 years.373 Three patients died during the observation period, one from low weight. Again, the SMR was not statistically significant from what would be expected in the healthy population.
Summary of findings. All of the BN literature is case series, that is, studies that follow individuals over time who have received treatment. One study included a nondisease comparison group. Much of the emphasis in the BN literature concerned comparing various definitions of disease outcomes and diagnostic subtypes. Generally in these studies, more than half of the patients followed no longer had a BN diagnosis at the end of the study period. A substantial percentage continued to suffer from other eating disorders, but BN was not associated with an increased mortality risk. A limited number of analyses uncovered factors significantly associated with outcomes of this disease. Only depression was associated with worse outcomes consistently across studies.
In each of the BN outcomes studies except for Patton et al., all participants we reviewed were female.373 Four percent of the participants in the Patton et al. study were male; however, this study included both AN and BN populations, and the authors do not specify how many of the included males were in each disorder group.
Most studies did not report the race, ethnicity, or cultural group of the participants. Franko et al. reported that 4 percent of their sample was nonwhite, but they did not specify the distribution in the BN sample, relative to the AN sample.368 Johnson and colleagues reported that the modal race was white;333 Keel and colleagues reported that 1 percent of their sample was nonwhite.384 These investigators did not, however, report outcome differences by race, ethnicity, or cultural group. No outcome studies of BN controlled for the age of participants at entry; no studies were limited to individuals with adolescent onset of the disorder. We conclude that no evidence exists to determine whether outcomes for BN differ by any of these categories.
Given the recent addition of the provisional criteria for BED to the psychiatric nomenclature, three studies met our inclusion criteria for this section. All three studies were case series.387–389 One study included a comparison group.389 One study was conducted in the United States (rated as fair),388 one in Germany (rated as fair),387 and one in Italy (rated as fair).389
In KQ 5 we address outcomes of BED and factors associated with outcomes. We partitioned outcomes into eating-related outcomes, psychological outcomes, and biomarker outcomes (largely weight change).
Case series with comparison groups. The only case series with a comparison group explored a special population of individuals undergoing laparoscopic adjustable gastric banding.389 This is an important research question intended to determine whether individuals with BED who are obese are appropriate for bariatric surgery. In this large study of 130 BED patients versus 249 obese comparison individuals without BED, those with BED experienced more band adjustments and more pouch and esophageal dilatations than those without BED. The authors did not report on psychological outcomes. At 5 years, the groups did not differ on measures of either weight loss or weight regain. The authors did not report on any variations in disordered eating behavior that may have persisted after bariatric surgery.
Case series without comparison groups. Fichter et al.387 followed 62 cases with BED for 6 years; of these patients, 78 percent had no ED diagnosis, 6 percent continued to have a BED diagnosis, and a minority had developed BN or EDNOS over the followup interval. Over the 6-year interval, depression, anxiety, and obsessionality measures also improved. The authors did not report whether changes observed in BMI over time were significant. No additional factors associated with outcome were reported.
The second case series examined the impact of comorbid psychopathology and personality disorders on treatment outcome for BED.388 Individuals with cluster B personality disorders reported a greater number of binge days at 1-year followup. Neither binge frequency nor EDE global scores were related to other comorbid conditions. The authors did not report additional psychological or biomarker outcomes.
Summary of studies addressing KQ 5. Only sparse evidence addresses factors associated with BED outcomes. The three included studies have vastly different designs and research questions; more importantly, their findings do not converge.
KQ 6 addresses whether outcomes differ for BED by sex, gender, age, race, ethnicity, or cultural groups In all, 405 women and 134 men participated in outcome studies of BED. No study compared differential factors associated with outcome by sex or gender.
Only one study reported ethnicity:388 151 whites, five blacks, four Hispanics, and two Native Americans. This study did not report any differential outcomes by ethnicity.
All three studies were of adults. No outcome studies of BED in children have been performed. Nothing is known about differential outcome by age group.
This chapter discusses our findings about anorexia nervosa (AN), bulimia nervosa (BN), and binge eating disorder (BED), which derive from our systematic review of literature for six key questions (KQs). Four KQs dealt with evidence about treatment issues (Chapters 3, 4, and 5):
Efficacy of treatments or combination of treatments
Harms associated with the treatment or combination of treatments
Factors associated with the efficacy of treatment
Differences in efficacy of treatment by sex, gender, age, race, ethnicity, or cultural group.
Two other KQs covered the course and outcomes of these conditions (Chapter 6):
Factors associated with outcomes among individuals with these conditions
Differences in outcomes by sex, gender, age, race, ethnicity, or cultural group.
Our report focused on randomized controlled trials (RCTs) for AN, BN, and BED and on outcomes studies that included sample sizes of 50 or greater and included at least 1 year of follow-up. All studies were published since 1980.
In this chapter, we first review the quality of the literature and the strength of the evidence based on the outcomes of and treatment of eating disorders. The confidence that readers can have in our findings, conclusions, inferences, and research recommendations rests heavily on the quality of the research reviewed and the overall robustness of the evidence. We then discuss the major issues resolved (or not resolved) in treating and managing patients with these conditions, drawing as appropriate from the findings for all six questions. Following that section, we present our research recommendations. The chapter ends with a brief recapitulation of our conclusions.
In this section we review our main findings on treatments for AN, BN, and BED, with specific attention to medications only, behavioral or psychotherapy interventions only, combination approaches, and novel interventions. We also comment on issues relating to outcomes from the disorders, including mortality. Before presenting the findings, we document our approach to assessing the strength of these bodies of evidence. Interpreting our findings accurately requires appreciation of the considerable drawbacks to much of this literature.
As described in Chapter 2 and documented in both evidence and summary tables, we first applied rigorous selection criteria for articles and assessed the quality of each study. We then evaluated the strength of the bodies of evidence available to address each KQ for each disorder. The possible grades in our scheme are as follows:
Strong evidence. The evidence is from studies of strong design; results are both clinically important and consistent with minor exceptions at most; results are free from serious doubts about generalizability, bias, or flaws in research design. Studies with negative results have sufficiently large samples to have adequate statistical power.
Moderately strong evidence. The evidence is from studies of strong design, but some uncertainty remains because of inconsistencies or concern about generalizability, bias, research design flaws, or adequate sample size. Alternatively, the evidence is consistent but derives from studies of weaker design.
Weak evidence. The evidence is from a limited number of studies of weaker design. Studies with strong design either have not been done or are inconclusive.
No published literature (for those situations in which no study addressed the question).
| Interventions | KQ 1 | KQ 2 | KQ 3 | KQ 4 |
|---|---|---|---|---|
| Anorexia Nervosa | ||||
| Medication and Medication plus Behavioral Interventions | ||||
| Adults | III | III | III | IV |
| Adolescents | III | III | III | IV |
| Behavioral Interventions | ||||
| Adults | III | IV | III | IV |
| Adolescents | II | IV | III | IV |
| Bulimia Nervosa | ||||
| Medication and Medication plus Behavioral Interventions | ||||
| All ages | I | I | III | IV |
| Behavioral Interventions | ||||
| All ages | I | IV | II | IV |
| Self-help | ||||
| All ages | III | IV | III | IV |
| Other | ||||
| All ages | III | IV | III | IV |
| Binge Eating Disorder | ||||
| Medication and Medication plus Behavioral Interventions | ||||
| Adult | II | I | III | IV |
| Behavioral Interventions | ||||
| Adult | II | IV | III | IV |
| Self-help | ||||
| Adult | III | IV | III | IV |
| Other | ||||
| Adult | III | IV | III | IV |
Regarding harms of therapy (KQ 2), we gave strong ratings (I) to the literature on medication interventions for BN and BED. The evidence for harms of other interventions for all three disorders received ratings of either weak (III) or nonexistent (IV). Behavioral trials rarely reported harms associated with treatment.
KQ 3 dealt with factors associated with or influencing therapeutic outcome. With the exception of behavioral interventions for BN, which we rated moderate (II), we rated the literature for all three disorders as weak (III). Very few well-designed studies addressed those factors that lead to good or poor outcome in clinical trials.
Finally, KQ 4 addressed differences in treatment outcome by age, sex, gender, race, ethnicity, or cultural group. For all three disorders and all types of interventions, we rated the literature as nonexistent (IV). The treatment literature for eating disorders has virtually ignored all these factors.
| Eating Disorder | KQ 5 | KQ 6 |
|---|---|---|
| Anorexia nervosa | II | III |
| Bulimia nervosa | II | IV |
| Binge eating disorder | III | IV |
The AN outcomes literature includes one prospective cohort study (following individuals identified in the community) with a comparison group design and one case series study (following a treatment population) with a comparison group design. The remaining literature follows case series of patients without comparisons. Some studies use strong methodological designs that control for length of followup and the effect of independent predictors. However, results were not consistent across studies.
The BN outcomes literature included no prospective cohort studies but did include several studies with strong methodological designs, including one case series study with a comparison group. However, partially because the literature is inconsistent in the methodology used to measure outcome, few factors were found to be consistently related to outcomes and so uncertainty remains.
The BED literature included only three studies. Much of the data provided in these studies was descriptive and offered very limited information concerning factors related to outcomes.
We used the body of literature that met our inclusion criteria for answering KQ 5 to address KQ 6 concerning differences in outcomes by sex, gender, age, race, ethnicity, or cultural group. We graded the AN literature as weak (III) and the BN and BED literature as nonexistent (IV). The AN literature had limited evidence discussing the effect of age of onset on outcomes, but results were not conclusive. The AN literature yielded no evidence to evaluate differences in outcomes by any other KQ 6 criteria. No study addressed any of these concerns for BN and BED.
Our review supports and extends previous systematic reviews on treatment of eating disorders, including several Cochrane reports. Broadly, Cochrane reviews of AN treatment concur that the literature is weak, made no specific recommendations regarding AN treatment, and encouraged larger well-designed trials.390 For psychotherapy for BN and binge eating, a Cochrane review supported cognitive behavioral therapy (CBT) for BN, in individual or group format, and encouraged further study of self-help.391 For antidepressant treatment, Cochrane reviewers concluded that single antidepressant agents were clinically effective for BN in comparison to placebo, with greater remission rate but also greater dropouts. No differential effect regarding efficacy and tolerability among the various classes of antidepressants was reported.392 Examining combinations of psychotherapy and antidepressants for BN, another Cochrane review reported that combination treatments were superior to psychotherapy alone, that psychotherapy appeared to be more acceptable to participants, and that the addition of antidepressants to psychological treatments decreased the acceptability of the psychological intervention.393
In addition, guidelines from the National Institute of Clinical Effectiveness (NICE) in the United Kingdom (http://www.nice.org.uk/) concur that AN evidence is weak. The NICE authors assigned high grades to CBT for BN and BED and to antidepressants for BN. For both BN and BED, NICE recommended self-help as an initial treatment step.
Managing individuals with AN with medication only is inappropriate, based on evidence reviewed here. No pharmacological intervention for AN has a significant impact on weight gain or the psychological features of AN. Although mood may improve with tricyclic antidepressants, this outcome is not associated with improved weight gain. Moreover, medication treatment for AN is associated with high dropout rates, suggesting that the currently available medications are not acceptable to individuals with AN.
For BN, good evidence indicates that fluoxetine (60 mg/day) reduces core bulimic symptoms of binge eating and purging and associated psychological features of the eating disorder in the short term. Based on two studies, the 60 mg dose performs better than lower doses and may contribute to decreased relapse at 1 year; however, patients do not tend to remain on the drug. Preliminary evidence exists for other second-generation antidepressants (trazodone and fluvoxamine), an anticonvulsant (topiramate), and a tricyclic antidepressant (desipramine). Preliminary evidence exists that monoamine oxidase inhibitors (MAOIs) are associated with decreased vomiting in the treatment of BN, although diet should be closely monitored.
Medication trials for BED have focused primarily on overweight individuals with BED. In these individuals, desired outcomes are twofold: weight loss and abstinence from binge eating. The majority of medication research for BED reflects short-term trials. Preliminary efficacy has been shown for selective serotonin reuptake inhibitors (SSRIs), one serotonin, dopamine, and norepinephrine uptake inhibitor, one tricyclic antidepressant, one anticonvulsant, and one appetite suppressant. In the absence of abstinence data and long-term followup, however, we do not know whether observed changes in binge eating, depression, and weight persist.
For adult AN, we have tentative evidence that CBT reduces relapse risk for adults with AN after weight restoration has been accomplished. By contrast, we do not know the extent to which CBT is helpful in the acutely underweight state, as one study found that a manual-based form of nonspecific supportive clinical management was more effective than CBT and interpersonal psychotherapy (IPT) in terms of global outcomes during the acute phase. No replications of these studies exist.
Family therapy as currently practiced has no supportive evidence for adults with AN and a comparatively long duration of illness. Overall, family therapy focusing on parental control of renutrition is efficacious in treating younger patients with AN; these approaches lead to clinically meaningful weight gain and psychological improvement. Although most studies of family therapy compared one variant of family therapy with another, two studies produced results suggesting that family therapy was superior to an individual therapy for adolescent patients with shorter duration of illness.
For BN, evidence for CBT is strong. Although IPT is also as effective, at 1-year followup, based on one study, symptomatic change appears to be more rapid with CBT. This factor decreases the time that patients are exposed to the symptoms of BN. Dialectical behavioral therapy (DBT) and guided imagery both show preliminary promise for BN patients.
For BED, CBT decreases the target symptom of binge eating. It does not, as currently delivered, promote weight loss in overweight patients. DBT may hold promise for BED patients as well.
Although many of the medication trials for AN were conducted within the context of basic clinical management, no study that systematically studied medication plus psychotherapy for AN met our inclusion criteria.
For BN, the combined drug plus behavioral intervention studies provide only preliminary evidence regarding the optimal combination of medication and psychotherapy or self-help. Although some preliminary evidence exists for incremental efficacy with combined treatment, given the variety of designs used and lack of replication, evidence remains weak.
For BED, the combination of CBT plus medication may improve both binge eating and weight loss outcomes. Sufficient trials have not been done to determine definitively which medications are best at producing and maintaining weight loss in this population. Moreover, the optimal duration of medication treatment for abstinence from binge eating and sustained weight loss has not yet been addressed empirically, yet weight-loss effects of medication are generally known to cease when the medication is discontinued.394
Across the three disorders, we found evidence of various innovative approaches that seem to hold promise, especially for conditions as complex as these eating disorders. Nonetheless, nothing can be said definitively because the trials were small and inconclusive.
The AN outcomes literature clearly and consistently identified that the risk of death is significantly higher in the AN population than would be expected in the population in general. Life-threatening complications of the disease include not only those directly related to weight loss and other physical problems but also a significantly elevated risk of suicide.
Studies were inconsistent concerning whether deceased patients had been included in the analysis sample at followup. Therefore, factors related to poor outcomes did not always include mortality risk. Several studies identified factors related to death versus all other outcomes. Only by including death with other outcome categories can we determine if factors related to death differ from factors related to other poor outcomes.
Individuals with BN and BED were not identified as being at elevated risk of death.
Adequate sample sizes. Especially in AN clinical trials, sample size was often insufficient to draw conclusions regarding differential efficacy across groups. Even when investigators did power calculations, they often did not plan an adequate allowance for attrition. Given this limitation, researchers using designs that contrasted one approach with another most commonly observed no differences across interventions. This result was especially true in trials of behavioral interventions and even more so in those that included a large number of comparison groups.
Accurate power analyses should be conducted before starting any study and presented in the methods section. Larger multisite studies should be conducted as a means of bolstering patient numbers.
Subgroup analyses. Even in the face of small sample sizes, many authors conducted subgroup analyses on outcome variables, often in the absence of a priori hypotheses. In these small studies, the ability to discern even large differences between groups is limited, and some findings might arise by chance. Investigators must avail themselves of adequate statistical assistance to ensure against inappropriate analyses of this sort.
Attrition. Loss to followup and dropout from clinical trials is especially problematic in AN studies.395 Individuals with AN are often in denial, deeply fearful of weight gain (which is the key treatment outcome), and hesitant to take medication. High attrition compromises the integrity of outcome data; differential attrition between treatment intervention groups and comparison (e.g., usual-care or placebo) groups is even more damaging. In light of high attrition, researchers often reported completer analyses rather than intention-to-treat analyses, and the former practice can bias results.
Substantial attention needs to be paid to enhancing motivation for treatment in individuals with AN and to improving retention in clinical trials. Although dropout is somewhat lower in BN and BED studies than in AN studies, investigators should also address these factors in clinical trials for these disorders.
In general, the eating disorders literature suffers from insufficient rigor with respect to statistical design and analysis in both the planning and conduct of trials. This leads to both gaps and inaccuracies in reporting and interpreting results. Minimally, these problems call into question the validity of the conclusions that can be drawn from individual studies. More broadly, it limits cross-study comparisons and the systematic accumulation of findings that stand the test of time and replication. Ultimately, these problems will hinder the advancement of effective treatments.
Unclear randomization and allocation concealment. Randomization procedures were not of uniformly high standards in the AN, BN, and BED literatures. Many studies failed to report how investigators achieved randomization (if indeed they did achieve it). In many instances, clinical decisions interfered with the integrity of the randomization procedures. No studies reported procedures for allocation concealment.
Trial design challenges. A common problem involves lack of attention to the within-subject repeated design inherent in intervention and treatment trials. For example, studies often indicate the use of repeated-measures analysis but then actually report analysis of posttreatment outcome data only using a paired t-test to identify treatment group differences. In some cases, investigators include baseline data as a covariate (which is not explicitly identical to using a repeated-measures model); in other cases, they do not take baseline data into account at all.
In addition, authors sometimes compute a change (delta) score (posttreatment minus baseline) representing within-subject change over time. This is a reasonable (indeed, often preferable) analytic approach to understand pre-post differences. However, they then fail to account for baseline differences that could result in misinterpretation of mean within-group delta scores; an example is when higher baseline values are associated with smaller delta scores.
Overall, advances in this field demand more clarity in the description of analytical methods employed, including specifically the analytic models that have been determined a priori, and for the use of repeated measures models with appropriate inclusion of covariates. Attention to these recommendations should improve our ability to integrate information from disparate studies and to draw conclusions with higher yield with respect to the design and implementation of future interventions.
Duration of treatment and absence of followup. Only a very few studies included a dimension of differential duration of treatment in their designs. Assuming that a medication trial that lasts weeks is likely to have long-lasting effects on symptoms that have been present often for many years is unrealistic. Realistic duration of treatment and longer followup of patients in clinical trials for AN, BN, and BED are essential. In addition, strategies to develop continuation and maintenance treatments have not yet been addressed in this field. They are a critical next step in both medication and psychotherapy research.
Excessive diagnostic and outcome measures. The field of eating disorders has spawned an unusually large array of diagnostic and outcome assessment measures. The lack of consistency of measures renders comparisons across studies virtually impossible. This problem is an especially important barrier to standardizing measures of weight and weight change in outcome assessments and trials involving AN therapies, especially when age and sex corrections for body mass index (BMI) should be employed. Future efforts to refine and consolidate the number of measures would be a valuable contribution to the field.
Researchers should be careful not to include too many outcome measures in their designs. They need to avoid having many outcome variables at the expense of the most important behavioral indicators. Excessive numbers of outcome measures, especially those that may be closely related, lead to a higher likelihood of Type I errors and an inevitable focus on the minor significant findings that do emerge. This is especially detrimental to understanding the efficacy of therapeutic regimens when those findings are not the most clinically relevant dimensions or when their relevance to recovery is unknown.
Treatment of medical morbidities. Insufficient attention has been paid to addressing the optimal approach to treatment of serious long-term physical sequelae of AN and BN, most notably osteoporosis. We advise that measures of physical health issues be considered in the design of future trials.
Sociocultural context. Although the facilitating nature of sociocultural forces such as emphasis on thinness and unhealthy dieting have long been acknowledged, few treatment or outcome studies have attempted to measure the impact of these pernicious contextual factors. Although these variables are less tractable (for study design and conduct) than more readily measured factors such as eating-disordered behaviors, depression, anxiety, or biomarkers, greater attention to developing effective methods to measure these contextual factors may reveal important and often overlooked factors that influence recovery. This in turn may open new avenues for prevention, community education, policy, and strategies for maintenance of treatment gains.
Lack of definition of stage of illness, remission, recovery, and relapse. For AN, BN, and BED, investigators did not apply consensus definitions of stage of illness, remission, recovery, and relapse. Developing standardized definitions of these terms for each disorder and the means to evaluate them are high priorities for future research. Accomplishing this will require a concerted and orchestrated effort to bring researchers together to develop such definitions and reporting guidelines.
Reporting change as reduction in behaviors rather than abstinence or remission. Especially in the BN and BED literature, researchers commonly reported outcomes such as percentage reduction in binge days, percentage reduction in binges, or amount of time spent binge eating. Although these are potential indicators of therapeutic change, when used alone they can be misleading because individuals with high weekly binge eating can reduce this behavior by even as much as 50 percent but still be highly symptomatic. Depending on the disorder and core behaviors being targeted, future studies should report either abstinence from binge eating, vomiting, and other compensatory behaviors or absence of binge days for a specified duration of time (at least 1 month but preferably longer).
Statistical reporting. Frequently, authors do not report degrees of freedom, making it impossible to decipher the exact nature of the model being tested. Incomplete reporting of results derived from multivariate models is problematic. Authors should take care to report clearly any interaction, between-group, and within-group effects when they employ repeated designs.
Statistically significant differences versus clinically meaningful differences. Across all three disorders attention to distinguishing between statistically significant and clinically meaningful differences is insufficient. For example, significant differences in weight gain in AN and in weight loss in BED may be observed; however, the extent to which group differences as small as 1 kg to 2 kg truly represent clinically meaningful differences is rarely addressed. Definitions of what constitutes clinically meaningful differences in eating disorders are required.
This issue is even more complex when dealing with psychological features of the eating disorder or associated anxiety or depression. Although significant group differences may emerge in a parameter such as hunger, the extent to which this type of finding reflects improvement in the disorder and is a harbinger for remission remains unknown.
Gaps in the literature can be identified for the specific diseases and for broader issues of research across eating disorders. We first examine deficits in the evidence base for the main types of interventions (for one, two, or all three of the conditions), drawing on the points made above about the quality of articles or strength of evidence. We then turn to broader methods and related issues for the entire body of investigations in these conditions.
Medications. Discovering new medications that target the core biological and psychological features of AN, address adverse medical sequelae such as osteoporosis, and enhance motivation and retention in medication trials are critically needed steps. As noted, fluoxetine offers some benefits for BN patients. Additional studies are required to determine the long-term effectiveness of relatively brief medication trials, the optimal duration of medication treatment, and the optimal strategy for maintenance of treatment gains. In addition, work to identify and test novel medications that decrease the urge to purge (e.g., with antiemetics) or reduce the extent to which binge eating and purging are experienced as reinforcing is also warranted. Medication trials should focus on achieving abstinence from binge eating and purging, not merely reducing the frequency with which these behaviors occur. Efforts to improve retention in medication trials for BN are also warranted, as are additional studies combining medications and behavioral interventions.
For BED medication questions, future investigations should take care to report specifically and separately on two outcomes - weight loss and abstinence from binge eating - because weight loss is less applicable to individuals with BED who are of normal weight. Future BED studies should clearly distinguish between normal weight and overweight participants and address whether treatment goals include both cessation of binge eating and weight loss. The impact of high placebo response should be considered in future trials and designs modified accordingly (e.g., sufficiently long placebo run-in phases).
Across all three disorders, no effort has been made to study drug augmentation effects. All trials were monotherapy trials; only a few allowed sequential medication in nonresponders. Investigators should consider augmentation strategies in their future studies.
Behavioral interventions. Strategies for enhancing CBT to change both binge eating and weight loss should be included in the next generation of behavioral studies. They should also focus on strategies for enhancing efficacy of CBT and how best to treat CBT nonresponders. On the basis of preliminary trials, DBT also deserves further study.
Combination interventions. The absence of trials combining medications and behavioral interventions (e.g., psychotherapy) is a serious deficit in the AN literature, and it is striking given that treatment delivered in the community for AN patients is often some form of combination treatment. Future studies must address the efficacy of various combinations of treatments for individuals with AN. Future studies should further explore optimal combinations and how best to combine treatments for BN patients who do not respond to CBT or fluoxetine alone. For BED patients, the needed research centers more on which medications have the greatest efficacy for producing desired outcomes and the optimal duration of medication use.
Novel and “borrowed” interventions. Research on innovative medications and behavioral treatments are warranted, especially given the state of treatment of AN. Medications studied to date have either focused on peripheral symptoms such as depression or anxiety or attempted to capitalize on medication side effects such as weight gain, with the aim of aiding weight restoration in AN. Of special importance will be trials of novel medications that target core biological and cognitive features of the disorders and that are also acceptable to patients.
Similarly, psychotherapies applied to eating disorders have been borrowed from other fields such as depression (CBT and IPT), anxiety disorders (exposure with response prevention), and personality disorders (DBT). We should actively seek to further adapt psychotherapeutic interventions that are tailored to the unique core pathology of eating disorders (e.g., drive for thinness, body dissatisfaction, appetite dysregulation) and that are both efficacious and acceptable to the patients. New behavioral interventions that target motivation to change and encourage retention in treatment are required. Further dismantling of complex therapies such as CBT to determine the active therapeutic components is also warranted.
Other fields are benefiting from the application of new information technologies to the treatment of illness. Adequately powered clinical trials that include the use of email, the Internet, personal digital assistants, text messaging, and other technological advances to enhance treatment will add to future treatment development. These approaches may be well suited to disorders marked by shame, denial, and interpersonal deficits and where availability of specialty care is limited.
Multidisciplinary interventions. Specialist inpatient and partial hospitalization treatment of AN often reflects a multidisciplinary approach: medicine, psychiatry, psychology, nutrition, family therapy, and sometimes additional disciplines such as recreational therapy and occupational therapy. The majority of treatment trials have been monotherapeutic. When they are multidisciplinary, the actual component of multidisciplinarity was rarely a variable on which patients were randomized. Studies that directly address the therapeutic benefits of and optimal approach to multidisciplinary treatment are required.
Maintenance of gains after drug discontinuation. For all three disorders, investigators typically failed to provide adequate follow-up time for medication trials. This means they cannot determine the extent to which positive behavior changes seen during medication administration are maintained over time. At minimum, such studies should have at least 1 year of followup. Especially with BN and BED, for which evidence for the short-term efficacy of medication interventions exists, additional information on maintenance of treatment gains, prevention of relapse, and optimal duration of medication treatment are critical next phases for clinical trials.
Patients with anorexia nervosa. AN is a serious psychiatric illness. Treatment research on AN is particularly challenging given the characteristic denial of illness, high drop-out rates from treatment, and the limited population prevalence in any single catchment area. Despite the fact that this is the most challenging eating disorder to treat, our evidence base is scant. Studies tend to be small, inadequately powered, and hence inconclusive. Medications studied to date have either focused on peripheral symptoms such as depression or anxiety or attempted to capitalize on medication side effects such as weight gain, with the aim of aiding weight restoration in AN. Both medication and behavioral intervention trials tend to be derivative—using medications or behavioral interventions that are borrowed from other areas of medicine without focusing on the core symptoms of AN.
We noted above some specific gaps related to medication and psychotherapy interventions. We reiterate here the urgency of more, and better, research on this disease. Trials of novel medications that target the core cognitive symptoms and biological processes of AN and medical sequelae are especially needed to move the field forward.
The literature on AN has failed to distinguish sufficiently between interventions targeted at individuals before or after weight restoration and has failed to address the optimal approach to renutrition. Indeed, whether medication and behavioral interventions have different outcomes depending on weight status remains murky. Given that low weight and malnutrition can interfere with the efficacy of medication and the ability to process information in psychotherapy, the optimal timing of the administration of medications and therapy vis-a-vis weight restoration is a critical question that remains unaddressed.
Patients with eating disorders not otherwise specified (EDNOS). Several treatment centers have reported that the majority of individuals who seek treatment for an eating disorder receive a diagnosis of EDNOS.88, 89 EDNOS is a compound category illustrated in the Diagnostic and Statistical Manual, Version IV (DSM IV), by six examples including BED. Despite the patient characteristics that lead to this diagnosis, investigators appear to have ignored systematically those with EDNOS diagnoses. Given the preponderance of individuals with EDNOS diagnoses in treatment settings, this is a serious shortcoming of the literature.
In part, this gap reflects the greater clarity and homogeneity that investigators can achieve in clinical trials when they recruit only individuals with clearly defined AN or BN. However, the price of this clarity is generalizability and, ultimately, understanding the effectiveness of interventions tested. Although some trials have begun to expand inclusion criteria to reflect typical clinical practice, others have retained strict inclusion criteria. Only by further clarifying clinical syndromes within the current EDNOS category and investigating the optimal approach to treat these conditions will we be able to determine how best to treat the majority of treatment-seeking individuals.
Improved epidemiologic data are required to determine whether the frequency with which EDNOS is seen in the clinic reflects population prevalence rates of the various eating disorders. In addition, active strides should be taken to characterize the syndromes that are captured under the heading of EDNOS and to determine the best way to treat conditions that exist under that umbrella diagnostic category.
The need for additional attention to individuals with EDNOS was clearly shown through our review of the outcomes literature. EDNOS is a common outcome among individuals who formerly had AN or BN. However, virtually nothing is known about the persistence of these conditions.
Age and lifespan orientation. The treatment literatures on AN, BN, and BED differ in how they examine differential therapeutic outcomes by age group. For all three disorders, a more thoughtful lifespan approach is required to determine optimal approaches from childhood through older adulthood.
The AN literature is devoid of medication studies for adolescents; drug trials have focused exclusively on adults. Future medication trials should explore medication efficacy in adolescents and the differential efficacy of medications between adolescents and adults.
In contrast, behavioral interventions have focused more on adolescent patients, possibly because of the existence of various family therapy models that are well suited to the context within which adolescent AN arises. Nonetheless, behavioral interventions should pay greater attention to the appropriateness of various approaches across the lifespan (including duration of illness) and of adaptations that depend on age of the patient.
The extent to which CBT approaches to adolescent treatment of AN were adapted to match the developmental level of the patients is unknown. Likewise, approaches that are effective in adolescents may be inappropriate for adults, although developmentally appropriate adaptations may be worthy of study. For example, the relative efficacy of family therapy for adolescents with AN may signal the important role of the family. However, the family of relevance for an adult with AN may be her or his spouse and children rather than family of origin. Such permutations of the therapeutic approach have not yet been tested.
For BN, most commonly older adolescent and adult patients received the same treatment and researchers made no effort to explore differential outcome by age group. Future studies that delve more into mechanisms of treatment response should take care to explore differential age effects.
For BED, no medication or behavioral intervention trials exist for adolescents. No study enrolled patients younger than 18; many included individuals up to 65 without documenting age effects. The first step for BED research is to acquire epidemiologic data to determine the extent to which this disorder is a problem for adolescents. The second needed step is to explore differential outcomes by age.
Males and females. Although males suffer from eating disorders, they are underrepresented in clinical trials of AN and BN. When included, their numbers are usually too small to be analyzed separately. Clinical trials of BED often include a greater number of men; however, no study has reported on differential efficacy by sex.
This situation can be remedied, first, by better studies comparing the phenomenology of AN, BN, and BED in males and females. Second, more extensive epidemiological data can provide more accurate estimates of the actual sex ratio in the population. Third, efforts should be expanded to explore differential treatment needs and outcomes in males and females across the age spectrum. Fourth, we have no data on whether treatment for eating disorders is best conducted in mixed-sex or single-sex environments. Fifth, multisite trials can be designed to increase sample size of male participants.
We note that much of the literature to date deals with males and females (a construct related to sex and biology). Very little research, apparently, tries to deal with gender (a construct related to socialization and social roles). We believe that more attention to the difference between these ideas, and some effort to understand the impact of gender, and not simply sex, may be valuable in understanding treatment approaches and efficacy.
Race and ethnicity. The majority of the literature on AN fails even to report the race and ethnicity of participants. All descriptions of participants should include this critical parameter. Although the more recent BN and BED literature has improved on this point, no studies of medication or behavioral interventions have addressed the issue of whether treatment efficacy differs by race or ethnic background. This is a serious omission in the literature.
To remedy this shortcoming, we must collect adequate epidemiologic data to provide critically needed information about the frequency with which eating disorders occur across racial and ethnic groups. Such data would provide guidance for planning targeted recruitment in clinical trials and enable researchers to set priorities for approaches to incorporating race and ethnicity into both treatment and outcomes studies. In addition, further exploration of sociocultural factors (e.g., stigma) may also assist with understanding both underdetection and underrepresentation of racial and ethnic minorities in research studies.
Underserved populations. The literature on AN, BN, and BED is devoid of any mention of specific issues of gay, lesbian, transsexual, or transgender individuals. These parameters should be systematically recorded in both treatment and outcome studies.
The United States' contribution to the literature. The literatures on AN, BN, and BED are geographically imbalanced. Although the United States has contributed considerably to the literature on BN and BED, it has done much less on both the treatment and outcome literature for AN. Although outcome studies of AN may be more difficult in the United States because of the mobility of the population, large-scale multisite treatment trials are perhaps more feasible in the United States given the number of academic treatment centers, the generally shared language, and the size of the population base. The United States should expand its contribution to the global literature for the next phase of treatment studies, especially for AN.
In addition, the unique racial and ethnic composition of the United States could assist with addressing the vacuum of information regarding differential treatment outcome by race and ethnicity across AN, BN, and BED. For the outcomes literature, the majority of literature for AN comes from outside of the United States. The extent to which data from outside the United States accurately reflect outcomes in the United States is unclear.
Replication. The hallmark of good science is replication. One major weakness of the existing literature and a critical need for the future is replication. Once efficacious interventions are identified, adequately powered replication studies should be supported to confirm their effectiveness. Results of such studies would need to be careful to report findings using measures and statistical techniques that would allow for direct comparisons across trials.
Large multisite randomized controlled trials. The majority of eating disorders treatment studies are small, single-site trials. The average sample size of AN trials, 23, illustrates this point robustly. Future multisite trials will facilitate patient recruitment, enhance statistical power, enable meaningful subset analyses, buffer against high drop-out rates, and improve generalizability of results. Working in partnership with insurance companies to enable such trials in the current reimbursement milieu may be critical to success.
Generalizability and key treatment questions in the community. Clinical trials for AN in particular do not adequately reflect the type of treatment typically delivered in the community. Nor do clinical trials for AN address some of the key challenges facing clinicians who treat this disorder in inpatient and partial hospitalization or residential settings.
For low-weight patients with AN, the first treatment challenge is weight restoration. Guidelines from the American Psychiatric Association (APA) suggest that individuals at 75 percent of ideal body weight (IBW) or lower are candidates for inpatient weight restoration, although many other factors influence level of care decisions. When facilities are available, weight restoration occurs in hospital, followed by various levels of step-down marked by increasing autonomy and exposure to real-life eating and emotional situations.
No clinical trials for AN address the optimal approach to inpatient weight restoration that can achieve the most lasting gain. This also includes nutritional trials of optimal approaches to renutrition. No studies address the accuracy of the recommendation for hospitalization at 75 percent IBW. No studies address the optimal conditions under which a patient should be discharged from inpatient treatment and stepped down to less structured environments. Given the financial expense of prolonged inpatient hospitalizations and the toll on both patient and family, the conditions under which extended hospitalizations are superior to intensive outpatient management should be the focus of future studies.
Harms of treatment. Trials of medication or behavioral interventions for patients with AN, BN, and BED do not routinely describe the degree of medical compromise or strategies to monitor for potential harm in malnourished patients. Indeed, behavioral intervention trials often completely overlook the fact that their interventions may have adverse effects on patients. Especially given the high drop-out rates from AN trials, behavioral interventions should pay greater attention to both physical and psychological harms associated with interventions. All studies should report adverse events associated with interventions with these disorders. In addition, with AN, researchers should determine, especially within medication trials, whether adverse events differ between the underweight and the weight-restored state.
Outcomes research and treatment research. One serious gap in the evidence base about eating disorders is the absence of “cross talk” between the outcomes and the treatment literatures. Outcomes literature reveals intriguing problems that persist years after the onset of AN. One example is the presence of autism spectrum disorders reported in the Göteborg cohort.344–346, 349, 356 Such observations could provide critical information to individuals designing new interventions for AN. Targeting social information processing deficits, for example, could be one way to enhance AN treatment delivery. Paying greater attention to premorbid traits and traits that persist after recovery or through persistent illness may help to enhance treatment efficacy by identifying new treatment targets.
In addition, greater attention to demographic patterns in outcome studies such as typical age of recovery from AN may assist with better appraising where an individual entering treatment is in the course of her or his illness. This could assist with enhancing engagement in treatment and reducing the number of dropouts.
Prospective cohort studies and comparison groups. Virtually all the outcome results and relationships that we identified came from case series studies. This design limits generalizability beyond the specific treatment population being studied. Only one prospective cohort study has been conducted with individuals identified with AN; none has been done among persons with either BN or BED. Therefore, little evidence exists as to whether outcomes differ across treatment populations, individuals in the general population who suffer from these disorders, and those who may not meet threshold diagnostic criteria yet report symptoms or features of the disorders.
Of particular interest would be studies that address factors associated with successful outcomes in AN or BN; these should explore trajectories of recovery and how current diagnostic nosology captures those trajectories. For example, an individual with AN who is assessed 5 years after the onset of that illness may be given a diagnosis of EDNOS; this pattern fails to acknowledge that the patient is on a recovery trajectory from AN. The appropriateness of receiving a diagnostic label (EDNOS) different from the original diagnosis (AN), rather than a specific indicator such as AN in partial remission, has yet to be addressed adequately in the literature.
Tracing outcomes across diagnoses. Many individuals who at one time suffered from AN or BN continue to experience less severe eating disorders in later years. Use of dichotomous or simplistic measures of disease state is increasingly seen as uninformative. Additional research is needed that can sufficiently capture the factors associated with transitions in severity of eating disorder diagnoses.
Statistical methods for outcomes research. Outcomes studies vary in their statistical sophistication. At their best, studies used multivariate techniques to control for the influence of various independent variables on outcomes; they may also employ survival analyses techniques to control for differences in the length of time that patients were followed. At their more rudimentary state, many studies simply presented descriptive comparisons between a series of prognostic factors and outcomes of interest, or they employed techniques more appropriate for exploratory research (e.g., stepwise regression). We encourage investigators doing outcomes research (as contrasted with trials) to plan from the outset on using advanced statistical and analytic methodological approaches.
Impact of weight loss treatment on binge eating. Although not a focus of this review, with the ever-increasing obesity epidemic,396, 397 an important area of study will be the impact of various weight loss treatments on binge eating and on the development of eating disorders and eating-disordered behaviors. Programs developed for obesity prevention and treatment in both children and adults should be carefully monitored to ensure that no untoward effects emerge that increase eating-disordered behaviors.398–401
Cost-effectiveness analyses. Only rarely has the cost-effectiveness of interventions for AN, BN, and BED been addressed. At some point, however, some medications, behavioral approaches, or combination therapies will appear to be efficacious in trials or effective in broader trials or observational studies. Then, clinicians, insurers, health plan administrators, and others will want information on the relative cost-effectiveness of different therapeutic options. To provide information to address these questions, future studies should include data collection of costs and cost-effectiveness analyses in their designs.
The literature regarding treatment efficacy and outcomes for AN, BN, and BED is of highly variable quality. For AN, the literature on medications was sparse and inconclusive. No studies combining medication with behavioral interventions met inclusion criteria. Evidence suggests that specific forms of family therapy are efficacious in treating adolescents, and preliminary evidence suggests that CBT may reduce relapse risk for adults after weight restoration and that a manual-based form of nonspecific supportive clinical management may be effective in underweight adults.
For BN, fluoxetine (60 mg/day) decreases the core bulimic symptoms of binge eating and purging and associated psychological features in the short term. CBT administered individually or in groups reduced core behavioral symptoms and psychological features in both the short and long term. How best to treat individuals who do not respond to CBT or fluoxetine remains unknown.
In BED, CBT reduced binge eating and leads to greater rates of abstinence when administered either individually or in group format, persisting for up to 4 months after treatment; however, CBT does not lead to weight loss in individuals with BED. Medications may also play a role in the treatment of BED although further research addressing how best to achieve both abstinence from binge eating and weight loss in overweight patients is required.
Higher levels of depression and compulsivity were associated with poorer outcomes in AN; increased mortality was associated with concurrent alcohol and substance use disorders. Only depression was consistently associated with poorer outcomes in BN; BN was not associated with an increased risk of mortality. Because of sparse data, we could reach no conclusions concerning BED outcomes. We uncovered weak to no evidence to address sociodemographic differences in either treatment or outcomes for any of these disorders.
The quality of the literature about treatment efficacy and outcome for AN, BN, and BED is highly variable. In the treatment literature, the largest deficiency rests with treatment efficacy for AN; we rated this literature as the weakest.
Future AN studies require large numbers of participants, multiple sites, clear delineation of the age of participants, and interventions that are tailored to the unique core pathology and medical sequelae of the illness. For BN, future studies should address novel treatments for the disorder, optimal duration of intervention, and optimal approaches for those who do not respond to medication or CBT. For BED, future studies require better explication of how best to target both binge eating and weight loss goals, optimal duration of intervention, and prevention of relapse.
For all three disorders, exploring additional treatment approaches is warranted. In addition, research teams should pay greater attention to factors influencing outcome, harms associated with treatment, and differential efficacy by age, sex, gender, race, ethnicity, and cultural group. Consensus definitions of remission, recovery, and relapse are essential. For both treatment and outcome literature, greater attention is required to the presentations currently grouped under the heading of EDNOS.
Outcome studies, especially for BN and BED, should emphasize population-based cohort studies with comparison groups and plan for adequate durations of follow-up. Ongoing psychiatric epidemiology studies should routinely include assessments of eating disorders. Epidemiologic studies of BMI and obesity trends should include assessments of eating-disordered behavior. Population-based studies should include measures of disability and impairment associated with eating disorders. For both future treatment and outcome studies, researchers must carefully attend to issues of statistical power, research design, and sophistication and appropriateness of statistical analyses.
| #3 | Search “Eating Disorders”[MeSH] | 17336 |
| #4 | Search “Eating Disorders”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 467 |
| #6 | Search “Anorexia”[MeSH] OR “Anorexia Nervosa”[MeSH] | 9631 |
| #7 | Search “Anorexia”[MeSH] OR “Anorexia Nervosa”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 195 |
| #11 | Search “Bulimia”[MeSH] | 3624 |
| #12 | Search “Bulimia”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 210 |
| #14 | Search “Therapeutics”[MeSH] | 1607160 |
| #23 | Search “Cognitive Therapy”[MeSH] OR “Family Therapy”[MeSH] OR “Drug Therapy”[MeSH] OR “Therapy, Computer-Assisted”[MeSH] | 289583 |
| #34 | Search “Randomized Controlled Trials”[MeSH] OR “Single-Blind Method”[MeSH] OR “Double-Blind Method”[MeSH] OR “Random Allocation”[MeSH] | 157518 |
| #35 | Search #3 AND #34 | 306 |
| #36 | Search #6 AND #34 | 146 |
| #37 | Search #11 AND #34 | 130 |
| #39 | Search #36 OR #7 | 272 |
| #40 | Search #38 OR #12 | 624 |
| #41 | Search #23 OR #14 | 1614410 |
| #42 | Search #41 AND #35 | 111 |
| #43 | Search #41 AND #36 | 45 |
| #44 | Search #41 AND #37 | 49 |
| #45 | Search relapse | 130475 |
| #48 | Search “Recurrence”[MeSH] OR “Patient Readmission”[MeSH] | 103204 |
| #49 | Search #48 AND #4 | 18 |
| #50 | Search #48 AND #6 | 95 |
| #51 | Search #48 AND #11 | 68 |
| #54 | Search “Outcome Assessment (Health Care)”[MeSH] OR “Treatment Outcome”[MeSH] OR “Outcome and Process Assessment (Health Care)”[MeSH] | 236323 |
| #55 | Search #54 AND #4 | 139 |
| #56 | Search #54 AND #6 | 341 |
| #57 | Search #54 AND #11 | 304 |
| #58 | Search “binge eating” | 1240 |
| #59 | Search #58 AND #34 | 50 |
| #60 | Search #58 AND #41 | 335 |
| #61 | Search #58 AND #48 | 22 |
| #65 | Search #60 AND #59 | 2 |
| #69 | Search #12 OR #37 | 274 |
| #3 | Search “Eating Disorders”[MeSH] | 17336 |
| #4 | Search “Eating Disorders”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 467 |
| #6 | Search “Anorexia”[MeSH] OR “Anorexia Nervosa”[MeSH] | 9631 |
| #7 | Search “Anorexia”[MeSH] OR “Anorexia Nervosa”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 195 |
| #11 | Search “Bulimia”[MeSH] | 3624 |
| #12 | Search “Bulimia”[MeSH] Field: All Fields, Limits: Randomized Controlled Trial | 210 |
| #14 | Search “Therapeutics”[MeSH] | 1607160 |
| #23 | Search “Cognitive Therapy”[MeSH] OR “Family Therapy”[MeSH] OR “Drug Therapy”[MeSH] OR “Therapy, Computer-Assisted”[MeSH] | 289583 |
| #24 | Search #3 AND #23 | 789 |
| #25 | Search #6 AND #23 | 463 |
| #26 | Search #11AND #23 | 0 |
| #27 | Search #11 AND #23 | 291 |
| #34 | Search “Randomized Controlled Trials”[MeSH] OR “Single-Blind Method”[MeSH] OR “Double-Blind Method”[MeSH] OR “Random Allocation”[MeSH] | 157518 |
| #35 | Search #3 AND #34 | 306 |
| #36 | Search #6 AND #34 | 146 |
| #37 | Search #11 AND #34 | 130 |
| #39 | Search #36 OR #7 | 272 |
| #40 | Search #38 OR #12 | 624 |
| #41 | Search #23 OR #14 | 1614410 |
| #42 | Search #41 AND #35 | 111 |
| #43 | Search #41 AND #36 | 45 |
| #44 | Search #41 AND #37 | 49 |
| #45 | Search relapse | 130475 |
| #48 | Search “Recurrence”[MeSH] OR “Patient Readmission”[MeSH] | 103204 |
| #49 | Search #48 AND #4 | 18 |
| #50 | Search #48 AND #6 | 95 |
| #51 | Search #48 AND #11 | 68 |
| #54 | Search “Outcome Assessment (Health Care)”[MeSH] OR “Treatment Outcome”[MeSH] OR “Outcome and Process Assessment (Health Care)”[MeSH] | 236323 |
| #55 | Search #54 AND #4 | 139 |
| #56 | Search #54 AND #6 | 341 |
| #57 | Search #54 AND #11 | 304 |
| #58 | Search “binge eating” | 1240 |
| #59 | Search #58 AND #34 | 50 |
| #60 | Search #58 AND #41 | 335 |
| #61 | Search #58 AND #48 | 22 |
| #65 | Search #60 AND #59 | 25 |
| #66 | Search #48 AND #3 | 186 |
| #67 | Search #54 AND #3 | 680 |
| #68 | Search #54 AND #58 | 134 |
| #1 | Search outcomes | 96219 |
| #10 | Search “Outcome Assessment (Health Care)”[MeSH] OR “Fatal Outcome”[MeSH] OR “Treatment Outcome”[MeSH] OR “Outcome and Process Assessment (Health Care)”[MeSH] OR “Weight Gain”[MeSH] OR “Osteoporosis”[MeSH] OR “Tooth Diseases”[MeSH] OR “Suicide”[MeSH] OR “Stomach Diseases”[MeSH] | 511077 |
| #17 | Search “Randomized Controlled Trial”[Publication Type] OR “Randomized Controlled Trials”[MeSH] OR “Double-Blind Method”[MeSH] OR “Single-Blind Method”[MeSH] AND “Random Allocation”[MeSH] OR “Longitudinal Studies”[MeSH]OR Observational Study | 492971 |
| #1 | Search anorexia [mh] OR anorexia nervosa [mh] or bulimia [mh] or “binge eating disorder” [tw] OR eating disorders [mh] OR “binge eating” [tw] | 17671 |
| #2 | Search coprophagia [mh] OR hyperphagia [mh] OR pica [mh] | 2392 |
| #3 | Search #1 NOT #2 | 15279 |
| #4 | Search adverse effects [subheading] OR harms [tw] OR “side effects” [tw] OR “adverse effects” [tw] OR death [mh] OR drug hypersensitivity [mh] OR drug toxicity [mh] OR seizures [mh] | 1211380 |
| #5 | Search #3 AND #4 | 1675 |
| #6 | Search therapeutics [mh] OR therapy [subheading] | 3923801 |
| #7 | Search #5 AND #6 | 1228 |
| #4 | Search “Eating Disorders”[MeSH]OR “binge eating” [tw] | 17669 |
| #8 | Search (“Anorexia”[MeSH] OR “Anorexia Nervosa”[MeSH]) OR “Bulimia”[MeSH] OR “binge eating disorder” [tw] | 11821 |
| #9 | Search #4 NOT #8 | 5848 |
| #10 | Search #4 NOT #8 Field: All Fields, Limits: Randomized Controlled Trial | 99 |
| #15 | Search (“Randomized Controlled Trial”[Publication Type] OR “Randomized Controlled Trials”[MeSH]) OR “Single-Blind Method”[MeSH] OR “Double-Blind Method”[MeSH] OR “Random Allocation”[MeSH] | 277468 |
| #16 | Search #9 AND #15 | 133 |
| #17 | Search #10 OR #16 | 133 |
| #18 | Search longitudinal studies [mh] OR observational study [mh] | 472680 |
| #20 | Search #18 AND #9 | 291 |
| #21 | Search #20 OR #16 | 400 |
| #27 | Search “Cognitive Therapy”[MeSH] OR “Therapeutics”[MeSH] OR “Family Therapy”[MeSH] OR “Drug Therapy”[MeSH] OR “Therapy, Computer-Assisted”[MeSH] | 1640745 |
| #28 | Search #27 AND #9 | 910 |
| #29 | Search #28 NOT #8 | 910 |
| #30 | Search #28 NOT #8 Field: All Fields, Limits: 5 Years | 277 |
| #31 | Search #28 NOT #30 | 633 |
| #32 | Search #28 NOT #30 Field: All Fields, Limits: 10 Years | 194 |
| #33 | Search #31 NOT #32 | 439 |
| #37 | Search “Recurrence”[MeSH] OR “Patient Readmission”[MeSH] | 104066 |
| #38 | Search #9 AND #37 | 49 |
| #45 | Search (“Outcome Assessment (Health Care)”[MeSH] OR “Treatment Outcome”[MeSH] OR “Outcome and Process Assessment (Health Care)”[MeSH]) OR “Weight Gain”[MeSH] OR “Osteoporosis”[MeSH] OR “Tooth Diseases”[MeSH] OR “Suicide”[MeSH] OR “Stomach Diseases”[MeSH] | 494808 |
| #46 | Search #9 AND #45 | 482 |
| #50 | Search “Coprophagia”[MeSH] OR “Hyperphagia”[MeSH] OR “Pica”[MeSH] | 2392 |
| #51 | Search #9 NOT #50 | 3922 |
| #52 | Search #9 NOT #50 Field: All Fields, Limits: Randomized Controlled Trial | 70 |
| #53 | Search #15 AND #51 Limits: Randomized Controlled Trial | 70 |
| #54 | Search #52 OR #53 Limits: Randomized Controlled Trial | 70 |
| #55 | Search #51 AND #18 Limits: Randomized Controlled Trial | 15 |
| #56 | Search #51 AND #18 Field: All Fields | 236 |




AA: African American
ABW: percentage of avg body wt (matched for age, gender, and height)
ADDM: adjustment disorder with depressed mood
ads: advertisements
aka: also known as
am: morning
AN: anorexia nervosa
ANBP: anorexia nervosa with binge eating and/or purging
ANCOVA: analysis of covariance
ANSS: anorexia nervosa symptom score
ANOVA: analysis of variance
ANR: restricting anorexia nervosa
AN-RDC: anorexia nervosa with concomitant major depression according to RDC
ANSS: Anorexia Nervosa Symptom Score
ASD: Autism spectrum disorder
avg: average
B-ERP: exposure with response prevention to pre-binge cues
BAI: Beck Anxiety Inventory
BAT: Body Attitudes Test
BP: blood pressure
BCE: bone collagen equivalents
BD: body dissatisfaction
BDI: Beck Depression Inventory
BE: binge eating episode
BEAQ: Binge Eating Adjective Checklist
BED: binge eating disorder
BES: Binge Eating Scale
BF: body fat
BFST: Behavioral family systems therapy
BIAQ: Body Image Avoidance Questionnaire
b.i.d.: twice a day
BITE: Bulimic Investigation Test Edinburgh
BMI: body mass index, measured in kg/m2
BN: bulimia nervosa
BPD: borderline personality disorder
BSI: Brief Symptom Inventory
BSQ: Body Shape Questionnaire
BSS: Body Satisfaction Scale
BT: Behavioral therapy
CA: California
CAT: cognitive analytical therapy
CFT: conjoint family therapy
CBCL: Child Behavior Checklist
CBT: Cognitive-behavioral therapy
CBT-E: Cognitive-behavioral therapy with exposure
CBT-C: Cognitive-behavioral therapy with cognitive interventions for treatment of body disturbance
CCEI: Crown Crisp Experimental Index
CDI: Children's Depression Inventory
CDRS: Contour Drawing Rating Scale
CFT: conjoint family therapy
CGI: Clinical Global Impression
CGI-S score: Clinical Global Impressions-Severity of Illness scores: 1 = normal, 2 = borderline, 3 = mildly ill, 4 = moderately ill, 5 = markedly ill, 6 = severely ill, 7 = among the most extremely ill.
Chi-square: χ2
CI: confidence interval
cm: centimeter
CNT: cognitive nutritional therapy
Co: company
CR: clinician rating
CT: Connecticut
CT: cognitive therapy
CUE: physiological cue assessment
d: day
DBT: Dialectical Behavior Therapy
DIET: Dieter's Inventory of Eating Temptations Questionnaire
Diff: Diff/Different
DSM IV: Diagnostic and Statistical Manual for Mental Disorders, Fourth Edition
DSM III: Diagnostic and Statistical Manual for Mental Disorders, Third Edition
DSM III-R: Diagnostic and Statistical Manual for Mental Disorders, Third Edition, Revised
DT: drive for thinness
Dx: diagnosis
EAT: Eating Attitudes Test
EB-IV: Diagnostic and Statistical Manual for Mental Disorders, eating behavior IV
EBT: educational behavioral therapy
ECG: electrocardiogram
ECT: Experimental Cognitive Therapy
ED: eating disorder
EDE: Eating Disorders Examination
EDE-Q: Eating Disorders Examination-Questionnaire
EDI: Eating Disorder Inventory
EDNOS: Eating disorder-not otherwise specified
EE: expressed emotion
EOIT: ego oriented individual therapy
ERP: exposure with response prevention
ES: effect size
et al: et alia
EWL: excess weightt loss
EXRP: exposure with response prevention
F: F-statistic
FAM-III: Family Assessment Measure
FBNCSG: Fluoxetine bulima nervosa collaborative study group
FH: family history
FL: Florida
FNE: Fear of Negative Evaluation
FRS: Figure Rating Scale
FU: FU
fx: function
g: grams
G: group
GAF: Global Assessment of Functioning Scale
GAF-S: Global Assessment of Functioning-Symptoms
GAF-F: Global Assessment of Functioning-Functionging
GCBT: group cognitive behavioral therapy
GEE: Generalized estimating equation
GI: gastrointestinal
GP: general practitioner
GSI: General Severity Index
HAM-A: Hamilton Rating Score for Anxiety
HAM-D: Hamilton Rating Score for Depression
HBT: Hypnobehavioral therapy
HDRS: Hamilton Depression Rating Scale (also HRSD: Hamilton Rating Scale for Depression)
HM: hazard multiplier
HRQ: Helping Relationship Questionnaire
HS: High School
HSCL: Hopkins Symptom Checklist
hr: hours
ht: height
Hx: history
IBC: Interaction Behavior Code
IBW: ideal body weight
ICBT: individual cognitive behavioral therapy
ICD: International Classification of Diseases
IDDB: Insulin dependent diabetes mellitus
IGF-1:
IL: Illinois
Inc.: Incorporated
info: information
IPT: Interpersonal psychotherapy
ITT: intention to treat
K 2 HPO 4 /cm 3: measure of bone mineral density (BMD)
kcal: kilocalories
Kg: kilograms
KS: Kansas
l: liter
LAGB: laparoscopic adjustable gastric banding
lb: pounds
LIFE: Longitudinal Interval FU Evaluation
Ltd.: limited
m: minutes
MA: Massachusetts
MADRS: Montgomery-Asberg Depression Rating Scale
MANCOVA: multivariate analysis of covariance
MANOVA: multivariate analysis of variance
MAOI: monoamine-oxidase inhibitors
max: maximum
MD: Maryland
MDD: major depressive disorder
MDE: major depressive episode
meds: medication(s)
MET: Motivational Enhancement therapy
mg: milligram
Mg: micrograms
MI: Michigan
Min: minimum
MKAT: measurement of bone specific alkaline phosphatase
mm Hg: millimeters mercury
MMPI: Minnesota Multiphasic Personality Inventory
MMPW: mean matched population wt
mmol: millimole
MN: Minnesota
MOCI: Maudsley Obsessive Compulsive Inventory
mo: month(s)
M-R Scores: Morgan and Russell scale
M-R-H Scale: Morgan-Russell-Hayward Scale
N: number
NA: not applicable
NATO: North Atlantic Treaty Organization
NBPD: non-borderline personality disorder
neg: negative
NG: nutritional groups
NIH: National Institutes of health
NIMH: National Institute of Mental Health
NJ: New Jersey
nM: nanomole
N: number
NC: North Carolina
NICHD: National Institute for Child Health and Development
NM: New Mexico
nmol: nanomile
NR: not reported
NS: not significant
NSMT: Non-specific Self Monitoring
NT: nutritional therapy
NY: New York
NYC: New York City
OBE: objective binge episode
OC: obsessive-compulsive
OCD: obsessive-compulsive disorder
OCPD: obsessive-compulsive personality disorder
outpt: outpatient
OR: odds ratio
P: p-value
P61: Patient's gloval impression
PA: Pennsylvania
PARQ: Parent Adolescent Relationship Questionnaire
P-ERP: exposure with response prevention to pre-purge cues
PE: psychoeducation
PGI: Patient Global Impression
PICP: C-terminal propeptide of type 1 collagen
pmol: picomole
po: per os (by mouth)
pos: positive
PSE: Present State Exam
PSR: Psychiatric Status Rating Scale
psych: psychological or psychiatric
PTSD: posttraumatic stress disorder
QEWPR: Questionnaire on Eating and Wt Patterns - Revised
RAN: restricting anorexia nervosa
RCT: randomized controlled trial
RDC: Research Diagnostic Criteria
RELAX: relaxation training
rhGh: recombinant human growth hormone
RI: Rhode Island
RP: response prevention
RM-ANOVA: repeated measures analysis of variance
RSE: Rosenberg Self Esteem Inventory
RSEQ: Rosenberg Self-Esteem Questionnaire
SADS-C: Schedule for Affective Disorders and Schizophrenia-Change Version
SAS: Social Adjustment Scale
SCI: Shapiro Control Inventory
SCID: Structured Clinical Interview for DSM IV
SCL-90: Hopkins Symptom Checklist-90
SD: standard deviation
SDS: Self-rating Depression Scale
SE: standard error
SEM: standara error of the mean
SES: socioeconomic status
SF-36: Short-Form 36-item quality of life questionnaire
RP: role physical component score
SF: social functioning component score
Vit: vitality component score
SFT: Separated family therapy
SIAB: Structured Interview for Anorexia Nervosa and Bulimic Syndromes
Sig: significant
SMFQ: Short Mood and Feeling Questionnaire
SMR: Standardized Mortality Ratio
SOC: stages of change
SPAQ: Seasonal Patterns Assessment Questionnaire
SR: Self-report
SRQ: Three Factor Eating Questionnaire
SRS: Self-Rating Depression Scale
SSRI: selective serotonin reuptake inhibitor
St: Saint
STAI: State/Trait Anxiety Inventory
STAXI: State Trait Anger Expression Inventory
SUD: substance use disorder
SUDS: Subjective units of distress
sx: symptoms
T: time
t.i.d.: three times a day
TAS-20: Toronto Alexithymia Scale
TCA: tricyclic antidepressants
TFEQ: Three Factor Eating Questionnaire
TN: Tennessee
TT 3: total testosterone
tx: treatment
U: university
UK: United Kingdom
USA: United States
UT: Utah
UTB: Urge to binge
UTP: Urge to purge
VAS: visual analog scale
vs: versus
WAIS: Wechsler Adult Intelligence Scale
WELSQ: Weight Efficacy Life Style Questionnaire
WLFL: Work, Life and Family Leisure Questionnaire
WI: Wisconsin
wk: week
wkly: weekly
WPIC: Western Psychiatric Institute and Clinic
wt: weight
X 2: chi square
YBC-ED: Yale-Brown-Cornell Eating Disorders Scale
Y-BOCS: Yale-Brown Obsessive Compulsive Scale
Y-BOCS-BE: Yale-Brown Obsessive Compulsive Scale Modified for Binge Eating
Yr: year
Yrs: years
Full Text Article Exclusion Criteria Codes for Database
| X1 | Sample size too small |
| X2 | No control or comparison group |
| X3 | No original data (e.g., letters, reviews, etc.) |
| X4 | Does not focus on subjects with primary problem of AN, BN, BED |
| X5 | Study published in abstract form only |
| X6 | Wrong study design (e.g., case series only) |
| X7 | Wrong (or no) outcome |
| X8 | Insufficient statistical analysis to make comparisons |
| X9 | Wrong year (i.e., outside of our inclusion period of 1980-2005) |
| X10 | Drug no longer on the market |
| X11 | Uses DSM-III definition for BN |
| X12 | Does not follow individuals (AN or BED) for at least 1 year |
| X13 | Does not follow BN patients 3 months |
| XL | Not retrievable from library |
Free Full text in PMC]