The Agency for Healthcare Research and Quality (AHRQ), formerly the Agency for Health Care Policy and Research, through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
| John M. Eisenberg, M.D. | Douglas B. Kamerow, M.D. |
| Director | Director, Center for Practice and Technology Assessment |
| Agency for Healthcare Research and Quality | Agency for Healthcare Research and Quality |
| The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service. |
This report was funded by the Agency for Healthcare Research and Quality (AHRQ) and developed with the help of many dedicated contributors. We are especially grateful for the input received from our partner organizations, the American Academy of Pediatrics (AAP) and the American Psychiatric Association. The members of the AAP Subcommittee on ADHD were especially helpful during the formulation of the research questions and the development of the data extraction forms. The support and commitment from our contacts at AHRQ were invaluable: Dr. David Atkins, the AHRQ Task Order Officer; Jacqueline Besteman; Al Deal; and the late Therese Dick, the AHRQ Contract Officer for this Task Order. It was with great sadness that we learned of Therese's untimely passing in the fall of 1998.
Thanks go to the San Franciso Cochrane Center for coordinating the peer review process, to the peer reviewers, and to Dr. Patricia Huston for her development of a coherent and constructive synthesis of the peer review commentaries. Several people provided us with information that was otherwise unavailable to us, including Drs. G. DuPaul, P. Jensen, and J. Swanson and members of the MTA Cooperative Group and the Canadian Coordinating Office for Health Technology Assessment (CCOHTA) research team. Special thanks to our colleagues at McMaster University for their enthusiasm, vision, and continued support.
To determine (a) the long-term and short-term effectiveness and safety of pharmacological and nonpharmacological interventions for attention-deficit/hyperactivity disorder (ADHD) in children and adults and (b) whether combined interventions are more effective than individual interventions.
MEDLINE (from 1966), CINAHL (from 1982), HEALTHStar (from 1975), PsycINFO (from 1984), EMBASE (from 1984), and the Cochrane Library searches were completed in November 1997. Reference lists of eligible studies and files of members of the research team and partner organizations were also searched.
Studies were selected if they focused on the treatment of ADHD in humans and were published in any language as a full report in peer-reviewed journals. Studies including conditions other than ADHD were reported if separate subgroup analyses for patients with ADHD were provided.
3 Two reviewers independently extracted data for 41 variables on general characteristics, along with detailed information on interventions, outcomes, and tests. Differences were resolved by consensus or by a third researcher. Studies were not combined quantitatively because the quality of reporting was low and heterogeneity existed across outcome measures and tests.
Seventy-eight studies (77 randomized controlled trials) met the inclusion criteria.
Twenty-three studies compared drugs and showed few, if any, differences among methylphenidate (MPH), dextroamphetamine (DEX), and pemoline; studies comparing stimulants with tricyclic antidepressants (2) were inconclusive.
Six studies compared drugs with nondrug interventions and showed consistently that stimulants, particularly MPH, may be more effective than nonpharmacological interventions.
Twenty studies compared combination therapies with a stimulant or a nondrug intervention alone; no additional beneficial effects for combination therapies were shown.
Nine studies compared tricyclic antidepressants with placebo and showed that desipramine may be more effective than placebo; no consistent effect was shown for imipramine.
Fourteen studies (13 in school children and 1 in adults) evaluated long-term therapy (>12 weeks) and showed a trend to general improvement regardless of treatment, but the length of followup was inadequate. MPH may reduce behavioral disturbance in children with ADHD while it is taken. Academic performance does not appear to be improved with stimulants.
Twelve studies evaluated treatment in adults with ADHD. For MPH vs. placebo, the results were contradictory. Antidepressants may be effective in adults, but no beneficial effect was seen with pemoline, nicotine, or phenylalanine compared with placebo.
Thirty-two reports (29 studies) evaluated adverse effects of drug therapy; many of the side effects associated with stimulant use appear to be relatively mild and of short duration and to respond to dosing or timing adjustments. Data are inadequate on the long-term effects and severity of adverse effects of most interventions.
This report describes rigorous systematic reviews on the treatment of ADHD, ready for incorporation into evidence-based clinical practice guidelines or performance measures. The report also provides a detailed description of the many limitations of the evidence available and provides recommendations to fill existing knowledge gaps. Studies on ADHD have low reporting quality, methodological flaws, and heterogeneity across outcome measures and tests. A detailed description is included of the many limitations of the available evidence plus recommendations to fill existing knowledge gaps. Fulfilling such knowledge gaps will not be easy and will require genuine collaboration among decisionmakers.
This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of the copyright holders.
Jadad AR, Boyle M, Cunningham C, et al. Treatment of Attention-Deficit/Hyperactivity Disorder. Evidence Report/Technology Assessment No. 11 (Prepared by McMaster University under Contract No. 290-97-0017). AHRQ Publication No. 00-E005. Rockville, MD: Agency for Healthcare Research and Quality. November 1999
Attention-deficit/hyperactivity disorder (ADHD) is one of the most common disorders diagnosed in children and adolescents. The American Psychiatric Association (APA) describes the essential feature as "a persistent pattern of inattention and/or hyperactivity-impulsivity that is more frequent and severe than is typically observed in individuals at a comparable level of development." Terms that have been used to describe children with "distractability, impulsivity, and usually overactivity" include minimal brain dysfunction/damage (MBD), hyperkinetic reaction, and hyperkinesis.
ADHD has been surrounded by great controversy involving clinicians, teachers, policymakers, parents, and the media. The range of opinion regarding the validity of ADHD extends from those who do not believe it exists and regard it as a myth, to those who believe that there is genetic and physiological evidence supporting its existence.
Prevalence estimates of ADHD vary according to the methods of ascertainment, diagnostic criteria, informants, and population sampled. According to the Diagnostic and Statistical Manual of Mental Disorders IV (DSM-IV), the prevalence of ADHD in school-age children is 3 to 5 percent. However, prevalence studies using the two previous versions of the DSM (DSM-III and DSM-III-R) in the United States, Canada, United Kingdom, Germany, and New Zealand have shown rates that vary from 1.7 to 16.1 percent. Although it was previously thought that ADHD remitted before or during adolescence, it has been estimated that more than 70 percent of hyperactive children continue to meet criteria for ADHD as adolescents and up to 65 percent as adults.
Problems with the diagnosis and treatment of this condition can also arise because approximately 65 percent of ADHD patients may have at least one comorbid disorder in the form of anxiety, communication, mood, conduct, oppositional defiant, and learning disorders; Tourette's syndrome; and subnormal intelligence. ADHD has been associated with impaired academic achievement, rejection by peers, and family resentment and antagonism. The rich terminology may reflect the broad spectrum and the high frequency with which it is described with other comorbid conditions.
There is also variation and controversy around the treatment of ADHD, which often includes stimulant medication. To reduce inappropriate variation in treatment, major organizations in North America have developed, or are in the process of developing, practice parameters or clinical practice guidelines to guide treatment decisions.
In 1997, the Agency for Healthcare Research and Quality (AHRQ) charged the McMaster University Evidence-based Practice Centre (MU-EPC) with producing an evidence report on the treatment of ADHD. The objectives of this work were to conduct a comprehensive systematic review of the literature on the treatment of ADHD, with input from different groups of stakeholders, and to support guideline development initiatives, while building on existing work and focusing on answerable, clinically relevant questions.
A multidisciplinary research team was assembled, with participation of members of the nominating organizations, the American Academy of Pediatrics (AAP) and the APA, local experts, and research staff. After multiple consultations and the evaluation of published systematic reviews and meta-analyses, the following general questions were selected as the focus of the evidence report:
What is the evidence from comparative studies on the effectiveness and safety, both short and long term, of pharmacological and nonpharmacological interventions for ADHD in children and adults?
Are combined interventions more effective than individual interventions?
To answer these questions, while avoiding the duplication of work, making efficient use of the resources available, and ensuring maximum added value, the scope of the evidence report focused on the following seven categories of research studies:
Studies with drug-to-drug comparisons of pharmacological interventions.
Placebo-controlled studies evaluating the effect of tricyclic antidepressants.
Studies comparing pharmacological with nonpharmacological interventions (drug vs. nondrug studies).
Studies evaluating the effect of long-term therapies (>12 weeks).
Studies evaluating therapies for ADHD in adults (>18 years of age).
Studies evaluating therapies given in combination.
Studies evaluating adverse effects of pharmacological interventions.
Numerous systematic reviews and meta-analyses have examined placebo-controlled trials of stimulant medication and have established consistently the short-term efficacy of these agents for core symptoms. Consequently, placebo-controlled trials evaluating stimulant medication were reviewed in this report only if they met the criteria for inclusion in any of the other six categories. In addition, the report focuses on head-to-head comparisons of pharmacological interventions and on head-to-head comparisons of pharmacological and nonpharmacological interventions. This was identified as the area of prime interest to clinicians rather than effectiveness of stand-alone interventions-either pharmacological or nonpharmacological.
In preparing the Report, we followed current standards for assessing and distilling research evidence. Citations of individual studies were regarded as potentially eligible and selected for further evaluation if they were randomized controlled trials (RCTs) that focused on the treatment of ADHD in humans and if they were published in peer-reviewed journals, in any language, as a full report. If the studies included conditions other than ADHD, they were included only if they provided separate analyses for patients with ADHD. We acknowledge that randomized controlled trials are imperfect tools, but so far they are our best probe to evaluate health care interventions. Non-RCTs were included if they provided data on adverse effects of interest collected over more than 16 weeks. More refined inclusion and exclusion criteria defined within each of the seven categories of research studies are described in the body of the report.
Inclusion of a study in the evidence report was decided by two members of the research team, by consensus, and on the basis of the information available in the full published articles.
Citations of potentially eligible studies were identified through a systematic search of:
MEDLINE (from 1966), CINAHL (from 1982) and HEALTHStar (from 1975), PsycINFO (from 1984), and EMBASE (from 1984) using the search strategy described below. The searches were completed in November of 1997 using the terms behavioral symptoms, attention-deficit disorder with hyperactivity, attention-deficit, hyperactivity, cognition disorders, minimal brain damage, minimal brain dysfunction, hyperkinetic syndrome, hyperkinetic reaction, impulsivity or inattention, and random, clinical trial, comparative, case control, or cohort.
The Cochrane Library (issue 4, 1997).
The reference lists any eligible article identified in any of the above sources.
Web sites of organizations funding research on the treatment of ADHD.
Files of members of the research team and partner organizations.
Data extraction forms were especially developed and tested for this project. The local research team, partner organizations, and the Task Order Officer (TOO) were consulted and the forms approved for content. More than 41 different variables describing the general characteristics of the study were recorded. In addition, detailed information on interventions, outcomes, and tests were extracted from each of the full reports, independently by two reviewers, with differences resolved by consensus or by a third member of the research team.
Descriptive statistics were calculated for all the variables. Evidence tables were constructed to summarize, globally and question by question, all the information extracted from the study reports. The local research team, in consultation with members of the partner organizations and the TOO, evaluated the overall quantity and quality of the data available and decided that meta-analysis would be inappropriate to summarize the evidence on each of the research questions and for each of the main categories of interest. The main reasons for this decision were substantial clinical heterogeneity across the studies (e.g., therapies evaluated, patient populations, duration), inconsistency in outcome measurements, low methodological quality, and incomplete data reporting (see detailed descriptions within each category). Therefore, this report represents a systematic qualitative review of the existing evidence, emphasizing the implications for clinical practice and the opportunities for future research to fill existing knowledge gaps.
A total of 2,405 citations were identified by the search strategies. Ninety-two reports, describing 78 different studies, met all the inclusion criteria.
Overall, numerous deficiencies in the reporting of available RCTs limit the assessment of their validity, relevance, precision, and, therefore, their clinical application. Most studies did not clearly describe clinically important information such as the primary outcomes of interest, the presence of comorbid disorders, the characteristics of the patients' families, compliance with treatment, and baseline measurement of outcomes of interest. There was little information on the treatment of ADHD in minority groups.
The small sample size of most studies limited their power to detect meaningful clinically important differences among the interventions.
Ninety-seven percent of the reports of RCTs did not describe the method of randomization. Ninety-five percent did not describe efforts to conceal allocation from the investigators who recruited the patients into the study (e.g., allocation codes were obtained by telephone after a patient accepted to enter the study). Eighty-seven percent did not describe the number of withdrawals and dropouts and the reasons for such in each of the groups. These limitations increased the likelihood of biased results.
Comparison or synthesis of data across studies was limited by the low quality of reporting and by the large number and heterogeneity of outcome measures and tests used in the studies. Researchers often used modified versions of the same tests (e.g., Conners) across studies but did not provide enough information on the modifications. This led to the conclusion in many instances that there is a lack of evidence on the effectiveness of clinically important interventions. It is important to recognize that this is different from finding evidence of the lack of effectiveness of the same interventions.
The following is a description of the main conclusions from each of the seven categories of interest:
Drug vs. drug comparisons: The limited evidence available from studies comparing different stimulants suggests that there are few, if any, short-term differences in effectiveness among methylphenidate (MPH), dextroamphetamine, and pemoline. The studies comparing stimulants to tricyclic antidepressants had many limitations and presented conflicting results.
Drug vs. nondrug comparisons: Despite the limitations in the individual studies, the results indicate consistently that stimulants are more effective than nonpharmacological interventions when compared head-to-head.
Combination therapies: Evidence is lacking to support the superiority of combination therapy over stimulant alone or superiority of combination therapy over nondrug intervention alone. A recent large trial found that combined treatment offers modest additional benefits over single-component treatments for non-ADHD areas of functioning.
Tricyclic antidepressants vs. placebo: The studies on desipramine, despite their heterogeneous designs, small sample sizes, and variable quality, suggest that desipramine is more effective than placebo. The studies evaluating imipramine show inconsistent results.
Long-term therapy: All but one of the studies available were restricted to school-age children. Few studies followed children for a period of time equivalent to the length of time children typically remain on these treatments or reported side effects or used outcome measures that are situation specific (e.g., measure outcomes at home and at school). These studies show a trend to general improvement over time regardless of treatment and support the need for long-term placebo-controlled studies. MPH appears to reduce behavioral disturbance in ADHD children as long as it is taken. However, there is no information on the reasons so many children discontinue medication. The studies available provide little evidence for improvement in academic performance with stimulants, even though MPH treatment appears to produce consistent behavior improvement. The largest and most comprehensive study to date (Multimodal Treatment Study of Children with Attention-Deficit/Hyperactivity Disorder-the MTA Cooperative Group study) indicates that intensive behavior therapy, comprising child, family, and school-based interventions, adds little to the effects of long-term stimulant therapy. The MTA study also identified that the quality of supervision of medication may be an important factor in optimizing long-term therapeutic benefit. Lithium does not appear to be an effective alternative in patients who do not respond to stimulants.
Treatment of ADHD in adults: The few studies evaluating MPH vs. placebo show contradictory results. The study with the highest methodological scores suggests that MPH may be effective for the treatment of ADHD in adults. Antidepressants may be effective in adults. Studies (one each) comparing pemoline, nicotine, or phenylalanine with placebo did not produce evidence in favor of these medications. There were no studies designed to determine the proportion of adults with ADHD who will use and benefit from other interventions.
Adverse effects: Many of the side effects associated with stimulant use appear to be relatively mild and of short duration and respond to dosing or timing adjustments. However, data are inadequate on the long-term effects and severity of the adverse effects of most interventions. No comparative studies were identified with data on important adverse effects of interest, including potential for abuse of stimulants, liver toxicity due to pemoline, or major arrhythmia with tricyclic antidepressants in patients with ADHD.
Areas needing further research include the following:
Effective strategies are needed to improve the quality of the study designs and reports. Journals that publish articles on the treatment of ADHD could benefit by the endorsement of criteria such as those included in the Consolidation of the Standards of Reporting Trials (CONSORT) statement, which has been adopted by over 70 leading peer-reviewed journals.
Larger studies with more rigorous design and longer term followup are needed to establish the effectiveness and adverse effects of most interventions in both children and adults.
The field would benefit from empirical methodological research evidence indicating the added value of nonrandomized within-subject and single-subject research designs for direct head-to-head comparisons between psychosocial interventions and other treatments.
Research groups should make efforts to select a core set of validated and clinically relevant outcomes to be measured in all the studies in addition to any other outcomes of interest to the specific groups of researchers.
More rigorous studies are clearly needed to establish the relative effectiveness of stimulants and tricyclic antidepressants and to compare the effects of stimulants with clonidine, buproprion, or selective serotonin-reuptake inhibitors.
More definitive studies are needed to determine the added value of nondrug interventions when patients are already receiving stimulants, as well as the value of adding stimulants when nondrug interventions fail to achieve the desired outcomes. These studies, however, will require complex designs, substantial amounts of resources, and efficient collaboration among research groups. The MTA study is an example of this type of collaborative effort.
Studies are also needed to determine whether comorbid factors (e.g., anxiety and depressive disorders) influence response to treatment.
Studies are required to assess the severity of most adverse effects associated with stimulant medication and to evaluate, explicitly, the tradeoff between improvement in ADHD symptoms and signs and adverse effects. However, such a study is only worthwhile if the perspectives of all interested parties (parents, teachers, and patients) are included in the exercise.
Reports of effectiveness and adverse effects almost always come from parents and/or teachers. One study (of adolescents) showed some important differences between parents and adolescents in the side-effects profile reported by each group. Data need to be collected from children on effectiveness and adverse side effects in order to gain a better understanding of the implications of treatment from the patients' perspective.
A better understanding of the distinctions between "adverse effects" of therapy and concomitant characteristics of ADHD is needed. A number of reports discuss the high prevalence of "side effects" reported on placebo. Many of these may be associated with problem behaviors. Including such behaviors distorts the context for evaluating the importance of side effects.
There were very few female patients in the available studies. A possibility exists that effectiveness and adverse effects vary by gender. This is an issue that needs to be examined or at least discussed.
RCTs are of limited power to evaluate adverse effects, particularly rare ones or those that appear during long-term therapy. Only one comparative non-RCT with adequate data was found and it provided limited information. More observational studies are required (particularly case-control or cohort studies). Knowledge of adverse effects may also improve through more creative use of existing drug databases.
Few studies have been supported financially by sources other than the Government or pharmaceutical companies. There is a great opportunity for consumer groups to support more research activities, given the number of important questions that remain unanswered and the implications of the results of research on the public.
Conducting research on the treatment of ADHD is not easy, given the complexity of the disorder, the frequent presence of comorbidity, and the variety of interventions and outcomes available. Future research efforts will require commitment among different groups of stakeholders.
In summary, this report includes seven systematic reviews that incorporate state-of-the-art methodology, represent the most rigorous systematic review conducted to date, and are ready for incorporation into evidence-based clinical practice guidelines or performance measures. The report also provides a detailed description of the many limitations of the evidence available and provides recommendations to fill existing knowledge gaps. Filling such gaps will not be easy and will require highly innovative efforts and collaboration among different groups of decisionmakers. The MTA study confirms that large-scale, long-term collaboration among researchers is possible. If this field continues to produce small, incompletely reported studies with heterogeneous designs, instead of the high-quality collaborative efforts required, research in this area will continue to be abundant but will be of little value to guide most clinically relevant decisions.
Attention-deficit/hyperactivity disorder (ADHD) is one of the most common psychiatric disorders diagnosed in children and adolescents (American Academy of Child and Adolescent Psychiatry, 1997a; Goldman, Genel, Bezman et al., 1998), estimated to affect 3 to 5 percent of school age children (NIH Consensus Statement Online, 1998). The American Psychiatric Association (APA) has proposed a series of diagnostic criteria for ADHD (Appendix A), identifying as the essential feature "a persistent pattern of inattention and/or hyperactivity-impulsivity that is more frequent and severe than is typically observed in individuals at a comparable level of development" (American Psychiatric Association, 1994). Over the years, different terms have been used to describe children with "distractability, impulsivity and usually also overactivity," including minimal brain dysfunction/damage (MBD), hyperkinetic reaction, and hyperkinesis (American Academy of Child and Adolescent Psychiatry, 1997a). ADHD is not a unitary entity. Subtypes of ADHD can be constructed based on differences in phenomenology, comorbidity, and etiology. These different subtypes may differ in etiology, natural history, and response to treatment. The prominence of particular core "symptoms" (they are really signs) of ADHD-inattentiveness, hyperactivity, and impulsiveness-is the basis for the primary distinction among ADHD subtypes. The Diagnostic and Statistical Manual of Mental Disorders IV (DSM-IV) distinguishes among a primarily inattentive subtype, a hyperactive-impulsive subtype, and a combined subtype (American Psychiatric Association, 1994). Inattentive ADHD children may be lethargic, shy, and prone to daydream and have a higher incidence of learning disabilities. Hyperactive-impulsive ADHD subjects may be more disruptive and aggressive.
Prevailing opinions regarding the validity of ADHD vary from those who do not believe it exists and regard it as a myth (http://www.healthy.net/library/books/ullman/chap3.htm;">http://www.healthy.net/library/books/ullman/chap3.htm; Weinberg and Brumback, 1992) to those who believe that underlying genetic and physiological evidence supports its existence (Kewley, 1998). Several features of ADHD contribute to the controversy: (1) it is a clinical diagnosis for which no laboratory or radiological confirmatory tests or specific physical features exist; (2) diagnostic criteria have changed frequently; (3) there is no curative treatment, so patients require long-term therapies; (4) therapy often includes stimulant drugs that are thought to have abuse or diversion potential; and (5) substantial differences in the rate of diagnosis and treatment occur across countries, particularly in the United Kingdom, Australia, Canada, and the United States (American Academy of Child and Adolescent Psychiatry, 1997b; Goldman, Genel, Bezman et al., 1998; Kewley, 1998; NIH Consensus Statement Online, 1998; Orford, 1998).
Prevalence estimates of ADHD vary according to the methods of ascertainment, diagnostic criteria, informants, and population sampled (American Academy of Child and Adolescent Psychiatry, 1997a; Elia, Ambrosini, and Rapoport, 1999; Goldman, Genel, Bezman et al., 1998). According to the DSM-IV, the prevalence of ADHD among school-age children is 3 to 5 percent (American Psychiatric Association, 1994). However, prevalence studies using the two previous versions of the DSM (DSM-III and DSM-III-R) in the United States, Canada, United Kingdom, Germany, and New Zealand have shown rates that vary from 1.7 to 16.1 percent (Goldman, Genel, Bezman et al., 1998). ADHD has been identified as the most common cause of referrals to child and adolescent psychiatric health services (American Psychiatric Association, 1994). Consistently, the disorder is diagnosed more frequently in males than in females (American Academy of Child and Adolescent Psychiatry, 1997a; American Psychiatric Association, 1994). Even in children rated by teachers as meeting criteria for any subtype of ADHD, fewer girls than boys receive an ADHD diagnosis (Wolraich, Hannah, Pinnock et al., 1996).
Although it was previously thought that ADHD remitted before or during adolescence, it is now estimated that more than 70 percent of hyperactive children continue to meet criteria for ADHD as adolescents and up to 65 percent as adults (Barkley, 1996; Goldman, Genel, Bezman et al., 1998).
Problems with the diagnosis and treatment of ADHD can also arise because of the presence of comorbidity, in the form of anxiety disorders, communication disorders, mood disorders, conduct disorders (CDs), oppositional defiant disorder (ODD), learning disorders, Tourette's syndrome, and subnormal intelligence (American Psychiatric Association, 1994; Goldman, Genel, Bezman et al., 1998; Orford, 1998). In fact, it has been estimated that as many as 65 percent of children with ADHD have at least one comorbid disorder (Goldman, Genel, Bezman et al., 1998). ADHD with comorbid oppositional defiant and/or conduct disorder is the most prevalent comorbidity. Children with ADHD and comorbid ODD/CD are more likely to be males, live in families with higher rates of psychosocial adversity, and have a poorer prognosis characterized by antisocial behavior, substance use disorder, and poor scholastic success. ADHD with ODD/CD may represent a genetic subtype (Biederman, Faraone, and Keenan, 1992; Cadoret and Stewart, 1991). However, the evidence for differential response to medication of ADHD with and without comorbid ODD/CD is not strong (Barkley, McMurray, Edelbrock et al., 1989). Children with comorbid CD may be less likely to comply with treatment regimens. Similarly, their families may be less adherent to treatment.
Comorbid internalizing disorders such as anxiety or depressive disorders are also common (reviewed in Jensen, Martin, and Cantwell, 1997). Few differences in clinical or demographic features have been found between ADHD with and without comorbid anxiety: ADHD in patients with anxiety is associated with more psychosocial risk factors (Biederman, Milberger, Faraone et al., 1995) and possibly with less impulsivity (Pliszka, 1992). This combination does not appear to be a distinct genetic subtype of ADHD (Biederman, Faraone, and Keenan, 1992). However, evidence shows that children with a diagnosis of ADHD with comorbid anxiety may respond less well to stimulants and better to antidepressants (Jensen, Martin, and Cantwell, 1997).
The presence of ADHD has been associated with some individual and family features. Patients have impaired academic achievement and may be rejected by peers, and their family relationships are often characterized by resentment and antagonism (American Psychiatric Association, 1994). Increased levels of parental frustration, marital discord, and divorce occur in families who have children with ADHD (NIH Consensus Statement Online, 1998). Individuals with ADHD also tend to consume a disproportionate share of resources and attention from educators, the health care system, social service agencies, and the legal system. (NIH Consensus Statement Online, 1998).
Many different treatment modalities have been proposed for patients with ADHD, including, among others, pharmacological agents (e.g., stimulants, antidepressants), cognitive/behavioral techniques, dietary interventions, and combinations of any of the above. However, given the present debate around the existence of ADHD, the variation in estimates of prevalence, and the frequency with which comorbid disorders are identified, it is not surprising to find wide variation and controversy around its treatment. This variability in treatments in North America led to the development of practice parameters by the American Academy of Child and Adolescent Psychiatry (AACAP) (American Academy of Child and Adolescent Psychiatry, 1997b) and has motivated the American Academy of Pediatrics (AAP) to develop clinical practice guidelines. The APA is currently contemplating how to use the available information, including this report, in the development of quality improvement projects.
Since 1997, the Agency for Healthcare Research and Quality (AHRQ) has been supporting initiatives that could facilitate the production of clinical practice guidelines and quality improvement tools and inform consensus conferences. One of these efforts is a joint initiative involving the AAP and Technical Resources International, Inc., to gather and analyze evidence on the prevalence and diagnostic screening of ADHD.
In June 1997, the U.S. Department of Health and Human Services announced the start of a new program through which the AHRQ would award contracts to institutions in the United States and Canada to serve as Evidence-based Practice Centers (EPCs). The mission of these EPCs, of which McMaster University is one, is to review all the relevant scientific literature on health care topics assigned to them by AHRQ. The main task of the EPCs is to produce "evidence reports" that will serve as the scientific foundation for public and private sector organizations to develop tools and strategies for improving the quality of the health care services. The first set of topics was nominated by a group of public and private sector organizations in response to a solicitation published by AHRQ in November 1996. On August 21, 1997, the McMaster University EPC (MU-EPC) was invited to submit a proposal for the development of an evidence report on the treatment of ADHD. This topic had been nominated by the AAP and the APA. Dr. David Atkins was identified by AHRQ as the Task Order Officer (TOO) for this Task Order.
The MU-EPC was notified on September 30, 1997, that it was successful in its bid to undertake the development of an evidence report on the treatment of ADHD, which is described in this document. The objectives of this Task Order were to conduct a comprehensive systematic review of the literature on the treatment of ADHD and to support guideline development initiatives by the nominating organizations, while building on existing work and focusing on answerable, clinically relevant questions.
During the period in which this Task Order was in development, two professional organizations, the American Medical Association (AMA) and AACAP produced important reports on the treatment of ADHD (American Academy of Child and Adolescent Psychiatry, 1997a; Goldman, Genel, Bezman et al., 1998).
The report by the AMA was produced by the Council of Scientific Affairs. It had some elements of a systematic review, as it included a clear description of the research questions and provided a description of the search strategy used to identify the relevant literature. The report, however, did not describe the inclusion or exclusion criteria for articles, the methods to assess the validity of the information, or the strategies used for data synthesis. The report by the AMA addressed six questions to inform current professional and public concern about ADHD. Two of the questions relate to issues addressed by this Task Order:
What constitutes optimal treatment for ADHD, and how do stimulants fit into it?
What are the adverse consequences of using stimulants, and, in particular, what is known about the risks of abuse and diversion?
The report by the AACAP was produced by the Work Group on Quality Issues and approved by the AACAP Council in February 1997. It also included a description of the search strategy used to identify literature on the assessment and treatment of children, adolescents, and adults with ADHD. The report did not present the issues of interest as questions but was structured following clear sections by topics of interest. As in the AMA report, no description of the inclusion and exclusion criteria, the criteria to assess the validity of the information, or the strategies used to synthesize the information was included. Because reports were published in peer-reviewed journals, it is possible that the information was not provided because of space limitations.
The National Institutes of Health (NIH) held a consensus planning conference on ADHD in November 1998. The NIH has also funded a long-term, collaborative Multimodal Treatment Study of Children with ADHD (the MTA Collaborative Group study) to address issues emphasized by the Institute of Medicine study Research on Children and Adolescents with Mental, Behavioral, and Developmental Disorders, the National Institute of Mental Health (NIMH) National Plan for Research on Child and Adolescent Mental Disorders, the Healthy People 2000, and Healthy Children 2000. This study examined long-term (14 months with a 24-month followup) effectiveness of medication for treatment of ADHD vs. behavioral treatment vs. both and compared state-of-the-art treatment with routine community care. In a parallel-groups design, 576 children (age 7 to 9 years) with ADHD were thoroughly assessed and randomized to 4 conditions: (1) medication alone, (2) psychosocial treatment alone, (3) the combination of both, or (4) community comparison (Richters, Arnold, Jensen et al., 1995). The results of this study became available to us in early 1999 and have been incorporated into this report.
The research team for this report was also aware that a systematic review that included placebo-controlled studies evaluating the effect of stimulants had been commissioned by the Canadian Coordinating Office for Health Technology Assessment (CCOHTA) to a group of researchers at the University of British Columbia in Canada. Relevant results from that effort have also been incorporated into this Task Order Report.
A multidisciplinary research team was assembled, with participation of members of the nominating organizations (the subcommittee of the AAP on ADHD and the Deputy Medical Director of the APA), consumer groups, local experts, and research staff (Appendix B). The local experts and research staff constituted a "local research team." All others were regarded as partners. To identify the research questions for this Task Order, the group engaged in multiple consultations and considered the findings of a systematic review of published systematic reviews and meta-analyses (Appendix C). This process led to the identification of the following questions to be addressed by the evidence report:
What is the evidence from comparative studies on the effectiveness and safety, both short and long term, of pharmacological and nonpharmacological interventions for ADHD in children and adults?
Are combined interventions more effective than individual interventions?
An extensive period of time at the outset of the report process was spent consulting with members of the nominating partner groups, AHRQ, and the research team in order to refine the general questions. To answer these questions and avoid duplication of work, make efficient use of the resources available, and ensure maximum added clinical value, the scope of the evidence report focused on the following seven categories of research studies:
Studies with drug-to-drug comparisons of pharmacological interventions.
Placebo-controlled studies evaluating the effect of tricyclic antidepressants.
Studies comparing pharmacological with nonpharmacological interventions (drug vs. nondrug studies).
Studies evaluating the effect of long-term therapies (>12 weeks).
Studies evaluating therapies for ADHD in adults (>18 years of age).
Studies evaluating therapies given in combination.
Studies evaluating adverse effects of pharmacological interventions.
The above categories will be described in detail in the Methodology chapter. A separate category of studies comparing the short-term effect (<3 months) of stimulants with placebos was not included in the scope of work for this Task Order. Individual studies comparing stimulants with placebo were included in this report only if they met the inclusion criteria for any of the other categories. The main reason behind this decision was that, as mentioned above, another group had been commissioned to cover this issue. Most of the members of the research team considered that even if an additional review was conducted, the conclusions would not be different from those obtained by the researchers at University of British Columbia, given that most studies to date have shown consistently that stimulant medication improves core symptoms, at least in the short term (Kavale, 1982; Ottenbacher and Cooper, 1983; Thurber and Walker, 1983). Therefore, it was concluded that a new systematic review on this topic would have added little to the current state of knowledge and would have consumed all the resources available to answer other questions. Instead, the team decided to focus attention on studies comparing the long-term effects (>12 weeks) of stimulants with placebo and complement the findings with those of previous systematic reviews and the concurrent work conducted at the University of British Columbia. In summary, focusing the scope of work on the seven categories of studies described above was the strategy used by the research team to ensure maximum added value, avoid duplication of work, and present the unique contributions of this Task Order to current knowledge on the treatment of ADHD in children and adults.
Citations of individual studies were regarded as potentially eligible and selected for further evaluation if they met the following generic criteria:
They focused on the treatment of ADHD in humans.
They were published, in any language, in peer-reviewed journals as a full report.
Hard copies were obtained for all potentially eligible studies. They were independently assessed by two members of the research team who then decided, by consensus, whether to include the study in one or more of the categories covered in the evidence report. If the studies included conditions other than ADHD, they were included if separate analyses for patients with ADHD were provided.
With the exception of the systematic review of adverse effects, the focus was placed on evidence provided by randomized controlled trials (RCTs), the simplest and more powerful research design to evaluate the efficacy and effectiveness of interventions. The specific inclusion criteria for each of the categories listed below were agreed to by all the members of the research team.
A study was included in this category if it was an RCT that met all the generic eligibility criteria of this Task Order and if it included at least one of the following head-to-head comparisons:
Stimulants (methylphenidate [MPH], dextroamphetamine [DEX], or pemoline) vs. stimulants. Trials comparing the same drug (e.g., MPH vs. MPH) were included if different formulations (e.g., sustained-release vs. regular) or different enantiomers (e.g., l-MPH vs. d-MPH) were compared.
Stimulants (as above) vs. tricyclic antidepressants (desipramine, imipramine, or amitriptyline).
Stimulants (as above) vs. clonidine, buproprion, or selective serotonin-reuptake inhibitors (fluoxetine or paroxetine).
In this category, a study was included if it was an RCT that met all the generic eligibility criteria of this Task Order and if it included a comparison of placebo with amitriptyline, imipramine, or desipramine.
In this category, a study was included if it was an RCT that met all the generic eligibility criteria of this Task Order and if:
One of the study arms included only a stimulant drug (MPH, DEX, or pemoline).
One or more of the control arms included other modes of intervention such as behavior modification, dietary interventions, or other psychosocial intervention.
This category did not include comparisons of nondrug interventions against placebo or against other nondrug interventions.
In this category, a study was included if it was an RCT that met all the generic eligibility criteria of this Task Order and if:
One of the study arms included two or more interventions given in combination.
One of the study arms included a stimulant (MPH, DEX, or pemoline).
This section aimed to answer the second primary question of this evidence-based report: Are combined interventions more effective than therapy with single-stimulant medication?
ADHD is a persistent disorder, and many of its adverse consequences such as antisocial disorder and substance abuse do not arise until many years after the condition is first detected and treated. Consequently, treatments have to be of sufficient intensity and duration to have an impact on these adverse outcomes and followup has to be of sufficient duration to be able to detect several of the most important treatment outcomes. Few long-term followup studies of the effectiveness of treatment for ADHD have been conducted even in the area of medication trials. To be able to compare the differential impact of short- and long-term medication trials, an arbitrary distinction was made between studies that provided treatment for fewer than or more than 12 weeks. This criterion permitted identification of a sufficient number of "long-term" studies from which to draw preliminary conclusions. This distinction does not, of course, imply that 12 weeks of treatment defines optimal extended treatment.
Against this background, a study was included in the review if it met all the generic eligibility criteria of this Task Order and if treatments were evaluated under randomized conditions for 12 weeks or more in all the study arms. Studies were excluded if the treatments were given for fewer than 12 weeks, even if outcome measurements were obtained more than 12 weeks after randomization.
In this category, a study was included if it met all the generic eligibility criteria of this Task Order and if treatments were evaluated in patients older than 18 years of age.
In this category, a study was included if it was an RCT that met all the generic eligibility criteria of this Task Order and if it provided data on at least one of the adverse effects of interest in all the study arms. The adverse effects of interest were the following:
For stimulants: changes in appetite, effects on growth (both in terms of height and weight), somatic effects (headaches, abdominal pain, sleep dysfunction), mood changes (crying, irritability, sadness/depression, withdrawal), motor tics, and drug addiction.
For pemoline only: liver toxicity.
For tricyclic antidepressants: cardiac arrythmias.
Non-RCT studies were included if they met all of the other inclusion criteria and if they evaluated adverse effects associated with treatment for more than 12 weeks and included more than 10 patients. RCTs are usually of insufficient duration to detect rare adverse events or those that take long periods of time to become apparent (Levine, Walter, Lee et al., 1994; Sackett, Richardson, Rosenberg et al., 1997).
Citations of potentially eligible studies were identified through a systematic search of:
MEDLINE (from 1966), CINAHL (from 1982) and HEALTHStar (from 1975), PsycINFO (from 1984), and EMBASE (from 1984) using the search strategy described in Appendix D. All databases were searched from the date of their release to November of 1997. The strategy was designed for search of MEDLINE and was modified to meet the specific features of CINAHL, EMBASE and PsycINFO.
The Cochrane Library (issue 4, 1997).
The reference lists any eligible article identified in any of the above sources.
Web sites of organizations funding research on the treatment of ADHD.
Files of members of the research team and partner organizations.
Data extraction forms were especially developed and tested for this project (Appendix E). After consultation with all the members of the local research team, the partners, and the TOO, the forms were approved for content. Two reviewers extracted data independently from each of the full reports. Any differences were resolved by consensus and by referring to the information in the original report. Any differences that could not be resolved by the two reviewers who extracted the data were resolved by the Task Order Leader or by a member of the local research team designated by him.
The original reports were not masked because an empirical methodological study showed results indicating that masking was time consuming and did not have an important impact on the results of systematic reviews (Berlin, 1997).
The information extracted addressed 41 different aspects (Appendix E, pages 1 to 5) of the studies, which were selected by the team members a priori. Of these, 25 elements were regarded as essential to judge the validity of each of the studies. The selection of five of these elements (questions 37 to 41 in Appendix E) was supported by empirical methodological evidence from studies that showed a direct relationship between these elements and the likelihood of bias in RCTs. The remaining 20 elements were selected by consensus among all the research team members as clinically important but are not based on empirical methodological research.
These elements included questions 37 to 41 in Appendix E. Questions 37 to 39 describe all the items of the only validated scale to assess the methodological quality of RCTs (Jadad, Moore, Carroll et al., 1996) (Appendix F). This scale assesses whether the studies describe randomization, double blinding, and withdrawals. It produces a minimum score of 0 points and a maximum score of 5 points. The higher the score, the better the methodological quality of the RCT (Jadad, Moore, Carroll et al., 1996). Studies that are given scores of <2 points, including RCTs in mental health, have been shown to exaggerate the estimates of the effects of interventions, on average, by more than 30 percent (Moher, Pham, Jones et al., 1998). Measured by this scale, even when the trials are not double blind, as was the case in some trials in this review, they can still be awarded 3 points if the reports include a description of appropriate methods to generate the randomization sequence (2 points) and a detailed account of withdrawals and dropouts (1 point).
The likelihood of bias in the RCTs was also assessed by determining whether allocation of individuals to the different study groups had been concealed until after consent was obtained from prospective participants to be part of the study (question 40 in Appendix E). Studies in which allocation was unclear or inadequately concealed have been shown to exaggerate the estimates of the effects of interventions, on average, by more than 35 percent (Moher, Pham, Jones et al., 1998; Schulz, Chalmers, Hayes et al., 1995). It is important that the difference is recognized between biases that are the result of lack of allocation concealment and biases that arise from lack of blinding. Allocation concealment helps to prevent selection bias, protects the randomization sequence before and until the interventions are given to study participants, and can always be implemented. Blinding helps prevent ascertainment bias, protects the randomization sequence after allocation, and cannot always be implemented (Schulz, Chalmers, Hayes et al., 1995).
In addition, information on association between the authors and investigators and the pharmaceutical or related industry was also sought (question 41 in Appendix E). It has been shown that reports of trials sponsored by pharmaceutical companies are more likely to favor the experimental intervention over controls than trials not sponsored by pharmaceutical companies (Bero and Rennie, 1996; Cho and Bero, 1996).
The 20 remaining elements referred to patient characteristics (questions 18, 24, 25, 26, and 32 in Appendix E), sampling issues (questions 14, 15, 17, 19, 21, 22, and 31), diagnosis (questions 28, 29, and 30) and treatment issues (questions 23, 34, 35, and 36). The following is a brief description of these elements and the theoretical effects that they may have on the validity and applicability of research on the treatment of ADHD:
The outcome of the interventions may vary as a function of age (question 24 in Appendix E), gender (question 25), intellectual abilities (question 26), and family characteristics (question 32).
In addition, findings in a specific ethnic group may not apply to others (question 18). Professionals must determine whether the patients in the sample employed in a study are comparable to those patients to whom the intervention will be applied.
The Number of Eligible Patients (question 14 in Appendix E): This constitutes the sampling frame of subjects available for study. If more subjects are eligible than needed, probability sampling should be used to identify study participants so that the study findings are generalizable to a defined group. In addition to this, the investigator should report on the response to enlistment, including the number of subjects who decline and accept the invitation to participate. The ratio of these two groups is a good indicator of the level of acceptability associated with the treatment options. In studies where a high percentage of subjects decline, the possibility exists that treatment is only applicable to a very small group of patients with distinctive features.
Number of Patients Randomized and Analyzed (questions 15, 17): The proper denominator for evaluating treatment effectiveness is the number of patients randomized. Subject withdrawals and losses to followup often lead to fewer subjects for analysis. The magnitude and distribution of these losses across treatment groups will have important implications for understanding treatment acceptability and effectiveness. In the absence of this information, study results are virtually uninterpretable.
Treatment Setting (question 19): The study location (treatment setting) provides the context for carrying out an investigation. Embedded in context are many features (e.g., professional affiliation of the investigators; reputation, acceptability, and accessibility of the facility) that may have a bearing on both the characteristics of the subjects within a study catchment area and the acceptability and effectiveness of treatment.
Sample Origin (question 31): Subjects for treatment studies may come from clinic (inpatient and outpatient) and/or nonclinic populations. The origin of the sample is likely to have a strong bearing on the severity and complexity of the cases being treated and their prognosis. This influence may be independent of the diagnostic criteria used to determine subject eligibility. This information is important for assessing the applicability and generalizability of study findings.
Inclusion and Exclusion Criteria (questions 21, 22): Inclusion and exclusion criteria are used to determine patient eligibility for study enlistment. These criteria define the "target population" or the subjects for whom the study results are intended to apply. In the absence of these criteria, it is very difficult, if not impossible, to assess the limits of generalizability for a particular study. It also leaves the user of clinical studies in a quandary about the extent to which study findings apply to his or her patients.
Diagnostic Model Used (question 28 in Appendix E): Diagnostic criteria for ADHD have evolved over time. Various diagnostic models may well define samples that differ in natural history, severity, comorbidity, ADHD subtype, and response to treatment.
Comorbid Conditions (question 29, divided into two components, a and b): Information was extracted on whether comorbid disorders had been considered by the authors (question 29a) and on whether the patients included in the studies had comorbid disorders (question 29b). Emphasis was placed on ODD, CD, Tourette's syndrome, anxiety disorder, depressive disorder, learning disorder, and mental retardation. Information on the presence of comorbid disorders is important in order to judge the generalizability of the results of a study to a particular clinical setting. In addition, comorbid disorders may be associated with different responses to treatment or different levels of adherence to treatment.
Individual(s) Who Made the Diagnosis (question 30): In general, low agreement occurs among informants on the presence of the core symptoms of ADHD. Current diagnostic models accord considerable importance to evidence for pervasive symptoms. Reliance on a single informant may generate biased study samples differing in severity or comorbidity from those generated by other informants. For example, teacher-rated ADHD is more strongly associated with academic achievement than is parent-rated ADHD (Szatmari, Offord, and Boyle, 1989).
Identification of the Primary Outcome (question 23 in Appendix E): If the primary outcome is not specified a priori (or not specified at all) and all outcomes in a study are treated alike, authors are more likely to highlight those with the most striking results. In addition, the more outcomes that are analyzed, the greater the risk of finding false-positive, statistically significant results merely by chance. Identification of the primary outcome is also an essential step to estimate the power of the study to detect true-negative results.
Fidelity and Monitoring of Treatment (question 34): Fidelity reflects the extent to which treatment is delivered correctly. Fidelity also ensures that interventions that are not part of the treatment protocol are not inadvertently administered. Fidelity can be enhanced by training professionals who administer treatment, conduct treatment according to treatment manuals, self-monitor treatment administration, and conduct independent adherence checks. Monitoring and reporting fidelity allows the reader to determine whether potentially effective treatments were fairly tested. The treatment manuals developed to support clinical trials can assist the dissemination of effective treatments to community practitioners.
Measurement of Compliance with Treatment (question 35): Compliance reflects the extent to which patients correctly carry out treatment plans. Compliance might require the timely administration of correct doses of medication, the completion of parent training homework projects, or the consistent application of classroom behavior management strategies by teachers.
Availability of Baseline Test Scores in the Report (question 36): Baseline test scores permit identification of atypical samples (severity, comorbidity) and evaluation of the comparability of the subjects in various study groups at the outset.
In addition to the methodological information, data were also extracted on each of the arms, outcomes, and tests used in each of the studies (Appendix E, Study Arm Form and Outcome/Adverse Effects Form). The outcomes of interest selected a priori by the panel of experts (led by the AAP Task Force) included the following:
Core/Global "Symptoms": included global assessment of all symptoms, global assessment of core symptoms, and function performance.
Individual Core "Symptoms": included separate data for inattention (including inattention itself or listening), hyperactivity, and impulsivity (included impulsivity and self-control-related outcomes). Although these are often described as "symptoms" they are really "signs" of ADHD, as they are observed phenomena rather than subjective experiences of the patients. To conform to tradition, however, the term "symptom" will be used throughout the report.
School/academic Performance: included achievement tests, grades, verbal skills, reading, mathematics, spelling, and measures of social competence.
Depression/anxiety-related Outcomes: included measures of depression and anxiety, as well as emotional well-being, crying, sadness, global mood, and self-esteem.
Conduct/oppositional-disorder-related Outcomes: included specific measures for ODD, CD, aggressiveness, and other behavior disturbances.
Adverse Effects: as described above in the section inclusion/exclusion criteria.
Information on these outcomes was extracted regardless of the instrument or test used by the researchers within each of the studies. Only information gathered during the duration of the participants' being in the randomized groups was extracted. No data gathered after discontinuation of therapy were sought.
It is important to stress at this point that standards of quality in research are constantly being raised and that the assessment of the rigor of the studies included in this report was performed using current parameters. Early research that may be found liable to bias today could have been regarded as state of the art at the time it was conducted or published.
Descriptive statistics were calculated for all the fields of the database. Evidence tables were constructed to describe the most salient characteristics of the eligible studies. These tables, which can be found after the References in this report, summarize, globally and category by category, all the information that was extracted.
Within each of the categories, four sets of tables were produced. The first three sets were regarded as the main evidence tables to support the information included in the Findings chapter, whereas the fourth was regarded as a separate source of information for readers interested in the data extracted in a very detailed fashion. The sets of tables have the following general structure:
The first set contains a summary of the key characteristics of the studies reviewed, including the name of the first author, the year of publication of the study, the type of study design, the number of patients randomized (or the number of patients analyzed, if the number of patients randomized was not available), the diagnosis model, the interventions studied, the duration of exposure to each of the interventions, the quality scores, the number of elements extracted from the articles among those regarded as essential to judge the validity of each of the studies but that were not supported by empirical evidence (20 in total), the outcomes of interest measured (following the list provided above), and the key results for the outcomes of interest (expressed in terms of statistical significance set at p<0.05). Data on outcomes other than those identified a priori as being of interest were not reported in these tables. They were mentioned in the text of the report when they had clinical significance.
The second set of tables includes a summary of the presence or absence in each of the studies of each of the 20 elements extracted from the articles that were regarded as essential to judge the validity of each of the studies but that were not supported by empirical evidence. The last column of these tables provides a count of these elements by article. This count should not be regarded as a quality score because no evidence exists on the relative weights of each of these elements. The counts were included to provide the reader with additional information to guide his or her decisions. The cells of all the other columns only include 1 and 0, representing whether the element was present or not in the articles, respectively.
The third set of tables has the same structure as the second set, but instead of 1 and 0, the cells include the actual information provided in the study reports and extracted as part of the review.
The fourth set of tables focuses on the actual results of the studies. It contains the name of the first author, the year of publication, the interventions, the tests used to measure the outcomes, data on each of the outcomes of interest measured with each of the tests, and the statistical significance (p<0.05) of the results on the outcomes of interest across the study groups. This table, which was requested by the AAP subcommittee, was subdivided by group of outcomes to help establish whether the effect of the interventions varies by outcome (e.g., intervention A may improve core symptoms but not academic achievement). One limitation for the production and interpretation of this set of tables derives from the fact that many tests were used to measure the outcomes and that even when the same test was used, the authors often made modifications that were not clearly described in the reports. Because of the magnitude and complementary nature of the tables, they are included in the report as a separate section. To make the data extracted easier to understand, the information on the studies was organized in the tables following an alphabetical order. The studies are also presented in alphabetical order within the text. In addition, every effort was made to describe the studies according to their methodological quality, taking into account both the evidence-based criteria and the count of the elements that were not supported by empirical evidence.
The local research team, in consultation with members of the partner organizations and the TOO, evaluated the overall quantity and quality of the data available. This evaluation led to the conclusion that meta-analysis would be inappropriate to summarize the evidence on each of the research questions or for each of the main categories of interest. The main reasons for this decision were substantial clinical heterogeneity across the studies (e.g., therapies evaluated, patient populations, duration), inconsistency in outcome measurements, low methodological quality, and incomplete data reporting (see detailed descriptions within each category). The use of meta-analysis to synthesize this type of data has been associated with a greater chance of obtaining imprecise and potentially misleading results (Ioannidis, Cappelleri, and Lau, 1998). Therefore, this report represents a systematic qualitative review of the existing evidence, emphasizing the implications for clinical practice and the directions that future researchers could take to fill existing knowledge gaps.
Every effort was made to present the information obtained in each of the categories following a uniform format. Overall, each section begins with a general description of the most salient characteristics of the studies and ends with a summary of the main findings of the studies. The way in which the results from the individual studies are presented varies substantially across sections. This reflects the different perspectives that the questions provide on the treatment of ADHD. For instance, two of the sections focus on pharmacological interventions (e.g., drug-to-drug comparisons, and antidepressants vs. placebo), whereas others focus on combined interventions, length of therapy, specific populations (e.g., adults), and outcomes (e.g., adverse effects). The tables complementing the text in each of the sections, as mentioned above, follow the same format.
The production of the evidence report involved continuous communication among members of the local research team, the TOO, and representatives of the partner organizations.Three face-to-face meetings were held; and communication by telephone, fax, and electronic mail was frequent. The partner organizations provided input during each of the steps of the process, from the formulation of the questions through the literature search and data extraction, to data synthesis.
Potential peer-reviewers were identified in April 1998. Thirty-eight individuals were approached by the San Francisco Cochrane Center and asked if they would be willing to review this evidence report. Twenty of these people responded positively to this request, and the report was sent out to them in early July. The potential reviewers included 14 content experts (practicing physicians, psychologists, researchers, representatives of organizations interested in the treatment of ADHD), 3 consumer representatives, and 3 methodologists. All reviewers were sent a "Structured Format for Referee's Comments" (Appendix G) and encouraged to provide comments on the text. The comments were returned to our Criticism Editor, Dr. Patricia Huston, who synthesized them for the McMaster EPC. All comments were shared with the Task Order Officer. All comments were reviewed and, where feasible, incorporated into the final report. Comments were received from 15 of the 20 reviewers who were invited to comment (Appendix G).
The subset of RCTs was screened to determine the number of articles with evidence to answer the research questions of interest. The following is the number of reports found in each category (in parentheses, the number of trials described in the reports):
Drug vs. drug 22 (22)
Drug vs. nondrug 12 (6)
Combination therapies (stimulants plus others) 27 (20)
Tricyclic antidepressants vs. placebo 12 (9)
Long-term therapies (>12 weeks) 19 (14)
Treatment of ADHD in adults (>18 years of age) 13 (12)
Adverse effects of interest 32 (29) (one non-RCT)
A total of 92 reports were included in this project. Numerous overlaps exist among the categories. In addition, multiple reports of the same studies were found. Information from multiple reports of the same studies was combined and considered as a single data set. Once this was accounted for, 78 different studies were included. The evidence tables in this document are based on information from these 78 studies. The remaining articles were not included because they did not provide data on any of the categories or questions of interest for this project.
The evidence identified within each of the categories of interest will be described separately (see below).
An unpublished copy was obtained of the Canadian Coordinating Officer for Health Technology Assessment (CCOHTA)-funded report on the treatment of ADHD prepared by researchers at the University of British Columbia (Miller, Lee, Raina et al., 1998). As planned, its methodological quality was assessed and information on RCTs comparing stimulants with placebo was extracted.
The most salient characteristics of all the studies included (Evidence Tables A1 through A34) were the following:
Sixty-five of the 78 studies were identified in MEDLINE first. Other sources contributed little.
Most of the studies (48 of the 78) had a crossover design.
Most of the studies reported funding by a government agency (48 studies). The second source of funding was industry (22 studies). Charities and consumer groups funded few studies (seven studies).
The first study identified was published in 1971 and the last in 1999. One-half of the studies (34/78) were published since 1990.
All but six studies were published in North America.
More than one-half the studies had 3 to 5 authors (49 studies).
The lead author was typically a pediatrician or a psychiatrist.
All the studies but two described a single research center.
Seventy-five studies included men or boy patients, and 51 included women or girl patients.
DSM-IIIR and -IV criteria were used almost as often (28 studies) as DSM-I and -III (30 studies) for the diagnosis of ADHD.
The number of arms in the studies varied from 2 to 6; 54 studies had 2 or 3 arms.
Thirty-two different interventions were studied. Apart from placebo (64 studies), the most frequently evaluated intervention was MPH (55 studies), followed by DEX (18 studies), and cognitive/behavioral therapy (8 studies). Intervention with pemoline was evaluated in five studies.
No studies provided an analysis stratified by treatment setting.
Forty-eight different outcomes were measured in the 78 studies. The most frequently measured outcomes were inattention (29 studies) and hyperactivity (30 studies). Impulsivity was measured in 13 trials. Global assessment of symptoms was provided in 22 studies and combined scores for core symptoms, in 20.
Sixty-nine different tests were used to measure the outcomes. Conners was the most frequently used test, followed by spontaneous reports.
The reports of the studies included in the review had deficiencies that increase the likelihood of bias in their results:
One study (MTA Cooperative Group, 1999) was given the maximum quality score with the validated scale (Jadad, Moore, Carroll et al., 1996). The average quality score was 2 out of 5 points. Only five trials were given 4 points. Only 18 of the 77 randomized trials were given scores > 3 points. As mentioned above, it has been shown consistently that trials that receive < 2 points are more likely to exaggerate treatment effects (Berman, Ezzo, Hadhazy et al., forthcoming; Khan, Daya, and Jadad, 1996; Moher, Pham, Jones et al., 1998). The main reasons for the low scores were lack of description of the method of randomization (73/77 trials), lack of description of methods to achieve double blinding (59 trials), and poor description of withdrawals and dropouts (66 trials).
Seventy-three trials did not mention concealment of allocation to study groups.
An association with industry was reported in 27 studies. Ciba-Geigy was associated with 16 studies.
Most studies had one or more factors that limited their external validity or generalizability. Only one of the studies (MTA Cooperative Group, 1999) included information on all of the 20 clinically relevant elements selected a priori for extraction from the articles.
The following is a brief summary of the main problems found in the studies:
The total duration (including the time required to administer the interventions and assess their outcomes) was 12 weeks or less in 64 studies. Results obtained in short-term studies may not apply to the long treatment periods that are often required.
Overall, the description of the interventions evaluated in the studies, particularly nonpharmacological ones, was poor.
Fewer than one-third of the trials mentioned ethnic groups. The most frequently considered group was white, followed by African American (35 studies).
Twenty of the studies did not report the inclusion criteria or reported them unclearly. One-half of the trials did not report the exclusion criteria. Insufficient information on inclusion or exclusion criteria limits the ability of clinicians to compare their patients with those reported in published studies.
The primary outcome of interest to the researchers was not described in 61 of the studies. As mentioned above, the identification of the primary outcome, especially when it is done a priori, provides the basic elements for the calculation of the sample size required by the researchers to identify clinically important differences and minimize the risk of finding false-negative results. In addition, this is likely to reduce the tendency of researchers to give more prominence to secondary outcomes when the interventions do not produce the expected effects on the primary outcome of interest.
Most of the studies (64) included patients between the ages of 5 and 12 years.
Forty-three studies did not report information on comorbid disorders, which are frequently present in patients with diagnosis of ADHD. The presence of comorbid disorders may increase or reduce the effectiveness of interventions targeted to ADHD. In addition, these comorbid disorders could be improved or exacerbated by the same treatments.
Family characteristics were not mentioned in 45 studies.
The fidelity of treatments was not described in 52 studies, and compliance was not measured in 44 of the 78 studies. Knowing the adequacy with which treatments are administered by health care providers (fidelity) or the adherence of patients to the treatments (compliance) are two essential components to determine the generalizability of the results of any study.
Baseline measurements of ADHD symptoms were provided in 54 studies. The most frequently used test to generate baseline measurements was Conners (37 studies), followed by Weschler (23 studies). Information on baseline characteristics enables readers to compare their cases with those included in the studies.
The following seven sections provide a detailed description of the evidence and main findings within each of the categories of interest. Within each section, studies are discussed in detail when they provide information that could increase the understanding of the interventions under evaluation.
Full references for the studies cited can be found in the References chapter. The Bibliography contains all sources that were considered, including those used and not used.
Nine of the 22 studies diagnosed ADHD using DSM criteria. In two trials, the duration of the study arms was unclear (Matochik, Liebenauer, King et al., 1994; Srinivas, Hubbard, Quinn et al., 1992). The number of patients randomized to each of the study arms was not reported in eight studies. In the remaining 14 trials, a total of 650 patients were studied. The sample size of these studies ranged from 9 to 125. Of the 22 reports, only 5 were given quality scores of >3 points. Two studies described concealment of allocation (Arnold, Huestis, Smeltzer et al., 1976; Kauffman, Smith-Wright, Reese et al., 1981).
The number of clinically relevant elements present in each of the study reports ranged from 7 to 18 (Evidence Tables B2 and B3). Of the 22 reports, 6 reported the number of patients eligible for the study, 8 did not report inclusion criteria, and 15 did not report exclusion criteria. Eighteen trials did not report the primary outcome. In 19 trials, the fidelity of treatment was not reported. Only 6 trials provided measurements of compliance with treatment and 13 trials provided baseline scores.
The following were the main findings of the studies in this category:
DEX vs. levoamphetamine (Arnold, Huestis, Smeltzer et al., 1976): The main conclusion from this study is that both isomers were better than placebo but not significantly better than each other in terms of improvement of core symptoms and academic performance (Supplemental Tables B4 and B6). Only in one parent report of global symptoms was DEX better than levoamphetamine (Supplemental Table B4). Parents' ratings, but not teachers' or psychiatrists' ratings, showed significant placebo effects. This study included only 31 patients.
One trial used a triple-blind crossover design and was designed to look at compliance with treatment (Kauffman, Smith-Wright, Reese et al., 1981). This study found that the mean individual patient compliance during MPH administration was 65 percent (0 to 100 percent) and during DEX administration was 60 percent (20 to 83 percent). The authors stressed that poor compliance may be an important reason for the variability and discordance among the results of the studies in this area.
MPH vs. pemoline (Conners and Taylor, 1980; Stephens, Pelham, and Skinner, 1984). Both studies in this group were placebo-controlled, but only one reported data on effectiveness outcomes of interest (Conners and Taylor, 1980). This study received a score >3 points, provided information on 17 of the 20 clinically relevant elements, was published in 1980, and included 60 patients diagnosed as having minimal brain dysfunction. Patients were randomized to receive pemoline at 18.75 mg/d, MPH at 5 mg two times per day, or placebo for 8 weeks. Overall, the two drugs produced similar effects and were more effective than placebo (Evidence Tables B1, Supplemental Tables B4-B7).
DEX vs. pemoline (Conners, Taylor, Meo et al., 1972): In this study, 81 children with diagnosis of minimal brain dysfunction were randomized to receive pemoline at 25 mg, DEX at 10 mg, or placebo for 8 weeks. This study found both drugs to be better than placebo but found few differences in terms of effectiveness and adverse effects between them.
Only four studies, all comparing MPH with tricyclic antidepressants, were found. One trial compared MPH with desipramine (Rapport, Carlson, Kelly et al., 1993). This was a randomized, double-blind, crossover placebo-controlled study in which 16 patients were allocated to receive desipramine in titrated doses to achieve plasma levels between 125 and 225 mg/ml; MPH at 10, 15, or 20 mg; a combination of desipramine and MPH at the above levels; or placebo. The children included in this study had diagnoses of ADHD with depressive symptoms, ADHD and major depressive disorder, or major depressive disorder with prominent attentional symptoms. The main conclusions from this study were that MPH was consistently the most potent drug in affecting children's vigilance (Continuous Performance Test [CPT]) and ability to learn paired associations (Paired Associate Learning [PAL]). Additional improvements in performance were observed when desipramine was added. Desipramine alone, however, did not affect children's vigilance but did enhance paired association learning to a lesser extent than that achieved by MPH alone or in combination. The authors caution about the interpretation of their results because of the small sample size and the degree of psychopathology experienced by the children studied.
Three trials compared MPH with imipramine (Quinn and Rapoport, 1975; Spring, Yellin, and Greenberg, 1976; Werry, Aman, and Diamond, 1980). One did not provide data on effectiveness outcomes of interest (Spring, Yellin, and Greenberg, 1976). The other two trials provided inconsistent results. One showed nonsignificant differences between MPH and imipramine (Quinn and Rapoport, 1975), whereas the other showed some results in favor of imipramine (Werry, Aman, and Diamond, 1980).
In summary, the small group of studies comparing different formulations or enantiomers of the same stimulants suggests that there little difference between them, at least in the short term, but that the active medications are more effective than placebo. Similarly, despite the overall heterogeneity of the studies comparing different stimulants, they showed few, if any, differences among MPH, DEX, and pemoline. These findings agree with those of the AMA and AACAP reports. The sample size of the studies, however, is small.
Studies comparing stimulants with tricyclic antidepressants had many limitations and presented conflicting results. More rigorous studies are clearly needed to establish the relative effectiveness of stimulants and tricyclic antidepressants. Studies are required to compare the effects of stimulants with clonidine, buproprion, or selective serotonin-reuptake inhibitors.
Six studies met the selection criteria described previously. Five of these studies overlapped with those studies included in the combination therapies and will be described in more detail in the following section (Brown, Wynne, and Medenis, 1985; Conrad, Dworkin, Shai et al., 1971; Firestone, Crowe, Goodman et al., 1986; Klein and Abikoff, 1997; MTA Cooperative Group, 1999; see next section), and one study compared DEX with the dietary supplement Efamol (Arnold, Kleykamp, Votolato et al., 1989). This study did not describe the number of patients randomized, had a duration of 4 weeks, and provided information on 10 of the 20 clinically relevant elements.
Despite their limitations, the individual studies available suggest that stimulants may be more effective than nonpharmacological interventions when compared head to head.
Combined interventions included stimulants vs. a combination of two or more drugs, stimulants vs. stimulants plus nondrug interventions, and nondrug vs. stimulant plus nondrug interventions. For nondrug interventions, it was difficult to determine the exact nature of the behavioral interventions administered because of the limited amount of descriptive information available in the reports. (Refer to Evidence Tables D1--D3 and Supplemental Evidence Tables D4-D8).
The search yielded 20 studies that examined combination therapies and met the above selection criteria.
The number of clinically relevant elements found in the report of the studies varied from 10 to 20. Fifteen trials reported inclusion criteria, 9 reported the exclusion criteria, and only 4 reported the primary outcome of interest (Evidence Tables D2 and D3). Eighteen studies included MPH, 2 studies included DEX, and none assessed pemoline.
Three categories of comparisons were made by the 20 studies are described below.
Garfinkel, Webster, and Sloman, 1981: This study compared two doses of caffeine, combination of caffeine with MPH, MPH alone, and placebo. The sample consisted of six boys between 6 and 10 years of age attending a day treatment program at a psychiatric institution who had been referred because they were not progressing academically or socially while in a regular or special education class. The researchers found that the strongest effect was noted when caffeine was combined with MPH and that caffeine at a low dose was superior to caffeine at a high dose. This study is difficult to interpret because of the small sample size, sample selection, lack of measurement of compliance or adherence to regime, and use of teacher report only as an outcome measure.
Levy and Hobbes, 1996: This study examined the use of haloperidol in combination with MPH to determine whether haloperidol blocks the effect of MPH if given before MPH. The sample included 10 boys between 5 and 12 years of age who met the criteria for ADHD when DSM-III criteria were used but who also had comorbid ODD. Measure of outcome relied only on the Continuous Performance Test, and the result is difficult to interpret. MPH improves reaction time on the CPT, and this effect is eliminated by use of haloperidol before administration of MPH.
Schechter and Timmons, 1985: This study used a crossover design to compare DEX at 1.6 mg and 5.0 mg, DEX at 1.6 mg combined with caffeine at 300 mg, and caffeine at 300 mg alone. The sample included 10 patients between 6 and 10 years of age who were diagnosed as having hyperkinetic syndrome using criteria other than DSM. After 2 weeks, all treatments led to significant increase in reaction time, significant decrease in errors of commission, and decreased total scores on the Conners Abbreviated Parent Rating Scale (ACRS). The report did not mention whether there was significant difference between treatments. This study is limited not only by its poor rating of quality but also because of small sample size, short duration of intervention, lack of measurement of compliance or adherence to regime, fixed doses of DEX, and use of dependent measures that are of questionable clinical significance.
Of the five studies in this group, four evaluated MPH and one DEX. Four studies (Brown, Wynne, and Medenis, 1985; Conrad, Dworkin, Shai et al., 1971; Firestone, Crowe, Goodman et al., 1986; Klein and Abikoff, 1997) allowed direct head-to-head comparison of stimulant vs. nondrug intervention. The fifth study is Long, Richert, and Ashcraft, 1993. The studies are sorted alphabetically by author for ease of reading. The types of nondrug interventions vary widely and are highlighted in the summary.
Brown, Wynne, and Medenis, 1985 (stimulant vs. stimulant plus cognitive behavioral therapy): This study compared the effect of cognitive behavioral therapy (CBT), MPH alone, and combination therapy (MPH plus CBT). Thirty hyperactive boys with academic difficulties between 6 and 11 years of age were randomized. Outcome measures were very broad, focusing on core symptoms of ADHD and academic measures. The CBT program consisted of 24 1-hour individual sessions 2 times per week over a 3-month period and consultation sessions for both parents and teachers. The results showed that only those children in the two medication treatment conditions demonstrated improvement in both attentional and behavioral ratings and that the combination therapy was not any more effective than MPH alone.
Conrad, Dworkin, Shai et al., 1971 (stimulant vs. stimulant plus child training): This study examined the effect of DEX alone or with twice weekly "prescriptive perceptual-cognitive tutoring" (child training [CT]). The sample consisted of 68 children from a low socioeconomic class; age was not specified. The diagnosis of ADHD was based on the Schenectady Hyperkinetic Scale. The children were randomly assigned to placebo alone, DEX alone, DEX plus CT, or placebo plus CT. The duration of the interventions was not clearly reported. Neither experimental condition significantly influenced academic achievement as measured by the Wide Range Achievement Test.
Firestone, Crowe, Goodman et al., 1986 (stimulant vs. stimulant plus parent training): In this study, the use of parent training (PT) in the treatment of ADHD was evaluated in 73 children between 5 and 9 years of age. Children were assigned to PT plus MPH, PT plus placebo, or MPH alone. Parents were asked to read a book on child management and were then asked to join a parents' discussion group. The results indicate that at 3 months there were significant differences between those in the two medication groups in terms of hyperactivity index, conduct problem, and reaction time but not the verbal grade level. Parent training added very little. At the first- and second-year followup, there were no significant differences among the groups. Some of the limitations of this study include substantial attrition over time, no report of the number of patients randomized, low-quality score, lack of measurement of compliance with treatment, and little information as to the extent of parent participation in the program.
Klein and Abikoff, 1997 (stimulant vs. stimulant plus cognitive behavioral therapy): This study compared the effects of CBT, MPH alone, and CBT plus MPH together. The sample consisted of 89 children 6 to 12 years of age whose diagnosis of ADHD was based on the Conners Teacher Rating Scale. Children with conduct disorder were systematically excluded. Behavior therapy for home and school utilized operant conditioning principles individualized for each child. Dose titration for MPH was carried out up to a maximum of 60 mg/d for optimal efficacy. The results suggested that MPH alone and in combination with CBT were significantly better than CBT with placebo on most outcomes. Some differences between MPH alone and combination therapy include less attention-seeking behavior, better impulse control, and more cooperative behavior. Teachers and psychiatrists also rated the children more improved on the Clinical Global Improvement (CGI).
Long, Rickert, and Ashcraft, 1993 (stimulant vs. stimulant plus bibliotherapy): This study examined the use of bibliotherapy as an adjunct to stimulant medication. Bibliotherapy refers to written behavioral management protocol. The sample consisted of 32 children between the ages of 6 and 11 who were recruited concurrently from outpatient pediatric clinics. Children were assigned to MPH alone or with bibliotherapy. Results revealed that the intensity of behavior lessened for the bibliotherapy group even though the problems remained. One of the major limitations of this study is that there was no surveillance of medication compliance during the study which could have influenced the results.
There were 14 studies in this category. In total, 13 studies evaluated the effect of MPH and one the effect of DEX (Conrad, Dworkin, Shai et al., 1971). Five of these studies have been discussed in the section above (Brown, Wynne, and Medenis, 1985; Conrad, Dworkin, Shai et al., 1971; Firestone, Crowe, Goodman et al., 1986; Klein and Abikoff, 1997; Long, Rickert, and Ashcraft, 1993). The remaining nine are described below.
Borden and Brown, 1989 (cognitive behavior therapy vs. stimulant plus cognitive behavioral therapy): This study was designed to determine whether those treated with combination of medication and CBT would tend to make external attributions for problem solutions or whether more internal attributions would be observed with CBT alone. Thus, this study did not focus on the effects of interventions on the core symptoms or associated behavior/academic difficulties in children with ADHD.
Brown, Borden, Wynne et al., 1986 (cognitive behavior therapy vs. stimulant plus cognitive behavioral therapy): This study examined the effect of combined CBT and MPH. The study achieved a high-quality score and provided information on 14 of the 20 clinically important elements. The study included 40 patients diagnosed with ADHD using DSM-III criteria between 5 and 13 years of age and of average intellectual functioning. Both CBT and attention control (AC) involved 22 1-hour sessions over 3 months, but the difference was in the inclusion of specific problemsolving strategies that were taught in the CBT group. MPH was given at 0.3 mg/kg two times per day and was discontinued during pre- or postmeasurements. The results showed that no significant main treatment effects or interactions among treatments occurred. They also noted that the effects of stimulant medication were noticeable while children were receiving it. However, these effects dissipated rapidly upon discontinuation of the medication.
Brown, Borden, Wynne et al., 1988: This study was designed to determine the extent to which compliance with treatments occur in treatment groups rather than to determine the effects of interventions on the core symptoms or associated behavior/academic difficulties in children with ADHD.
Carlson, Pelham, Milich et al., 1992 (behavior therapy plus stimulant vs. regular classroom plus stimulant): Twenty-four boys between 6 and 12 years of age participating in an 8-week summer treatment program for ADHD were randomized to 2 classrooms (behavior management classroom [behavioral contingencies/modification, or BC/Mod] vs. regular classroom [AC]) and to placebo or MPH at a low dose or high dose. Children showed significantly higher rates of on-task behavior, lower rates of disruptive behavior, and higher rates of following rules when receiving MPH than when receiving placebo. The medication effects on on-task and disruptive behavior interacted with classroom setting such that in regular classroom settings, increasing the dose of MPH had a linear effect of increasing dosage. The study seems to suggest that low dosages of MPH are sufficient to maximally improve children's behavior when behavioral classroom techniques are employed.
Christensen, 1975 (behavior therapy vs. behavior therapy plus stimulants): This study examined the effect of behavior therapy alone (BC/Mod) or in combination with MPH. The subjects were 16 "hyperactive, institutionalized retarded" children between 9 and 16 years of age who were randomly assigned to 2 groups: MPH plus BC/Mod (main element of the program was a token reinforcement system) or placebo plus BC/Mod. The results showed that the BC/Mod procedures in the presence of the placebo were just as effective as the same procedures in the presence of MPH, suggesting no added effect of MPH. However, it is important to note that MPH was given one time per day and the dose was not individually titrated.
Hinshaw, Henker, and Whalen, 1984 (cognitive self-evaluation vs. cognitive self-evaluation plus stimulant): This study evaluated the cognitive self-evaluation procedures and MPH on social behavior in a playground setting. Twenty-four hyperactive boys between the ages of 8 and 13 who were already receiving treatment with MPH were randomized to "reinforcement alone" with MPH, "reinforcement alone" with placebo, "reinforced self-evaluation" (RSE) with MPH, or RSE with placebo. No attempts were made to standardize the medication dose, each boy remaining at his customary dosage. The results show that RSE in conjunction with MPH is superior to medication alone or RSE alone as judged by observations of social behavior on playground.
Hinshaw, Buhrmester, and Heller, 1989 (cognitive behavioral therapy vs. stimulant plus cognitive behavioral therapy): This study examined the effect of MPH on anger control in 24 boys between the ages of 6 and 12 who were participating in a 5-week summer research program. Children received CBT using "specific curriculum" three times per week to enhance anger control during verbal provocation and were also randomized to MPH. The results indicated that MPH exerts significant effects on the global rating of self-control and physical retaliation when the child was provoked, but the data were difficult to extract. Furthermore, the outcome measures were of questionable clinical significance.
Pelham, Carlson, Sams et al., 1993 (behavioral therapy plus stimulant vs. regular classroom plus stimulant): Thirty-one ADHD boys ranging in age from 5 to 10 years participated in an 8-week day treatment program. The boys alternated, on a weekly basis, between classrooms with behavior modification and no behavior modification and received in random order placebo or MPH at a low dose or high dose. Results revealed significant effects of both interventions with the mean effect size of medication being more than twice as great as that of the behavior modification. The addition of either dose of MPH resulted in improvement beyond the effects of behavior modification alone.
Solanto, Wender, and Bartell, 1997 (behavioral contingencies/modification vs. behavioral contingencies/modification plus stimulants): This study evaluated the effect of MPH in combination with BC/Mod on a measure of sustained attention (CPT) in 22 children between 6 and 10 years of age who met DSM-III-R criteria for ADHD. The results indicated that MPH improved sustained attention whereas behavioral intervention alone did not. Adding behavioral intervention to MPH did not lead to further improvement. Unfortunately, this study did not explore whether children with better CPT performance had improvements in other core symptoms of ADHD.
The largest and most comprehensive study conducted to date was designed to determine the effectiveness of different treatment strategies, involving careful titration and systematic testing of drugs other than MPH if needed to achieve an adequate response.
MTA Cooperative Group, 1999: Five-hundred-seventy-nine children, between the ages of 7 and 9, were randomly assigned to one of four 14-month-long conditions: (1) medication management (MedMgt) with monthly visits and supportive care; (2) intensive behavioral treatment (Beh) (including home- and school-based behavior management, direct behavioral management therapy for the child delivered through a summer camp program); (3) both treatments combined (Comb); or (4) assessment and referral back to community service providers (CC). Children's outcomes in clinical and functional domains were assessed throughout the study, using rating scales and direct observational measures. Comb and MedMgt treatments were clinically and statistically superior to intensive Beh and routine CC in reducing core ADHD and oppositional-aggressive symptoms. Moreover, the Comb and MedMgt approaches did not differ statistically or clinically from each other for ADHD or oppostional-aggressive symptoms. In several non-ADHD-symptom areas (parent-rated internalizing symptoms, social skills, consumer satisfaction, and reading), Comb provided modest incremental benefits over single-treatment approaches. MedMgt was superior to medication management provided in the community despite the fact that two-thirds of the CC subjects were treated with medication during the study period.
In summary, there is a lack of evidence supporting the superiority of combination therapy over stimulant alone or superiority of combination therapy over nondrug intervention alone. More research is needed to determine the added value of including nondrug interventions to the treatment scheme of patients who are already receiving stimulants, as well as the added value of treatment with stimulants when nondrug interventions fail to achieve the desired outcomes.
Five of these reports studied desipramine and three the effects of imipramine. One of these trials studied desipramine and clonidine (Singer, Brown, Quaskey et al., 1995). None of the studies evaluated the effects of amitriptyline.
Only one studied an adult population (Wilens, Biederman, Prince et al., 1996). The interventions in the trials were administered for 2 to 18 weeks (in 6 trials, it was 6 weeks or fewer). All the trials but one described the inclusion criteria and three described the exclusion criteria. None described the primary outcome measure (Evidence Tables E1-E3).
Following are descriptions of the main findings of these studies.
Biederman, Baldessarini, Wright et al., 1989a: This study was described in a group of three reports (Biederman, Baldessarini, Wright et al., 1989b; Biederman, Baldessarini, Wright et al., 1993). This study, which did not report the number of patients randomized, found significant improvements in parent and teacher ratings of primary symptoms and in depression in favor of desipramine.
Donnelly, Zametkin, Rapoport et al., 1986: In this study, Conners teacher ratings of primary symptoms improved and classroom activity level declined significantly, although scores on continuous performance and memory tasks were not significantly different. Significant increases in blood pressure and heart rate were noted in the desipramine group.
Gualtieri, Keenan, and Chandler, 1991: In this study, desipramine at three doses (low, medium, and high) was evaluated against placebo in 12 patients. Desipramine at the medium dose resulted in the best scores on "clinical significance," global rating, and teachers' reports of inpatient behavior. Of all individual school behavior, only "fidgety" behavior showed a meaningful linear trend. In addition, this study showed that patients were least "cheerful" on medication at high doses and that motor performance was adversely affected by the drug.
Singer, Brown, Quaskey et al., 1995: In this study, the effects of clonidine, desipramine, and placebo were compared in children with Tourette's syndrome and ADHD. On a global linear analog scale, parents rated children as significantly better on desipramine than on clonidine or placebo. Parent ratings on the Child Behavior Checklist hyperactivity subscale for boys on desipramine were significantly lower than for those on clonidine. Teacher ratings on the nervous overactive subscale were lower for boys during treatment with clonidine and desipramine than placebo.
Wilens, Biederman, Prince et al., 1996: In this study, 68 percent (13/19) of the participants showed a positive response to desipramine (very much or much improved plus at least a 30 percent reduction in rating scale symptoms). No participants showed a favorable response to placebo. Two participants receiving desipramine dropped out, and 10 had doses lowered as a result of adverse effects. Favorable response was not related to the presence of lifetime comorbidity of anxiety or depressive disorders.
Gualtieri and Evans, 1988: In this study, imipramine showed a dose-response relationship with increased scores on a selective attention task but reduced motor performance (finger-tapping speed) in nine children.
Werry, Aman, and Diamond, 1980: Compared with placebo, imipramine reduced impulsive responses and seat movement significantly, but it did not reduce omission errors or have an impact on self-esteem. Psychiatrists' evaluation of improvement using Clinical Global Improvement showed significantly better results with imipramine compared with those with MPH and placebo. Parent ratings using the Conners Parent Questionnaire showed inconsistent results. There were no differences among the three groups when teacher ratings were used.
Winsberg, Kupietz, Yepes et al., 1980: This study included 11 children who had failed to respond to MPH. No significant benefit was obtained either on parent and teacher ratings of the child's hyperactivity or on the patient's performance in a laboratory measure of sustained attention, except for improvements in attention measured by parents using Conners.
In summary, the studies on desipramine, regardless of their heterogeneous designs, small sample sizes, and variable quality, suggest that desipramine is more effective than placebo. The studies evaluating imipramine, however, show inconsistent results.
Only 4 of the 14 studies reported the adverse effects of long-term therapy. In general, clarity was lacking on medication status at the time of posttesting, the patients were selected by atypical criteria (only two of these studies used DSM-III-R and -IV criteria), the duration of the studies was not long compared with the time patients receive treatments in the clinical setting, the sample sizes were small, and patients were included whose comorbid conditions are not typical of the general population of ADHD children (e.g., all patients in Kupietz, Winsberg, Richardson, et al., 1988, had a comorbid reading disability; many patients in Gillberg, Melander, von Knorring et al., 1997, had comorbid autism). In addition, these studies used differing measures of academic performance or cognitive function.
The following is a brief description of the main findings of the studies in each category.
Brown, Borden, Wynne et al., 1986: This study, which included 40 children and adolescents, obtained a high-quality score and included information on 14 of the 20 clinically relevant elements selected for extraction. It showed no advantage of cognitive training over no training in terms of improvement of core symptoms, academic performance, and conduct/oppositional disorders.
Brown, Borden, Wynne et al., 1988: This study evaluated compliance with medication. Subjects were randomly assigned to a 3-month protocol of MPH at 0.3 mg/kg two times per day or inert placebo two times per day and then were also assigned randomly to a cognitive therapy or AC training program. There was a trend for parents of children receiving placebo to report that more doses of medication and more therapy sessions were missed.
Firestone, Crowe, Goodman et al., 1986: This study included children from 5 to 9 years of age referred to various outpatient clinics who met criteria for ADHD both at home and at school. The diagnostic procedure is not specified. Patients were randomly assigned to one of three groups: parent training plus placebo, MPH plus PT, or MPH only. A real asset of this study is the careful attention to enrollment, attrition, and dropout rates. Patients given MPH and the combined PT plus MPH showed equivalent improvement from pre- to posttest (3 months), and both of these arms showed greater improvement than the PT plus placebo arm in behavior and attention, but not in reading. No differences were detected after 12 and 24 months, perhaps as a result of numerous crossovers and dropouts. Subjects who switched to MPH showed improvement following the switch.
Kupietz, Winsberg, Richardson et al., 1988: All 58 subjects in this study met criteria for ADHD and for reading disability and therefore are not typical of ADHD children in most clinics. The study also obtained a high-quality score and included information on 16 of the clinically relevant elements extracted. Behavioral improvement was noted for patients given MPH compared with those given placebo. MPH at the higher doses yielded the greatest effect. Eleven patients dropped out because of the lack of benefit or because of adverse effects. There was no improvement in academic performance, apart from improvement in reading which was observed even though posttesting was conducted off medication. There was evidence of relapse between 3 and 6 months.
Brown, Wynne, and Medenis, 1985: In this study, CBT alone resulted in minimal improvement in attention and no improvement in behavior or academic performance. The MPH and the combined treatments resulted in improved attention and behavior but little improvement in academic performance. Combined treatment was not superior to MPH alone.
Gittelman-Klein, Klein, Katz et al., 1976: This study has been described in previous sections. Initially, the combination MPH plus THIOR was superior to MPH or THIOR alone, although combined treatment was not superior to MPH alone after 12 weeks. Adverse effects were more common with the combined treatment group. The duration of the effect of the combined treatment was longer than that of MPH only, resulting in greater improvement in parent ratings.
Quinn and Rapoport, 1975: Patients who had participated in two independent RCTs of MPH vs. placebo and imipramine vs. placebo were followed for 1 year. Children who had been assigned to placebo in the initial studies (6 weeks) were assigned to either MPH or imipramine. Treatments were provided by community physicians. Behavioral improvement was evident in both medicated groups but not in the unmedicated group according to teacher ratings; equivalent improvement in behavior was evident in all three groups according to parents. No improvement was evident on academic, intellectual, or cognitive measures. MPH appeared to be the more effective medication, since fewer subjects discontinued treatment as a result of adverse effects or lack of improvement.
Conrad, Dworkin, Shai et al., 1971: Eighty-one subjects were selected from children in kindergarten and first and second grade attending 7 schools in a lower class neighborhood in 1969. To qualify for the treatment program, children had to be rated in the top 19 percent on the teacher hyperactivity ratings and exhibit abnormalities on at least one of various measures of perceptual-cognitive impairment. Dosage of medication (or placebo) was initiated at 5 mg/d and increased to 10 to 20 mg/d based on response. Subjects who had not received at least a 60-day supply of medication were excluded from the analysis on the grounds that they were noncompliant. The tutoring program was elaborate, covered academic and emotional issues, was lengthy (2 times per week for 20 weeks), and was well described. Fidelity of the tutoring program was not assessed. Neither experimental condition influenced academic achievement significantly.
Gillberg, Melander, von Knorring et al., 1997: This study merits special mention because it is the only long-term trial conducted outside of North America and one of the four studies published since 1991. Several features of this study are unusual and weaken the ability to generalize to other samples of ADHD children. Reasons for withdrawal from the study suggest that subjects were either nonresponders or were having adverse effects of DEX during the initial unblinded assessment of DEX effects. This limits the validity of the results of the subsequent comparison of DEX and placebo. The description of comorbidity indicates that this sample was unlike those treated with psychostimulants in North America. Seven of 62 subjects had many autistic features, 10 had mild mental retardation, and 2 had Tourette's syndrome. Hallucinations occurred more often in medicated children than has been reported in other studies (3/61), but on the whole, adverse effects were few and relatively mild. Positive effects of the drug were present 15 months after treatment was started.
Greenhill, Rieder, Wender et al., 1973: Subjects were 9 severely hyperactive children, 6 to 16 years of age, who were unresponsive to drugs and psychotherapy and had no evidence of psychosis or medical illness. Lithium treatment was initiated while the children were inpatients and then continued for 3 months while they were outpatients. The trial consisted of an initial, nonrandomized phase in which placebo, DEX, and lithium were each administered for 1 week in a single-blind fashion. The second phase was conducted on an outpatient basis and consisted of a modified double-blind trial of 3-week phases of DEX, lithium, and placebo. Children showed no improvement or worsening of symptoms when given lithium. DEX was associated with improvement in hyperactivity and concentration compared with placebo as reported by parents, teachers, and psychologists and by several cognitive-perceptual measures. Tutoring accounted for improvement on the Information subtest of the Weschler Intelligence Scale for Children (WISC). The combination of medication and tutoring resulted in improvement in copying, motor pattern, reduction in errors on a cognitive task, reduction in hyperactivity, and increase in full-scale intelligence quotient (IQ). The interventions did not improve academic achievement.
Fehlings, Roberts, Humphries et al., 1991: This study included 26 boys, 7 to 13 years of age, who had been referred to a child development clinic. Patients in the CBT group showed greater improvement in parent-rated behavior and child-rated self-concept than those in the supportive therapy group, and these improvements persisted from posttreatment assessment at 4 months to the 5-month followup.
Linden, Habib, and Radojevic, 1996: Subjects were 18 children, age 5 to 15 years with a "primary" diagnosis of ADHD which was based on unspecified measures (e.g., "family interview"). Subjects were referrals to an outpatient psychology clinic. Inclusion and exclusion criteria were not clearly defined. Nine subjects were assigned to EEG biofeedback treatment over 6 months, and nine subjects to a wait-list control group. Adjunctive treatment was controlled, and subjects received no medication or treatment other than that provided in the trial throughout the study. Measures were taken before and after the 6-month treatment program. Active treatment resulted in significantly greater improvement in IQ and attentiveness than did control treatment.
MTA Cooperative Group, 1999: Five-hundred-seventy-nine children, age 7 to 9, were randomly assigned to one of four 14-month-long conditions: medication management (MedMgt) with monthly visits and supportive care, intensive behavioral treatment (Beh) (including home- and school-based behavior management, direct behavioral management therapy for the child delivered through a summer camp program), both treatments combined (Comb), or assessment and referral back to community service providers (CC). Children's outcomes in clinical and functional domains were assessed throughout the study, using rating scales and direct observational measures. Comb and MedMgt treatments were clinically and statistically superior to intensive Beh and routine CC in reducing core ADHD and oppositional-aggressive symptoms. Moreover, Comb and MedMgt approaches did not differ statistically or clinically from each other for ADHD or oppositional-aggressive symptoms. In several non-ADHD-symptom areas (parent-rated internalizing symptoms, social skills, consumer satisfaction, and reading), Comb provided modest incremental benefits over single-treatment approaches. MedMgt was superior to medication management provided in the community despite the fact that two-thirds of the CC subjects were treated with medication during the study period.
Overall, the studies in this category show a trend to general improvement over time regardless of treatment. MPH appears to reduce behavioral disturbance in ADHD children for as long as it is taken. However, the studies lack information on the reasons that so many children discontinue medication and provide little evidence for improvement in academic performance with stimulants, even though MPH treatment does appear to produce consistent behavior improvement. Lithium does not appear to be an effective alternative in subjects who do not respond to stimulants. A limitation of this category is that all but one study was restricted to school-age children. Few studies followed children for a time period equivalent to the duration children typically remain on these treatments in the clinical setting. There is little information on situation-specific outcome measures (e.g., outcomes measured at home and at school), and adverse effects are reported infrequently.
Five RCTs compared MPH with placebo. These studies had very heterogeneous designs and incomplete reporting and provided contradictory results. The only study that was given a quality score >3 was in this group (Spencer, Wilens, Biederman et al., 1995). It was a randomized, double-blind, crossover comparison of titrated doses of MPH with placebo. The dose of MPH was titrated to approximately 1 mg/kg. This study included 25 patients out of 85 who had been referred. On MPH, 78 percent of patients showed end-of-treatment CGI scores <2 and 30 percent reduction in individual rating scale scores. Only 4 percent of the participants met these criteria on placebo. Of the four additional studies comparing the effect of MPH with placebo in adults (Evans, Vallano, and Pelham, 1994; Mattes, Boswell, and Oliver, 1984; Reimherr, Wender, Ebert et al., 1984; Wender, Reimherr, Wood et al., 1985), one had a single-patient design (n-of-1 design) and lasted for 1 week (Evans, Vallano, and Pelham, 1994); one study evaluated the interventions for 2 weeks but did not report the number of patients randomized to each arm (Wender, Reimherr, Wood et al., 1985); and another did not include any of the outcomes of interest (Reimherr, Wender, Ebert et al., 1984). The remaining study (Mattes, Boswell, and Oliver, 1984) included an evaluation of two groups of adults, one with evidence of residual attention-deficit disorder (ADD) with hyperactivity and the other without. In this study, no overall benefit from MPH was evident.
Additional information describing these studies can be found in Supplemental Evidence Tables G5-G8.
DeVeaugh-Geiss and Joseph, 1980: This was a single-subject study (a 20-year-old man) and a double-blind investigation of his response to DEX, secobarbital, and placebo. The authors noted a reduction of anxiety and motor activity while the subject was taking DEX.
Matochik, Liebenauer, King et al., 1994: This study was primarily interested in the effects of MPH and DEX on cerebral glucose metabolism as measured by positron emission tomograph (PET) before and after treatment. Subjects were adults who met criteria for ADHD, met Utah criteria for ADHD in adulthood, had a definite childhood history of ADHD, had no psychiatric disorder, and were unmedicated prior to the study. Most had parents and/or children with ADHD. Medication dose was individually titrated and administered for 6 to 15 weeks. No differences in glucose metabolism with medication were observed. No direct comparison of MPH and DEX was conducted.
Conners, Levin, Sparrow et al., 1996: This 1-week trial found that nicotine at 7 mg delivered via transdermal patch yielded significant improvements in CGI ratings (56 percent much improved, 31 percent no change, and 12 percent unchanged or worse) and in self-ratings of vigor and concentration and improved performance on time estimation tasks, when compared with placebo.
Ernst, Liebenauer, Jons et al., 1996: This trial found that selegiline treatment was not more effective than placebo.
Wender, Reimherr, and Wood, 1981: This study examined the effects of pemoline vs. placebo. More than one-half the patients in the pemoline-treated group complained of moderate to severe adverse effects. Although the overall response to pemoline in the placebo and drug groups did not differ, adults with a childhood history of ADHD on parent reports showed a more favorable response than those without a childhood history of ADHD.
Wilens, Biederman, Prince et al., 1996: This study, also described in the tricyclic antidepressants section, found that 68 percent (13/19) of the participants showed a positive response to desipramine (very much or much improved) plus at least a 30 percent reduction in rating-scale symptoms. No participants showed a favorable response to placebo. Two participants receiving desipramine dropped out, and 10 had doses lowered as a result of adverse effects. Favorable response was not related to the presence of lifetime comorbidity of anxiety or depressive disorders.
Wood, Reimherr, and Wender, 1985: This was a crossover trial of phenylalanine that yielded a significant improvement in mood and mood ability over placebo, but the positive effects were lost within 3 months.
In conclusion, few RCTs are designed to evaluate the long-term effects of treatments for ADHD. Most of the studies available had incomplete reports, had small sample sizes, and were of short duration. The few studies evaluating MPH vs. placebo show contradictory results, but mostly in favor of MPH. The only study available with evaluation of the effect of DEX vs. placebo included few data of clinical relevance. Antidepressants may be effective in adults. Studies (one each) comparing pemoline, nicotine, or phenylalanine with placebo did not produce evidence in favor of these medications. No studies were designed to determine the proportion of adults with ADHD who will use, and benefit from, other interventions, particularly nonpharmacological.
The search yielded 33 reports of 28 RCTs and 1 non-RCT of pharmacological treatment for ADHD that provided data on adverse effects and met the inclusion criteria. Detailed information on the general characteristics of the 29 studies is included in Tables H1a-H1h. The earliest study was published in 1972 and the most recent, in 1997. Almost one-half the reports were published since 1993. Although the period of time during which the interventions were studied varied from 1 to 89 weeks, 19 evaluated adverse effects for 12 weeks or fewer, whereas 3 of the studies did not report the exact duration of followup (Fine and Johnston, 1993; Matochik, Liebenauer, King et al., 1994; Rapport, Carlson, Kelly et al., 1993). Unfortunately, 11 RCTs failed to indicate the number of patients randomized to 1 or more of the treatment arms, making it impossible in these studies to identify with confidence the proportion of patients adversely affected by treatment. Among the studies reporting on the number of patients randomized, substantial variation was seen in sample sizes, from 16 to 166 patients. Eight of the 17 RCTs that provided the number of patients randomized included 30 or fewer subjects. The failure to report clearly on the number of patients randomized was symptomatic of overall poor quality. The interventions evaluated were MPH in 25 RCTs, amphetamines (DEX or l-amphetamine [L-amph]) in 9 RCTs, pemoline in 2 RCTs, and antidepressants in 2 trials. The single non-RCT evaluated MPH.
The adverse effects examined most often across the studies included sleep disorders/disturbances (including nightmares), headaches, motor tics, decreased appetite/anorexia, abdominal pain, irritability, nausea, and fatigue/tiredness (Evidence Tables H1a-H1h and H2). No study evaluated the risk of addiction with stimulants, liver toxicity with pemoline, or cardiac arrhythmias with antidepressants.
Meta-analysis was deemed inappropriate in this category for the same generic reasons described in the Methodology chapter, but in addition, there was marked heterogeneity in the methods used to elicit, measure, and report adverse side effects-different concepts (differences in type as well as characterization: occurrence, frequency, severity), instruments, respondents, and method (elicitation vs. spontaneous report); very short exposure periods; little or no information on interactions between developmental stage and adverse effects; and imprecise estimates of adverse effects because of small sample sizes.
The following is a more detailed description of each of the adverse effects most frequently studied. Tables H1a-H1h provide a detailed summary of these adverse effects, according to the interventions evaluated in the studies.
Studies that provided data on nausea, fatigue and tiredness either showed no significance among the interventions evaluated in them or did not provide data on significance (Evidence Tables H1g and H1h).
Overall, as described above, the studies in this category show a trend to general improvement over time regardless of treatment. The studies suggest that stimulants reduce behavioral disturbance but provide little evidence for improvement in academic performance. The studies have major methodological deficiencies which are compounded by their restriction to school-age children, relatively short followup, and few data on adverse effects.
Three of the meta-analyses included in the review of reviews evaluated the effect of stimulants compared with placebo (Appendix C). Although these reviews had major methodological limitations, they showed consistently that stimulants improve behavior in children with diagnosis of ADHD in the short-term (Kavale, 1982; Ottenbacher and Cooper, 1983; Thurber and Walker, 1983).
The report of the systematic review conducted at the University of British Columbia was assessed and obtained a quality score of 6 out of 7 using Oxman and Guyatt's methodological index (Oxman and Guyatt, 1991). This review was based on an extensive search of seven databases to August 1997, contact with pharmaceutical companies, and hand search of journals.
The review focused on RCTs comparing stimulants with placebo that provided data on children's general "ADHD behaviors" obtained with 11 teacher measures: the Abbreviated Conners Teacher Rating Scale; the Iowa Conners Teacher Rating Scale; the Attentional Problems Subscale of the Achenbach Child Behavior Checklist (Teacher Report Form [TRF]); the Child Attention Profile (CAP) Questionnaire; School Situations Questionnaire (SSQ); Revised Behavior Problem Checklist (RBPC); ADD-H Comprehensive Teacher Rating Scale (ACTeRS); Attention-Deficit Disorder Evaluation Scale (ADDES); Children's Attention and Adjustment Survey (CAAS); ADHD Rating Scale; and the Swanson, Nolan, and Pelham (SNAP). Nine behavior rating scales used to measure parent opinions were: Conners Parent Rating Scale; Conners, Loney, and Milich (CLAM) Questionnaire; Achenbach Child Behavior Checklist Attentional Problems score; Home Situations Questionnaire (HSQ); Revised Behavior Problem Checklist; Attention-Deficit Disorder Evaluation Scale; Children's Attention and Adjustment Survey; ADHD Rating Scale; and SNAP.
The authors standardized data that had been obtained with different tools to a common measure across the studies. Despite the fact that the authors acknowledged the limitations of the data available, they performed meta-analysis using DerSimonian and Laird's method. The review included 16 RCTs evaluating efficacy of stimulant treatment using teacher rating scales, 2 of which did not show advantage for the stimulants. Twelve RCTs included parent rating scales, one of which did not show advantage for the stimulants. When all the studies were combined, the overall results indicated that drug therapy was more effective than placebo. The results from the analysis of seven RCTs of MPH and three of DEX was possible only with the Conners Teacher Rating Scale and showed that both drugs were better than placebo. The only trial comparing pemoline vs. placebo also showed advantage for patients receiving pemoline.
In summary, the results of studies comparing stimulants with placebo indicate that stimulants offer benefit at least in the short term. The few data on long-term effects suggest that stimulants may improve behavioral disturbances. Little evidence shows long-term benefit of stimulant therapy on academic performance, and there is a paucity of data for long-term adverse effects.
The main overall conclusions from studies included in the Task Order are the following:
Extensive research has been done on the treatment of ADHD for more than 25 years. However, few studies compare different pharmacological interventions head to head or with nonpharmacological treatments or evaluate the efficacy of antidepressants or combination therapies, the implications of long-term therapy, the treatment of ADHD in adults, or the adverse effects associated with the most frequently prescribed medications.
The field would benefit from empirical methodological research evidence indicating the added value of nonrandomized within-subject and single-subject research designs for direct head-to-head comparisons between psychosocial interventions and other treatments.
The reports of the studies available have numerous deficiencies that limit the assessment of their validity, relevance, precision, and, therefore, their clinical application. It was surprising, for instance, to find that most studies did not describe clearly important information such as the presence of comorbid disorders, the characteristics of the patients' families, the fidelity of treatments, compliance with treatment, or baseline measurement of outcomes of interest. Little information is available on the treatment of adult patients who belong to minority ethnic groups. In addition, the reporting of other important aspects of the studies was deficient.
Most of the studies available had small sample sizes and did not provide information on the primary outcome of interest to the researchers. Therefore, it was unclear on most occasions whether statistically nonsignificant results reflected a true lack of difference among interventions or false-negative results from underpowered studies. Research during the past 20 years on this issue in other areas indicates that the latter is the most plausible situation in most cases (Freiman, Chalmers, Smith et al., 1978; Moher, Dulberg, and Wells, 1994).
One of the most important challenges faced by researchers interested in ADHD and its treatment is to select a core set of uniform outcome measures to be used across studies in addition to those preferred by individual research groups. Standardized methods to obtain and report each of the outcomes, and the identification of clinically meaningful changes in the outcomes in response to treatment, would facilitate the interpretation of individual studies and the synthesis of data across studies.
More research is needed to understand the relative value of the different aspects of a study. For instance, it is unclear whether having information on family characteristics is more or less important than having information on comorbid disorders. More efforts should be made to understand the implications that different diagnostic models for ADHD and comorbid disorders could have on the outcomes of treatment. This area provides a fertile ground of research on these clinically and methodologically important issues. Any future research efforts may also benefit from more communication among clinicians, providers, and consumers.
Most reports lacked description of basic generic methodological aspects that increase their likelihood of bias. For instance, 97 percent of the articles did not describe the method of randomization, 95 percent did not provide information on concealment of allocation, and 87 percent did not describe the number of, or reasons for, withdrawals and dropouts in each of the groups. It is important to acknowledge that the studies may have included most of the elements in their protocols throughout the execution of the study but did not describe them in their report. Journal editors and peer reviewers should recognize the importance of these elements and encourage authors to include them in their reports. Most of the current problems that were encountered could be easily corrected if journal editors adopted evidence-based reporting recommendations such as the Consolidation of the Standards of Reporting Trials (CONSORT) statement (Begg, Cho, Eastwood et al., 1996) and kept track of new methodological developments that could increase the validity and applicability of research. The CONSORT statement was produced and published by an international group of clinical epidemiologists, biostatisticians, and journal editors in 1996. Its aim is to improve the standards of written reports of RCTs and to ensure that readers find all the information they require in the reports to interpret the trial results with confidence. This statement includes a checklist of 21 items and a flow diagram that authors can use to provide necessary information on the progress of patients through a study. The statement has already been adopted by over 70 major biomedical journals (Jadad and Rennie, 1998).
The approach used in this review for the assessment of the quality of the studies by no means represents the only or most appropriate way to assess trial quality. However, the authors of this Task Order report included the only validated tool available that also appears to produce robust and valid results in an increasing number of empirical methodological studies. In addition, the scale was not used in isolation. Instead, it was complemented with separate assessments of other components for which empirical evidence of a direct relationship with bias exists. In addition, separate assessments of 20 other components were added to provide readers with a much wider and more clinically relevant picture. Any future decision on the assessment of trial validity should be made in the light of new empirical methodological evidence.
The large number and heterogeneity of outcome measures and tests used in the studies limited efforts to compare and synthesize data across the studies included in this report.
Researchers often use modified versions of the same tests (e.g., Conners) across studies but provide little information on their modified versions. The field would benefit by the selection of a core set of validated and clinically relevant outcomes to be measured in all the studies in addition to any other outcomes of interest to the specific groups of researchers. Few studies have been supported financially by sources other than governments or pharmaceutical companies. A great opportunity exists for consumer groups to support more research activities, given the number of important questions that remain unanswered and the implications of the results of research on the public.
Future research efforts in the areas reviewed in this Task Order report will require collaboration and commitment among different groups of stakeholders. The MTA study funded by NIMH is an example of this type of collaboration.
The following is a description of the main implications for clinical practice and future research efforts from each of the seven categories of interest.
The small group of studies comparing different chemical structures and formulations of the same stimulants suggests that the different compounds are more effective than placebo but very similar to each other.
In agreement with the AMA and AACAP reports, it seems that few, if any, differences occur among MPH, DEX, and pemoline. In addition, the studies comparing stimulants with tricyclic antidepressants had many limitations and presented conflicting results. More rigorous studies are clearly needed to establish the relative effectiveness of stimulants and tricyclic antidepressants.
Studies are also required to compare the effects of stimulants with clonidine, buproprion, or selective serotonin-reuptake inhibitors.
The studies available indicate consistently that stimulants (particularly MPH) may be more effective than nonpharmacological interventions when compared head to head. The use of nondrug interventions as adjunct to treatment with stimulant awaits more definitive studies.
Evidence is lacking that supports the superiority of combination therapy over stimulant alone or superiority of combination therapy over nondrug interventions alone. Thus, more definitive studies are needed to determine the added value of nondrug interventions when patients are already receiving stimulants, as well as the value of adding stimulants when nondrug interventions fail to achieve the desired outcomes. These studies, however, will require complex designs, substantial amounts of resources, and efficient collaboration among research groups. The MTA study is an example of this type of study and represents an important contribution to knowledge in this area.
The studies on desipramine, regardless of their heterogeneous designs, small sample sizes, and variable quality, suggest that desipramine is more effective than placebo. The studies evaluating imipramine show inconsistent results. Given that this group of drugs may be considered by clinicians as the second line of treatment after stimulants, more rigorous research is needed to establish their role in the treatment of ADHD.
Despite the fact that most patients diagnosed with ADHD receive treatment for long periods of time, the research efforts made to date are of little value to guide most clinical decisions. More studies are needed in this area because of the persistence of the disorder (NIH Consensus Statement Online, 1998). Practically all the evidence available concentrates on school-age children. Even in this age group, few data are available on adverse effects, on academic achievement, or on situation-specific outcomes (e.g., at home and at school). This is compounded by frequent crossover from one treatment arm to another within studies with parallel design, high attrition rates, and poor description of the reasons for discontinuation of treatments.
When adverse drug reactions do occur, they are usually related to dose and there is no evidence that concludes long-term effects of therapeutic use of psychostimulants are harmful. Long-term therapy studies supply evidence that MPH reduces behavioral disturbance in ADHD children as long as it is taken. Lithium does not appear to be an effective alternative in subjects who do not respond to stimulants. The MTA confirmed the findings of previous studies demonstrating short-term benefits do continue during longer term treatment (MTA Cooperative Group, 1999). There is no information on the long-term outcomes of medication-treated ADHD individuals in terms of educational and occupational achievements, involvement with law enforcement agencies or other areas of social functioning (NIH Consensus Statement Online, 1998).
The studies available in this category are few, and most have incomplete reports and small sample sizes and are of short duration. Given the dearth of research evidence in this area, more studies are needed to establish the effectiveness and adverse effects of different interventions for the treatment of ADHD in adults. In the meantime, clinicians should be aware that the few studies evaluating MPH compared with placebo show contradictory results, that antidepressants may be effective in adults, and that the studies available do not support the use of pemoline, nicotine, or phenylalanine. Studies were not found that evaluated the effectiveness of nonpharmacological interventions for adults with diagnosis of ADHD.
The findings in this report agree with those in the AMA report in relation to the evidence available on adverse effects. Many of the side effects associated with stimulant use appear to be relatively "mild, short lived and responsive to dosing or timing adjustments" (Goldman, Genel, Bezman et al., 1998). However, data are inadequate on the long-term effects and severity of the adverse effects of most interventions. Timing is an additional element that should be taken into account in future studies (e.g., the effects of stimulants administered during puberty and/or concomitant growth spurts). It appears from the available commentaries in many of the studies that desirable changes in behavior brought about by stimulant medication far outweigh reported side effects. Although this may be true, no attempts have been made to evaluate the tradeoff explicitly (i.e., examine the differential utility) between behavioral changes and side effects. The risks and benefits of treatment with psychostimulants must also be measured (NIH Consensus Statement Online, 1998). However, this is only worth doing if the perspectives of all interested parties (parents, teachers, and patients ) are included in the exercise.
In addition, it is important to highlight the following issues/concerns:
No comparative studies were identified with data on addiction in relation to the treatment of ADHD with stimulants, liver toxicity resulting from pemoline administration, or major arrhythmia resulting from administration of tricyclic antidepressants.
Reports of adverse effects almost always come from parents and/or teachers. One study (of adolescents) showed some important differences between parents and adolescents in the side-effects profile reported by each group. Data need to be collected from children on adverse side effects in order to make an intelligible assessment of their importance from the patients' perspective.
A better understanding of the distinctions between "adverse effects" and concomitant characteristics of ADHD is needed. A number of reports discuss the high prevalence of "side effects" reported on placebo. Many of these may be associated problem behaviors. Including such behaviors distorts the context for evaluating the importance of side effects.
Few girls and women have been studied. It is possible that effectiveness and adverse effects vary by sex. This issue needs to be examined or at least discussed.
RCTs are limited to evaluate adverse effects, particularly rare ones or those that appear during long-term therapy. Only one comparative non-RCT with adequate data was found, and it also provided limited information. More observational studies are required (particularly case-control or cohort studies) to gain a better understanding of adverse effects associated with different treatments for ADHD. Knowledge of adverse effects may also improve through more creative use of existing drug databases.
The findings and conclusions of this evidence report are based on the information that was available in the published reports of the studies included. Additional information obtained directly from the authors could have overcome many of the reporting limitations described above. Contact with authors could have also led to reduction in the likelihood of publication bias through the identification of unpublished studies. The budget and time lines available, however, were insufficient to allow this.
The interpretability of these data included in most of the tables of this evidence report is limited. These data were obtained by the authors of the studies with many different instruments that were poorly described in most reports.
Another limitation of this report is that it does not include quantitative estimates of the relative effects of the interventions evaluated. However, meta-analysis was deemed inappropriate given the amount, heterogeneity, and quality of the data available. As mentioned above, the use of meta-analysis to synthesize this type of data has been associated with a greater chance of obtaining imprecise and potentially misleading results (Ioannidis, Cappelleri, and Lau, 1998). It is unclear, however, whether the problems found in this data set are greater or smaller than those in other areas. It is important, however, to recognize that qualitative systematic reviews can introduce other problems such as biased narrative description of the characteristics and findings of the studies included. By including detailed evidence tables in this report, it is hoped that readers will be able to replicate the findings, when appropriate.
It could also be argued that another limitation of this report is that it did not include a separate section for studies comparing the short-term effects of stimulants with placebo. This decision was motivated by the need to make efficient use of the resources available while ensuring the maximum added value from the Task Order report. Given the consistency of the findings of the individual studies included in this review, the three published systematic reviews, and the most recent unpublished review, as well as the limitations of the studies available, it is the belief of the authors that the decision to focus attention on the other seven categories resulted in the most efficient use of the resources available.
It could be argued that systematic reviews of behavioral interventions could benefit substantially by the inclusion of nonrandom within-subject and single-subject research designs. However, we regarded these study designs as too vulnerable to bias and of limited value for direct head-to-head comparisons between psychosocial therapies and interventions.
In many instances, it was concluded that there was a lack of evidence on the effectiveness of clinically important interventions. It is important to recognize that this is different from the lack of effectiveness of the same interventions.
This report represents a collective effort by representatives of different groups of stakeholders, including government agencies (AHRQ), professional organizations (AAP and APA), consumer groups (Children and Adults with Attention-Deficit Disorders [CHADD]), clinicians, and researchers with content and methodological expertise. Emphasis was placed on ensuring the relevance of the research questions, on building on existing knowledge, and on supporting future clinical practice guideline development or research efforts.
The report includes seven systematic reviews that incorporated state-of-the-art methodology and data that are ready for incorporation into evidence-based clinical practice guidelines or performance measures.
This report would best be used as a tool to guide future research priorities and as a platform from which to develop programs, guidelines, and policies rather than as a guideline for clinical practice. Our hope is that this report will be used to stimulate discussion, advance knowledge, and provide constructive challenge in an area of public concern: ADHD.
Evidence Tables A: General Characteristics of All Studies
Either (1) or (2):
Inattention: Six (or more) of the following symptoms of inattention have persisted for at least 6 months to a degree that is maladaptive and inconsistent with developmental level:
often fails to give close attention to details or makes careless mistakes in schoolwork, work, or other activities
often has difficulty sustaining attention in tasks or play activities
often does not seem to listen when spoken to directly
often does not follow through on instructions and fails to finish schoolwork, chores, or duties in the workplace (not due to oppositional behavior or failure to understand instructions)
often has difficulties organizing tasks and activities
often avoids, dislikes, or is reluctant to engage in tasks that require sustained mental effort (such as schoolwork or homework)
often loses things necessary for tasks or activities (e.g., toys, school assignments, pencils, books, or tools)
is often easily distracted by extraneous stimuli
is often forgetful in daily activities
Hyperactivity-impulsivity. Six (or more) of the following symptoms of hyperactivity-impulsivity have persisted for at least 6 months to a degree that is maladaptive and inconsistent with developmental level: Hyperactivity
often fidgets with hands or feet or squirms in seat
often leaves seat in classroom or in other situations in which remaining seated is expected
often runs about or climbs excessively in situations in which it is inappropriate (in adolescents or adults, may be limited to subjective feelings of restlessness)
often has difficulty playing or engaging in leisure activities quietly
is often "on the go" or acts as if "driven by a motor"
often talks excessively
Impulsivity
often blurts out answers before questions have been completed
often has difficulty awaiting turn
often interrupts or intrudes on others (e.g., butts into conversations or games)
Some hyperactive-impulsive or inattention symptoms that caused impairment were present before age 7 years.
Some impairment from the symptoms is present in two or more settings (e.g., at school [or work] and at home).
There must be clear evidence of clinically significant impairment in social, academic, or occupational functioning.
The symptoms do not occur exclusively during the course of a Pervasive Developmental Disorder, Schizophrenia, or other Psychotic Disorder and are not better accounted for by another mental disorder (e.g., Mood Disorder, Anxiety Disorder, Dissociative Disorder, or a Personality Disorder).
314.01 Combined Type: if both Criteria Al and A2 are met for the past 6 months (ICD-10 F90.0)
314.00 Predominantly Inattentive Type: if Criterion Al is met but Criterion A2 is not met for the past 6 months (ICD-10 F98.8)
314.01 Predominantly Hyperactive-impulsive Type: if Criterion A2 is met but Criterion Al is not met for the past 6 months (ICD-10 F90.0)
314.9 AD/HD NOS: symptoms are prominent but do not quite meet above criteria (ICD-10 F90.9)
CODING NOTE: For individuals (especially adolescents and adults) who currently have symptoms that no longer meet full criteria, "in Partial Remission" should be specified.
From:
Based on information from the Diagnostic and
Statistical Manual of Mental Health Disorders, Fourth Edition. Copyright 1994
American Psychiatric Association
The following individuals were consulted extensively and actively participated in all the steps of this project:
Alejandro (Alex) R. Jadad, MD DPhil, Professor, Clinical Epidemiology and Biostatistics (CE&B), McMaster University; Co-Director, Canadian Cochrane Centre; and Director of the MU-EPC was the Task Order Leader for this proposal. Dr. Jadad coordinated the design, execution and reporting of the Task Order. Dr. Jadad also led the section on drug vs. drug comparisons.
Michael Boyle, PhD, Professor and Research Coordinator in the Department of Psychiatry and Behavioural Neurosciences; Associate member of CE&B, McMaster University; and a member of the Centre for Studies of Children at Risk, Hamilton Health Sciences Corporation. Dr. Boyle provided methodological expertise to the project as well as knowledge of the diagnostic and assessment issues pertaining to children suffering from psychiatric disorders.He led the section on adverse effects and the discussion of issues related to sampling.
Charles (Chuck) Cunningham, PhD, is a Clinical Child Psychologist for the Child and Family Centre, Hamilton Health Sciences Corporation; and Professor, Department of Psychiatry and Behavioural Neurosciences; associate member CE&B, McMaster University. Dr. Cunningham is the Chair ofthe Professional Advisory Board for the organization Children and Adults with Attention-Deficit Disorders (CHADD) in Canada.He has been extensively involved in both clinical and research activities for children suffering from ADHD.Dr. Cunningham synthesized the data on antidepressants vs. placebo and on the treatment of ADHD in adults.
Marie Kim, MD, is an Assistant Professor in the Departments of Pediatrics at both McMaster University and University of Western Ontario.She is a pediatrician with expertise in neurodevelopmental issues and is currently a graduate student in the McMaster Health Research Methodology program. Dr. Kim led the development of the sections on combination therapy and nondrug vs. drug comparisons.
Russell Schachar, MD, is a Professor in the Department of Psychiatry at the University of Toronto; the Director of Child Psychiatry Research at the Hospital for Sick Children in Toronto; and Senior Scientist in the Research Institute at the Hospital for Sick Children in Toronto. He has published extensively in the field of pediatric psychiatry, including a systematic review of long-term studies (1996).Dr. Schachar led the section focusing on long-term studies.
Andrew (Andy) R. Willan, Professor, CE&B, McMaster University, is a biostatistican and has overseen the statistical direction and support for the Task Order.He provided advice on the design of the extraction forms and the programming and implementation of the technology that was used for data extraction, storage, and analysis.
Stuart MacLeod, MD, is the Director of the Father Sean O'Sullivan Resaerch Center at St. Joseph's Hospital, Hamilton; Professor, CE&B, Medicine and Pediatrics, McMaster University.Dr. MacLeod is a clinical pharmacologist and provided input on the pharmacological issues associated with the treatment of ADHD.
Deborah Cook, MD, MSc, Associate Professor, CE&B, McMaster University, has significant expertise in systematic reviews and provided methodological input to the design and execution of the review.
American Academy of Pediatrics (AAP): All members of the AAP Subcommittee on ADHD participated actively in the task order, providing input during the formulation of the research questions, the development of the data extraction forms, and support during the search for systematic reviews on the treatment of ADHD. Members of this Subcommittee include Dr. James Perrin (Co-Chair), Dr. Martin Stein (Co-Chair), Dr. Robert Amler, Dr. Thomas Blondis, Dr. Ronald Brown, Dr. Anthony DeSpirito, Dr. Heidi Feldman, Dr. Theodore Ganiats, Dr. Brian Grabert, Dr. Charles Homer, Dr. Bruce Meyer, Dr. Karen Pierce, Dr. Bennett Shaywitz, Dr. Mark Wolraich, and Ms. Carla Herrerias (coordinator).
American Psychiatry Association (APA): Dr. Deborah Zarin, Deputy Medical Director, American Psychiatric Association, provided background materials from the APA and acted as the liaison with that organization. She also provided input during the formulation of the research questions, the development of the data extraction forms, and support during the search for systematic reviews on the treatment of ADHD.
Carmela Vidic is a member of the Hamilton local chapter of CHADD.Ms. Vidic has made presentations to government officials and school boards on the topic of ADHD and is a social worker in her professional life. She is also the mother of three children, one of whom has been diagnosed with ADHD.
Lynda Booker, BA, is a Research Assistant and was the second reviewer/abstractor for the research articles included in this review. Ms. Booker was also involved with the development of search strategies, identification of eligible articles, and the presentation and design of many of the evidence tables. Ms. Booker also coordinated data extraction and synthesis for the review of reviews.
Mary Gauld, BA, has been the Research Coordinator for the Task Order. Working under the direction of the Task Order Leader, Ms. Gauld coordinated all aspects of the production of the evidence report, including supervision of other staff and ensuring planning and communication structures in keeping with the timelines set for various aspects of the Task Order.Ms. Gauld was also one of two reviewers and abstractors for the research articles included in this review.
Ritz Kakuma, BA,was the Office Assistant on the Task Order.Ms. Kakuma was primarily responsible for conducting the literature searches and managing the office files, as well as preparing and editing the various reports submitted to AHCPR.
Ann McKibbon, MLS,is a medical librarian in the Health Information Research Unit.She provided invaluable expertise and advise to the group in searching, reviewing, and abstracting the medical literature.
Debbie Robinson provided support as accounting clerk, monitoring University financial statements and assisting in the preparation of monthly reports and invoices to AHCPR.
Anne Snider, MEd, is the Coordinator of the MU-EPC and provided the overall coordination of the monthly reporting and financial management. Ms. Snider also acted as the liaison within the McMaster community, as well as with external bodies such as the AHCPR Contracts Office and subcontractors such as the San Francisco Cochrane Center.
Elizabeth Wiecek, MD, worked along with Andy Willan, our biostatistician, coordinating the development the database using traditional programming methods (SAS) along with the new technology.Together Drs. Willan and Wieczek developed a plan to use MicroSoft Office Version 8.0, MicroSoft Access program, and the TELEform Designer software for data extraction.
[This appendix includes a summary of the main findings and the list of the articles that were included. The full document is available upon request]
--The Agency for Health Care Policy and Research (AHCPR) charged the McMaster Evidence-based Practice Centre (EPC) to conduct a comprehensive systematic review of the literature about the treatment of ADHD, with input from different groups of stakeholders. One of the strategies used to avoid duplication of work included a critical appraisal of existing systematic reviews and meta-analyses.
--To identify and appraise published meta-analyses and systematic reviews on the treatment of ADHD and to produce an annotated bibliography.
--MEDLINE, CINAHL, HealthSTAR, PsycINFO, and EMBASE were searched from their first issue date to September 1998; The Cochrane database (1998 issue 3); selected Internet sites and the files of investigators.
--Review articles described as systematic reviews or meta-analyses or including methods section were selected independently by three reviewers.
--Two reviewers extracted, by consensus, information on the name, number and background of authors; publication year and language; funding, source of studies; years covered; type and number of studies included; language restrictions; methodological quality; and data on the population, interventions, and outcomes evaluated by the reviews.
--Thirteen reviews, published from 1982 to 1998, were included. Eight included meta-analysis and five a qualitative review. Nonpharmacological treatments were mentioned in six reviews and combination therapies in three. One review focused on the treatment of adults. Forty seven drugs and 20 adverse effects were mentioned. Most reviews had major methodological flaws.
--Most published systematic reviews and meta-analyses on the treatment of ADHD have limited value to guide clinical, policy, and research decisions. A rigorous systematic review following established methodological criteria is warranted.
Database: Medline <1966 -- 28-Nov-97>
Set Search
001 exp "Behavioral Symptoms (Non MeSH)"/
002 hyperactiv:.tw,sh.
003 exp Cognition Disorders/
004 1 and 2
005 2 and 3
006 exp attention deficit disorder with hyperactivity/
007 attention deficit:.tw,sh.
008 6 or 7
009 4 or 5
010 8 or 9
011 minimal brain damage:.tw,sh.
012 minimal brain dysfunction:.tw,sh.
013 hyperkinetic syndrome:.tw,sh.
014 hyperkinetic reaction:.tw,sh.
015 impulsivity:.tw,sh.
016 inattent:.tw,sh.
017 10 or 11 or 12 or 13 or 14 or 15 or 16
018 random:.tw,sh,pt.
019 clinical trial:.tw,sh,pt.
020 comparative:.tw,sh.
021 case-control:.tw,sh.
022 cohort:.tw,sh.
023 18 or 19 or 20 or 21 or 22
024 17 and 23
025 17 and 18
026 17 and 19
027 17 and 20
028 17 and 21
029 17 and 22
030 29

Jadad, Moore, Carroll et al. Controlled Clin Trials 1996;17:1-12.
A method to generate the sequence of randomization will be regarded as appropriate if it allowed each study participant to have the same chance of receiving each intervention and the investigators could not predict which treatment was next. Methods of allocation using date of birth, date of admission, hospital numbers, or alternation should be not regarded as appropriate.
A study must be regarded as double blind if the word "double blind" is used. The method will be regarded as appropriate if it is stated that neither the person doing the assessments nor the study participant could identify the intervention being assessed, or if in the absence of such a statement for use of active placebos identical placebos or dummies are mentioned.
Participants who were included in the study but did not complete the observation period or who were not included in the analysis must be described. The number and the reasons for withdrawal in each group must be stated. If there were no withdrawals, it should be stated in the article. If there is no statement on withdrawals, this item must be given no points.
The San Francisco Cochrane Center helped coordinate the peer review of this report. Our thanks go to Dr. Lisa Bero, Dr. Drummond Rennie and Ms. Andrea Clark.
(Please refer to page and paragraph number when making very specific comments.)
Are evidence report questions well formulated and easily understandable?
Is there a thorough search for relevant data using appropriate resources?
Are there unbiased, explicit searching strategies that are appropriately matched to the question?
Are appropriate inclusion and exclusion criteria used to select articles? Are selection criteria applied in a manner that limits bias? Are efforts made to identify unpublished data, if this is appropriate? Are reasons for excluding studies from the report stated?
Is the validity of individual studies addressed in a reliable manner? Are important parameters (e.g. setting, study population, study design) that could affect study results systematically addressed?
Is there a minimal amount of missing information regarding outcomes and other variables considered key to the interpretation of results? Are efforts made to reduce bias in the data collection process?
Are important parameters, such as study designs, considered in the synthesis? Are reasonable decisions made concerning whether and how to combine the data? Are results sensitive to changes in the way the analysis was done? Is precision of results reported? Are limitations and inconsistencies of studies stated? Are limitations of the review process stated?
Are implications for research discussed?
Are conclusions supported by the data reviewed? Is evidence appropriately interpreted as inconclusive (no evidence of effect) or as showing a particular strategy did not work (evidence of no effect)? Is a summary of pertinent findings provided?
Does the Executive Summary adequately summarize the report? Is the evidence report presented in a clear readable manner? If not, your suggestions are:
What are the major strengths of this report? What are the major limitations of this report?
Indicate here whether you have any conflicts of interest regarding the review of the Evidence Report.
-I certify that I have no affiliations with or involvement in any organization or entity with a direct financial interest in the subject matter of the Evidence Report (e.g. employment, consultancies, stock ownership, honoraria, expert testimony.)
Declare your conflict of interest here.
Reviewers of this Task Order :
| Name | Affiliation |
|---|---|
| Ronald Brown, PhD | Society for Pediatric Psychology Medical University of South Carolina Charleston, South Carolina |
| Mina Dulcan, MD | Children's Memorial Hospital Chicago, Illinois |
| Theodore Ganiats, MD (Methodology expert) | American Academy of Family Physicians UCSD School of Medicine LaJolla, California |
| Peter Jenssen, MD | NIH Associate Director for Child and Adolescent Research Washington, DC |
| Rachel Klein, MD | New York State Psychiatric Institute Columbia University New York, NY |
| Geraldine Macdonald (Methodology expert) | Centre for Professional Studies University of Bristol Bristol, United Kingdom |
| William Mahoney, MD | Professor, Pediatrics McMaster University Children's Hospital Hamilton, Ontario |
| James Perrin, MD | Co-Chair, AAP subcommittee on ADHD Boston, Massachusetts |
| Karen Pierce, MD | American Academy of Child and Adolescent Psychiatry Chicago, Illinois |
| Parminder Raina, Ph.D (Methodology expert) | Centre for Community & Child Health Research Co-Author, CCOHTA report Vancouver, British Columbia |
| Leonard Rapport, MD | Children's Hospital Boston, Massachusetts |
| Bennett Shaywitz, MD | Child Neurology Society New Haven, Connecticut |
| Martin Stein, MD | Co-chair, AAP subcommittee LaJolla, California |
| Richard Ward, MD | Faculty of Medicine, University of Calgary Calgary, Alberta, Canada |
| Deborah Zarin, MD | American Psychiatric Association Washington, DC |
| Criticism Editor Patricia Huston, MD, MPH | Ottawa Heart Institute Ottawa, Ontario, Canada |
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]AACAP: American Association of Child and Adolescent Psychiatry
AAP: American Academy of Pediatrics
AC: attention control
ACRS: Abbreviated Conners Rating Scale
ACTeRS: ADD-H Comprehensive Teachers' Rating Scale
AD: anxiety disorder
ADD: Attention-Deficit Disorder (DSM-I-III)
ADDES: Attention-Deficit Disorder Evaluation Scale
ADHD: Attention-Deficit/Hyperactivity Disorder (DSM-IIIR-IV)
AHRQ: Agency for Healthcare Research and Quality
AMA: American Medical Association
APA: American Psychiatric Association
Afro: African American
BC/Mod: behavioral contingencies/modification
BDI: Beck's depression inventory
Beh: behavioral treatment
BPC: Behavior Problem Checklist
CAAS: Children's Attention and Adjustment Survey
CAP: Child Attention Profile
CBCL: Child Behavior Check List
CBT: cognitive behavioral therapy
CC: community service providers
CCOHTA: Canadian Coordinating Office for Health Technology Assessment
CCT: Children's Checking Task
CD: conduct disorder
CDRS: Children's Depression Rating Scale
CEFT: Children's Embedded Figures Test
CGI: Clinical Global Improvement
CGP: corrected grade placement
CHADD: Children and Adults with Attention-Deficit Disorders
CLAM: Conners, Loney, and Milich
Clin: clinician
Comb: combined
Com: community
CONSORT: Consolidation of the Standards of Reporting Trials
CommPhy: community physicians
CPRS: Conners Parent Rating Scale
CPT: Continuous Performance Test
CT: child training
DD: depressive disorder
Des: desipramine
DETROIT: Detroit Tests of Learning Aptitude
DEX: dextroamphetamine, dexamphetamine
DSM: Diagnostic and Statistical Manual of Mental Disorders
DTS: Doctor's Target Symptoms Scale
Dx: Diagnosis
EEG: electroencephalogram
EFT: Embedded Figures Test
EPC: Evidence-based Practice Center
HAM-A: Hamilton Anxiety Rating Scale
HoIP: hospital inpatient
HoOP: hospital outpatient
HSQ: Home Situations Questionnaire
HySy: hyperkinetic syndrome
InMe: individual method
IGRS: Inpatient Global Rating Scale
IQ: intelligence quotient
KBIT: Kaufman brief intelligence
KIDDIE-SADS: Schedule for Affective Disorders and Schizophrenia for Children
L-amph: l-amphetamine
LD: learning disorder
MASC: Multidimensional Anxiety Scale for Children
MBD: minimal brain dysfunction/damage
MedMgt: medication management
MFFT: Matching Familiar Figures Test
MHOP: mental health clinic outpatient
MPH or MPH-reg: methylphenidate
MPH-SR: sustained-release methylphenidate
MRIQ: mental retardation/low IQ
MTA: Multimodal Treatment Study of Children with ADHD
MU-EPC: McMaster University Evidence-based Practice Center
NIH: National Institutes of Health
NIMH: National Institute of Mental Health
NM: none mentioned
NP: none present
NR: not reported
NS: not significant
ODD: oppositional defiant disorder
PAL: Paired Associate Learning
PC: pediatric clinic
PCOP: pediatric clinic outpatient
PET: positron emission tomograph
PGS: Physician's Global Rating Scale
POMS: profile of mood states
PT: parent training
PTSR: Physician's Target Symptom Ratings
RBPC: Revised Behavior Problem Checklist
RCT: randomized controlled trial
RSE: reinforced self-evaluation
SCL90: Derogitais Symptom Checklist 90
SCRS: Self Control Rating Scale
SD: standard deviation
SEM: standard error of the mean
SEP: Stimulus Equivalence Paradigm
SNAP: Swanson, Nolan, and Pelham
SSERS: Stimulant Side Effects Rating Scale
SSQ: School Situations Questionnaire
ST: supportive therapy
STESS: Subjective Treatment Emergent Symptom Scale
STM: Short Term Memory Task
STROOP: A test to measure interference in serial verbal reactions
Teach: teacher
TeDi: teacher diagnosis
THIOR: thioridazine
TIP: telephone interview probe
TOO: Task Order Officer
TOTS: Time on Task Scale
Touret: Tourettes's syndrome
TRF: Teacher Report Form
WAIS-R: Weschler Adult Intelligence Scaled-Revised
WISC: Wechsler Intelligence Scale for Children
WRAT: Wide Range Achievement Tests
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC] [PubMed]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]