NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Pharmacological Treatment of Dementia
 
hserta
AHRQ Evidence Reports
public health

Chapter  97:  Pharmacological Treatment of Dementia

A140440

Prepared for:

Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services

540 Gaither Road

Rockville, MD 20850

www.ahrq.gov

Contract No. 290-02-0020

Prepared by:

McMaster University Evidence-based Practice Center

McMaster University, Faculty of Health Sciences

Hamilton, Ontario, Canada

Investigators:

Pasqualina (Lina) Santaguida, PhD

Parminder Raina, PhD

Lynda Booker, BA

Christopher Patterson, MD

Fulvia Baldassarre, MSc

David Cowan, MD

Mary Gauld, BA

Mitch Levine, MD

Ayse Unsal, PhD

AHRQ Publication No. 04-E018-2

April 2004

ISBN: 1-58763-151-2

ISSN: 1530-4396

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

AHRQ is the lead Federal agency charged with supporting research designed to improve the quality of health care, reduce its cost, address patient safety and medical errors, and braoden access to essential services. AHRQ sponsors and conducts research that provides evidence-based information on health care outcomes; quality; and cost, use and access. The information helps health care decisionmakers - patients and clinicians, health system leaders, and policymakers - make more informed decisions and improve the quality of health care services.

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:

Santaguida PS, Raina P, Booker L, Patterson C, Baldassarre F, Cowan D, Gauld M, Levine M, Unsal A. Pharmacological Treatment of Dementia. Evidence Report/Technology Assessment No. 97 (Prepared by McMaster University Evidence-based Practice Center under Contract No. 290-02-0020). AHRQ Publication No. 04-E018-2. Rockville, MD: Agency for Healthcare Research and Quality. April 2004.

Prepared for:

Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services

540 Gaither Road

Rockville, MD 20850

www.ahrq.gov

Contract No. 290-02-0020

Prepared by:

McMaster University Evidence-based Practice Center

McMaster University, Faculty of Health Sciences

Hamilton, Ontario, Canada

Investigators:

Pasqualina (Lina) Santaguida, PhD

Parminder Raina, PhD

Lynda Booker, BA

Christopher Patterson, MD

Fulvia Baldassarre, MSc

David Cowan, MD

Mary Gauld, BA

Mitch Levine, MD

Ayse Unsal, PhD

AHRQ Publication No. 04-E018-2

April 2004

ISBN: 1-58763-151-2

ISSN: 1530-4396

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

AHRQ is the lead Federal agency charged with supporting research designed to improve the quality of health care, reduce its cost, address patient safety and medical errors, and braoden access to essential services. AHRQ sponsors and conducts research that provides evidence-based information on health care outcomes; quality; and cost, use and access. The information helps health care decisionmakers - patients and clinicians, health system leaders, and policymakers - make more informed decisions and improve the quality of health care services.

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:

Santaguida PS, Raina P, Booker L, Patterson C, Baldassarre F, Cowan D, Gauld M, Levine M, Unsal A. Pharmacological Treatment of Dementia. Evidence Report/Technology Assessment No. 97 (Prepared by McMaster University Evidence-based Practice Center under Contract No. 290-02-0020). AHRQ Publication No. 04-E018-2. Rockville, MD: Agency for Healthcare Research and Quality. April 2004.

Preface

The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Director, Center for Outcomes and Evidence, Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850.

Carolyn M. Clancy, M.D.

Director

Agency for Healthcare Research and Quality

Name of Funding Government Agency Director

Director

Name of Funding Government Agency

Jean Slutsky, P.A., M.S.P.H.

Acting Director, Center for Outcomes and Evidence

Agency for Healthcare Research and Quality

The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Acknowledgements

The authors of this report would like to thank and acknowledge the assistance of the following people:

Structured Abstract

Context. Dementia is a chronic progressive disease with no known cure. It affects cognition, behavior/mood, physical functions and activities of daily living, and caregiver burden. Therapeutic interventions for dementia aim to affect these domains.

Objectives. To review the evidence and answer the questions: Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes? Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes? Are certain drugs, including alternative medicines (non-pharmaceutical), more effective than others? Do certain patient populations benefit more from pharmacotherapy than others? What is the evidence base for the treatment of ischemic vascular dementia (VaD)?

Data sources. Studies were identified by searching the Cochrane Central trial registry, MEDLINE® , PreMedline®, EMBASE, AMED, CINAHL®, Ageline, and PsycINFO.

Study selection. English-language randomized controlled trials were selected if they evaluated pharmacological agents for adults with a diagnosis of dementia according to the criteria of International Classification of Diseases (ICD), Diagnostic and Statistical Manual of Mental Disorders (DSM) or National Institute of Neurological and Communicative Disorders and Stroke (NINCDS). Crossover trials and studies with a quality score < 3 on the Jadad Scale were excluded.

Data extraction. Data were extracted on type of dementia, severity of disease, setting, regimen of pharmacological agents, study duration, main outcome measures, adverse effects, and results. The quality of studies was assessed, and the quality of adverse effect reporting was assessed. Effect sizes were calculated and data were pooled when appropriate.

Data synthesis. (1) Efficacy: One hundred and eighty-six Randomized Controlled Trials (RCTs) evaluated 97 drugs. As expected the findings varied with the dementia population and the specific outcomes in the various domains. Those pharmacological agents that showed a consistent effect of benefit are as follows: A) Global assessment was improved by donepezil, galantamine, rivastigmine, velnacrine, cerebrolysin and idebenone; B) Cognition (general and specific) was improved by donepezil, galantamine, metrifonate (this drug has been withdrawn from use in North America because of safety concerns), nicergoline, physosligmine, rivastigmine, velnacrine, memantine, cerebrolysin, ginkgo biloba, idebenone and propentofylline; C) Behavior/mood was improved by haloperidol; D) Quality of life/Activities of Daily Living (ADL) was improved by donepezil, galantamine and posatirelin. In general, caregiver burden and quality of life/ADL were not frequently evaluated. (2) Delay disease: Cerebrolysin, selegiline plus vitamin E, and donepezil showed some significant effects in delaying disease progress in patients with mild to moderate and moderately severe Alzheimer's disease. (3) Head to head comparisons: Superiority was seen for sulphomucopolysaccharides over CDP-choline, donepezil over vitamin E, antagonic-stress over nicergoline, antagonic-stress over meclofenoxate, posatirelin over citicoline, and pyritinol over hydergine. (4) Patient populations: Stratified analyses included: age, gender, Apolipoprotein E (APOE) genotype, disease type, disease severity, race by location, care dependence, and presence of depression. Single populations of dementia subjects with Down's syndrome, and depression were evaluated. Evidence was inconclusive for this question. (5) Ischemic VaD: A total of 20 pharmacological interventions in 29 studies were applied to vascular dementias. Differences were suggested between multi-infarct dementia (MID) and Alzheimer's disease (AD) for 5′ -MTHF-trazodone, AD and VaD for citalopram, and AD and MID for Ginkgo biloba. Trials with VaD patients showed effects for memantine, nicergoline, pentoxyfylline, idebenone, donepezil and cerebrolysin.

Conclusions. Pharmacotherapy for dementia can improve symptoms and outcomes. Adverse events should be more systematically reported. Few studies evaluated delay in either disease onset or progression, but there was some evidence suggesting delay in progression. Few studies compared drugs with other drugs. Due to poor evaluation, data was limited to consider efficacy of pharmacotherapy in different subgroups of patients. Some agents have been shown to be effective in VaD patients.

Chapter 1. Introduction

This review focuses on the pharmacological treatment of dementia. Dementia is a syndrome of acquired cognitive defects sufficient to interfere with social or occupational functioning, which results from various central neurodegenerative and ischemic processes. Dementia has become a major public health problem due to its increasing prevalence, long duration, caregiver burden, and high financial cost of care. The prevalence of dementia varies as a function of the defining criteria as shown by Erkinjuntti et al. (1997),1 who showed a range from 3.1% using the International Classification of Diseases, Tenth Revision (ICD-10) criteria, up to 29.1% using the Diagnostic and Statistical Manual, Third Edition (DSM-III) criteria. Jorm et al. (1987)2 conducted a meta-analysis based on 22 international studies and found that the actual prevalence rates differed significantly from study to study. However, this meta-analysis demonstrated that the prevalence increased exponentially with age. The prevalence ranged from 0.7% for 60 – 64 year olds to 24% for people over the age of 85 years. In the United States, the prevalence of Alzheimer's disease (AD) is projected to quadruple to one in 45 Americans in the next 50 years3 across all ages. The Canadian Study of Health & Aging (CSHA)4 estimated the prevalence of dementia in Canada at 8% (approximately 252,600 cases) in 1991 among seniors over the age of 65 years. The prevalence of dementia increases to 34% among those aged 85 years or more.4 The age-standardized incidence of dementia in Canada has been estimated at 21.8 per thousand for females and 19.1 per thousand for males.5 The prevalence is expected to double to half a million cases in Canada by 2013.6 Because the world's population is progressively aging, especially in the developed nations, more people are falling into age groups where the prevalence of dementia is highest. From a clinical perspective, dementia predominately affects 1) cognition, 2) behavior/mood, 3) physical functions and activities of daily living, and 4) caregiver burden. Therapeutic interventions for dementia aim to affect these four primary domains.

Pharmacotherapy is often the primary intervention used to improve symptoms or delay the progression of dementia syndromes. The pharmacological agents used vary significantly with respect to their therapeutic actions. The most common pharmacological agents used in North America modify the activity of cholinesterases—enzymes, which degrade acetylcholine, a neurotransmitter that is critical to the neurons involved in cognition (e.g. memory, thought, and judgment). Other approaches include the use of anti-oxidants, which work by minimizing the effects of free radicals that are released through normal oxidative metabolism. These free radicals may cause neuronal damage and play a role in the development of dementia. Similarly, it is believed that inflammation contributes to nerve cell damage and dementia; hence anti-inflammatory drugs may act by decreasing inflammation, potentially reducing nerve degeneration, which may in turn slow or even prevent dementia illnesses.

Other pharmacological interventions that have been studied include cholesterol-lowering agents, anti-hypertensives, folic acid, hormones (e.g. estrogen), behavior and mood altering drugs, anti-amyloid strategies (e.g. immunization, aggregation inhibitors, and secretase inhibitors), transition metal chelators, nerve growth factors, and agents that target neurotransmitters other than acetylcholine and its receptors. The various pharmacotherapeutic agents available to treat problems associated with dementia have varying levels of evidence to support their efficacy. This report is a systematic evaluation of the evidence for pharmacological interventions in the treatment of dementia in the domains of cognition, global function, behavior/mood, quality of life/ADL, and caregiver burden.

Diagnosis of Dementia

Determination of disease onset presents considerable difficulty, as dementia, by definition, has an insidious and gradual progression. A number of diagnostic models have been used to classify dementia. In 1988 the National Institute of Neurological and Communicative Disorders and Stroke - Alzheimer's Disease and Related Disorders Association (NINCDS/ADRDA) research on diagnostic criteria were published for AD, which served to increase the validity and reliability of the clinical diagnosis.7, 8 Trials published prior to this time may reflect mixed populations other than AD. Other models used to diagnose dementia include: the International Classification of Diseases (ICD) version 9 or 10, the Diagnostic and Statistical Manual of Mental Disorders (DSM) III, III-R, and IV (American Psychiatric Association), and the NINCDS/ADRDA.9 The difficulty with these different diagnostic criteria for dementia is that they are not interchangeable. Erkinjuntti et al. (1997)1 compared six commonly used classification schemes (DSM-III, DSM-III-R, DSM-IV, ICD-9, ICD-10, and the Cambridge Examination for Mental Disorders in the Elderly (CAMDEX)). They showed that the prevalence of dementia can differ by a factor of 10 depending on the diagnostic criteria used. Two other studies have demonstrated that the prevalence of vascular dementia (VaD) varies with the classification system and therefore these criteria for diagnosis are not interchangeable.10, 11 Furthermore, there is controversy about the validity of the clinical classification of VaD, as autopsy confirmation often does not substantiate the clinical diagnosis.12, 13 The majority of dementias were actually AD with co-existing vascular and Parkinson's disease lesions.14 In contrast, the clinical accuracy of AD diagnosis is relatively high.7 The discovery that a long pre-clinical period precedes AD has led to the establishment of early diagnostic indices of dementia. This border zone between normality and dementia has been given numerous names and definitions, which include: benign senescent forgetfulness (BSF), age associated memory impairment (AAMI), age-consistent memory impairment (ACMI), age-associated cognitive decline (AACD), mild cognitive impairment (MCI), cognitive loss no dementia (CLOND), and cognitive impairment but not dementia (CIND). The prevalence for this pre-clinical or mild form of cognitive decline varies with the classification system used.15 Unfortunately, the classifications used to diagnose early mild cognitive decline are not interchangeable. MCI16 is emerging as the preferred term for this condition17 using the criteria of Petersen et al.16 Ritchie et al.18 (2001) estimated the prevalence of MCI to be 3.2% with an 11.1% conversion rate to dementia within a 3 year period.

Analytic Framework: Understanding Therapeutic Aims of Pharmacological Treatment

Dementia is a chronic progressive disease for which no known cure exists. Pharmacological interventions used to treat dementia are intended to achieve at least one of the following broad therapeutic aims:

Prevention of onset of the disease. In the context of this review, this applies to those at greatest risk (such as those with the clinical diagnosis of MCI) of conversion to a dementia syndrome.

Symptomatic treatment of the disease. Symptomatic benefit can be described as maintenance (or stabilization) or improvement of the current cognitive, behavioral, functional, or caregiver status only while on active treatment with the pharmacological intervention. Withdrawal of the pharmacological therapy results in a decline towards baseline or placebo levels of relevant outcomes.

Delay in the progression of the disease. A therapeutic intervention that brings about delay in the progression of the disease can be described as either 1) one that maintains (or stabilizes) or improves current cognitive, behavioral, functional, or caregiver status, which is sustained, or 2) one that can be shown to alter the rate of decline of the disease progression, even when the drug is withdrawn.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf1.jpg.

   Figure 1. Pathway for the progression of dementia and the ideal application of drug interventions within this framework

Figure 1 details the analytic framework for the progression of dementia and shows when various pharmacological interventions would ideally be administered for the intended therapeutic benefit within this pathway. The scope of this review did not include the evaluation of normal healthy aging populations. Rather, pharmacological agents intended for populations at increased risk of conversion to dementia syndromes, such as MCI, were eligible for evaluation in this systematic review.

Understanding Efficacy of Pharmacological Interventions in Dementia Trials

It has been suggested that dementia does not have uniformly accepted criteria for disease progression or consensus regarding the magnitude of clinically important changes.19, 20 With respect to the therapeutic aim, the practical consequences of these unresolved issues are that the same efficacy variables have been used to both show evidence of symptomatic benefit and demonstrate the effects on disease progression. Thus, the design of a clinical trial (rather than the outcome) is critical to demonstrating which of these two therapeutic outcomes (symptomatic benefit or delay in progression) is being achieved with the pharmacological agent.21

Irrespective of which therapeutic aim is being achieved by the pharmacological agent, the lack of consensus on these two issues has even more important implications when considering the definition of “efficacy” for either treatment goal. A change in a relevant outcome measure that is due to factors other than chance is deemed statistically significant. The criteria to determine efficacy solely on statistical significance have long been recognized as problematic from an interpretation perspective. Clinically meaningful change reflects a different level of “significance” and often requires consensus among experts within the field to establish what magnitude of change is important.20

Efficacy as Measured by Clinical versus Statistical Significance

The dementia literature is not consistent in the criteria used for establishing efficacy, and there is no consensus on the meaning of clinical significance in the changes observed.20, 22 In general, attempts are made to select an outcome measuring an important dementia attribute (such as cognition) and an additional outcome evaluating global change as observed by a clinician (with or without input from the caregiver). The outcomes selected to reflect these two domains vary, as do the number of attributes that are selected for evaluation.

The United States Food and Drug Administration (FDA) has established criteria for efficacy of dementia (specifically AD) drug interventions,23 which require the following: 1) a double blind, placebo-controlled trial, 2) subjects who meet established criteria for AD, 3) sufficient length of follow-up to appreciate a meaningful effect of the drug on cognition, and 4) a clinical change of sufficient magnitude to be recognized by a clinician. In establishing these criteria, it was assumed that the outcome measuring cognition was the primary change of interest, and that the global clinical evaluation would mirror the changes in the primary variable.24 In 1997, the European Medicine Evaluation Agency (EMEA) issued new guidelines that incorporated two new concepts for the treatment of AD.25 Firstly, the EMEA guidelines suggested a preference for a measure of functional abilities in addition to a global measure, and noted that behavioral outcomes were important from a clinical perspective. Secondly, a definition of “responders” should be included in all trials, such that the degree of improvement in their cognition (or stabilization) was pre-specified. However, the magnitude of the change reflecting a clinically meaningful change was not specifically stated in either of these two guidelines. Sufficient magnitude of the change would reflect a clinically important difference, and this would vary with the type of outcome selected.

Several authors have attempted to define “clinically” relevant change. Gutzmann et al. (2002)26 developed an Efficacy Index Score (EIS), which is a checklist that combines dropout as well as the relevant improvements individually across the three levels of assessment (cognitive function, activities of daily living and global function). Although, this summary score has not been validated relative to other traditional outcomes, it does present a unique example of determining efficacy in the context of anti-dementia drug interventions. Mayeux and Sano (1999)27 in reviewing drug interventions for dementia, evaluated efficacy as a percent of the change in the treatment group relative to baseline (corrected for any change in the placebo group) and contrasted this with the percent of dropouts related to adverse events. Disease progression was considered with respect to the outcomes of 1) time until death, 2) nursing home placement, 3) loss of ability to perform Activities of Daily Living (ADL), or 4) severe dementia. In the context of clinical trials seeking to establish efficacy of pharmacological interventions, the latter outcomes may be problematic to ascertain.

Evaluation of the natural history of AD established some threshold values for expected decline or progression of the disease. Using the Alzheimer's Disease Assessment Scale-Cognitive Section (ADAS-cog),28 Rosen demonstrated that a decline of 1.28 points occurred within 12 weeks, a decline of 3.5 points within 6 months, and Stern et al. (1994)29 showed a decline of 9 – 11 points by 1 year. Clinical experience would also suggest that the decline is not linear, with less deterioration in the early and later stages and the greatest acceleration in the middle severity category. The characteristics of the natural history of AD and other dementia types are best derived from longitudinal studies. Although, more details on the natural history of dementia are being reported, the fundamental difficulty still remains concerning the diversity of the outcome measures used to describe these changes. The picture of cognitive, behavioral, and functional decline will therefore vary with the outcome measure selected to describe it. Additionally, the diversity has a negative impact on comparisons of drug efficacy that can be made across trials.30

Efficacy and Outcome Measures Used in Pharmacological Intervention Trials

No specific set of commonly accepted outcomes that define efficacy or “clinical relevance” applies to all the pharmacological interventions that have been used to treat dementia. More than 175 outcome measures are listed in Appendix E. EMEA guidelines acknowledge that no single test encompasses the broad range of disease characteristics associated with AD; nor has there been convincing evidence that an ideal (or reference) instrument exists to capture cognitive, behavioral, functional, or caregiver status.25 The FDA has recommended that “dual efficacy” of dementia drug interventions be established by significant change in both a psychological measure and a global change measure. The outcomes used to measure these attributes within these two domains were not specified. In practice, there has been a general trend in North America toward using the outcomes ADAS-cog, the Mini-Mental State Examination (MMSE), and the Clinicians' Interview-Based Impression of Change-Plus (CIBIC+) to capture the two domains when evaluating drugs for AD populations. However, these frequently used outcome measures may not be the best choice with respect to capturing “clinically relevant change”. The psychometric instrument properties must also be taken into consideration. For example, it has been suggested that the ADAS-cog is weighted predominately to evaluate memory loss at the expense of other cognitive domains (especially executive control functions),31 which suggests that the face validity of this instrument may be in question. The generalizability of these results may be limited to dementia in which memory impairment is a key feature as the instrument is less sensitive to personality and executive dysfunction changes seen in a less typical dementia, such as frontotemporal dementia. The responsiveness (ability to detect change) of the CIBIC+ has not been well established.32 This suggests that some of the most established outcomes used to evaluate efficacy of pharmacological interventions are far from ideal.

Demers et al. (2000)33 critically appraised some of the most commonly used scales evaluating global assessment,32 quality of life/ADL,30 and behavior/mood34 with respect to the quality of their psychometric properties. Several important limitations were identified in these reviews for the measures they evaluated, and these include 1) a lack of responsiveness data, 2) diversity in the content of the scales (capturing various aspects of a domain, for example, behavior), and 3) limited studies on reliability and validity (which are sample specific). The literature evaluating outcome measures used in dementia trials would suggest that most instruments have significant limitations, or at least more data are required to establish the required properties for acceptability of the scales.

Given the current state of development of research on outcome measures used in dementia trials for determining efficacy, a dilemma is clearly at hand. Ideally, all outcomes used to evaluate efficacy should have demonstrated acceptable psychometric properties, such as reliability, validity (construct), and responsiveness. However, since none of these outcomes have been accepted as standards, the selection of the most appropriate outcome is purely arbitrary. Similarly, establishing a rationale to exclude studies based on the specific type of outcome measure would be arbitrary. For this reason, no exclusion criteria based on outcome measures were used as eligibility criteria for this study.

Efficacy and Potential Risk of Adverse Events

Increasing attention has been given to the potential for harm, and not just benefits, when considering the efficacy of drug interventions. Empirical evidence across diverse medical fields indicates that reporting of safety information, including milder adverse events, receives much less attention than the positive efficacy outcomes.35 Thus, an evaluation of the benefits of anti-dementia pharmacological agents alone may present a biased view of the overall benefit of the intervention. In the context of this systematic review, the type and frequency of adverse events associated with the use of a drug intervention will be scrutinized to a greater extent than previous reviews of anti-dementia drugs.

Capturing and evaluating adverse events is problematic. Typical randomized controlled trial (RCT) dose finding studies should consist of the comparison of several doses of a drug versus placebo; efficacy is demonstrated relative to a placebo group or relative to a different dose group. Ideally, the goal of early phase trials is to estimate the minimum effective dose or the maximum safe dose (or both). However, it is misleading to assume that drugs shown to be safe and effective in trials are safe and effective in all other circumstances.36 The nature of pre-market clinical trials makes it difficult to evaluate the benefits of drugs for the universe of potential users, as criteria restricting entry into the trial do not necessarily reflect dementia patients in general. By their nature, some adverse events are not easily anticipated, and therefore are not screened for in some trials. The implementation of pharmaco-vigilance systems attests to the need for further capture of potential adverse events not captured in trials. Adverse events may be hard to predict or anticipate and are captured only if a trial protocol was designed to measure these events. A limited number of standardized instruments exist to capture these events reliably. Unique to individuals with cognitive decline is the potential problem of validity of the self-report instrument, even if completed by the caregiver. Furthermore, many trials may be underpowered to detect adverse events with an incidence of 1/1000.37 Despite these limitations, quality criteria for the collection and reporting of adverse events have been identified.35, 37 An instrument to evaluate the quality of reporting adverse events has been developed and used in this report to determine the strength of the evidence for adverse events in the context of determining efficacy.

Efficacy and Intention to Treat Analysis

Determining efficacy in dementia trials evaluating pharmacological interventions may vary depending on the selection of the analysis type. In general, the types of analyses of primary data in trials fall into two main categories: 1) intention to treat analyses (ITT) or last observation carried forward (LOCF), and 2) observed case (OC) or completed trial (CT). The advantages of ITT over OC analyses have been well explicated.38 It is recognized that non-compliance is not a random event; thus, ITT analyses should be used to base principal conclusions of efficacy.39 In the context of some anti-dementia drug therapies, where dropout rates due to adverse events and other non-compliance reasons may be high, the ITT analysis minimizes bias and the potential for type I errors when considering treatment efficacy. However, the ITT analysis, while less biased, does tend to reduce treatment effects to the extent that there are dropouts and crossover patients. The optimal analysis, when there is a large loss to follow-up, is to conduct the analysis both ways and look for consistency.

Primary Objectives and Scope of Systematic Review

A large number of pharmacological interventions have been studied in dementia patients. These agents can be classified into three broad categories: 1) cholinergic neurotransmitter modifying agents, such as acetylcholinesterase inhibitors, 2) non-cholinergic neurotransmitter/ neuropeptide modifying agents, and 3) other pharmacological agents. Although only four agents have been approved by the FDA for the treatment of dementia, many other pharmacological agents are being evaluated in trials in off-label use. In both these circumstances, there was a need to determine the evidence to support claims of efficacy and to describe adverse events.

The Questions

Given the range of pharmacological agents that have been used to treat dementia, evaluation of all of these interventions in a systematic review (which afforded a consistent methodology) should serve as a meaningful contribution in this area. The purpose of this systematic review is to answer the following questions:

  1. Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes?

  2. Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes?

  3. Are certain drugs, including alternative medicines (non-pharmaceutical), more effective than others?

  4. Do certain patient populations benefit more from pharmacotherapy than others?

  5. What is the evidence base for the treatment of VaD?

This review considers different dementia populations (not just AD) and subjects from both community and institutional settings. The interventions were limited to pharmacological agents (including nutriceuticals), and these were not restricted to those that have received official approval in North America. The studies eligible in this systematic review were restricted to parallel RCTs, but the study outcomes were not limited to specific types.

The review will serve to evaluate the quality of the evidence and identify important gaps in the literature. Future recommendations will serve the dementia research community specifically. This evidence report will support the American College of Physicians (ACP) and the American Academy of Family Physicians (AAFP) in developing “best practices” and practice guidelines for the evidence-based treatment of dementia for providers, patients and the public.

Chapter 2. Methods

The Research Team

A multidisciplinary local research team representing geriatric and dementia epidemiology/ systematic review methods (P. Raina, PhD), pharmaco-epidemiology (M. Levine, MD, PhD), geriatric medicine/ dementia (D. Cowan, MD; C. Patterson, MD), rehabilitation/ systematic review methods (P. Santaguida, PT, PhD), and neuropsychology (A. Unsal, PhD) was assembled. The core research team, including experienced staff at the McMaster Evidence-based Practice Center (EPC) (F. Baldassarre, MSc; L. Booker, BA; M. Gauld, BA) participated in regular meetings and reached consensus on key methodological issues. An international Technical Expert Panel (TEP) was assembled to provide high-level content expertise in dementia and participated in conference calls on an as-needed basis. Participants in this panel were: Larry W. Chambers, PhD. Ottawa, ON, Canada; Thomas Cook, MD. (ACP appointee) Colorado Springs, CO, USA; Rachelle Doody, MD, PhD. Houston, TX, USA; John Feightner, MSc, MD. London, ON, Canada; Rodney Hornbake, MD. (ACP appointee) Hadlyme, CT, USA; David Hogan, MD. Calgary, AB, Canada; Roy Jones, MD. Bath, UK; and Holly Tuokko, PhD. Victoria, BC, Canada.

Topic Assessment and Refinement

Refinement of Questions

The first step during the topic assessment and refinement process was to organize a teleconference with the partner organization, the Task Order Officer (TOO), invited topic experts, and the McMaster team in order to define the magnitude of the topic addressed and to refine/clarify the preliminary research questions for this evidence report. It was agreed that this evidence report would focus on addressing the efficacy of pharmacotherapies for dementia syndromes. Regular teleconferences were held with the TOO, the partner, and technical experts throughout the data refinement and extraction phase.

Search Strategy

Table 1. Databases searched for relevant RCTs
Database searchedSearch datePeriod searched
Cochrane CentralFebruary 3, 20031st Quarter 2003
MEDLINE® & PreMedline®February 4, 20031998 to 2003 week 4
EMBASEFebruary 6, 20031998 to 2003 week 5
AMEDMarch 4, 20031985 to 2003 February
CINAHL®March 5, 20031982 to February 2003 week 3
AgelineMarch 6, 20031978 to 2002 December
PsycINFOMarch 7, 20031967 to 2002 December
Search strategies were developed and undertaken in the electronic databases listed in Table 1 for the time periods specified. The order of the databases in Table 1 also represents the sequence that the databases were searched. Appendix A details the search terms for all databases.

Expert opinion was sought on the most efficient search strategies to minimize noise in the collection of citations. Some of the medical subject headings (MeSH) used to select RCTs yielded a large number of non-RCT literature due to misclassification of the study design terms. For example, in previous indexing, terms like “longitudinal study” or “comparative study” were applied to RCTs; conversely, the MeSH terms “random” or “randomized” in the title or abstract were not consistently used. However, some recent methodological work has suggested that more specific search term approaches can be used, which increases the sensitivity and specificity of the search results.40 The Cochrane Central Trial Registry contains correctly re-classified RCT/Controlled Clinical Trial (CCT) trials that were misclassified in MEDLINE® and EMBASE from 1966 to 1998. All published RCTs to 1998 are contained within this database. Hence we commenced our search with the Cochrane Central Trial Registry database. For this reason, MEDLINE® and EMBASE were searched from 1998 forward for relevant studies, and all the other databases from their inception.

Specific drug names and manufacturer brands were considered as potential search terms. However, the local research team was in agreement that listing specific drug names would bias the yield to include only those pharmacological agents searched and would not capture newer drug therapies. Thus the recommendation was to not restrict the search to known pharmacological agents but to include whatever agents were in the literature.

In addition to the electronic databases, the bibliographies of retrieved papers were retrieved. Any citations recommended by the local research team, the TEP, or the peer reviewers were retrieved and screened.

Eligibility Criteria

Inclusion. Studies were included that contained the following criteria:

  1. Age: Studies involving dementia patients who were 18 years or older in age

  2. Diagnostic model used to determine dementia: The diagnosis of dementia using any of these criteria:

    1. ICD 9 or 10.41, 42

    2. DSM III, III-R, and IV.43, 44, 45

    3. NINCDS.9

    4. NINCDS-ADRDA9 or NINCDS-AIREN.46

  3. Diagnostic criteria used to determine cognitive impairment (pre-dementia): In the case of not yet diagnosed dementia, specific diagnostic categories were accepted for the following:

    1. mild cognitive impairment (MCI)..47

    2. cognitive impairment not dementia (CIND).48

    3. cognitive loss no dementia (CLoND).49

  4. Disease classifications for dementia: These included AD, senile dementia of the Alzheimer's type (SDAT), Lewy body disease, VaD, multi-infarct dementia (MID), AIDS/HIV dementia, Parkinson's disease dementia (PDD), progressive supranuclear palsy (PSP), mixed diagnosis dementia, encephalopathy, Mesulam syndrome, progressive non-fluent aphasia, Binswanger disease, subcortical leukoencephalopathy, circumscribed lobar brain atrophy, Pick disease, amyloid beta-protein (not Down's syndrome or trisomy), cerebral amyloid angiopathy, neurofibrillary tangles, threads, senile plaques, corticobasil ganglionic degeneration, cerebral autosomal dominant ischemia with subcortical leukoencephalopathy (CADISIL), Huntington's disease with dementia, hydrocephalus (for additional terms used in the search strategy, see Appendix A).

  5. Severity classification: This was accepted in whichever classification system the studies specified. The majority of studies specified threshold criteria using the MMSE as follows: mild > 22, moderate 14 – 21, and 10 – 14 as severe. Many studies used the definition of mild to moderate as a range from 10 to 26 based on criteria established by Folstein et al.50 Some studies specified a category (i.e. mild to moderate) but did not report the baseline MMSE values for the groups compared.

Some studies specified two categories (mild to moderate) and (moderate to severe) based on the DSM-III-R criteria. Cambridge Examination for Mental Disorders in the Elderly (CAMDEX) specifies levels of severity (minimal, mild, moderate, severe). Similarly, some studies reported a category of severity without stating which method was used. In these instances, the category of severity specified was accepted as reported by the study authors.

Exclusion. Studies that had populations with any of the characteristics listed below were excluded.

  1. Dementia disease classification: i) alcohol caused dementia/ Korsakoff's syndrome, ii) Creutzfeldt-Jakob syndrome, c) spongiform encephalopathy, iii) hypothyroidism, iv) vitamin B12 deficiency, v) neurosyphilis.

  2. Dementia diagnosed using only Lowb, Hachinski (specific for VaD) criteria.51

  3. All organically caused dementias which includes “Delirium, Dementia, Amnesic Disorders, and Cognitive Disorder Otherwise Specified. The predominant disturbance is a clinically significant deficit in cognition that represents a significant change from a previous level of functioning. For each disorder in this section, the etiology is either a general medical condition (although the specific general medical condition may not be identifiable) or a substance (i.e., a drug of abuse, medication, or toxin), or a combination of these factors.”43

  4. Temporary dementia (e.g. side effect of anesthesia) classified as follows: Delirium: a delirium is characterized by a disturbance of consciousness and a change in cognition that develop over a short period of time. The disorders included in the “Delirium” section are listed according to presumed etiology: delirium due to a general medical condition, substance-induced delirium (i.e. due to a drug of abuse, a medication, or toxin exposure), delirium due to multiple etiologies, or delirium not otherwise specified (if the etiology is indeterminate).

  5. Normal or healthy volunteers: studies that deal with healthy people (i.e. prevention is limited to people who have any form of the above); volunteer study population

  6. General population of elderly persons.

  7. Study subjects selected for depression (some patients may have dementia but not all) and where there is no stratified analysis by disease subgroup (i.e. the dementia subjects).

Study Design. Eligible studies included parallel design RCTs only. Although crossover trials are suitable for chronic diseases, they may be prone to period effects or period by treatment interactions. Period effects are systematic changes in the outcome that apply to all patients due to temporal changes in the disease or to the measurement instrument. Period by treatment interactions occur when the efficacy of the intervention varies by period. Additionally, a carry-over effect may occur if there is not an adequate washout period. Apart from the weaknesses of this design, some limitations arise when considering the potential for meta-analytic analyses. Traditionally, first period data from a crossover trial are abstracted and can be potentially combined with parallel trials for analyses of a pooled estimate; the reporting of the study results (positive or negative) would also be based on this first period data. In a preliminary phase of the review, several crossover trials were examined. Many did not report first period data, which precluded any potential for combining with parallel trials; many trials also did not undertake statistical tests during the first experience, thus making it difficult to report the direction and significance of the first period findings. Finally, because this systematic review was considering a variety of drug interventions administered over differing time intervals, period effects might be an important source of bias. For all these reasons, the decision was made to exclude crossover trials from this systematic review.

Language of Publication. Studies published in the English language were eligible. The scope and resources of this review did not permit translation of studies published in other languages.

Sample Size. No sample size restrictions were applied.

Treatment Interventions. Drug interventions were eligible in the following manner:

  1. Pharmacological agents: all types of pharmacological treatment were considered in this review, including food supplements (as defined by the FDA). Government approval was not a requirement, and as such, off-label use of drugs (i.e. drugs approved for other conditions but used in the treatment of dementia) were eligible for this review.

  2. Dose: all doses and dosing schedules and any mode of administration (oral, subdermal, transdermal, intravenous, suppository, or intra-muscular injection) were considered.

  3. Treatment period: the period of treatment must equal or exceed 1 day.

  4. Follow-up length: Any duration of follow-up was eligible. Different drugs require different time periods to show an effect. For example, antidepressant and anti-psychotic medications may take a month or more to be effective. Some dementia drugs take a minimum of 2 months. For interventions such as vitamin E or Ginkgo biloba, the time to effect is not well established. Thus, an absolute limit to the minimum number of months of follow-up could not be applied to all potential interventions. It was anticipated that many studies with some of the most recent pharmacological agents (i.e. donepezil) would have a minimum follow-up of 24 weeks.

Study Outcomes. No specific set of commonly accepted outcomes that define efficacy or “clinical relevance” were applicable to all the pharmacological interventions that have been used to treat dementia. The literature evaluating outcome measures in dementia trials would suggest that most instruments have significant limitations or at least more data are required to establish the required properties for acceptability of the scales. Since none of the outcomes used in dementia trials have been accepted as standards (no consensus), the selection of the most appropriate or clinically relevant outcome is purely arbitrary. Similarly, establishing a rationale to exclude studies based on the specific type of outcome measure would be arbitrary. For this reason, no exclusion criteria based on outcome measures were used as eligibility criteria for this study; rather the domains of interest for inclusion have been identified.

Studies with the following outcomes were included:

  • General cognitive function (e.g., ADAS-cog).

  • Specific cognitive function (e.g., Weschler Memory Tests).

  • Global clinical assessment (e.g., CIBIC).

  • Behavior/mood (disturbances characterized by agitation, wandering, sleep cycle disturbance, depression, obsessive compulsive activities) (e.g., Behavioral Pathology in Alzheimer's Disease Rating Scale (BEHAVE_AD)).

  • Quality of life/ADL (e.g., Instrumental Activities of Daily Living (IADL)).

  • Effects on primary caregiver (also referred to as caregiver burden).

  • Safety as measured by the incidence of adverse effects (e.g., particularly serious adverse events).

  • Acceptability of treatment as measured by withdrawal rate from trial due to side effects of the medication. (e.g., dropouts due to adverse events).

  • Mortality.

  • Dependency or Rate of Institutionalization/ or continued residence in own home.

  • Use of services.

Studies with the following outcomes were excluded as follows:

  • Studies which reported only biological/ physiological outcomes, such as plasma levels, changes on functional imaging, or electroencephalography (EEG) activity, were noted but not assessed as efficacy measures.

  • Outcomes reported in the trials should reflect changes in the person with dementia. If the study population did not all have dementia, only data subgrouped for dementia was examined.

Minimum quality threshold score for eligibility.

Exclusion part I: Pre-Jadad score. Studies were also screened to determine a minimum threshold for quality, sometimes described as “fatal flaws” in the trial design. Specifically, all studies had to include at least some mention of the term “randomization” or “withdrawal(s)” in the text of the paper. Trials that did not at least mention these components were excluded, as they possessed a fatal flaw.

Exclusion part II: Post-Jadad score. The methodological quality of the primary studies was assessed using the modified Jadad scale for RCTs52 (Appendix B). The reliability of this modified scale was shown to be high, as measured by the intraclass correlation coefficient (ICC = 0.90).52 Each study was evaluated by two reviewers, and the level of agreement was determined statistically. The first three items on the scale rate elements that have been shown to bias meta-analytic results. These include randomization, blinding, and withdrawal. If these items alone are considered, the maximum score is 5. Any study that did not score 3 or more on the scale was excluded from the review. Therefore, this review abstracts detailed data only from studies that achieved moderate to high ratings on the quality scale.

Evaluating the methodological quality of studies and rating the strength of the evidence.

Quality of the RCT. The methodological quality of the primary studies was assessed using the modified Jadad scale for RCTs.52

Quality of reporting adverse events. The potential for risk, or adverse events, was an important component to consider with respect to efficacy. The Jadad scale for quality does not take into account factors associated with adequate collection and reporting of adverse events as detailed by Ioannidis and Lau (2002).35 Therefore, a summary checklist was developed to determine the potential quality in the collection and reporting of adverse events (Appendix B). This score was used to evaluate the relative quality of the adverse events reported.

Data Collection and Reliability of Study Selection

During the identification phase, two independent reviewers evaluated the title and abstract for eligibility; those meeting the criteria were retrieved as well as those that reported insufficient information to determine eligibility. Two independent reviewers examined the full text of these articles (passing from the title and abstract phase). All studies meeting eligibility criteria were reviewed to assess quality and abstracted according to predetermined criteria. The articles were grouped according to the pharmacological agent used in the intervention.

A team of study assistants was trained in the eligibility criteria for the purposes of this systematic review. Standardized forms and a guide explaining the criteria were developed from previous templates (Appendix B). Two reviewers were used for the identification, selection, validity, and abstraction phases of the systematic review. Disagreements were resolved by consensus. The reviewers were experienced EPC staff with post-graduate training in research methods. The reviewers and abstractors would consult with more senior members of the TEP for content expertise or methods-related issues.

Summarizing Results: Descriptive and Analytic Approaches

It was expected that studies of the pharmacological agents used in the management of dementia would be quite diverse with respect to the intended therapeutic effect. For these studies, evidence and summary tables (Appendix C) were constructed to describe the more salient characteristics of the included studies.

Meta-analysis

Statistical meta-analysis was not appropriate for all outcomes or interventions. Before calculating a pooled effect measure, the reasonableness of pooling was assessed on clinical and biological grounds, in terms of clinical homogeneity. Tabular summaries of key characteristics, participants, interventions, and outcomes were considered. A priori, it was decided that pooled estimates would be undertaken for studies with the same pharmacological intervention and the same outcome measure and that a minimum of three studies was necessary for pooling for a specific outcome*. Consideration was given to the similarity of study populations when selecting studies to be included in the pooled estimates. Although many studies evaluated multiple outcomes, data necessary for meta-analysis were not provided in all eligible trials. When sufficient data were provided to estimate the weighted mean difference (WMD), then a meta-analysis was undertaken. WMD was selected as the pooled estimate (versus the standardized mean difference) because the outcome measures did not differ between studies eligible for pooled estimates. For WMD, the difference between the treated and control groups are weighted by the inverse of the variance.

Analysis was undertaken in RevMan 4.2 (Review Manager, Cochrane Collaboration, 2003), and the random-effects model was used to conduct our analyses. In cases where heterogeneity existed, the results of the random-effects model only were considered for interpretation of the results of the pooled estimate. RevMan 4.2 automatically tests the homogeneity of the results of the individual studies for each comparison of dichotomous or continuous data. Tests of homogeneity are formal statistical analyses for examining whether the observed variation in study results is compatible with the variation expected by chance alone. The more significant the results of the test (the smaller the p-value), the more likely that the observed differences were due to unknown factors likely not controlled for in the study. Sensitivity analysis or meta-regression was not undertaken to assess the extent to which the methodological quality of studies, population characteristics, dose, etc., accounted for variation in the primary outcome.

Power Analyses

Power analyses were conducted for select pharmacological interventions reporting non-significant findings for all primary outcomes reported in the paper. In addition, if the trial reported the outcomes of MMSE, ADAS-cog, or the CIBIC+, the power for these was also estimated. It was assumed that the desired level of significance was set to alpha equal to 0.05. Adequate power was defined as at least 80% power.

Peer Review Process

A list of potential peer reviewers was created at the outset of the study. During the course of the project, additional names were added to this list by the McMaster Center and Agency for Healthcare Research and Quality (AHRQ). In May 2003, the individuals on the list were approached by the McMaster team and asked if they would act as peer reviewers of this evidence report. A total of 26 experts agreed and received a copy of the draft report and a copy of the “Structured Format for Referee's Comments” (Appendix D). A list of the reviewers' names and their affiliation is provided in Appendix D. In addition, a criticism editor, Dr. Patricia Huston, who is external to the McMaster EPC, was asked to review the draft report and synthesize the peer review comments. The report from the criticism editor was then used to prioritize the incorporation of peer review comments into the final version of this evidence report.

Chapter 3. Results

In this chapter, the presentations of the main results of the systematic review are organized according to the five questions that were addressed. The first question, concerning efficacy of the pharmacological interventions, contains results from all eligible studies. Subsets of trials were then selected from this larger set to address the remaining four questions (see Chapter 2 Methods).

Eligible Studies

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf2.jpg.

   Figure 2. Flow diagram showing the final number of studies meeting the eligibility criteria

Figure 2 shows the final yield of eligible studies for evaluation, and the inclusion/exclusion criteria are listed in Chapter 2. Approximately 10.5% of identified studies met the eligibility criteria in the title and abstract phase. Similarly, 14.7% of the full text screened citations were eligible for full data abstraction. Several trials were identified as “companion papers”, indicating that results for these related studies were based on the same study subjects. These related studies were evaluated and a main publication was selected (usually the first chronological publication), and the remaining trials were searched for any additional data for abstraction; the “companion papers” were not considered as unique studies. English-language reports only were included in this review.53 Although this is acknowledged as a possible source of bias, the overall proportion of potentially eligible non-English studies for review in title and abstract was small (7%).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf3.jpg.

   Figure 3. Proportion of studies as a function of year of publication

Figure 3 indicates the distribution of eligible studies as a function of publication year grouped into approximately 5-year intervals. The largest proportion of studies (83%) was published within the last 11 years, with the greatest number from 1997 forward. This may have some implications for future systematic reviews with respect to the years searched.

This systematic review yielded a total of 97 pharmacological agents used in the treatment of dementia from 186 unique studies. These 97 interventions have been classified according to three broad categories: 1) cholinergic neurotransmitter modifying agents, 2) non-cholinergic neurotransmitter/ neuropeptide modifying agents, and 3) other agents.

Question 1: Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes?

The largest number of eligible citations evaluated was cholinergic neurotransmitter modifying agents (n = 72). The remaining citations were distributed amongst the non-cholinergic neurotransmitter/neuropeptide modifying agents (n = 61) and other agents (n = 76) categories. Some studies evaluated agents in more than one category. The results for all pharmacological agents are presented in this chapter in summary format with descriptive text and an overall summary table (OST) for each drug located at the end of this chapter. The specific details abstracted from each individual study are presented within Appendix C (guide to the results tables) and organized into these same three therapeutic effect classification groups and by pharmacological agent. These Evidence Tables are available on-line at http://www.ahrq.gov/clinic/epcindex.htm.

Appendix C contains three sets of tables with key study descriptors as follows:

Key characteristics. Summarizes the following aspects of each study: features (author, year published, funding source, modified Jadad scale quality score, number randomized, number completing the trial, subgroup analysis), population characteristics (diagnosis, criteria for diagnosis, disease severity, percent male, age, dwelling, and differentiating demographics), intervention (doses, titration scheme, and intervention period), and a complete list of outcomes administered in the study protocol.

Study results. Details the changes observed (the magnitude of theses changes, the comparison groups analyzed, and the findings of any statistical testing) for those outcomes for which appropriate data was reported (for up to three time periods if available). When reported in studies, baseline measures, particularly MMSE score, were also detailed in these tables.

Study adverse events. Lists the specific types of adverse events (side effects, adverse reactions, and serious events) reported, any statistically significant differences between groups, the proportion of withdrawals due to adverse events, and the quality rating score (based on a checklist devised at the McMaster EPC and on the work of Ioannidis and Lau (2002)35) specific to the collection and reporting of these adverse events.

Interpretation of the Results in the Overall Summary Tables (OST) for Individual Studies

Table 9. Guide to Overall Summary Tables - Outcome Measures Classified by Domain
General cognitive function measureSpecific cognitive function measureGlobal AssessmentBehavior/MoodQuality of Life/ADL/FunctionCaregiver BurdenOther
ADAS-Cog (also ADAS-11)ACPTR-AVLADASABIDABSCATSCAUST
AMTSBabcock Story recallRMADCS-CGICABSRADCS-ADLCSSSAS
BCRSBarbizet VisuospatialRPMADSACESADFACSCSIAIMS
CamCOGBLMRey Memory TestAGS-EADAS-Non-cogADLSCBBARS/ BAS
CASIBNTSet testBf-SAFBSADL-CERP
CETMBSRTSnodgrass Picture Naming TaskBGPBDIADL-PDSESRS
IQCODEBSVSRT-DRBlessed-D/ BDRSBEHAVE-ADBIFinger Tapping
MCPTBVRSWFITCAPEBPRSDependency ScaleTest
MMSECCASSSSWFTCDR-NHBRMSDADSAS
MMMSECategory FluencySKTCDR-SBBRDSFASTUPDR
SMMSECDTTKCGAECERAD-BRSDFIM
CMMSECNTBTMTCGICMAIIADL
MQControlled Challenge Word AssociationWMS (MQ)CGICCS or CSDDIDDD
RMTCOWATWMS-RRCGRSDSCSNAA
RVMCVLTZVTCIBICDSSNAI
SIBDigit Span TestCICIC+Facial BehaviorOARS-ADL
SMQDSSTDBDSGSPDS
SMSTEFRDMRHAM-APSMS
TP, TPATFCMTDRSHAM-DPSQI
WAISFIGTEISHDRSQoL
FOMFCCAHDS-RQoL-P
GAGSFRSIPSC-EQoL-C
Grooved Pegboard TestGERRILPRSSF-36
Letter CancellationGBSMAACL-RSIP
Letter FluencyGDSMADRSTime to functional decline
LMTGISMOSES
LNNBGPI-ENAB
MAEHDSNMS
MEMTHISNOSGER-IADL
MNLTMAC-FNPI-NH
NCTNOSGERNSL
NDTNOSIEOAS
NLTNPI (NPI-4, NPI10)PANSS-EC
NMICPDRSPOMS
NSTPGIRRMBPC
OLTPlutchik CGSRPT
OMDRRAGSSBI
RGRSSHGRT
SCAGSRT
Stockton GRSVHB
TSI
VRGI
To facilitate the presentation of information within the OST (found at the end of this chapter), the outcomes reported in eligible studies were classified into seven domains: 1) general cognition scales, 2) specific cognition tests (neuropsychological tests evaluating specific attributes of cognition, such as short- and long-term memory, word fluency, etc.), 3) global assessment, 4) behavior/mood, 5) quality of life/ADL, 6) caregiver burden, and 7) other. The EPC research team reached consensus on the classification of the various outcome measures within these seven domains. For example, the ADAS-cog and MMSE were classified as “general cognition scales”, and the BEHAVE-AD and NOSGER were placed in the “behavior and mood” domain (see Appendix C guide to the results tables). The complete list of outcomes that were reported in the studies evaluated in this review and the domains that they were classified within is found in Appendix E. Table 9 in the report presents a guide to the overall summary tables by domain.

For each of the outcomes reported by a study, four interpretations of the results were possible. The four options for interpretation are as follows:

SC = significant change. Demonstrated by statistical significance (p ≤ 0.05) for the primary outcomes from an ITT analysis comparing treatment and placebo groups, or comparing differences among dose groups.

NS = not significant. The corollary of SC indicating no statistical significance.

MX = mixed results. Primary outcomes within the same domain show opposite or inconclusive statistical significance; for example, in the general cognition domain, half the studies show significant change and the other half show no significance).

NR = not reported. Outcome was collected but not statistically evaluated or not reported in the publication.

NT = not tested. No outcomes in this domain were tested.

Secondary outcome results were reported in the absence of any primary outcome data (for the domain of interest) and were demarcated with a (2°) in the OST. Similarly, analyses other than ITT were denoted with an asterisk (*) in the OST. If the report describes my subgroup analyses, the word SUBGROUP appears in the “other” column.

Adverse events were not always clearly described in many studies. A priori, we selected 5 generic symptoms (nausea, dizziness, agitation, eating disorder, and diarrhea) and selected to detail the ranges amongst studies for both placebo and treatment groups for these symptoms. The percent of withdrawals for both groups due to adverse events was reported. Adverse events reported to be statistically significant are highlighted for the reader. The details in addition to the quality score rating will assist the reader in evaluating the potential for harm.

Statistical Analysis

Power Analyses and Measures of Effect for combined studies. Power analyses (PW) were for individual trials for select pharmacological interventions (donepezil, galantamine, tacrine, rivastigmine, memantine, estrogen, carnitine, ginkgo biloba, selegiline) for all primary outcomes. In addition, if the trial reported the outcomes of the MMSE, ADAS-cog, or CIBIC+, power was also estimated (for individual trials of pharmacological interventions that had a minimum of three trials with a common outcome). Quantitative meta-analyses were undertaken in interventions that had a minimum of three trials using the same outcome scales and which provided sufficient data to permit calculation of effect sizes (as an Odds Ratio (OR), Relative Risk (RR) and Weighted mean difference (WMD)). The random-effects model results are presented to the reader.

Quantitative and Descriptive Analyses

Results of cholinergic neurotransmitter modifying agents (CNMA)

Table 2. List of Cholinergic neurotransmitter modifying agents and the number of studies vs. placebo for each of these. Asterisk (*) indicates report of a drug vs. drug trial [comparator drug(s) in brackets]
DrugNumber of studies vs. placebo
Antagonic Stress *[Meclofenoxate] *[Nicergoline]0**
Carnitine6
Donepezil *[Vitamin E]10*
Eptastigmine2
Galantamine6
Huperzine-A *[Tablet Capsule]1*
Linopirdine2
Mexofenoxate *[Antagonic stress]0*
Metrifonate9
Nicergoline*[Antagonic Stress]4*
Physostigmine4
Posatirelin *[Citicoline]4*
Rivastigmine6
Sabeluzole1
Tacrine *[Idebenone] *[Silymarin]6**
Velnacrine3
A total of 70 studies evaluating 16 cholinergic neurotransmitter modifying agents were eligible for review (Table 2). Six studies directly compared different drugs, and these trials are considered separately in the section that addresses question three. Overall results for each of the trials for each of the interventions are detailed in the OST located at the end of this chapter and organized by drug. All other study details are available in Evidence Tables 1 through 93 in Appendix C.

Carnitine (also known as acetyl-L-carnitine, gamma-trimethyl-β-acetylbutyrobetaine (Alcar). See Evidence Tables 1 through 8 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. A total of six studies54, 55, 56, 57, 58, 59 evaluating carnitine were included in this review. Four of the reports were published from 1990 to 199256, 57, 58, 59 while the remaining two, both by same authors, were published in 1996 and 2000.55, 54

Design/methodology. A total of 925 subjects were evaluated in these six studies comparing carnitine and placebo. The range of study sample sizes was from 30 to 431 subjects. Quality scores (out of 8 points) ranged from moderate57 (5) to high58, 59 (7), and all of the studies were partially or totally funded by industry.

Populations. All trials were conducted on AD patients, and all but one study used the NINCDS criteria for diagnosis. None of the trials reported including patients with severe dementia; all were classified as mild to moderate.

One trial58 had a mix of community and institutional patients, and one study reported using a community sample.56 The mean age of the samples ranged from 5954 to 79 years,59 with the majority reporting mean age greater than 70 years. One study56 did not report mean age. Three studies specified the baseline MMSE54, 55, 57 (range 16.1 to 20.6) and one trial specified the modified MMSE56 (mean 35) demonstrating no differences between placebo and treatment groups.

Intervention. The dose varied from 2 to 3 grams per day, and treatment duration was either 24 weeks56, 57, 59 or 52 weeks.54, 55, 58 One study57 did not report the dose used. No titration period was used for this drug in any of the studies.

Primary outcomes. All of the studies measured cognition as a main outcome; half of the trials also measured outcomes in the behavior/mood domain.58, 54, 55 All but one of the studies58 used a quality of life/ADL or functional status measure. Only one study55 evaluated caregiver burden. Of the four studies59, 58, 54, 55 reporting the findings from a global measure, only one55 used the Caregiver-rated Global Impression of Change (CGIC).

Analysis. Half of the studies reported ITT analyses58, 54, 55 and the remaining trials reported OC results.57, 59, 56 The ability to combine results was limited with only three studies having the common outcomes of ADAS-cog, modified MMSE (mMMSE), and Clinical Dementia Rating (CDR).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf4.jpg.

   Figure 4. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the MMSE comparing carnitine versus placebo

Results and interpretation. See Summary Table 1. The four studies that evaluated “general cognitive function” did not find statistically significant differences in this domain. For those trials that provided sufficient data to estimate power, three trials54, 55, 56 were underpowered for the MMSE (PW = 0.15 to 0.19), and two trials54, 55 for the ADAS-cog (PW = 0.08 to 0.09) and the CDR (PW = 0.06 to 0.11). Meta-analysis was undertaken for the MMSE scores; although favoring a treatment effect, the pooled effect size (WMD = 0.55) was modest and zero was contained within the confidence interval (Figure 4) for the random effects models. The pooled estimate favoring treatment may suggest some potential for benefit in general cognitive function, but this must be verified in future research.

For “specific cognitive tests”, two out of four studies did not detect statistical differences relative to placebo, and the remaining two showed mixed results (see Summary Table 1). No significant differences were found in the domains of global assessment, behavior/mood, and quality of life/ADL; power could not be evaluated for the majority of these outcomes in the trials (insufficient data reported to permit calculation).

Four57, 59, 56, 58 of the six trials scored 3 out of 5 on our quality scale for rating adverse events. Two trials55, 54 did not adequately report adverse events (score = 1), but tested for statistical differences between groups. Withdrawal rates due to adverse events ranged from 0 – 3% in all studies, with the exception of a single trial59 where the percentage was 22% (placebo) and 44% (treatment). The high rates in this trial are likely related to the small sample size (n = 36). This same trial59 was also the sole study reporting dizziness and anxiety (confusion, depression). In general, gastrointestinal symptoms (Evidence Table 8) were the most frequently reported adverse events, but most studies did not test for statistical differences in the rates between the groups. The percent of subjects reporting of a priori selected symptoms across all studies are as follows: 1) nausea (placebo = 6 – 14%, all doses carnitine = 28%), 2) dizziness (not reported as an event for either placebo or treatment group), 3) agitation (placebo = 6%, all doses carnitine = 7%), 4) diarrhea (not reported as an event for either placebo or treatment group), and 4) eating disorder (not reported as an event for either placebo or treatment group).

Donepezil. See Evidence Tables 9 through 21 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. A total of 11 studies60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70 evaluating donepezil were eligible for this systematic review. One study 70 compared donepezil to vitamin E rather than placebo. All were published within the last 6 years (n = 1, 1996), (n = 2, 1998), (n = 1, 1999), (n = 5, 2001), (n = 2, 2002). Three of these studies66, 67, 69 were undertaken by the same research group at different time periods and had unrelated cohorts of patients.

Design/methodology. A total of 3239 subjects (range of study sample size, 30 – 893) were included in these trials. The modified Jadad scale quality scores ranged from 561 to 8.63, 64 All studies were funded by industry sponsors with the exception of a single trial70 that did not specify their source of support.

Populations. All but one study60 used the NINCDS criteria to diagnose dementia. Eight studies included only AD patients, one study included only VaD,68 and the remaining two included Parkinson's disease dementia (PDD65) and AD patients with cardiovascular disease.64 A single trial included subjects with Down's syndrome and AD.60 The severity of the dementia patients was described as mild to moderate in five studies,65, 66, 60, 62, 70 mild to moderately severe in two studies,69, 67 probable in two studies,61, 68 and moderate to severe in two studies.63, 64

Some studies specified that the dementia patients were recruited from the community,68, 63, 60 one study from institutional setting,64 and the remaining did not specify the living arrangements. Mean ages of the study subjects ranged from 54 to 85.7 years with most studies representing ages in the upper to mid 70s.

Six studies 61, 62, 65, 66, 67, 69 specified the race of the subjects, and of these, the overwhelming sample was Caucasian (range from 92 – 100%). All but one study68 specified the proportion of men recruited, the range being from 18 – 46%. Four of these studies presented some results stratified by gender,62 age,64 APOE genotype,62 baseline MMSE,63, 64 patients with Down's syndrome,60 and the use of psychoactive drugs.63 Three studies specified the baseline MMSE61, 64, 70 demonstrating no differences between placebo and treatment group, and the mean values varied from 14 to 16.

Intervention. Five studies evaluated a 10 mg dose given once daily,60, 61, 62, 63, 66 two studies 5 mg daily,64, 69 and four studies compared 5 mg and 10 mg dose groups.65, 68, 67, 70 Titration periods observed included 7 days65, 67 and 4 weeks,70, 68 these were not specified in the remaining studies. The total duration of the drug (including titration) varied from 12,65 15,67 23/24,60, 63, 64, 65, 66, 68 and 54/5661, 62 weeks.

Primary outcomes. Specific cognitive tests and caregiver burden were not evaluated in these studies. Nine studies used the MMSE, and six studies the ADAS-cog.

Analysis. All but one of the studies60 comparing donepezil to placebo used ITT analysis. The study using OC analysis showed no statistical difference between treatments. It had the smallest sample size of 30 subjects and was underpowered (PW = 0.16) for the behavioral measure used, NPI.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf5.jpg.

   Figure 5. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the MMSE comparing donepezil and placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf6.jpg.

   Figure 6. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing donepezil versus placebo

Results and interpretation. See Summary Table 2. For the 10 trials60, 61, 62, 63, 64, 65, 66, 67, 69, 68 comparing donepezil to placebo, two studies60, 64 did not show a positive effect for the domain of general cognition. However, both of these studies evaluated the outcomes in this domain as secondary outcomes, with one trial64 lacking sufficient power for the MMSE (PW = 0.69); for the other trial, power could not be evaluated. For the eight trials61, 62, 63, 64, 66, 69, 67, 68 showing a positive effect on general cognition, all but one trial used the MMSE as an outcome, which allowed for a pooled effect size estimate (Figure 5); we assumed that the VaD patients in one trial65 could be combined with the other dementia populations. Figure 5 shows a consistent treatment effect for improvement in general cognitive function as measured by the MMSE, and the overall effect was statistically significant. Figure 6 shows the four trials65, 66, 67, 68 that used the ADAS-cog to measure general cognitive function change. A consistent effect favoring treatment was evident, and the test for overall effect was statistically significant. It should be noted that some of the values used in the pooled estimates for the MMSE and the ADAS-cog were derived from figures showing means and confidence intervals in the trial reports, thus introducing some imprecision into these estimates.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf7.jpg.

   Figure 7. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the CIBC+ (continuous data) comparing donepezil versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf8.jpg.

   Figure 8. Relative Risk (RR) from the Random Effects Model (Random) for the CIBIC+ (dichotomous data probability of improving) for a 5 mg dose of donepezil

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf9.jpg.

   Figure 9. Relative Risk (RR) from the Fixed Effect Model (fixed) for the CIBIC+ (dichotomous data [improved versus not]) for a 10 mg dose of donepezil

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf10.jpg.

   Figure 10. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the Clinical Dementia Rating (CDR) comparing donepezil versus placebo

Ten studies60, 62, 63, 64, 70, 65, 66, 67, 69, 68 evaluated global assessment, and with the exception of three trials,60, 69, 70 all studies showed a statistically significant difference in this domain. The overall effect for the CIBIC (Figure 7), and the CIBIC + (Figure 8, expressed as a proportion of improved versus not improved) were estimated for the 5 mg dose of donepezil. Figure 9 shows the summary estimate for the three studies65, 66, 68 that evaluated the 10 mg dose of donepezil; heterogeneity was significant (p = 0.007) in this meta-analysis, but the overall effect was significant (p = 0.002). Similarly, Figure 10 shows the summary estimate for the Clinical Dementia Rating (CDR) global assessment measure. A radial plot of these three studies was undertaken and suggests that one trial64 could be an important source of the heterogeneity. Summary estimate was also calculated for the Neuropsychiatric Inventory (NPI), which was classified in the behavior domain. It was noted that two of the three studies that evaluated this outcome were lacking sufficient power60, 64 (PW = 0.11, PW = 0.16). The test for heterogeneity was significant but the test for overall effect was not. Thus, the results of the summary estimates for the NPI outcome are problematic. Two global assessment outcomes, the CIBIC+ and the CDR, show a consistent effect favoring the drug treatment at 5 mg; the evidence is inconsistent for the 10 mg dose.

Five of the eight studies60, 61, 63, 64, 65, 66, 67, 69 measuring Activities of Daily Living (ADL) found a significant difference in the various outcomes used to assess ADL, but none of these could be combined into a summary estimate. It should be noted that the majority of trials selected these ADL variables as secondary outcomes. Behavior outcomes were not significant or showed mixed results for the three studies that evaluated this domain but these lacked sufficient power. Only one study collected caregiver stress and health service utilization outcomes63 but did not report these data.

Quality scores for reporting adverse events varied from 1 to 4 but the majority of trials scored 3 or greater (n=7). One recently published study68 scored 1, with no events detailed. Withdrawal due to adverse events ranged from 0 – 18% for treatment groups and 0 – 11% for placebo (see Evidence Table 21). Four studies65, 68, 67, 66 were able to demonstrate a dose effect, with increasing frequency of events as dosage increased. One study62 reported significant differences between treatment and placebo for balance problems and asthenia (neurological fatigue). Fatigue was shown to be significant in two other studies.61, 66 Four 65, 61, 66, 67 out of six studies testing differences between groups were statistically significant for diarrhea (placebo = 3 – 21%, all doses donepezil = 0 – 38%), nausea and vomiting (placebo = 4 – 9%, all doses donepezil = 4 – 25%). The other a priori symptom reported was agitation and frequencies for placebo varied from 0 – 8% and for all doses from 3 – 19%; but these were not shown to be statistically different. No serious adverse events requiring hospitalization were reported or shown to differ statistically between groups.

Galantamine. See Evidence Tables 22 through 29 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Six studies of galantamine71, 72, 73, 74, 75, 76 were included in this review. All compared galantamine with placebo and were published between 2000 and 2002 (from six different authors).

Design/methodology. The sample sizes for subjects ranged from 28573 to 97874 with 3530 subjects evaluated in total. All quality scores were high, with either 7 or 8 on the Jadad scale. Funding sources for these studies varied; one study did not report funding source,74 one was funded by a non-industry source,75 one was partially funded by industry,71 and three were funded by industry.72, 73

Populations. All but one of the studies71 included AD patients only, and this single study mixed VaD and AD patients. All subjects in these studies were classified as mild to moderate.

One study specified that the subjects were from the community.73 All studies included from 36 – 53% male subjects, and the mean age ranged from 72.2 to 76.8 years. Three studies71, 74, 76 specified race, and with the largest proportions being white (range from 91.5 – 99.9%). Two studies evaluated subgroups, based on baseline MMSE75 and APOE genotype.75, 76

Intervention. All studies had a titration period, starting at 4 mg per day71, 73 or 8 mg per day.72, 74, 75, 76 Four studies increased the dose weekly,71, 72, 75, 76 and one study increased every 2 to 3 days.73 All studies had a treatment dose of 24 mg per day. Three studies73, 75, 76 included other doses ranging from 16 mg per day to 36 mg per day. Trials were undertaken for a minimum of 3 and maximum of 6 months.

Primary outcomes. All domains were measured except for specific cognitive tests and caregiver burden. All studies used the ADAS-cog and CIBIC or CGIC measures as primary outcomes. None reported baseline mean MMSE values.

Analysis. All but one71 of the studies reported ITT analysis, and the results of this study did not differ from the others.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf11.jpg.

   Figure 11. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing galantamine at 24 mg dose versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf12.jpg.

   Figure 12. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing galantamine at 32 mg dose versus placebo

Results and interpretation. See Summary Table 3. Five of the six trials71, 72, 73, 74, 75, 76 that evaluated general cognitive function showed a significant effect. One trial73 showed mixed effects with the ADAS-cog showing some improvement at the 24 mg but not the 32 mg dose level . Figures 11 and 12 show the pooled estimate for the ADAS-cog for five studies for 24 and 32 mg doses; one trial71 was excluded from the pooled estimate as the population of this study was thought to be a source of heterogeneity. However, the test for heterogeneity for the 24 mg dose (Figure 11) was significant (p = 0.001) despite omitting this study, but the overall effect was significant (p = 0.0005); this estimate should be interpreted with caution. The pooled estimate for the 32 mg dose (Figure 12) showed a consistent effect favoring treatment and was significant (p < 0.00001).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf13.jpg.

   Figure 13. Relative Risk (RR) from the Random Effects Model (Random) for the CIBIC comparing galantamine at 24 mg dose versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf14.jpg.

   Figure 14. Relative Risk (RR) from the Fixed Effects Mode Fixed l for CIBIC comparing galantamine at 32 mg dose versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf15.jpg.

   Figure 15. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the DAD comparing galantamine at 24 mg dose versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf16.jpg.

   FIgure 16. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the DAD comparing galantamine at 32 mg dose versus placebo

All studies evaluated global assessment with the CIBIC+ with one exception.73 All studies with the exception of this study showed a significant difference between placebo and treatment for both the 24 and 32 mg doses. Figures 13 and 14 show the pooled estimates for the CIBIC+ for these two dosages and suggest an overall effect size of equivalent magnitude for either dose. Similarly, all studies evaluated quality of life/ADL with a variety of different outcome measures; four studies71, 72, 74, 75 showed statistically significant differences between groups, and two studies did not.76, 73 Figures 15 and 16 shows the results of pooling the estimates for the Disability Assessment for Dementia (DAD)77 outcome in those studies that provided sufficient data. In this instance, we included the trial with mixed dementia populations71 to have a minimum of three studies required for a pooled estimate. A consistent, statistically significant effect favoring treatment is evident; the higher dose of 32 mg shows a slightly larger effect size relative to 24 mg. Two of the three studies that reported on behavior/mood outcomes showed statistically significant differences (Summary Table 3).

Five71, 73, 74, 75, 76 of the six trials scored 3 out of 5 on our quality scale for rating adverse events. One trial72 scored 4. Withdrawal rates due to adverse events ranged from 4 – 9% for placebo and 8 – 27% for the treatment group. One study73 showed a dose response for adverse events. Although, most trials did not report testing for differences between groups, two trials76, 75 reported a statistical significant difference in weight loss between the placebo and treatment group. Statistical differences for aberrant hematology were not significant in any of the five studies that evaluated this (Evidence Table 29). The most common types of adverse events reported were gastrointestinal symptoms (nausea and vomiting, diarrhea), eating disorders/weight loss, and dizziness (four studies, see Evidence Table 29). The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 13%, all doses = 6 – 44%), 2) dizziness (placebo = 3 – 11%, all doses = 4 – 19%), 3) diarrhea (placebo = 2 – 10%, all doses = 4 – 19%), 4) agitation (placebo = 1 – 9%, all doses = 6 – 15%), and 5) eating disorder (placebo = 0 – 6%, all doses = 4 – 20%).

Metrifonate. See Evidence Tables 30 through 40 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Nine studies78, 79, 80, 81, 82, 83, 84, 85, 86 were eligible for this systematic review. Studies were published from 1996 to 1999, and all studies compared metrifonate to placebo.

Design/methodology. Overall, 2759 subjects were evaluated. Sample sizes randomized ranged from 27 to 605 subjects. Quality of the studies ranged from 580 to 8,81 with the majority of studies earning 6 points.82, 83, 79, 85, 78 Two studies were non-industry-funded78, 79 and one did not specify the source of funding,80 while the other six studies were partially or wholly funded by industry.

Populations. The subjects in all included trials were classified as having mild to moderate AD. Not all trials specified the source of recruitment or the racial composition of subjects. Three studies specified a community sample,86, 79, 85 and three trials reported the racial composition l,83, 86, 84 which was greater than 90% white in all cases. Mean age for all of the studies ranged from 71.4 to 75.0 years, with one study not reporting the mean age.80

Intervention. All but one study86 reported the loading dose, which varied from 0.5 mg per kg to 5.0 mg per kg. Following this initial loading period, the maintenance dose varied from 0.65 mg per kg to 4 mg per kg and 50 mg per day. The duration of the study treatments varied from 21 days to 26 weeks.

Primary outcomes. All outcome domains were evaluated with the exception of caregiver burden. ADAS-cog, CIBIC+, and MMSE were most frequently used as outcomes.

Analysis. Four trials reported OC analyses78, 79, 80, 85 and the remaining reported ITT analyses.

Results and interpretation. See Summary Table 4. A consistent positive change in cognitive function was found in all studies that reported this outcome (n = 8). One study85 tested cognitive function, global assessment, and behavioral outcomes and reported the baseline endpoint scores but did not test for differences between treatment and placebo groups. Four of the eight studies reporting global assessment outcomes showed statistically significant differences between groups, and two83, 86 showed mixed results. The remaining two trials showed no significant results for global assessment, but they were secondary rather than primary outcomes. All studies that evaluated behavior/mood and quality of life/ADL outcomes (with the exception of one trial86) showed no significant findings or mixed findings; it should be noted that all were secondary outcomes. There were not enough similar outcomes reported to complete a pooled analysis for metrifonate.

With the exception of a single study, quality scores for reporting adverse events were greater than 3 and generally well reported. However, only one trial 83 tested for differences between groups and found nausea and vomiting, diarrhea, and muscle and joint disorder to be significantly different. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 14%, all doses = 2 – 50%), 2) dizziness (placebo = 1%, all doses = 3 – 4%), 3) diarrhea (placebo = 4 – 14%, all doses = 11–19%), 4) agitation (placebo = 2 – 14%, all doses = 8 – 33%), and none reported eating disorder as an adverse event. Withdrawal rates due to adverse outcomes varied from 0 – 9% for placebo and 0 – 12% for treatment groups. Some studies indicated arrhythmia80, 82, 83, 85 and hypotension82, 85 and hematological abnormality82 but did not test for differences between groups. The majority of studies reported that laboratory tests including liver function and hematology were within normal limits. Overall, it was difficult to determine which types of adverse events reported had the potential to cause serious harm. This is some concern as metrifonate is no longer used as a therapy for dementia due to its potential for serious adverse events that include: respiratory paralysis, bradychardia, severe leg cramps and dyspepsia.87

Nicergoline. See Evidence Tables 41 through 47 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Four studies88, 89, 90, 91 compared the effect of nicergoline to placebo, and one study,92 published in 1994, compared nicergoline to antagonic-stress. The four placebo comparison studies were published in 1995,91 1997,89, 90 and 2001.88

Design/methodology. Sample sizes in the controlled studies varied from 108 to 346, with the total number of subjects included totaling 705. The drug versus drug study had only 62 subjects. The placebo trials all had quality scores of 6 points, while the non-placebo trial had a quality score of 5. Funding sources were reported only in two studies,88, 90 and both were industry-funded.

Populations. These studies had a very mixed population of dementia patients. Two included AD only,88, 92 one trial MID only,89 one trial included both senile dementia of Alzheimer type (SDAT) and MID,91 and one trial included PDD, VaD, and mixed dementia.90 All subjects had mild to moderate dementia. Studies included 38 – 55% male subjects; one study91 did not report the gender proportions. Mean age of subjects ranged from 69.3 to 73.7 years with one study not reporting91 this value. One study91 compared SDAT patients to MID patients.

Intervention. All trials versus placebo used 60 mg per day, but duration varied from 2 months,91 6 months,88, 89, 92 and 12 months.90

Primary outcomes. Caregiver burden was the only domain not evaluated by at least one of the studies. Three trials89, 90, 91 specified baseline MMSE, and this varied from 20 to 22.

Analysis. Two of the trials reported OC analyses90, 91 and two reported ITT analyses89, 88; the trial comparing nicergoline to antagonic-stress presented OC analysis only.

Results and interpretation. See Summary Table 5. There was a consistent positive effect for improvement in general cognitive function as all four studies showed a statistically significant difference. The evidence for benefit in the global assessment domain is inconclusive as only two of the trials89, 90 found significant differences and two trials88, 93 had mixed results (see Summary Table 5). Two trials91, 88 measured behavior/mood and found no significant difference. A single trial88 evaluated quality of life/ADL; although two outcomes in this domain were used (both as secondary measures), none was significant relative to placebo. There were not enough similar outcomes reported to complete a pooled analysis for nicergoline.

Quality scores for reporting adverse events varied from 2 to 5 for these four trials, and none tested for differences between groups. Withdrawal due to adverse events varied from 0 – 8% for placebo and 0 to 9% for the treatment group. The trial with the lowest quality reporting score 89 reported the most number of different events (up to 23 event types). With the exception of headache, which was reported in all four trials, it was difficult to determine which types of adverse events most characterized exposure to this pharmacological agent. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 3%), 2) dizziness (placebo = 1 – 2%, all doses = 0% or not reported), 3) diarrhea (placebo = 2 – 6%, all doses = 2 – 4%, 4) agitation (placebo = 5%, all doses = not reported), and none reported eating disorder as an adverse event.

Physostigmine. See Evidence Tables 48 through 53 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Four studies were eligible for our review,94, 95, 96, 97 all comparing physostigmine to placebo only. The studies were from 1996, 1999, and 2000 with two of the studies being by the same author.95, 96

Design/methodology. Sample size ranged from 17697 to 47596 with an overall total of 1198 subjects. Quality scores were 5,96 6,94, 95 and 797 out of 8 possible points. Two studies were industry-funded,97, 96 one was partially funded by industry,95 and one did not report funding source.94

Populations. All subjects had a diagnosis of mild to moderate or probable AD. Only one study96 reported that all subjects were drawn from the community. Mean age ranged from 68.6 to 73.4 years, and the proportion of male subjects varied from 39.8 – 63%.

Intervention. Treatment schedules varied across the studies. One study94 used a patch (30 and 60 mg), one trial used 30 mg (15 mg twice daily),97 one trial had a washout and titration every 3 weeks to 30 or 36 mg per day,96 and another trial titrated weekly to 15 mg twice daily.95 Duration of treatment ranged from 6 weeks95 to 24 weeks.94, 96

Primary outcomes. All studies used the ADAS-cog as a primary outcome, and none reported caregiver burden or behavior/mood measures. No studies reported baseline MMSE scores.

Analysis. All but one trial94 used ITT analysis.

Results and interpretation. See Summary Table 6. Although all four trials measured general cognitive function, only three reported the results. All of these were statistically significant (see Summary Table 6) for the ADAS-cog, with change scores varying from 0.95 to 2.9 (change from baseline). Two trials 95, 96 found significant change for global assessment outcomes; the remaining two showed mixed results97 and non-significance.94 Behavior/mood was only measured in one trial,94 and the effect was not reported. Three of the trials included measures of quality of life/ADL as secondary outcomes95, 96, 97 and all found no significant difference from placebo but these were secondary outcomes and may reflect a lack of power. There were not enough similar outcomes reported to complete a pooled analysis for physostigmine.

The quality scores for reporting adverse events were generally low, scoring 1 or 2 out of 5. Withdrawal rates due to adverse events varied from 1 – 5% for placebo and 12 – 55%in the treatment group, with one study97 not reporting rates. The high withdrawal rates were in studies with sample sizes that varied from181 to 475 subjects. A single study97 tested for differences between groups, and found that dizziness, tremor, weight loss, asthenia (varying from 6 – 22% for all doses), confusion, delirium, and respiratory problems were significantly different. The cluster of reported types of adverse events suggests that gastrointestinal problems (abdominal pain, diarrhea) (placebo = 1 – 9%, all doses = 13 – 28%), nausea and vomiting (placebo = 1 – 9%, all doses = 9 – 75%) and eating disorder (placebo = 2 – 6%, all doses = 5 – 16%) were most frequently reported. Dizziness (placebo = 4 – 13%, all doses = 11 – 38%) and agitation (placebo = 6 – 16%, all doses = 4 – 8%) were also reported. No events deemed serious enough for hospitalization were reported.

Posatirelin. See Evidence Tables 54 through 59 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Four studies98, 99, 100, 101 compared posatirelin to placebo, and one of these 101 also compared it to citicoline. One study was published in each of the years from 1995 to 1998, and all studies were conducted in Italy. Two studies were by the same author.100, 101

Design/methodology. Populations randomized in the studies varied from 136100 to 36099 with a total of 931 in all trials. Quality scores ranged from 598 to 7100 out of a possible 8 points. Three of the four studies did not report the source of their funding, but one trial100 reported partial funding by industry.

Populations. No two studies included exactly the same populations; one trial had only AD,101 one trial had only VaD,100 one trial had mixed AD and VaD,99 and another trial had mixed AD, VaD, and PDD.98 This latter trial98 compared populations in a subgroup analysis of AD versus VaD. All studies evaluated populations with mild to moderate disease.

The mean age of the subjects ranged from 69.4100 to 78.898 with the percentage of male subjects varying from 34101 to 66%.100 One study100 included a dementia population who also had hypertension.

Intervention. A dose of 10 mg per day was used in all studies, and treatment interval varied from 3 to 4 months.

Primary outcomes. General cognitive function was evaluated in all studies using the intellectual impairment Gottfries-Bråne-Steen (GBS) subscale and the MMSE was used in one trial.101 Specific cognitive function was evaluated in one trial.100 Quality of life/ADL was evaluated with the ADL subscale of the GBS, and behavior/mood was evaluated with the emotional impairment subscale of the GBS. GBS total score was assumed to be a measure of quality of life/ADL rather than global assessment. None of the trials reported baseline MMSE scores.

Analysis. Two of the studies100, 101 used OC analysis to report outcomes.

Results and interpretation. See Summary Table 7. All four studies evaluated general cognitive function, and three of these trials100, 99, 98 reported significant differences using the intellectual impairment subscale of the GBS as a measure of this attribute. One study100 measured this same outcome and the MMSE, but did not report results for the latter outcome. One study101 measured reported changes within a treatment relative to baseline and not relative to placebo; this study demonstrated superiority for posatirelin relative to citicoline (a third comparison group), but did not test for differences between the placebo group. Showing non-inferiority of citicoline in this trial does not establish efficacy with respect to placebo. Statistically significant changes were also shown for the domain of quality of life/ADL as measured by the GBS total score or GBS ADL (FactorII) subscales in three of the studies.98, 99, 100 A single trial100 evaluated global assessment using the TP Global scale in VaD subjects. There were not enough similar outcomes reported to complete a pooled analysis for posatirelin.

Quality scores for reporting adverse events varied from 2 to 4. Withdrawal rates due to adverse events ranged from 0 – 3%in placebo and 0 – 4%in the treatment group. One trial100 did not report the rate of withdrawal. None of the studies tested for significant differences between groups. All studies reported the presence of agitation, and at least three studies reported arrhythmia, nausea/vomiting, headache, rash/skin disorder, and sleep disorder; there is no evidence to suggest that these differed between placebo and treatment group. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 1 – 4%), 2) dizziness (placebo = not reported, all doses = 1%), 3) diarrhea (placebo = 2%, all doses = 2%), 4) agitation (placebo = 1 – 5%, all doses = 1 – 5%), and none reported eating disorder as an adverse event. No serious adverse events were reported.

Rivastigmine. See Evidence Tables 60 through 67 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Six studies were eligible for this review, all comparing rivastigmine to placebo. Studies were published in 1998,102, 103 1999,104, 105 2000,106 and 2001.107

Design/methodology. Six studies evaluated 2071 subjects in total, with studies ranging from 27107 to 725104 subjects. The quality of studies varied from 5 to 8, with three studies106, 104, 103 scoring 8 points and one107 earning 5 points. All studies were funded by industry sponsors.

Populations. Four studies were evaluated in AD patients,107, 105, 104, 103 one trial102 dementia of the Alzheimer's type (DAT), and one study106 Lewy body dementia subjects for mild to moderately severe subjects. One study103 reported a subgroup analysis by vascular risk. One study reported a community sample in their trial.104 Mean age for the studies ranged from 69.4102 to 75.9 years.107 Two studies107, 105 did not report the ratio of male subjects in their study, and the other four varied from 39 – 56%. One trial104 reported co-morbidity of diabetes, hypertension and arthritis; one trial103 reported concurrent medication use for cardiovascular, gastrointestinal and analgesic aids.

Intervention. Doses for rivastigmine varied from 1 mg103 to 12 mg,106, 105, 104 and treatment duration varied from 14104 to 26107, 103 weeks. All studies titrated the dose of drug over a period ranging from 2 weeks106 to 12 weeks.107

Primary outcomes. The ADAS-cog and CIBIC+ were evaluated in half the studies. Baseline MMSE was reported in two trials,103, 106 and the mean scores varied from 18 to 20. Specific cognitive function, behavior/mood, and quality of life/ADL were infrequently evaluated, and caregiver burden was not evaluated in the trials.

Analysis. Trials were evenly divided between ITT analyses106, 104, 103 and OC.107, 105, 102

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf17.jpg.

   Figure 17. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing rivastigmine versus placebo

Results and interpretation. See Summary Table 8. Although, general cognitive function was evaluated in five studies, only four reported findings, which were all statistically significant; one of these was borderline significant (p = 0.054) (see Summary Table 8). Two of these studies104, 103 represented two sites (North American and European) of the same protocol. Although the same protocol was used, one study103 found significance for both high (6 – 12 mg) and lower (1 – 4 mg) dosages, but the other trial did not show significance for the lower dose, likely due to lack of power for this outcome (PW = 0.67). Interestingly, at the 12-week midpoint, the low dose groups in both these studies appeared to be worse than placebo for all primary outcome measures, but then migrated to improvement at the 26-week endpoint. For those studies105, 104, 103 that reported ADAS-cog change scores from baseline for the treatment group, mean change values varied from -2.75 to 0.26. Figure 17 shows the pooled estimate for those trials that provided sufficient data and represents the 12 mg dose. However, the test for homogeneity was significant suggesting that the pooled estimate should be interpreted with caution (albeit a significant overall effect). For those studies that reported MMSE change scores from baseline for the treatment group, mean values varied from 0.0 to 0.6; a single trial103 showed a decline of 7.9 points relative to baseline for the placebo after 26 weeks (other trials reporting MMSE scores did not report such marked change).

Two trials evaluated specific cognitive tests; one trial106 showed statistical significance for the CCASSS but not for the MMSE (general cognitive test). Similarly, another trial105 found the Weschler Logical memory (instant recall) to be significant but the ADAS-cog (general cognitive outcome) was borderline significant.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf18.jpg.

   Figure 18. Relative Risk (RR) from the Random Effects Model (Random) for the CIBIC+ comparing rivastigmine versus placebo

With respect to global changes, five of six studies showed significant changes, and from these, three studies104, 103, 102 were for the high dose only. One of these studies102 defined the high dose as 6 mg per day, which was the minimum dose level for the other two studies. One study107 showed a statistical difference for the two deterioration categories (5 – 7) in the CIBIC+, but not in the improvement categories when comparing treatment and placebo groups. Figure 18 shows the pooled estimate for the CIBIC+ in those studies that provided sufficient information. A consistent effect favoring treatment is shown, but the two smaller trials display large confidence intervals (Figure 18). There was no clear trend in the domains of behavior/mood and quality of life/ADL as not all studies evaluated these domains.

Quality scores for reporting adverse events varied from 2 to 5. Withdrawal rates due to adverse events ranged from 4 – 11% in the placebo and 11 – 27% in the treatment group. One trial107 did not report the withdrawal rates or the types of adverse events observed. Two trials103, 104 demonstrated a dose response; however, one of these trials104 showed significant differences for nausea and vomiting only, and the other trial103 showed significant for all the adverse events reported. With respect to the types of adverse events, the majority of studies reported dizziness, nausea and vomiting, eating disorder/weight loss, and headache. It should be noted that one study105 allowed intentional prescribed anti-emetic drugs to increase the tolerance of subjects taking rivastigmine. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3 – 10%, all doses = 8 – 58%), 2) dizziness (placebo = 0 – 7%, all doses = 6 – 20 %), 3) diarrhea (placebo = 2 – 9%, all doses = 7 – 17%), 4) eating disorder (placebo = 4 – 8%, all doses = 4 – 19%), and 5) agitation was not reported. No serious adverse events were reported.

Tacrine. See Evidence Tables 68 through 77 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Eight studies evaluating tacrine were eligible for this review; tacrine was compared to placebo in six trials108, 109, 110, 111, 112, 113 (lecithin was assumed to be like placebo) and to other drugs in two trials.114, 26 One study114 compared two arms with tacrine, one with silymarin added and one placebo arm. The other non-placebo controlled trial compared tacrine with idebenone.26 These drug versus drug trials were published in 1999 and 2002. The placebo controlled trials were published 1991,111 1994,108, 109, 113 1996,110 and 1999.112

Design/methodology. The placebo studies evaluated 994 patients in total with a range of 13111 to 663108 subjects per study. Quality scores out of 8 points were evenly distributed with two studies each having scores of 5, 6 and 7. Both drug versus drug trials had scores of 7. One study109 was not funded by industry, and one trial114 did not report its source of funding; the other six studies had at least partial, if not full, industry support.

Populations. All studies included AD subjects; one trial26 also included PDD patients. Subjects in all studies had mild to moderate or probable disease. Five of the studies113, 109, 111, 114, 26 reported that their sample was from the community. The mean age ranged from 68110 to 75 years,113 and the percentage of male subjects in the studies varied from 13110 – 54%.113 One study reported race with 100% white subjects.26

Intervention. See Summary Table 9. All placebo-controlled trials used a titration period to get to maximum dose, from 11 days113 to 18 weeks.112, 108 Treatment doses varied from 80 mg per day110 to 160 mg per day.108 Treatment duration was either 12/13 weeks,110, 113, 111 or 30/36 weeks112, 108, 109 for all placebo-controlled studies. The trial versus Idebenone26 was for 60 weeks and the trial with silymarin114 was for 15 weeks.

Primary outcomes. All six of our identified domains were evaluated by at least one trial. All trials measured cognition; however, sufficient data to permit pooled analyses could not be adequately abstracted from all these studies that had similar outcomes. Baseline MMSE varied from 14 to 18 in the fours trials112, 110, 109, 114 that reported this.

Analysis. Five studies108, 112, 113, 114, 26 reported ITT analysis and three did not.109, 111, 110

Results and interpretation. Of the six placebo-controlled studies, only one trial108 showed statistical significance for general cognitive function as measured by the ADAS-cog (ES = -0.268). Three doses (80 mg,120 mg,160 mg) were compared in this trial, and the 120 and 160 mg per day were shown to be statistically significant (approximately a mean change of 2 points on the ADAS-cog). One trial112 showed mixed results for the two outcomes used (CASI and MMSE) to evaluate general cognitive function; this trial was underpowered for both these outcomes (PW = 0.22 and 0.26, respectively). Three trials109, 110, 111 found no statistical differences between treatment and placebo but had small sample sizes, ranging from 12 to 32 subjects, and were likely underpowered (insufficient reporting to estimate power). A fourth trial113 also found no statistical difference (p = 0.55) for general cognitive function, but the study duration was 12 weeks. It should be noted that this study used an 80 mg dose, which was shown to have no benefit relative to higher doses of 120 and 160 mg.108 A single trial109 of small sample size evaluated specific cognitive tests and did not show statistical differences.

Three studies evaluated global assessment, and two 113, 108 found statistical significance; the trial showing no benefit112 also showed inconclusive findings for general cognitive function as well (PW = 0.05 for the CGIC). Four trials108, 109, 110, 113 evaluated behavior/mood and showed no difference between groups. Two trials109, 111 with small sample sizes measured quality of life/ADL and showed no significant changes; lack of sufficient power cannot be ruled out. There were not enough similar outcomes reported to complete a pooled analysis for tacrine.

The quality scores for reporting adverse events varied from 1 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 12% for placebo and 0 – 55% in the treatment group. The higher rates of withdrawal were associated with higher doses. Elevated alanine transaminase (ALT) or hepatic abnormality (placebo = 4 – 13%, all doses = 7 – 67%) was reported in six studies, suggesting the potential for serious liver damage. None of these trials tested for differences between treatment and placebo with respect to adverse events. Five of the studies reported nausea and vomiting (placebo = 0 – 9%, all doses = 9 – 37%); gastrointestinal problems and dizziness (placebo = 0 – 16%, all doses = 4 – 14%) was also noted in several studies. Frequencies of other a priori symptoms of interest are as follows: 1) agitation (placebo = 5 – 12%, all doses = 5 – 9%), and 2) diarrhea (placebo = 0 – 13%, all doses = 4 – 18%).

Velnacrine. See Evidence Tables 78 through 82 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Three studies evaluated velnacrine versus placebo, and these were published in 1991,115 1995,116 and 1996.117

Design/methodology. A total of 774 subjects were studied with sample sizes ranging from 16115 to 449.116 Quality scores out of a possible 8 points varied from scores of 6117, 115 or 7.116 All studies were sponsored by industry.

Populations. The characteristics of the populations all included probable AD subjects. The mean age of the participants ranged from 70.5 to 72.8 years and the percentage of male subjects ranged from 31 – 41%. Location of recruitment was not specified.

Intervention. The doses given for this drug overlapped between the studies, but they were on different schedules (once, twice, or three times per day). None of the studies had a titration period. One study117 compared four doses (three daily doses of 10 mg, 25 mg, 50 mg, and 75 mg). One study116 compared doses of 150 mg or 225 mg per day. The other study used a dose of 100 mg twice daily and had the smallest sample size.

Primary outcomes. General cognitive function was evaluated in all studies with the ADAS-cog, and none specified baseline MMSE values. A variety of outcomes were used to evaluate global assessment. At least one of these trials evaluated the other outcomes domains

Analysis. Only one of the trials116 used an ITT analysis.

Results and interpretation. See Summary Table 10. One115 trial was of very small sample size (n = 16) and of a 2 weeks duration, compared to the other two studies117, 116 with 15 or 24 weeks. Similarly, this trial evaluated two outcomes (specific cognitive function and global assessment) and showed mixed or non-significant results, likely a function of being underpowered. The two remaining studies117, 116 had sample sizes over 300 subjects and showed statistical significance for the domain of general cognitive function using the ADAS-cog. The magnitude of the change reported varied from -2.0 at 12 weeks and then -1.0 116 at 24 weeks for the 225 mg dose group only; a mean change of 2.15117 for the 75 mg (three times daily) as observed at the study endpoint of 15 weeks (no other dosage group was reported for this study). The trial116 evaluating doses of 150 and 225 given once daily showed significant changes for 225 mg per day but not for 150 mg per day at endpoint (24 weeks), whereas, the trial with 75 mg twice daily did show significant change for general cognitive function.

All studies included assessment of global functioning, for which two117, 116 found significant differences, and one115 had mixed results. Behavior/mood was evaluated in only one study117 as a secondary outcome with an OC analysis, and no significant effect was found. Similarly, quality of life/ADL was measured in two studies as secondary outcomes, which produced opposite results (not significant117 and significant116). One of the trials116 measured effects on caregiver burden as a secondary outcome and found a significant effect. There were not enough similar outcomes reported to complete a pooled analysis for velnacrine.

Quality scores for reporting adverse events were 3 for all studies. Withdrawal rates varied from 0 – 22% for the placebo group and 5 – 33% for the treatment group. None of the studies reported a dose response. None of the studies tested for statistical differences between the placebo and treatment groups. Two studies reported aberrant hematology and hepatic abnormality116, 117; for these two studies the rate of occurrence were 2 – 21% for placebo, and 32 – 40% for all doses. All studies reported diarrhea and nausea and vomiting. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 0 – 4%, all doses = 3 – 8%), 2) dizziness (placebo = 3%, all doses = 0 – 8%), 3) diarrhea (placebo = 3%, all doses = 2 – 33%), 4) agitation (placebo = 4%, all doses = 1 – 4%), and 5) eating disorder (placebo = 1%, all doses 2 – 4%).

Various cholinergic neurotransmitter modifying agents. See Evidence Tables 83 through 93 at http://www.ahrq.gov/clinic/epcindex.htm. See Summary Table 11.

The remaining agents classified as cholinergic neurotransmitter modifying agents were grouped according to the number of studies for the purposes of presentation:

Cholinergic pharmacological agents that had two trials eligible for this review and were compared to placebo.

Eptastigmine (Evidence Tables 83, 84, 85, 93). Two trials118, 119 evaluated eptastigmine in patients with mild to moderate AD (103 patients for 4 weeks and in 491 patients for 24 weeks). Both trials were industry-funded. The trial that used an ITT analysis118 and had the longer duration (24 weeks) showed significant change in the three domains: general cognitive function, global assessment, and quality of life/ADL. The OC analysis119 of the patients treated for 4 weeks showed no significant effect. One trial used the ADAS-cog as a primary outcome118 and showed a small increase of 1.05 and 0.41 for the 15 and 20 mg thrice daily doses, respectively, relative to the placebo group (which increased by 2.6 points). The CIBIC+ was significant for the higher dose group only in this same trial. The evidence of benefit for eptastigmine remains inconclusive given the lack of consistency between studies.

Linopirdine (Evidence Tables 83, 87, 88, 93). Two 1997 trials120, 121 evaluated linopirdine in patients with mild to moderate AD patients for 4 or 6 weeks at 40 or 30 mg thrice daily. Both were at least partially industry-funded. One trial120 included 382 patients on 30 mg dose during a 6 month trial and used an ITT analysis; this study showed statistically significant findings for general cognitive function alone as measured with the ADAS-cog (mean change 2.0 points); global assessment, quality of life/ADL, and behavior/mood were not significant. All outcomes evaluated in the second trial121 were not significant, even though OC analysis was used.

Cholinergic pharmacological agents with only one trial eligible for this systematic review.

Huperzine-A (Evidence Tables 83, 92, 93). This study122 showed a statistically significant benefit relative to placebo in an OC analysis of all domains that were evaluated: general cognitive function, behavior/mood, and quality of life/ADL. The study population was 103 Asian patients with mild to moderate AD, who were treated for 8 weeks.

Sabeluzole (Evidence Tables 83, 89, 93). This study123 included 39 patients with mild to moderate AD and lasted 48 weeks. General cognitive function as measured by the ADAS-cog showed approximately a 5 point increase compared to a 7 point increase for placebo. The OC analysis showed no significant difference from placebo in general cognition.

Results of non-cholinergic neurotransmitter/neuropeptide modifying agents (NCNMA)

Table 3. List of Non-cholinergic neurotransmitter/neuropeptide modifying agents and the number of studies vs. placebo for each of these. Asterisk (*) indicates report of a drug vs. drug trial [comparator drug(s) in brackets]
DrugNumber of studies vs. placebo
Alaproclate1
Alprazolam *[Lorazepam]0*
Anapsos1
BMY (Nootropic)1
Carbamazepine2
Citalopram *[Mianserin] *[Perphenazine]2**
Diphenhydramine *[Haloperidol, Oxazepam]0*
Divalproex2
Fluoxetine *[Haloperidol] *[Amitriptyline]2**
Fluvoxamine1
Haloperidol **[Risperidone] *[Loxapine] *[ Diphenhydramine Oxazepam ] *[Fluoxetine] *[ Tiapride] *[ Trazodone]4*******
Imipramine *[Paroxetine]1*
Lisuride1
Lorazepam *[Alprazolam] *[Olanzapine]1**
Loxapine *[Haloperidol] *[Thioridazine]1**
Lu25-1091
Maprotiline1
Melperone *[Tiapride]0*
Memantine3
Mianserin *[Citalopram]0*
Minaprine1
Moclobemide1
Naftidrofuryl1
Olanzapine *[Lorazepam]2*
Oxazepam *[Diphenhydramine Haloperidol]0*
Paroxetine *[Imipramine]0*
Perphenazine *[Citalopram]1*
Phosphatidylserine2
Risperidone **[Haloperidol]1**
Selegiline *[Vitamin E]7*
Sertraline2
Thioridazine *[Loxapine]1*
Tiapride *[Haloperidol] *[Melperone]1**
Trazodone *[Haloperidol] *[5′-MTHF]1**
Xanomeline1
A total of 35 drugs in 50 studies were classified as non-cholinergic neurotransmitter/neuropeptide modifying agents. These pharmacological agents can be seen in Table 3. Sixteen of these studies involved direct comparisons to other drugs and these are considered separately in the section addressing Question Three. Overall results for each of the trials each intervention are detailed in OST located at the end of this chapter and organized by drug. All other study details are available in Evidence Tables 95 through 161 in the Appendices.

Haloperidol. See Evidence Tables 94 through 103 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Five studies124, 125, 126, 127, 128 evaluating haloperidol relative to placebo as well as another drug were included in this review. Three additional trials129, 130, 131 (from 1990, 1993, and 2001) compared haloperidol to another drug and did not include a placebo group (these are detailed in question 3). One study was published in each of 1982, 1997, and 1999; two were from 2000.

Design/methodology. Sample sizes for the placebo-controlled studies were generally small with samples of 15,128 64,126 149,125 344,127 and 306124 for an overall total sample size of 622 subjects. All but one of the studies had a quality score of 6 out of a possible 8 points; the other study127 had a score of 7 points. One study124 did not indicate a funding source, three studies125, 126, 127 indicated some industry funding, although none showed total industry funding, and one study128 had no industry funding.

Populations. Populations evaluated in the studies included three with only mild to moderate or probable AD,124, 125, 128 one with PDD and MID,126 and one with PDD, VaD, and mixed dementia127 (which reported subgroup information about VaD versus all subjects). Two placebo studies126 reported the presence of subjects with severe disease. Two trials126, 127 studied institutionalized patients while one128 looked at community subjects. Ages in the studies ranged from a mean of 72.7 to 81.0 years, and 33 – 49% of subjects were male.

Intervention. Haloperidol doses ranged from 3 mg to 20 mg per day for a treatment period of 3 weeks,124 6 weeks,128 10 weeks,126 12 weeks127 or 16 weeks.125 The other drugs that haloperidol was compared to included fluoxetine,128 loxapine,126 risperidone,127 tiapride,124 trazodone & BMT125 in the placebo controlled studies; loxapine,129 risperidone,131 oxazepam & diphenhydramine130 were evaluated in the head to head comparisons.

Primary outcomes. All studies evaluated behavioral outcomes, and at least one study evaluated the effect of haloperidol in each of the other domains included in this review with the exception of specific cognitive function. None of the studies reported baseline MMSE values.

Analysis. Two studies124, 127 reported ITT analysis and three128, 126, 125 did not.

Results and interpretation. See Summary Table 12. Of the five studies that had a placebo group, only three trials evaluated general cognitive function. One trial127 did not report the results for this domain and two showed no significant difference.124, 125 Three trials124, 126, 127 found statistical differences for outcomes in the behavior/mood domain, and two trials128, 125 showed no change. One of these non-significant trials128 evaluating behavior had a very small sample size (n = 12) and was likely underpowered. Four trials evaluated global function, and the two studies124, 125 that reported findings based on the CGIC and CGI showed both improvement and no benefit, suggesting inconsistent evidence for this domain; it should be noted that one of these trials lasted for only 3 weeks.124 One trial evaluated quality of life using the IADL and showed statistical difference in favor of the placebo. Two trials125, 128 evaluated caregiver burden and showed no effect; one of these studies128 had very small sample size and was likely underpowered. There were not enough similar outcomes reported to complete a pooled analysis for haloperidol.

The quality scores for reporting adverse events varied from 1 to 5. Only three of five studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 5 – 17% for placebo and 17 – 33% in the treatment group. One trial124 showed a dose response effect, but the study only lasted for 3 weeks. Three trials tested for differences between treatment and placebo with respect to extrapyramidal symptoms (placebo = 17 – 32%, all doses = 34 – 97%), and two124, 125 found significant differences. One study125 found significant differences between groups for balance-related problems. Although reported by only two trials, the range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = not reported), and 2) dizziness (placebo = 24%, all doses = 21%), 3) no frequencies were reported for agitation, diarrhea, or eating disorder.

Memantine. See Evidence Tables 104 through 108 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Three studies comparing memantine to placebo were eligible for review. One study was published in 1999132 and the other two133, 134 in 2002.

Design/methodology. Sample sizes ranged from 166 to 579 for a total population evaluated of 1066 subjects. Two studies132, 133 earned 6 points out of 8 for the quality score while the other134 earned 7 points. One report132 did not indicate the source of funding, and the other two had industry support or funding.

Populations. Two studies133, 134 included VaD patients only, one of which134 analyzed subgroups based on MMSE, type of VaD, and gender. The other study132 included VaD, DAT and PDD patients and did subgroup analysis comparing VaD to DAT and grouping for care dependence. One trial132 included patients with severe disease and was the only study to report that all of their subjects were institutionalized. One study134 included only community subjects and the other study133 did not report source of patients. Study subjects had a mean age of 71.2,132 76.4,133 and 77.4134 years, and 42 – 53% were male.

Intervention. Two studies133, 134 had a 4 week titration period with a final dose of 20 mg per day for the remaining 24 week study duration. The third study used a 2-week titration period with a final dose of 10 mg per day for the remaining 10 weeks of the study.

Primary outcomes. All studies evaluated global function. The ADAS-cog was evaluated in two studies134, 133 and showed smaller changes of decline relative to placebo by approximately 1.5 points. All studies measured global function with the CGI-C but did not provide variance data to permit the calculation of the pooled estimates. Although, all trials measured MMSE, none reported baseline values for this outcome. Only one trial132 evaluated behavior/mood and quality of life/ADL. No study evaluated specific cognitive function or caregiver burden.

Analysis. All studies performed ITT analysis.

Results and interpretation. See Summary Table 13. Two studies134, 133 in subjects with mild to moderate VaD showed significant findings for general cognitive function but not global assessment. The power could be estimated for one of these trials133 and was found to be below acceptable levels (PW= 0.60). The third memantine trial132 in this review evaluated mixed dementia populations (including some VaD) with moderate to severe dementia and found significant differences for global function, behavior/mood, and quality of life/ADL, but did not evaluate general cognitive function. It should be noted that this trial132 used half the dose of memantine for half the study duration in patients with greater disease severity, and had approximately half the sample size of the other two trials evaluated in this systematic review. There were not enough similar outcomes reported to complete a pooled analysis for Memantine.

The quality scores for reporting adverse events varied from 3 to 4. Only two of three studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 7 – 13% for placebo and 9 – 12% in the treatment group. One trial133 tested for differences between treatment and placebo, and none of the comparisons were significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all does = 5%), 2) dizziness (placebo = 3 – 8%, all doses = 6 – 11%), 3) diarrhea (placebo = 4%, all doses = 4%), 4) agitation (placebo = 7 – 8%, all doses = 4 – 5%), and none reported eating disorder as an adverse event.

Selegiline. See Evidence Tables 109 through 116 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Six studies evaluated the effect of selegiline compared to placebo.135, 136, 137, 138, 139, 140 A single study135 compared selegiline to vitamin E, placebo and the combination of selegiline plus vitamin E . The studies were published in 1991,136 1992,137 1993,138 1997,135 1998,140 and 1999.139

Design/methodology. Sample sizes ranged from 10137 to 341135 with a total population evaluated of 733 subjects. Study quality scores were 5,140, 135, 137 6,139, 138 and 7.136 Three trials139, 137, 136 did not report the source of funding, and the other three140, 135, 138 had some industry support.

Populations. Studies included patients with mild to moderate PDD, DAT, and AD. Two subgroup analyses based on the results of the clock drawing test139 and the GDS result136 were reported. One study reported that the included patients were institutionalized.139 Mean age of the subjects in the trials ranged from 68.6 to 83.0 years and all had male subjects (29 – 74%).

Intervention. All trials used the same dose, 10 mg per day, with three of the trials135, 138, 137 giving the drug in two 5 mg doses. One trial140 reported a titration period of 7 days. The duration of the trials varied with treatment times of 2 months,137 3 months,136 6 months,139, 140 15 months,138 and 24 months.135

Primary outcomes. Quality of life/ADL and caregiver burden were not evaluated in any of the studies.

Analysis. Two studies135, 140 carried out ITT analysis.

Results and interpretation. See Summary Table 14. Five of the six trials evaluated general cognitive function, and of these, only four reported their findings. Two of the trials138, 140 showed non-significant findings, but these had very small sample sizes (10 and 41 subjects) and were likely underpowered. Two trials137, 139 showed mixed results, and one of these was likely underpowered.137 One trial136 found significant changes for specific cognitive tests (Sternberg Memory tests). Similarly, this same trial showed significant differences for global assessment and behavior/mood. This is the only trial that showed consistently positive findings across domains tested, and it also had the highest quality score (7). However, the other studies evaluating specific cognitive functions, global assessment, and behavior/mood did not show consistent results (non-significant or mixed findings). There were not enough similar outcomes reported to complete a pooled analysis for selegiline.

There is some evidence that shows that selegiline and selegiline combined with vitamin E, increases the time to important functional decline milestones135 using time to event in the survival analysis. The results of this study showed that the vitamin E, selegiline, and combined groups were statistically different (i.e., declined less) from the placebo group in analyses that included baseline MMSE score as a covariate (not significant when excluded). The median survival was 230 days (vitamin E), 215 days (selegiline), and 145 days (combined group). Moreover, the vitamin E group showed a statistically significant difference for the endpoint of institutionalization, and the other treatment groups did not. Thus, the findings of this study suggest that selegiline and vitamin E may delay clinically important deterioration in patients with moderately severe AD; this delay varied from 20 to 32 weeks. It should be noted that this study evaluated subjects over a 2 year period, the longest of any dementia trial; moreover, the population was moderate to severe with respect to severity.

The quality scores for reporting adverse events varied from 0 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 4% for placebo and 0 – 9% in the treatment group. Two trials137, 138 did not report any adverse events. Only one trial135 tested for differences between the treatment and placebo groups and showed that balance (worse) and falls were significantly different between groups (particularly the group with selegiline combined with vitamin E (22%) versus placebo (5%)). However, when adjusted for multiple comparisons, these were no longer significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 0%), 2) dizziness (placebo = 2 – 20%, all doses = 0 – 30%), and 3) agitation (placebo = 4 – 16%, all doses = 4 – 23%); no trial diarrhea or reported eating disorder as an adverse event.

Various non-cholinergic neurotransmitter/neuropeptide modifying agents. See Evidence Tables 117 through 160 at http://www.ahrq.gov/clinic/epcindex.htm. See Summary Table 15.

Ten non-cholinergic neurotransmitter neuropeptide modifying agents versus placebo were studied in only two included trials:

Anapsos (Evidence Tables 117, 119, 120, 160). Anapsos versus placebo was reported for a total of 114 patients with AD or VaD in reports published 1993141 and 2000.142 Both studies were partially funded by industry, and varied in the drug dose and duration; one trial142 used 360 mg per day or 720 mg per day for 4 weeks, and the other trial141 used 300 mg three times a day for 12 weeks. They each reported only one domain, and showed a significant change for general cognitive function142 and no significant results for global assessment.141

Carbamazepine (Evidence Tables 117, 121, 122, 160). Two trials 143, 144 evaluated carbamazepine in a total of 72 patients. The 1998 study144 included a mixed severity population of institutionalized patients with non-industry-funding but also some financial support from industry. Both studies titrated up from 100 mg per day to 300 mg per day for 6 weeks. They evaluated all domains except caregiver burden. The trial using OC143 population showed no significant effect for all outcomes tested but was likely underpowered (n = 16). The trial using ITT144 showed a significant change in global assessment and behavior/mood. The evidence for benefit remains inconclusive given the lack of consistency between trials.

Citalopram (Evidence Tables 117, 123, 124, 160). Citalopram was evaluated in a total of 183 patients with mixed dementias including AD, VaD, mixed, MID, PDD. One trial145 was non-industry-funded and the other146 did not report funding source. Treatment was 20 mg per day for two weeks in both trials, with one continuing for 2 more weeks with 30 mg per day. One trial146 measured the global effect and had mixed results. Both studies measured behavior/mood: one145 showing significant change and the other146 showing no significant change.

Divalproex sodium (Evidence Tables 117, 125, 126, 160). Divalproex sodium was evaluated in 229 subjects with mixed populations of VaD and AD who were treated for 6 weeks with increasing dosages until 20 mg per kg daily147 or until side effects appeared.148 These trials were both industry-supported or funded and included 56 or 173 institutionalized patients with probable or possible disease. Both trials showed no significant change in cognition and behavior/mood, while only one study148 measured quality of life/ADL and found no significant difference. Both trials did a global assessment; one study found no significant difference,148 and the other147 found a significant change in favor of placebo.

Fluoxetine (Evidence Tables 117, 140, 141, 160). Fluoxetine was studied in a total of 56 AD patients using 3 or 20 mg per day128 or a titration from 10 to 40 mg per day149 for 6 weeks. All patients included in one study149 also had major or minor depression. One study128 was not industry-funded, and the other149 did not indicate the funding source. Overall, the two studies evaluated general cognition, behavior/mood, quality of life/ADL, and caregiver burden; no significant differences between the drug and placebo were found.

Loxapine (Evidence Tables 117, 147, 159, 160). Loxapine was evaluated in two trials150, 126 from 1982 and included a total of 124 patients with MID and PDD. One trial126 reported moderate to severe disease. The mean age in the other trial150 was 83.0 years compared to 72.7 years in the trial with severe patients. Both studies were partially funded by industry and lasted 8 or 10 weeks. Only two domains were evaluated: global assessment and behavior /mood. No significant difference was shown in one trial,150 while the other trial126 showed a significant difference for behavior/mood.

Olanzapine (Evidence Tables 117, 135, 146, 160). Olanzapine was evaluated by two industry-funded trials in a total of 478 institutionalized patients with AD, VaD, and mixed dementia. One study151 used 10 or 15 mg per day for 6 weeks and the other152 used 12.5 mg maximum for one day. Both studies showed no significant change in general cognition. Both showed a significant change in measures of behavior/mood. One study152 evaluated global assessment and found no significant differences.

Phosphatidylserine (Evidence Tables 117, 136, 137, 160). Two industry-funded trials studied a total of 193 patients with AD or PDD. One study153 included institutionalized patients with mild to severe AD and a mean age of 62.1 years, and the other154 included community patients with mild to moderate AD or PDD and a mean age of 71.0 years. Both studies did subgroup analysis based on severity of illness. The study of institutionalized patients153 found significant change in the domain of general cognition and global assessment. The study with community patients found significant change in a global assessment but no significant change in a measure of quality of life/ADL.

Risperidone (Evidence Tables 117, 142, 144, 160). Two studies evaluated risperidone for 12 weeks in 625 AD, VaD, or mixed dementia patients with moderate to severe disease155 and in 344 PDD, VaD, or mixed dementia patients with severe disease.127 The studies were industry-funded or supported, and both did subgroup analysis: one by disease and the other by gender, age, race, and diagnosis. Both trials showed a significant change in a global assessment. One study155 found a significant change in behavior/mood, and the other study127 had mixed results for that domain. There was no significant change in cognition or quality of life/ADL according to one of the trials.127

Sertraline (Evidence Tables 117, 138, 139, 160). Sertraline was evaluated in two studies: one trial156 for 8 weeks in 31 late-stage institutionalized AD patients with major depression (mean age 89.0 years), and the other trial157 for 13 weeks in a community sample of 22 patients with mild to moderate AD and depression (mean age 77.0 years). Both studies found no significant differences in cognition. The trial156 in subjects with severe disease found no significant difference in behavior/mood; the second trial157 had mixed results for this same domain. The study in patients with mild to moderate disease showed significant change for a global assessment and no significance for quality of life/ADL.

Non-cholinergic neurotransmitter/neuropeptide modifying interventions (NCNMA). Fifteen drugs in this drug grouping were compared to placebo in only one included trial. Eight of these trials showed a significant difference from placebo (See Evidence Tables 117, 120, 128, 129, 133, 134, 143, 145, 149, 160 at http://www.ahrq.gov/clinic/epcindex.htm): Alaproclate158 for 4 weeks in 43 institutionalized patients with mild to severe PDD, MID and mixed dementia was better for quality of life/ADL. Imipramine159 for 8 weeks in a community sample of 61 PDD and AD patients with depression was better for global assessment. Lisuride160 for 8 weeks in 22 patients with mild to moderately severe AD was better for cognition. Minaprine161 for 12 weeks in an institutionalized sample of MID or SDAT patients showed mixed results for behavior. Moclobemide162 for 6 weeks in 511 patients with mild to moderate AD who were from both the community and institutions was better for cognition and behavior/mood. Naftidrofuryl163 for 6 months in 378 patients with mild to severe VaD or mixed dementia was better for cognition and global assessment. Tiapride124 for 3 weeks in 306 institutionalized AD patients with aggressiveness or irritability was better for behavior/mood. Trazodone125 for 16 weeks in 149 AD patients from the community was better for quality of life/ADL. Xanomeline164 in 343 community AD patients for 6 months was better for cognition, global assessment, and quality of life/ADL.

Seven trials (See Evidence Tables 119, 127, 130, 131, 132, 146, 147, 148, 160 at http://www.ahrq.gov/clinic/epcindex.htm) found no significant differences from placebo when evaluating perphenazine,145 thoridazine,150 fluvoxamine,165 lorazepam,152 LU25,166 maprotiline,167 and minaprine161.

Results of other agents

Table 4. List of Other pharmacological agents and the number of studies vs. placebo for each of these. Asterisk (*) indicates report of a drug vs. drug trial [comparator drug(s) in brackets]
DrugNumber of studies vs. placebo
Aniracetam1
5′-MTHF *[Trazodone]0*
Amitriptyline *[Fluoxetine]0*
Ateroid1
Buflomedil1
Cerebrolysin6
Choro-San1
Choto-San1
Citicoline *[Posatirelin] *[Sulphomucopolysaccharides]0**
Cyclandelate2
Denbufylline1
Desferrioxamine1
Diclofenac1
Ergokryptine (CMB 36-733)1
Ergokryptine (Dek)1
Estrogens5
Ginkgo Biloba3
Glycosaminoglycan Polysulfate1
Guanfacine1
Hydergine *[Pyritinol]1*
Hydroxychloroquine1
Idebenone *[Tacrine]4*
Indomethacin1
Misoprostol *[Diclofenac]0*
Monosialotetrahexosylganglioside (GM-1)1
N-Acetylcysteine1
Nimesulide1
Nimodipine2
Nizatidine1
Nootropic1
ORG 27662
Oxiracetam5
Pentoxifylline *[Sulodexide]3*
Piracetam1
Prednisone1
Propentofylline4
Pyritinol *[Hydergine]0*
Silymarin + Tacrine *[Placebo + Tacrine]0*
Simvastatin1
Sulphomucopolysaccharides *[Citicoline]0*
Sulodexide *[Pentoxifylline]0*
Thiamine1
Vasopressin (DDAVP)1
Vincamine1
Vitamin E *[Donepezil] *[Selegiline]1**
Xantinolnicotinate1
A total of 72 studies representing 46 different other agents were eligible for this review and these can be seen in Table 4. Twenty-two of these interventions were evaluated in a single trial and only briefly summarized in this chapter; greater detail is provided in Evidence Tables 161 through 249.

Cerebrolysin. See Evidence Tables 161 through 168 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Six included studies 168, 169, 170, 171, 172, 173 compared cerebrolysin to placebo. One report was from 1994,172 one from 1999,171 two from 2000,169, 170 one from 2001,168 and one from 2002.173

Design/methodology. The sample size in the studies ranged from 53169 to 192173 with a total of 819 subjects. The quality of studies varied from scoring 6171 to 8168, 173 points out of a possible 8 points. One study172 did not indicate the source of funding, one trial had non-industry funding,173 and the four remaining trials were funded by industry.168, 169, 170, 171

Populations. All but one of the six studies included AD patients; one study171 evaluated patients who had mild to moderate VaD. Mean ages of the subjects in the studies ranged from 69.7171 to 74.1 years.173 The proportion of males in the trials varied from 34 – 69%, and only one trial173 specified the proportion of Caucasians.

Intervention. All of the studies used the same dose of cerebrolysin, 30 ml per day, for 5 days per week. One trial172 was for 28 days, four studies169, 170, 171 lasted 4 weeks, one trial168 lasted 16 weeks, and one trial173 lasted 24 weeks.

Primary outcomes. Most studies evaluated general cognitive function and three trials168, 169, 173 used the ADAS-cog. Baseline MMSE was reported in a single trial173 with a score of 21. All studies evaluated global function, and at least two studies evaluated one outcome in each of the remaining domains with the exception of caregiver burden.

Analysis. All but one173 of the studies used ITT analysis.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf19.jpg.

   Figure 19. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing cerebrolysin versus placebo

Results and interpretation. See Summary Table 16. Four of the five studies that evaluated general cognitive function showed significant differences.169, 168, 170, 171 Figure 19 displays the pooled estimate for those studies for which the appropriate data could be extracted for the ADAS-cog. Although a summary estimate was calculated, the test for heterogeneity was positive, suggesting the estimate should be interpreted with caution. Moreover, the overall estimate was not significant. One study173 showed no significant difference in MMSE or ADAS-cog. This was the only study to report non-industry funding and coincidentally the only study to use OC population analysis. Three studies used specific cognitive measures, two of which172, 171 found significant differences and one of which170 showed mixed results.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf20.jpg.

   Figure 20. Odd Ratio (OR) from the Random Effects Model (Random) for the CGI comparing cerebrolysin versus placebo

All trials evaluated global assessment, and all except one trial171 reported a significant difference. Figure 20 shows the pooled estimate for the CGI. The pooled estimate was calculated, the test for heterogeneity was positive, suggesting the estimate should be interpreted with caution. However, the overall estimate is significant. Three trials reported results for a measure of behavior/mood, one showing significant effects168 and the other two171 showing none. All trials carried out evaluations of quality of life/ADL measures; one did not report the effect,168 one had mixed results170 and the other four showed no significant difference.169, 173, 172, 171 No study measured caregiver burden.

Two169, 172 of the six trials scored 5 out of 5 on our quality scale for rating adverse events, yet they did not report any adverse events. Two studies173, 168 scored 4, and the other two trials scored 3171 and 2.170 All the studies with scores equals to 4 or less tested for statistical differences in adverse events between placebo and treatment groups. Withdrawals due to adverse events were not reported in one study,170 and were 1% in two studies.173, 168 Three studies169, 172, 171 reported no withdrawals. A significant difference between treatment and control group was reported in one study173 for weight change, anxiety, and headache. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 10 – 24%, all doses = 3 – 21%), 2) dizziness (placebo = 0 – 12, all doses = 1 – 8%), and 3) agitation (placebo = 1%, all doses = 0%), and none reported diarrhea or eating disorder as an adverse event.

Estrogens. See Evidence Tables 169 through 175 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Five studies174, 175, 176, 177, 178 evaluated estrogens for dementia patients: one published in 1999,178 three in 2000,176, 177, 175 and one in 2001.174 None compared estrogens to another drug.

Design/methodology. The number of subjects included in the studies ranged from 15178 to 120 subjects177 with a total of 247 patients. Quality of the studies ranged from 5174 to 8178 points out of a possible 8 points. All studies were partially or fully funded by industry.

Populations. Four of the studies176, 177, 175, 174 included patients with mild to moderate AD, and one study178 included moderate to severe dementia patients who were all institutionalized. Only one of the studies178 included male subjects. Mean age ranged from 71.8175 to 80.0 years174 in the AD studies, and it was 83.8 in the dementia study.178

Intervention. One of the studies with AD patients used 0.10 mg per day174 for 8 weeks, and the others used 1.25 mg per day for 12 weeks,175 16 weeks,176 and 52 weeks.177 The study178 including subjects with severe disease used 2.5 mg per day for 4 weeks.

Primary outcomes. At least one study evaluated each of the included domains with the exception of caregiver burden.

Analysis. Two of the studies177, 175 performed ITT analysis and the other three used OC analysis.

Results and interpretation. See Summary Table 17. Three176, 177, 175 trials evaluated general cognitive function and all showed non-significant findings; two trials176, 177 lacked sufficient power (PW = 0.10, PW = 0.44) for the ADAS-cog. Attempts were made to combine the ADAS-cog, but the random-effects model was positive for heterogeneity and the overall effect was not significant. Two trials174, 177 evaluated specific cognitive function, and only one of these, using the Stroop Color Word Interference Test (SCWIT) measure, showed significant differences.174 Global assessment was undertaken in all trials and found to be not significant in any of these trials. For those trials where power could be estimated,176, 177, 175 there was insufficient power for the CGIC, CDR, and CIBIC+ outcomes. For the outcomes of behavior/mood and quality of life/ADL, none of the trials reported significant differences; power for the outcomes used in the trials could not be estimated. Overall, the evidence that estrogen affected general and specific cognitive function, global assessment, behavior/mood, and quality of life/ADL is inconclusive. There were not enough similar outcomes reported to complete a pooled analysis for estrogens.

One178 of the five trials scored 5 out of 5 on our quality scale for rating adverse events, and surprisingly, this same trial did not report any adverse event. Two trials176, 177 scored 3; one trial175 scored 2, and one174 scored 1. This latter study reported adverse events, but did not test for significant differences between groups. Withdrawal rates due to adverse events ranged from 0 – 5% for placebo and 0 –14% for the treatment group. The most frequently reported adverse event was vaginal bleeding,175, 177, 176 and a single trial175 reported a significant difference between placebo and treatment group for vaginal bleeding. It was not clear from the descriptions provided in the study if they had ascertained whether vaginal bleeding was present prior to the trial commencement. Nausea was the single a priori symptom of interest that was reported and by a single trial; frequencies varied from 0% for the placebo group and 4% for the treatment group.

Ginkgo biloba. See Evidence Tables 176 through 180 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Three studies179, 180, 181 evaluating Ginkgo biloba were eligible to be included in this review. All of the studies compared the drug to placebo only. One of the studies was reported in 1996181 and two were reported in 1997.179, 180

Design/methodology. The studies included evaluated 20 subjects,180 216 subjects,181 and 327 subjects179 (totaling 563 subjects). Two of the reports179, 181 scored 8 quality points out of a possible 8 points, and the other180 earned 6 points. One study did not indicate the funding source,180 and the other two had industry funding.

Populations. All of the studies included a mix of dementia diagnoses as follows: 1) mild to moderately severe AD and MID,179 2) mild to moderate DAT and PDD180, and 3) mild to moderate DAT and MID181 in community dwelling patients. Two of the studies reported subgroup analysis, one comparing diagnoses and comparing effects based on baseline MMSE score179 and the other based on diagnosis.181 The patients in these trials had mean ages of 64.6,180 69.0,179 and 69.6 years.181

Intervention. Two of the trials gave 240 mg per day for 3 months180 and 6 months,181 and the other trial179 gave 40 mg three times daily for 12 months.

Primary outcomes. None of the studies reported on quality of life/ADL or caregiver burden.

Analysis. All of the trials used an ITT analysis.

Results and interpretation. See Summary Table 18. Two of the three trials evaluated general cognitive function, and only one of these showed significant results.179 Two studies181, 180 showed positive results with specific cognitive function. The results for global assessment are inconsistent as only one trial had positive findings,181 one study had mixed results,179 and one trial180 showed non-significant results. This latter study had a very small sample size and lacked sufficient power for some outcomes. Only one trial181 reported behavior/mood outcomes and found no difference between groups. None of the studies evaluated quality of life/ADL and caregiver burden. There were not enough similar outcomes reported to complete a pooled analysis for ginkgo biloba.

One180 of the three trials scored 5 out of 5 on our quality scale for rating adverse events. One study181 scored 4, and one trial179 scored 3. Two studies181, 180 had no withdrawals due to adverse events, and one trial179 had a withdrawal rate of 6% for both placebo and treatment groups. Two studies179, 180 did not report any adverse event. One study181 reported a statistically significant difference between the treatment and the placebo group for skin disorders. The same study reported gastrointestinal and headache adverse effects, but did not test for statistical differences between the placebo and the treatment group. None of the trials reported any of the a priori symptoms of interest.

Idebenone. See Evidence Tables 181 through 187 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Four studies182, 183, 184, 185 were included in this review that evaluated idebenone versus placebo, and one study26 compared idebenone to tacrine but not to placebo. The placebo trials were published in 1992,184 1994,182 1997,185 and 1998,183 the tacrine trial was published in 2002 by the same author as a previous placebo trial.183

Design/methodology. Sample sizes in the studies ranged from 92182 to 450 subjects183 with a total of 950 patients in the placebo-controlled studies. The study comparing idebenone with tacrine included 203 subjects, but a large number withdrew, and only 44 completed the trial. One of the trials185 earned 5 points out of a possible 8 points on the quality scale, two of the trials earned 6 points,182, 183 and one earned 7 points.184 None of the placebo studies reported their funding source. The tacrine study earned 7 points on the quality scale and was partially funded by industry.

Populations. The studies included patients with AD, MID, PDD, and DAT. Two of the trials182, 183 reported that the subjects had mild to moderately severe disease and the remainder reported mild to moderate disease. Two of the studies reported subgroup analysis based on disease severity.185, 183 Mean ages in the studies ranged from 69.9183 to 73.6 years.184

Intervention. Dosing schemes were 30 or 90 mg per day for 6 months,185 30 mg three times per day for 3 months,182 45 mg twice daily for 4 months,184 and 120 mg three times per day for 12 months.183 The tacrine trial used 360 mg per day for 14 months.

Primary outcomes. Caregiver burden was the only domain in this review that was not evaluated in at least one of the studies.

Analysis. Two of the studies183, 185 used ITT analysis while the other two used OC analysis. The tacrine trial used ITT analysis.

Results and interpretation. See Summary Table 19. Three trials183, 184, 185 found significant differences for general cognitive function. Two trials185, 183 used the ADAS-cog and reported changes that varied from -4.5 to -4.9 for placebo versus -4.4 to -8.8 for the treatment group. The doses varied in these two trials from 90 to 360 mg per day. A single trial182 evaluated specific cognitive function and showed inconsistent findings. Three trials182, 183, 185 evaluated global assessment and all found significant differences relative to placebo. A single trial185 evaluated behavior/mood and was statistically significant, even though it was a secondary outcome. Two trials183, 184 evaluated quality of life/ADL and were both statistically significant. No study evaluated caregiver burden. These findings suggest some evidence of benefit for general cognitive function, global assessment, and quality of life/ADL. There were not enough similar outcomes reported to complete a pooled analysis for idebenone.

Quality scores for reporting adverse events varied from 1 to 5. Rates of withdrawal due to adverse events varied from 0 – 5% for the placebo group and 0 – 5% in the treatment group; a single trial183 did not report withdrawal rates. Two trials183, 185 tested for statistical differences between groups and found no differences. Although no clear pattern emerges, three studies identified at least one balance-related adverse event across studies. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 2 – 11%), 2) dizziness (placebo = not reported, all doses = 2%), and 3) not reported for diarrhea, agitation, or eating disorder as an adverse event.

Oxiracetam. See Evidence Tables 188 through 194 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Five trials186, 187, 188, 189, 190 included in this review evaluated oxiracetam versus placebo. The studies were published in 1988,188 1989,187 and 1992.190, 189, 186

Design/methodology. A total of 554 patients were included in the studies, ranging from 30 patients188 to 289 patients.187 Four of the studies earned 6 points out of a possible 8 points on the quality scale, and the other study189 earned 4 points. Two of the studies189, 187 did not report the source of their funding, and the other three trials had partial industry funding.

Populations. The trials included a mixture of diagnoses, including AD, PDD, mixed dementia, and MID, and none of the studies reported severe disease. One of the studies187 performed subgroup analysis based on diagnosis, comparing MID to PDD. The mean age of the subjects included in the trials ranged from 62.0188 to 73.8 years.189

Intervention. All of the trials used a dose of 800 mg twice daily, for a duration of 12186, 187 to 26 weeks.189

Primary outcomes. At least one trial evaluating oxiracetam evaluated one of the outcome domains examined in this review with the exception of caregiver burden. A single trial189 reported baseline MMSE at 22 for both placebo and treatment groups.

Analysis. None of the trials used ITT analysis.

Results and interpretation. See Summary Table 20. Three trials187, 190, 189 out of the five studies tested for outcomes on general cognitive function. Only two of these trials187, 190 reported the findings, which were both significant, even though the NMIC and MMSE were used to measure this attribute. Three trials186, 188, 190 evaluated specific cognitive function and showed mixed results. A single large trial187 evaluated global assessment and found significant differences between groups using the Blessed Dementia Scale (Italian version). Three trials187, 188, 190 evaluated behavior/mood with the IPSC-E, and of these, a single trial190 did not show significant differences. One trial189 reported on Beck Depression Inventory (BDI) but did not show statistical comparisons. Similarly, three trials186, 189, 190 evaluated quality of life/ADL, and a single trial189 showed no significant findings. No study evaluated caregiver burden. There were not enough similar outcomes reported to complete a pooled analysis for oxiracetam.

The quality scores for reporting adverse events varied from 2 to 5. The proportion of withdrawals due to adverse events varied form 0 – 9% for the placebo group and 0 – 6% for the treatment group. No clear pattern for adverse events is evident, but three of the five studies reported gastrointestinal related problems, primarily associated with abdominal pain. Although, only single trials evaluated the range of frequencies of the a priori symptoms of interest are as follows: 1) dizziness (placebo = not reported, all doses = 11%), and 2) agitation (placebo = 1%, all doses = not reported); no trial reported nausea, eating disorder, or diarrhea as an adverse event.

Pentoxifylline. See Evidence Tables 195 through 200 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Three trials191, 192, 193 in this review evaluated pentoxifylline versus placebo. One trial,194 published in 1997, compared pentoxifylline to sulodexide rather than placebo. The placebo trials were published in 1987,193 1992,192 and 1996.191

Design/methodology. The studies included 36 patients,193 64 patients192 and 289 patients.191 The sulodexide trial included 93 patients. All placebo trials had 6 points out of a possible 8 points on the quality scale and had partial or full industry funding. The sulodexide trial earned 5 points on the quality scale and did not report the source of funding.

Populations. The three placebo-controlled trials included patients with mild to moderate MID, and one trial193 also included PDD patients. The sulodexide trial had only patients with mild to moderate VaD. Subgroup analysis was performed in two trials, looking at MID versus PDD diagnosis193 and grouping by vascular change versus discrete stroke.192 The mean age of the studies ranged from 69.7191 to 77.0 years.193

Intervention. All of the studies gave 1200 mg per day of pentoxifylline; one study gave the drug once a day for 9 months,191 one study gave 400 mg three times per day for 9 months,192 and one gave 400 mg three times per day for 3 months.193 The sulodexide study gave the drug once a day for 6 months.

Primary outcomes. At least one trial evaluated one of the outcome domains examined in this review with the exception of caregiver burden.

Analysis. A single191 trial used an ITT analysis.

Results and interpretation. See Summary Table 21. All three placebo trials showed non-significant findings for any primary outcome evaluated on all subjects in the study. It should be noted that two of these trials192, 193 had very small sample sizes (n = 38, n =28) that were evaluated in the OC analyses; this suggests that the trials lacked sufficient power to evaluate multiple outcomes. Knezevic et al.191 had a large sample size (n = 289) and employed an ITT analysis; all primary outcomes evaluated were not significant. The evidence for all outcomes considered in this review are inconclusive for pentoxifylline. There were not enough similar outcomes reported to complete a pooled analysis for pentoxifylline.

The quality scores for reporting adverse events were generally low, varying from 1 to 3. Withdrawal rates due to adverse events varied from 0 – 25% in the placebo group and 0 – 22% in the treatment group. The two studies that reported adverse events indicated the presence of gastrointestinal disturbances, including abdominal pain or nausea and vomiting (placebo = 7% and all doses = 14%). None of the trials reported dizziness, agitation, eating disorder or diarrhea.

Propentofylline. See Evidence Tables 201 through 206 at http://www.ahrq.gov/clinic/epcindex.htm.

Number of studies. Four studies195, 196, 197, 198 in this review evaluated propentofylline versus placebo. The first trial was published in 1990.198 Two, by the same author, were published in 1996196 and 1998.197 One was published in 1997.195

Design/methodology. The number of subjects in the studies ranged from 30 subjects197, 196 to 260 subjects,195 with a total of 510 subjects. Three of the studies197, 195, 196 earned 5 points out of a possible 8 points on the quality scale, and the other study198 earned 6 points. Only one study indicated the source of funding for the trial,196 and it was industry-supported.

Populations. The trials included subjects with mild to moderate AD only,197 VaD only,196 mild dementia only,198 and mild to moderate combined AD and VaD.195 Two trials presented subgroup analysis: one for AD versus VaD,195 and one based on MMSE baseline score. The mean age in the studies ranged from 64.8 197 to 72.4 years.195

Intervention. All four studies gave 300 mg three times a day for 3 months with the exception of one trial195 which had a duration of 12 months.

Primary outcomes. At least one trial evaluated one outcome in each of the domains examined in this review with the exception of caregiver burden. Baseline MMSE was reported in two trials197, 198 and varied from 20 and 21 for both placebo and treatment groups.

Analysis. One of the studies195 used an ITT analysis.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf21.jpg.

   Figure 21. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the MMSE change score comparing propentofylline versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf22.jpg.

   Figure 22. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the DSST change score comparing propentofylline versus placebo

Results and interpretation. See Summary Table 22. All four trials evaluated general cognitive function and the pooled estimate can be seen in Figure 21. Two of the trials197, 196 had small sample sizes and these trials had the widest confidence intervals. The test for heterogeneity did not exceed our threshold of 0.10 for significance; the overall summary effect was significant. Figure 22 shows the pooled estimate for the DSST, a measure of specific cognitive function; this pooled estimate should be interpreted with caution as the test for heterogeneity was significant and the overall effect was not significant. Thus, there is some evidence of benefit for general cognitive function, and inconclusive evidence for specific cognitive function as measured by the DSST. Similarly, there is inconclusive evidence for global assessment. Behavior/mood outcomes (using the NAB) were evaluated by a single trial195 and shown to be significantly different; this same trial evaluated quality of life/ADL (using the NAA) and showed no significant difference.

The quality scores for reporting adverse events varied from 1 to 4. The percentage of withdrawals varied from 0 – 13% for the placebo group and 0 – 12% for the treatment group. None of the trials tested for differences between groups. Three of the trials195, 197, 198 reported gastrointestinal events that included abdominal pain, constipation, and nausea and vomiting (placebo = 2%, all doses = 7%). Dizziness (placebo = 3 – 5%, all doses = 1 – 6%) was the only other a priori symptom of interest.

Various other agents. See Evidence Tables 207 through 249 at http://www.ahrq.gov/clinic/epcindex.htm. See Summary Table 23.

Interventions with two studies included for review. Six other agents were compared with placebo in only two included trials:

Choto-san (Evidence Tables 207, 213, 214, 249) Two studies compared Choto-San with placebo in patients with VaD. Both studies included all Asian subjects and co-morbid disorders were present in both studies. Each study lasted 12 week, with doses of drug at 7.5 g three times a day199 and 2.5 g three times a day.200 The studies were published in 1994200 and 1997199 and both had a quality score of 5 out of 8 points. Both studies measured global assessment and behavior/mood and disagreed on both results. One study199 showed a significant change in global assessment and no significant difference in behavior/mood while the other200 showed mixed results for global assessment and a significant difference for behavior/mood.

Cyclandelate (Evidence Tables 207, 215, 216, 249). Cyclandelate was evaluated in two studies: one201 in 139 AD patients and another202 in 196 PDD, VaD, and mixed dementia patients. The mixed population study202 had a subgroup analysis based on the MMSE, ADAS-cog, and treatment center. The AD patients received 400 mg four times per day for 16 weeks and the mixed population study used 800 mg twice a day for 24 weeks. Only caregiver burden was not evaluated by either study and only global assessment was evaluated by both studies. The study with AD patients201 showed a significant change in global assessment and in behavior/mood and mixed results in the specific cognitive function measures. The study with the mixed population202 showed no significant difference in global assessment or general cognitive measures or quality of life/ADL or function.

Ergokryptine (Evidence Tables 207, 212, 218, 249). Two trials, which were not similar, evaluated ergokryptine. One trial203 that did not indicate the source of funding included 125 PDD patients and treated them with a dose titrated up to 2 mg per day for 8 weeks. The other trial,204 which was industry funded, treated 215 AD patients with a dose titrated up to 20 mg twice a day for one year. Neither study reported caregiver burden, but both reported general cognitive function and global assessment. They differed on the results of both of those domains: one study204 showed significant change in cognition and mixed results in global assessment, and the other study203 showing no significant difference and significant change, respectively. Significant change was demonstrated in specific cognitive measures in the study with AD patients.204 Mixed results were shown for behavior/mood outcomes, and no significant difference was seen for quality of life/ADL in the study with PDD patients.203

Hydroxychloroquine/Nimesulide (Evidence Tables 207, 226, 229, 249). No significant difference from placebo was seen in either of two studies205, 206 for cognition, behavior/mood, or quality of life/ADL. Global assessment was evaluated in one of the studies206 and there was no significant difference found. One trial205 included minimal to mild AD patients and the other206 included mild to moderate AD patients One study205 treated patients for 18 months with a dose of 400 or 200 mg per day based on weight. The other study treated for 3 months with 100 mg twice a day. Both studies had non-industry funding, and one study205 also had industry support.

Nimodipine (Evidence Tables 207, 230, 231, 249). Nimodipine was evaluated in one study207 with 259 mild to moderate MID patients receiving 30 mg twice a day for 26 weeks. The other study208 evaluated 178 patients with mild to moderately severe MID and PDD receiving 90 mg per day for 12 weeks. The trials received industry funding or support and were published ten years apart, in 1990208 and in 2000.207 The trial using ITT analysis207 showed no significant difference in the domains of general cognitive function measures, specific cognitive measures, global assessment, and quality of life/ADL. The trial using OC208 analysis found significant differences in the domains of general cognitive function measures, specific cognitive measures, global assessment, and behavior/mood.

ORG2766 (ACTH peptides) (Evidence Tables 207, 233, 234, 249) Org2766 versus placebo was reported for a total of 233 patients with AD or primary degenerative senile dementia (PDSD) in reports from 1985209 and 1986.210, 209 One study209 was industry-supported and used 20 mg twice daily for 6 months, and the other was non-industry-funded210 and used 80 mg twice daily for 1 month. One study210 found a statistical difference between drug and placebo in the domains of specific cognitive function measures and in global assessment, and the other study209 found no significant difference in the domains they evaluated: global assessment and behavior.

Interventions with only one trial included for review. Twenty-two drugs in this drug grouping were compared to placebo in only one included trial. Eleven of these trials showed a significant difference from placebo and are summarized briefly here. See Evidence Tables 207 to 240, 244, 248 and 249 for greater detail concerning the trials.

Drugs compared to placebo in one trial only (Evidence Tables 207, 208, 209, 210, 217, 220, 222, 224, 237, 238, 240, 248, 249) Aniracetam was better for cognition and global assessment in 109 community patients for 6 months, Ateroid211 in 155 PDD, MID or SDAT patients for 12 weeks was better for general cognition. Desferrioxamine212 in 48 probable AD patients for 2 years was better for behavior/mood. Glycosaminoglycan polysulfate213 in 155 moderate to severe PDD or MID patients for 12 weeks was better for behavior/mood. Guanfacine214 in 29 mild to moderate AD or PDD patients for 13 weeks was better for specific cognitive measures and global assessment. Nootropic agent BMY215 in 69 mild to moderate AD patients for 12 weeks was better for general cognitive measures. Thiamine216 in 15 mild to moderate AD patients for 12 months was better for general and specific cognitive function measures. Vincamine217 for 12 weeks in 152 institutionalized patients with mild to moderate PDD or VaD was better for global assessment. Vitamin E135 in 341 moderate AD patients for 2 years was better for delaying institutionalization. Deamino-D-arginine-vasopressin218 in 14 PDD patients was better for behavior and had mixed results for global assessment. Xantinolnicotinate219 in 313 mild to moderate AD or MID patients for 12 weeks was better for specific cognitive function measures and global assessment.

Twelve trials (Evidence Tables 207, 211, 219, 221, 223, 225, 227, 228, 232, 235, 236, 239, 244, 249) found no significant differences from placebo or mixed results when evaluating buflomedil,220 citicoline,101 denbufylline,221 diclofenac and misoprostol,222 hydergine,223 indomethacin,224 monosialotetrahexosylgan,225 N-acetylcysteine,226 nizatidine,227 piracetam,228 prednisone,229 and simvastatin.230

Question 2: Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes?

Delay of Onset of Dementia

The concept of “delay onset” was operationalized to imply delay in conversion from a cognitive disturbance state, classified as MCI, CLOND or CIND, to a true dementia state. No studies with this population met the final eligibility criteria, although four trials231, 232, 233, 234 advanced to the full text screening stage. The lack of studies eligible for evaluation in this systematic review points to a gap in the literature for pharmacological interventions (attempting to demonstrate a delay in disease onset) in MCI-type populations.

Delay of Progression

In general, very few studies evaluated patients who were classified as “severe”. Five studies126, 208, 129, 178, 132 had moderate to severe groups of dementia patients, and only one trial reported all three levels163 of the disease spectrum. The interventions evaluated in these trials were estrogen, haloperidol, glycosaminoglycan polysulfate, memantine, and naftidrofuryl. This suggests that there is a bias in the trials eligible in this systematic review towards evaluating mild to moderate disease; this in turn reflects the underlying assumption that the less severe groups are most likely to benefit from drug trials. Since so few studies have evaluated the more severe groups, this assumption may require some empirical justification. Therefore, delay in progression has not been considered in severe patients.

The selected studies used two approaches for showing “delaying disease progression”. The first method for evaluating the potential for a drug to delay disease progression used longer-term follow-up; survival analyses (time to a relevant event) were then used to show differences between the two groups. The second design approach used withdrawal from treatment for a period and continued monitoring of the treatment and placebo groups (to demonstrate a deviation of the treatment group from the natural history as represented by the placebo group). Such designs have been termed withdrawal, active-extension, randomized withdrawal, randomized start, and staggered start.235, 236, 19, 21 From our 186 included studies, we then further selected a subgroup of papers that had the potential to demonstrate delay in disease progression through the use of one of these two designs. Therefore, any eligible trial that employed a survival analysis or a two-period approach, where the pharmacological agent was withdrawn during one of the periods, was selected for further evaluation to answer this question.

Survival Analyses

Two studies135, 61 using survival analyses were identified. In a 2-year study135 that compared placebo to three other groups (selegiline, selegiline with vitamin E, and vitamin E), time to the development of significant dementia milestones (death, institutionalization, loss of ability to perform ADL, or score on scale indicating severe dementia) was used as the time to event in the survival analysis. The results of this study showed that the vitamin E, selegiline, and combined groups were statistically different (i.e. declined less) from the placebo group in analyses that included baseline MMSE score as a covariate (not significant when excluded). The median survival was 230 days (vitamin E), 215 days (selegiline), and 145 days (combined group). Moreover, the vitamin E group showed a statistically significant difference for the endpoint of institutionalization, and the other treatment groups did not. There were no statistical differences between groups with respect to adverse events. Thus, the findings of this study suggest that selegiline and vitamin E may delay clinically important deterioration in patients with moderately severe AD; this delay varied from 20 to 32 weeks. The second study61 used survival analyses to evaluated the time to the development of severe functional impairments in a comparison of placebo and donepezil with a follow-up of 54 weeks. The results of the Kaplan-Meier analysis showed a mean number of days to significant functional decline of 252 days for placebo and 357 days for the donepezil group (mean difference of 100 days). The treatment group was 38% less likely to decline over a 1-year period. Both these studies demonstrated some delay in disease progress varying from 100 to 230 days for these three different pharmacological agents.

Staggered Withdrawal

Delay in disease progression can also be evaluated using a “time to return to baseline” following withdrawal of treatment. Similarly, staggering the start of the treatment parallels the staggered withdrawal and can be used to evaluate disease progression. In this design approach, the time to return to baseline is compared to the placebo group, which represents the natural course of the disease. Of the studies that were eligible for this research question used a classic withdrawal design (withdrawal in period II after the intervention was administered); none of these studies were able to maintain double blinding after the withdrawal of the intervention. Justification for the selection of the length of the washout or follow-up period was not consistently provided (which possibly reflects the lack of a priori aim to show delay in progression).

Table 5. Studies that withdrew the treatment agent but maintained at least single blinding
StudyDrugScheduleResult
Ruether 2001Cerebrolysin4w drug + 8w washout + 4w drug + 12w washoutAll patients:
ADAS-noncog, maintained difference from placebo
NAI returned to baseline
Subgroup MMSE<20:
ADAS Noncog, CGI, ADAS-cog, SKT maintained difference from placebo
Nyth 1990Citalopram4w drug + 8 w open drug + 4w new random drugNR
Rogers 1996Donepezil12w drug + 2w SB Pl washout5 mg maintained effect, 3 mg no maintenance of effect for ADAS-cog (NS)
Rogers 1998bDonepezil24w drug + 6w SB and placebo washoutReturn to placebo levels for ADAS-cog, MMSE, CIBIC (all NS)
Wilcock 2002Memantine2w SB and placebo + 28w drug + 2w SB placebo washoutNR
McKeith 2000Rivastigmine20w drug + 3w restReturn to placebo levels for NPI and computerized cognitive assessment (NS)
Antuono 1995Velnacrine2w SB placebo + 24w drug + 6w SB placebo washoutReturn to placebo levels for the ADAS-cog but SC for CGIC remained for washout
Bodick 1997Xanomeline24w drug + 4 w SB placeboSC at week 24 with CNTB
No differences vs. placebo at w4 of washout
Table 6. Studies that withdrew treatment and did not specify if blinding for washout or extension was maintained
StudyDrugScheduleResult
Dehlin 1985Alaproclate2w placebo + 4w drug + 2w placeboSC for GBS intellectual subscale at w4 of treatment
No significant difference at w2 of washout
Cutler 1993BMY 21,50212w drug + 4w placebo washoutTreatment showed no significant change and follow-up showed no change
Amaducci 1988Phosphatidylserine3m drug + 21 m follow-upSC remained for severe disease patients, not moderate
Raskind 1997Metrifonate26w drug + 8w follow-upNR
Parnetti 1995Posatirelin90d IM + 30d follow-up placebo“Maintained positive effect” but specific numbers not reported
Agid 1998Rivastigmine10w drug + 2w placebo washoutNR
Tables 5 and 6 detail any study that attempted to withdraw the drug in the treatment group and then continue observations over time. All studies that reported outcomes after the drug trial endpoint subsequently interrupted protocol and switched to “open-label” circumstances. In open-label conditions, blinding was broken and greater proportions of patients withdrew from the study as the follow-up increased. From a methodological perspective, these data were considered to be biased and would not meet our review eligibility criteria. However, we summarize in these Tables the same observations that were reported in all the studies eligible for this systematic review.

In Table 5, single blinding was maintained in a placebo-controlled trial of cerebrolysin, which had an 8- and 12-week follow-up and showed continuing statistical differences after drug withdrawal. The remaining drug interventions listed in Table 5 suggest that the treatment provided predominately symptomatic relief lasting 2 to 6 weeks and then returning to placebo levels. Similarly, the pharmacological agents in Table 6 suggest that treatment provided only symptomatic relief.

Question 3: Are certain drugs, including alternative medicines (non-pharmaceutical), more effective than others?

From a methodological perspective, addressing the question of being “more effective” requires head to head comparisons of pharmacological interventions. If one intervention (Drug A) has been shown to be effective relative to placebo of a specified effect size, and a second intervention (Drug B) has been shown to be effective at a lower magnitude relative to placebo, it does not necessarily follow that Drug A is more effective than Drug B. Comparisons of the relative effectiveness of certain drugs can only be evaluated in the context of head to head evaluation within the same trial. Those studies undertaken as direct comparisons are summarized below.

Head to Head Comparisons

A total of 26125, 152, 129, 237, 130, 238, 239, 145, 150, 124, 114, 131, 127, 240, 241, 135, 242, 101, 243, 70, 194, 128, 244, 26, 92, 245 studies compared efficacy of two or more pharmacological agents relative to each other. In general, few drugs showed statistically significant differences relative to each other. Those that did include the following (drug performing better is listed first):

  1. Sulphomucopolysaccharides versus CDP-choline238 - Significant differences were seen in favor of sulphomucopolysaccharides in measures of behavior and global assessment in 30 institutionalized patients with mild to moderate MID.

  2. Donepezil and vitamin E70 - Significant differences were seen in favor of donepezil in general cognitive function in 54 patients with mild AD.

  3. Antagonic stress versus nicergoline92 - Significant differences were seen in favor of antagonic stress in cognition as well as a global assessments in 62 subjects with mild to moderate AD.

  4. Antagonic stress versus meclofenate242 - Significant differences were seen in favor of antagonic stress in measures of cognition and global assessment in 63 patients with mild to moderate AD.

  5. Posatirelin versus citicoline101 - Significant differences were seen in favor of posatirelin in general cognitive measure and mood in 222 community living patients with mild to moderate AD.

  6. Pyritinol versus hydergine243 - A significant difference was found in favor of pyritinol in a global assessment measure in 102 Hispanic patients with mild to moderate AD.

  7. Idebenone26 versus tacrine-Mixed results were observed; the Efficacy Index Score showing a significant benefit over tacrine, while the global assessment showed no difference in 203 AD patients, 44 of whom completed the study.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf23.jpg.

   Figure 23. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing donepezil versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf24.jpg.

   Figure 24. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing galantamine versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf25.jpg.

   Figure 25. Weighted Mean Difference (WMD) from the Random Effects Model (Random) for the ADAS-cog comparing rivastigmine versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf26.jpg.

   Figure 26. Weighted Mean Difference (WMD) from the Fixed Effects Model (Fixed) for the ADAS-cog comparing tacrine versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf27.jpg.

   Figure 27. Relative Risk (RR) from the Random Effects Model (Random) for the CIBIC comparing donepezil versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf28.jpg.

   Figure 28. Relative Risk (RR) from the Random Effects Model (Random) for the CIBIC comparing galantamine versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf29.jpg.

   Figure 29. Relative comparison of effect sizes for studies using the CIBIC rivastigmine versus placebo

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf30.jpg.

   Figure 30. Relative comparison of effect sizes for studies using the CIBIC comparing tacrine versus placebo

Relative comparisons of FDA approved drugs for the treatment of dementia. Although no head to head trials compared drugs that are likely to be used in current practice in the United States, it was recognized that an assessment of the relative effectiveness of those drugs approved for the treatment of dementia would be of interest to clinicians. Four drug interventions that are currently approved for the treatment of dementia include donepezil, galantamine, rivastigmine, and tacrine. We caution the reader that inferences drawn from the following figures are limited because these FDA-approved drugs were not compared within the same study. The evidence for benefits and harms has been previously discussed in this report. The pooled estimates (WMD and RR) of two outcomes (ADAS-cog, CIBIC) frequently used in clinical practice have been presented together to illustrate the relative benefit of these approved drugs (Figures 23 to 30). For the purposes of this relative comparison, the pooled estimate reflecting the largest effect size (i.e. the dose showing the greatest magnitude) was selected. Several relevant details should be noted before comparing these estimates as follows: 1) the 5 mg dose of donepezil was selected because the magnitude of the pooled estimate was largest, 2) the 32 mg dose of galantamine had the largest pooled estimate, 3) the rivastigmine pooled estimate for the ADAS-cog was significant for heterogeneity, so the pooled estimate should be considered with great caution, and 4) none of the studies that evaluated tacrine and measured the CIBIC reported sufficient data to estimate an effect size; hence the effect size of the CGIC was substituted for comparison.

Question 4: Do certain patient populations benefit more from pharmacotherapy than others?

The following studies contained stratified analyses of outcomes for different clinical populations. A total of 22245, 211, 146, 179, 181, 132, 134, 142, 183, 185, 56, 168, 201, 63, 64, 55, 173, 62, 76, 75, 60, 159 studies addressed this question. During data abstraction, these trials were identified if the methods sections (including analyses) stated that stratified analyses were undertaken. Eight different variables were identified for which stratified analyses were reported. These included: age, gender, APOE genotype, disease type, disease severity (as determined by MMSE/ ADAS-cog threshold levels), treatment center, care dependence, and presence of depression. Of these 22 studies, seven trials245, 211, 146, 179, 181, 132, 134 evaluated disease type (AD, PDD, SDAT, MID, VaD). They will be discussed in Question 5 (see below).

Table 7. Studies with stratified analyses
CITATIONDRUGSUBGROUPDRUG EFFECT
Alvarez 2000AnapsosDisease severitySC in ADAS-cog in patients with mild cognitive deterioration and with AD
NS in patients with VD
Gutzmann 1998IdebenoneDisease severityNR
Weyer 1997IdebenoneDisease severitySC for ADAS Total
ADAS total score ≥ 20
Sano 1992CarnitineMMSELow mMMSE group SC on the SRT and CSF levels of drug
High mMMSE group NS neuropsychological test scores, CGI ratings and CSF levels of drug
Ruether 2001CerebrolysinMMSESubgroup MMSE < 20: SC in CGI, ADAS-cog, NAI and ADAS-Noncog. Suggests it's because this group had reduced placebo response.
Schellenberg 1997CyclandelateMMSE, ADAS-cog, Treatment centerNR
Feldman 2001DonepezilMMSE Psychoactive drug useNR
Tariot 2001aDonepezilMMSE (10–26) AgeMMSE group: SC greater differences than for the whole group for MMSE, GDR
Older patients group: SC for MMSE, CDR
Thal 1996aCarnitineAgeSC age-by-treatment interaction on the ADAS-cog ITT population
Patients ≤ 65 years significant difference in decline for ADAS-cog favoring Carnitine but not for CDR
Patients > 65 years NS
Panisset 2002CerebrolysinAPOE genotypeNS association of the APOE e4 status and response to study drug
Winblad 2001bDonepezilAPOE genotype GenderNS difference for the subgroups
Raskind 2000GalantamineAPOE genotypeNS
Wilcock 2000GalantamineAPOE genotypeNS for APOE group
MMSESC for MMSE < 18
Prasher 2002DonepezilDown syndrome ONLY in trialNR
Reifler 1989ImipramineDepressionDepressed patients SC higher HAM-D scale score. For MMSE patients with AD + depression had higher scores initially and improved significantly more over time

SC = Significant change

NS = Not statistically significant

NR = Not reported

Table 7 details the 15142, 26, 185, 56, 246, 201, 63, 64, 55, 173, 62, 76, 75, 60, 159 studies that provided stratified analyses other than for disease type. For disease severity, no clear pattern emerges. For APOE, no significant difference was noted between groups173, 62, 76, 75 for the three interventions that included cerebrolysin, donepezil, and galantamine. For age, Thal et al.55 conducted a post-hoc analysis to assess the effect of age on the rate of decline. Patients were categorized according to age (< 65 years, 65 years and older). The results of the study indicate that a subgroup of patients, aged 65 years or younger, may benefit more from carnitine as compared to older subjects. Specifically, in the younger population, the significant difference between the treatment and the placebo group was observed for ADAS-cog but not for CDR.

In general, very few studies examined the efficacy of drugs with respect to dementia by population characteristics. Three additional studies attempted to evaluate unique populations or population characteristics. Prasher et al.60 evaluated subjects who had Down's Syndrome with dementia and were treated with donepezil, and found none of the outcomes to be significant; this study had a sample size of 30 subjects and was underpowered. Ban et al.213 conducted a multicenter, placebo-controlled, double-blind study with Hispanic and Italian populations. This study was not designed to specifically evaluate the efficacy of glycosaminoglycan polysulfate by ethnicity. However, the study included centers from Mexico, Panama, Naples, and Trieste. This study examined whether the changes encountered in the different outcome measures could be related to center effect, but no statistically significant center effect was found. While this study suggests that ethnicity may have minimal impact, future studies should specifically assess the impact of racial composition on the efficacy of drugs.

Question 5: What is the evidence for the treatment of VaD?

Summary Table 25 details the results of studies in which patients had VaD, or stratified data were presented with respect to VaD subgroups identified as VaD or MID. The trial details for all these studies are provided in evidence tables of key study characteristics, evidence tables of study results, and evidence tables of adverse events found in Appendix C; summary results of trials were also discussed in the results sections of Question 1.

A total of 20 pharmacological interventions in 29 studies211, 220, 238, 171, 200, 199, 146, 68, 181, 184, 133, 134, 132, 161, 89, 91, 93, 247, 187, 191, 192, 194, 193, 100, 98, 196, 195, 245, 217 were applied specifically to dementias classified as VaD. Sixteen studies evaluated populations entirely composed of patients with VaD (or MID), and the remaining 13 trials had VaD as a subgroup. The majority of these pharmacological interventions (n = 14) were represented by a single trial, limiting the extent of the evidence; these included ateroid, buflomedil, cerebrolysin, sulphomucopolysaccharides (CDP choline), citalopram, donepezil, Ginkgo biloba, idebenone, minaprine, nimodipine, nicergoline, oxiracetam, 5-THF (trazodone), vincamine, and xantinolnicotinate. Surprisingly, four of these trials did not report any results relative to placebo, and these included buflomedil, Ginkgo biloba, oxiracetam, and 5-THF (trazodone); all but one of these trials220 evaluated subgroups of VaD patients and likely did not posses sufficient power to evaluate differences. Six interventions had more than a single trial, and these included Choto-san (n = 2), memantine (n = 3), nicergoline (n = 2), pentoxifylline (n = 4), posatirelin (n = 2), and propentofylline (n = 2).

Several of the trials with sample sizes greater than 100 subjects showed significant differences in general cognitive function: ateroid, cerebrolysin, donepezil, idebenone, and nicergoline. Similarly, these larger sample studies showed statistical differences for global assessment: Choto-san, donepezil, memantine, nicergoline, propentofylline, vincamine, and xantinolnicotinate. Findings for other outcome domains were inconclusive, as these were rarely evaluated (see Summary Table 25).

Table 8. Studies evaluating vascular dementia patients relative to other dementias
CITATIONDRUGSUBGROUPDRUG EFFECT
Passeri 19935′-MTHF Trazodone (TRZ)AD vs. MIDEquivalence study
When patients with AD were analyzed separately the same pattern of response to MTHF and TRZ was found in the HDRS and RVM as when they were analyzed together with patients with MID.
MID as separate group: HDRS was significantly reduced vs. baseline after 8 weeks of treatment in the TRZ group and only at the end of the follow-up period in the MTHF group. RVM remained unchanged in MID pts in both treatment groups.
Ban 1991bAteroidPDD vs. MIDNR
Nyth 1990CitalopramAD/SDAT vs. VaDA period:
No improvement in the VaD group
SC in the AD/SDAT group in emotional bluntness, confusion, irritability, anxiety, fear-panic, depressed mood, and restlessness. MADRS scores significantly reduced
B period:
AD/SDAT group SC in emotional bluntness at week 8. NS at week 4 and 12.
NS for the VaD group.
LeBars 1997Ginkgo bilobaAD vs. MID+AD MMSEAD subgroup: SC for ADAS-cog and GERRI
Kanowski 1996Ginkgo bilobaDAT vs. MIDImprovements at 24 weeks of treatment in comparison to baseline values were consistently slightly greater in the DAT group than in the MID group. Calculation of descriptive p-values seemed inappropriate due to the very small number of patients with MID in the sample.
Winblad 1999MemantineAD/VaD Care dependenceNR for differences between dementia types
Care dependence: Patients with < 20 points on the CGI and BGP
Care dependence subscore shows slightly higher response rates than those with >20 points in the memantine group.
Wilcock 2002MinaprineSDAT vs. MIDThe largest treatment effect occurred in patients with baseline MMSE score < 15 (p = 0.04) and in those without cerebrovascular macro-lesions (p = 0.002)

SC = Significant change

NS = Not statistically significant

NR = Not reported

Table 8 below lists the studies that undertook comparisons between VaD populations and other dementia types. Although, not consistent across all trials, three of the studies suggests possible differences between 1) MID and AD for 5′-MTHF-trazodone,245 2) AD/SDAT and VaD for citalopram,146 and 3) DAT and MID for Ginkgo biloba.181

Summary Evidence Tables

Chapter 4. Discussion

This systematic review was undertaken primarily to evaluate the efficacy of pharmacological agents in the treatment of dementia. The studies were limited to parallel design RCTs with quality scores greater than 3 on the Jadad scale. The interventions were not limited to those currently on label by the FDA; it was of interest to cast a wide net and capture reports of pharmacological agents that are used off-label for the treatment of dementia. Since a variety of agents with different therapeutic effects were evaluated, the outcomes were not restricted to a specific subset of all available outcomes used in the dementia literature. The psychometric properties of some of the most commonly used outcomes have been critically appraised and found to be limited. Moreover, there is no current consensus as to which domains, and the outcomes within these, that best reflect clinically important change.

Strength of the Evidence

The studies eligible for review in this dementia report represent the highest form of evidence. This strongly suggests that these trials are more likely to be “well-designed, well conducted studies in representative populations that assess the effects of health outcomes”.53 The high quality scores also indicate that the studies evaluated in this systematic review have a relatively high level of internal validity. The characteristics of the population and the interventions were detailed to assist the reader in evaluating the degree of external validity. Similarly, attempts were made to highlight “consistency” in the evidence as well as the quantity of evidence and the magnitude of the reported changes.

Although, there is greater understanding on evaluating the evidence for the “benefits” of therapies, there is less clarity on determining the potential for harms from pharmacological interventions for treating dementia. With respect to adverse events and the potential for serious harms, greater variability in systematic collection and reporting of these were observed in the dementia pharmacological literature. Evaluation of the potential for harm is considered with three main points: 1) the most frequently reported adverse events across studies for a specific drug, 2) the overall withdrawal rate due to adverse events for both the control and treatment groups, and 3) the range of frequencies reported for a subset of symptoms (nausea, diarrhea, dizziness, agitation, eating disorder) selected a priori and evaluated for all pharmacological interventions.

At present there is no coherent framework that captures the disease processes present in dementia patients for the range of outcomes evaluated in this systematic review. This report details the highest evidence from both a design and internal validity perspective. It is our view that determining the clinical relevance (external validity) of such high-quality evidence must ultimately be reached by consensus amongst multidisciplinary experts within the decision-making body that will use this evidence for such purposes as developing practice guidelines.

Question 1: Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes?

Summary of the Systematic Review Results

A total of 97 interventions in 186 studies were eligible for evaluation in this systematic review and were distributed as follows:

  • A total of 16 different cholinergic neurotransmitter modifying pharmacological agents in 72 studies.

  • A total of 35 non-cholinergic neurotransmitter/neuropeptide modifying agents in 61 studies.

  • A total of 46 other pharmacological agents in 76 studies*.

  • two studies compared two NCNMAs with an OTHER.

  • two studies compared a CNMA with an OTHER.

  • one study compared a CNMA with two OTHERS.

  • two studies compared an NCNMA with an OTHER.

The evidence for all these pharmacological agents was presented in great detail in Chapter 3 and in Evidence Tables of Key Study Characteristics, Tables of Study Results, and Tables of Study Adverse Events contained in Appendix C. Conclusions regarding those pharmacological agents that had a minimum of three trials are summarized here. The summary of the pharmacological agents that had fewer than three trials can be found in Chapter 3.

Summary of Cholinergic Neurotransmitter Modifying Agents

Carnitine. Six trials evaluated carnitine in 925 subjects with mild to moderate severity, recruited predominately from the community at doses of 2 to 3 g for either 24 or 52 weeks. Evidence of benefit is conflicting for the domains of cognition. Most studies were not statistically significant and the lack of sufficient power may have been an important factor. Similarly, no significant differences were found in the domains of global assessment, behavior/mood, and quality of life/ADL; power could not be evaluated for the majority of these outcomes.

Four of the six studies scored 3 for quality on reporting adverse events. Withdrawal rates due to adverse events varied from 0 – 3% (excluding results from one outlier trial248), and gastrointestinal symptoms were the most frequently reported types of adverse events. The percent of subjects reporting the a priori symptoms of interest across all studies were as follows: 1) nausea (placebo = 6 – 14%, all doses carnitine = 28%), and 2) agitation (placebo = 6%, all doses carnitine = 7%). Dizziness, diarrhea, or eating disorder were not reported by any study. No serious adverse events requiring hospitalization and associated with carnitine were reported.

Donepezil. Ten trials in 3239 subjects evaluated the efficacy of donepezil compared to placebo, and one trial compared donepezil to a group given vitamin E. The majority of studies (n = 8) evaluated AD patients, for which half were recruited from the community (other studies did not specify). The subjects had predominately mild to moderate disease and doses of 5 or 10 mg were used with varying duration from 12 to 56 weeks.

There is consistent evidence of benefit in the domains of general cognitive function and global assessment. The combined effect sizes for the ADAS-cog and the CIBIC were estimated. Evidence is inconsistent for a dose response in these domains based on the three studies that evaluated two different doses (5 and 10 mg); the benefit was of similar magnitude for both dose groups for global assessment outcomes. Similarly, two of the three studies that evaluated behavior/mood outcomes (NPI) showed no statistically significant changes relative to placebo; these trials lacked sufficient power to detect a difference. There is some evidence of benefit in ADL outcomes, although this outcome domain was evaluated with a variety of instruments. Caregiver burden outcomes was evaluated in a single study that did not report the findings for this domain.

Adverse events quality scores were 3 or greater for the majority of studies (n=7). Four trials provided evidence of a dose response for adverse events. One study showed a statistical difference for balance-related problems and asthenia (neurological fatigue) between placebo and treatment groups. Withdrawal due to adverse events ranged from 0 – 18% for treatment groups and 0 – 11% for placebo. Four out of six studies testing differences between groups were statistically significant for diarrhea (placebo = 3 – 21%, all doses donepezil = 0 – 38%), nausea and vomiting (placebo = 4 – 9%, all doses donepezil = 4 – 25%). The other a priori symptom reported was agitation and frequencies for placebo varied from 0 – 8% and for all doses from 3 – 19%; but these were not shown to be statistically different.

Galantamine. Six trials in 3530 subjects evaluated the efficacy of galantamine compared to placebo. Doses of 24 and 32 mg were evaluated in half of these studies. Five studies evaluated AD patients and there was limited information regarding whether the subjects were from the general community or institutional settings. All studies recruited subjects with mild to moderate disease and the drug was administered with varying duration of 3 or 6 months.

Evidence of benefit is consistent in the domains of general cognitive function, global assessment, quality of life/ADL. Two of the three studies that evaluated, behavior/mood found statistically significant differences. A small dose effect was evident in the ADL domain when comparing the pooled estimates of the DAD; no dose effect was observed for outcomes in the global assessment domain, and dose effect could not be evaluated for the general cognition domain. The caregiver burden domain was not evaluated in any trials.

Five of the six trials scored 3 out of 5 on our quality scale for rating adverse events. Withdrawal rates due to adverse events ranged from 4 – 9% for placebo and 8 – 27% for the treatment group. One study showed a dose response for adverse events. Although, most trials did not report testing for differences between groups, two trials reported a statistically significant difference in weight loss with the treatment group having more than the placebo group. The most common types of adverse events reported were gastrointestinal symptoms (nausea and vomiting, diarrhea), eating disorders/weight loss, and dizziness. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 13%, all doses = 6 – 44%), 2) dizziness (placebo = 3 – 11%, all doses = 4 – 19%), 3) diarrhea (placebo = 2 – 10%, all doses = 4 – 19%), 4) agitation (placebo = 1 – 9%, all doses = 6 – 15%), and 5) eating disorder (placebo = 0 – 6%, all doses = 4 – 20%).

Metrifonate. Nine studies compared metrifonate to placebo in 2759 subjects with mild to moderate AD (likely from community settings as the majority of studies did not specify this). Metrifonate dosages evaluated varied from 50 to 80 mg, and study duration ranged from 21 days to 26 weeks duration.

All but one study showed metrifonate to have a consistent positive effect on measures of general cognitive function; none of the studies evaluated any specific cognitive function measures. The effects on global assessment were less consistent, but suggested a positive effect in four of the eight studies that reported this outcome. Evidence for effect in the domains of behavior/mood and quality of life/ADL were not significant in the majority of studies that evaluated these domains, however these were primarily evaluated as secondary outcomes and likely lacked sufficient power.

With the exception of a single study, quality scores for reporting adverse events were greater than 3. However, only one trial83tested for differences between groups and found nausea and vomiting, diarrhea, and muscle and joint disorder to be significantly different. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 14%, all doses = 2 – 50%), 2) dizziness (placebo = 1%, all doses = 3 – 4%), 3) diarrhea (placebo = 4 – 14%, all doses = 11 – 19%), 4) agitation (placebo = 2 – 14%, all doses = 8 – 33%), and none reported eating disorder as an adverse event. Withdrawal due to adverse events varied from 0 – 9% for placebo and 0 – 12% for the treatment group. Overall, it was difficult to determine which types of adverse events reported had the potential to cause serious harm. This is noteworthy as metrifonate has been withdrawn from use in North America, and Bayer has suspended Phase III trials,87 because some patients in clinical trials have experienced serious muscle weakness . This decision was based on the results of an experimental study showing risk of respiratory paralysis with the use of metrifonate. Other adverse events of concern included severe leg cramps, dyspepsia, and bradycardia. None of the studies we reviewed indicated that if present, these events differed significantly between groups. It is not clear if this inconsistency is a function of the methods used to collect and report adverse events or a limitation of RCTs as a source of detecting serious adverse events when incidence is low.

Nicergoline. Four trials in 705 subjects compared nicergoline to placebo and one trial compared it to antagonic-stress in mixed populations that included AD, MID, PDD, VaD, mixed dementia, and SDAT, which were classified as mild to moderate in severity.

All placebo-controlled trials found a positive effect for general cognitive outcomes, but half the results were based on OC analyses. The evidence was mixed for benefit in the domain of global assessments. No significant differences were found for behavior/mood, and quality of life/ADL outcomes, but these were evaluated in few studies and as secondary outcomes (suggesting that sufficient power was an issue).

Quality scores for reporting adverse events varied from 2 to 5 for these four trials, and none tested for differences between groups. Withdrawal due to adverse events varied from 0 – 8% for placebo and 0 – 9% for the treatment group. With the exception of headache, which was reported in all four trials, it was difficult to determine which types of adverse events most characterized exposure to this pharmacological agent. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 3%), 2) dizziness (placebo = 1–2%, all doses = 0% or not reported), 3) diarrhea (placebo = 2 – 6%, all doses = 2 – 4%), 4) agitation (placebo = 5%, all doses = not reported), and none reported eating disorder as an adverse event.

Physostigmine. Four studies of 1198 subjects with mild to moderate AD evaluated physostigmine administered in patch and oral form (30 to 60 mg dose) for study duration varying from 6 to 24 weeks. All subjects were recruited from the community.

There is evidence that physostigmine has a statistically significant effect on general cognitive function, as three of the four studies showed improvement. Evidence for an effect on global function was mixed with no consistent benefit. Similarly, for quality of life/ADL outcomes, all three studies that evaluated this domain were not statistically significant but these were secondary outcomes and may reflect a lack of power. Behavior/mood and caregiver burden were not tested in these trials.

The quality scores for reporting adverse events were generally low, scoring 1 or 2 out of 5. Withdrawal rates due to adverse events varied from 1 – 5% for placebo and 12 – 55% in the treatment group, with one study not reporting rates. The high withdrawal rates were in studies with sample sizes that varied from181 to 475 subjects. A single study tested for differences between groups, and found that dizziness, tremor, weight loss, asthenia, confusion, delirium, and respiratory problems were significantly different. The cluster of reported types of adverse events suggests that gastrointestinal problems (abdominal pain, diarrhea) (placebo = 1 – 9%, all doses = 13 – 28%), nausea and vomiting (placebo = 1 – 9%, all doses = 9 – 75%) and eating disorder (placebo = 2 – 6%, all doses = 5 – 16%) were most frequently reported. Dizziness (placebo = 4 – 13%, all doses = 11 – 38%) and agitation (placebo = 6 – 16%, all doses = 4 – 8%) were also reported.

Posatirelin. Four trials evaluated posatirelin in 931 subjects in a variety of mild to moderate dementia populations (AD, PDD, VaD) using a dose of 10 mg per day over 3 months duration.

Three of the four trials showed statistical significance for general cognitive function and quality of life/ADL (as measured by GBS subscales for these domains). The evidence remains inconsistent for benefit in global assessment (evaluated in only one trial) and behavior/mood (mixed results). Caregiver burden and specific cognitive function were not evaluated in any trial.

Quality scores for reporting adverse events varied from 2 to 4. Withdrawal rates due to adverse events ranged from 0 – 3% in placebo and 0 – 4% in the treatment group. None of the studies tested for significant differences between groups. All studies reported the presence of agitation, and at least three studies reported arrhythmia, nausea/vomiting, headache, rash/skin disorder, and sleep disorder. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 1 – 4%), 2) dizziness (placebo = not reported, all doses = 1%), 3) diarrhea (placebo = 2%, all doses = 2%), 4) agitation (placebo = 1 – 5%, all doses = 1 – 5%), and none reported eating disorder as an adverse event.

Rivastigmine. Six studies evaluated 2071 subjects and three of these studies were limited to AD patients only. Doses for rivastigmine varied from 1 to 12 mg, and treatment ranged from 14 to 26 weeks and only one study specified a community sample.

The evidence shows that general cognitive function improves with rivastigmine at a dose of 12 mg, but there is mixed results for efficacy at lower doses. Two trials also evaluated specific cognitive function but the results were not consistent within studies (between general and specific measures) and between studies for these domains. There is consistent evidence of benefit for the outcome of global assessment but the dosage at which this is significant varies highly between studies. In the domains of behavior/mood and quality of life/ADL, the findings were not statistically significant nor consistent; most of these analyses were not based on intention to treat analysis and lack of sufficient power cannot be ruled out. Caregiver burden outcomes were not evaluated by any trial.

Quality scores for reporting adverse events varied from 2 to 5. Withdrawal rates due to adverse events ranged from 4 – 11% in the placebo and 11 – 27% in the treatment group. Two trials demonstrated a dose response; however, one of these trials showed significant differences for nausea and vomiting only, and the other trial showed significant difference for all the adverse events reported. The majority of studies reported dizziness, nausea and vomiting, eating disorder/weight loss, and headache. It should be noted that one study allowed intentional prescribed anti-emetic drugs to increase the tolerance of subjects taking rivastigmine. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3 – 10%, all doses = 8 – 58%), 2) dizziness (placebo = 0 – 7%, all doses = 6 – 20 %), 3) diarrhea (placebo = 2 – 9%, all doses = 7 – 17%), 4) eating disorder (placebo = 4 – 8%, all doses = 4 – 19%), and 5) agitation was not reported.

Tacrine. Six studies108, 109, 110, 111, 112, 113 evaluated tacrine in 994 subjects predominately with mild to moderate AD at doses of 80 to 160 mg lasting either 12 – 13 or 3 – 36 weeks in duration. Two other studies114, 26 involving 425 patients were non-placebo controlled studies.

A single trial108 was found to show benefit for general cognitive function with a small effect and this was based on a series of related publications. The five trials showing no benefit for general cognitive function comprised small sample sizes and much shorter study duration. Overall, the evidence for benefit for general cognitive function is limited to this single trial. There is evidence for benefit in global function from two of the three trials that evaluated this domain. Changes in behavior/mood, quality of life/ADL domains, specific cognitive function, and caregiver burden were all not significant, but lack of sufficient power cannot be ruled out.

The quality scores for reporting adverse events varied from 1 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 12% for placebo and 0 – 55% in the treatment group. The higher rates of withdrawal were associated with higher doses. Elevated alanine transaminase (ALT) or hepatic abnormality (placebo = 4 – 13%, all doses tacrine = 7 – 67%) was reported in six studies, raising concerns for the potential for serious liver damage. None of these trials tested for differences between treatment and placebo with respect to adverse events. Five of the studies reported nausea and vomiting (placebo = 0 – 9%, all doses = 9 – 37%); gastrointestinal problems; dizziness (placebo = 0 – 16%, all doses = 4 – 14%) was also noted in several studies. Frequencies of other a priori symptoms of interest are as follows: 1) agitation (placebo = 5 – 12%, all doses = 5 – 9%), and 2) diarrhea (placebo = 0 – 13%, all doses = 4 – 18%). There is evidence for the potential for serious adverse events associated with liver function in six trials.

Velnacrine. Three studies evaluated the effects of velnacrine in 774 AD patients with a diagnosis of AD. The doses that were shown to effect significant changes were 75 mg twice daily and 225 mg daily in studies with a 15 and 24 week duration. Location of recruitment was not specified.

Statistically significant effects were observed for general cognitive function, and global assessment in the two studies with sample sizes over 300 subjects. Behavior/mood and caregiver burden showed some benefit in one trial116 at the highest dose only. Quality of life/ADL was tested as a secondary outcome and showed mixed findings.

Quality scores for reporting adverse events were 3 for all studies. Withdrawal rates varied from 0 – 22% for the placebo group and 5 – 33% for the treatment group. None of the studies reported a dose response. None of the studies tested for statistical differences between the placebo and treatment groups. Two studies reported aberrant hematology and hepatic abnormality;116, 117 for these two studies the rates of occurrence were 2 – 21% for placebo, and 32 – 40% for all doses. All studies reported diarrhea and nausea and vomiting. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 0 – 4%, all doses = 3 – 8%), 2) dizziness (placebo = 3%, all doses = 0 – 8%), 3) diarrhea (placebo = 3%, all doses = 2 – 33%), 4) agitation (placebo = 4%, all doses = 1 – 4%), and 5) eating disorder (placebo = 1%, all doses 2 – 4%). The potential for serious liver effects was not well specified in these trials.

Summary of Non-cholinergic Neurotransmitter/Neuropeptide Modifying Agents

Haloperidol. Five studies evaluated the effect of haloperidol relative to placebo in a total of 622 subjects with mild to moderate disease and included AD patients124, 125, 128 and mixed populations (MID/VaD/ PDD).126, 127 One trial128 had only 15 patients, and one trial124 lasted only three weeks. Two studies recruited subjects from institutions; one from the community and two did not specify.

Mixed results were observed for improvement in global assessment. In three of the trials there was benefit in the domain of behavior/mood which reached statistical significance. Two trials evaluated caregiver burden and found no statistically significant differences but lack of sufficient power cannot be ruled out. Few studies evaluated outcomes in quality of life/ADL. Haloperidol did not affect general cognitive function in two trials and was not evaluated in the other studies.

The quality scores for reporting adverse events varied from 1 to 5 and only three of five studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 5% to 17% for placebo and 17 – 33% in the treatment group. One trial124 showed a dose-response effect but the study only lasted for three weeks. Three trials tested for differences between treatment and placebo with respect to extra pyramidal symptoms (placebo = 17 – 32%, all does = 34 – 97%), and two found statistically significant differences.124, 125 One study125 found significant differences between groups for balance-related problems. Although reported by only two trials, the range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = not reported, and 2) dizziness (placebo = 24%, all doses = 21%). No frequencies were reported for agitation, diarrhea, or eating disorder.

Memantine. Three trials evaluated memantine in 1066 patients, primarily with VaD, with 10 or 20 mg doses lasting 12 or 28 weeks. Disease severity was moderate to severe in a single study132 and mild to moderate in the remaining two studies133, 134. One study included patients that were institutionalized, one from the community and the third did not specify.

There is consistent evidence of benefit for general cognitive function in the two studies that evaluated this domain. The findings for global assessment are mixed. The sole trial that evaluated mixed dementia populations (including some VaD) with moderate to severe dementia found significant differences for global function, behavior/mood, and quality of life/ADL outcomes, but did not evaluate general cognitive function. It should be noted that this trial with mixed populations used half the dose of memantine for half the study duration in patients with greater disease severity, and had approximately half the sample size of the other two trials evaluated in this systematic review. Despite a lower dose, a smaller number of more severely affected patients, and a shorter duration, a statistically significant difference was found.

The quality scores for reporting adverse events varied from 3 to 4. Only two of three studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 7% to 13% for placebo and 9 – 12% in the treatment group. A single trial tested for differences between treatment and placebo, and none of the comparisons were significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 5%), 2) dizziness (placebo = 3 – 8%, all doses = 6 – 11%), 3) diarrhea (placebo = 4%, all doses = 4%), 4) agitation (placebo = 7 – 8%, all doses = 4 – 5%), and none reported eating disorder as an adverse event.

Selegiline. Six trials135, 136, 249, 138, 139, 140 evaluated selegiline in 733 patients with AD, PDD, and DAT with 10 mg per day and a study duration of 60 days or 2 years. Only one study reported that patients were from institutional settings.

All but one trial that evaluated general cognition showed no statistically significant changes. A single trial found statistical improvements in specific cognitive tests (Sternberg Memory tests); this trial also showed statistically significant improvements in global assessment and behavior/mood. Only this trial, which had the highest quality score (7), showed consistently positive findings across domains tested. Three of the five trials that evaluated part or all of these domains had very small sample sizes and were likely underpowered, possibly accounting for the inconsistent findings. There is some evidence that selegiline and selegiline combined with vitamin E, increases the time to important functional decline milestones; this is based on a single study.

The quality scores for reporting adverse events varied from 0 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 4% for placebo and 0 – 9% in the treatment group. Only one trial135 tested for differences between the treatment and placebo groups and showed that balance and falls were significantly different (worse) between groups (22% for the group with selegiline combined with vitamin E versus 5% in the placebo). When adjusted for multiple comparisons, these were no longer significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 0%), 2) dizziness (placebo = 2 – 20%, all doses = 0 – 30%), and 3) agitation (placebo = 4 – 16%, all doses = 4 – 23%); no trial reported diarrhea or eating disorder as an adverse event.

Summary of Other Pharmacological Agents

Cerebrolysin. Six studies evaluated the effect of cerebrolysin in a total of 819 subjects. All but one of the trials171 included only AD patients with mild to moderate disease. All of the studies used the same dose of cerebrolysin, 30 ml per day for 5 days per week for 4 to 24 weeks. Location of recruitment was not specified.

Cerebrolysin showed a statistically significant effect on cognition in four out of five studies. Although, a pooled estimate for the ADAS-cog was calculated, the model was positive for heterogeneity and the overall estimate was not significant. The results for specific cognitive tests for the three trials that evaluated this domain were inconsistent. Global assessment measures showed a significant effect in five of the trials. This model was also positive for heterogeneity but significant for an overall effect. Two out of three studies showed an effect for behavior/mood, and none of the six studies showed an effect on quality of life/ADL. No study measured caregiver burden.

Two of the six trials scored 5 out of 5 on our quality scale for rating adverse events, yet they did not report any adverse events. Two studies scored 4, and the other two trials scored 3 and 2. All the studies with scores equals to 4 or less tested for statistical differences in adverse events between placebo and treatment groups. Withdrawals due to adverse events were not reported in one study170 and were 1% in two studies173, 168 and none withdrew in three studies.169, 172, 173 One study reported significant differences between treatment and control group173 for weight change, anxiety, and headache. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 10 – 24%, all doses = 3 – 21%), 2) dizziness (placebo = 0 – 12, all doses = 1 – 8%), and 3) agitation (placebo = 1%, all doses = 0%), and none reported diarrhea or eating disorder as an adverse event.

Estrogen. Five studies evaluated estrogens for dementia in 247 patients with primarily mild to moderate AD, with the exception of one study178 that included moderate to severe dementia patients who were all institutionalized. One of the studies with AD patients provided 0.10 mg per day174 by skin patch for 8 weeks and the others used 1.25 mg per day for 12 to 52 weeks.177 The study including severe subjects used 2.5 mg per day for 4 weeks.178

Three trials evaluated general cognitive function and all showed non-significant findings; two of these trials lacked sufficient power for the ADAS-cog. Similarly, two trials evaluated specific cognitive function but results were mixed. Most of the outcomes evaluated in the domains of global assessment, behavior/mood, and quality of life/ADL were secondary outcomes and none showed significance; lack of power could be a factor in these trials.

One of the five trials scored 5 out of 5 on our quality scale for rating adverse events, and surprisingly, this same trial did not report any adverse event. Withdrawal rates due to adverse events ranged from 0 – 5% for placebo and 0 –14% for the treatment group. The most frequently reported adverse event was vaginal bleeding and a single trial reported a significant difference between placebo and treatment group for this symptom. It was not clear from the descriptions provided in the study if they had ascertained whether vaginal bleeding was present prior to the trial commencement. Nausea was the single a priori symptom of interest that was reported and by a single trial; frequencies varied from 0% for the placebo group and 4% for the treatment group.

Ginkgo biloba. Three trials evaluated Ginkgo biloba, 120 to 240 mg per day for 3 to 12 months, in a total of 563 subjects with mixed dementias of mild to moderate severity.

The largest trial179 had the longest treatment interval but the lowest daily dosage and reported a significant effect for general cognitive function but had mixed findings for global assessment. A second large trial181 found positive changes for neuropsychological tests, global assessment, and behavior outcomes with double the dosage of the previously described trial and half the treatment interval. In this same RCT, clinical efficacy was assessed by using a responder analysis, with therapy response being defined as response in at least two of the three variables: CGI (global function), SKT (special cognitive function), and NAB (ADL). A single trial evaluated behavior/mood and was not significant. No trial evaluated caregiver burden or quality of life/ADL.

All three trials scored 3 or greater on the quality scale for rating adverse events. Two studies had no withdrawals due to adverse events, and one trial had a withdrawal rate of 6% for both placebo and treatment groups. Two studies did not report any adverse event. One study reported a statistically significant difference between the treatment and the placebo group for skin disorders. The same study reported gastrointestinal and headache adverse effects, but did not test for statistical differences between the placebo and the treatment group. None of the trials reported the presence of the a priori symptoms of interest.

Idebenone. Four studies185, 183, 182, 184 evaluated the drug idebenone in 1153 subjects of mixed dementia populations of mild to moderate severity; one of these trials26 evaluated idebenone relative to tacrine. Doses varied from 30 mg per day to 360 mg per day, and the treatment interval ranged from 90 days to 60 weeks.

There was evidence of benefit for general cognitive function and global assessment. Several studies evaluated behavior/mood and quality of life/ADL and these outcomes were found to be significantly different. None of the trials evaluated caregiver burden.

Quality scores for reporting adverse events varied from 1 to 5. Rates of withdrawal due to adverse events varied from 0 – 5% for the placebo group and 0 – 5% in the treatment group; a single trial183 did not report withdrawal rates. Two trials183, 185 tested for statistical differences between groups and found no differences. Although no clear pattern emerges, three studies identified at least one balance-related adverse event most consistently reported across studies. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 2 – 11%), 2) dizziness (placebo = not reported, all doses = 2%), and 3) not reported for diarrhea, agitation, or eating disorder as an adverse event.

Oxiracetam. Five studies186, 187, 188, 189, 190 evaluated oxiracetam in 554 subjects with different dementia syndromes of mild to moderate severity. All analyses were observed cases and not ITT. All studies used 1600 mg daily, with one exception189 where the dose ranged between 1600 – 2400 mg per day. The treatment interval ranged from 90 days to 26 weeks.

All outcomes shown to be positive for this drug were based on observed case evaluation. The two trials that evaluate general cognitive function showed benefit. The findings for specific cognitive function were mixed. A single trial evaluated global assessment and showed statistically significant change. Behavior/mood, and quality of life/ADL outcomes showed mixed results. No study evaluated caregiver burden.

The quality scores for reporting adverse events varied from 2 to 5. The proportion of withdrawals due to adverse events varied form 0 – 9% for the placebo group and 0 – 6% for the treatment group. No clear pattern for adverse events is evident, but three of the five studies reported gastrointestinal related problems, primarily associated with abdominal pain. Although, only single trials evaluated the range of frequencies of the a priori symptoms of interest are as follows: 1) dizziness (placebo = not reported, all doses = 11%), and 2) agitation (placebo = 1%, all doses = not reported); no trial reported nausea, eating disorder, or diarrhea as an adverse event.

Pentoxifylline. Three placebo-controlled studies193, 192, 191 evaluated pentoxifylline and one study194 compared pentoxifylline to sulodexide, with a total of 482 subjects with predominately MID. The total dose administered in all studies was 1200 mg per day but varied from 400 mg three times per day to 1200 mg once per day. The treatment intervals ranged from 12 to 36 weeks.

All three placebo trials showed non-significant findings for any primary outcome evaluated on all subjects in the study. It should be noted that two of these trials had very small sample sizes (n = 38, n =28) that were evaluated in the OC analyses; this suggests that the trials lacked sufficient power to evaluate multiple outcomes. The remaining trial had a large sample size (n = 289) and employed an ITT analysis; all primary outcomes evaluated were not significant.

The quality scores for reporting adverse events were generally low, varying from 1 to 3. Withdrawal rates due to adverse events varied from 0 – 25% in the placebo group and 0 – 22% in the treatment group. The two studies that reported adverse events indicated the presence of gastrointestinal disturbances, including abdominal pain or nausea and vomiting (placebo = 7% and all doses = 14%). None of the trials reported dizziness, agitation, eating disorder or diarrhea.

Propentofylline. Four trials197, 196, 250, 198 using propentofylline in 510 patients with AD and VaD were included. A dose of 900 mg per day was consistent across all studies, and the treatment duration ranged from 3 to 12 months.

The two studies with small sample sizes (n = 30) showed no significant results for any outcome evaluated but lack of power cannot be ruled out. There were two trials that found benefit in general cognitive function based on the MMSE. The results for specific cognitive function as measured by the DSST were mixed, as were those for global assessment. Behavior/ mood outcomes were evaluated by a single trial195 and shown to be significant; this same trial evaluated quality of life/ADL and showed no significant difference. No trial evaluated caregiver burden.

The quality scores for reporting adverse events varied from 1 to 4. The percentage of withdrawals varied from 0 – 13% for the placebo group and 0 – 12% for the treatment group. None of the trials tested for differences between groups. Three of the trials195, 197, 198 reported gastrointestinal events that included abdominal pain, constipation, and nausea and vomiting (placebo = 2%, all doses = 7%). Dizziness (placebo = 3 – 5%, all doses = 1 – 6%) was the only other a priori symptom of interest.

Methodological Issues and Limitations in Assessing Efficacy of Dementia Agents

Definition of clinically significant or meaningful difference. The stance undertaken in this review has been cautious with regards to interpreting “clinically significant” differences within and across studies. This systematic review has highlighted some of the concerns expressed in the literature on pharmacological efficacy research in dementia. Ultimately, clinical significance is a complex issue, and its definition can vary across individuals and groups of individuals. Wherever possible, attempts were made to identify the magnitude of differences in the studies and the limitations of the data from some of these primary studies.

In drug development programs, an ordered series of trials are undertaken: dose tolerance (phase I), dose finding (phase II), dose efficacy (Phase III), and post-marketing (phase IV). However, due to the pressures on pharmaceutical companies to develop drugs quickly and cost-efficiently, a drug may move into the next phase of development before evidence of the previous phase is known.251 Even when phase III trials are carried out in an adequate manner, the interpretation of the efficacy results is hampered by multiple p-values, disagreement over the need for multiplicity corrections, and the potential for conflicting evidence from trials of different sizes.251 Some of these difficulties can be minimized by measuring a single primary efficacy variable at one point in time and using a p-value of less than 0.025 (one-tailed, as the aim is for the statistical test to determine if the drug performs better than the placebo or low dose).252 This presumes that good dose-response data exist, identifying a single dose level as the best candidate for further evaluation. Lastly, interpreting differences on the basis of statistical significance has long been recognized as problematic. Clinically meaningful change reflects a different level of “significance” and often requires consensus among experts within the field for these criteria.

Issues of diagnosis and severity. Three methodological issues related to population classifications have limited the inferences that can be garnered from this systematic review. The first issue concerns the classification models used for diagnosing dementia; they are not interchangeable among the various types of dementia and the “pre-clinical” forms of slight cognitive impairment. Moreover, there are still concerns about the accuracy of these criteria. For example, in the American Academy of Neurology's (AAN) recent evidence-based review of dementia case definitions, none met the AAN's highest evidence standard. A clinical diagnosis of AD is only 28% specific after age 79 years. Similarly, no dementia screening measure is accurate enough to be recommended by the American Society of Internal Medicine.253 The AAN specifically faulted the emphasis on memory function in dementia case definitions.254 Yet tests like the ADAS-cog emphasize memory loss at the expense of other cognitive domains, especially executive control function,31 and many anti-dementia treatment strategies target neurotransmitters and structures (like acetylcholine and the hippocampus), which mediate memory test performance.

A second consideration in defining populations of dementia patients concerns determination of severity level. The MMSE, although frequently used, may not best capture severity. Many studies were observed to define the severity (mild, moderate, severe) of dementia populations based on the MMSE. For example, a range from 10 to 26 has been used to define a mild to moderate severity level.50 Given that the maximum and minimum instrument scores are 0 and 30, this suggests that the extreme ends of the spectrum, particularly the “severe” end (i.e. <10), represent a very narrow proportion of patients. These two broad categories (mild to moderate and severe) may not actually reflect the cognitive and functional differences in a clinically meaningful manner. The MMSE does not address issues of executive control function (as required by the DSM-IV dementia case definition), which is known to be a good predictor of functional status. From a research perspective, a better classification reflecting disease severity may be an important factor for stratification and determining the efficacy of pharmacological interventions.

Outcome issues. The studies evaluated in our review used 181 different outcomes across seven domains. This raises the issue of which of these outcomes are considered by clinicians to be most “clinically relevant”. Let us assume that the most clinically relevant outcomes for all the drug interventions for dementia are the ADAS-cog and the MMSE because they are very commonly reported in studies.

In this dementia review, numerous studies did not measure outcomes evaluating cognition, as the intended effect of the drug was not always in the domain of cognition (e.g. neuroleptics for behavior control). Moreover, a large number of the studies that used important clinical cognition outcomes, such as the MMSE, did so only to establish baseline severity, or they used it as a secondary outcome. This presents us with some difficulty in the consistency of reporting on this limited set of “clinically relevant” outcomes. There is also the issue of which domain (i.e. cognitive function versus ADL versus behavior) is the most clinically relevant. The FDA guidelines suggest cognition and global assessment; the EMEA guidelines suggest the addition of an ADL or quality of life/ADL measure as being most clinically relevant. Thus, some consensus work needs to be done among experts in the field to determine the most clinically relevant outcomes and domains. For example, the choice of most clinically relevant outcome may depend upon type and stage of dementia (e.g. for mild AD, neuropsychological outcomes may be are the most important domain while for severe AD, behavior may be the most relevant outcome), which may challenge the achievement of consensus.

To our knowledge, no specific set of outcomes that define “clinical relevance” applies to all the drug interventions we evaluated. The FDA has recommended that “dual efficacy” of dementia drug interventions be established by significant change in both a psychological measure and a global change measure. The outcomes measuring these attributes within these two domains were not specified. However, there was a general trend for using the outcomes ADAS-cog and CIBIC+ to capture these two attributes when evaluating drugs for AD populations.

Ideally, all outcomes should have demonstrated acceptable psychometric properties, such as reliability, validity (construct), and responsiveness. We did not a priori evaluate the properties of outcomes reported in the eligible studies. In some cases, these outcomes were developed in non-English languages but the original study was reported in English. In considering the psychometric properties of some of the outcome instruments used, the attribute of responsiveness is critical, and some have suggested that this has not been adequately evaluated in many outcome measures.33, 30, 255

We might envision a clinically relevant pharmacological treatment as one that has made a real difference, where the change is both relevant and important to the patient or to clinicians. This fundamentally shows the difference between clinically significant (relevant and important) versus statistically significant (associated with probabilities), where the latter determines that the results are not due to chance. Moreover, a clinically important change will vary depending on whether importance is defined from the patient or clinician perspective.

Five different levels of responsiveness (ability to detect change) of outcome measures have been defined:256 1) Minimal change potentially detectable (essentially an attribute of the scoring method of the outcome), 2) Minimal change actually detectable beyond measurement error of the instrument (also defined as Minimum Detectable Change (MDI) or Reliability Change Index (RCI), which includes the Standard Error of the Measurement (SEM)), 3) Observed change (often reported as the standardized response mean (SRM) or effect size (ES). 4) Observed change in those estimated to have improved; the key to understanding change in this instance is that an external standard is used to determine whom has improved (often reported as comparison between groups that have improved versus those who have not; the improved group can be defined by either patient and/or clinician or a combination), and 5) Observed change in those estimated to have important improvement (often reported as the minimal clinically important difference and can be determined by the patient or clinician, or a combination of both).

Consider the ADAS-cog and the CIBIC+: The minimal change detectable is 1/70 = 0.0143 for the ADAS-cog and 1/7 = 0.143 for the CIBIC+, suggesting that the ADAS-cog can detect smaller increments of change relative to the CIBIC+. Thus different instruments have differing sensitivities to detecting change. There is scant literature on the responsiveness of outcome measures as defined in number 4 above, observed change in those that have improved, or as in number 5 above, observed change in those estimated to have important improvement. Thus, we have identified a significant gap in the literature with regard to estimating clinically important changes. Much greater consideration of issues of responsiveness should be given in future research in efficacy trials of pharmacological agents. Greater understanding of clinically important change suggests that some of our current judgments of efficacy are limited as these important differences need to be established.

Analysis issues. The inability to estimate the power of a study to detect a difference presented significant limitations in interpreting those studies that showed no significant differences. Similarly, the lack of sufficient data for estimating effect size limited the ability to show the magnitude of the change. It is recommended that future trials evaluating the efficacy of pharmacological agents adhere to the CONSORT guidelines in order to provide sufficient data to estimate power and effect size for all relevant outcomes.

Although the difficulty of maintaining adherence to long-term drug interventions among dementia patients is acknowledged, the ITT analysis should continue to be the analysis of choice in trials. Ideally, both ITT and OC analyses should be presented. If both suggested the same conclusion, confidence in the study results would be increased.

Problems with funding/ sponsorship exclusively from drug companies. The sponsorship of studies by for-profit organizations has led to bias towards the publishing of positive results.257 These findings suggest that there are powerful disincentives for pharmaceutical companies to publish negative trials. This is contrary to what academic-based, non-industry funded trials show, where the publication of negative trials are more likely.

A recent evaluation of FDA databases for antidepressant drugs258 in the US, suggested that less than half of antidepressant trials were negative, which does not correspond to the published literature. In this systematic review, no attempts were made to contact industry for unpublished trials, which introduces the possibility of a bias associated with not reporting negative trials. Additionally, we did not contact authors who did not specify funding sources for their studies. Future research on the efficacy of pharmacological agents to treat dementia should indicate all sources of funding and who undertook the study analyses.

Adverse events. In this systematic review, the type and frequency of adverse events associated with the use of a drug intervention were scrutinized and reported to a greater extent than previous reviews of anti-dementia drugs. Attempts were made to weigh the potential for harm against the benefits when determining the efficacy of pharmacological interventions. Empirical evidence across diverse medical fields indicates that reporting of safety information (including milder adverse events) receives much less attention than the positive efficacy outcomes.35 Thus, it was recognized that an evaluation of the benefits of anti-dementia pharmacological agents alone may present a biased view of the efficacy of the intervention.

The ability to capture and evaluate adverse events proved to be difficult for several reasons. For example, although metrifonate had good evidence of positive effects on cognitive function, it was banned from use due to the risk of respiratory paralysis. The description of serious adverse events in the trials we evaluated did not capture this type of event, nor did different studies identify “serious events” in a consistent manner. This points to several fundamental limitations. The first of these relates to the limitation associated with the RCT design itself, which is less likely than the longitudinal cohort study designs to capture serious adverse events that are rare. Secondly, many trials were of relatively short duration and captured “idealized” dementia populations. Many of these trials were from pre-marketing studies contracted by pharmaceutical companies in carefully controlled research settings. Dementia patients seen in practice may have more complex medical illnesses and are at greater risk for potential side effects. In addition, drugs used in “polypharmacy” have even greater potential for pharmacological interactions. Furthermore, practitioners may prescribe these pharmacological agents for wider indications than originally intended, or may not refrain from withholding the drug from certain high-risk subgroups, leading to increased risk of adverse events. Thus, published rates of adverse events in well-controlled trials may underestimate true rates seen in practice.

Thirdly, by their nature, some adverse events are not easily anticipated, and therefore are not screened for in some trials. Adverse events may be hard to predict or anticipate but can be captured only if a trial protocol was designed to measure these events. This problem is compounded by the lack of consistency in what constitutes “serious” events or how the severity of the typical events is rated. A limited number of standardized instruments exist to capture these events reliably, but the overwhelming majority of studies in this systematic review did not use these instruments. Furthermore, capturing information from individuals with cognitive decline can create problems; the validity of the self-report instrument, even if completed by the caregiver, can be problematic. More research on the reliable collection of adverse events in dementia populations (with compromised cognition) may be required.

A fourth consideration concerns the issue of off-label use of pharmacological agents. Given that only four drugs are currently approved by the FDA for the treatment of dementia, the other 97 interventions evaluated in this review are classified as “off label use” but many are not approved by the FDA and not, therefore, available. For some of these off-label medications the potential mechanism of action on the disease process has not been fully established (if even considered), yet they have been applied to dementia populations. This off-label use of pharmacological agents may present further difficulties in evaluating adverse events.

Question 2: Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes?

Summary of Systematic Review Results

Few studies evaluated delay of onset or delay in disease progression. A definite gap for evaluating disease onset (as defined by the selection of populations at risk such as MCI populations) has been identified in this review.

Conversely, the need for good evaluation of disease progression in trials was also identified. In general, few studies evaluated subjects in more severe state of the disease. This suggests that a bias exists towards evaluating mild to moderate disease in the trials eligible in this systematic review. This in turn reflects the underlying assumption that the less severe groups are most likely to benefit from drug trials. Since so few studies have evaluated the more severe groups, this assumption may require some empirical justification in future research. Those studies that evaluated severe patients showed some potential for benefit. Future research in this area may require some consensus regarding the classification of severity levels.

Three studies evaluating cerebrolysin,168 selegiline and vitamin E,135 and donepezil61 have shown significant effects in delaying disease progress in mild to moderate61, 168 and moderately severe disease in patients with AD. This delay in progress was expressed in terms of delay in days to primary event135, 61 or statistical differences between placebo at a specified time interval.168 Although these two trials coincidentally evaluated dementia patients over the longest time interval, it did not withdraw the drug at the end of the study. Theoretically, conclusive evidence of disease delay would be demonstrated if the treatment groups did not return to the level of the placebo. Thus, distinguishing between symptomatic and disease modifying effects is not possible unless the drug is withdrawn and the treatment group(s) are observed for these changes.

When studies attempted to evaluate disease progression, long-term (1 year or greater) trials continued in an “open-label fashion”, where blinding was no longer maintained. This limits the confidence that bias did not affect the subsequent changes in the outcomes. It was observed that increasing levels of dropout (for a variety of reasons) also plagued these open-label phases of evaluation. From a practical perspective, maintaining adherence in longer-term trials in dementia patients are challenging,19 particularly for those in the placebo arm or for those interventions that have a high proportion of adverse events.

A number of trial designs have been proposed to capture delay in disease progression versus symptomatic treatment. Some of these trial designs include withdrawal of treatment, active-extension, randomized withdrawal, randomized start, and staggered start designs.235, 236, 19, 21 One important aspect of these designs is the selection of an adequate washout period or an adequate follow-up period. In addition, longer evaluation with survival analyses may be a good strategy to evaluate delay in disease progress for some drugs. One advantage of this design is the selection of clinically relevant milestones (functional changes over time), which was utilized in two studies 63, 136; the selection of such events may merit greater consideration in future trials evaluating delay. A more critical analysis of the staggered/start/stagger withdrawal design in comparison to the survival analysis would be helpful. Also, one could provide a more extensive analysis of the data on propentofylline and vitamin E,136 which represent the most extensive efforts to use the stagger/start/stagger withdrawal and survivor analysis approaches, respectively. Future research seeking to establish efficacy should clearly specify if symptomatic treatment or delay in progression is the therapeutic aim. This is important for determining specifically if efficacy is considered with respect to these two aims. Accordingly, a design that can establish this aim should be selected.

Methodological Issues

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf31.jpg.

   Figure 31. Delay of symptomatic treatment effects

Determining symptomatic treatment versus affecting delay in disease progress. Figure 31 depicts hypothetical responses of dementia patients to two similar pharmacological interventions relative to placebo. In this example, the placebo group changes over time were modeled according to the natural history of AD as described by Stern et al. (1994)29; the progressive decline of the AD subjects may not be representative of all dementia types. For simplicity's sake, the decline is assumed to be linear, although the literature has suggested the rate of decline varies between the different types of dementia and within each of these groups as a function of the disease severity.21, 259 The two drugs depicted in Figure 31 are similar in that they have the identical titration (approximately 8 weeks) and washout periods. In this hypothetical scenario, the drugs are both withdrawn at 6 months (DW) and the washout periods have ended at 8 months. Within the active treatment period (first 6 months), the response to Drug I depicts the maintenance or stabilization of cognition function relative to the placebo, whereas the response to Drug II suggests improvement (or restoration) of cognition for a short period. However, the rapid decline of cognition scores within the two treatment groups to the level of placebo at 8 months (end of the washout period (EW)) suggests that the treatment effect was symptomatic relief. Upon withdrawal for subjects exposed to either Drug I (maintenance or stabilization) or Drug II (improvement relative to baseline and placebo), the cognition scores declined to the same rate of placebo, and thus no delays in disease progression were demonstrated.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-phartrdemf32.jpg.

   Figure 32. Dealy in disease progression treatment effects

In contrast, Figure 32 shows a delayed rate of decline relative to placebo after the withdrawal of the pharmacological interventions. The response depicted for the treatment group exposed to Drug I shows that cognitive function is maintained until the drug is withdrawn (DW) and then the rate of decline is slower relative to the placebo (different slope of change) following the washout period. The response of the treatment group exposed to Drug II would suggest that cognition is improved for an interval (relative to baseline and placebo); when the drug is withdrawn, the rate of decline in cognitive function approximates that of the placebo group but is offset by approximately 6 months. Comparison of the slopes of the decline of cognition (Figure 32) would indicate a greater rate of decline for Drug II relative to Drug I, but both exemplify delay in progression of the disease effects. Theoretically, the treatment group rates of decline will never meet the decline rate of the placebo group when true disease modification has been effected by the pharmacological agent.

In these two Figures (31, 32), the responses depicted have been idealized to clearly show the differences between symptomatic treatments versus disease modifying treatments. Additionally, two examples of the rate of decline as a characteristic of delay in disease progress have been explicated. However, in practical terms the most effective time interval for bringing about meaningful change in cognition (or other important outcomes), and the best time period to observe whether or not the effect is maintained (or lost), is not known. The difficulty in estimating these ideal time intervals is further compounded when the uncertainty of the rate of cognitive change (or decline) is considered.21 It is likely that treatment effects may not be equal across all stages of the disease (mild, moderate, severe) or between the various types of dementia diagnoses.

The evidence provided in dementia trials to demonstrate the three broad therapeutic aims of pharmacological interventions has been expressed in a variety of comparisons. Ideally the changes due to the pharmacological intervention would be expressed in terms of differences between the treatment and placebo groups. Surprisingly, many trials have reported statistical significance between baseline and endpoint of the treatment group(s) as evidence of a therapeutic effect. Change has been described as “improvement” relative to the baseline for either of the treatment or control groups. Although, it is unlikely that AD subjects would ever improve spontaneously relative to baseline, it may be possible in some dementias. Additionally, the magnitude of the “improvement” is dependent on the time interval for which the differences were estimated. Consider Drug II in Figure 31 at the 4- and 6-month intervals; clearly, the magnitude of the difference is greatest at 4 months. Similarly, the evidence for “stabilization” or estimates for “delay in progression” was dependent on the interval used for evaluation.

Question 3: Are certain drugs, including alternative medicines (including non-pharmaceutical) more effective than others?

Summary of Systematic Review Results

What may be most relevant to clinicians are head to head comparisons of the cholinergic modifying neurotransmitter pharmacological agents, particularly those currently approved for the treatment of dementia (tacrine, rivastigmine, galantamine, donepezil) in the United States. The evidence for each of these drugs has been extensively detailed, and the relative merits and handicaps of each were outlined in chapter 3. Relative effectiveness as demonstrated by effect sizes for the ADAS-cog and the CIBIC were also shown in chapter 3. Although, the psychometric properties of these two outcomes are well accepted, comparison across the populations in these pooled estimates may not lend themselves to direct comparison across these four different specific drugs. Thus, inferences about the relative effectiveness of these four medications specific for the treatment of dementia should be made cautiously as head to head comparisons were not undertaken.

Relative efficacy must be evaluated in direct comparison trials

From a methodological perspective, addressing the question of being “more effective” requires head to head comparisons of pharmacological interventions.

An evaluation of the trials that undertook direct head to head comparison of two distinct pharmacological agents was limited because only seven trials were identified. Although, these trials may have shown some relative benefit of one drug versus another, the clinical relevance of these particular agents is limited as none of the drugs currently approved by the FDA specifically for the treatment of dementia is represented in these eligible studies. Moreover, these studies are essentially limited to single trials and are not sufficiently strong to base recommendations on the relative effectiveness of drugs. Head to head comparison studies are beginning to appear in abstract form only and a significant gap in the literature has been identified.

Question 4: Do certain patient populations benefit more from pharmacotherapy than others?

Summary of Systematic Review Results

In general, very few trials examined the efficacy of dementia drugs across different populations or population characteristics. From the 13 studies that reported stratified analyses, eight different variables were identified, which included age, gender, APOE genotype, disease type, disease severity (as determined by MMSE/ ADAS-cog threshold levels), treatment center, care dependence, and presence of depression. Additionally, three trials were identified that evaluated efficacy in 1) patients with Down's syndrome and dementia, 2) different ethnicities as a function of treatment center in a multicenter trial, and 3) depressed patients. Given the relatively small number of trials evaluating these variables within different populations and different pharmacological interventions, the findings of this review are limited with respect to these patient variables. These reflect merely what has been reported in the literature rather than variables of importance with respect to efficacy of pharmacological therapies. A significant gap in the literature has been identified.

Representativeness of populations in the drug trials

The study population characteristics were detailed for the trials evaluated. A recent study,22 suggests that many “real world” dementia patients in Ontario would not have met the eligibility criteria for participation in several of the cholinesterase inhibitor studies. This study highlights an important limitation of the pharmacological literature in that dementia patients recruited are not representative of the general dementia population. Additionally, clinicians and researchers should note that when a when a drug is approved for use, it is for a specific indication and a specific patient population. Evidence for one type of patient population may not necessarily be applied to another population. This is critical information to have when establishing clinical practice guidelines.

Question 5: What is the evidence-base for the treatment of vascular dementia?

Summary of Systematic Review Results

A total of 20 pharmacological interventions in 29 studies211, 220, 238, 171, 200, 199, 146, 68, 181, 184, 133, 134, 132, 161, 89, 91, 93, 247, 187, 191, 192, 194, 193, 100, 98, 196, 195, 245, 217 were applied specifically to VaD classified dementias. The majority of these pharmacological interventions (n = 14) were represented by single trials, these interventions included ateroid, buflomedil, cerebrolysin, sulphomucopolysaccharides (CDP choline), citalopram, donepezil, Ginkgo biloba, idebenone, minaprine, nimodipine, oxiracetam, 5-THF (trazodone), vincamine, and xantinolnicotinate. Six interventions had more than a single trial, and these included Choto-san (n = 2), memantine (n = 3), nicergoline(n = 2), pentoxifylline (n = 4), posatirelin (n = 2), and propentofylline (n = 2). In general, when the drug interventions were shown to be effective, it was in the domains of cognitive function (both general and specific) and global assessment. Other domains were less frequently evaluated. Several trials attempted to test for differences between VaD groups and other dementia types.

Diagnosis Classification of VaD

Erkinjuntti et al (1997)1 compared six commonly used classification schemes (DSM-III, DSM-III-R, DSM-IV, ICD-9, ICD-10 and the CAMDEX) and demonstrated that the prevalence of dementia can differ by a factor of 10 depending on the diagnostic criteria used. Two other studies have demonstrated that the prevalence of VaD varies with the classification system; therefore these criteria for diagnosis are not interchangeable.10, 11

There is controversy about the validity of the clinical classification of VaD, as autopsy confirmation often does not substantiate the clinical diagnosis.12, 13 The majority of dementias were actually AD with co-existing VaD and PDD lesions.14 In contrast, the clinical accuracy of AD diagnosis is relatively high.7 Future research in vascular dementia should attempt to better distinguish this subgroup.

Determining Clinical Relevance

With rare exceptions, dementias are inevitably progressive and eventually lead to severe cognitive deficits, functional impairment, and often behavioral problems, unless death supervenes from intercurrent disease. The trajectories, sequence of clinical features, and burden on caregivers vary depending upon the type of dementia. For example, cognitive decline typically precedes functional impairment and behavioral disturbances in AD, while behavior and/or language problems typically announce the onset of frontotemporal degeneration.

Physicians and other health care practitioners have numerous roles in the management of individuals with dementia. These include identification, assessment and staging, classification, and prognostication, in addition to treatment of the individual and caregiver and planning for future disabilities (e.g. arranging alternatives to driving, assigning power of attorney and compiling living wills/advance directives).

Given these multiple tasks, how is the treating physician to interpret the results of therapeutic trials, which mostly deal with the pharmacological treatment of individuals with predominantly one type of dementia (AD) in the mild to moderate stages?

The traditional view of most physicians is that treatment success is measured by reversal of a disease, which is not a realistic goal in dementia. (While the older literature suggested that as many as 15 to 30% of dementias were “reversible,” more recent studies indicate that at most a few percent of dementias presenting to physicians are potentially reversible.)

Thus, the treating practitioner must begin by setting a realistic goal for therapeutic intervention. Symptom relief, alleviation of caregiver burden, prevention of complications (such as injury prevention or avoidance of aspiration pneumonia), and delay in progression of disease might be potential treatment targets. From this list, only symptom relief and delay in progression could be inferred from the studies examined in this systematic evidence review.

Outside the specialty clinic or clinical trial setting, most physicians have limited time and resources to expend on their patients with dementia. Few will have access to psychometrists or other individuals capable of administering extensive assessment instruments such as those used in clinical trials (e.g. ADAS-cog). Thus the typical practitioner must be able to complete a brief assessment, which provides sufficient information to determine whether a treatment is 1) indicated and 2) effective.

Deciding if a treatment is indicated depends upon the correct diagnosis (does this person have a dementia, and if so what type?), potential contraindications to the treatment (e.g. active peptic ulcer or heart block in the case of cholinesterase inhibitors), and the severity of disease. Determination of severity of dementia has given rise to several global scores such as the Global Deterioration Scale (GDS)260 and the Clinical Dementia Rating (CDR)261 In practice, the Mini-Mental State Examination (MMSE)50 (a short, 30-item, cognitive screening test) is frequently used as a measure of severity. Not only is it part of the usual diagnostic protocol for suspected dementia, but it also has the advantage of being included in the entry criteria of many of the RCTs of anti-dementia medications. It is therefore useful for determining whether a patient fulfils the appropriate severity criterion for therapeutic intervention.

With regard to deciding whether a treatment is effective, much has been written about the relative importance of statistically significant and clinically significant changes in measures of cognition, function, and behavior in dementia. A distinction must be drawn between clinically detectable change and clinically meaningful change. While psychometric measures (standardized instruments, which are highly reliable and relatively free from the influence of judgment) may detect changes too small to be appreciated by the clinician, clinometric tools (measures that are based on a clinical judgment about an individual patient262) may be considered more relevant to practice. Results expressed as a change from baseline measured by clinometric instruments such as the Clinicians Interview Based Impression of Change (CIBIC) or its derivative the CIBIC plus, which incorporates observations of the caregiver, mimic clinical practice more closely than most psychometric tools. The CIBIC aims to cover multiple domains relevant to the clinician (i.e. cognitive, functional, and behavioral). Clinicians may therefore interpret statistically significant changes on the CIBIC or similar scales with more confidence than changes on the many psychometric scales used in therapeutic trials. However, if an effect size of ~0.5 or greater is included in the analysis of psychometric outcomes, one can be reasonably confident of a robust response to the treatment under investigation.

Another measure of efficacy is the response rate—the percentage of study participants who experience an improvement (defined as a change of a specific magnitude on one or more scales.) This figure is useful for the clinician who may then indicate to the individual with dementia the chances of a positive outcome from the planned treatment.

Clinicians are faced with a bewildering array of results from clinical trials. Convergence of results (different studies of the same medication showing similar results) or studies of drugs in the same class showing similar results may help to reassure clinicians that the results are genuine. Conversely, when trials show differing results, clinicians should be especially vigilant in accepting only the results of the more positive trials.

As always, the translation of clinical trial evidence into practice demands careful scrutiny by the practitioner. Attention to external validity (is my patient sufficiently similar to those in the clinical trial that I can expect the same result from treatment?), interpretation of the outcome measures (clinically as well as statistically significant benefit), and weighing potential risks against potential benefits remain the responsibility of the treating practitioner.

Limitations of the McMaster AHRQ Review

A systematic review that has evaluated 91 pharmacological interventions in 186 RCTs with high internal validity has several limitations. The studies selected for this review are English-language trials. Based on our search results, we estimate that we could have potentially retrieved 1385 foreign-language articles (after de-duplication 1213) distributed among databases as follows: 346 from Cochrane Central, 444 from EMBASE, 559 from MEDLINE/PreMedline® 36 from other databases before review for title and abstract. If we assumed the same rate of potentially eligible studies for these non-English studies, an additional 16 non-English studies may have been eligible for review. It is possible that agents, such as Ginkgo biloba, may have had important trials published in non-English languages. The budget and timelines available, however, were a limiting factor to obtaining, translating, and abstracting non-English trials.

Secondly, no contact with authors of the eligible trials was undertaken to collect additional unpublished studies or provide results/data that were not presented in the published article. Although contact with the original authors of the trials (to supplement the missing information from the included studies) could have compensated for many of the reporting challenges we encountered, this strategy was not feasible given the timeline of this systematic review. Our experience at the McMaster EPC suggests that the majority of authors do not respond in a timely fashion if at all. Additionally, efforts were not made to contact industry for unpublished trials. It is likely that industry sponsors of trials that are not published in the public domain are under no obligation to share trials (particularly negative trials). Not contacting authors of eligible trials for additional data and not attempting to locate unpublished trials (either by other authors/ experts or by industry) may introduce publication bias in this systematic review.

Thirdly, we employed two eligibility criteria that may account for some differences in acceptance of well-known studies. The first of these was a minimum threshold for quality score as determined by the modified Jadad scale. Despite the fact that this scale has excellent reliability and content validity, some may argue that the threshold score of 3 is arbitrary and may have unnecessarily eliminated studies of historical importance. It is our view that given the amount of literature available, all efforts should be aimed at selecting only the trials with the highest internal validity rather than selecting the largest number of eligible trials.

The second eligibility criteria concerned the exclusion of crossover trials. Although crossover trials are suitable for chronic diseases, they may be prone to period effects or period-by-treatment interactions. Period effects are systematic changes in the outcome that apply to all patients due to temporal changes in the disease or to the measurement instrument. Period-by-treatment interactions occur when the efficacy of the intervention varies by period. This is a significant concern for studies that attempt to show disease modification and are carried out over a longer period of time. Additionally, a carry-over effect may occur if the washout period is not adequate. In addition to the weaknesses of this design, some limitations arise when considering the potential for meta-analytic analyses. Traditionally, first period data from a crossover trial are abstracted and can be potentially combined with parallel trials for analyses of a pooled estimate; the reporting of the study results (positive or negative) would also be based on this first period data. In a preliminary phase of the review, several crossover trials were examined. It was noted that many did not report first period data, which precludes any potential for combining with parallel trials; many trials also did not undertake statistical tests during the first experience, thus making it difficult to report the direction of the findings, even if the trial could be combined. Finally, the TEP considered the fact that this systematic review was evaluating a variety of drug interventions administered over differing time intervals, and so period effects might be an important source of bias. For all these reasons, the TEP made the decision to exclude crossover trials from this systematic review. Thus, this review is limited to evidence based on high-quality parallel trials only.

A final limitation to our study was the use of a checklist developed to address the issue of quality of reporting adverse events. The Jadad scale was not designed to evaluate the quality of reporting adverse events. Thus, when determining the “harms” or risks associated with an intervention, the quality or “internal validity” of collecting and reporting these adverse events needed to be evaluated. Although our checklist has face validity, it has not undergone formal psychometric testing.

Future Research Recommendations

The findings of this report suggest several important areas for future research on pharmacological treatments for dementia. These include:

Analytic Framework of the intended aim of the therapy on the disease

  • Better conceptualization and research design to capture “delay in progression”.

  • Clearer consensus on defining efficacy (benefits and clinically important change).

  • Longer term studies (> 12 months).

Potential for bias

  • Clarification of the role of industry sponsorship; one recommendation should be that all studies are required to disclose such information in future, including who analyzed the results.

  • More concerted effort to incorporate unpublished studies and negative trials in future reviews.

Population

  • Inclusion of the spectrum of severity in the patient populations (there is nothing to suggest that severe patients may not benefit from pharmacotherapy aimed at cognitive function improvement).

  • The need for validation of trials and testing processes within cultures other than the traditional white population.

  • Examining the efficacy of interventions in different sub-populations (age, disease severity levels, etc.).

  • Better measurement and reporting of important patient characteristics (including baseline cognition scores, co-morbid conditions, the use of other medications, etc.).

  • Inclusion of MCI type groups of subjects to evaluate “delay of onset”.

Outcomes

  • Expansion of outcomes collected to include more than just cognitive function, and especially include caregiver burden and quality of life/ADL.

  • Clear operational definitions for determining critical outcomes (delay to onset, delay to progression, important effect size, etc).

  • Better understanding of how outcomes perform cross-culturally.

  • Production of other diagnostic instruments to detect both onset and responses to therapies across varied cultural groups.

  • Improvement in the reporting of adverse events to evaluate harm.

Analysis

  • Appropriate analytical strategies that take into account intention to treat (ITT)/ last observation carried forward (LOCF) analyses; where possible both observed case and ITT/LOCF analyses should be presented.

  • Sufficient data to estimate effect size, taking into account variability in both treated and control populations on the primary measures.

  • Reporting the power of the study when findings are non-significant.

Intervention

  • Undertake more studies with direct comparison of drugs to determine the relative efficacy of agents.

  • Improved description of the titration process.

  • Improved collection of adverse events undertaken in a systematic fashion with standardized instruments.

Acronyms and Abbreviations

AAMIAge-Associated Memory Impairment
ABIDAgitated Behavior Inventory for Dementia
ABSAdaptive Behavior Scale
ACESAgitation-Calmness Evaluation Scale
ACFPAmerican College of Family Physicians
AChEAcetycholinesterase
ACP-ASIMAmerican College of Physicians - American Society of Internal Medicine
ACPTAuditory Continuous Performance Test
ACTHAdrenocorticotropic homone
ADAlzheimer's Disease
ADASAlzheimer's Disease Assessment Scale
ADAS-11; ADAS-13Alzheimer's Disease Assessment Scale (11 and 13 items)
ADAS-CogAlzheimer's Disease Assessment Scale-Cognitive and Non-Cognitive Sections
ADAS-NonCog
ADCS-ADLAlzheimer's Disease Cooperative Study - Activities of Daily Living
ADCS-CGICAlzheimer's Disease Cooperative Study - Clinical Global Impression of Change
ADFACSAlzheimer's Disease Functional Assessment and Change Scale
ADL; ADLCActivities of Daily Living (Checklist)
ADL-BDRSActivities of Daily Living-Blessed Dementia Rating Scale
ADL-PDSActivities of Daily Living- Progressive Deterioration Scale
ADSAlzheimer's Deficit Scale
ADSSAlzheimer's Disease Symptomatology Scale
AFBSAversive Feeding Behavior Scale
AGGRAggressiveness subscale of the Personality Psychopathology Five (PSY-5) Scales
AGS-EAssessment of Global Symptomatology-Elderly
AHRQAgency for Healthcare Research and Quality
AIDSAcquired Immune Deficiency Syndrome
AIMSAbnormal Involuntary Movement Scale
ALCARCarnitine
AMEDAllied and Complementary Medicine Database
AMIAttention Matrices
AMPAAlpha-amino-3-hydroxy-5-methyl-4-isoxazole proprionic acid
AMTSAbbreviated Mental Test Score
APOEApolipoprotein E gene
BADLBasic Activities of Daily Living
BARS, BASBarnes Akathisia Rating Scale
BCRSBrief Cognitive Rating Scale
BDIBeck Depression Inventory
BDRSBlessed Dementia Rating Scale
BEHAVE - ADBehavioral Pathology in Alzheimer's Disease Rating Scale
BePUBerlin rating scale for psychomotoric restlessness
Bf-SZerssen Adjective Mood Scale (German test: Befindlichkeitsskala)
BGPBehavioural Rating Scale for Geriatric Patients
BGPBehavioural Rating Scale for Geriatric Patients
BIBarthel Index
bidTwice a day
BL-ABlessed A scale
Blessed-D BDRSBlessed Dementia Rating Scale
BMIBody Mass Index
BMICTBlessed Memory Information and Concentration Test
BMYNootropic agent; Bristol-Myers Squibb
BNTBoston Naming Test
BPRSBrief Psychiatric Rating Scale
BRMSBech-Rafaelsen Mania Scale
BRSDBehavioral Rating Scale for Dementia
BSRTBabcock Story Recall Test
BSRTBuschke Selective Reminding Test
BSSBehavioral Syndromes Scale for Dementia
BTTBlock Tapping Test
CADISILCerebral Autosomal Dominant Ischemia with Subcortical Leukoencephalopathy
CAMCOGCognitive section of the Cambridge Examination for Mental Disorders in the Elderly
CAMDEXCambridge Examination for Mental Disorders in the Elderly
CAMTOTCAMCOG Total Score
CANTABCambridge Automated Neuropsychological Test Assessment Battery
CAPEClifton Assessment Procedures for the Elderly
CASEClifton Assessment Scale for the Elderly
CASICognitive Abilities Screening Instrument
CATSCaregiver's Activity Time Survey
CAUSTCanadian Utilization of Service Tracking questionnaire
CBCComplete Blood Count
CBM 36–7332-methyl-alpha-ergokryptine
CCASSSComputerized Cognitive Assessment System Speed Score
CCTControlled Clinical Trial
CDR, CDRSClinical Dementia Rating; Clinical Dementia Rating Scale
CDR-NHClinical Dementia Rating - Nursing Home Version
CDR-SBClinical Dementia Rating - Sum of Boxes
CDTClock-Drawing Test
CEBClinical Epidemiology and Biostatistics
CERADConsortium to Establish a Registry for Alzheimer's Disease
CERECerebrolysin
CETMDynamic measure of comprehension process (Spilich)
CGAEClinical Global Assessment of Efficacy
CGC+Clinical Global Change-Plus
CGIClinical Global Impression
cGICCaregiver-rated Global Impression of Change
CGICClinical Global Impression of Change
CGI-GIGlobal Improvement
CGI-CGCClinical Global Impression-Clinical Global Change
CGI-S; CGI-S/CClinical Global Impression-Severity/Change
CGRSClinicians' Global Rating Score
chisqChi-Square Test
chisq M-HMantel-Haenszel Chi-Square Test
CIConfidence interval
CIBIClinician's Interview-Based Impression
CIBICClinician's Interview-Based Impression of Change
CIBIC+Clinician's Interview Based Impression of Change plus Caregiver
CIBIS+Clinician's Interview-Based Impression of Severity with Caregiver Input
CINAHLCumulative Index to Nursing & Allied Health Literature ®
CINDCognitive Impairment Not yet Diagnosed
CLEXClinical Examination
CloNDCognitive Loss No Dementia
CMAICohen Mansfield Agitation Inventory
CNTBComputerized Neuropsychological Test Battery
COSTARTCoding Symbols for a Thesaurus of Adverse Reaction Terms
COWATControlled Oral Word Association Test
CPRSComprehensive Psychopathological Rating Scale
CPTCognitive Performance Test
CSDDCornell Scale for Depression in Dementia
CSGDSCollateral Source Geriatric Depression Scale
CSICaregiver Stress Inventory
CSSCaregiver Stress Scale
CTComputerized Tomography
CVDCerebrovascular Disease
CVLTCalifornia Verbal Learning Test
dday
dEffect Size Value - (d) is the average amount of change in standard deviation units achieved by individuals in a treated group versus the change achieved by members of a control/comparison group for a particular study
DADDisability Assessment for Dementia
DATDementia Alzheimer's Type
D-BDelay relative to Baseline
DBDSDementia Behavior Disturbance Scale
DCTDigit Copying Test
DDAVPDeamino-D-Arginine-Vasopressin
DEKDihydroergokryptine
DfDegrees of Freedom
DMRDementia Questionnaire for Mentally Retarded Persons
DMSEDelayed Matching-to-Sample Exam
D-PDelay relative to Placebo
DPZDonepezil
DRSDementia Rating Scale
DSCSDepressive Symptoms Collateral Source
DSMDiagnostic and Statistical Manual of Mental Disorders (Edition III, III-R, IV)
DSPTDigit Span Test
DSSDepressive Signs Scale
DST; DSSTDigit Symbol (Substitution) Test
DTICDiscovering Things in Common
e.g.,example
ECGElectrocardiogram
EEGElectroencephalography
EFREmotional Face Recognition
EISEfficacy Index Score
EMBASEExcerpta Medica Database
EPSExtrapyramidal Symptoms
ERPEvent-Related Potential
ESRSExtrapyramidal Symptom Rating Scale
FASTFunctional Assessment Staging
FCCAFinal Comprehensive Consensus Assessment
FCMTFigure Copy/ Memory Test
FDAFood and Drug Administration
FDG-PETPositron Emission Tomography with 18-fluorodeoxyglucoseis
FIGTFigure Detection Test
FIMFunctional Independence Measure
FRSFunctional Rating Scale test
ggram
GABAGamma-aminobutyric acid
GBSGottfries-Bråne-Steen
GBS-SDSGottfries-Bråne-Steen - Scale for Dementia Syndromes
GDSGlobal Deterioration Scale
GERRIGeriatric Evaluation by Relative's Rating Instrument
GISGlobal Improvement Scale
GM-1Monosialoganglioside
GMS-AGeriatric Mental State questionnaire
GPI-EGeneral Psychiatric Impression-Elderly
GSGestalt Scale
hhour
HAM-A; HARSHamilton Anxiety Rating Scale
HAM-D; HDRSHamilton Depression Rating Scale
HDS-RHasegawa Dementia Scale-Revised
HISHachinski Ischemic Score
HIVHuman Immunodeficiency Virus
HMIIHachinski-Marshall Ischaemic Index
HVLTHopkins Verbal Learning Test
IADLInstrumental Activities of Daily Living
ICCItem Characteristic Curve analysis
ICDInternational Classification of Diseases (Version 9 or 10)
IDDDInterview for Deterioration in Daily Living Activities in Dementia-complex task
IFIndustry Funded
IMIntramuscular
I-PImprovement relative to Placebo
IPSC-ERaskin's and Crook's Inventory of Psychic and Somatic Complaints for the Elderly
IQCODEInformant Questionnaire on Cognitive Decline in the Elderly
ISIndustry provided Supplies
ITTIntention-to-treat
IUInternational Units
kgkilogram
KOLTKendrick Object Learning Test
LASLuria Alternating Series
lbspounds
LFTLiver Function Test
LMTLogical Memory Test
LNNBLuria-Nebraska Neuropsychological Battery
LOCFLast Observation Carried Forward
LPRSLondon Psychogeriatric Rating Scale
LRULipasemic Releasing Units
mmonth
Mmale
MAACL-RMultiple Affect Adjective Checklist-Revised
MACFMicrotubule Actin Crosslinking Factor
MADR-SMontgomery-Asberg Depression Rating Scale
MCIMild Cognitive Impairment
MCPTModified Continuous Performance Test
MDBMental Deterioration Battery
MeSHMedical Subject Heading
μgmicrogram
mgmilligram
MIDMulti Infarct Dementia
MinMinimal
MITTModified Intention-to-treat
mlmilliliter
MMSE (MMSE-CE)Mini-Mental Status Exam (estimated score)
CMMSECantonese MMSE
MMMSEModified MMSE
SMMSEStandardized MMSE
MNLTModified Names Learning Test
ModModerate
Modly SevModerately Severe
MQMemory Quotient
MRIMagnetic Resonance Imaging
MRSMagnetic Resonance Spectroscopy
MU-EPCMcMaster University Evidence-based Practice Center
MWFMattis Word Fluency
MXMixed results
nnumber included in study
NNo
NANot available
NAANuremberg gerontopsychological inventory for Assessing Activities of daily living
NABNurnberger-Alters-Beobachtungs-Skala
NACN-Acetylcysteine
NAINuremberg Age Inventory
NARTNelson Adult Reading Test
NCTNumber Connection Test
NDTNew Dot Test
NINon-Industry funding source
NIMCSNewcastle Memory, Information and Concentration Scale
NINCDSNational Institute of Neurological and Communicative Disorders and Stroke
NINCDS-ADRDANational Institute of Neurological and Communicative Disorders and Stroke - Alzheimer's Disease and Related Disorders Association
NINDS-AIRENNational Institute of Neurological Disorders and Stroke - Association Internationale pour la Recherche et l'Enseignement en Neurosciences
NLTNames Learning Test
NMDAN-methyl-D-aspartate
NMICSNewcastle Memory, Information and Concentration Scale
NMSNowlis Mood Scale
NNINumber Needed to Intervene
NOSGERNurses Observation Scale for Geriatric Patients
NOSGER-IADLNurses Observation Scale for Geriatric Patients - Instrumental Activities of Daily Living subscale
NOSIENurses Observation Scale for Inpatients
NPINeuropsychiatric Inventory
(NPI-4, NPI-10)Subscores 4,10
NPI-NHNeuropsychiatric Inventory - Nursing Home Version
NRNot Reported
NRSMGNon-Randomised Studies Methods Group
NSNot significant
NSLNeuropsychological Aging Self-Evaluation - List for Age Symptoms
NSTNon-Stress Test
NTNot tested
OARS - ADLOlder Americans Resource Scale
OASOvert Aggression Scale
OCObserved Cases
OLTObject Learning Test
OMDROculomotor Delayed Response
OROdds Ratio
ORG 2766Adrenocorticotropic hormone derivative
OXIROxiracetam
ozounce
pp value
P300Electrophysiological potential that is indicator of associative and cognitive processes and latency in decision making processes
PADPresenile Alzheimer's Disease
PADLPerformance of Activities of Daily Living
PANSS-ECPositive and Negative Syndrome Scale-Excited Component
PDParkinson's Disease
PDDProgressive Degenerative Dementia
PDSProgressive Deterioration Scale
PDSDPrimary Degenerative Senile Dementia
PETPositron Emission Tomography
PGIRPatient's Global Improvement Rating
PIPartially funded by Industry
PICDPresenile Idiopathic Cognitive Decline
POMSProfile of Mood States
PRLProlactin
PSMSPhysical Self-Maintenance Scale
PSPProgressive Supranuclear Palsy
PSQIPittsburgh Sleep Quality Index
qidFour times daily
QoLQuality of Life
RCorrelation Coefficient
RAResearch Assistant
RAGS; RAGS-ERelative's Assessment of Global Symptomatology (Elderly)
RAPSUScale for psychomotoric agitation
R-AVLRey auditory-verbal-learning test
RCTRandomized Controlled Trial
RDSRapid Disability Scale
RefManReference Manager Version 10®
RGRSRelatives' Global Rating Score
RMBPCRevised Memory and Behavior Problems Checklist
RMTRey Memory Test; Randt Memory Test
RMT-A&RRandt Memory Test - Acquisition and Recall
RMT-DRRandt Memory Test - Delayed Recall
RMT-MIRandt Memory Test - Memory Index
RPMRaven's Progressive Matrices
RPTRivermead Behavioural Memory Test-Profile Score
RRRelative Risk
RTReaction Time
RTIResearch Triangle Institute
SADSSchedule for Affective Disorders and Schizophrenia
SASSimpson-Angus Scale
SAS-GSelf Assessment Scale - Geriatric
S-BStabilization relative to Baseline
SBISpontaneous Behavior Interview
SCSignificant change
SCAGSandoz Clinical Assessment - Geriatric
SCBScreen for Caregiver Burden
SCWITStroop Color Word Interference Test
SDStandard Deviation
SDATSenile Dementia of the Alzheimer's Type
SEMStandard Error
SevSevere
SF-36Medical Outcomes Study Short-Form 36-Item Health Survey
SGRSStockton Geriatric Rating Scale
SHGRSStuard Hospital Geriatric Rating Scale
SIBSevere Impairment Battery
SIPSickness Impact Profile
SKTSyndrome Kurz test; Syndrome Short Test
SMQSquire's Memory Questionnaire
SMSTSternberg's Memory Scanning Test
SPECT-TcHMPAOSingle Photon Emission Computed Tomography with hexamethylpropyleneamineoxime
SPETSingle Photon Emission Tomography
SRTSelective Reminding Procedure
SRT-DRSelective Reminding Procedure-Delayed Recall
SWFTSemantic Word Fluency Test
TEPTechnical Expert Panel
TESSTreatment Emergent Symptom Scale
TESS-DOTESDosage Record and Treatment Emergent Symptom Scales
tidThree times daily
TKToken Test
TOOTask Order Officer
TPToulouse Piéron
TPATToulouse-Pieron Attention Test
TSITest for Severe Impairment
UKUnited Kingdom
UKUSide effect rating scale
UPDRSUnified Parkinson's Disease Rating Scale
USUnited States
VaDVascular Dementia
VAMSVisual Analog Mood Scale
VASVisual Analogue Scales
VHBVideorecorder Home-Behavioral assessment
vs.versus
wweek
WAISWechsler Adult Intelligence Scale
WAIS-DIDeterioration Index
WAIS-DSPTWechsler Adult Intelligence Scale - Digit Span Test
WAIS-DSSTWechsler Adult Intelligence Scale -Digit Symbol Substitution Test
WAIS-DTICWechsler Adult Intelligence Scale - Discovering Things in Common
WAIS-VOCWechsler Adult Intelligence Scale-Vocabulary Subset
WHOWorld Health Organization
WLMWord List Memory test
WMS-MQWechsler Memory Scale-Memory Learning Restoration
WMS-RWechsler Memory Scale-Revised
x2chi-square
yyear
Yyes
ZVTZahlen-Verbindungs Test -Trail Making Test

Appendix A Search strategies

Database: Cochrane Central Register of Controlled Trials <4th Quarter 2002> on OVID

Search strategy executed on February 3, 2003

Same search executed on February 4th, 2003 on the online version of the Cochrane Library for the new references <1st Quarter 2003>:

  1. (mild cognitive impairment or MCI).tw.

  2. ((cognitive impairment not dementia) or CIND).tw.

  3. ((cognitive loss not dementia) or CLOND).tw.

  4. Delirium, Dementia, Amnestic, Cognitive Disorders/

  5. exp Amnesia/

  6. Cognition Disorders/

  7. exp Dementia/

  8. exp tauopathies/

  9. dement:.tw.

  10. Alzheimer:.tw.

  11. Huntington disease/

  12. Lewy: ajd8 bod:.tw.

  13. ((cognit: or memory or mental:) adj8 (decli: or impair: or los: or deteriorat:)).tw.

  14. (chronic adj8 cerebrovascular).tw.

  15. supra-nuclear palsy.tw.

  16. (normal pressure hydrocephalus adj8 shunt:).tw.

  17. benign senescent forgetfulness.tw.

  18. (cerebr: adj8 deteriorat:).tw.

  19. cerebr: ajd8 insufficien:.tw.

  20. (confusion: or confused).tw.

  21. (pick: adj8 disease).tw.

  22. (creutzfeldt: or JCD: or CJD:).tw.

  23. (Huntington: or Huntingdon).tw.

  24. Binswanger:.tw.

  25. brain atrophy.tw.

  26. exp Cerebral Amyloid Angiopathy/

  27. neurofibrillary tangles/

  28. senile plaques/

  29. neuropil threads/

  30. spongiform encephalopathy.tw.

  31. exp Hypothyroidism/

  32. neurosyphilis/

  33. exp amyloid beta-protein/ not (Down syndrome/ or trisomy 21.tw.)

  34. (CADISIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.

  35. (corticobasil ganglionic degeneration or cortical basal degeneration or corticabasal ganglionic degeneration).tw.

  36. multisystem atrophy.tw.

  37. exp alcohol amnestic disorder/

  38. (alcohol adj3 amnestic).tw.

  39. or/1–38

Database: Pre-MEDLINE, MEDLINE on OVID

Search strategy executed on February 4, 2003

  1. randomized controlled trial.pt.

  2. controlled clinical trial.pt.

  3. controlled clinical trials/

  4. (clinical trials, phase II or clinical trials, phase III or clinical trials, phase IV or multicenter studies).sh.

  5. random allocation.sh.

  6. double blind method.sh.

  7. cross-over studies.sh.

  8. single-blind method.sh.

  9. clinical trial.pt.

  10. (clin: adj25 trial:).ti,ab.

  11. ((singl: or doubl: or trebl: or tripl:) adj25 (blind: or mask:)).ti,ab.

  12. placebos.sh.

  13. placebo:.ti,ab.

  14. random:.ti,ab.

  15. or/1–14

  16. comparative study.sh.

  17. exp evaluation studies/

  18. follow up studies.sh.

  19. prospective studies.sh.

  20. or/16–19

  21. (tu or th).xs.

  22. treatment outcome/

  23. exp therapeutics/

  24. or/21–23

  25. 20 and 24

  26. 15 or 25

  27. (mild cognitive impairment or MCI).tw.

  28. ((cognitive impairment not dementia) or CIND).tw.

  29. ((cognitive loss not dementia) or CLOND).tw.

  30. exp dementia/

  31. exp tauopathies/

  32. (dement: or alzheimer:).tw.

  33. amentia.tw.

  34. frontotemporal lobar degeneration.tw.

  35. hiv-associated cognitive motor complex.tw.

  36. encephalopathy, aids.tw.

  37. encephalopathy, hiv.tw.

  38. mesulam syndrome.tw.

  39. progressive nonfluent aphasia.tw.

  40. binswanger disease.tw.

  41. binswanger encephalopathy.tw.

  42. leukoencephalopathy, subcortical.tw.

  43. subcortical arteriosclerotic encephalopathy.tw.

  44. chronic progressive subcortical encephalopathy.tw. or alcohol amnestic disorder/ or alcohol induced disorders, nervous system/ or (alcohol adj3 amnestic).tw. or (alcohol adj2 dysmestic).tw. or (ethanol adj3 nervous system disorders).tw. or ethyl alcohol abuse neurologic syndromes.tw.

  45. (lewy: bod: adj8 disease).tw.

  46. brain atrophy, circumscribed lobar.tw.

  47. (pick: adj8 disease).tw.

  48. exp amyloid beta-protein/ not (down syndrome/ or trisomy 21.tw.)

  49. exp cerebral amyloid angiopathy/

  50. neurofilament proteins/

  51. tau proteins/

  52. neurofibrillary tangles/

  53. neuropil threads/

  54. senile plaques/

  55. (Corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.

  56. (CADISIL or Cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.

  57. Multisystems atrophy.tw.

  58. huntington disease/

  59. hydrocephalus, normal pressure/

  60. Creutzfeldt-Jakob syndrome/

  61. spongiform encephalopathy.tw.

  62. (cjd or jcd).tw.

  63. Creutzfeldt-Jakob disease.tw.

  64. spongiform encephalopathy.tw.

  65. exp Hypothyroidism/

  66. exp Vitamin B 12 Deficiency/

  67. exp Neurosyphilis/

  68. or/27–67

  69. 26 and 68

  70. animal.sh.

  71. 69 not 70

  72. not (comment or editorial or news or letter).pt.

  73. and eng.la.

  74. limit 73 to yr=1998-2003

Database: EMBASE <1996 to 2003 Week 5> on OVID

Search strategy executed on February 6, 2003

  1. (mild cognitive impairment or MCI).tw.

  2. ((cognitive impairment not dementia) or CIND).tw.

  3. ((cognitive loss not dementia) or CLOND).tw.

  4. exp dementia/

  5. (dement: or alzheimer:).tw.

  6. amentia.tw.

  7. frontotemporal lobar degeneration.tw.

  8. hiv-associated cognitive motor complex.tw.

  9. encephalopathy, aids.tw.

  10. encephalopathy, hiv.tw.

  11. mesulam syndrome.tw.

  12. progressive nonfulent aphasia.tw.

  13. binswanger disease.tw.

  14. binswanger encephalopathy.tw.

  15. leukoencephalopathy, subcortical.tw.

  16. subcortical arteriosclerotic encephalopathy.tw.

  17. chronic progressive subcortical encephalopathy.tw.

  18. exp Korsakoff psychosis/ or exp Wernicke Korsakoff syndrome/

  19. (alcohol adj3 amnestic).tw.

  20. (alcohol adj2 dysmnestic).tw.

  21. (ethanol adj3 nervous system disorders).tw.

  22. ethyl alcohol abuse neurologic syndromes.tw.

  23. (Lewy: bod: adj8 disease).tw.

  24. brain atrophy, circumscribed lobar.tw.

  25. (Pick: adj8 disease).tw.

  26. exp brain atrophy/ or exp brain cortex atrophy/ or exp brain degeneration/ or exp corticobasal degeneration/ or exp lewy body/ or exp neurofibrillary tangle/ or exp neuropil thread/ or exp organic brain syndrome/

  27. exp amyloid beta-protein/ not (exp Down syndrome/ or trisomy 21.tw.)

  28. exp Vascular Amyloidosis/

  29. exp Neurofilament Protein/

  30. Tau Protein/

  31. Neurofibrillary Tangle/

  32. Neuropil Thread/

  33. Senile Plaque/

  34. (corticobasil ganglionic degeneration or cortical basal degeneration or corticobasal degeneration).tw.

  35. (CADISIL or Cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.

  36. multisystems atrophy.tw.

  37. Normotensive Hydrocephalus/

  38. Creutzfeldt Jakob Disease/

  39. exp Brain Spongiosis/

  40. spongiform encephalopathy.tw.

  41. (CJD or JCD).tw.

  42. Creutzfeldt-Jakob disease.tw.

  43. exp Hypothyroidism/

  44. Cyanocobalamin Deficiency/

  45. Neurosyphilis/

  46. or/1–45

  47. multicenter study/ or phase 2 clinical trial/ or phase 3 clinical trial/ or phase 4 clinical trial/ or randomized controlled trial/ or exp postmarketing surveillance/

  48. randomization/

  49. crossover procedure/ or double blind procedure/ or experimental design/ or latin square design/ or parallel design/ or single blind procedure/

  50. (clin: adj25 trial:).ti,ab.

  51. ((singl: or doubl: or trebl: or tripl:) adj25 (blin: or mask:)).ti,ab.

  52. Placebo/

  53. placebo:.ti,ab.

  54. random:.ti,ab.

  55. exp comparative study/ or exp drug comparison/

  56. exp “evaluation and follow up”/

  57. longitudinal study/ or major clinical study/ or prospective study/

  58. or/47–54

  59. or/55–57

  60. (tu or th).fs.

  61. exp treatment outcome/

  62. exp therapy/

  63. or/60–62

  64. 59 and 63

  65. 58 or 64

  66. 65 and 46

  67. exp animal/

  68. 66 not 67

  69. 68 not (comment or editorial or news or letter or conference paper).pt.

  70. limit 69 to English language

  71. limit 70 to yr=1998-2003

Database: AMED (Allied and Complementary Medicine) <1985 to February 2003> on OVID

Search strategy executed on March 4, 2003

  1. exp clinical trials/ or double blind method/ or random allocation/

  2. clinical trial.pt.

  3. (clin: adj25 trial:).ti,ab.

  4. ((singl: or doubl: or trebl: or tripl:) adj25 (blind or mask:)).ti,ab.

  5. placebos.sh.

  6. placebo:.ti,ab.

  7. random:.ti,ab.

  8. or/1–7

  9. (mild cognitive impairment or MCI).tw.

  10. ((cognitive impairment not dementia) or CIND).tw.

  11. ((cognitive loss not dementia) or CLOND).tw.

  12. exp dementia/

  13. (dement: or Alzheimer:).tw.

  14. amentia.tw.

  15. frontotemporal lobar degeneration.tw.

  16. hiv-associated cognitive motor complex.tw.

  17. (encephalopathy, aids or encephalopathy, hiv).tw.

  18. mesulam syndrome.tw.

  19. progressive nonfluent aphasia.tw.

  20. binswanger disease.tw.

  21. binswanger encephalopathy.tw.

  22. leukoencephalopathy, subcortical.tw.

  23. subcortical arteriosclerotic encephalopathy.tw.

  24. chronic progressive subcortical encephalopathy.tw.

  25. (alcohol adj3 amnestic).tw.

  26. (alcohol adj2 dysmnestic).tw.

  27. (ethanol adj3 nervous system disorders).tw.

  28. ethyl alcohol abuse neurologic syndromes.tw.

  29. (Lewy: bod: adj8 disease).tw.

  30. brain atrophy, circumscribed lobar.tw.

  31. (Pick: adj8 disease).tw.

  32. (corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.

  33. (CADISIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.

  34. Multisystems atrophy.tw.

  35. spongiform encephalopathy.tw.

  36. (cjd or Jcd).tw.

  37. Creutzfeldt-Jakob disease.tw.

  38. hypothyroidism/

  39. or/9–38

  40. 8 and 39

  41. 40 not (comment or editorial or news or letter).pt.

  42. 41 and English.lg.

Database: CINAHL <1982 to February Week 3 2003> on OVID

Search strategy executed on March 5, 2003

  1. crossover design/ or empirical research/ or experimental studies/ or exp clinical trials/ or community trials/ or factorial design/ or quantitative studies/

  2. clinical trial.pt.

  3. (clin: adj25 trial:).ti,ab.

  4. ((singl: or doubl: or trebl: or tripl:) adj25 (blind: or mask:)).ti,ab.

  5. Placebos/

  6. placebo:.ti,ab.

  7. random:.ti,ab.

  8. Study Design/

  9. or/1–8

  10. (mild cognitive impairment or MCI).tw.

  11. ((cognitive impairment not dementia) or CIND).tw.

  12. ((cognitive loss not dementia) or CLOND).tw.

  13. exp Dementia/

  14. (dement: or Alzheimer:).tw.

  15. amentia.tw.

  16. frontotemporal lobar degeneration.tw.

  17. hiv-associated cognitive motor complex.tw.

  18. (encephalopathy, aids or encephalopathy, hiv).tw.

  19. mesulam syndrome.tw.

  20. progressive nonfulent aphasia.tw.

  21. Binswanger disease.tw.

  22. Binswanger encephalopathy.tw.

  23. leukoencephalopathy, subcortical.tw.

  24. (chronic progressive subcortical encephalopathy or (alcohol adj3 amnestic) or (alcohol adj2 dysmestic) or (ethanol adj3 nervous system disorders) or ethyl alcohol abuse neurologic syndromes).tw.

  25. Lewy body disease.tw.

  26. brain atrophy, circumscribed lobar.tw.

  27. Pick: disease.tw.

  28. (corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.

  29. (CADASIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.

  30. multisystems atrophy.tw.

  31. Huntington's Disease/

  32. Creutzfeldt-Jakob Syndrome/

  33. spongiform encephalopathy.tw.

  34. (cjd or jcd).tw.

  35. Creutzfeldt-Jakob disease.tw.

  36. spongiform encephalopathy.tw.

  37. exp Hypothyroidism/

  38. Neurosyphilis/

  39. or/10–38

  40. 9 and 39

  41. 40 not (editorial or letter or proceedings).pt.

  42. limit 41 to English

Database: Ageline <1978 to December 2002> on SILVERPLATTER

Search strategy executed on March 6, 2003

#1 randomized controlled trials in DE

#2 controlled clinical trials in de

#3 random allocation

#4 controlled clinical trial*

#5 randomized controlled trial*

#6 random allocation

#7 double blind method

#8 single blind method

#9 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8

#10 clinical trial*

#11 (clin* near trial*) in TI

#12 (clin* near trial*) in AB

#13 (singl* or doubl* or trebl* or tripl*) near (blind* or mask*)

#14 (#13 in TI) or (#13 in AB)

#15 placebo*

#16 Placebo* in TI

#17 placebo* in AB

#18 random* in TI

#19 random* in AB

#20 research design

#21 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8 or #9 or #10 or #11 or #12 or #13 or #14 or #15 or #16 or #17 or #18 or #19 or #20

#22 mild cognitive impairment

#23 (cognitive impairment and dementia) or CIND

#24 (cognitive loss not dementia) or CLOND

#25 (explode 'Dementia-' in DE) or (explode 'Early-Onset-Dementia' in DE) or (explode 'Vascular-Dementia' in DE)

#26 dement* or alzheimer*

#27 amentia*

#28 frontotemporal lobar degeneration

#29 hiv-associated cognitive motor complex

#30 aids associated encephalopathy

#31 hiv associated encephalopathy

#32 mesulam syndrome

#33 progressive nonfluent aphasia

#34 binswanger disease

#35 binswanger encephalopathy

#36 leukoencephalopathy subcortical

#37 subcortical arteriosclerotic encephalopathy

#38 chronic progressive subcortical encephalopthy

#39 alcohol near amnestic

#40 alcohol amnestic disorder

#41 alcohol induced disorders

#42 alcohol near dysmnestic

#43 ethanol near nervous system disorders

#44 lewy* bod* near disease

#45 ethyl alcohol abuse neurologic syndromes

#46 brain atrophy lobar

#47 Pick* near disease*

#48 cerebral amyloid angiopathy

#49 neurofilament protein*

#50 tau protein*

#51 neurofibrillary tangles

#52 neuropil threads

#53 senile plaque*

#54 corticobasil ganglionic degeneration or cortical basal degeneration or cortica

#55 cadisil

#56 cerebral autosomal dominant ischemia with subcortical leukoencephalopathy

#57 multisystems atrophy

#58 explode 'Huntingtons-Disease' in DE

#59 normal pressure hydrocephalus

#60 Creutzfeldt-Jakob syndrome

#61 spongiform encephalopathy

#62 cjd or jcd

#63 Creutzfeldt-Jakob disease

#64 hypothyroidism

#65 vitamin b12 deficiency

#66 neurosyphilis

#67 #22 or #23 or #24 or #25 or #26 or #27 or #28 or #29 or #30 or #31 or #32 or #33 or #34 or #35 or #36 or #37 or #38 or #39 or #40 or #41 or #42 or #43 or #44 or #45 or #46 or #47 or #48 or #49 or #50 or #51 or #52 or #53 or #54 or #55 or #56 or #57 or #58 or #59 or #60 or #61 or #62 or #63 or #64 or #65 or #66

#68 #67 and #21

* #69 #68 and (DT=JOURNAL-ARTICLE)

Database: PsychINFO <1967 TO 2002/12> on SILVERPLATTER

Search strategy executed on March 7, 2003

#1 randomized controlled trial in PT

#2 controlled clinical trial in PT

#3 controlled clinical trials

#4 random allocation

#5 (clinical trial*) in DE,SU

#6 (random allocation) in DE,SU

#7 (double blind method) in DE,SU

#8 (cross-over studies) in DE,SU

#9 (single-blind method) in DE,SU

#10 (clinical trial) in PT

#11 ((clin* near trial*)) in TI

#12 (placebo*) in DE,SU

#13 (placebo*) in TI

#14 (random*) in TI

#15 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8 or #9 or #10 or #11 or #12 or #13 or #14

#16 mild cognitive impairment

#17 MCI

#18 (cognitive impairment not dementia) or CIND

#19 (cognitive loss not dementia) or CLOND

#20 (explode 'AIDS-Dementia-Complex' in DE) or (explode 'Alzheimers-Disease' in DE) or (explode 'Dementia-with-Lewy-Bodies' in DE) or (explode 'Dementia-' in DE) or (explode 'General-Paresis' in DE) or (explode 'Multi-Infarct-Dementia' in DE) or (explode 'Presenile-Dementia' in DE) or (explode 'Senile-Dementia' in DE) or (explode 'Vascular-Dementia' in DE)

#21 dement* or Alzheimer*

#22 amentia

#23 HIV-associated cognitive motor complex

#24 encephalopathy aids

#25 encephalopathy hiv

#26 mesulam syndrome

#27 progressive nonfluent aphasia

#28 binswanger disease

#29 binswanger encephalopathy

#30 leukoencephalopathy subcortical

#31 subcortical arteriosclerotic encephalopathy

#32 chronic progressive subcortical encephalopathy

#33 alcohol near amenstic

#34 alcohol near dysmnestic

#35 ethanol near (nervous system disorders)

#36 ethyl alcohol abuse neurologic syndromes

#37 lewy* bod* near disesase

#38 brain atrophy circumscribed lobar

#39 Pick* near disease

#40 cerebral amyloid angiopathy

#41 (neurofilament proteins) in DE,SU

#42 (tau proteins) in DE,SU

#43 (neurofibrillary tangles) in DE,SU

#44 (neuropil threads) in DE,SU

#45 (senile plaque*) in DE,SU

#46 (corticobasil ganglionic degeneration) or (cortical basal degeneration)

#47 cadisil or (cerebral autosomal dominant ischemia with subcortical leukoencephalopathy) (0 records)

#48 multisystems atrohpy (0 records)

#49 'Huntingtons-Disease' in DE (883 records)

#50 hydrocephalus normal pressure (140 records)

#51 'Creutzfeldt-Jakob-Syndrome' in DE (109 records)

#52 spongiform encephalopathy (39 records)

#53 cjd or jcd (80 records)

#54 Creutzfeldt-Jakob disease (183 records)

#55 explode 'Hypothyroidism-' in DE (260 records)

#56 explode 'Neurosyphilis-' in DE (39 records)

#57 vitamin B12 deficien* (50 records)

#58 frontotemporal lobar degeneration (15 records)

#59 #16 or #17 or #18 or #19 or #20 or #21 or #22 or #23 or #24 or #25 or #26 or #27 or #28 or #29 or #30 or #31 or #32 or #33 or #34 or #35 or #36 or #37 or #38 or #39 or #40 or #41 or #42 or #43 or #44 or #45 or #46 or #47 or #48 or #49 or #50 or #51 or #52 or #53 or #54 or #55 or #56 or #57 or #58 (27527 records)

#60 #15 and #59 (338 records)

* #61 #60 and (LA=ENGLISH) and (PO=HUMAN) (322 records)

Appendix B Forms

graphic element

USING THE FORM

  1. Be sure to fill in the DM ID#, the Name of the First Author and Your Initials in the three boxes at the top right of the form.

  2. If a paper should be excluded, fill in the “EXCLUDE” box and fill in the box for the reason for exclusion that occurs first in the list of 7 reasons for exclusion.

  3. The boxes for “KEEP ANYWAY as SPECIAL' or "should check for full article” can also be checked if it is an excluded article but may be useful to our review as background or clarification of it appears to be a companion paper for another report that is likely in our review.

  4. If you choose to exclude the paper, the details for included papers do not need to be filled in.

  5. If a paper should be included, fill in the “INCLUDE” box and fill in the information for ONLY TWO of the subsequent categories listed: Diagnosis of interest and Treatments randomized. Ignore Population analyzed, Outcomes reported and Other........we may use them later for grouping.

  6. If you are not sure if a paper qualifies for inclusion and want it to be looked at by our clinicians or methodologists, mark “CONSULTATION REQUIRED”

  7. If a paper is excluded because the Dementia population is not defined by DSM, NINCDS OR ICD-10 criteria, save it for consultation and mark "population' beside the Consultation Required box.

EXCLUDING ARTICLES

  1. Complete report must be in English to be included. If there is only an English abstract, exclude the article.

  2. Only full reports will be included. If the article is a letter, comment, editorial, news, abstract, proceedings of a meeting or any other brief description, exclude the article. If it seems that the study would otherwise be included, check the box “Should check for full article”.

  3. All dementia populations will be accepted at this stage if they are documented by DSM III, DSM III-R, DSM IV, NINCDS-ADRDA, ICD-9, ICD-10. The population studied may include those with mild cognitive impaired (MCI), cognitive impairment, not Dementia (CIND), cognitive loss, not Dementia (CLOND). If the author cites the article by McKhann as the criteria for diagnosis, it can be included because that is the criteria for NINCDS.

  4. Articles included should look at treatment of disease, cognition, behaviour, or quality of life, time to deterioration, depression, falls etc. Exclude if outcomes reported are ONLY neurophysiologic or neuroimaging (eg EEG)

  5. Exclude if not a report of a randomized controlled trial.

  6. Outcomes reported should be for subjects with Dementia. If the entire population does not have Dementia, only data sub-grouped for Dementia will be examined. Exclude if there are no outcomes of interest reported for specifically Dementia subjects.

  7. Any previously unmentioned, compelling reason to exclude the study should be specified.

INCLUDING ARTICLES

  1. If you are unclear about whether the diagnosis is an included one, mark the referral box and pass along for a consult. If entire population is demented, mark boxes for all specific diagnoses included in study outcomes. If not all of population is demented, mark boxes for all specific diagnoses included in subgroup analysis of outcomes. If diagnosis is not listed as a choice, but is an included diagnosis, specify on line provided.

  2. Specify treatments if they were randomly provided to the dementia population.

DETAIL ABOUT DISEASE TERMS (terms from the literature search)

NOT

INCLUDE

*Keep articles aside in a group if the intervention is directed toward the caregiver or caregiver/patient dyad.

graphic element

graphic element

Table A

Key Characteristics
REF ID #Author YearFunding SourceQuality ScoreInterventionsCriteria for DiagnosisDiagnosisDisease SeverityTotal Number RandomizedNumber Completing TrialMean age (range) % Male (M) PopulationHighest DoseTreatment PeriodOutcomes MeasuredOutcome reports stratified by patient characteristic
GUIDE TO DATA EXTRACTION TABLE
REF ID #Enter number written on top of first page of article.
Author / YearEnter last name of first author on one line and year of publication on the second line.
Funding SourceEnter one of the following codes: IF (Industry Funded) PI (Partially funded by Industry) IS (Industry provided Supplies) NI (Non-Industry funding source) NR (not reported) Use more than one code if necessary.
Quality scoreEnter the Modified Jadad score for Alzheimer's Disease (out of 8 points)
InterventionsEnter name of drugs used in trial. Use the most commonly recognized name (eg use Tacrine instead of generic name). If more than one drug is used, put one on each line. If a dose response trial is reported, treat as one drug at the highest dose. If placebo is used, enter as first drug.
Criteria for DiagnosisIndicate what criteria were used for diagnosis. It should be one of NINCDS, DSMIII, DSMIII-R, DSMIV, ICD-10
DiagnosisEnter all dementia diagnoses included in the trial.
 PDD = PRIMARY DEGENERATIVE DEMENTIA
 MID = MULTI- INFARCT DEMENTIA
 AD = DAT = SDAT = ALZHEIMER'S DEMENTIA
 MIXED
 VaD = VASCULAR DEMENTIA
 DEMENTIA
Disease SeverityUse the descriptive terms as used in the paper.
Total Number randomizedGive the number in all groups that were initially randomized.
Number completing trialGive the number in all groups that completed the treatment. If ITT population is given, report also.
Mean age (range)Enter whatever information is given in paper for whole population
% Male (M)Compute this figure if possible for those randomized at baseline
PopulationGive any special inclusion criteria (or existing factor in population) which may affect the external validity (eg comorbid disorders, race, setting)
Highest DoseGive the dose per day. If the dose was titrated up to individual doses, give the highest dose used and enter details of titration on the second line.
Record as reported in paper - make sure to note if dose is by weight or a set amount and report how often given (preferably by day).
Treatment PeriodThis should be the length of time for which the subjects received drug treatment. Use the longest period if there is more than one. Note here if there is an open extension
Outcomes MeasuredList all of the tests reported. If a battery of psychological tests were done, list the name of the battery.
If physical tests are reported (e.g. blood tests, scans) list in very general terms (e.g. blood levels, EEG).
Outcome reports stratified by patient characteristicEnter a Y if any of the data is reported in any way and is stratified by any patient characteristic (e.g. gender, age, race, genotype, education) and describe what the characteristic is. Otherwise, enter N.

Table B

Evidence Results
DRUG
REF ID #Author YearAnalysis GroupsTest UsedResult ValueP ValueResult ValueP ValueResult ValueP value
BaselineMid-Point: (specify)Final: (specify) 24w
BaselineMid-Point: (specify)Final: (specify)
GUIDE TO DATA EXTRACTION TABLE B
REF ID #Enter number written on top of first page of article.
Author / YearEnter last name of first author on one line and year on the second line
Analysis groups andUse one line for each intervention that will be reported on. Number them as follows (1] 2] 3] )
InterventionsMake the Control condition # 1 and Drug treatment conditions # 2 and so forth.
If results are reported as change or relationship between baseline and later time-points or between placebo and drug conditions, these will be added as additional Analysis Groups (e.g. [ 3] Baseline vs 28w Placebo) would be entered as the third Analysis Group and would be for the score that reports the change in value between the one at baseline and the one at 28 weeks for the placebo group. The Analysis Group [ 5] Placebo vs Tacrine 28w) would be the fifth Analysis Group and be used for the score that reports the change in value between the placebo condition and the drug condition at 28 weeks.
If subgroup analysis has been reported, an Analysis Group can be created to report the results.
For Analysis Groups that refer to the drug used, enter the name of drug used in trial. Use the most commonly recognized name (eg use Tacrine instead of generic name). If more than one drug is used, put one on each line. If variable dosing is used within an arm, report the higher dose. Put the Placebo condition first in the list
Test UsedList all psychological and functional tests used that have extractable data (cognition, behavior, functional, global). Do not report on physiological measurements such as blood levels. Enter the primary outcomes first in the list and bold the font on the test name.
Baseline, Mid-Point, FinalEnter number of hours, days, weeks, months, years from baseline measurement to current measurement. Use abbreviations (h = hours, d = days, w = weeks, m = months, y = years)
If more than 3 time points given, use the most central one for Time 2.
Result ValuesEnter the value for Mean ± Standard Deviation for each arm for each time-point for each test. If Standard Error is used, use an *.
If % is used, enter the % sign after the value.
If the test consists of subsections (eg a battery that also reports the total), just use the total score unless a sub-score is a primary outcome.
Give P value if provided. If no P value available, but CI reported, put CI in P value column.
Overall Summary Table Interpretation
SC= statistically significant CHANGE at the alpha = 0.05
= based on the PRIMARY outcomes (no need to specify as 1° because this is the default). If the paper does not specify primary or secondary ASSUME primary for all reported outcomes
= report secondary outcomes ONLY when there is NO primary variable for that domain AND indicate with (2°) in front of the result code
= based on the ITT analyses results ONLY; if ITT results were not reported in the paper, then indicate with an asterick (*) located behind the result code (i.e. NS*)
= this change is ONLY relative to placebo (within group findings are reported in Table B..so not necessary to recapitulate this in summary table)
= in those instances where there is A PRIORI hypotheses for subgroup analyses and there are statistical results reported, then indicate with a symbol (i.e. # or ^) that the subgroup analyses were SC or NS and specify with respect to what factor (i.e. Vascular dementia versus not, or gender, etc)
NS= not statistically significant effect for primary or secondary outcomes (some additional domains are tested with the secondary outcomes)
NT= outcomes were not tested reflecting in this domain
MX= mixed results for two primary outcomes (i.e one was significant and the other variable was not significant) and do not represent a SUBGROUP analysis
= indicates that two measures within the same domain show conflicting results (one outcome is significant and the other is not significant)
Reporting Safety information in Randomized controlled trials (Ioannidis and Lau, 2002)
DOMAIN recommendation Ioannidis and LauOUR QUESTIONSTATUS
FREQUENCY of WITHDRAWALS due to adverse events (AE)Do the authors specify the number of patients withdrawn from the study due to AE per study arm and per type of AE that caused withdrawalYN
Unclear
YN
Unclear
FREQUENCY of AE (can be stated as a count or as a proportion for either CLINICAL AE or LABORATORY-DEFINED TOXICITY)Do the authors provide the number of AE with respect to severity (reference to a known scale of toxicity such as mild, moderate, severe, life threatening or grades 1 to 3, etc) per study arm and per type of specific AE (i.e. diarrhea, headache, etc)YN
Unclear
YN
Unclear
Was the recording of the AE (i.e. surveillance) ACTIVE or PASSIVEIs the surveillance ACTIVE (actively monitor the presence of absence of AE during the study…do not rely on methods that are PASSIVE (sometimes called spontaneous reporting)YN
Unclear
Describe a SCHEDULE for collection of safety infoOptional
1) Do the authors specify the schedule for collection of safety information?
FREQUENCY of SERIOUS AE (i.e. results in death, requires inpatient hospitalization, persistent or significant disability or is life threatening, WHO 2001)Are exact numbers for high-grade (serious and life threatening) clinical AE laboratory toxicity reported.YN
Unclear
SEVERITY of each AE (i.e. mild, moderate, or severe headache)Have each of the AE been reported with respect to a severity continuum (i.e. mild diarrhea, severe headache, etc) ?YN
Have some of the AE been reported with respect to a severity grade?Unclear
Optional:
YN
Unclear
Description of UNUSUAL or NOT PREVIOUSLY RECORDED AEHas a detailed description of cases of unusual or not previously recorded AE effects been presented?YN
Unclear
STANDARDIZED SCALES used to capture AE.Do the authors report the use of widely known, standardized scales for AE?YN
Specify scale:Unclear
If the scale is new, do the authors provide definitions for the grades of severityOptional:
YN
Unclear
Identify specific SAFETY TESTS or QUESTIONNAIRES used for data collectionDo the authors identify specific safety tests or questionnaires used for data collectionYN
Unclear

THRESHOLD SCORING for ADVERSE EVENTS:

  1. Scoring:

    • YES = 1

    • NO = 0

    • Unclear = consult with other rater to reach consensus

  2. For the first 3 questions a MINIMUM score of 3 is required to proceed to the subsequent 5 questions

  3. If all patients were accounted for (i.e. no withdrawals), then we assume a score of 2 for the WITHDRAWAL due to AE question

  4. If no mention of serious (see definition) is mentioned in the paper, then we will ASSUME that they were NOT monitored (rather than not reported)

Appendix C Evidence tables