Figure 1. Pathway for the progression of dementia and the ideal application of drug interventions within this framework
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Director, Center for Outcomes and Evidence, Agency for Healthcare Research and Quality, 540 Gaither Road, Rockville, MD 20850.
Carolyn M. Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Name of Funding Government Agency Director
Director
Name of Funding Government Agency
Jean Slutsky, P.A., M.S.P.H.
Acting Director, Center for Outcomes and Evidence
Agency for Healthcare Research and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.
The authors of this report would like to thank and acknowledge the assistance of the following people:
Sarah Tanner-Bouma, Eileen Estrabillo, Connie Freeborn, Naomi Kirshenbaum, Elizabeth Simmons, Rob Stevens, and Hongmei Wu for assistance with data extraction and preparation of data tables;
Carol Lefebvre for her assistance in developing the search strategies;
Angela Eady for her assistance in developing the search strategies and editing the final version of the document;
Bin Zhu, Gary Foster, and Ayaz Hyder for their assistance with the meta-analysis;
Amy Steep and Tom Flemming of the McMaster University Health Sciences library for helping us with our searches, retrievals, and development of methods to access the library from off-site;
Dan Ellington from ISI Researchsoft for answering questions regarding the management of our references and citations;
Roxanne Cheeseman, Laurie Cocking, Susan Hanna, and Jessica Cheeseman for their assistance in the preparation and distribution of the manuscript.
Context. Dementia is a chronic progressive disease with no known cure. It affects cognition, behavior/mood, physical functions and activities of daily living, and caregiver burden. Therapeutic interventions for dementia aim to affect these domains.
Objectives. To review the evidence and answer the questions: Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes? Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes? Are certain drugs, including alternative medicines (non-pharmaceutical), more effective than others? Do certain patient populations benefit more from pharmacotherapy than others? What is the evidence base for the treatment of ischemic vascular dementia (VaD)?
Data sources. Studies were identified by searching the Cochrane Central trial registry, MEDLINE® , PreMedline®, EMBASE, AMED, CINAHL®, Ageline, and PsycINFO.
Study selection. English-language randomized controlled trials were selected if they evaluated pharmacological agents for adults with a diagnosis of dementia according to the criteria of International Classification of Diseases (ICD), Diagnostic and Statistical Manual of Mental Disorders (DSM) or National Institute of Neurological and Communicative Disorders and Stroke (NINCDS). Crossover trials and studies with a quality score < 3 on the Jadad Scale were excluded.
Data extraction. Data were extracted on type of dementia, severity of disease, setting, regimen of pharmacological agents, study duration, main outcome measures, adverse effects, and results. The quality of studies was assessed, and the quality of adverse effect reporting was assessed. Effect sizes were calculated and data were pooled when appropriate.
Data synthesis. (1) Efficacy: One hundred and eighty-six Randomized Controlled Trials (RCTs) evaluated 97 drugs. As expected the findings varied with the dementia population and the specific outcomes in the various domains. Those pharmacological agents that showed a consistent effect of benefit are as follows: A) Global assessment was improved by donepezil, galantamine, rivastigmine, velnacrine, cerebrolysin and idebenone; B) Cognition (general and specific) was improved by donepezil, galantamine, metrifonate (this drug has been withdrawn from use in North America because of safety concerns), nicergoline, physosligmine, rivastigmine, velnacrine, memantine, cerebrolysin, ginkgo biloba, idebenone and propentofylline; C) Behavior/mood was improved by haloperidol; D) Quality of life/Activities of Daily Living (ADL) was improved by donepezil, galantamine and posatirelin. In general, caregiver burden and quality of life/ADL were not frequently evaluated. (2) Delay disease: Cerebrolysin, selegiline plus vitamin E, and donepezil showed some significant effects in delaying disease progress in patients with mild to moderate and moderately severe Alzheimer's disease. (3) Head to head comparisons: Superiority was seen for sulphomucopolysaccharides over CDP-choline, donepezil over vitamin E, antagonic-stress over nicergoline, antagonic-stress over meclofenoxate, posatirelin over citicoline, and pyritinol over hydergine. (4) Patient populations: Stratified analyses included: age, gender, Apolipoprotein E (APOE) genotype, disease type, disease severity, race by location, care dependence, and presence of depression. Single populations of dementia subjects with Down's syndrome, and depression were evaluated. Evidence was inconclusive for this question. (5) Ischemic VaD: A total of 20 pharmacological interventions in 29 studies were applied to vascular dementias. Differences were suggested between multi-infarct dementia (MID) and Alzheimer's disease (AD) for 5′ -MTHF-trazodone, AD and VaD for citalopram, and AD and MID for Ginkgo biloba. Trials with VaD patients showed effects for memantine, nicergoline, pentoxyfylline, idebenone, donepezil and cerebrolysin.
Conclusions. Pharmacotherapy for dementia can improve symptoms and outcomes. Adverse events should be more systematically reported. Few studies evaluated delay in either disease onset or progression, but there was some evidence suggesting delay in progression. Few studies compared drugs with other drugs. Due to poor evaluation, data was limited to consider efficacy of pharmacotherapy in different subgroups of patients. Some agents have been shown to be effective in VaD patients.
This review focuses on the pharmacological treatment of dementia. Dementia is a syndrome of acquired cognitive defects sufficient to interfere with social or occupational functioning, which results from various central neurodegenerative and ischemic processes. Dementia has become a major public health problem due to its increasing prevalence, long duration, caregiver burden, and high financial cost of care. The prevalence of dementia varies as a function of the defining criteria as shown by Erkinjuntti et al. (1997),1 who showed a range from 3.1% using the International Classification of Diseases, Tenth Revision (ICD-10) criteria, up to 29.1% using the Diagnostic and Statistical Manual, Third Edition (DSM-III) criteria. Jorm et al. (1987)2 conducted a meta-analysis based on 22 international studies and found that the actual prevalence rates differed significantly from study to study. However, this meta-analysis demonstrated that the prevalence increased exponentially with age. The prevalence ranged from 0.7% for 60 – 64 year olds to 24% for people over the age of 85 years. In the United States, the prevalence of Alzheimer's disease (AD) is projected to quadruple to one in 45 Americans in the next 50 years3 across all ages. The Canadian Study of Health & Aging (CSHA)4 estimated the prevalence of dementia in Canada at 8% (approximately 252,600 cases) in 1991 among seniors over the age of 65 years. The prevalence of dementia increases to 34% among those aged 85 years or more.4 The age-standardized incidence of dementia in Canada has been estimated at 21.8 per thousand for females and 19.1 per thousand for males.5 The prevalence is expected to double to half a million cases in Canada by 2013.6 Because the world's population is progressively aging, especially in the developed nations, more people are falling into age groups where the prevalence of dementia is highest. From a clinical perspective, dementia predominately affects 1) cognition, 2) behavior/mood, 3) physical functions and activities of daily living, and 4) caregiver burden. Therapeutic interventions for dementia aim to affect these four primary domains.
Pharmacotherapy is often the primary intervention used to improve symptoms or delay the progression of dementia syndromes. The pharmacological agents used vary significantly with respect to their therapeutic actions. The most common pharmacological agents used in North America modify the activity of cholinesterases—enzymes, which degrade acetylcholine, a neurotransmitter that is critical to the neurons involved in cognition (e.g. memory, thought, and judgment). Other approaches include the use of anti-oxidants, which work by minimizing the effects of free radicals that are released through normal oxidative metabolism. These free radicals may cause neuronal damage and play a role in the development of dementia. Similarly, it is believed that inflammation contributes to nerve cell damage and dementia; hence anti-inflammatory drugs may act by decreasing inflammation, potentially reducing nerve degeneration, which may in turn slow or even prevent dementia illnesses.
Other pharmacological interventions that have been studied include cholesterol-lowering agents, anti-hypertensives, folic acid, hormones (e.g. estrogen), behavior and mood altering drugs, anti-amyloid strategies (e.g. immunization, aggregation inhibitors, and secretase inhibitors), transition metal chelators, nerve growth factors, and agents that target neurotransmitters other than acetylcholine and its receptors. The various pharmacotherapeutic agents available to treat problems associated with dementia have varying levels of evidence to support their efficacy. This report is a systematic evaluation of the evidence for pharmacological interventions in the treatment of dementia in the domains of cognition, global function, behavior/mood, quality of life/ADL, and caregiver burden.
Determination of disease onset presents considerable difficulty, as dementia, by definition, has an insidious and gradual progression. A number of diagnostic models have been used to classify dementia. In 1988 the National Institute of Neurological and Communicative Disorders and Stroke - Alzheimer's Disease and Related Disorders Association (NINCDS/ADRDA) research on diagnostic criteria were published for AD, which served to increase the validity and reliability of the clinical diagnosis.7, 8 Trials published prior to this time may reflect mixed populations other than AD. Other models used to diagnose dementia include: the International Classification of Diseases (ICD) version 9 or 10, the Diagnostic and Statistical Manual of Mental Disorders (DSM) III, III-R, and IV (American Psychiatric Association), and the NINCDS/ADRDA.9 The difficulty with these different diagnostic criteria for dementia is that they are not interchangeable. Erkinjuntti et al. (1997)1 compared six commonly used classification schemes (DSM-III, DSM-III-R, DSM-IV, ICD-9, ICD-10, and the Cambridge Examination for Mental Disorders in the Elderly (CAMDEX)). They showed that the prevalence of dementia can differ by a factor of 10 depending on the diagnostic criteria used. Two other studies have demonstrated that the prevalence of vascular dementia (VaD) varies with the classification system and therefore these criteria for diagnosis are not interchangeable.10, 11 Furthermore, there is controversy about the validity of the clinical classification of VaD, as autopsy confirmation often does not substantiate the clinical diagnosis.12, 13 The majority of dementias were actually AD with co-existing vascular and Parkinson's disease lesions.14 In contrast, the clinical accuracy of AD diagnosis is relatively high.7 The discovery that a long pre-clinical period precedes AD has led to the establishment of early diagnostic indices of dementia. This border zone between normality and dementia has been given numerous names and definitions, which include: benign senescent forgetfulness (BSF), age associated memory impairment (AAMI), age-consistent memory impairment (ACMI), age-associated cognitive decline (AACD), mild cognitive impairment (MCI), cognitive loss no dementia (CLOND), and cognitive impairment but not dementia (CIND). The prevalence for this pre-clinical or mild form of cognitive decline varies with the classification system used.15 Unfortunately, the classifications used to diagnose early mild cognitive decline are not interchangeable. MCI16 is emerging as the preferred term for this condition17 using the criteria of Petersen et al.16 Ritchie et al.18 (2001) estimated the prevalence of MCI to be 3.2% with an 11.1% conversion rate to dementia within a 3 year period.
Dementia is a chronic progressive disease for which no known cure exists. Pharmacological interventions used to treat dementia are intended to achieve at least one of the following broad therapeutic aims:
Prevention of onset of the disease. In the context of this review, this applies to those at greatest risk (such as those with the clinical diagnosis of MCI) of conversion to a dementia syndrome.
Symptomatic treatment of the disease. Symptomatic benefit can be described as maintenance (or stabilization) or improvement of the current cognitive, behavioral, functional, or caregiver status only while on active treatment with the pharmacological intervention. Withdrawal of the pharmacological therapy results in a decline towards baseline or placebo levels of relevant outcomes.
Delay in the progression of the disease. A therapeutic intervention that brings about delay in the progression of the disease can be described as either 1) one that maintains (or stabilizes) or improves current cognitive, behavioral, functional, or caregiver status, which is sustained, or 2) one that can be shown to alter the rate of decline of the disease progression, even when the drug is withdrawn.
Figure 1
It has been suggested that dementia does not have uniformly accepted criteria for disease progression or consensus regarding the magnitude of clinically important changes.19, 20 With respect to the therapeutic aim, the practical consequences of these unresolved issues are that the same efficacy variables have been used to both show evidence of symptomatic benefit and demonstrate the effects on disease progression. Thus, the design of a clinical trial (rather than the outcome) is critical to demonstrating which of these two therapeutic outcomes (symptomatic benefit or delay in progression) is being achieved with the pharmacological agent.21
Irrespective of which therapeutic aim is being achieved by the pharmacological agent, the lack of consensus on these two issues has even more important implications when considering the definition of “efficacy” for either treatment goal. A change in a relevant outcome measure that is due to factors other than chance is deemed statistically significant. The criteria to determine efficacy solely on statistical significance have long been recognized as problematic from an interpretation perspective. Clinically meaningful change reflects a different level of “significance” and often requires consensus among experts within the field to establish what magnitude of change is important.20
The dementia literature is not consistent in the criteria used for establishing efficacy, and there is no consensus on the meaning of clinical significance in the changes observed.20, 22 In general, attempts are made to select an outcome measuring an important dementia attribute (such as cognition) and an additional outcome evaluating global change as observed by a clinician (with or without input from the caregiver). The outcomes selected to reflect these two domains vary, as do the number of attributes that are selected for evaluation.
The United States Food and Drug Administration (FDA) has established criteria for efficacy of dementia (specifically AD) drug interventions,23 which require the following: 1) a double blind, placebo-controlled trial, 2) subjects who meet established criteria for AD, 3) sufficient length of follow-up to appreciate a meaningful effect of the drug on cognition, and 4) a clinical change of sufficient magnitude to be recognized by a clinician. In establishing these criteria, it was assumed that the outcome measuring cognition was the primary change of interest, and that the global clinical evaluation would mirror the changes in the primary variable.24 In 1997, the European Medicine Evaluation Agency (EMEA) issued new guidelines that incorporated two new concepts for the treatment of AD.25 Firstly, the EMEA guidelines suggested a preference for a measure of functional abilities in addition to a global measure, and noted that behavioral outcomes were important from a clinical perspective. Secondly, a definition of “responders” should be included in all trials, such that the degree of improvement in their cognition (or stabilization) was pre-specified. However, the magnitude of the change reflecting a clinically meaningful change was not specifically stated in either of these two guidelines. Sufficient magnitude of the change would reflect a clinically important difference, and this would vary with the type of outcome selected.
Several authors have attempted to define “clinically” relevant change. Gutzmann et al. (2002)26 developed an Efficacy Index Score (EIS), which is a checklist that combines dropout as well as the relevant improvements individually across the three levels of assessment (cognitive function, activities of daily living and global function). Although, this summary score has not been validated relative to other traditional outcomes, it does present a unique example of determining efficacy in the context of anti-dementia drug interventions. Mayeux and Sano (1999)27 in reviewing drug interventions for dementia, evaluated efficacy as a percent of the change in the treatment group relative to baseline (corrected for any change in the placebo group) and contrasted this with the percent of dropouts related to adverse events. Disease progression was considered with respect to the outcomes of 1) time until death, 2) nursing home placement, 3) loss of ability to perform Activities of Daily Living (ADL), or 4) severe dementia. In the context of clinical trials seeking to establish efficacy of pharmacological interventions, the latter outcomes may be problematic to ascertain.
Evaluation of the natural history of AD established some threshold values for expected decline or progression of the disease. Using the Alzheimer's Disease Assessment Scale-Cognitive Section (ADAS-cog),28 Rosen demonstrated that a decline of 1.28 points occurred within 12 weeks, a decline of 3.5 points within 6 months, and Stern et al. (1994)29 showed a decline of 9 – 11 points by 1 year. Clinical experience would also suggest that the decline is not linear, with less deterioration in the early and later stages and the greatest acceleration in the middle severity category. The characteristics of the natural history of AD and other dementia types are best derived from longitudinal studies. Although, more details on the natural history of dementia are being reported, the fundamental difficulty still remains concerning the diversity of the outcome measures used to describe these changes. The picture of cognitive, behavioral, and functional decline will therefore vary with the outcome measure selected to describe it. Additionally, the diversity has a negative impact on comparisons of drug efficacy that can be made across trials.30
No specific set of commonly accepted outcomes that define efficacy or “clinical relevance” applies to all the pharmacological interventions that have been used to treat dementia. More than 175 outcome measures are listed in Appendix E. EMEA guidelines acknowledge that no single test encompasses the broad range of disease characteristics associated with AD; nor has there been convincing evidence that an ideal (or reference) instrument exists to capture cognitive, behavioral, functional, or caregiver status.25 The FDA has recommended that “dual efficacy” of dementia drug interventions be established by significant change in both a psychological measure and a global change measure. The outcomes used to measure these attributes within these two domains were not specified. In practice, there has been a general trend in North America toward using the outcomes ADAS-cog, the Mini-Mental State Examination (MMSE), and the Clinicians' Interview-Based Impression of Change-Plus (CIBIC+) to capture the two domains when evaluating drugs for AD populations. However, these frequently used outcome measures may not be the best choice with respect to capturing “clinically relevant change”. The psychometric instrument properties must also be taken into consideration. For example, it has been suggested that the ADAS-cog is weighted predominately to evaluate memory loss at the expense of other cognitive domains (especially executive control functions),31 which suggests that the face validity of this instrument may be in question. The generalizability of these results may be limited to dementia in which memory impairment is a key feature as the instrument is less sensitive to personality and executive dysfunction changes seen in a less typical dementia, such as frontotemporal dementia. The responsiveness (ability to detect change) of the CIBIC+ has not been well established.32 This suggests that some of the most established outcomes used to evaluate efficacy of pharmacological interventions are far from ideal.
Demers et al. (2000)33 critically appraised some of the most commonly used scales evaluating global assessment,32 quality of life/ADL,30 and behavior/mood34 with respect to the quality of their psychometric properties. Several important limitations were identified in these reviews for the measures they evaluated, and these include 1) a lack of responsiveness data, 2) diversity in the content of the scales (capturing various aspects of a domain, for example, behavior), and 3) limited studies on reliability and validity (which are sample specific). The literature evaluating outcome measures used in dementia trials would suggest that most instruments have significant limitations, or at least more data are required to establish the required properties for acceptability of the scales.
Given the current state of development of research on outcome measures used in dementia trials for determining efficacy, a dilemma is clearly at hand. Ideally, all outcomes used to evaluate efficacy should have demonstrated acceptable psychometric properties, such as reliability, validity (construct), and responsiveness. However, since none of these outcomes have been accepted as standards, the selection of the most appropriate outcome is purely arbitrary. Similarly, establishing a rationale to exclude studies based on the specific type of outcome measure would be arbitrary. For this reason, no exclusion criteria based on outcome measures were used as eligibility criteria for this study.
Increasing attention has been given to the potential for harm, and not just benefits, when considering the efficacy of drug interventions. Empirical evidence across diverse medical fields indicates that reporting of safety information, including milder adverse events, receives much less attention than the positive efficacy outcomes.35 Thus, an evaluation of the benefits of anti-dementia pharmacological agents alone may present a biased view of the overall benefit of the intervention. In the context of this systematic review, the type and frequency of adverse events associated with the use of a drug intervention will be scrutinized to a greater extent than previous reviews of anti-dementia drugs.
Capturing and evaluating adverse events is problematic. Typical randomized controlled trial (RCT) dose finding studies should consist of the comparison of several doses of a drug versus placebo; efficacy is demonstrated relative to a placebo group or relative to a different dose group. Ideally, the goal of early phase trials is to estimate the minimum effective dose or the maximum safe dose (or both). However, it is misleading to assume that drugs shown to be safe and effective in trials are safe and effective in all other circumstances.36 The nature of pre-market clinical trials makes it difficult to evaluate the benefits of drugs for the universe of potential users, as criteria restricting entry into the trial do not necessarily reflect dementia patients in general. By their nature, some adverse events are not easily anticipated, and therefore are not screened for in some trials. The implementation of pharmaco-vigilance systems attests to the need for further capture of potential adverse events not captured in trials. Adverse events may be hard to predict or anticipate and are captured only if a trial protocol was designed to measure these events. A limited number of standardized instruments exist to capture these events reliably. Unique to individuals with cognitive decline is the potential problem of validity of the self-report instrument, even if completed by the caregiver. Furthermore, many trials may be underpowered to detect adverse events with an incidence of 1/1000.37 Despite these limitations, quality criteria for the collection and reporting of adverse events have been identified.35, 37 An instrument to evaluate the quality of reporting adverse events has been developed and used in this report to determine the strength of the evidence for adverse events in the context of determining efficacy.
Determining efficacy in dementia trials evaluating pharmacological interventions may vary depending on the selection of the analysis type. In general, the types of analyses of primary data in trials fall into two main categories: 1) intention to treat analyses (ITT) or last observation carried forward (LOCF), and 2) observed case (OC) or completed trial (CT). The advantages of ITT over OC analyses have been well explicated.38 It is recognized that non-compliance is not a random event; thus, ITT analyses should be used to base principal conclusions of efficacy.39 In the context of some anti-dementia drug therapies, where dropout rates due to adverse events and other non-compliance reasons may be high, the ITT analysis minimizes bias and the potential for type I errors when considering treatment efficacy. However, the ITT analysis, while less biased, does tend to reduce treatment effects to the extent that there are dropouts and crossover patients. The optimal analysis, when there is a large loss to follow-up, is to conduct the analysis both ways and look for consistency.
A large number of pharmacological interventions have been studied in dementia patients. These agents can be classified into three broad categories: 1) cholinergic neurotransmitter modifying agents, such as acetylcholinesterase inhibitors, 2) non-cholinergic neurotransmitter/ neuropeptide modifying agents, and 3) other pharmacological agents. Although only four agents have been approved by the FDA for the treatment of dementia, many other pharmacological agents are being evaluated in trials in off-label use. In both these circumstances, there was a need to determine the evidence to support claims of efficacy and to describe adverse events.
Given the range of pharmacological agents that have been used to treat dementia, evaluation of all of these interventions in a systematic review (which afforded a consistent methodology) should serve as a meaningful contribution in this area. The purpose of this systematic review is to answer the following questions:
Does pharmacotherapy for dementia syndromes improve cognitive symptoms and outcomes?
Does pharmacotherapy delay cognitive deterioration or delay disease onset of dementia syndromes?
Are certain drugs, including alternative medicines (non-pharmaceutical), more effective than others?
Do certain patient populations benefit more from pharmacotherapy than others?
What is the evidence base for the treatment of VaD?
This review considers different dementia populations (not just AD) and subjects from both community and institutional settings. The interventions were limited to pharmacological agents (including nutriceuticals), and these were not restricted to those that have received official approval in North America. The studies eligible in this systematic review were restricted to parallel RCTs, but the study outcomes were not limited to specific types.
The review will serve to evaluate the quality of the evidence and identify important gaps in the literature. Future recommendations will serve the dementia research community specifically. This evidence report will support the American College of Physicians (ACP) and the American Academy of Family Physicians (AAFP) in developing “best practices” and practice guidelines for the evidence-based treatment of dementia for providers, patients and the public.
A multidisciplinary local research team representing geriatric and dementia epidemiology/ systematic review methods (P. Raina, PhD), pharmaco-epidemiology (M. Levine, MD, PhD), geriatric medicine/ dementia (D. Cowan, MD; C. Patterson, MD), rehabilitation/ systematic review methods (P. Santaguida, PT, PhD), and neuropsychology (A. Unsal, PhD) was assembled. The core research team, including experienced staff at the McMaster Evidence-based Practice Center (EPC) (F. Baldassarre, MSc; L. Booker, BA; M. Gauld, BA) participated in regular meetings and reached consensus on key methodological issues. An international Technical Expert Panel (TEP) was assembled to provide high-level content expertise in dementia and participated in conference calls on an as-needed basis. Participants in this panel were: Larry W. Chambers, PhD. Ottawa, ON, Canada; Thomas Cook, MD. (ACP appointee) Colorado Springs, CO, USA; Rachelle Doody, MD, PhD. Houston, TX, USA; John Feightner, MSc, MD. London, ON, Canada; Rodney Hornbake, MD. (ACP appointee) Hadlyme, CT, USA; David Hogan, MD. Calgary, AB, Canada; Roy Jones, MD. Bath, UK; and Holly Tuokko, PhD. Victoria, BC, Canada.
The first step during the topic assessment and refinement process was to organize a teleconference with the partner organization, the Task Order Officer (TOO), invited topic experts, and the McMaster team in order to define the magnitude of the topic addressed and to refine/clarify the preliminary research questions for this evidence report. It was agreed that this evidence report would focus on addressing the efficacy of pharmacotherapies for dementia syndromes. Regular teleconferences were held with the TOO, the partner, and technical experts throughout the data refinement and extraction phase.
| Database searched | Search date | Period searched |
|---|---|---|
| Cochrane Central | February 3, 2003 | 1st Quarter 2003 |
| MEDLINE® & PreMedline® | February 4, 2003 | 1998 to 2003 week 4 |
| EMBASE | February 6, 2003 | 1998 to 2003 week 5 |
| AMED | March 4, 2003 | 1985 to 2003 February |
| CINAHL® | March 5, 2003 | 1982 to February 2003 week 3 |
| Ageline | March 6, 2003 | 1978 to 2002 December |
| PsycINFO | March 7, 2003 | 1967 to 2002 December |
Expert opinion was sought on the most efficient search strategies to minimize noise in the collection of citations. Some of the medical subject headings (MeSH) used to select RCTs yielded a large number of non-RCT literature due to misclassification of the study design terms. For example, in previous indexing, terms like “longitudinal study” or “comparative study” were applied to RCTs; conversely, the MeSH terms “random” or “randomized” in the title or abstract were not consistently used. However, some recent methodological work has suggested that more specific search term approaches can be used, which increases the sensitivity and specificity of the search results.40 The Cochrane Central Trial Registry contains correctly re-classified RCT/Controlled Clinical Trial (CCT) trials that were misclassified in MEDLINE® and EMBASE from 1966 to 1998. All published RCTs to 1998 are contained within this database. Hence we commenced our search with the Cochrane Central Trial Registry database. For this reason, MEDLINE® and EMBASE were searched from 1998 forward for relevant studies, and all the other databases from their inception.
Specific drug names and manufacturer brands were considered as potential search terms. However, the local research team was in agreement that listing specific drug names would bias the yield to include only those pharmacological agents searched and would not capture newer drug therapies. Thus the recommendation was to not restrict the search to known pharmacological agents but to include whatever agents were in the literature.
In addition to the electronic databases, the bibliographies of retrieved papers were retrieved. Any citations recommended by the local research team, the TEP, or the peer reviewers were retrieved and screened.
Inclusion. Studies were included that contained the following criteria:
Age: Studies involving dementia patients who were 18 years or older in age
Diagnostic model used to determine dementia: The diagnosis of dementia using any of these criteria:
Diagnostic criteria used to determine cognitive impairment (pre-dementia): In the case of not yet diagnosed dementia, specific diagnostic categories were accepted for the following:
Disease classifications for dementia: These included AD, senile dementia of the Alzheimer's type (SDAT), Lewy body disease, VaD, multi-infarct dementia (MID), AIDS/HIV dementia, Parkinson's disease dementia (PDD), progressive supranuclear palsy (PSP), mixed diagnosis dementia, encephalopathy, Mesulam syndrome, progressive non-fluent aphasia, Binswanger disease, subcortical leukoencephalopathy, circumscribed lobar brain atrophy, Pick disease, amyloid beta-protein (not Down's syndrome or trisomy), cerebral amyloid angiopathy, neurofibrillary tangles, threads, senile plaques, corticobasil ganglionic degeneration, cerebral autosomal dominant ischemia with subcortical leukoencephalopathy (CADISIL), Huntington's disease with dementia, hydrocephalus (for additional terms used in the search strategy, see Appendix A).
Severity classification: This was accepted in whichever classification system the studies specified. The majority of studies specified threshold criteria using the MMSE as follows: mild > 22, moderate 14 – 21, and 10 – 14 as severe. Many studies used the definition of mild to moderate as a range from 10 to 26 based on criteria established by Folstein et al.50 Some studies specified a category (i.e. mild to moderate) but did not report the baseline MMSE values for the groups compared.
Some studies specified two categories (mild to moderate) and (moderate to severe) based on the DSM-III-R criteria. Cambridge Examination for Mental Disorders in the Elderly (CAMDEX) specifies levels of severity (minimal, mild, moderate, severe). Similarly, some studies reported a category of severity without stating which method was used. In these instances, the category of severity specified was accepted as reported by the study authors.
Exclusion. Studies that had populations with any of the characteristics listed below were excluded.
Dementia disease classification: i) alcohol caused dementia/ Korsakoff's syndrome, ii) Creutzfeldt-Jakob syndrome, c) spongiform encephalopathy, iii) hypothyroidism, iv) vitamin B12 deficiency, v) neurosyphilis.
Dementia diagnosed using only Lowb, Hachinski (specific for VaD) criteria.51
All organically caused dementias which includes “Delirium, Dementia, Amnesic Disorders, and Cognitive Disorder Otherwise Specified. The predominant disturbance is a clinically significant deficit in cognition that represents a significant change from a previous level of functioning. For each disorder in this section, the etiology is either a general medical condition (although the specific general medical condition may not be identifiable) or a substance (i.e., a drug of abuse, medication, or toxin), or a combination of these factors.”43
Temporary dementia (e.g. side effect of anesthesia) classified as follows: Delirium: a delirium is characterized by a disturbance of consciousness and a change in cognition that develop over a short period of time. The disorders included in the “Delirium” section are listed according to presumed etiology: delirium due to a general medical condition, substance-induced delirium (i.e. due to a drug of abuse, a medication, or toxin exposure), delirium due to multiple etiologies, or delirium not otherwise specified (if the etiology is indeterminate).
Normal or healthy volunteers: studies that deal with healthy people (i.e. prevention is limited to people who have any form of the above); volunteer study population
General population of elderly persons.
Study subjects selected for depression (some patients may have dementia but not all) and where there is no stratified analysis by disease subgroup (i.e. the dementia subjects).
Study Design. Eligible studies included parallel design RCTs only. Although crossover trials are suitable for chronic diseases, they may be prone to period effects or period by treatment interactions. Period effects are systematic changes in the outcome that apply to all patients due to temporal changes in the disease or to the measurement instrument. Period by treatment interactions occur when the efficacy of the intervention varies by period. Additionally, a carry-over effect may occur if there is not an adequate washout period. Apart from the weaknesses of this design, some limitations arise when considering the potential for meta-analytic analyses. Traditionally, first period data from a crossover trial are abstracted and can be potentially combined with parallel trials for analyses of a pooled estimate; the reporting of the study results (positive or negative) would also be based on this first period data. In a preliminary phase of the review, several crossover trials were examined. Many did not report first period data, which precluded any potential for combining with parallel trials; many trials also did not undertake statistical tests during the first experience, thus making it difficult to report the direction and significance of the first period findings. Finally, because this systematic review was considering a variety of drug interventions administered over differing time intervals, period effects might be an important source of bias. For all these reasons, the decision was made to exclude crossover trials from this systematic review.
Language of Publication. Studies published in the English language were eligible. The scope and resources of this review did not permit translation of studies published in other languages.
Sample Size. No sample size restrictions were applied.
Treatment Interventions. Drug interventions were eligible in the following manner:
Pharmacological agents: all types of pharmacological treatment were considered in this review, including food supplements (as defined by the FDA). Government approval was not a requirement, and as such, off-label use of drugs (i.e. drugs approved for other conditions but used in the treatment of dementia) were eligible for this review.
Dose: all doses and dosing schedules and any mode of administration (oral, subdermal, transdermal, intravenous, suppository, or intra-muscular injection) were considered.
Treatment period: the period of treatment must equal or exceed 1 day.
Follow-up length: Any duration of follow-up was eligible. Different drugs require different time periods to show an effect. For example, antidepressant and anti-psychotic medications may take a month or more to be effective. Some dementia drugs take a minimum of 2 months. For interventions such as vitamin E or Ginkgo biloba, the time to effect is not well established. Thus, an absolute limit to the minimum number of months of follow-up could not be applied to all potential interventions. It was anticipated that many studies with some of the most recent pharmacological agents (i.e. donepezil) would have a minimum follow-up of 24 weeks.
Study Outcomes. No specific set of commonly accepted outcomes that define efficacy or “clinical relevance” were applicable to all the pharmacological interventions that have been used to treat dementia. The literature evaluating outcome measures in dementia trials would suggest that most instruments have significant limitations or at least more data are required to establish the required properties for acceptability of the scales. Since none of the outcomes used in dementia trials have been accepted as standards (no consensus), the selection of the most appropriate or clinically relevant outcome is purely arbitrary. Similarly, establishing a rationale to exclude studies based on the specific type of outcome measure would be arbitrary. For this reason, no exclusion criteria based on outcome measures were used as eligibility criteria for this study; rather the domains of interest for inclusion have been identified.
Studies with the following outcomes were included:
General cognitive function (e.g., ADAS-cog).
Specific cognitive function (e.g., Weschler Memory Tests).
Global clinical assessment (e.g., CIBIC).
Behavior/mood (disturbances characterized by agitation, wandering, sleep cycle disturbance, depression, obsessive compulsive activities) (e.g., Behavioral Pathology in Alzheimer's Disease Rating Scale (BEHAVE_AD)).
Quality of life/ADL (e.g., Instrumental Activities of Daily Living (IADL)).
Effects on primary caregiver (also referred to as caregiver burden).
Safety as measured by the incidence of adverse effects (e.g., particularly serious adverse events).
Acceptability of treatment as measured by withdrawal rate from trial due to side effects of the medication. (e.g., dropouts due to adverse events).
Mortality.
Dependency or Rate of Institutionalization/ or continued residence in own home.
Use of services.
Studies with the following outcomes were excluded as follows:
Studies which reported only biological/ physiological outcomes, such as plasma levels, changes on functional imaging, or electroencephalography (EEG) activity, were noted but not assessed as efficacy measures.
Outcomes reported in the trials should reflect changes in the person with dementia. If the study population did not all have dementia, only data subgrouped for dementia was examined.
Minimum quality threshold score for eligibility.
Exclusion part I: Pre-Jadad score. Studies were also screened to determine a minimum threshold for quality, sometimes described as “fatal flaws” in the trial design. Specifically, all studies had to include at least some mention of the term “randomization” or “withdrawal(s)” in the text of the paper. Trials that did not at least mention these components were excluded, as they possessed a fatal flaw.
Exclusion part II: Post-Jadad score. The methodological quality of the primary studies was assessed using the modified Jadad scale for RCTs52 (Appendix B). The reliability of this modified scale was shown to be high, as measured by the intraclass correlation coefficient (ICC = 0.90).52 Each study was evaluated by two reviewers, and the level of agreement was determined statistically. The first three items on the scale rate elements that have been shown to bias meta-analytic results. These include randomization, blinding, and withdrawal. If these items alone are considered, the maximum score is 5. Any study that did not score 3 or more on the scale was excluded from the review. Therefore, this review abstracts detailed data only from studies that achieved moderate to high ratings on the quality scale.
Evaluating the methodological quality of studies and rating the strength of the evidence.
Quality of the RCT. The methodological quality of the primary studies was assessed using the modified Jadad scale for RCTs.52
Quality of reporting adverse events. The potential for risk, or adverse events, was an important component to consider with respect to efficacy. The Jadad scale for quality does not take into account factors associated with adequate collection and reporting of adverse events as detailed by Ioannidis and Lau (2002).35 Therefore, a summary checklist was developed to determine the potential quality in the collection and reporting of adverse events (Appendix B). This score was used to evaluate the relative quality of the adverse events reported.
During the identification phase, two independent reviewers evaluated the title and abstract for eligibility; those meeting the criteria were retrieved as well as those that reported insufficient information to determine eligibility. Two independent reviewers examined the full text of these articles (passing from the title and abstract phase). All studies meeting eligibility criteria were reviewed to assess quality and abstracted according to predetermined criteria. The articles were grouped according to the pharmacological agent used in the intervention.
A team of study assistants was trained in the eligibility criteria for the purposes of this systematic review. Standardized forms and a guide explaining the criteria were developed from previous templates (Appendix B). Two reviewers were used for the identification, selection, validity, and abstraction phases of the systematic review. Disagreements were resolved by consensus. The reviewers were experienced EPC staff with post-graduate training in research methods. The reviewers and abstractors would consult with more senior members of the TEP for content expertise or methods-related issues.
It was expected that studies of the pharmacological agents used in the management of dementia would be quite diverse with respect to the intended therapeutic effect. For these studies, evidence and summary tables (Appendix C) were constructed to describe the more salient characteristics of the included studies.
Statistical meta-analysis was not appropriate for all outcomes or interventions. Before calculating a pooled effect measure, the reasonableness of pooling was assessed on clinical and biological grounds, in terms of clinical homogeneity. Tabular summaries of key characteristics, participants, interventions, and outcomes were considered. A priori, it was decided that pooled estimates would be undertaken for studies with the same pharmacological intervention and the same outcome measure and that a minimum of three studies was necessary for pooling for a specific outcome*. Consideration was given to the similarity of study populations when selecting studies to be included in the pooled estimates. Although many studies evaluated multiple outcomes, data necessary for meta-analysis were not provided in all eligible trials. When sufficient data were provided to estimate the weighted mean difference (WMD), then a meta-analysis was undertaken. WMD was selected as the pooled estimate (versus the standardized mean difference) because the outcome measures did not differ between studies eligible for pooled estimates. For WMD, the difference between the treated and control groups are weighted by the inverse of the variance.
Analysis was undertaken in RevMan 4.2 (Review Manager, Cochrane Collaboration, 2003), and the random-effects model was used to conduct our analyses. In cases where heterogeneity existed, the results of the random-effects model only were considered for interpretation of the results of the pooled estimate. RevMan 4.2 automatically tests the homogeneity of the results of the individual studies for each comparison of dichotomous or continuous data. Tests of homogeneity are formal statistical analyses for examining whether the observed variation in study results is compatible with the variation expected by chance alone. The more significant the results of the test (the smaller the p-value), the more likely that the observed differences were due to unknown factors likely not controlled for in the study. Sensitivity analysis or meta-regression was not undertaken to assess the extent to which the methodological quality of studies, population characteristics, dose, etc., accounted for variation in the primary outcome.
Power analyses were conducted for select pharmacological interventions reporting non-significant findings for all primary outcomes reported in the paper. In addition, if the trial reported the outcomes of MMSE, ADAS-cog, or the CIBIC+, the power for these was also estimated. It was assumed that the desired level of significance was set to alpha equal to 0.05. Adequate power was defined as at least 80% power.
A list of potential peer reviewers was created at the outset of the study. During the course of the project, additional names were added to this list by the McMaster Center and Agency for Healthcare Research and Quality (AHRQ). In May 2003, the individuals on the list were approached by the McMaster team and asked if they would act as peer reviewers of this evidence report. A total of 26 experts agreed and received a copy of the draft report and a copy of the “Structured Format for Referee's Comments” (Appendix D). A list of the reviewers' names and their affiliation is provided in Appendix D. In addition, a criticism editor, Dr. Patricia Huston, who is external to the McMaster EPC, was asked to review the draft report and synthesize the peer review comments. The report from the criticism editor was then used to prioritize the incorporation of peer review comments into the final version of this evidence report.
In this chapter, the presentations of the main results of the systematic review are organized according to the five questions that were addressed. The first question, concerning efficacy of the pharmacological interventions, contains results from all eligible studies. Subsets of trials were then selected from this larger set to address the remaining four questions (see Chapter 2 Methods).
Figure 2
Figure 3
This systematic review yielded a total of 97 pharmacological agents used in the treatment of dementia from 186 unique studies. These 97 interventions have been classified according to three broad categories: 1) cholinergic neurotransmitter modifying agents, 2) non-cholinergic neurotransmitter/ neuropeptide modifying agents, and 3) other agents.
The largest number of eligible citations evaluated was cholinergic neurotransmitter modifying agents (n = 72). The remaining citations were distributed amongst the non-cholinergic neurotransmitter/neuropeptide modifying agents (n = 61) and other agents (n = 76) categories. Some studies evaluated agents in more than one category. The results for all pharmacological agents are presented in this chapter in summary format with descriptive text and an overall summary table (OST) for each drug located at the end of this chapter. The specific details abstracted from each individual study are presented within Appendix C (guide to the results tables) and organized into these same three therapeutic effect classification groups and by pharmacological agent. These Evidence Tables are available on-line at http://www.ahrq.gov/clinic/epcindex.htm.
Appendix C contains three sets of tables with key study descriptors as follows:
Key characteristics. Summarizes the following aspects of each study: features (author, year published, funding source, modified Jadad scale quality score, number randomized, number completing the trial, subgroup analysis), population characteristics (diagnosis, criteria for diagnosis, disease severity, percent male, age, dwelling, and differentiating demographics), intervention (doses, titration scheme, and intervention period), and a complete list of outcomes administered in the study protocol.
Study results. Details the changes observed (the magnitude of theses changes, the comparison groups analyzed, and the findings of any statistical testing) for those outcomes for which appropriate data was reported (for up to three time periods if available). When reported in studies, baseline measures, particularly MMSE score, were also detailed in these tables.
Study adverse events. Lists the specific types of adverse events (side effects, adverse reactions, and serious events) reported, any statistically significant differences between groups, the proportion of withdrawals due to adverse events, and the quality rating score (based on a checklist devised at the McMaster EPC and on the work of Ioannidis and Lau (2002)35) specific to the collection and reporting of these adverse events.
| General cognitive function measure | Specific cognitive function measure | Global Assessment | Behavior/Mood | Quality of Life/ADL/Function | Caregiver Burden | Other | |
|---|---|---|---|---|---|---|---|
| ADAS-Cog (also ADAS-11) | ACPT | R-AVL | ADAS | ABID | ABS | CATS | CAUST |
| AMTS | Babcock Story recall | RM | ADCS-CGIC | ABSR | ADCS-ADL | CSS | SAS |
| BCRS | Barbizet Visuospatial | RPM | ADS | ACES | ADFACS | CSI | AIMS |
| CamCOG | BLM | Rey Memory Test | AGS-E | ADAS-Non-cog | ADL | SCB | BARS/ BAS |
| CASI | BNT | Set test | Bf-S | AFBS | ADL-C | ERP | |
| CETM | BSRT | Snodgrass Picture Naming Task | BGP | BDI | ADL-PDS | ESRS | |
| IQCODE | BSV | SRT-DR | Blessed-D/ BDRS | BEHAVE-AD | BI | Finger Tapping | |
| MCPT | BVR | SWFIT | CAPE | BPRS | Dependency Scale | Test | |
| MMSE | CCASSS | SWFT | CDR-NH | BRMS | DAD | SAS | |
| MMMSE | Category Fluency | SKT | CDR-SB | BRDS | FAST | UPDR | |
| SMMSE | CDT | TK | CGAE | CERAD-BRSD | FIM | ||
| CMMSE | CNTB | TMT | CGI | CMAI | IADL | ||
| MQ | Controlled Challenge Word Association | WMS (MQ) | CGIC | CS or CSDD | IDDD | ||
| RMT | COWAT | WMS-RR | CGRS | DSCS | NAA | ||
| RVM | CVLT | ZVT | CIBIC | DSS | NAI | ||
| SIB | Digit Span Test | CICIC+ | Facial Behavior | OARS-ADL | |||
| SMQ | DSST | DBDS | GS | PDS | |||
| SMST | EFR | DMR | HAM-A | PSMS | |||
| TP, TPAT | FCMT | DRS | HAM-D | PSQI | |||
| WAIS | FIGT | EIS | HDRS | QoL | |||
| FOM | FCCA | HDS-R | QoL-P | ||||
| GAGS | FRS | IPSC-E | QoL-C | ||||
| Grooved Pegboard Test | GERRI | LPRS | SF-36 | ||||
| Letter Cancellation | GBS | MAACL-R | SIP | ||||
| Letter Fluency | GDS | MADRS | Time to functional decline | ||||
| LMT | GIS | MOSES | |||||
| LNNB | GPI-E | NAB | |||||
| MAE | HDS | NMS | |||||
| MEMT | HIS | NOSGER-IADL | |||||
| MNLT | MAC-F | NPI-NH | |||||
| NCT | NOSGER | NSL | |||||
| NDT | NOSIE | OAS | |||||
| NLT | NPI (NPI-4, NPI10) | PANSS-EC | |||||
| NMIC | PDRS | POMS | |||||
| NST | PGIR | RMBPC | |||||
| OLT | Plutchik CGS | RPT | |||||
| OMDR | RAGS | SBI | |||||
| RGRS | SHGRT | ||||||
| SCAG | SRT | ||||||
| Stockton GRS | VHB | ||||||
| TSI | |||||||
| VRGI | |||||||
For each of the outcomes reported by a study, four interpretations of the results were possible. The four options for interpretation are as follows:
SC = significant change. Demonstrated by statistical significance (p ≤ 0.05) for the primary outcomes from an ITT analysis comparing treatment and placebo groups, or comparing differences among dose groups.
NS = not significant. The corollary of SC indicating no statistical significance.
MX = mixed results. Primary outcomes within the same domain show opposite or inconclusive statistical significance; for example, in the general cognition domain, half the studies show significant change and the other half show no significance).
NR = not reported. Outcome was collected but not statistically evaluated or not reported in the publication.
NT = not tested. No outcomes in this domain were tested.
Secondary outcome results were reported in the absence of any primary outcome data (for the domain of interest) and were demarcated with a (2°) in the OST. Similarly, analyses other than ITT were denoted with an asterisk (*) in the OST. If the report describes my subgroup analyses, the word SUBGROUP appears in the “other” column.
Adverse events were not always clearly described in many studies. A priori, we selected 5 generic symptoms (nausea, dizziness, agitation, eating disorder, and diarrhea) and selected to detail the ranges amongst studies for both placebo and treatment groups for these symptoms. The percent of withdrawals for both groups due to adverse events was reported. Adverse events reported to be statistically significant are highlighted for the reader. The details in addition to the quality score rating will assist the reader in evaluating the potential for harm.
Power Analyses and Measures of Effect for combined studies. Power analyses (PW) were for individual trials for select pharmacological interventions (donepezil, galantamine, tacrine, rivastigmine, memantine, estrogen, carnitine, ginkgo biloba, selegiline) for all primary outcomes. In addition, if the trial reported the outcomes of the MMSE, ADAS-cog, or CIBIC+, power was also estimated (for individual trials of pharmacological interventions that had a minimum of three trials with a common outcome). Quantitative meta-analyses were undertaken in interventions that had a minimum of three trials using the same outcome scales and which provided sufficient data to permit calculation of effect sizes (as an Odds Ratio (OR), Relative Risk (RR) and Weighted mean difference (WMD)). The random-effects model results are presented to the reader.
| Drug | Number of studies vs. placebo |
|---|---|
| Antagonic Stress *[Meclofenoxate] *[Nicergoline] | 0** |
| Carnitine | 6 |
| Donepezil *[Vitamin E] | 10* |
| Eptastigmine | 2 |
| Galantamine | 6 |
| Huperzine-A *[Tablet Capsule] | 1* |
| Linopirdine | 2 |
| Mexofenoxate *[Antagonic stress] | 0* |
| Metrifonate | 9 |
| Nicergoline*[Antagonic Stress] | 4* |
| Physostigmine | 4 |
| Posatirelin *[Citicoline] | 4* |
| Rivastigmine | 6 |
| Sabeluzole | 1 |
| Tacrine *[Idebenone] *[Silymarin] | 6** |
| Velnacrine | 3 |
Number of studies. A total of six studies54, 55, 56, 57, 58, 59 evaluating carnitine were included in this review. Four of the reports were published from 1990 to 199256, 57, 58, 59 while the remaining two, both by same authors, were published in 1996 and 2000.55, 54
Design/methodology. A total of 925 subjects were evaluated in these six studies comparing carnitine and placebo. The range of study sample sizes was from 30 to 431 subjects. Quality scores (out of 8 points) ranged from moderate57 (5) to high58, 59 (7), and all of the studies were partially or totally funded by industry.
Populations. All trials were conducted on AD patients, and all but one study used the NINCDS criteria for diagnosis. None of the trials reported including patients with severe dementia; all were classified as mild to moderate.
One trial58 had a mix of community and institutional patients, and one study reported using a community sample.56 The mean age of the samples ranged from 5954 to 79 years,59 with the majority reporting mean age greater than 70 years. One study56 did not report mean age. Three studies specified the baseline MMSE54, 55, 57 (range 16.1 to 20.6) and one trial specified the modified MMSE56 (mean 35) demonstrating no differences between placebo and treatment groups.
Intervention. The dose varied from 2 to 3 grams per day, and treatment duration was either 24 weeks56, 57, 59 or 52 weeks.54, 55, 58 One study57 did not report the dose used. No titration period was used for this drug in any of the studies.
Primary outcomes. All of the studies measured cognition as a main outcome; half of the trials also measured outcomes in the behavior/mood domain.58, 54, 55 All but one of the studies58 used a quality of life/ADL or functional status measure. Only one study55 evaluated caregiver burden. Of the four studies59, 58, 54, 55 reporting the findings from a global measure, only one55 used the Caregiver-rated Global Impression of Change (CGIC).
Analysis. Half of the studies reported ITT analyses58, 54, 55 and the remaining trials reported OC results.57, 59, 56 The ability to combine results was limited with only three studies having the common outcomes of ADAS-cog, modified MMSE (mMMSE), and Clinical Dementia Rating (CDR).
Results and interpretation. See Summary Table 1. The four studies that evaluated “general cognitive function” did not find statistically significant differences in this domain. For those trials that provided sufficient data to estimate power, three trials54, 55, 56 were underpowered for the MMSE (PW = 0.15 to 0.19), and two trials54, 55 for the ADAS-cog (PW = 0.08 to 0.09) and the CDR (PW = 0.06 to 0.11). Meta-analysis was undertaken for the MMSE scores; although favoring a treatment effect, the pooled effect size (WMD = 0.55) was modest and zero was contained within the confidence interval (Figure 4
Number of studies. A total of 11 studies60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70 evaluating donepezil were eligible for this systematic review. One study 70 compared donepezil to vitamin E rather than placebo. All were published within the last 6 years (n = 1, 1996), (n = 2, 1998), (n = 1, 1999), (n = 5, 2001), (n = 2, 2002). Three of these studies66, 67, 69 were undertaken by the same research group at different time periods and had unrelated cohorts of patients.
Design/methodology. A total of 3239 subjects (range of study sample size, 30 – 893) were included in these trials. The modified Jadad scale quality scores ranged from 561 to 8.63, 64 All studies were funded by industry sponsors with the exception of a single trial70 that did not specify their source of support.
Populations. All but one study60 used the NINCDS criteria to diagnose dementia. Eight studies included only AD patients, one study included only VaD,68 and the remaining two included Parkinson's disease dementia (PDD65) and AD patients with cardiovascular disease.64 A single trial included subjects with Down's syndrome and AD.60 The severity of the dementia patients was described as mild to moderate in five studies,65, 66, 60, 62, 70 mild to moderately severe in two studies,69, 67 probable in two studies,61, 68 and moderate to severe in two studies.63, 64
Some studies specified that the dementia patients were recruited from the community,68, 63, 60 one study from institutional setting,64 and the remaining did not specify the living arrangements. Mean ages of the study subjects ranged from 54 to 85.7 years with most studies representing ages in the upper to mid 70s.
Six studies 61, 62, 65, 66, 67, 69 specified the race of the subjects, and of these, the overwhelming sample was Caucasian (range from 92 – 100%). All but one study68 specified the proportion of men recruited, the range being from 18 – 46%. Four of these studies presented some results stratified by gender,62 age,64 APOE genotype,62 baseline MMSE,63, 64 patients with Down's syndrome,60 and the use of psychoactive drugs.63 Three studies specified the baseline MMSE61, 64, 70 demonstrating no differences between placebo and treatment group, and the mean values varied from 14 to 16.
Intervention. Five studies evaluated a 10 mg dose given once daily,60, 61, 62, 63, 66 two studies 5 mg daily,64, 69 and four studies compared 5 mg and 10 mg dose groups.65, 68, 67, 70 Titration periods observed included 7 days65, 67 and 4 weeks,70, 68 these were not specified in the remaining studies. The total duration of the drug (including titration) varied from 12,65 15,67 23/24,60, 63, 64, 65, 66, 68 and 54/5661, 62 weeks.
Primary outcomes. Specific cognitive tests and caregiver burden were not evaluated in these studies. Nine studies used the MMSE, and six studies the ADAS-cog.
Analysis. All but one of the studies60 comparing donepezil to placebo used ITT analysis. The study using OC analysis showed no statistical difference between treatments. It had the smallest sample size of 30 subjects and was underpowered (PW = 0.16) for the behavioral measure used, NPI.
Five of the eight studies60, 61, 63, 64, 65, 66, 67, 69 measuring Activities of Daily Living (ADL) found a significant difference in the various outcomes used to assess ADL, but none of these could be combined into a summary estimate. It should be noted that the majority of trials selected these ADL variables as secondary outcomes. Behavior outcomes were not significant or showed mixed results for the three studies that evaluated this domain but these lacked sufficient power. Only one study collected caregiver stress and health service utilization outcomes63 but did not report these data.
Number of studies. Six studies of galantamine71, 72, 73, 74, 75, 76 were included in this review. All compared galantamine with placebo and were published between 2000 and 2002 (from six different authors).
Design/methodology. The sample sizes for subjects ranged from 28573 to 97874 with 3530 subjects evaluated in total. All quality scores were high, with either 7 or 8 on the Jadad scale. Funding sources for these studies varied; one study did not report funding source,74 one was funded by a non-industry source,75 one was partially funded by industry,71 and three were funded by industry.72, 73
Populations. All but one of the studies71 included AD patients only, and this single study mixed VaD and AD patients. All subjects in these studies were classified as mild to moderate.
One study specified that the subjects were from the community.73 All studies included from 36 – 53% male subjects, and the mean age ranged from 72.2 to 76.8 years. Three studies71, 74, 76 specified race, and with the largest proportions being white (range from 91.5 – 99.9%). Two studies evaluated subgroups, based on baseline MMSE75 and APOE genotype.75, 76
Intervention. All studies had a titration period, starting at 4 mg per day71, 73 or 8 mg per day.72, 74, 75, 76 Four studies increased the dose weekly,71, 72, 75, 76 and one study increased every 2 to 3 days.73 All studies had a treatment dose of 24 mg per day. Three studies73, 75, 76 included other doses ranging from 16 mg per day to 36 mg per day. Trials were undertaken for a minimum of 3 and maximum of 6 months.
Primary outcomes. All domains were measured except for specific cognitive tests and caregiver burden. All studies used the ADAS-cog and CIBIC or CGIC measures as primary outcomes. None reported baseline mean MMSE values.
Analysis. All but one71 of the studies reported ITT analysis, and the results of this study did not differ from the others.
Number of studies. Nine studies78, 79, 80, 81, 82, 83, 84, 85, 86 were eligible for this systematic review. Studies were published from 1996 to 1999, and all studies compared metrifonate to placebo.
Design/methodology. Overall, 2759 subjects were evaluated. Sample sizes randomized ranged from 27 to 605 subjects. Quality of the studies ranged from 580 to 8,81 with the majority of studies earning 6 points.82, 83, 79, 85, 78 Two studies were non-industry-funded78, 79 and one did not specify the source of funding,80 while the other six studies were partially or wholly funded by industry.
Populations. The subjects in all included trials were classified as having mild to moderate AD. Not all trials specified the source of recruitment or the racial composition of subjects. Three studies specified a community sample,86, 79, 85 and three trials reported the racial composition l,83, 86, 84 which was greater than 90% white in all cases. Mean age for all of the studies ranged from 71.4 to 75.0 years, with one study not reporting the mean age.80
Intervention. All but one study86 reported the loading dose, which varied from 0.5 mg per kg to 5.0 mg per kg. Following this initial loading period, the maintenance dose varied from 0.65 mg per kg to 4 mg per kg and 50 mg per day. The duration of the study treatments varied from 21 days to 26 weeks.
Primary outcomes. All outcome domains were evaluated with the exception of caregiver burden. ADAS-cog, CIBIC+, and MMSE were most frequently used as outcomes.
Analysis. Four trials reported OC analyses78, 79, 80, 85 and the remaining reported ITT analyses.
With the exception of a single study, quality scores for reporting adverse events were greater than 3 and generally well reported. However, only one trial 83 tested for differences between groups and found nausea and vomiting, diarrhea, and muscle and joint disorder to be significantly different. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 14%, all doses = 2 – 50%), 2) dizziness (placebo = 1%, all doses = 3 – 4%), 3) diarrhea (placebo = 4 – 14%, all doses = 11–19%), 4) agitation (placebo = 2 – 14%, all doses = 8 – 33%), and none reported eating disorder as an adverse event. Withdrawal rates due to adverse outcomes varied from 0 – 9% for placebo and 0 – 12% for treatment groups. Some studies indicated arrhythmia80, 82, 83, 85 and hypotension82, 85 and hematological abnormality82 but did not test for differences between groups. The majority of studies reported that laboratory tests including liver function and hematology were within normal limits. Overall, it was difficult to determine which types of adverse events reported had the potential to cause serious harm. This is some concern as metrifonate is no longer used as a therapy for dementia due to its potential for serious adverse events that include: respiratory paralysis, bradychardia, severe leg cramps and dyspepsia.87
Number of studies. Four studies88, 89, 90, 91 compared the effect of nicergoline to placebo, and one study,92 published in 1994, compared nicergoline to antagonic-stress. The four placebo comparison studies were published in 1995,91 1997,89, 90 and 2001.88
Design/methodology. Sample sizes in the controlled studies varied from 108 to 346, with the total number of subjects included totaling 705. The drug versus drug study had only 62 subjects. The placebo trials all had quality scores of 6 points, while the non-placebo trial had a quality score of 5. Funding sources were reported only in two studies,88, 90 and both were industry-funded.
Populations. These studies had a very mixed population of dementia patients. Two included AD only,88, 92 one trial MID only,89 one trial included both senile dementia of Alzheimer type (SDAT) and MID,91 and one trial included PDD, VaD, and mixed dementia.90 All subjects had mild to moderate dementia. Studies included 38 – 55% male subjects; one study91 did not report the gender proportions. Mean age of subjects ranged from 69.3 to 73.7 years with one study not reporting91 this value. One study91 compared SDAT patients to MID patients.
Intervention. All trials versus placebo used 60 mg per day, but duration varied from 2 months,91 6 months,88, 89, 92 and 12 months.90
Primary outcomes. Caregiver burden was the only domain not evaluated by at least one of the studies. Three trials89, 90, 91 specified baseline MMSE, and this varied from 20 to 22.
Analysis. Two of the trials reported OC analyses90, 91 and two reported ITT analyses89, 88; the trial comparing nicergoline to antagonic-stress presented OC analysis only.
Quality scores for reporting adverse events varied from 2 to 5 for these four trials, and none tested for differences between groups. Withdrawal due to adverse events varied from 0 – 8% for placebo and 0 to 9% for the treatment group. The trial with the lowest quality reporting score 89 reported the most number of different events (up to 23 event types). With the exception of headache, which was reported in all four trials, it was difficult to determine which types of adverse events most characterized exposure to this pharmacological agent. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 3%), 2) dizziness (placebo = 1 – 2%, all doses = 0% or not reported), 3) diarrhea (placebo = 2 – 6%, all doses = 2 – 4%, 4) agitation (placebo = 5%, all doses = not reported), and none reported eating disorder as an adverse event.
Number of studies. Four studies were eligible for our review,94, 95, 96, 97 all comparing physostigmine to placebo only. The studies were from 1996, 1999, and 2000 with two of the studies being by the same author.95, 96
Design/methodology. Sample size ranged from 17697 to 47596 with an overall total of 1198 subjects. Quality scores were 5,96 6,94, 95 and 797 out of 8 possible points. Two studies were industry-funded,97, 96 one was partially funded by industry,95 and one did not report funding source.94
Populations. All subjects had a diagnosis of mild to moderate or probable AD. Only one study96 reported that all subjects were drawn from the community. Mean age ranged from 68.6 to 73.4 years, and the proportion of male subjects varied from 39.8 – 63%.
Intervention. Treatment schedules varied across the studies. One study94 used a patch (30 and 60 mg), one trial used 30 mg (15 mg twice daily),97 one trial had a washout and titration every 3 weeks to 30 or 36 mg per day,96 and another trial titrated weekly to 15 mg twice daily.95 Duration of treatment ranged from 6 weeks95 to 24 weeks.94, 96
Primary outcomes. All studies used the ADAS-cog as a primary outcome, and none reported caregiver burden or behavior/mood measures. No studies reported baseline MMSE scores.
Analysis. All but one trial94 used ITT analysis.
The quality scores for reporting adverse events were generally low, scoring 1 or 2 out of 5. Withdrawal rates due to adverse events varied from 1 – 5% for placebo and 12 – 55%in the treatment group, with one study97 not reporting rates. The high withdrawal rates were in studies with sample sizes that varied from181 to 475 subjects. A single study97 tested for differences between groups, and found that dizziness, tremor, weight loss, asthenia (varying from 6 – 22% for all doses), confusion, delirium, and respiratory problems were significantly different. The cluster of reported types of adverse events suggests that gastrointestinal problems (abdominal pain, diarrhea) (placebo = 1 – 9%, all doses = 13 – 28%), nausea and vomiting (placebo = 1 – 9%, all doses = 9 – 75%) and eating disorder (placebo = 2 – 6%, all doses = 5 – 16%) were most frequently reported. Dizziness (placebo = 4 – 13%, all doses = 11 – 38%) and agitation (placebo = 6 – 16%, all doses = 4 – 8%) were also reported. No events deemed serious enough for hospitalization were reported.
Number of studies. Four studies98, 99, 100, 101 compared posatirelin to placebo, and one of these 101 also compared it to citicoline. One study was published in each of the years from 1995 to 1998, and all studies were conducted in Italy. Two studies were by the same author.100, 101
Design/methodology. Populations randomized in the studies varied from 136100 to 36099 with a total of 931 in all trials. Quality scores ranged from 598 to 7100 out of a possible 8 points. Three of the four studies did not report the source of their funding, but one trial100 reported partial funding by industry.
Populations. No two studies included exactly the same populations; one trial had only AD,101 one trial had only VaD,100 one trial had mixed AD and VaD,99 and another trial had mixed AD, VaD, and PDD.98 This latter trial98 compared populations in a subgroup analysis of AD versus VaD. All studies evaluated populations with mild to moderate disease.
The mean age of the subjects ranged from 69.4100 to 78.898 with the percentage of male subjects varying from 34101 to 66%.100 One study100 included a dementia population who also had hypertension.
Intervention. A dose of 10 mg per day was used in all studies, and treatment interval varied from 3 to 4 months.
Primary outcomes. General cognitive function was evaluated in all studies using the intellectual impairment Gottfries-Bråne-Steen (GBS) subscale and the MMSE was used in one trial.101 Specific cognitive function was evaluated in one trial.100 Quality of life/ADL was evaluated with the ADL subscale of the GBS, and behavior/mood was evaluated with the emotional impairment subscale of the GBS. GBS total score was assumed to be a measure of quality of life/ADL rather than global assessment. None of the trials reported baseline MMSE scores.
Analysis. Two of the studies100, 101 used OC analysis to report outcomes.
Quality scores for reporting adverse events varied from 2 to 4. Withdrawal rates due to adverse events ranged from 0 – 3%in placebo and 0 – 4%in the treatment group. One trial100 did not report the rate of withdrawal. None of the studies tested for significant differences between groups. All studies reported the presence of agitation, and at least three studies reported arrhythmia, nausea/vomiting, headache, rash/skin disorder, and sleep disorder; there is no evidence to suggest that these differed between placebo and treatment group. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 1 – 4%), 2) dizziness (placebo = not reported, all doses = 1%), 3) diarrhea (placebo = 2%, all doses = 2%), 4) agitation (placebo = 1 – 5%, all doses = 1 – 5%), and none reported eating disorder as an adverse event. No serious adverse events were reported.
Number of studies. Six studies were eligible for this review, all comparing rivastigmine to placebo. Studies were published in 1998,102, 103 1999,104, 105 2000,106 and 2001.107
Design/methodology. Six studies evaluated 2071 subjects in total, with studies ranging from 27107 to 725104 subjects. The quality of studies varied from 5 to 8, with three studies106, 104, 103 scoring 8 points and one107 earning 5 points. All studies were funded by industry sponsors.
Populations. Four studies were evaluated in AD patients,107, 105, 104, 103 one trial102 dementia of the Alzheimer's type (DAT), and one study106 Lewy body dementia subjects for mild to moderately severe subjects. One study103 reported a subgroup analysis by vascular risk. One study reported a community sample in their trial.104 Mean age for the studies ranged from 69.4102 to 75.9 years.107 Two studies107, 105 did not report the ratio of male subjects in their study, and the other four varied from 39 – 56%. One trial104 reported co-morbidity of diabetes, hypertension and arthritis; one trial103 reported concurrent medication use for cardiovascular, gastrointestinal and analgesic aids.
Intervention. Doses for rivastigmine varied from 1 mg103 to 12 mg,106, 105, 104 and treatment duration varied from 14104 to 26107, 103 weeks. All studies titrated the dose of drug over a period ranging from 2 weeks106 to 12 weeks.107
Primary outcomes. The ADAS-cog and CIBIC+ were evaluated in half the studies. Baseline MMSE was reported in two trials,103, 106 and the mean scores varied from 18 to 20. Specific cognitive function, behavior/mood, and quality of life/ADL were infrequently evaluated, and caregiver burden was not evaluated in the trials.
Analysis. Trials were evenly divided between ITT analyses106, 104, 103 and OC.107, 105, 102
Results and interpretation. See Summary Table 8. Although, general cognitive function was evaluated in five studies, only four reported findings, which were all statistically significant; one of these was borderline significant (p = 0.054) (see Summary Table 8). Two of these studies104, 103 represented two sites (North American and European) of the same protocol. Although the same protocol was used, one study103 found significance for both high (6 – 12 mg) and lower (1 – 4 mg) dosages, but the other trial did not show significance for the lower dose, likely due to lack of power for this outcome (PW = 0.67). Interestingly, at the 12-week midpoint, the low dose groups in both these studies appeared to be worse than placebo for all primary outcome measures, but then migrated to improvement at the 26-week endpoint. For those studies105, 104, 103 that reported ADAS-cog change scores from baseline for the treatment group, mean change values varied from -2.75 to 0.26. Figure 17
Two trials evaluated specific cognitive tests; one trial106 showed statistical significance for the CCASSS but not for the MMSE (general cognitive test). Similarly, another trial105 found the Weschler Logical memory (instant recall) to be significant but the ADAS-cog (general cognitive outcome) was borderline significant.
With respect to global changes, five of six studies showed significant changes, and from these, three studies104, 103, 102 were for the high dose only. One of these studies102 defined the high dose as 6 mg per day, which was the minimum dose level for the other two studies. One study107 showed a statistical difference for the two deterioration categories (5 – 7) in the CIBIC+, but not in the improvement categories when comparing treatment and placebo groups. Figure 18
Quality scores for reporting adverse events varied from 2 to 5. Withdrawal rates due to adverse events ranged from 4 – 11% in the placebo and 11 – 27% in the treatment group. One trial107 did not report the withdrawal rates or the types of adverse events observed. Two trials103, 104 demonstrated a dose response; however, one of these trials104 showed significant differences for nausea and vomiting only, and the other trial103 showed significant for all the adverse events reported. With respect to the types of adverse events, the majority of studies reported dizziness, nausea and vomiting, eating disorder/weight loss, and headache. It should be noted that one study105 allowed intentional prescribed anti-emetic drugs to increase the tolerance of subjects taking rivastigmine. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3 – 10%, all doses = 8 – 58%), 2) dizziness (placebo = 0 – 7%, all doses = 6 – 20 %), 3) diarrhea (placebo = 2 – 9%, all doses = 7 – 17%), 4) eating disorder (placebo = 4 – 8%, all doses = 4 – 19%), and 5) agitation was not reported. No serious adverse events were reported.
Number of studies. Eight studies evaluating tacrine were eligible for this review; tacrine was compared to placebo in six trials108, 109, 110, 111, 112, 113 (lecithin was assumed to be like placebo) and to other drugs in two trials.114, 26 One study114 compared two arms with tacrine, one with silymarin added and one placebo arm. The other non-placebo controlled trial compared tacrine with idebenone.26 These drug versus drug trials were published in 1999 and 2002. The placebo controlled trials were published 1991,111 1994,108, 109, 113 1996,110 and 1999.112
Design/methodology. The placebo studies evaluated 994 patients in total with a range of 13111 to 663108 subjects per study. Quality scores out of 8 points were evenly distributed with two studies each having scores of 5, 6 and 7. Both drug versus drug trials had scores of 7. One study109 was not funded by industry, and one trial114 did not report its source of funding; the other six studies had at least partial, if not full, industry support.
Populations. All studies included AD subjects; one trial26 also included PDD patients. Subjects in all studies had mild to moderate or probable disease. Five of the studies113, 109, 111, 114, 26 reported that their sample was from the community. The mean age ranged from 68110 to 75 years,113 and the percentage of male subjects in the studies varied from 13110 – 54%.113 One study reported race with 100% white subjects.26
Primary outcomes. All six of our identified domains were evaluated by at least one trial. All trials measured cognition; however, sufficient data to permit pooled analyses could not be adequately abstracted from all these studies that had similar outcomes. Baseline MMSE varied from 14 to 18 in the fours trials112, 110, 109, 114 that reported this.
Analysis. Five studies108, 112, 113, 114, 26 reported ITT analysis and three did not.109, 111, 110
Results and interpretation. Of the six placebo-controlled studies, only one trial108 showed statistical significance for general cognitive function as measured by the ADAS-cog (ES = -0.268). Three doses (80 mg,120 mg,160 mg) were compared in this trial, and the 120 and 160 mg per day were shown to be statistically significant (approximately a mean change of 2 points on the ADAS-cog). One trial112 showed mixed results for the two outcomes used (CASI and MMSE) to evaluate general cognitive function; this trial was underpowered for both these outcomes (PW = 0.22 and 0.26, respectively). Three trials109, 110, 111 found no statistical differences between treatment and placebo but had small sample sizes, ranging from 12 to 32 subjects, and were likely underpowered (insufficient reporting to estimate power). A fourth trial113 also found no statistical difference (p = 0.55) for general cognitive function, but the study duration was 12 weeks. It should be noted that this study used an 80 mg dose, which was shown to have no benefit relative to higher doses of 120 and 160 mg.108 A single trial109 of small sample size evaluated specific cognitive tests and did not show statistical differences.
Three studies evaluated global assessment, and two 113, 108 found statistical significance; the trial showing no benefit112 also showed inconclusive findings for general cognitive function as well (PW = 0.05 for the CGIC). Four trials108, 109, 110, 113 evaluated behavior/mood and showed no difference between groups. Two trials109, 111 with small sample sizes measured quality of life/ADL and showed no significant changes; lack of sufficient power cannot be ruled out. There were not enough similar outcomes reported to complete a pooled analysis for tacrine.
The quality scores for reporting adverse events varied from 1 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 12% for placebo and 0 – 55% in the treatment group. The higher rates of withdrawal were associated with higher doses. Elevated alanine transaminase (ALT) or hepatic abnormality (placebo = 4 – 13%, all doses = 7 – 67%) was reported in six studies, suggesting the potential for serious liver damage. None of these trials tested for differences between treatment and placebo with respect to adverse events. Five of the studies reported nausea and vomiting (placebo = 0 – 9%, all doses = 9 – 37%); gastrointestinal problems and dizziness (placebo = 0 – 16%, all doses = 4 – 14%) was also noted in several studies. Frequencies of other a priori symptoms of interest are as follows: 1) agitation (placebo = 5 – 12%, all doses = 5 – 9%), and 2) diarrhea (placebo = 0 – 13%, all doses = 4 – 18%).
Number of studies. Three studies evaluated velnacrine versus placebo, and these were published in 1991,115 1995,116 and 1996.117
Design/methodology. A total of 774 subjects were studied with sample sizes ranging from 16115 to 449.116 Quality scores out of a possible 8 points varied from scores of 6117, 115 or 7.116 All studies were sponsored by industry.
Populations. The characteristics of the populations all included probable AD subjects. The mean age of the participants ranged from 70.5 to 72.8 years and the percentage of male subjects ranged from 31 – 41%. Location of recruitment was not specified.
Intervention. The doses given for this drug overlapped between the studies, but they were on different schedules (once, twice, or three times per day). None of the studies had a titration period. One study117 compared four doses (three daily doses of 10 mg, 25 mg, 50 mg, and 75 mg). One study116 compared doses of 150 mg or 225 mg per day. The other study used a dose of 100 mg twice daily and had the smallest sample size.
Primary outcomes. General cognitive function was evaluated in all studies with the ADAS-cog, and none specified baseline MMSE values. A variety of outcomes were used to evaluate global assessment. At least one of these trials evaluated the other outcomes domains
Analysis. Only one of the trials116 used an ITT analysis.
All studies included assessment of global functioning, for which two117, 116 found significant differences, and one115 had mixed results. Behavior/mood was evaluated in only one study117 as a secondary outcome with an OC analysis, and no significant effect was found. Similarly, quality of life/ADL was measured in two studies as secondary outcomes, which produced opposite results (not significant117 and significant116). One of the trials116 measured effects on caregiver burden as a secondary outcome and found a significant effect. There were not enough similar outcomes reported to complete a pooled analysis for velnacrine.
Quality scores for reporting adverse events were 3 for all studies. Withdrawal rates varied from 0 – 22% for the placebo group and 5 – 33% for the treatment group. None of the studies reported a dose response. None of the studies tested for statistical differences between the placebo and treatment groups. Two studies reported aberrant hematology and hepatic abnormality116, 117; for these two studies the rate of occurrence were 2 – 21% for placebo, and 32 – 40% for all doses. All studies reported diarrhea and nausea and vomiting. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 0 – 4%, all doses = 3 – 8%), 2) dizziness (placebo = 3%, all doses = 0 – 8%), 3) diarrhea (placebo = 3%, all doses = 2 – 33%), 4) agitation (placebo = 4%, all doses = 1 – 4%), and 5) eating disorder (placebo = 1%, all doses 2 – 4%).
The remaining agents classified as cholinergic neurotransmitter modifying agents were grouped according to the number of studies for the purposes of presentation:
Cholinergic pharmacological agents that had two trials eligible for this review and were compared to placebo.
Cholinergic pharmacological agents with only one trial eligible for this systematic review.
| Drug | Number of studies vs. placebo |
|---|---|
| Alaproclate | 1 |
| Alprazolam *[Lorazepam] | 0* |
| Anapsos | 1 |
| BMY (Nootropic) | 1 |
| Carbamazepine | 2 |
| Citalopram *[Mianserin] *[Perphenazine] | 2** |
| Diphenhydramine *[Haloperidol, Oxazepam] | 0* |
| Divalproex | 2 |
| Fluoxetine *[Haloperidol] *[Amitriptyline] | 2** |
| Fluvoxamine | 1 |
| Haloperidol **[Risperidone] *[Loxapine] *[ Diphenhydramine Oxazepam ] *[Fluoxetine] *[ Tiapride] *[ Trazodone] | 4******* |
| Imipramine *[Paroxetine] | 1* |
| Lisuride | 1 |
| Lorazepam *[Alprazolam] *[Olanzapine] | 1** |
| Loxapine *[Haloperidol] *[Thioridazine] | 1** |
| Lu25-109 | 1 |
| Maprotiline | 1 |
| Melperone *[Tiapride] | 0* |
| Memantine | 3 |
| Mianserin *[Citalopram] | 0* |
| Minaprine | 1 |
| Moclobemide | 1 |
| Naftidrofuryl | 1 |
| Olanzapine *[Lorazepam] | 2* |
| Oxazepam *[Diphenhydramine Haloperidol] | 0* |
| Paroxetine *[Imipramine] | 0* |
| Perphenazine *[Citalopram] | 1* |
| Phosphatidylserine | 2 |
| Risperidone **[Haloperidol] | 1** |
| Selegiline *[Vitamin E] | 7* |
| Sertraline | 2 |
| Thioridazine *[Loxapine] | 1* |
| Tiapride *[Haloperidol] *[Melperone] | 1** |
| Trazodone *[Haloperidol] *[5′-MTHF] | 1** |
| Xanomeline | 1 |
Number of studies. Five studies124, 125, 126, 127, 128 evaluating haloperidol relative to placebo as well as another drug were included in this review. Three additional trials129, 130, 131 (from 1990, 1993, and 2001) compared haloperidol to another drug and did not include a placebo group (these are detailed in question 3). One study was published in each of 1982, 1997, and 1999; two were from 2000.
Design/methodology. Sample sizes for the placebo-controlled studies were generally small with samples of 15,128 64,126 149,125 344,127 and 306124 for an overall total sample size of 622 subjects. All but one of the studies had a quality score of 6 out of a possible 8 points; the other study127 had a score of 7 points. One study124 did not indicate a funding source, three studies125, 126, 127 indicated some industry funding, although none showed total industry funding, and one study128 had no industry funding.
Populations. Populations evaluated in the studies included three with only mild to moderate or probable AD,124, 125, 128 one with PDD and MID,126 and one with PDD, VaD, and mixed dementia127 (which reported subgroup information about VaD versus all subjects). Two placebo studies126 reported the presence of subjects with severe disease. Two trials126, 127 studied institutionalized patients while one128 looked at community subjects. Ages in the studies ranged from a mean of 72.7 to 81.0 years, and 33 – 49% of subjects were male.
Intervention. Haloperidol doses ranged from 3 mg to 20 mg per day for a treatment period of 3 weeks,124 6 weeks,128 10 weeks,126 12 weeks127 or 16 weeks.125 The other drugs that haloperidol was compared to included fluoxetine,128 loxapine,126 risperidone,127 tiapride,124 trazodone & BMT125 in the placebo controlled studies; loxapine,129 risperidone,131 oxazepam & diphenhydramine130 were evaluated in the head to head comparisons.
Primary outcomes. All studies evaluated behavioral outcomes, and at least one study evaluated the effect of haloperidol in each of the other domains included in this review with the exception of specific cognitive function. None of the studies reported baseline MMSE values.
Analysis. Two studies124, 127 reported ITT analysis and three128, 126, 125 did not.
The quality scores for reporting adverse events varied from 1 to 5. Only three of five studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 5 – 17% for placebo and 17 – 33% in the treatment group. One trial124 showed a dose response effect, but the study only lasted for 3 weeks. Three trials tested for differences between treatment and placebo with respect to extrapyramidal symptoms (placebo = 17 – 32%, all doses = 34 – 97%), and two124, 125 found significant differences. One study125 found significant differences between groups for balance-related problems. Although reported by only two trials, the range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = not reported), and 2) dizziness (placebo = 24%, all doses = 21%), 3) no frequencies were reported for agitation, diarrhea, or eating disorder.
Number of studies. Three studies comparing memantine to placebo were eligible for review. One study was published in 1999132 and the other two133, 134 in 2002.
Design/methodology. Sample sizes ranged from 166 to 579 for a total population evaluated of 1066 subjects. Two studies132, 133 earned 6 points out of 8 for the quality score while the other134 earned 7 points. One report132 did not indicate the source of funding, and the other two had industry support or funding.
Populations. Two studies133, 134 included VaD patients only, one of which134 analyzed subgroups based on MMSE, type of VaD, and gender. The other study132 included VaD, DAT and PDD patients and did subgroup analysis comparing VaD to DAT and grouping for care dependence. One trial132 included patients with severe disease and was the only study to report that all of their subjects were institutionalized. One study134 included only community subjects and the other study133 did not report source of patients. Study subjects had a mean age of 71.2,132 76.4,133 and 77.4134 years, and 42 – 53% were male.
Intervention. Two studies133, 134 had a 4 week titration period with a final dose of 20 mg per day for the remaining 24 week study duration. The third study used a 2-week titration period with a final dose of 10 mg per day for the remaining 10 weeks of the study.
Primary outcomes. All studies evaluated global function. The ADAS-cog was evaluated in two studies134, 133 and showed smaller changes of decline relative to placebo by approximately 1.5 points. All studies measured global function with the CGI-C but did not provide variance data to permit the calculation of the pooled estimates. Although, all trials measured MMSE, none reported baseline values for this outcome. Only one trial132 evaluated behavior/mood and quality of life/ADL. No study evaluated specific cognitive function or caregiver burden.
Analysis. All studies performed ITT analysis.
The quality scores for reporting adverse events varied from 3 to 4. Only two of three studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 7 – 13% for placebo and 9 – 12% in the treatment group. One trial133 tested for differences between treatment and placebo, and none of the comparisons were significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all does = 5%), 2) dizziness (placebo = 3 – 8%, all doses = 6 – 11%), 3) diarrhea (placebo = 4%, all doses = 4%), 4) agitation (placebo = 7 – 8%, all doses = 4 – 5%), and none reported eating disorder as an adverse event.
Number of studies. Six studies evaluated the effect of selegiline compared to placebo.135, 136, 137, 138, 139, 140 A single study135 compared selegiline to vitamin E, placebo and the combination of selegiline plus vitamin E . The studies were published in 1991,136 1992,137 1993,138 1997,135 1998,140 and 1999.139
Design/methodology. Sample sizes ranged from 10137 to 341135 with a total population evaluated of 733 subjects. Study quality scores were 5,140, 135, 137 6,139, 138 and 7.136 Three trials139, 137, 136 did not report the source of funding, and the other three140, 135, 138 had some industry support.
Populations. Studies included patients with mild to moderate PDD, DAT, and AD. Two subgroup analyses based on the results of the clock drawing test139 and the GDS result136 were reported. One study reported that the included patients were institutionalized.139 Mean age of the subjects in the trials ranged from 68.6 to 83.0 years and all had male subjects (29 – 74%).
Intervention. All trials used the same dose, 10 mg per day, with three of the trials135, 138, 137 giving the drug in two 5 mg doses. One trial140 reported a titration period of 7 days. The duration of the trials varied with treatment times of 2 months,137 3 months,136 6 months,139, 140 15 months,138 and 24 months.135
Primary outcomes. Quality of life/ADL and caregiver burden were not evaluated in any of the studies.
Analysis. Two studies135, 140 carried out ITT analysis.
There is some evidence that shows that selegiline and selegiline combined with vitamin E, increases the time to important functional decline milestones135 using time to event in the survival analysis. The results of this study showed that the vitamin E, selegiline, and combined groups were statistically different (i.e., declined less) from the placebo group in analyses that included baseline MMSE score as a covariate (not significant when excluded). The median survival was 230 days (vitamin E), 215 days (selegiline), and 145 days (combined group). Moreover, the vitamin E group showed a statistically significant difference for the endpoint of institutionalization, and the other treatment groups did not. Thus, the findings of this study suggest that selegiline and vitamin E may delay clinically important deterioration in patients with moderately severe AD; this delay varied from 20 to 32 weeks. It should be noted that this study evaluated subjects over a 2 year period, the longest of any dementia trial; moreover, the population was moderate to severe with respect to severity.
The quality scores for reporting adverse events varied from 0 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 4% for placebo and 0 – 9% in the treatment group. Two trials137, 138 did not report any adverse events. Only one trial135 tested for differences between the treatment and placebo groups and showed that balance (worse) and falls were significantly different between groups (particularly the group with selegiline combined with vitamin E (22%) versus placebo (5%)). However, when adjusted for multiple comparisons, these were no longer significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 0%), 2) dizziness (placebo = 2 – 20%, all doses = 0 – 30%), and 3) agitation (placebo = 4 – 16%, all doses = 4 – 23%); no trial diarrhea or reported eating disorder as an adverse event.
Ten non-cholinergic neurotransmitter neuropeptide modifying agents versus placebo were studied in only two included trials:
| Drug | Number of studies vs. placebo |
|---|---|
| Aniracetam | 1 |
| 5′-MTHF *[Trazodone] | 0* |
| Amitriptyline *[Fluoxetine] | 0* |
| Ateroid | 1 |
| Buflomedil | 1 |
| Cerebrolysin | 6 |
| Choro-San | 1 |
| Choto-San | 1 |
| Citicoline *[Posatirelin] *[Sulphomucopolysaccharides] | 0** |
| Cyclandelate | 2 |
| Denbufylline | 1 |
| Desferrioxamine | 1 |
| Diclofenac | 1 |
| Ergokryptine (CMB 36-733) | 1 |
| Ergokryptine (Dek) | 1 |
| Estrogens | 5 |
| Ginkgo Biloba | 3 |
| Glycosaminoglycan Polysulfate | 1 |
| Guanfacine | 1 |
| Hydergine *[Pyritinol] | 1* |
| Hydroxychloroquine | 1 |
| Idebenone *[Tacrine] | 4* |
| Indomethacin | 1 |
| Misoprostol *[Diclofenac] | 0* |
| Monosialotetrahexosylganglioside (GM-1) | 1 |
| N-Acetylcysteine | 1 |
| Nimesulide | 1 |
| Nimodipine | 2 |
| Nizatidine | 1 |
| Nootropic | 1 |
| ORG 2766 | 2 |
| Oxiracetam | 5 |
| Pentoxifylline *[Sulodexide] | 3* |
| Piracetam | 1 |
| Prednisone | 1 |
| Propentofylline | 4 |
| Pyritinol *[Hydergine] | 0* |
| Silymarin + Tacrine *[Placebo + Tacrine] | 0* |
| Simvastatin | 1 |
| Sulphomucopolysaccharides *[Citicoline] | 0* |
| Sulodexide *[Pentoxifylline] | 0* |
| Thiamine | 1 |
| Vasopressin (DDAVP) | 1 |
| Vincamine | 1 |
| Vitamin E *[Donepezil] *[Selegiline] | 1** |
| Xantinolnicotinate | 1 |
Number of studies. Six included studies 168, 169, 170, 171, 172, 173 compared cerebrolysin to placebo. One report was from 1994,172 one from 1999,171 two from 2000,169, 170 one from 2001,168 and one from 2002.173
Design/methodology. The sample size in the studies ranged from 53169 to 192173 with a total of 819 subjects. The quality of studies varied from scoring 6171 to 8168, 173 points out of a possible 8 points. One study172 did not indicate the source of funding, one trial had non-industry funding,173 and the four remaining trials were funded by industry.168, 169, 170, 171
Populations. All but one of the six studies included AD patients; one study171 evaluated patients who had mild to moderate VaD. Mean ages of the subjects in the studies ranged from 69.7171 to 74.1 years.173 The proportion of males in the trials varied from 34 – 69%, and only one trial173 specified the proportion of Caucasians.
Intervention. All of the studies used the same dose of cerebrolysin, 30 ml per day, for 5 days per week. One trial172 was for 28 days, four studies169, 170, 171 lasted 4 weeks, one trial168 lasted 16 weeks, and one trial173 lasted 24 weeks.
Primary outcomes. Most studies evaluated general cognitive function and three trials168, 169, 173 used the ADAS-cog. Baseline MMSE was reported in a single trial173 with a score of 21. All studies evaluated global function, and at least two studies evaluated one outcome in each of the remaining domains with the exception of caregiver burden.
Analysis. All but one173 of the studies used ITT analysis.
Results and interpretation. See Summary Table 16. Four of the five studies that evaluated general cognitive function showed significant differences.169, 168, 170, 171 Figure 19
All trials evaluated global assessment, and all except one trial171 reported a significant difference. Figure 20
Two169, 172 of the six trials scored 5 out of 5 on our quality scale for rating adverse events, yet they did not report any adverse events. Two studies173, 168 scored 4, and the other two trials scored 3171 and 2.170 All the studies with scores equals to 4 or less tested for statistical differences in adverse events between placebo and treatment groups. Withdrawals due to adverse events were not reported in one study,170 and were 1% in two studies.173, 168 Three studies169, 172, 171 reported no withdrawals. A significant difference between treatment and control group was reported in one study173 for weight change, anxiety, and headache. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 10 – 24%, all doses = 3 – 21%), 2) dizziness (placebo = 0 – 12, all doses = 1 – 8%), and 3) agitation (placebo = 1%, all doses = 0%), and none reported diarrhea or eating disorder as an adverse event.
Number of studies. Five studies174, 175, 176, 177, 178 evaluated estrogens for dementia patients: one published in 1999,178 three in 2000,176, 177, 175 and one in 2001.174 None compared estrogens to another drug.
Design/methodology. The number of subjects included in the studies ranged from 15178 to 120 subjects177 with a total of 247 patients. Quality of the studies ranged from 5174 to 8178 points out of a possible 8 points. All studies were partially or fully funded by industry.
Populations. Four of the studies176, 177, 175, 174 included patients with mild to moderate AD, and one study178 included moderate to severe dementia patients who were all institutionalized. Only one of the studies178 included male subjects. Mean age ranged from 71.8175 to 80.0 years174 in the AD studies, and it was 83.8 in the dementia study.178
Intervention. One of the studies with AD patients used 0.10 mg per day174 for 8 weeks, and the others used 1.25 mg per day for 12 weeks,175 16 weeks,176 and 52 weeks.177 The study178 including subjects with severe disease used 2.5 mg per day for 4 weeks.
Primary outcomes. At least one study evaluated each of the included domains with the exception of caregiver burden.
Analysis. Two of the studies177, 175 performed ITT analysis and the other three used OC analysis.
One178 of the five trials scored 5 out of 5 on our quality scale for rating adverse events, and surprisingly, this same trial did not report any adverse event. Two trials176, 177 scored 3; one trial175 scored 2, and one174 scored 1. This latter study reported adverse events, but did not test for significant differences between groups. Withdrawal rates due to adverse events ranged from 0 – 5% for placebo and 0 –14% for the treatment group. The most frequently reported adverse event was vaginal bleeding,175, 177, 176 and a single trial175 reported a significant difference between placebo and treatment group for vaginal bleeding. It was not clear from the descriptions provided in the study if they had ascertained whether vaginal bleeding was present prior to the trial commencement. Nausea was the single a priori symptom of interest that was reported and by a single trial; frequencies varied from 0% for the placebo group and 4% for the treatment group.
Number of studies. Three studies179, 180, 181 evaluating Ginkgo biloba were eligible to be included in this review. All of the studies compared the drug to placebo only. One of the studies was reported in 1996181 and two were reported in 1997.179, 180
Design/methodology. The studies included evaluated 20 subjects,180 216 subjects,181 and 327 subjects179 (totaling 563 subjects). Two of the reports179, 181 scored 8 quality points out of a possible 8 points, and the other180 earned 6 points. One study did not indicate the funding source,180 and the other two had industry funding.
Populations. All of the studies included a mix of dementia diagnoses as follows: 1) mild to moderately severe AD and MID,179 2) mild to moderate DAT and PDD180, and 3) mild to moderate DAT and MID181 in community dwelling patients. Two of the studies reported subgroup analysis, one comparing diagnoses and comparing effects based on baseline MMSE score179 and the other based on diagnosis.181 The patients in these trials had mean ages of 64.6,180 69.0,179 and 69.6 years.181
Intervention. Two of the trials gave 240 mg per day for 3 months180 and 6 months,181 and the other trial179 gave 40 mg three times daily for 12 months.
Primary outcomes. None of the studies reported on quality of life/ADL or caregiver burden.
Analysis. All of the trials used an ITT analysis.
One180 of the three trials scored 5 out of 5 on our quality scale for rating adverse events. One study181 scored 4, and one trial179 scored 3. Two studies181, 180 had no withdrawals due to adverse events, and one trial179 had a withdrawal rate of 6% for both placebo and treatment groups. Two studies179, 180 did not report any adverse event. One study181 reported a statistically significant difference between the treatment and the placebo group for skin disorders. The same study reported gastrointestinal and headache adverse effects, but did not test for statistical differences between the placebo and the treatment group. None of the trials reported any of the a priori symptoms of interest.
Number of studies. Four studies182, 183, 184, 185 were included in this review that evaluated idebenone versus placebo, and one study26 compared idebenone to tacrine but not to placebo. The placebo trials were published in 1992,184 1994,182 1997,185 and 1998,183 the tacrine trial was published in 2002 by the same author as a previous placebo trial.183
Design/methodology. Sample sizes in the studies ranged from 92182 to 450 subjects183 with a total of 950 patients in the placebo-controlled studies. The study comparing idebenone with tacrine included 203 subjects, but a large number withdrew, and only 44 completed the trial. One of the trials185 earned 5 points out of a possible 8 points on the quality scale, two of the trials earned 6 points,182, 183 and one earned 7 points.184 None of the placebo studies reported their funding source. The tacrine study earned 7 points on the quality scale and was partially funded by industry.
Populations. The studies included patients with AD, MID, PDD, and DAT. Two of the trials182, 183 reported that the subjects had mild to moderately severe disease and the remainder reported mild to moderate disease. Two of the studies reported subgroup analysis based on disease severity.185, 183 Mean ages in the studies ranged from 69.9183 to 73.6 years.184
Intervention. Dosing schemes were 30 or 90 mg per day for 6 months,185 30 mg three times per day for 3 months,182 45 mg twice daily for 4 months,184 and 120 mg three times per day for 12 months.183 The tacrine trial used 360 mg per day for 14 months.
Primary outcomes. Caregiver burden was the only domain in this review that was not evaluated in at least one of the studies.
Analysis. Two of the studies183, 185 used ITT analysis while the other two used OC analysis. The tacrine trial used ITT analysis.
Quality scores for reporting adverse events varied from 1 to 5. Rates of withdrawal due to adverse events varied from 0 – 5% for the placebo group and 0 – 5% in the treatment group; a single trial183 did not report withdrawal rates. Two trials183, 185 tested for statistical differences between groups and found no differences. Although no clear pattern emerges, three studies identified at least one balance-related adverse event across studies. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 2 – 11%), 2) dizziness (placebo = not reported, all doses = 2%), and 3) not reported for diarrhea, agitation, or eating disorder as an adverse event.
Number of studies. Five trials186, 187, 188, 189, 190 included in this review evaluated oxiracetam versus placebo. The studies were published in 1988,188 1989,187 and 1992.190, 189, 186
Design/methodology. A total of 554 patients were included in the studies, ranging from 30 patients188 to 289 patients.187 Four of the studies earned 6 points out of a possible 8 points on the quality scale, and the other study189 earned 4 points. Two of the studies189, 187 did not report the source of their funding, and the other three trials had partial industry funding.
Populations. The trials included a mixture of diagnoses, including AD, PDD, mixed dementia, and MID, and none of the studies reported severe disease. One of the studies187 performed subgroup analysis based on diagnosis, comparing MID to PDD. The mean age of the subjects included in the trials ranged from 62.0188 to 73.8 years.189
Intervention. All of the trials used a dose of 800 mg twice daily, for a duration of 12186, 187 to 26 weeks.189
Primary outcomes. At least one trial evaluating oxiracetam evaluated one of the outcome domains examined in this review with the exception of caregiver burden. A single trial189 reported baseline MMSE at 22 for both placebo and treatment groups.
Analysis. None of the trials used ITT analysis.
The quality scores for reporting adverse events varied from 2 to 5. The proportion of withdrawals due to adverse events varied form 0 – 9% for the placebo group and 0 – 6% for the treatment group. No clear pattern for adverse events is evident, but three of the five studies reported gastrointestinal related problems, primarily associated with abdominal pain. Although, only single trials evaluated the range of frequencies of the a priori symptoms of interest are as follows: 1) dizziness (placebo = not reported, all doses = 11%), and 2) agitation (placebo = 1%, all doses = not reported); no trial reported nausea, eating disorder, or diarrhea as an adverse event.
Number of studies. Three trials191, 192, 193 in this review evaluated pentoxifylline versus placebo. One trial,194 published in 1997, compared pentoxifylline to sulodexide rather than placebo. The placebo trials were published in 1987,193 1992,192 and 1996.191
Design/methodology. The studies included 36 patients,193 64 patients192 and 289 patients.191 The sulodexide trial included 93 patients. All placebo trials had 6 points out of a possible 8 points on the quality scale and had partial or full industry funding. The sulodexide trial earned 5 points on the quality scale and did not report the source of funding.
Populations. The three placebo-controlled trials included patients with mild to moderate MID, and one trial193 also included PDD patients. The sulodexide trial had only patients with mild to moderate VaD. Subgroup analysis was performed in two trials, looking at MID versus PDD diagnosis193 and grouping by vascular change versus discrete stroke.192 The mean age of the studies ranged from 69.7191 to 77.0 years.193
Intervention. All of the studies gave 1200 mg per day of pentoxifylline; one study gave the drug once a day for 9 months,191 one study gave 400 mg three times per day for 9 months,192 and one gave 400 mg three times per day for 3 months.193 The sulodexide study gave the drug once a day for 6 months.
Primary outcomes. At least one trial evaluated one of the outcome domains examined in this review with the exception of caregiver burden.
Analysis. A single191 trial used an ITT analysis.
The quality scores for reporting adverse events were generally low, varying from 1 to 3. Withdrawal rates due to adverse events varied from 0 – 25% in the placebo group and 0 – 22% in the treatment group. The two studies that reported adverse events indicated the presence of gastrointestinal disturbances, including abdominal pain or nausea and vomiting (placebo = 7% and all doses = 14%). None of the trials reported dizziness, agitation, eating disorder or diarrhea.
Number of studies. Four studies195, 196, 197, 198 in this review evaluated propentofylline versus placebo. The first trial was published in 1990.198 Two, by the same author, were published in 1996196 and 1998.197 One was published in 1997.195
Design/methodology. The number of subjects in the studies ranged from 30 subjects197, 196 to 260 subjects,195 with a total of 510 subjects. Three of the studies197, 195, 196 earned 5 points out of a possible 8 points on the quality scale, and the other study198 earned 6 points. Only one study indicated the source of funding for the trial,196 and it was industry-supported.
Populations. The trials included subjects with mild to moderate AD only,197 VaD only,196 mild dementia only,198 and mild to moderate combined AD and VaD.195 Two trials presented subgroup analysis: one for AD versus VaD,195 and one based on MMSE baseline score. The mean age in the studies ranged from 64.8 197 to 72.4 years.195
Intervention. All four studies gave 300 mg three times a day for 3 months with the exception of one trial195 which had a duration of 12 months.
Primary outcomes. At least one trial evaluated one outcome in each of the domains examined in this review with the exception of caregiver burden. Baseline MMSE was reported in two trials197, 198 and varied from 20 and 21 for both placebo and treatment groups.
Analysis. One of the studies195 used an ITT analysis.
The quality scores for reporting adverse events varied from 1 to 4. The percentage of withdrawals varied from 0 – 13% for the placebo group and 0 – 12% for the treatment group. None of the trials tested for differences between groups. Three of the trials195, 197, 198 reported gastrointestinal events that included abdominal pain, constipation, and nausea and vomiting (placebo = 2%, all doses = 7%). Dizziness (placebo = 3 – 5%, all doses = 1 – 6%) was the only other a priori symptom of interest.
Interventions with two studies included for review. Six other agents were compared with placebo in only two included trials:
The concept of “delay onset” was operationalized to imply delay in conversion from a cognitive disturbance state, classified as MCI, CLOND or CIND, to a true dementia state. No studies with this population met the final eligibility criteria, although four trials231, 232, 233, 234 advanced to the full text screening stage. The lack of studies eligible for evaluation in this systematic review points to a gap in the literature for pharmacological interventions (attempting to demonstrate a delay in disease onset) in MCI-type populations.
In general, very few studies evaluated patients who were classified as “severe”. Five studies126, 208, 129, 178, 132 had moderate to severe groups of dementia patients, and only one trial reported all three levels163 of the disease spectrum. The interventions evaluated in these trials were estrogen, haloperidol, glycosaminoglycan polysulfate, memantine, and naftidrofuryl. This suggests that there is a bias in the trials eligible in this systematic review towards evaluating mild to moderate disease; this in turn reflects the underlying assumption that the less severe groups are most likely to benefit from drug trials. Since so few studies have evaluated the more severe groups, this assumption may require some empirical justification. Therefore, delay in progression has not been considered in severe patients.
The selected studies used two approaches for showing “delaying disease progression”. The first method for evaluating the potential for a drug to delay disease progression used longer-term follow-up; survival analyses (time to a relevant event) were then used to show differences between the two groups. The second design approach used withdrawal from treatment for a period and continued monitoring of the treatment and placebo groups (to demonstrate a deviation of the treatment group from the natural history as represented by the placebo group). Such designs have been termed withdrawal, active-extension, randomized withdrawal, randomized start, and staggered start.235, 236, 19, 21 From our 186 included studies, we then further selected a subgroup of papers that had the potential to demonstrate delay in disease progression through the use of one of these two designs. Therefore, any eligible trial that employed a survival analysis or a two-period approach, where the pharmacological agent was withdrawn during one of the periods, was selected for further evaluation to answer this question.
Two studies135, 61 using survival analyses were identified. In a 2-year study135 that compared placebo to three other groups (selegiline, selegiline with vitamin E, and vitamin E), time to the development of significant dementia milestones (death, institutionalization, loss of ability to perform ADL, or score on scale indicating severe dementia) was used as the time to event in the survival analysis. The results of this study showed that the vitamin E, selegiline, and combined groups were statistically different (i.e. declined less) from the placebo group in analyses that included baseline MMSE score as a covariate (not significant when excluded). The median survival was 230 days (vitamin E), 215 days (selegiline), and 145 days (combined group). Moreover, the vitamin E group showed a statistically significant difference for the endpoint of institutionalization, and the other treatment groups did not. There were no statistical differences between groups with respect to adverse events. Thus, the findings of this study suggest that selegiline and vitamin E may delay clinically important deterioration in patients with moderately severe AD; this delay varied from 20 to 32 weeks. The second study61 used survival analyses to evaluated the time to the development of severe functional impairments in a comparison of placebo and donepezil with a follow-up of 54 weeks. The results of the Kaplan-Meier analysis showed a mean number of days to significant functional decline of 252 days for placebo and 357 days for the donepezil group (mean difference of 100 days). The treatment group was 38% less likely to decline over a 1-year period. Both these studies demonstrated some delay in disease progress varying from 100 to 230 days for these three different pharmacological agents.
Delay in disease progression can also be evaluated using a “time to return to baseline” following withdrawal of treatment. Similarly, staggering the start of the treatment parallels the staggered withdrawal and can be used to evaluate disease progression. In this design approach, the time to return to baseline is compared to the placebo group, which represents the natural course of the disease. Of the studies that were eligible for this research question used a classic withdrawal design (withdrawal in period II after the intervention was administered); none of these studies were able to maintain double blinding after the withdrawal of the intervention. Justification for the selection of the length of the washout or follow-up period was not consistently provided (which possibly reflects the lack of a priori aim to show delay in progression).
| Study | Drug | Schedule | Result |
|---|---|---|---|
| Ruether 2001 | Cerebrolysin | 4w drug + 8w washout + 4w drug + 12w washout | All patients: |
| ADAS-noncog, maintained difference from placebo | |||
| NAI returned to baseline | |||
| Subgroup MMSE<20: | |||
| ADAS Noncog, CGI, ADAS-cog, SKT maintained difference from placebo | |||
| Nyth 1990 | Citalopram | 4w drug + 8 w open drug + 4w new random drug | NR |
| Rogers 1996 | Donepezil | 12w drug + 2w SB Pl washout | 5 mg maintained effect, 3 mg no maintenance of effect for ADAS-cog (NS) |
| Rogers 1998b | Donepezil | 24w drug + 6w SB and placebo washout | Return to placebo levels for ADAS-cog, MMSE, CIBIC (all NS) |
| Wilcock 2002 | Memantine | 2w SB and placebo + 28w drug + 2w SB placebo washout | NR |
| McKeith 2000 | Rivastigmine | 20w drug + 3w rest | Return to placebo levels for NPI and computerized cognitive assessment (NS) |
| Antuono 1995 | Velnacrine | 2w SB placebo + 24w drug + 6w SB placebo washout | Return to placebo levels for the ADAS-cog but SC for CGIC remained for washout |
| Bodick 1997 | Xanomeline | 24w drug + 4 w SB placebo | SC at week 24 with CNTB |
| No differences vs. placebo at w4 of washout | |||
| Study | Drug | Schedule | Result |
|---|---|---|---|
| Dehlin 1985 | Alaproclate | 2w placebo + 4w drug + 2w placebo | SC for GBS intellectual subscale at w4 of treatment |
| No significant difference at w2 of washout | |||
| Cutler 1993 | BMY 21,502 | 12w drug + 4w placebo washout | Treatment showed no significant change and follow-up showed no change |
| Amaducci 1988 | Phosphatidylserine | 3m drug + 21 m follow-up | SC remained for severe disease patients, not moderate |
| Raskind 1997 | Metrifonate | 26w drug + 8w follow-up | NR |
| Parnetti 1995 | Posatirelin | 90d IM + 30d follow-up placebo | “Maintained positive effect” but specific numbers not reported |
| Agid 1998 | Rivastigmine | 10w drug + 2w placebo washout | NR |
From a methodological perspective, addressing the question of being “more effective” requires head to head comparisons of pharmacological interventions. If one intervention (Drug A) has been shown to be effective relative to placebo of a specified effect size, and a second intervention (Drug B) has been shown to be effective at a lower magnitude relative to placebo, it does not necessarily follow that Drug A is more effective than Drug B. Comparisons of the relative effectiveness of certain drugs can only be evaluated in the context of head to head evaluation within the same trial. Those studies undertaken as direct comparisons are summarized below.
A total of 26125, 152, 129, 237, 130, 238, 239, 145, 150, 124, 114, 131, 127, 240, 241, 135, 242, 101, 243, 70, 194, 128, 244, 26, 92, 245 studies compared efficacy of two or more pharmacological agents relative to each other. In general, few drugs showed statistically significant differences relative to each other. Those that did include the following (drug performing better is listed first):
Sulphomucopolysaccharides versus CDP-choline238 - Significant differences were seen in favor of sulphomucopolysaccharides in measures of behavior and global assessment in 30 institutionalized patients with mild to moderate MID.
Donepezil and vitamin E70 - Significant differences were seen in favor of donepezil in general cognitive function in 54 patients with mild AD.
Antagonic stress versus nicergoline92 - Significant differences were seen in favor of antagonic stress in cognition as well as a global assessments in 62 subjects with mild to moderate AD.
Antagonic stress versus meclofenate242 - Significant differences were seen in favor of antagonic stress in measures of cognition and global assessment in 63 patients with mild to moderate AD.
Posatirelin versus citicoline101 - Significant differences were seen in favor of posatirelin in general cognitive measure and mood in 222 community living patients with mild to moderate AD.
Pyritinol versus hydergine243 - A significant difference was found in favor of pyritinol in a global assessment measure in 102 Hispanic patients with mild to moderate AD.
Idebenone26 versus tacrine-Mixed results were observed; the Efficacy Index Score showing a significant benefit over tacrine, while the global assessment showed no difference in 203 AD patients, 44 of whom completed the study.
The following studies contained stratified analyses of outcomes for different clinical populations. A total of 22245, 211, 146, 179, 181, 132, 134, 142, 183, 185, 56, 168, 201, 63, 64, 55, 173, 62, 76, 75, 60, 159 studies addressed this question. During data abstraction, these trials were identified if the methods sections (including analyses) stated that stratified analyses were undertaken. Eight different variables were identified for which stratified analyses were reported. These included: age, gender, APOE genotype, disease type, disease severity (as determined by MMSE/ ADAS-cog threshold levels), treatment center, care dependence, and presence of depression. Of these 22 studies, seven trials245, 211, 146, 179, 181, 132, 134 evaluated disease type (AD, PDD, SDAT, MID, VaD). They will be discussed in Question 5 (see below).
| CITATION | DRUG | SUBGROUP | DRUG EFFECT |
|---|---|---|---|
| Alvarez 2000 | Anapsos | Disease severity | SC in ADAS-cog in patients with mild cognitive deterioration and with AD |
| NS in patients with VD | |||
| Gutzmann 1998 | Idebenone | Disease severity | NR |
| Weyer 1997 | Idebenone | Disease severity | SC for ADAS Total |
| ADAS total score ≥ 20 | |||
| Sano 1992 | Carnitine | MMSE | Low mMMSE group SC on the SRT and CSF levels of drug |
| High mMMSE group NS neuropsychological test scores, CGI ratings and CSF levels of drug | |||
| Ruether 2001 | Cerebrolysin | MMSE | Subgroup MMSE < 20: SC in CGI, ADAS-cog, NAI and ADAS-Noncog. Suggests it's because this group had reduced placebo response. |
| Schellenberg 1997 | Cyclandelate | MMSE, ADAS-cog, Treatment center | NR |
| Feldman 2001 | Donepezil | MMSE Psychoactive drug use | NR |
| Tariot 2001a | Donepezil | MMSE (10–26) Age | MMSE group: SC greater differences than for the whole group for MMSE, GDR |
| Older patients group: SC for MMSE, CDR | |||
| Thal 1996a | Carnitine | Age | SC age-by-treatment interaction on the ADAS-cog ITT population |
| Patients ≤ 65 years significant difference in decline for ADAS-cog favoring Carnitine but not for CDR | |||
| Patients > 65 years NS | |||
| Panisset 2002 | Cerebrolysin | APOE genotype | NS association of the APOE e4 status and response to study drug |
| Winblad 2001b | Donepezil | APOE genotype Gender | NS difference for the subgroups |
| Raskind 2000 | Galantamine | APOE genotype | NS |
| Wilcock 2000 | Galantamine | APOE genotype | NS for APOE group |
| MMSE | SC for MMSE < 18 | ||
| Prasher 2002 | Donepezil | Down syndrome ONLY in trial | NR |
| Reifler 1989 | Imipramine | Depression | Depressed patients SC higher HAM-D scale score. For MMSE patients with AD + depression had higher scores initially and improved significantly more over time |
SC = Significant change
NS = Not statistically significant
NR = Not reported
In general, very few studies examined the efficacy of drugs with respect to dementia by population characteristics. Three additional studies attempted to evaluate unique populations or population characteristics. Prasher et al.60 evaluated subjects who had Down's Syndrome with dementia and were treated with donepezil, and found none of the outcomes to be significant; this study had a sample size of 30 subjects and was underpowered. Ban et al.213 conducted a multicenter, placebo-controlled, double-blind study with Hispanic and Italian populations. This study was not designed to specifically evaluate the efficacy of glycosaminoglycan polysulfate by ethnicity. However, the study included centers from Mexico, Panama, Naples, and Trieste. This study examined whether the changes encountered in the different outcome measures could be related to center effect, but no statistically significant center effect was found. While this study suggests that ethnicity may have minimal impact, future studies should specifically assess the impact of racial composition on the efficacy of drugs.
A total of 20 pharmacological interventions in 29 studies211, 220, 238, 171, 200, 199, 146, 68, 181, 184, 133, 134, 132, 161, 89, 91, 93, 247, 187, 191, 192, 194, 193, 100, 98, 196, 195, 245, 217 were applied specifically to dementias classified as VaD. Sixteen studies evaluated populations entirely composed of patients with VaD (or MID), and the remaining 13 trials had VaD as a subgroup. The majority of these pharmacological interventions (n = 14) were represented by a single trial, limiting the extent of the evidence; these included ateroid, buflomedil, cerebrolysin, sulphomucopolysaccharides (CDP choline), citalopram, donepezil, Ginkgo biloba, idebenone, minaprine, nimodipine, nicergoline, oxiracetam, 5-THF (trazodone), vincamine, and xantinolnicotinate. Surprisingly, four of these trials did not report any results relative to placebo, and these included buflomedil, Ginkgo biloba, oxiracetam, and 5-THF (trazodone); all but one of these trials220 evaluated subgroups of VaD patients and likely did not posses sufficient power to evaluate differences. Six interventions had more than a single trial, and these included Choto-san (n = 2), memantine (n = 3), nicergoline (n = 2), pentoxifylline (n = 4), posatirelin (n = 2), and propentofylline (n = 2).
| CITATION | DRUG | SUBGROUP | DRUG EFFECT |
|---|---|---|---|
| Passeri 1993 | 5′-MTHF Trazodone (TRZ) | AD vs. MID | Equivalence study |
| When patients with AD were analyzed separately the same pattern of response to MTHF and TRZ was found in the HDRS and RVM as when they were analyzed together with patients with MID. | |||
| MID as separate group: HDRS was significantly reduced vs. baseline after 8 weeks of treatment in the TRZ group and only at the end of the follow-up period in the MTHF group. RVM remained unchanged in MID pts in both treatment groups. | |||
| Ban 1991b | Ateroid | PDD vs. MID | NR |
| Nyth 1990 | Citalopram | AD/SDAT vs. VaD | A period: |
| No improvement in the VaD group | |||
| SC in the AD/SDAT group in emotional bluntness, confusion, irritability, anxiety, fear-panic, depressed mood, and restlessness. MADRS scores significantly reduced | |||
| B period: | |||
| AD/SDAT group SC in emotional bluntness at week 8. NS at week 4 and 12. | |||
| NS for the VaD group. | |||
| LeBars 1997 | Ginkgo biloba | AD vs. MID+AD MMSE | AD subgroup: SC for ADAS-cog and GERRI |
| Kanowski 1996 | Ginkgo biloba | DAT vs. MID | Improvements at 24 weeks of treatment in comparison to baseline values were consistently slightly greater in the DAT group than in the MID group. Calculation of descriptive p-values seemed inappropriate due to the very small number of patients with MID in the sample. |
| Winblad 1999 | Memantine | AD/VaD Care dependence | NR for differences between dementia types |
| Care dependence: Patients with < 20 points on the CGI and BGP | |||
| Care dependence subscore shows slightly higher response rates than those with >20 points in the memantine group. | |||
| Wilcock 2002 | Minaprine | SDAT vs. MID | The largest treatment effect occurred in patients with baseline MMSE score < 15 (p = 0.04) and in those without cerebrovascular macro-lesions (p = 0.002) |
SC = Significant change
NS = Not statistically significant
NR = Not reported
This systematic review was undertaken primarily to evaluate the efficacy of pharmacological agents in the treatment of dementia. The studies were limited to parallel design RCTs with quality scores greater than 3 on the Jadad scale. The interventions were not limited to those currently on label by the FDA; it was of interest to cast a wide net and capture reports of pharmacological agents that are used off-label for the treatment of dementia. Since a variety of agents with different therapeutic effects were evaluated, the outcomes were not restricted to a specific subset of all available outcomes used in the dementia literature. The psychometric properties of some of the most commonly used outcomes have been critically appraised and found to be limited. Moreover, there is no current consensus as to which domains, and the outcomes within these, that best reflect clinically important change.
The studies eligible for review in this dementia report represent the highest form of evidence. This strongly suggests that these trials are more likely to be “well-designed, well conducted studies in representative populations that assess the effects of health outcomes”.53 The high quality scores also indicate that the studies evaluated in this systematic review have a relatively high level of internal validity. The characteristics of the population and the interventions were detailed to assist the reader in evaluating the degree of external validity. Similarly, attempts were made to highlight “consistency” in the evidence as well as the quantity of evidence and the magnitude of the reported changes.
Although, there is greater understanding on evaluating the evidence for the “benefits” of therapies, there is less clarity on determining the potential for harms from pharmacological interventions for treating dementia. With respect to adverse events and the potential for serious harms, greater variability in systematic collection and reporting of these were observed in the dementia pharmacological literature. Evaluation of the potential for harm is considered with three main points: 1) the most frequently reported adverse events across studies for a specific drug, 2) the overall withdrawal rate due to adverse events for both the control and treatment groups, and 3) the range of frequencies reported for a subset of symptoms (nausea, diarrhea, dizziness, agitation, eating disorder) selected a priori and evaluated for all pharmacological interventions.
At present there is no coherent framework that captures the disease processes present in dementia patients for the range of outcomes evaluated in this systematic review. This report details the highest evidence from both a design and internal validity perspective. It is our view that determining the clinical relevance (external validity) of such high-quality evidence must ultimately be reached by consensus amongst multidisciplinary experts within the decision-making body that will use this evidence for such purposes as developing practice guidelines.
A total of 97 interventions in 186 studies were eligible for evaluation in this systematic review and were distributed as follows:
A total of 16 different cholinergic neurotransmitter modifying pharmacological agents in 72 studies.
A total of 35 non-cholinergic neurotransmitter/neuropeptide modifying agents in 61 studies.
A total of 46 other pharmacological agents in 76 studies*.
two studies compared two NCNMAs with an OTHER.
two studies compared a CNMA with an OTHER.
one study compared a CNMA with two OTHERS.
two studies compared an NCNMA with an OTHER.
The evidence for all these pharmacological agents was presented in great detail in Chapter 3 and in Evidence Tables of Key Study Characteristics, Tables of Study Results, and Tables of Study Adverse Events contained in Appendix C. Conclusions regarding those pharmacological agents that had a minimum of three trials are summarized here. The summary of the pharmacological agents that had fewer than three trials can be found in Chapter 3.
Carnitine. Six trials evaluated carnitine in 925 subjects with mild to moderate severity, recruited predominately from the community at doses of 2 to 3 g for either 24 or 52 weeks. Evidence of benefit is conflicting for the domains of cognition. Most studies were not statistically significant and the lack of sufficient power may have been an important factor. Similarly, no significant differences were found in the domains of global assessment, behavior/mood, and quality of life/ADL; power could not be evaluated for the majority of these outcomes.
Four of the six studies scored 3 for quality on reporting adverse events. Withdrawal rates due to adverse events varied from 0 – 3% (excluding results from one outlier trial248), and gastrointestinal symptoms were the most frequently reported types of adverse events. The percent of subjects reporting the a priori symptoms of interest across all studies were as follows: 1) nausea (placebo = 6 – 14%, all doses carnitine = 28%), and 2) agitation (placebo = 6%, all doses carnitine = 7%). Dizziness, diarrhea, or eating disorder were not reported by any study. No serious adverse events requiring hospitalization and associated with carnitine were reported.
Donepezil. Ten trials in 3239 subjects evaluated the efficacy of donepezil compared to placebo, and one trial compared donepezil to a group given vitamin E. The majority of studies (n = 8) evaluated AD patients, for which half were recruited from the community (other studies did not specify). The subjects had predominately mild to moderate disease and doses of 5 or 10 mg were used with varying duration from 12 to 56 weeks.
There is consistent evidence of benefit in the domains of general cognitive function and global assessment. The combined effect sizes for the ADAS-cog and the CIBIC were estimated. Evidence is inconsistent for a dose response in these domains based on the three studies that evaluated two different doses (5 and 10 mg); the benefit was of similar magnitude for both dose groups for global assessment outcomes. Similarly, two of the three studies that evaluated behavior/mood outcomes (NPI) showed no statistically significant changes relative to placebo; these trials lacked sufficient power to detect a difference. There is some evidence of benefit in ADL outcomes, although this outcome domain was evaluated with a variety of instruments. Caregiver burden outcomes was evaluated in a single study that did not report the findings for this domain.
Adverse events quality scores were 3 or greater for the majority of studies (n=7). Four trials provided evidence of a dose response for adverse events. One study showed a statistical difference for balance-related problems and asthenia (neurological fatigue) between placebo and treatment groups. Withdrawal due to adverse events ranged from 0 – 18% for treatment groups and 0 – 11% for placebo. Four out of six studies testing differences between groups were statistically significant for diarrhea (placebo = 3 – 21%, all doses donepezil = 0 – 38%), nausea and vomiting (placebo = 4 – 9%, all doses donepezil = 4 – 25%). The other a priori symptom reported was agitation and frequencies for placebo varied from 0 – 8% and for all doses from 3 – 19%; but these were not shown to be statistically different.
Galantamine. Six trials in 3530 subjects evaluated the efficacy of galantamine compared to placebo. Doses of 24 and 32 mg were evaluated in half of these studies. Five studies evaluated AD patients and there was limited information regarding whether the subjects were from the general community or institutional settings. All studies recruited subjects with mild to moderate disease and the drug was administered with varying duration of 3 or 6 months.
Evidence of benefit is consistent in the domains of general cognitive function, global assessment, quality of life/ADL. Two of the three studies that evaluated, behavior/mood found statistically significant differences. A small dose effect was evident in the ADL domain when comparing the pooled estimates of the DAD; no dose effect was observed for outcomes in the global assessment domain, and dose effect could not be evaluated for the general cognition domain. The caregiver burden domain was not evaluated in any trials.
Five of the six trials scored 3 out of 5 on our quality scale for rating adverse events. Withdrawal rates due to adverse events ranged from 4 – 9% for placebo and 8 – 27% for the treatment group. One study showed a dose response for adverse events. Although, most trials did not report testing for differences between groups, two trials reported a statistically significant difference in weight loss with the treatment group having more than the placebo group. The most common types of adverse events reported were gastrointestinal symptoms (nausea and vomiting, diarrhea), eating disorders/weight loss, and dizziness. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 13%, all doses = 6 – 44%), 2) dizziness (placebo = 3 – 11%, all doses = 4 – 19%), 3) diarrhea (placebo = 2 – 10%, all doses = 4 – 19%), 4) agitation (placebo = 1 – 9%, all doses = 6 – 15%), and 5) eating disorder (placebo = 0 – 6%, all doses = 4 – 20%).
Metrifonate. Nine studies compared metrifonate to placebo in 2759 subjects with mild to moderate AD (likely from community settings as the majority of studies did not specify this). Metrifonate dosages evaluated varied from 50 to 80 mg, and study duration ranged from 21 days to 26 weeks duration.
All but one study showed metrifonate to have a consistent positive effect on measures of general cognitive function; none of the studies evaluated any specific cognitive function measures. The effects on global assessment were less consistent, but suggested a positive effect in four of the eight studies that reported this outcome. Evidence for effect in the domains of behavior/mood and quality of life/ADL were not significant in the majority of studies that evaluated these domains, however these were primarily evaluated as secondary outcomes and likely lacked sufficient power.
With the exception of a single study, quality scores for reporting adverse events were greater than 3. However, only one trial83tested for differences between groups and found nausea and vomiting, diarrhea, and muscle and joint disorder to be significantly different. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea and vomiting (placebo = 3 – 14%, all doses = 2 – 50%), 2) dizziness (placebo = 1%, all doses = 3 – 4%), 3) diarrhea (placebo = 4 – 14%, all doses = 11 – 19%), 4) agitation (placebo = 2 – 14%, all doses = 8 – 33%), and none reported eating disorder as an adverse event. Withdrawal due to adverse events varied from 0 – 9% for placebo and 0 – 12% for the treatment group. Overall, it was difficult to determine which types of adverse events reported had the potential to cause serious harm. This is noteworthy as metrifonate has been withdrawn from use in North America, and Bayer has suspended Phase III trials,87 because some patients in clinical trials have experienced serious muscle weakness . This decision was based on the results of an experimental study showing risk of respiratory paralysis with the use of metrifonate. Other adverse events of concern included severe leg cramps, dyspepsia, and bradycardia. None of the studies we reviewed indicated that if present, these events differed significantly between groups. It is not clear if this inconsistency is a function of the methods used to collect and report adverse events or a limitation of RCTs as a source of detecting serious adverse events when incidence is low.
Nicergoline. Four trials in 705 subjects compared nicergoline to placebo and one trial compared it to antagonic-stress in mixed populations that included AD, MID, PDD, VaD, mixed dementia, and SDAT, which were classified as mild to moderate in severity.
All placebo-controlled trials found a positive effect for general cognitive outcomes, but half the results were based on OC analyses. The evidence was mixed for benefit in the domain of global assessments. No significant differences were found for behavior/mood, and quality of life/ADL outcomes, but these were evaluated in few studies and as secondary outcomes (suggesting that sufficient power was an issue).
Quality scores for reporting adverse events varied from 2 to 5 for these four trials, and none tested for differences between groups. Withdrawal due to adverse events varied from 0 – 8% for placebo and 0 – 9% for the treatment group. With the exception of headache, which was reported in all four trials, it was difficult to determine which types of adverse events most characterized exposure to this pharmacological agent. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 3%), 2) dizziness (placebo = 1–2%, all doses = 0% or not reported), 3) diarrhea (placebo = 2 – 6%, all doses = 2 – 4%), 4) agitation (placebo = 5%, all doses = not reported), and none reported eating disorder as an adverse event.
Physostigmine. Four studies of 1198 subjects with mild to moderate AD evaluated physostigmine administered in patch and oral form (30 to 60 mg dose) for study duration varying from 6 to 24 weeks. All subjects were recruited from the community.
There is evidence that physostigmine has a statistically significant effect on general cognitive function, as three of the four studies showed improvement. Evidence for an effect on global function was mixed with no consistent benefit. Similarly, for quality of life/ADL outcomes, all three studies that evaluated this domain were not statistically significant but these were secondary outcomes and may reflect a lack of power. Behavior/mood and caregiver burden were not tested in these trials.
The quality scores for reporting adverse events were generally low, scoring 1 or 2 out of 5. Withdrawal rates due to adverse events varied from 1 – 5% for placebo and 12 – 55% in the treatment group, with one study not reporting rates. The high withdrawal rates were in studies with sample sizes that varied from181 to 475 subjects. A single study tested for differences between groups, and found that dizziness, tremor, weight loss, asthenia, confusion, delirium, and respiratory problems were significantly different. The cluster of reported types of adverse events suggests that gastrointestinal problems (abdominal pain, diarrhea) (placebo = 1 – 9%, all doses = 13 – 28%), nausea and vomiting (placebo = 1 – 9%, all doses = 9 – 75%) and eating disorder (placebo = 2 – 6%, all doses = 5 – 16%) were most frequently reported. Dizziness (placebo = 4 – 13%, all doses = 11 – 38%) and agitation (placebo = 6 – 16%, all doses = 4 – 8%) were also reported.
Posatirelin. Four trials evaluated posatirelin in 931 subjects in a variety of mild to moderate dementia populations (AD, PDD, VaD) using a dose of 10 mg per day over 3 months duration.
Three of the four trials showed statistical significance for general cognitive function and quality of life/ADL (as measured by GBS subscales for these domains). The evidence remains inconsistent for benefit in global assessment (evaluated in only one trial) and behavior/mood (mixed results). Caregiver burden and specific cognitive function were not evaluated in any trial.
Quality scores for reporting adverse events varied from 2 to 4. Withdrawal rates due to adverse events ranged from 0 – 3% in placebo and 0 – 4% in the treatment group. None of the studies tested for significant differences between groups. All studies reported the presence of agitation, and at least three studies reported arrhythmia, nausea/vomiting, headache, rash/skin disorder, and sleep disorder. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 1 – 4%), 2) dizziness (placebo = not reported, all doses = 1%), 3) diarrhea (placebo = 2%, all doses = 2%), 4) agitation (placebo = 1 – 5%, all doses = 1 – 5%), and none reported eating disorder as an adverse event.
Rivastigmine. Six studies evaluated 2071 subjects and three of these studies were limited to AD patients only. Doses for rivastigmine varied from 1 to 12 mg, and treatment ranged from 14 to 26 weeks and only one study specified a community sample.
The evidence shows that general cognitive function improves with rivastigmine at a dose of 12 mg, but there is mixed results for efficacy at lower doses. Two trials also evaluated specific cognitive function but the results were not consistent within studies (between general and specific measures) and between studies for these domains. There is consistent evidence of benefit for the outcome of global assessment but the dosage at which this is significant varies highly between studies. In the domains of behavior/mood and quality of life/ADL, the findings were not statistically significant nor consistent; most of these analyses were not based on intention to treat analysis and lack of sufficient power cannot be ruled out. Caregiver burden outcomes were not evaluated by any trial.
Quality scores for reporting adverse events varied from 2 to 5. Withdrawal rates due to adverse events ranged from 4 – 11% in the placebo and 11 – 27% in the treatment group. Two trials demonstrated a dose response; however, one of these trials showed significant differences for nausea and vomiting only, and the other trial showed significant difference for all the adverse events reported. The majority of studies reported dizziness, nausea and vomiting, eating disorder/weight loss, and headache. It should be noted that one study allowed intentional prescribed anti-emetic drugs to increase the tolerance of subjects taking rivastigmine. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3 – 10%, all doses = 8 – 58%), 2) dizziness (placebo = 0 – 7%, all doses = 6 – 20 %), 3) diarrhea (placebo = 2 – 9%, all doses = 7 – 17%), 4) eating disorder (placebo = 4 – 8%, all doses = 4 – 19%), and 5) agitation was not reported.
Tacrine. Six studies108, 109, 110, 111, 112, 113 evaluated tacrine in 994 subjects predominately with mild to moderate AD at doses of 80 to 160 mg lasting either 12 – 13 or 3 – 36 weeks in duration. Two other studies114, 26 involving 425 patients were non-placebo controlled studies.
A single trial108 was found to show benefit for general cognitive function with a small effect and this was based on a series of related publications. The five trials showing no benefit for general cognitive function comprised small sample sizes and much shorter study duration. Overall, the evidence for benefit for general cognitive function is limited to this single trial. There is evidence for benefit in global function from two of the three trials that evaluated this domain. Changes in behavior/mood, quality of life/ADL domains, specific cognitive function, and caregiver burden were all not significant, but lack of sufficient power cannot be ruled out.
The quality scores for reporting adverse events varied from 1 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 12% for placebo and 0 – 55% in the treatment group. The higher rates of withdrawal were associated with higher doses. Elevated alanine transaminase (ALT) or hepatic abnormality (placebo = 4 – 13%, all doses tacrine = 7 – 67%) was reported in six studies, raising concerns for the potential for serious liver damage. None of these trials tested for differences between treatment and placebo with respect to adverse events. Five of the studies reported nausea and vomiting (placebo = 0 – 9%, all doses = 9 – 37%); gastrointestinal problems; dizziness (placebo = 0 – 16%, all doses = 4 – 14%) was also noted in several studies. Frequencies of other a priori symptoms of interest are as follows: 1) agitation (placebo = 5 – 12%, all doses = 5 – 9%), and 2) diarrhea (placebo = 0 – 13%, all doses = 4 – 18%). There is evidence for the potential for serious adverse events associated with liver function in six trials.
Velnacrine. Three studies evaluated the effects of velnacrine in 774 AD patients with a diagnosis of AD. The doses that were shown to effect significant changes were 75 mg twice daily and 225 mg daily in studies with a 15 and 24 week duration. Location of recruitment was not specified.
Statistically significant effects were observed for general cognitive function, and global assessment in the two studies with sample sizes over 300 subjects. Behavior/mood and caregiver burden showed some benefit in one trial116 at the highest dose only. Quality of life/ADL was tested as a secondary outcome and showed mixed findings.
Quality scores for reporting adverse events were 3 for all studies. Withdrawal rates varied from 0 – 22% for the placebo group and 5 – 33% for the treatment group. None of the studies reported a dose response. None of the studies tested for statistical differences between the placebo and treatment groups. Two studies reported aberrant hematology and hepatic abnormality;116, 117 for these two studies the rates of occurrence were 2 – 21% for placebo, and 32 – 40% for all doses. All studies reported diarrhea and nausea and vomiting. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 0 – 4%, all doses = 3 – 8%), 2) dizziness (placebo = 3%, all doses = 0 – 8%), 3) diarrhea (placebo = 3%, all doses = 2 – 33%), 4) agitation (placebo = 4%, all doses = 1 – 4%), and 5) eating disorder (placebo = 1%, all doses 2 – 4%). The potential for serious liver effects was not well specified in these trials.
Haloperidol. Five studies evaluated the effect of haloperidol relative to placebo in a total of 622 subjects with mild to moderate disease and included AD patients124, 125, 128 and mixed populations (MID/VaD/ PDD).126, 127 One trial128 had only 15 patients, and one trial124 lasted only three weeks. Two studies recruited subjects from institutions; one from the community and two did not specify.
Mixed results were observed for improvement in global assessment. In three of the trials there was benefit in the domain of behavior/mood which reached statistical significance. Two trials evaluated caregiver burden and found no statistically significant differences but lack of sufficient power cannot be ruled out. Few studies evaluated outcomes in quality of life/ADL. Haloperidol did not affect general cognitive function in two trials and was not evaluated in the other studies.
The quality scores for reporting adverse events varied from 1 to 5 and only three of five studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 5% to 17% for placebo and 17 – 33% in the treatment group. One trial124 showed a dose-response effect but the study only lasted for three weeks. Three trials tested for differences between treatment and placebo with respect to extra pyramidal symptoms (placebo = 17 – 32%, all does = 34 – 97%), and two found statistically significant differences.124, 125 One study125 found significant differences between groups for balance-related problems. Although reported by only two trials, the range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = not reported, and 2) dizziness (placebo = 24%, all doses = 21%). No frequencies were reported for agitation, diarrhea, or eating disorder.
Memantine. Three trials evaluated memantine in 1066 patients, primarily with VaD, with 10 or 20 mg doses lasting 12 or 28 weeks. Disease severity was moderate to severe in a single study132 and mild to moderate in the remaining two studies133, 134. One study included patients that were institutionalized, one from the community and the third did not specify.
There is consistent evidence of benefit for general cognitive function in the two studies that evaluated this domain. The findings for global assessment are mixed. The sole trial that evaluated mixed dementia populations (including some VaD) with moderate to severe dementia found significant differences for global function, behavior/mood, and quality of life/ADL outcomes, but did not evaluate general cognitive function. It should be noted that this trial with mixed populations used half the dose of memantine for half the study duration in patients with greater disease severity, and had approximately half the sample size of the other two trials evaluated in this systematic review. Despite a lower dose, a smaller number of more severely affected patients, and a shorter duration, a statistically significant difference was found.
The quality scores for reporting adverse events varied from 3 to 4. Only two of three studies reported withdrawal rates; the proportion of subjects withdrawing due to adverse events ranged from 7% to 13% for placebo and 9 – 12% in the treatment group. A single trial tested for differences between treatment and placebo, and none of the comparisons were significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 3%, all doses = 5%), 2) dizziness (placebo = 3 – 8%, all doses = 6 – 11%), 3) diarrhea (placebo = 4%, all doses = 4%), 4) agitation (placebo = 7 – 8%, all doses = 4 – 5%), and none reported eating disorder as an adverse event.
Selegiline. Six trials135, 136, 249, 138, 139, 140 evaluated selegiline in 733 patients with AD, PDD, and DAT with 10 mg per day and a study duration of 60 days or 2 years. Only one study reported that patients were from institutional settings.
All but one trial that evaluated general cognition showed no statistically significant changes. A single trial found statistical improvements in specific cognitive tests (Sternberg Memory tests); this trial also showed statistically significant improvements in global assessment and behavior/mood. Only this trial, which had the highest quality score (7), showed consistently positive findings across domains tested. Three of the five trials that evaluated part or all of these domains had very small sample sizes and were likely underpowered, possibly accounting for the inconsistent findings. There is some evidence that selegiline and selegiline combined with vitamin E, increases the time to important functional decline milestones; this is based on a single study.
The quality scores for reporting adverse events varied from 0 to 3. The proportion of subjects withdrawing due to adverse events ranged from 0 – 4% for placebo and 0 – 9% in the treatment group. Only one trial135 tested for differences between the treatment and placebo groups and showed that balance and falls were significantly different (worse) between groups (22% for the group with selegiline combined with vitamin E versus 5% in the placebo). When adjusted for multiple comparisons, these were no longer significant. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 0%), 2) dizziness (placebo = 2 – 20%, all doses = 0 – 30%), and 3) agitation (placebo = 4 – 16%, all doses = 4 – 23%); no trial reported diarrhea or eating disorder as an adverse event.
Cerebrolysin. Six studies evaluated the effect of cerebrolysin in a total of 819 subjects. All but one of the trials171 included only AD patients with mild to moderate disease. All of the studies used the same dose of cerebrolysin, 30 ml per day for 5 days per week for 4 to 24 weeks. Location of recruitment was not specified.
Cerebrolysin showed a statistically significant effect on cognition in four out of five studies. Although, a pooled estimate for the ADAS-cog was calculated, the model was positive for heterogeneity and the overall estimate was not significant. The results for specific cognitive tests for the three trials that evaluated this domain were inconsistent. Global assessment measures showed a significant effect in five of the trials. This model was also positive for heterogeneity but significant for an overall effect. Two out of three studies showed an effect for behavior/mood, and none of the six studies showed an effect on quality of life/ADL. No study measured caregiver burden.
Two of the six trials scored 5 out of 5 on our quality scale for rating adverse events, yet they did not report any adverse events. Two studies scored 4, and the other two trials scored 3 and 2. All the studies with scores equals to 4 or less tested for statistical differences in adverse events between placebo and treatment groups. Withdrawals due to adverse events were not reported in one study170 and were 1% in two studies173, 168 and none withdrew in three studies.169, 172, 173 One study reported significant differences between treatment and control group173 for weight change, anxiety, and headache. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 10 – 24%, all doses = 3 – 21%), 2) dizziness (placebo = 0 – 12, all doses = 1 – 8%), and 3) agitation (placebo = 1%, all doses = 0%), and none reported diarrhea or eating disorder as an adverse event.
Estrogen. Five studies evaluated estrogens for dementia in 247 patients with primarily mild to moderate AD, with the exception of one study178 that included moderate to severe dementia patients who were all institutionalized. One of the studies with AD patients provided 0.10 mg per day174 by skin patch for 8 weeks and the others used 1.25 mg per day for 12 to 52 weeks.177 The study including severe subjects used 2.5 mg per day for 4 weeks.178
Three trials evaluated general cognitive function and all showed non-significant findings; two of these trials lacked sufficient power for the ADAS-cog. Similarly, two trials evaluated specific cognitive function but results were mixed. Most of the outcomes evaluated in the domains of global assessment, behavior/mood, and quality of life/ADL were secondary outcomes and none showed significance; lack of power could be a factor in these trials.
One of the five trials scored 5 out of 5 on our quality scale for rating adverse events, and surprisingly, this same trial did not report any adverse event. Withdrawal rates due to adverse events ranged from 0 – 5% for placebo and 0 –14% for the treatment group. The most frequently reported adverse event was vaginal bleeding and a single trial reported a significant difference between placebo and treatment group for this symptom. It was not clear from the descriptions provided in the study if they had ascertained whether vaginal bleeding was present prior to the trial commencement. Nausea was the single a priori symptom of interest that was reported and by a single trial; frequencies varied from 0% for the placebo group and 4% for the treatment group.
Ginkgo biloba. Three trials evaluated Ginkgo biloba, 120 to 240 mg per day for 3 to 12 months, in a total of 563 subjects with mixed dementias of mild to moderate severity.
The largest trial179 had the longest treatment interval but the lowest daily dosage and reported a significant effect for general cognitive function but had mixed findings for global assessment. A second large trial181 found positive changes for neuropsychological tests, global assessment, and behavior outcomes with double the dosage of the previously described trial and half the treatment interval. In this same RCT, clinical efficacy was assessed by using a responder analysis, with therapy response being defined as response in at least two of the three variables: CGI (global function), SKT (special cognitive function), and NAB (ADL). A single trial evaluated behavior/mood and was not significant. No trial evaluated caregiver burden or quality of life/ADL.
All three trials scored 3 or greater on the quality scale for rating adverse events. Two studies had no withdrawals due to adverse events, and one trial had a withdrawal rate of 6% for both placebo and treatment groups. Two studies did not report any adverse event. One study reported a statistically significant difference between the treatment and the placebo group for skin disorders. The same study reported gastrointestinal and headache adverse effects, but did not test for statistical differences between the placebo and the treatment group. None of the trials reported the presence of the a priori symptoms of interest.
Idebenone. Four studies185, 183, 182, 184 evaluated the drug idebenone in 1153 subjects of mixed dementia populations of mild to moderate severity; one of these trials26 evaluated idebenone relative to tacrine. Doses varied from 30 mg per day to 360 mg per day, and the treatment interval ranged from 90 days to 60 weeks.
There was evidence of benefit for general cognitive function and global assessment. Several studies evaluated behavior/mood and quality of life/ADL and these outcomes were found to be significantly different. None of the trials evaluated caregiver burden.
Quality scores for reporting adverse events varied from 1 to 5. Rates of withdrawal due to adverse events varied from 0 – 5% for the placebo group and 0 – 5% in the treatment group; a single trial183 did not report withdrawal rates. Two trials183, 185 tested for statistical differences between groups and found no differences. Although no clear pattern emerges, three studies identified at least one balance-related adverse event most consistently reported across studies. The range of frequencies of the a priori symptoms of interest are as follows: 1) nausea (placebo = 2%, all doses = 2 – 11%), 2) dizziness (placebo = not reported, all doses = 2%), and 3) not reported for diarrhea, agitation, or eating disorder as an adverse event.
Oxiracetam. Five studies186, 187, 188, 189, 190 evaluated oxiracetam in 554 subjects with different dementia syndromes of mild to moderate severity. All analyses were observed cases and not ITT. All studies used 1600 mg daily, with one exception189 where the dose ranged between 1600 – 2400 mg per day. The treatment interval ranged from 90 days to 26 weeks.
All outcomes shown to be positive for this drug were based on observed case evaluation. The two trials that evaluate general cognitive function showed benefit. The findings for specific cognitive function were mixed. A single trial evaluated global assessment and showed statistically significant change. Behavior/mood, and quality of life/ADL outcomes showed mixed results. No study evaluated caregiver burden.
The quality scores for reporting adverse events varied from 2 to 5. The proportion of withdrawals due to adverse events varied form 0 – 9% for the placebo group and 0 – 6% for the treatment group. No clear pattern for adverse events is evident, but three of the five studies reported gastrointestinal related problems, primarily associated with abdominal pain. Although, only single trials evaluated the range of frequencies of the a priori symptoms of interest are as follows: 1) dizziness (placebo = not reported, all doses = 11%), and 2) agitation (placebo = 1%, all doses = not reported); no trial reported nausea, eating disorder, or diarrhea as an adverse event.
Pentoxifylline. Three placebo-controlled studies193, 192, 191 evaluated pentoxifylline and one study194 compared pentoxifylline to sulodexide, with a total of 482 subjects with predominately MID. The total dose administered in all studies was 1200 mg per day but varied from 400 mg three times per day to 1200 mg once per day. The treatment intervals ranged from 12 to 36 weeks.
All three placebo trials showed non-significant findings for any primary outcome evaluated on all subjects in the study. It should be noted that two of these trials had very small sample sizes (n = 38, n =28) that were evaluated in the OC analyses; this suggests that the trials lacked sufficient power to evaluate multiple outcomes. The remaining trial had a large sample size (n = 289) and employed an ITT analysis; all primary outcomes evaluated were not significant.
The quality scores for reporting adverse events were generally low, varying from 1 to 3. Withdrawal rates due to adverse events varied from 0 – 25% in the placebo group and 0 – 22% in the treatment group. The two studies that reported adverse events indicated the presence of gastrointestinal disturbances, including abdominal pain or nausea and vomiting (placebo = 7% and all doses = 14%). None of the trials reported dizziness, agitation, eating disorder or diarrhea.
Propentofylline. Four trials197, 196, 250, 198 using propentofylline in 510 patients with AD and VaD were included. A dose of 900 mg per day was consistent across all studies, and the treatment duration ranged from 3 to 12 months.
The two studies with small sample sizes (n = 30) showed no significant results for any outcome evaluated but lack of power cannot be ruled out. There were two trials that found benefit in general cognitive function based on the MMSE. The results for specific cognitive function as measured by the DSST were mixed, as were those for global assessment. Behavior/ mood outcomes were evaluated by a single trial195 and shown to be significant; this same trial evaluated quality of life/ADL and showed no significant difference. No trial evaluated caregiver burden.
The quality scores for reporting adverse events varied from 1 to 4. The percentage of withdrawals varied from 0 – 13% for the placebo group and 0 – 12% for the treatment group. None of the trials tested for differences between groups. Three of the trials195, 197, 198 reported gastrointestinal events that included abdominal pain, constipation, and nausea and vomiting (placebo = 2%, all doses = 7%). Dizziness (placebo = 3 – 5%, all doses = 1 – 6%) was the only other a priori symptom of interest.
Definition of clinically significant or meaningful difference. The stance undertaken in this review has been cautious with regards to interpreting “clinically significant” differences within and across studies. This systematic review has highlighted some of the concerns expressed in the literature on pharmacological efficacy research in dementia. Ultimately, clinical significance is a complex issue, and its definition can vary across individuals and groups of individuals. Wherever possible, attempts were made to identify the magnitude of differences in the studies and the limitations of the data from some of these primary studies.
In drug development programs, an ordered series of trials are undertaken: dose tolerance (phase I), dose finding (phase II), dose efficacy (Phase III), and post-marketing (phase IV). However, due to the pressures on pharmaceutical companies to develop drugs quickly and cost-efficiently, a drug may move into the next phase of development before evidence of the previous phase is known.251 Even when phase III trials are carried out in an adequate manner, the interpretation of the efficacy results is hampered by multiple p-values, disagreement over the need for multiplicity corrections, and the potential for conflicting evidence from trials of different sizes.251 Some of these difficulties can be minimized by measuring a single primary efficacy variable at one point in time and using a p-value of less than 0.025 (one-tailed, as the aim is for the statistical test to determine if the drug performs better than the placebo or low dose).252 This presumes that good dose-response data exist, identifying a single dose level as the best candidate for further evaluation. Lastly, interpreting differences on the basis of statistical significance has long been recognized as problematic. Clinically meaningful change reflects a different level of “significance” and often requires consensus among experts within the field for these criteria.
Issues of diagnosis and severity. Three methodological issues related to population classifications have limited the inferences that can be garnered from this systematic review. The first issue concerns the classification models used for diagnosing dementia; they are not interchangeable among the various types of dementia and the “pre-clinical” forms of slight cognitive impairment. Moreover, there are still concerns about the accuracy of these criteria. For example, in the American Academy of Neurology's (AAN) recent evidence-based review of dementia case definitions, none met the AAN's highest evidence standard. A clinical diagnosis of AD is only 28% specific after age 79 years. Similarly, no dementia screening measure is accurate enough to be recommended by the American Society of Internal Medicine.253 The AAN specifically faulted the emphasis on memory function in dementia case definitions.254 Yet tests like the ADAS-cog emphasize memory loss at the expense of other cognitive domains, especially executive control function,31 and many anti-dementia treatment strategies target neurotransmitters and structures (like acetylcholine and the hippocampus), which mediate memory test performance.
A second consideration in defining populations of dementia patients concerns determination of severity level. The MMSE, although frequently used, may not best capture severity. Many studies were observed to define the severity (mild, moderate, severe) of dementia populations based on the MMSE. For example, a range from 10 to 26 has been used to define a mild to moderate severity level.50 Given that the maximum and minimum instrument scores are 0 and 30, this suggests that the extreme ends of the spectrum, particularly the “severe” end (i.e. <10), represent a very narrow proportion of patients. These two broad categories (mild to moderate and severe) may not actually reflect the cognitive and functional differences in a clinically meaningful manner. The MMSE does not address issues of executive control function (as required by the DSM-IV dementia case definition), which is known to be a good predictor of functional status. From a research perspective, a better classification reflecting disease severity may be an important factor for stratification and determining the efficacy of pharmacological interventions.
Outcome issues. The studies evaluated in our review used 181 different outcomes across seven domains. This raises the issue of which of these outcomes are considered by clinicians to be most “clinically relevant”. Let us assume that the most clinically relevant outcomes for all the drug interventions for dementia are the ADAS-cog and the MMSE because they are very commonly reported in studies.
In this dementia review, numerous studies did not measure outcomes evaluating cognition, as the intended effect of the drug was not always in the domain of cognition (e.g. neuroleptics for behavior control). Moreover, a large number of the studies that used important clinical cognition outcomes, such as the MMSE, did so only to establish baseline severity, or they used it as a secondary outcome. This presents us with some difficulty in the consistency of reporting on this limited set of “clinically relevant” outcomes. There is also the issue of which domain (i.e. cognitive function versus ADL versus behavior) is the most clinically relevant. The FDA guidelines suggest cognition and global assessment; the EMEA guidelines suggest the addition of an ADL or quality of life/ADL measure as being most clinically relevant. Thus, some consensus work needs to be done among experts in the field to determine the most clinically relevant outcomes and domains. For example, the choice of most clinically relevant outcome may depend upon type and stage of dementia (e.g. for mild AD, neuropsychological outcomes may be are the most important domain while for severe AD, behavior may be the most relevant outcome), which may challenge the achievement of consensus.
To our knowledge, no specific set of outcomes that define “clinical relevance” applies to all the drug interventions we evaluated. The FDA has recommended that “dual efficacy” of dementia drug interventions be established by significant change in both a psychological measure and a global change measure. The outcomes measuring these attributes within these two domains were not specified. However, there was a general trend for using the outcomes ADAS-cog and CIBIC+ to capture these two attributes when evaluating drugs for AD populations.
Ideally, all outcomes should have demonstrated acceptable psychometric properties, such as reliability, validity (construct), and responsiveness. We did not a priori evaluate the properties of outcomes reported in the eligible studies. In some cases, these outcomes were developed in non-English languages but the original study was reported in English. In considering the psychometric properties of some of the outcome instruments used, the attribute of responsiveness is critical, and some have suggested that this has not been adequately evaluated in many outcome measures.33, 30, 255
We might envision a clinically relevant pharmacological treatment as one that has made a real difference, where the change is both relevant and important to the patient or to clinicians. This fundamentally shows the difference between clinically significant (relevant and important) versus statistically significant (associated with probabilities), where the latter determines that the results are not due to chance. Moreover, a clinically important change will vary depending on whether importance is defined from the patient or clinician perspective.
Five different levels of responsiveness (ability to detect change) of outcome measures have been defined:256 1) Minimal change potentially detectable (essentially an attribute of the scoring method of the outcome), 2) Minimal change actually detectable beyond measurement error of the instrument (also defined as Minimum Detectable Change (MDI) or Reliability Change Index (RCI), which includes the Standard Error of the Measurement (SEM)), 3) Observed change (often reported as the standardized response mean (SRM) or effect size (ES). 4) Observed change in those estimated to have improved; the key to understanding change in this instance is that an external standard is used to determine whom has improved (often reported as comparison between groups that have improved versus those who have not; the improved group can be defined by either patient and/or clinician or a combination), and 5) Observed change in those estimated to have important improvement (often reported as the minimal clinically important difference and can be determined by the patient or clinician, or a combination of both).
Consider the ADAS-cog and the CIBIC+: The minimal change detectable is 1/70 = 0.0143 for the ADAS-cog and 1/7 = 0.143 for the CIBIC+, suggesting that the ADAS-cog can detect smaller increments of change relative to the CIBIC+. Thus different instruments have differing sensitivities to detecting change. There is scant literature on the responsiveness of outcome measures as defined in number 4 above, observed change in those that have improved, or as in number 5 above, observed change in those estimated to have important improvement. Thus, we have identified a significant gap in the literature with regard to estimating clinically important changes. Much greater consideration of issues of responsiveness should be given in future research in efficacy trials of pharmacological agents. Greater understanding of clinically important change suggests that some of our current judgments of efficacy are limited as these important differences need to be established.
Analysis issues. The inability to estimate the power of a study to detect a difference presented significant limitations in interpreting those studies that showed no significant differences. Similarly, the lack of sufficient data for estimating effect size limited the ability to show the magnitude of the change. It is recommended that future trials evaluating the efficacy of pharmacological agents adhere to the CONSORT guidelines in order to provide sufficient data to estimate power and effect size for all relevant outcomes.
Although the difficulty of maintaining adherence to long-term drug interventions among dementia patients is acknowledged, the ITT analysis should continue to be the analysis of choice in trials. Ideally, both ITT and OC analyses should be presented. If both suggested the same conclusion, confidence in the study results would be increased.
Problems with funding/ sponsorship exclusively from drug companies. The sponsorship of studies by for-profit organizations has led to bias towards the publishing of positive results.257 These findings suggest that there are powerful disincentives for pharmaceutical companies to publish negative trials. This is contrary to what academic-based, non-industry funded trials show, where the publication of negative trials are more likely.
A recent evaluation of FDA databases for antidepressant drugs258 in the US, suggested that less than half of antidepressant trials were negative, which does not correspond to the published literature. In this systematic review, no attempts were made to contact industry for unpublished trials, which introduces the possibility of a bias associated with not reporting negative trials. Additionally, we did not contact authors who did not specify funding sources for their studies. Future research on the efficacy of pharmacological agents to treat dementia should indicate all sources of funding and who undertook the study analyses.
Adverse events. In this systematic review, the type and frequency of adverse events associated with the use of a drug intervention were scrutinized and reported to a greater extent than previous reviews of anti-dementia drugs. Attempts were made to weigh the potential for harm against the benefits when determining the efficacy of pharmacological interventions. Empirical evidence across diverse medical fields indicates that reporting of safety information (including milder adverse events) receives much less attention than the positive efficacy outcomes.35 Thus, it was recognized that an evaluation of the benefits of anti-dementia pharmacological agents alone may present a biased view of the efficacy of the intervention.
The ability to capture and evaluate adverse events proved to be difficult for several reasons. For example, although metrifonate had good evidence of positive effects on cognitive function, it was banned from use due to the risk of respiratory paralysis. The description of serious adverse events in the trials we evaluated did not capture this type of event, nor did different studies identify “serious events” in a consistent manner. This points to several fundamental limitations. The first of these relates to the limitation associated with the RCT design itself, which is less likely than the longitudinal cohort study designs to capture serious adverse events that are rare. Secondly, many trials were of relatively short duration and captured “idealized” dementia populations. Many of these trials were from pre-marketing studies contracted by pharmaceutical companies in carefully controlled research settings. Dementia patients seen in practice may have more complex medical illnesses and are at greater risk for potential side effects. In addition, drugs used in “polypharmacy” have even greater potential for pharmacological interactions. Furthermore, practitioners may prescribe these pharmacological agents for wider indications than originally intended, or may not refrain from withholding the drug from certain high-risk subgroups, leading to increased risk of adverse events. Thus, published rates of adverse events in well-controlled trials may underestimate true rates seen in practice.
Thirdly, by their nature, some adverse events are not easily anticipated, and therefore are not screened for in some trials. Adverse events may be hard to predict or anticipate but can be captured only if a trial protocol was designed to measure these events. This problem is compounded by the lack of consistency in what constitutes “serious” events or how the severity of the typical events is rated. A limited number of standardized instruments exist to capture these events reliably, but the overwhelming majority of studies in this systematic review did not use these instruments. Furthermore, capturing information from individuals with cognitive decline can create problems; the validity of the self-report instrument, even if completed by the caregiver, can be problematic. More research on the reliable collection of adverse events in dementia populations (with compromised cognition) may be required.
A fourth consideration concerns the issue of off-label use of pharmacological agents. Given that only four drugs are currently approved by the FDA for the treatment of dementia, the other 97 interventions evaluated in this review are classified as “off label use” but many are not approved by the FDA and not, therefore, available. For some of these off-label medications the potential mechanism of action on the disease process has not been fully established (if even considered), yet they have been applied to dementia populations. This off-label use of pharmacological agents may present further difficulties in evaluating adverse events.
Few studies evaluated delay of onset or delay in disease progression. A definite gap for evaluating disease onset (as defined by the selection of populations at risk such as MCI populations) has been identified in this review.
Conversely, the need for good evaluation of disease progression in trials was also identified. In general, few studies evaluated subjects in more severe state of the disease. This suggests that a bias exists towards evaluating mild to moderate disease in the trials eligible in this systematic review. This in turn reflects the underlying assumption that the less severe groups are most likely to benefit from drug trials. Since so few studies have evaluated the more severe groups, this assumption may require some empirical justification in future research. Those studies that evaluated severe patients showed some potential for benefit. Future research in this area may require some consensus regarding the classification of severity levels.
Three studies evaluating cerebrolysin,168 selegiline and vitamin E,135 and donepezil61 have shown significant effects in delaying disease progress in mild to moderate61, 168 and moderately severe disease in patients with AD. This delay in progress was expressed in terms of delay in days to primary event135, 61 or statistical differences between placebo at a specified time interval.168 Although these two trials coincidentally evaluated dementia patients over the longest time interval, it did not withdraw the drug at the end of the study. Theoretically, conclusive evidence of disease delay would be demonstrated if the treatment groups did not return to the level of the placebo. Thus, distinguishing between symptomatic and disease modifying effects is not possible unless the drug is withdrawn and the treatment group(s) are observed for these changes.
When studies attempted to evaluate disease progression, long-term (1 year or greater) trials continued in an “open-label fashion”, where blinding was no longer maintained. This limits the confidence that bias did not affect the subsequent changes in the outcomes. It was observed that increasing levels of dropout (for a variety of reasons) also plagued these open-label phases of evaluation. From a practical perspective, maintaining adherence in longer-term trials in dementia patients are challenging,19 particularly for those in the placebo arm or for those interventions that have a high proportion of adverse events.
A number of trial designs have been proposed to capture delay in disease progression versus symptomatic treatment. Some of these trial designs include withdrawal of treatment, active-extension, randomized withdrawal, randomized start, and staggered start designs.235, 236, 19, 21 One important aspect of these designs is the selection of an adequate washout period or an adequate follow-up period. In addition, longer evaluation with survival analyses may be a good strategy to evaluate delay in disease progress for some drugs. One advantage of this design is the selection of clinically relevant milestones (functional changes over time), which was utilized in two studies 63, 136; the selection of such events may merit greater consideration in future trials evaluating delay. A more critical analysis of the staggered/start/stagger withdrawal design in comparison to the survival analysis would be helpful. Also, one could provide a more extensive analysis of the data on propentofylline and vitamin E,136 which represent the most extensive efforts to use the stagger/start/stagger withdrawal and survivor analysis approaches, respectively. Future research seeking to establish efficacy should clearly specify if symptomatic treatment or delay in progression is the therapeutic aim. This is important for determining specifically if efficacy is considered with respect to these two aims. Accordingly, a design that can establish this aim should be selected.
Determining symptomatic treatment versus affecting delay in disease progress. Figure 31
In contrast, Figure 32
What may be most relevant to clinicians are head to head comparisons of the cholinergic modifying neurotransmitter pharmacological agents, particularly those currently approved for the treatment of dementia (tacrine, rivastigmine, galantamine, donepezil) in the United States. The evidence for each of these drugs has been extensively detailed, and the relative merits and handicaps of each were outlined in chapter 3. Relative effectiveness as demonstrated by effect sizes for the ADAS-cog and the CIBIC were also shown in chapter 3. Although, the psychometric properties of these two outcomes are well accepted, comparison across the populations in these pooled estimates may not lend themselves to direct comparison across these four different specific drugs. Thus, inferences about the relative effectiveness of these four medications specific for the treatment of dementia should be made cautiously as head to head comparisons were not undertaken.
From a methodological perspective, addressing the question of being “more effective” requires head to head comparisons of pharmacological interventions.
An evaluation of the trials that undertook direct head to head comparison of two distinct pharmacological agents was limited because only seven trials were identified. Although, these trials may have shown some relative benefit of one drug versus another, the clinical relevance of these particular agents is limited as none of the drugs currently approved by the FDA specifically for the treatment of dementia is represented in these eligible studies. Moreover, these studies are essentially limited to single trials and are not sufficiently strong to base recommendations on the relative effectiveness of drugs. Head to head comparison studies are beginning to appear in abstract form only and a significant gap in the literature has been identified.
In general, very few trials examined the efficacy of dementia drugs across different populations or population characteristics. From the 13 studies that reported stratified analyses, eight different variables were identified, which included age, gender, APOE genotype, disease type, disease severity (as determined by MMSE/ ADAS-cog threshold levels), treatment center, care dependence, and presence of depression. Additionally, three trials were identified that evaluated efficacy in 1) patients with Down's syndrome and dementia, 2) different ethnicities as a function of treatment center in a multicenter trial, and 3) depressed patients. Given the relatively small number of trials evaluating these variables within different populations and different pharmacological interventions, the findings of this review are limited with respect to these patient variables. These reflect merely what has been reported in the literature rather than variables of importance with respect to efficacy of pharmacological therapies. A significant gap in the literature has been identified.
The study population characteristics were detailed for the trials evaluated. A recent study,22 suggests that many “real world” dementia patients in Ontario would not have met the eligibility criteria for participation in several of the cholinesterase inhibitor studies. This study highlights an important limitation of the pharmacological literature in that dementia patients recruited are not representative of the general dementia population. Additionally, clinicians and researchers should note that when a when a drug is approved for use, it is for a specific indication and a specific patient population. Evidence for one type of patient population may not necessarily be applied to another population. This is critical information to have when establishing clinical practice guidelines.
A total of 20 pharmacological interventions in 29 studies211, 220, 238, 171, 200, 199, 146, 68, 181, 184, 133, 134, 132, 161, 89, 91, 93, 247, 187, 191, 192, 194, 193, 100, 98, 196, 195, 245, 217 were applied specifically to VaD classified dementias. The majority of these pharmacological interventions (n = 14) were represented by single trials, these interventions included ateroid, buflomedil, cerebrolysin, sulphomucopolysaccharides (CDP choline), citalopram, donepezil, Ginkgo biloba, idebenone, minaprine, nimodipine, oxiracetam, 5-THF (trazodone), vincamine, and xantinolnicotinate. Six interventions had more than a single trial, and these included Choto-san (n = 2), memantine (n = 3), nicergoline(n = 2), pentoxifylline (n = 4), posatirelin (n = 2), and propentofylline (n = 2). In general, when the drug interventions were shown to be effective, it was in the domains of cognitive function (both general and specific) and global assessment. Other domains were less frequently evaluated. Several trials attempted to test for differences between VaD groups and other dementia types.
Erkinjuntti et al (1997)1 compared six commonly used classification schemes (DSM-III, DSM-III-R, DSM-IV, ICD-9, ICD-10 and the CAMDEX) and demonstrated that the prevalence of dementia can differ by a factor of 10 depending on the diagnostic criteria used. Two other studies have demonstrated that the prevalence of VaD varies with the classification system; therefore these criteria for diagnosis are not interchangeable.10, 11
There is controversy about the validity of the clinical classification of VaD, as autopsy confirmation often does not substantiate the clinical diagnosis.12, 13 The majority of dementias were actually AD with co-existing VaD and PDD lesions.14 In contrast, the clinical accuracy of AD diagnosis is relatively high.7 Future research in vascular dementia should attempt to better distinguish this subgroup.
With rare exceptions, dementias are inevitably progressive and eventually lead to severe cognitive deficits, functional impairment, and often behavioral problems, unless death supervenes from intercurrent disease. The trajectories, sequence of clinical features, and burden on caregivers vary depending upon the type of dementia. For example, cognitive decline typically precedes functional impairment and behavioral disturbances in AD, while behavior and/or language problems typically announce the onset of frontotemporal degeneration.
Physicians and other health care practitioners have numerous roles in the management of individuals with dementia. These include identification, assessment and staging, classification, and prognostication, in addition to treatment of the individual and caregiver and planning for future disabilities (e.g. arranging alternatives to driving, assigning power of attorney and compiling living wills/advance directives).
Given these multiple tasks, how is the treating physician to interpret the results of therapeutic trials, which mostly deal with the pharmacological treatment of individuals with predominantly one type of dementia (AD) in the mild to moderate stages?
The traditional view of most physicians is that treatment success is measured by reversal of a disease, which is not a realistic goal in dementia. (While the older literature suggested that as many as 15 to 30% of dementias were “reversible,” more recent studies indicate that at most a few percent of dementias presenting to physicians are potentially reversible.)
Thus, the treating practitioner must begin by setting a realistic goal for therapeutic intervention. Symptom relief, alleviation of caregiver burden, prevention of complications (such as injury prevention or avoidance of aspiration pneumonia), and delay in progression of disease might be potential treatment targets. From this list, only symptom relief and delay in progression could be inferred from the studies examined in this systematic evidence review.
Outside the specialty clinic or clinical trial setting, most physicians have limited time and resources to expend on their patients with dementia. Few will have access to psychometrists or other individuals capable of administering extensive assessment instruments such as those used in clinical trials (e.g. ADAS-cog). Thus the typical practitioner must be able to complete a brief assessment, which provides sufficient information to determine whether a treatment is 1) indicated and 2) effective.
Deciding if a treatment is indicated depends upon the correct diagnosis (does this person have a dementia, and if so what type?), potential contraindications to the treatment (e.g. active peptic ulcer or heart block in the case of cholinesterase inhibitors), and the severity of disease. Determination of severity of dementia has given rise to several global scores such as the Global Deterioration Scale (GDS)260 and the Clinical Dementia Rating (CDR)261 In practice, the Mini-Mental State Examination (MMSE)50 (a short, 30-item, cognitive screening test) is frequently used as a measure of severity. Not only is it part of the usual diagnostic protocol for suspected dementia, but it also has the advantage of being included in the entry criteria of many of the RCTs of anti-dementia medications. It is therefore useful for determining whether a patient fulfils the appropriate severity criterion for therapeutic intervention.
With regard to deciding whether a treatment is effective, much has been written about the relative importance of statistically significant and clinically significant changes in measures of cognition, function, and behavior in dementia. A distinction must be drawn between clinically detectable change and clinically meaningful change. While psychometric measures (standardized instruments, which are highly reliable and relatively free from the influence of judgment) may detect changes too small to be appreciated by the clinician, clinometric tools (measures that are based on a clinical judgment about an individual patient262) may be considered more relevant to practice. Results expressed as a change from baseline measured by clinometric instruments such as the Clinicians Interview Based Impression of Change (CIBIC) or its derivative the CIBIC plus, which incorporates observations of the caregiver, mimic clinical practice more closely than most psychometric tools. The CIBIC aims to cover multiple domains relevant to the clinician (i.e. cognitive, functional, and behavioral). Clinicians may therefore interpret statistically significant changes on the CIBIC or similar scales with more confidence than changes on the many psychometric scales used in therapeutic trials. However, if an effect size of ~0.5 or greater is included in the analysis of psychometric outcomes, one can be reasonably confident of a robust response to the treatment under investigation.
Another measure of efficacy is the response rate—the percentage of study participants who experience an improvement (defined as a change of a specific magnitude on one or more scales.) This figure is useful for the clinician who may then indicate to the individual with dementia the chances of a positive outcome from the planned treatment.
Clinicians are faced with a bewildering array of results from clinical trials. Convergence of results (different studies of the same medication showing similar results) or studies of drugs in the same class showing similar results may help to reassure clinicians that the results are genuine. Conversely, when trials show differing results, clinicians should be especially vigilant in accepting only the results of the more positive trials.
As always, the translation of clinical trial evidence into practice demands careful scrutiny by the practitioner. Attention to external validity (is my patient sufficiently similar to those in the clinical trial that I can expect the same result from treatment?), interpretation of the outcome measures (clinically as well as statistically significant benefit), and weighing potential risks against potential benefits remain the responsibility of the treating practitioner.
A systematic review that has evaluated 91 pharmacological interventions in 186 RCTs with high internal validity has several limitations. The studies selected for this review are English-language trials. Based on our search results, we estimate that we could have potentially retrieved 1385 foreign-language articles (after de-duplication 1213) distributed among databases as follows: 346 from Cochrane Central, 444 from EMBASE, 559 from MEDLINE/PreMedline® 36 from other databases before review for title and abstract. If we assumed the same rate of potentially eligible studies for these non-English studies, an additional 16 non-English studies may have been eligible for review. It is possible that agents, such as Ginkgo biloba, may have had important trials published in non-English languages. The budget and timelines available, however, were a limiting factor to obtaining, translating, and abstracting non-English trials.
Secondly, no contact with authors of the eligible trials was undertaken to collect additional unpublished studies or provide results/data that were not presented in the published article. Although contact with the original authors of the trials (to supplement the missing information from the included studies) could have compensated for many of the reporting challenges we encountered, this strategy was not feasible given the timeline of this systematic review. Our experience at the McMaster EPC suggests that the majority of authors do not respond in a timely fashion if at all. Additionally, efforts were not made to contact industry for unpublished trials. It is likely that industry sponsors of trials that are not published in the public domain are under no obligation to share trials (particularly negative trials). Not contacting authors of eligible trials for additional data and not attempting to locate unpublished trials (either by other authors/ experts or by industry) may introduce publication bias in this systematic review.
Thirdly, we employed two eligibility criteria that may account for some differences in acceptance of well-known studies. The first of these was a minimum threshold for quality score as determined by the modified Jadad scale. Despite the fact that this scale has excellent reliability and content validity, some may argue that the threshold score of 3 is arbitrary and may have unnecessarily eliminated studies of historical importance. It is our view that given the amount of literature available, all efforts should be aimed at selecting only the trials with the highest internal validity rather than selecting the largest number of eligible trials.
The second eligibility criteria concerned the exclusion of crossover trials. Although crossover trials are suitable for chronic diseases, they may be prone to period effects or period-by-treatment interactions. Period effects are systematic changes in the outcome that apply to all patients due to temporal changes in the disease or to the measurement instrument. Period-by-treatment interactions occur when the efficacy of the intervention varies by period. This is a significant concern for studies that attempt to show disease modification and are carried out over a longer period of time. Additionally, a carry-over effect may occur if the washout period is not adequate. In addition to the weaknesses of this design, some limitations arise when considering the potential for meta-analytic analyses. Traditionally, first period data from a crossover trial are abstracted and can be potentially combined with parallel trials for analyses of a pooled estimate; the reporting of the study results (positive or negative) would also be based on this first period data. In a preliminary phase of the review, several crossover trials were examined. It was noted that many did not report first period data, which precludes any potential for combining with parallel trials; many trials also did not undertake statistical tests during the first experience, thus making it difficult to report the direction of the findings, even if the trial could be combined. Finally, the TEP considered the fact that this systematic review was evaluating a variety of drug interventions administered over differing time intervals, and so period effects might be an important source of bias. For all these reasons, the TEP made the decision to exclude crossover trials from this systematic review. Thus, this review is limited to evidence based on high-quality parallel trials only.
A final limitation to our study was the use of a checklist developed to address the issue of quality of reporting adverse events. The Jadad scale was not designed to evaluate the quality of reporting adverse events. Thus, when determining the “harms” or risks associated with an intervention, the quality or “internal validity” of collecting and reporting these adverse events needed to be evaluated. Although our checklist has face validity, it has not undergone formal psychometric testing.
The findings of this report suggest several important areas for future research on pharmacological treatments for dementia. These include:
Analytic Framework of the intended aim of the therapy on the disease
Better conceptualization and research design to capture “delay in progression”.
Clearer consensus on defining efficacy (benefits and clinically important change).
Longer term studies (> 12 months).
Potential for bias
Clarification of the role of industry sponsorship; one recommendation should be that all studies are required to disclose such information in future, including who analyzed the results.
More concerted effort to incorporate unpublished studies and negative trials in future reviews.
Population
Inclusion of the spectrum of severity in the patient populations (there is nothing to suggest that severe patients may not benefit from pharmacotherapy aimed at cognitive function improvement).
The need for validation of trials and testing processes within cultures other than the traditional white population.
Examining the efficacy of interventions in different sub-populations (age, disease severity levels, etc.).
Better measurement and reporting of important patient characteristics (including baseline cognition scores, co-morbid conditions, the use of other medications, etc.).
Inclusion of MCI type groups of subjects to evaluate “delay of onset”.
Outcomes
Expansion of outcomes collected to include more than just cognitive function, and especially include caregiver burden and quality of life/ADL.
Clear operational definitions for determining critical outcomes (delay to onset, delay to progression, important effect size, etc).
Better understanding of how outcomes perform cross-culturally.
Production of other diagnostic instruments to detect both onset and responses to therapies across varied cultural groups.
Improvement in the reporting of adverse events to evaluate harm.
Analysis
Appropriate analytical strategies that take into account intention to treat (ITT)/ last observation carried forward (LOCF) analyses; where possible both observed case and ITT/LOCF analyses should be presented.
Sufficient data to estimate effect size, taking into account variability in both treated and control populations on the primary measures.
Reporting the power of the study when findings are non-significant.
Intervention
Undertake more studies with direct comparison of drugs to determine the relative efficacy of agents.
Improved description of the titration process.
Improved collection of adverse events undertaken in a systematic fashion with standardized instruments.
| AAMI | Age-Associated Memory Impairment |
| ABID | Agitated Behavior Inventory for Dementia |
| ABS | Adaptive Behavior Scale |
| ACES | Agitation-Calmness Evaluation Scale |
| ACFP | American College of Family Physicians |
| AChE | Acetycholinesterase |
| ACP-ASIM | American College of Physicians - American Society of Internal Medicine |
| ACPT | Auditory Continuous Performance Test |
| ACTH | Adrenocorticotropic homone |
| AD | Alzheimer's Disease |
| ADAS | Alzheimer's Disease Assessment Scale |
| ADAS-11; ADAS-13 | Alzheimer's Disease Assessment Scale (11 and 13 items) |
| ADAS-Cog | Alzheimer's Disease Assessment Scale-Cognitive and Non-Cognitive Sections |
| ADAS-NonCog | |
| ADCS-ADL | Alzheimer's Disease Cooperative Study - Activities of Daily Living |
| ADCS-CGIC | Alzheimer's Disease Cooperative Study - Clinical Global Impression of Change |
| ADFACS | Alzheimer's Disease Functional Assessment and Change Scale |
| ADL; ADLC | Activities of Daily Living (Checklist) |
| ADL-BDRS | Activities of Daily Living-Blessed Dementia Rating Scale |
| ADL-PDS | Activities of Daily Living- Progressive Deterioration Scale |
| ADS | Alzheimer's Deficit Scale |
| ADSS | Alzheimer's Disease Symptomatology Scale |
| AFBS | Aversive Feeding Behavior Scale |
| AGGR | Aggressiveness subscale of the Personality Psychopathology Five (PSY-5) Scales |
| AGS-E | Assessment of Global Symptomatology-Elderly |
| AHRQ | Agency for Healthcare Research and Quality |
| AIDS | Acquired Immune Deficiency Syndrome |
| AIMS | Abnormal Involuntary Movement Scale |
| ALCAR | Carnitine |
| AMED | Allied and Complementary Medicine Database |
| AMI | Attention Matrices |
| AMPA | Alpha-amino-3-hydroxy-5-methyl-4-isoxazole proprionic acid |
| AMTS | Abbreviated Mental Test Score |
| APOE | Apolipoprotein E gene |
| BADL | Basic Activities of Daily Living |
| BARS, BAS | Barnes Akathisia Rating Scale |
| BCRS | Brief Cognitive Rating Scale |
| BDI | Beck Depression Inventory |
| BDRS | Blessed Dementia Rating Scale |
| BEHAVE - AD | Behavioral Pathology in Alzheimer's Disease Rating Scale |
| BePU | Berlin rating scale for psychomotoric restlessness |
| Bf-S | Zerssen Adjective Mood Scale (German test: Befindlichkeitsskala) |
| BGP | Behavioural Rating Scale for Geriatric Patients |
| BGP | Behavioural Rating Scale for Geriatric Patients |
| BI | Barthel Index |
| bid | Twice a day |
| BL-A | Blessed A scale |
| Blessed-D BDRS | Blessed Dementia Rating Scale |
| BMI | Body Mass Index |
| BMICT | Blessed Memory Information and Concentration Test |
| BMY | Nootropic agent; Bristol-Myers Squibb |
| BNT | Boston Naming Test |
| BPRS | Brief Psychiatric Rating Scale |
| BRMS | Bech-Rafaelsen Mania Scale |
| BRSD | Behavioral Rating Scale for Dementia |
| BSRT | Babcock Story Recall Test |
| BSRT | Buschke Selective Reminding Test |
| BSS | Behavioral Syndromes Scale for Dementia |
| BTT | Block Tapping Test |
| CADISIL | Cerebral Autosomal Dominant Ischemia with Subcortical Leukoencephalopathy |
| CAMCOG | Cognitive section of the Cambridge Examination for Mental Disorders in the Elderly |
| CAMDEX | Cambridge Examination for Mental Disorders in the Elderly |
| CAMTOT | CAMCOG Total Score |
| CANTAB | Cambridge Automated Neuropsychological Test Assessment Battery |
| CAPE | Clifton Assessment Procedures for the Elderly |
| CASE | Clifton Assessment Scale for the Elderly |
| CASI | Cognitive Abilities Screening Instrument |
| CATS | Caregiver's Activity Time Survey |
| CAUST | Canadian Utilization of Service Tracking questionnaire |
| CBC | Complete Blood Count |
| CBM 36–733 | 2-methyl-alpha-ergokryptine |
| CCASSS | Computerized Cognitive Assessment System Speed Score |
| CCT | Controlled Clinical Trial |
| CDR, CDRS | Clinical Dementia Rating; Clinical Dementia Rating Scale |
| CDR-NH | Clinical Dementia Rating - Nursing Home Version |
| CDR-SB | Clinical Dementia Rating - Sum of Boxes |
| CDT | Clock-Drawing Test |
| CEB | Clinical Epidemiology and Biostatistics |
| CERAD | Consortium to Establish a Registry for Alzheimer's Disease |
| CERE | Cerebrolysin |
| CETM | Dynamic measure of comprehension process (Spilich) |
| CGAE | Clinical Global Assessment of Efficacy |
| CGC+ | Clinical Global Change-Plus |
| CGI | Clinical Global Impression |
| cGIC | Caregiver-rated Global Impression of Change |
| CGIC | Clinical Global Impression of Change |
| CGI-GI | Global Improvement |
| CGI-CGC | Clinical Global Impression-Clinical Global Change |
| CGI-S; CGI-S/C | Clinical Global Impression-Severity/Change |
| CGRS | Clinicians' Global Rating Score |
| chisq | Chi-Square Test |
| chisq M-H | Mantel-Haenszel Chi-Square Test |
| CI | Confidence interval |
| CIBI | Clinician's Interview-Based Impression |
| CIBIC | Clinician's Interview-Based Impression of Change |
| CIBIC+ | Clinician's Interview Based Impression of Change plus Caregiver |
| CIBIS+ | Clinician's Interview-Based Impression of Severity with Caregiver Input |
| CINAHL | Cumulative Index to Nursing & Allied Health Literature ® |
| CIND | Cognitive Impairment Not yet Diagnosed |
| CLEX | Clinical Examination |
| CloND | Cognitive Loss No Dementia |
| CMAI | Cohen Mansfield Agitation Inventory |
| CNTB | Computerized Neuropsychological Test Battery |
| COSTART | Coding Symbols for a Thesaurus of Adverse Reaction Terms |
| COWAT | Controlled Oral Word Association Test |
| CPRS | Comprehensive Psychopathological Rating Scale |
| CPT | Cognitive Performance Test |
| CSDD | Cornell Scale for Depression in Dementia |
| CSGDS | Collateral Source Geriatric Depression Scale |
| CSI | Caregiver Stress Inventory |
| CSS | Caregiver Stress Scale |
| CT | Computerized Tomography |
| CVD | Cerebrovascular Disease |
| CVLT | California Verbal Learning Test |
| d | day |
| d | Effect Size Value - (d) is the average amount of change in standard deviation units achieved by individuals in a treated group versus the change achieved by members of a control/comparison group for a particular study |
| DAD | Disability Assessment for Dementia |
| DAT | Dementia Alzheimer's Type |
| D-B | Delay relative to Baseline |
| DBDS | Dementia Behavior Disturbance Scale |
| DCT | Digit Copying Test |
| DDAVP | Deamino-D-Arginine-Vasopressin |
| DEK | Dihydroergokryptine |
| Df | Degrees of Freedom |
| DMR | Dementia Questionnaire for Mentally Retarded Persons |
| DMSE | Delayed Matching-to-Sample Exam |
| D-P | Delay relative to Placebo |
| DPZ | Donepezil |
| DRS | Dementia Rating Scale |
| DSCS | Depressive Symptoms Collateral Source |
| DSM | Diagnostic and Statistical Manual of Mental Disorders (Edition III, III-R, IV) |
| DSPT | Digit Span Test |
| DSS | Depressive Signs Scale |
| DST; DSST | Digit Symbol (Substitution) Test |
| DTIC | Discovering Things in Common |
| e.g., | example |
| ECG | Electrocardiogram |
| EEG | Electroencephalography |
| EFR | Emotional Face Recognition |
| EIS | Efficacy Index Score |
| EMBASE | Excerpta Medica Database |
| EPS | Extrapyramidal Symptoms |
| ERP | Event-Related Potential |
| ESRS | Extrapyramidal Symptom Rating Scale |
| FAST | Functional Assessment Staging |
| FCCA | Final Comprehensive Consensus Assessment |
| FCMT | Figure Copy/ Memory Test |
| FDA | Food and Drug Administration |
| FDG-PET | Positron Emission Tomography with 18-fluorodeoxyglucoseis |
| FIGT | Figure Detection Test |
| FIM | Functional Independence Measure |
| FRS | Functional Rating Scale test |
| g | gram |
| GABA | Gamma-aminobutyric acid |
| GBS | Gottfries-Bråne-Steen |
| GBS-SDS | Gottfries-Bråne-Steen - Scale for Dementia Syndromes |
| GDS | Global Deterioration Scale |
| GERRI | Geriatric Evaluation by Relative's Rating Instrument |
| GIS | Global Improvement Scale |
| GM-1 | Monosialoganglioside |
| GMS-A | Geriatric Mental State questionnaire |
| GPI-E | General Psychiatric Impression-Elderly |
| GS | Gestalt Scale |
| h | hour |
| HAM-A; HARS | Hamilton Anxiety Rating Scale |
| HAM-D; HDRS | Hamilton Depression Rating Scale |
| HDS-R | Hasegawa Dementia Scale-Revised |
| HIS | Hachinski Ischemic Score |
| HIV | Human Immunodeficiency Virus |
| HMII | Hachinski-Marshall Ischaemic Index |
| HVLT | Hopkins Verbal Learning Test |
| IADL | Instrumental Activities of Daily Living |
| ICC | Item Characteristic Curve analysis |
| ICD | International Classification of Diseases (Version 9 or 10) |
| IDDD | Interview for Deterioration in Daily Living Activities in Dementia-complex task |
| IF | Industry Funded |
| IM | Intramuscular |
| I-P | Improvement relative to Placebo |
| IPSC-E | Raskin's and Crook's Inventory of Psychic and Somatic Complaints for the Elderly |
| IQCODE | Informant Questionnaire on Cognitive Decline in the Elderly |
| IS | Industry provided Supplies |
| ITT | Intention-to-treat |
| IU | International Units |
| kg | kilogram |
| KOLT | Kendrick Object Learning Test |
| LAS | Luria Alternating Series |
| lbs | pounds |
| LFT | Liver Function Test |
| LMT | Logical Memory Test |
| LNNB | Luria-Nebraska Neuropsychological Battery |
| LOCF | Last Observation Carried Forward |
| LPRS | London Psychogeriatric Rating Scale |
| LRU | Lipasemic Releasing Units |
| m | month |
| M | male |
| MAACL-R | Multiple Affect Adjective Checklist-Revised |
| MACF | Microtubule Actin Crosslinking Factor |
| MADR-S | Montgomery-Asberg Depression Rating Scale |
| MCI | Mild Cognitive Impairment |
| MCPT | Modified Continuous Performance Test |
| MDB | Mental Deterioration Battery |
| MeSH | Medical Subject Heading |
| μg | microgram |
| mg | milligram |
| MID | Multi Infarct Dementia |
| Min | Minimal |
| MITT | Modified Intention-to-treat |
| ml | milliliter |
| MMSE (MMSE-CE) | Mini-Mental Status Exam (estimated score) |
| CMMSE | Cantonese MMSE |
| MMMSE | Modified MMSE |
| SMMSE | Standardized MMSE |
| MNLT | Modified Names Learning Test |
| Mod | Moderate |
| Modly Sev | Moderately Severe |
| MQ | Memory Quotient |
| MRI | Magnetic Resonance Imaging |
| MRS | Magnetic Resonance Spectroscopy |
| MU-EPC | McMaster University Evidence-based Practice Center |
| MWF | Mattis Word Fluency |
| MX | Mixed results |
| n | number included in study |
| N | No |
| NA | Not available |
| NAA | Nuremberg gerontopsychological inventory for Assessing Activities of daily living |
| NAB | Nurnberger-Alters-Beobachtungs-Skala |
| NAC | N-Acetylcysteine |
| NAI | Nuremberg Age Inventory |
| NART | Nelson Adult Reading Test |
| NCT | Number Connection Test |
| NDT | New Dot Test |
| NI | Non-Industry funding source |
| NIMCS | Newcastle Memory, Information and Concentration Scale |
| NINCDS | National Institute of Neurological and Communicative Disorders and Stroke |
| NINCDS-ADRDA | National Institute of Neurological and Communicative Disorders and Stroke - Alzheimer's Disease and Related Disorders Association |
| NINDS-AIREN | National Institute of Neurological Disorders and Stroke - Association Internationale pour la Recherche et l'Enseignement en Neurosciences |
| NLT | Names Learning Test |
| NMDA | N-methyl-D-aspartate |
| NMICS | Newcastle Memory, Information and Concentration Scale |
| NMS | Nowlis Mood Scale |
| NNI | Number Needed to Intervene |
| NOSGER | Nurses Observation Scale for Geriatric Patients |
| NOSGER-IADL | Nurses Observation Scale for Geriatric Patients - Instrumental Activities of Daily Living subscale |
| NOSIE | Nurses Observation Scale for Inpatients |
| NPI | Neuropsychiatric Inventory |
| (NPI-4, NPI-10) | Subscores 4,10 |
| NPI-NH | Neuropsychiatric Inventory - Nursing Home Version |
| NR | Not Reported |
| NRSMG | Non-Randomised Studies Methods Group |
| NS | Not significant |
| NSL | Neuropsychological Aging Self-Evaluation - List for Age Symptoms |
| NST | Non-Stress Test |
| NT | Not tested |
| OARS - ADL | Older Americans Resource Scale |
| OAS | Overt Aggression Scale |
| OC | Observed Cases |
| OLT | Object Learning Test |
| OMDR | Oculomotor Delayed Response |
| OR | Odds Ratio |
| ORG 2766 | Adrenocorticotropic hormone derivative |
| OXIR | Oxiracetam |
| oz | ounce |
| p | p value |
| P300 | Electrophysiological potential that is indicator of associative and cognitive processes and latency in decision making processes |
| PAD | Presenile Alzheimer's Disease |
| PADL | Performance of Activities of Daily Living |
| PANSS-EC | Positive and Negative Syndrome Scale-Excited Component |
| PD | Parkinson's Disease |
| PDD | Progressive Degenerative Dementia |
| PDS | Progressive Deterioration Scale |
| PDSD | Primary Degenerative Senile Dementia |
| PET | Positron Emission Tomography |
| PGIR | Patient's Global Improvement Rating |
| PI | Partially funded by Industry |
| PICD | Presenile Idiopathic Cognitive Decline |
| POMS | Profile of Mood States |
| PRL | Prolactin |
| PSMS | Physical Self-Maintenance Scale |
| PSP | Progressive Supranuclear Palsy |
| PSQI | Pittsburgh Sleep Quality Index |
| qid | Four times daily |
| QoL | Quality of Life |
| R | Correlation Coefficient |
| RA | Research Assistant |
| RAGS; RAGS-E | Relative's Assessment of Global Symptomatology (Elderly) |
| RAPSU | Scale for psychomotoric agitation |
| R-AVL | Rey auditory-verbal-learning test |
| RCT | Randomized Controlled Trial |
| RDS | Rapid Disability Scale |
| RefMan | Reference Manager Version 10® |
| RGRS | Relatives' Global Rating Score |
| RMBPC | Revised Memory and Behavior Problems Checklist |
| RMT | Rey Memory Test; Randt Memory Test |
| RMT-A&R | Randt Memory Test - Acquisition and Recall |
| RMT-DR | Randt Memory Test - Delayed Recall |
| RMT-MI | Randt Memory Test - Memory Index |
| RPM | Raven's Progressive Matrices |
| RPT | Rivermead Behavioural Memory Test-Profile Score |
| RR | Relative Risk |
| RT | Reaction Time |
| RTI | Research Triangle Institute |
| SADS | Schedule for Affective Disorders and Schizophrenia |
| SAS | Simpson-Angus Scale |
| SAS-G | Self Assessment Scale - Geriatric |
| S-B | Stabilization relative to Baseline |
| SBI | Spontaneous Behavior Interview |
| SC | Significant change |
| SCAG | Sandoz Clinical Assessment - Geriatric |
| SCB | Screen for Caregiver Burden |
| SCWIT | Stroop Color Word Interference Test |
| SD | Standard Deviation |
| SDAT | Senile Dementia of the Alzheimer's Type |
| SEM | Standard Error |
| Sev | Severe |
| SF-36 | Medical Outcomes Study Short-Form 36-Item Health Survey |
| SGRS | Stockton Geriatric Rating Scale |
| SHGRS | Stuard Hospital Geriatric Rating Scale |
| SIB | Severe Impairment Battery |
| SIP | Sickness Impact Profile |
| SKT | Syndrome Kurz test; Syndrome Short Test |
| SMQ | Squire's Memory Questionnaire |
| SMST | Sternberg's Memory Scanning Test |
| SPECT-TcHMPAO | Single Photon Emission Computed Tomography with hexamethylpropyleneamineoxime |
| SPET | Single Photon Emission Tomography |
| SRT | Selective Reminding Procedure |
| SRT-DR | Selective Reminding Procedure-Delayed Recall |
| SWFT | Semantic Word Fluency Test |
| TEP | Technical Expert Panel |
| TESS | Treatment Emergent Symptom Scale |
| TESS-DOTES | Dosage Record and Treatment Emergent Symptom Scales |
| tid | Three times daily |
| TK | Token Test |
| TOO | Task Order Officer |
| TP | Toulouse Piéron |
| TPAT | Toulouse-Pieron Attention Test |
| TSI | Test for Severe Impairment |
| UK | United Kingdom |
| UKU | Side effect rating scale |
| UPDRS | Unified Parkinson's Disease Rating Scale |
| US | United States |
| VaD | Vascular Dementia |
| VAMS | Visual Analog Mood Scale |
| VAS | Visual Analogue Scales |
| VHB | Videorecorder Home-Behavioral assessment |
| vs. | versus |
| w | week |
| WAIS | Wechsler Adult Intelligence Scale |
| WAIS-DI | Deterioration Index |
| WAIS-DSPT | Wechsler Adult Intelligence Scale - Digit Span Test |
| WAIS-DSST | Wechsler Adult Intelligence Scale -Digit Symbol Substitution Test |
| WAIS-DTIC | Wechsler Adult Intelligence Scale - Discovering Things in Common |
| WAIS-VOC | Wechsler Adult Intelligence Scale-Vocabulary Subset |
| WHO | World Health Organization |
| WLM | Word List Memory test |
| WMS-MQ | Wechsler Memory Scale-Memory Learning Restoration |
| WMS-R | Wechsler Memory Scale-Revised |
| x2 | chi-square |
| y | year |
| Y | yes |
| ZVT | Zahlen-Verbindungs Test -Trail Making Test |
Search strategy executed on February 3, 2003
Same search executed on February 4th, 2003 on the online version of the Cochrane Library for the new references <1st Quarter 2003>:
(mild cognitive impairment or MCI).tw.
((cognitive impairment not dementia) or CIND).tw.
((cognitive loss not dementia) or CLOND).tw.
Delirium, Dementia, Amnestic, Cognitive Disorders/
exp Amnesia/
Cognition Disorders/
exp Dementia/
exp tauopathies/
dement:.tw.
Alzheimer:.tw.
Huntington disease/
Lewy: ajd8 bod:.tw.
((cognit: or memory or mental:) adj8 (decli: or impair: or los: or deteriorat:)).tw.
(chronic adj8 cerebrovascular).tw.
supra-nuclear palsy.tw.
(normal pressure hydrocephalus adj8 shunt:).tw.
benign senescent forgetfulness.tw.
(cerebr: adj8 deteriorat:).tw.
cerebr: ajd8 insufficien:.tw.
(confusion: or confused).tw.
(pick: adj8 disease).tw.
(creutzfeldt: or JCD: or CJD:).tw.
(Huntington: or Huntingdon).tw.
Binswanger:.tw.
brain atrophy.tw.
exp Cerebral Amyloid Angiopathy/
neurofibrillary tangles/
senile plaques/
neuropil threads/
spongiform encephalopathy.tw.
exp Hypothyroidism/
neurosyphilis/
exp amyloid beta-protein/ not (Down syndrome/ or trisomy 21.tw.)
(CADISIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.
(corticobasil ganglionic degeneration or cortical basal degeneration or corticabasal ganglionic degeneration).tw.
multisystem atrophy.tw.
exp alcohol amnestic disorder/
(alcohol adj3 amnestic).tw.
or/1–38
Database: Pre-MEDLINE, MEDLINE on OVID
Search strategy executed on February 4, 2003
randomized controlled trial.pt.
controlled clinical trial.pt.
controlled clinical trials/
(clinical trials, phase II or clinical trials, phase III or clinical trials, phase IV or multicenter studies).sh.
random allocation.sh.
double blind method.sh.
cross-over studies.sh.
single-blind method.sh.
clinical trial.pt.
(clin: adj25 trial:).ti,ab.
((singl: or doubl: or trebl: or tripl:) adj25 (blind: or mask:)).ti,ab.
placebos.sh.
placebo:.ti,ab.
random:.ti,ab.
or/1–14
comparative study.sh.
exp evaluation studies/
follow up studies.sh.
prospective studies.sh.
or/16–19
(tu or th).xs.
treatment outcome/
exp therapeutics/
or/21–23
20 and 24
15 or 25
(mild cognitive impairment or MCI).tw.
((cognitive impairment not dementia) or CIND).tw.
((cognitive loss not dementia) or CLOND).tw.
exp dementia/
exp tauopathies/
(dement: or alzheimer:).tw.
amentia.tw.
frontotemporal lobar degeneration.tw.
hiv-associated cognitive motor complex.tw.
encephalopathy, aids.tw.
encephalopathy, hiv.tw.
mesulam syndrome.tw.
progressive nonfluent aphasia.tw.
binswanger disease.tw.
binswanger encephalopathy.tw.
leukoencephalopathy, subcortical.tw.
subcortical arteriosclerotic encephalopathy.tw.
chronic progressive subcortical encephalopathy.tw. or alcohol amnestic disorder/ or alcohol induced disorders, nervous system/ or (alcohol adj3 amnestic).tw. or (alcohol adj2 dysmestic).tw. or (ethanol adj3 nervous system disorders).tw. or ethyl alcohol abuse neurologic syndromes.tw.
(lewy: bod: adj8 disease).tw.
brain atrophy, circumscribed lobar.tw.
(pick: adj8 disease).tw.
exp amyloid beta-protein/ not (down syndrome/ or trisomy 21.tw.)
exp cerebral amyloid angiopathy/
neurofilament proteins/
tau proteins/
neurofibrillary tangles/
neuropil threads/
senile plaques/
(Corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.
(CADISIL or Cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.
Multisystems atrophy.tw.
huntington disease/
hydrocephalus, normal pressure/
Creutzfeldt-Jakob syndrome/
spongiform encephalopathy.tw.
(cjd or jcd).tw.
Creutzfeldt-Jakob disease.tw.
spongiform encephalopathy.tw.
exp Hypothyroidism/
exp Vitamin B 12 Deficiency/
exp Neurosyphilis/
or/27–67
26 and 68
animal.sh.
69 not 70
not (comment or editorial or news or letter).pt.
and eng.la.
limit 73 to yr=1998-2003
Database: EMBASE <1996 to 2003 Week 5> on OVID
Search strategy executed on February 6, 2003
(mild cognitive impairment or MCI).tw.
((cognitive impairment not dementia) or CIND).tw.
((cognitive loss not dementia) or CLOND).tw.
exp dementia/
(dement: or alzheimer:).tw.
amentia.tw.
frontotemporal lobar degeneration.tw.
hiv-associated cognitive motor complex.tw.
encephalopathy, aids.tw.
encephalopathy, hiv.tw.
mesulam syndrome.tw.
progressive nonfulent aphasia.tw.
binswanger disease.tw.
binswanger encephalopathy.tw.
leukoencephalopathy, subcortical.tw.
subcortical arteriosclerotic encephalopathy.tw.
chronic progressive subcortical encephalopathy.tw.
exp Korsakoff psychosis/ or exp Wernicke Korsakoff syndrome/
(alcohol adj3 amnestic).tw.
(alcohol adj2 dysmnestic).tw.
(ethanol adj3 nervous system disorders).tw.
ethyl alcohol abuse neurologic syndromes.tw.
(Lewy: bod: adj8 disease).tw.
brain atrophy, circumscribed lobar.tw.
(Pick: adj8 disease).tw.
exp brain atrophy/ or exp brain cortex atrophy/ or exp brain degeneration/ or exp corticobasal degeneration/ or exp lewy body/ or exp neurofibrillary tangle/ or exp neuropil thread/ or exp organic brain syndrome/
exp amyloid beta-protein/ not (exp Down syndrome/ or trisomy 21.tw.)
exp Vascular Amyloidosis/
exp Neurofilament Protein/
Tau Protein/
Neurofibrillary Tangle/
Neuropil Thread/
Senile Plaque/
(corticobasil ganglionic degeneration or cortical basal degeneration or corticobasal degeneration).tw.
(CADISIL or Cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.
multisystems atrophy.tw.
Normotensive Hydrocephalus/
Creutzfeldt Jakob Disease/
exp Brain Spongiosis/
spongiform encephalopathy.tw.
(CJD or JCD).tw.
Creutzfeldt-Jakob disease.tw.
exp Hypothyroidism/
Cyanocobalamin Deficiency/
Neurosyphilis/
or/1–45
multicenter study/ or phase 2 clinical trial/ or phase 3 clinical trial/ or phase 4 clinical trial/ or randomized controlled trial/ or exp postmarketing surveillance/
randomization/
crossover procedure/ or double blind procedure/ or experimental design/ or latin square design/ or parallel design/ or single blind procedure/
(clin: adj25 trial:).ti,ab.
((singl: or doubl: or trebl: or tripl:) adj25 (blin: or mask:)).ti,ab.
Placebo/
placebo:.ti,ab.
random:.ti,ab.
exp comparative study/ or exp drug comparison/
exp “evaluation and follow up”/
longitudinal study/ or major clinical study/ or prospective study/
or/47–54
or/55–57
(tu or th).fs.
exp treatment outcome/
exp therapy/
or/60–62
59 and 63
58 or 64
65 and 46
exp animal/
66 not 67
68 not (comment or editorial or news or letter or conference paper).pt.
limit 69 to English language
limit 70 to yr=1998-2003
Database: AMED (Allied and Complementary Medicine) <1985 to February 2003> on OVID
Search strategy executed on March 4, 2003
exp clinical trials/ or double blind method/ or random allocation/
clinical trial.pt.
(clin: adj25 trial:).ti,ab.
((singl: or doubl: or trebl: or tripl:) adj25 (blind or mask:)).ti,ab.
placebos.sh.
placebo:.ti,ab.
random:.ti,ab.
or/1–7
(mild cognitive impairment or MCI).tw.
((cognitive impairment not dementia) or CIND).tw.
((cognitive loss not dementia) or CLOND).tw.
exp dementia/
(dement: or Alzheimer:).tw.
amentia.tw.
frontotemporal lobar degeneration.tw.
hiv-associated cognitive motor complex.tw.
(encephalopathy, aids or encephalopathy, hiv).tw.
mesulam syndrome.tw.
progressive nonfluent aphasia.tw.
binswanger disease.tw.
binswanger encephalopathy.tw.
leukoencephalopathy, subcortical.tw.
subcortical arteriosclerotic encephalopathy.tw.
chronic progressive subcortical encephalopathy.tw.
(alcohol adj3 amnestic).tw.
(alcohol adj2 dysmnestic).tw.
(ethanol adj3 nervous system disorders).tw.
ethyl alcohol abuse neurologic syndromes.tw.
(Lewy: bod: adj8 disease).tw.
brain atrophy, circumscribed lobar.tw.
(Pick: adj8 disease).tw.
(corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.
(CADISIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.
Multisystems atrophy.tw.
spongiform encephalopathy.tw.
(cjd or Jcd).tw.
Creutzfeldt-Jakob disease.tw.
hypothyroidism/
or/9–38
8 and 39
40 not (comment or editorial or news or letter).pt.
41 and English.lg.
Database: CINAHL <1982 to February Week 3 2003> on OVID
Search strategy executed on March 5, 2003
crossover design/ or empirical research/ or experimental studies/ or exp clinical trials/ or community trials/ or factorial design/ or quantitative studies/
clinical trial.pt.
(clin: adj25 trial:).ti,ab.
((singl: or doubl: or trebl: or tripl:) adj25 (blind: or mask:)).ti,ab.
Placebos/
placebo:.ti,ab.
random:.ti,ab.
Study Design/
or/1–8
(mild cognitive impairment or MCI).tw.
((cognitive impairment not dementia) or CIND).tw.
((cognitive loss not dementia) or CLOND).tw.
exp Dementia/
(dement: or Alzheimer:).tw.
amentia.tw.
frontotemporal lobar degeneration.tw.
hiv-associated cognitive motor complex.tw.
(encephalopathy, aids or encephalopathy, hiv).tw.
mesulam syndrome.tw.
progressive nonfulent aphasia.tw.
Binswanger disease.tw.
Binswanger encephalopathy.tw.
leukoencephalopathy, subcortical.tw.
(chronic progressive subcortical encephalopathy or (alcohol adj3 amnestic) or (alcohol adj2 dysmestic) or (ethanol adj3 nervous system disorders) or ethyl alcohol abuse neurologic syndromes).tw.
Lewy body disease.tw.
brain atrophy, circumscribed lobar.tw.
Pick: disease.tw.
(corticobasil ganglionic degeneration or cortical basal degeneration or cortica).tw.
(CADASIL or cerebral autosomal dominant ischemia with subcortical leukoencephalopathy).tw.
multisystems atrophy.tw.
Huntington's Disease/
Creutzfeldt-Jakob Syndrome/
spongiform encephalopathy.tw.
(cjd or jcd).tw.
Creutzfeldt-Jakob disease.tw.
spongiform encephalopathy.tw.
exp Hypothyroidism/
Neurosyphilis/
or/10–38
9 and 39
40 not (editorial or letter or proceedings).pt.
limit 41 to English
Database: Ageline <1978 to December 2002> on SILVERPLATTER
Search strategy executed on March 6, 2003
#1 randomized controlled trials in DE
#2 controlled clinical trials in de
#3 random allocation
#4 controlled clinical trial*
#5 randomized controlled trial*
#6 random allocation
#7 double blind method
#8 single blind method
#9 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8
#10 clinical trial*
#11 (clin* near trial*) in TI
#12 (clin* near trial*) in AB
#13 (singl* or doubl* or trebl* or tripl*) near (blind* or mask*)
#14 (#13 in TI) or (#13 in AB)
#15 placebo*
#16 Placebo* in TI
#17 placebo* in AB
#18 random* in TI
#19 random* in AB
#20 research design
#21 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8 or #9 or #10 or #11 or #12 or #13 or #14 or #15 or #16 or #17 or #18 or #19 or #20
#22 mild cognitive impairment
#23 (cognitive impairment and dementia) or CIND
#24 (cognitive loss not dementia) or CLOND
#25 (explode 'Dementia-' in DE) or (explode 'Early-Onset-Dementia' in DE) or (explode 'Vascular-Dementia' in DE)
#26 dement* or alzheimer*
#27 amentia*
#28 frontotemporal lobar degeneration
#29 hiv-associated cognitive motor complex
#30 aids associated encephalopathy
#31 hiv associated encephalopathy
#32 mesulam syndrome
#33 progressive nonfluent aphasia
#34 binswanger disease
#35 binswanger encephalopathy
#36 leukoencephalopathy subcortical
#37 subcortical arteriosclerotic encephalopathy
#38 chronic progressive subcortical encephalopthy
#39 alcohol near amnestic
#40 alcohol amnestic disorder
#41 alcohol induced disorders
#42 alcohol near dysmnestic
#43 ethanol near nervous system disorders
#44 lewy* bod* near disease
#45 ethyl alcohol abuse neurologic syndromes
#46 brain atrophy lobar
#47 Pick* near disease*
#48 cerebral amyloid angiopathy
#49 neurofilament protein*
#50 tau protein*
#51 neurofibrillary tangles
#52 neuropil threads
#53 senile plaque*
#54 corticobasil ganglionic degeneration or cortical basal degeneration or cortica
#55 cadisil
#56 cerebral autosomal dominant ischemia with subcortical leukoencephalopathy
#57 multisystems atrophy
#58 explode 'Huntingtons-Disease' in DE
#59 normal pressure hydrocephalus
#60 Creutzfeldt-Jakob syndrome
#61 spongiform encephalopathy
#62 cjd or jcd
#63 Creutzfeldt-Jakob disease
#64 hypothyroidism
#65 vitamin b12 deficiency
#66 neurosyphilis
#67 #22 or #23 or #24 or #25 or #26 or #27 or #28 or #29 or #30 or #31 or #32 or #33 or #34 or #35 or #36 or #37 or #38 or #39 or #40 or #41 or #42 or #43 or #44 or #45 or #46 or #47 or #48 or #49 or #50 or #51 or #52 or #53 or #54 or #55 or #56 or #57 or #58 or #59 or #60 or #61 or #62 or #63 or #64 or #65 or #66
#68 #67 and #21
* #69 #68 and (DT=JOURNAL-ARTICLE)
Database: PsychINFO <1967 TO 2002/12> on SILVERPLATTER
Search strategy executed on March 7, 2003
#1 randomized controlled trial in PT
#2 controlled clinical trial in PT
#3 controlled clinical trials
#4 random allocation
#5 (clinical trial*) in DE,SU
#6 (random allocation) in DE,SU
#7 (double blind method) in DE,SU
#8 (cross-over studies) in DE,SU
#9 (single-blind method) in DE,SU
#10 (clinical trial) in PT
#11 ((clin* near trial*)) in TI
#12 (placebo*) in DE,SU
#13 (placebo*) in TI
#14 (random*) in TI
#15 #1 or #2 or #3 or #4 or #5 or #6 or #7 or #8 or #9 or #10 or #11 or #12 or #13 or #14
#16 mild cognitive impairment
#17 MCI
#18 (cognitive impairment not dementia) or CIND
#19 (cognitive loss not dementia) or CLOND
#20 (explode 'AIDS-Dementia-Complex' in DE) or (explode 'Alzheimers-Disease' in DE) or (explode 'Dementia-with-Lewy-Bodies' in DE) or (explode 'Dementia-' in DE) or (explode 'General-Paresis' in DE) or (explode 'Multi-Infarct-Dementia' in DE) or (explode 'Presenile-Dementia' in DE) or (explode 'Senile-Dementia' in DE) or (explode 'Vascular-Dementia' in DE)
#21 dement* or Alzheimer*
#22 amentia
#23 HIV-associated cognitive motor complex
#24 encephalopathy aids
#25 encephalopathy hiv
#26 mesulam syndrome
#27 progressive nonfluent aphasia
#28 binswanger disease
#29 binswanger encephalopathy
#30 leukoencephalopathy subcortical
#31 subcortical arteriosclerotic encephalopathy
#32 chronic progressive subcortical encephalopathy
#33 alcohol near amenstic
#34 alcohol near dysmnestic
#35 ethanol near (nervous system disorders)
#36 ethyl alcohol abuse neurologic syndromes
#37 lewy* bod* near disesase
#38 brain atrophy circumscribed lobar
#39 Pick* near disease
#40 cerebral amyloid angiopathy
#41 (neurofilament proteins) in DE,SU
#42 (tau proteins) in DE,SU
#43 (neurofibrillary tangles) in DE,SU
#44 (neuropil threads) in DE,SU
#45 (senile plaque*) in DE,SU
#46 (corticobasil ganglionic degeneration) or (cortical basal degeneration)
#47 cadisil or (cerebral autosomal dominant ischemia with subcortical leukoencephalopathy) (0 records)
#48 multisystems atrohpy (0 records)
#49 'Huntingtons-Disease' in DE (883 records)
#50 hydrocephalus normal pressure (140 records)
#51 'Creutzfeldt-Jakob-Syndrome' in DE (109 records)
#52 spongiform encephalopathy (39 records)
#53 cjd or jcd (80 records)
#54 Creutzfeldt-Jakob disease (183 records)
#55 explode 'Hypothyroidism-' in DE (260 records)
#56 explode 'Neurosyphilis-' in DE (39 records)
#57 vitamin B12 deficien* (50 records)
#58 frontotemporal lobar degeneration (15 records)
#59 #16 or #17 or #18 or #19 or #20 or #21 or #22 or #23 or #24 or #25 or #26 or #27 or #28 or #29 or #30 or #31 or #32 or #33 or #34 or #35 or #36 or #37 or #38 or #39 or #40 or #41 or #42 or #43 or #44 or #45 or #46 or #47 or #48 or #49 or #50 or #51 or #52 or #53 or #54 or #55 or #56 or #57 or #58 (27527 records)
#60 #15 and #59 (338 records)
* #61 #60 and (LA=ENGLISH) and (PO=HUMAN) (322 records)

USING THE FORM
Be sure to fill in the DM ID#, the Name of the First Author and Your Initials in the three boxes at the top right of the form.
If a paper should be excluded, fill in the “EXCLUDE” box and fill in the box for the reason for exclusion that occurs first in the list of 7 reasons for exclusion.
The boxes for “KEEP ANYWAY as SPECIAL' or "should check for full article” can also be checked if it is an excluded article but may be useful to our review as background or clarification of it appears to be a companion paper for another report that is likely in our review.
If you choose to exclude the paper, the details for included papers do not need to be filled in.
If a paper should be included, fill in the “INCLUDE” box and fill in the information for ONLY TWO of the subsequent categories listed: Diagnosis of interest and Treatments randomized. Ignore Population analyzed, Outcomes reported and Other........we may use them later for grouping.
If you are not sure if a paper qualifies for inclusion and want it to be looked at by our clinicians or methodologists, mark “CONSULTATION REQUIRED”
If a paper is excluded because the Dementia population is not defined by DSM, NINCDS OR ICD-10 criteria, save it for consultation and mark "population' beside the Consultation Required box.
EXCLUDING ARTICLES
Complete report must be in English to be included. If there is only an English abstract, exclude the article.
Only full reports will be included. If the article is a letter, comment, editorial, news, abstract, proceedings of a meeting or any other brief description, exclude the article. If it seems that the study would otherwise be included, check the box “Should check for full article”.
All dementia populations will be accepted at this stage if they are documented by DSM III, DSM III-R, DSM IV, NINCDS-ADRDA, ICD-9, ICD-10. The population studied may include those with mild cognitive impaired (MCI), cognitive impairment, not Dementia (CIND), cognitive loss, not Dementia (CLOND). If the author cites the article by McKhann as the criteria for diagnosis, it can be included because that is the criteria for NINCDS.
Articles included should look at treatment of disease, cognition, behaviour, or quality of life, time to deterioration, depression, falls etc. Exclude if outcomes reported are ONLY neurophysiologic or neuroimaging (eg EEG)
Exclude if not a report of a randomized controlled trial.
Outcomes reported should be for subjects with Dementia. If the entire population does not have Dementia, only data sub-grouped for Dementia will be examined. Exclude if there are no outcomes of interest reported for specifically Dementia subjects.
Any previously unmentioned, compelling reason to exclude the study should be specified.
INCLUDING ARTICLES
If you are unclear about whether the diagnosis is an included one, mark the referral box and pass along for a consult. If entire population is demented, mark boxes for all specific diagnoses included in study outcomes. If not all of population is demented, mark boxes for all specific diagnoses included in subgroup analysis of outcomes. If diagnosis is not listed as a choice, but is an included diagnosis, specify on line provided.
Specify treatments if they were randomly provided to the dementia population.
DETAIL ABOUT DISEASE TERMS (terms from the literature search)
NOT
Not normal or healthy volunteers
Not general population of elderly persons
Not selected for depression (some may have dementia but not all)…BUT… If subgroup analysis may have been done, it should be marked “Retrieve”.
INCLUDE
Alzheimer's disease by DSM, NINCDS OR ICD
Dementia defined by DSM, NINCDS OR ICD
MCI - mild cognitive impairment
CIND - cognitive impairment, not Dementia
CLOND - cognitive loss, not Dementia
*Keep articles aside in a group if the intervention is directed toward the caregiver or caregiver/patient dyad.


| REF ID # | Author Year | Funding Source | Quality Score | Interventions | Criteria for Diagnosis | Diagnosis | Disease Severity | Total Number Randomized | Number Completing Trial | Mean age (range) % Male (M) Population | Highest Dose | Treatment Period | Outcomes Measured | Outcome reports stratified by patient characteristic |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| REF ID # | Enter number written on top of first page of article. |
| Author / Year | Enter last name of first author on one line and year of publication on the second line. |
| Funding Source | Enter one of the following codes: IF (Industry Funded) PI (Partially funded by Industry) IS (Industry provided Supplies) NI (Non-Industry funding source) NR (not reported) Use more than one code if necessary. |
| Quality score | Enter the Modified Jadad score for Alzheimer's Disease (out of 8 points) |
| Interventions | Enter name of drugs used in trial. Use the most commonly recognized name (eg use Tacrine instead of generic name). If more than one drug is used, put one on each line. If a dose response trial is reported, treat as one drug at the highest dose. If placebo is used, enter as first drug. |
| Criteria for Diagnosis | Indicate what criteria were used for diagnosis. It should be one of NINCDS, DSMIII, DSMIII-R, DSMIV, ICD-10 |
| Diagnosis | Enter all dementia diagnoses included in the trial. |
PDD = PRIMARY DEGENERATIVE DEMENTIA | |
MID = MULTI- INFARCT DEMENTIA | |
AD = DAT = SDAT = ALZHEIMER'S DEMENTIA | |
MIXED | |
VaD = VASCULAR DEMENTIA | |
DEMENTIA | |
| Disease Severity | Use the descriptive terms as used in the paper. |
| Total Number randomized | Give the number in all groups that were initially randomized. |
| Number completing trial | Give the number in all groups that completed the treatment. If ITT population is given, report also. |
| Mean age (range) | Enter whatever information is given in paper for whole population |
| % Male (M) | Compute this figure if possible for those randomized at baseline |
| Population | Give any special inclusion criteria (or existing factor in population) which may affect the external validity (eg comorbid disorders, race, setting) |
| Highest Dose | Give the dose per day. If the dose was titrated up to individual doses, give the highest dose used and enter details of titration on the second line. |
| Record as reported in paper - make sure to note if dose is by weight or a set amount and report how often given (preferably by day). | |
| Treatment Period | This should be the length of time for which the subjects received drug treatment. Use the longest period if there is more than one. Note here if there is an open extension |
| Outcomes Measured | List all of the tests reported. If a battery of psychological tests were done, list the name of the battery. |
| If physical tests are reported (e.g. blood tests, scans) list in very general terms (e.g. blood levels, EEG). | |
| Outcome reports stratified by patient characteristic | Enter a Y if any of the data is reported in any way and is stratified by any patient characteristic (e.g. gender, age, race, genotype, education) and describe what the characteristic is. Otherwise, enter N. |
| DRUG | |||||||||
| REF ID # | Author Year | Analysis Groups | Test Used | Result Value | P Value | Result Value | P Value | Result Value | P value |
| Baseline | Mid-Point: (specify) | Final: (specify) 24w | |||||||
| Baseline | Mid-Point: (specify) | Final: (specify) | |||||||
| REF ID # | Enter number written on top of first page of article. |
| Author / Year | Enter last name of first author on one line and year on the second line |
| Analysis groups and | Use one line for each intervention that will be reported on. Number them as follows (1] 2] 3] ) |
| Interventions | Make the Control condition # 1 and Drug treatment conditions # 2 and so forth. |
| If results are reported as change or relationship between baseline and later time-points or between placebo and drug conditions, these will be added as additional Analysis Groups (e.g. [ 3] Baseline vs 28w Placebo) would be entered as the third Analysis Group and would be for the score that reports the change in value between the one at baseline and the one at 28 weeks for the placebo group. The Analysis Group [ 5] Placebo vs Tacrine 28w) would be the fifth Analysis Group and be used for the score that reports the change in value between the placebo condition and the drug condition at 28 weeks. | |
| If subgroup analysis has been reported, an Analysis Group can be created to report the results. | |
| For Analysis Groups that refer to the drug used, enter the name of drug used in trial. Use the most commonly recognized name (eg use Tacrine instead of generic name). If more than one drug is used, put one on each line. If variable dosing is used within an arm, report the higher dose. Put the Placebo condition first in the list | |
| Test Used | List all psychological and functional tests used that have extractable data (cognition, behavior, functional, global). Do not report on physiological measurements such as blood levels. Enter the primary outcomes first in the list and bold the font on the test name. |
| Baseline, Mid-Point, Final | Enter number of hours, days, weeks, months, years from baseline measurement to current measurement. Use abbreviations (h = hours, d = days, w = weeks, m = months, y = years) |
| If more than 3 time points given, use the most central one for Time 2. | |
| Result Values | Enter the value for Mean ± Standard Deviation for each arm for each time-point for each test. If Standard Error is used, use an *. |
| If % is used, enter the % sign after the value. | |
| If the test consists of subsections (eg a battery that also reports the total), just use the total score unless a sub-score is a primary outcome. | |
| Give P value if provided. If no P value available, but CI reported, put CI in P value column. |
| SC | = statistically significant CHANGE at the alpha = 0.05 |
| = based on the PRIMARY outcomes (no need to specify as 1° because this is the default). If the paper does not specify primary or secondary ASSUME primary for all reported outcomes | |
| = report secondary outcomes ONLY when there is NO primary variable for that domain AND indicate with (2°) in front of the result code | |
| = based on the ITT analyses results ONLY; if ITT results were not reported in the paper, then indicate with an asterick (*) located behind the result code (i.e. NS*) | |
| = this change is ONLY relative to placebo (within group findings are reported in Table B..so not necessary to recapitulate this in summary table) | |
| = in those instances where there is A PRIORI hypotheses for subgroup analyses and there are statistical results reported, then indicate with a symbol (i.e. # or ^) that the subgroup analyses were SC or NS and specify with respect to what factor (i.e. Vascular dementia versus not, or gender, etc) | |
| NS | = not statistically significant effect for primary or secondary outcomes (some additional domains are tested with the secondary outcomes) |
| NT | = outcomes were not tested reflecting in this domain |
| MX | = mixed results for two primary outcomes (i.e one was significant and the other variable was not significant) and do not represent a SUBGROUP analysis |
| = indicates that two measures within the same domain show conflicting results (one outcome is significant and the other is not significant) |
| DOMAIN recommendation Ioannidis and Lau | OUR QUESTION | STATUS | |
|---|---|---|---|
| FREQUENCY of WITHDRAWALS due to adverse events (AE) | Do the authors specify the number of patients withdrawn from the study due to AE per study arm and per type of AE that caused withdrawal | Y | N |
| Unclear | |||
| Y | N | ||
| Unclear | |||
| FREQUENCY of AE (can be stated as a count or as a proportion for either CLINICAL AE or LABORATORY-DEFINED TOXICITY) | Do the authors provide the number of AE with respect to severity (reference to a known scale of toxicity such as mild, moderate, severe, life threatening or grades 1 to 3, etc) per study arm and per type of specific AE (i.e. diarrhea, headache, etc) | Y | N |
| Unclear | |||
| Y | N | ||
| Unclear | |||
| Was the recording of the AE (i.e. surveillance) ACTIVE or PASSIVE | Is the surveillance ACTIVE (actively monitor the presence of absence of AE during the study…do not rely on methods that are PASSIVE (sometimes called spontaneous reporting) | Y | N |
| Unclear | |||
| Describe a SCHEDULE for collection of safety info | Optional | ||
| 1) Do the authors specify the schedule for collection of safety information? | |||
| FREQUENCY of SERIOUS AE (i.e. results in death, requires inpatient hospitalization, persistent or significant disability or is life threatening, WHO 2001) | Are exact numbers for high-grade (serious and life threatening) clinical AE laboratory toxicity reported. | Y | N |
| Unclear | |||
| SEVERITY of each AE (i.e. mild, moderate, or severe headache) | Have each of the AE been reported with respect to a severity continuum (i.e. mild diarrhea, severe headache, etc) ? | Y | N |
| Have some of the AE been reported with respect to a severity grade? | Unclear | ||
| Optional: | |||
| Y | N | ||
| Unclear | |||
| Description of UNUSUAL or NOT PREVIOUSLY RECORDED AE | Has a detailed description of cases of unusual or not previously recorded AE effects been presented? | Y | N |
| Unclear | |||
| STANDARDIZED SCALES used to capture AE. | Do the authors report the use of widely known, standardized scales for AE? | Y | N |
| Specify scale: | Unclear | ||
| If the scale is new, do the authors provide definitions for the grades of severity | Optional: | ||
| Y | N | ||
| Unclear | |||
| Identify specific SAFETY TESTS or QUESTIONNAIRES used for data collection | Do the authors identify specific safety tests or questionnaires used for data collection | Y | N |
| Unclear | |||
THRESHOLD SCORING for ADVERSE EVENTS:
Scoring:
YES = 1
NO = 0
Unclear = consult with other rater to reach consensus
For the first 3 questions a MINIMUM score of 3 is required to proceed to the subsequent 5 questions
If all patients were accounted for (i.e. no withdrawals), then we assume a score of 2 for the WITHDRAWAL due to AE question
If no mention of serious (see definition) is mentioned in the paper, then we will ASSUME that they were NOT monitored (rather than not reported)