NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Management of Treatment-Resistant Epilepsy, Volumes 1 and 2
 
hserta
AHRQ Evidence Reports
public health

Chapter  77:  Management of Treatment-Resistant Epilepsy, Volumes 1 and 2

A115370

Prepared for:

Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services

www.ahrq.gov

Contract No. 290-97-0020

Prepared by:

ECRI Evidence-based Practice Center, Plymouth Meeting, PA

Investigators

Richard Chapell, Ph.D.

James Reston, Ph.D.

David Snyder, Ph.D.

Jonathan Treadwell, Ph.D.

Stephen Tregear, Ph.D.

Charles Turkelson, Ph.D

AHRQ Publication No. 03-0028

May 2003

ISBN: 1-58763-081-8

ISSN: 1530-4396

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:

Chapell R, Reston J, Snyder D. Management of Treatment-Resistant Epilepsy. Evidence Report/Technology Assessment No. 77. (Prepared by the ECRI Evidence-based Practice Center under Contract No 290-97-0020.) AHRQ Publication No. 03-0028. Rockville, MD: Agency for Healthcare Research and Quality. May 2003.

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

AHRQ is the lead Federal agency charged with supporting research designed to improve the quality of health care, reduce its cost, address patient safety and medical errors, and broaden access to essential services. AHRQ sponsors and conducts research that provides evidence-based information on health care outcomes; quality; and cost, use, and access. The information helps health care decisionmakers—patients and clinicians, health system leaders, and policymakers—make more informed decisions and improve the quality of health care services.

Prepared for:

Agency for Healthcare Research and Quality

U.S. Department of Health and Human Services

www.ahrq.gov

Contract No. 290-97-0020

Prepared by:

ECRI Evidence-based Practice Center, Plymouth Meeting, PA

Investigators

Richard Chapell, Ph.D.

James Reston, Ph.D.

David Snyder, Ph.D.

Jonathan Treadwell, Ph.D.

Stephen Tregear, Ph.D.

Charles Turkelson, Ph.D

AHRQ Publication No. 03-0028

May 2003

ISBN: 1-58763-081-8

ISSN: 1530-4396

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:

Chapell R, Reston J, Snyder D. Management of Treatment-Resistant Epilepsy. Evidence Report/Technology Assessment No. 77. (Prepared by the ECRI Evidence-based Practice Center under Contract No 290-97-0020.) AHRQ Publication No. 03-0028. Rockville, MD: Agency for Healthcare Research and Quality. May 2003.

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

AHRQ is the lead Federal agency charged with supporting research designed to improve the quality of health care, reduce its cost, address patient safety and medical errors, and broaden access to essential services. AHRQ sponsors and conducts research that provides evidence-based information on health care outcomes; quality; and cost, use, and access. The information helps health care decisionmakers—patients and clinicians, health system leaders, and policymakers—make more informed decisions and improve the quality of health care services.

Preface

The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.

Carolyn M. Clancy, M.D.

Director

Agency for Healthcare Research and Quality

Jean Slutsky, P.A., M.S.P.H.

Acting Director, Center for Practice and Technology Assessment

Agency for Healthcare Research and Quality

The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Structured Abstract

Objectives. This report, commissioned at the request of the Centers for Disease Control and Prevention and the Social Security Administration, addresses in an evidence-based fashion diagnosis of and interventions for treatment-resistant epilepsy (TRE). It addresses drug and surgical treatments, as well as service-related interventions.

Search Strategy. We systematically searched 23 electronic databases, including PubMed® and EMBASE. Search dates ranged from 1985 to January 1, 2002 for all but drug topics, which ranged from 1975 to January 1, 2002. We employed different search strategies for each of the nine key questions addressed. Our searches identified 11,111 articles.

Selection Criteria. We retrieved 2,356 articles, and included 357, according to a priori criteria accounting for the quality and relevance of available studies.

Data Collection and Analysis. We employed a “best evidence” synthesis that used the best available, not the best possible evidence. Case control studies were the most common design for diagnostic topics, RCTs were most common for antiepileptic drug (AED) strategies, and the surgical literature was nearly all retrospective case series. The quality of these studies was systematically considered. We computed summary statistics in meta-analyses of RCTs of multiple AED therapy (polytherapy) and computed thresholds for effectiveness in meta-analyses of sequential AED monotherapy and uncontrolled surgical studies.

Main results. There is no widely used definition of TRE. Lack of high quality studies precludes an evidence-based determination of the most effective diagnostic for rediagnosing or re-evaluating patients. Nevertheless, up to 35 percent of patients (but probably fewer) diagnosed with TRE may also have nonepileptic seizures, or not have epilepsy at all. Not all patients diagnosed with TRE receive optimized therapy, but the number of these patients cannot be determined. Initiation of sequential monotherapy appears to result in seizure increases in many patients, and whether sequential monotherapy causes any patients to become seizure-free is not clear. Polytherapy can reduce seizure frequency, but some patients experience intolerable adverse effects. Drug reduction may cause seizure increases without additional benefit. Results of the AED studies assessed in this report may not be generalizable to drugs not examined in the studies we included. Temporal lobe surgery eliminates seizures in many patients. Hemispherectomy and frontal lobe surgery eliminate seizures in an indeterminate number of patients. Corpus callosotomy reduces seizure frequency but generally does not eliminate seizures. Vagal nerve stimulation affords some seizure reduction. There was insufficient evidence to assess other treatments. Epilepsy is associated with increased all-cause mortality and death from drowning. The link between sudden death and seizure frequency is uncertain. Generalized tonic-clonic seizures seem associated with an increased risk of death.

Conclusions. Some patients diagnosed with treatment-resistant epilepsy are misdiagnosed or not receiving optimized AED treatment. Effective treatments are available, but all have disadvantages. There are many weaknesses in the current literature, particularly in studies of diagnostics and nondrug, nonsurgical interventions. Better-designed studies in these areas are needed.

Summary

Overview

In this report, we evaluate and synthesize the published literature on diagnosis of, and medical and nonmedical interventions for treatment-resistant epilepsy. This report was commissioned upon the request of the Centers for Disease Control and Prevention and the Social Security Administration.

Epilepsy is a common, serious neurologic condition. An International League Against Epilepsy (ILAE) Commission Report from 1997 estimated the prevalence of active epilepsy as 40 to 100 in 10,000 and the incidence of unprovoked seizures as 2 to 7 per 10,000. However, precise estimates of prevalence and incidence are complicated by differences in the way investigators define epileptic and nonepileptic seizures (NES), and by the fact that prevalence is typically estimated using retrospective methods.

In addition to the immediate, debilitating effects of seizures, epilepsy also interferes with daily activities, and persons with epilepsy may have to contend with the increased possibility of accidental injury and even death. Psychiatric disorders may also be more common in people with epilepsy.

Persons with epilepsy often have impaired physical, psychological, and social functioning, which may lead to economic loss and diminished quality of life. A survey of 1,023 people with epilepsy published in 2000 showed that compared to U.S. Census Bureau norms, respondents received less education, were less likely to be employed, and were more likely to be members of low-income households.

Reporting the Evidence

This evidence report addresses nine key research questions encompassing 49 technologies, including several service-related interventions. However, the quantity and quality of published literature was insufficient to permit an evidence-based evaluation of 39 of these technologies. We therefore evaluated one diagnostic technology, three antiepileptic drug (AED) strategies, five surgical procedures, and one nondrug, nonsurgical intervention. In addition, we also surveyed the definitions of treatment-resistant epilepsy in the published clinical literature, with particular emphasis on the definitions reported in clinical studies.

The outcomes we considered depended upon the key research questions. We used 16 patient-oriented outcomes to evaluate the effects of treatment, and all reported measures of diagnostic test performance. We also examined the rates of all-cause mortality and cause-specific mortality among persons with epilepsy.

Methodology

To obtain information for this report, we systematically searched 23 electronic databases, including PubMed® and EMBASE. In general, literature searches covered the years 1985 to January 1, 2002. For topics on AEDs, we searched for studies published between 1975 and January 1, 2002. We employed these earlier search dates to ensure that we captured data on standard drug treatments, which are likely to be in relatively older literature.

We employed different search strategies for each of the nine key research questions. Searches were implemented by first developing a list of Medical Subject Headings (MeSH) terms, publication types, and textword combinations. This list included the concepts inherent in each of the key research questions. These searches identified 11,111 articles. From these identified articles, we retrieved 2,356 potentially relevant articles to determine whether they met the a priori criteria tailored for each Key Research Question.

Three hundred forty-eight articles met these inclusion criteria. We next evaluated these articles to determine whether they contained design flaws so severe that their results were uninterpretable. Such articles were excluded. In addition, we excluded articles if there were fewer than five published studies on a given intervention or diagnostic, and none of the studies was a randomized controlled trial with 50 or more patients in the treatment arm. We adopted this latter criterion because of the difficulty in reaching firm evidence-based conclusions from a relatively small literature base comprised of studies of less than optimal design. As a result, 299 articles are included in this evidence report for key research questions 2–9. One hundred eighty-five articles for key research question 1 (on definitions of treatment-resistant epilepsy) were selected from all of the articles included in key research questions 2–6, from available clinical guidelines, and from a random sample of 100 review articles.

We employed a “best evidence” synthesis in this Evidence Report. Thus, for each key research question, we used the best available evidence, not the best possible evidence. Consequently, studies of several designs were included in this report. Diagnostic case-control studies are the most common design for diagnostic topics, randomized controlled trials (RCT) are most commonly used for evaluating AED strategies, and the surgical literature is comprised almost exclusively of retrospective case series.

We evaluated the internal validity of all included studies using checklists of biases that could potentially affect their results. In considering study design, we assumed that randomized controlled trials provide results with the least potential for bias. This was followed, in order of increasing potential for bias, by controlled studies of other design, studies that measured patient outcomes before and after some intervention, and uncontrolled studies. Among each type of study, we considered blinded studies to have lower potential for bias than nonblinded studies, and prospective studies to have lower potential for bias than retrospective studies.

In parts of this report, we used a systematic narrative review supplemented by numerous de novo calculations. These include calculations that index the statistical power of nonsignificant studies, various statistics (e.g., chi-square tests), crude mortality ratios, and other quantities, as appropriate. The majority of this evidence report is, however, meta-analytic.

We performed random effects meta-analyses on data from RCTs examining polytherapy AED treatment. We used sensitivity analyses to evaluate how robust the results of these analyses were. Sensitivity analyses consisted of removing the largest and smallest studies from the meta-analysis, and removing the studies with the largest and smallest effects. Each of the trials in these meta-analyses is an instance of polytherapy, rather than a direct study of this strategy. However, combining these trials into a single analysis of polytherapy can provide an approximate estimate of the effect of adding a single new AED to patients' regimens.

We performed threshold analyses on data from uncontrolled studies of sequential monotherapy and surgery. For sequential monotherapy, we employed random effects models, whereas for surgery we employed fixed effects models. We used random effects models for analyses of sequential monotherapy because of the heterogeneity among results of trials using different AEDs. In our threshold analyses, we meta-analytically compared the improvement rate in treated patients to increasing rates of improvement in a hypothetical “control” group. Starting at 0 percent, we increased the rate of improvement in the “control” patients until the difference in improvement between the treated and “control” groups was no longer statistically significant. This value is the threshold. Where possible, we provide context for these thresholds by supplementing them with historical data obtained from published articles.

We also report the percentage of patients who improved after the intervention (as given by the meta-analytic results when improvement in the control group is 0 percent), but note that this percentage is not a measure of the net effectiveness of the intervention. Some patients may have improved without treatment. Nevertheless, this percentage is informative because it represents the proportion of patients likely to improve, regardless of the cause of their improvement.

When heterogeneity among study results was found in a threshold analysis, we attempted to “explain” the source of the heterogeneity using meta-regression. Because of the lack of strong a priori hypotheses about the reasons for this heterogeneity, we constructed multiple meta-regression models for each instance in which heterogeneity was found. The post hoc nature of these analyses led us to adopt stringent criteria for identifying models for further exploration. These explorations consisted of threshold analyses of the regression intercepts.

Findings

Question 1: What are the definitions of treatment-resistant epilepsy used in the literature?

  • Treatment resistance is infrequently defined in the literature. Less than one third of the surveyed publications reported any definition of this term.

  • When treatment resistance was defined, definitions typically included the number of AEDs a patient tried before being considered treatment-resistant. Some definitions also included seizure frequency, duration of illness, and whether AEDs were administered at maximum tolerable doses.

  • Drug trials tended to require fewer failures of AED treatment compared to surgical trials. This is because a very thorough assessment of drug regimens is usually attempted before surgery is considered. Assessments are usually less thorough when giving a patient another AED.

  • Despite the fact that reports of clinical trials and review articles regularly use terms such as “intractable,” “refractory,” or “treatment-resistant” to describe patients for whom one or more treatments have failed, no consensus exists as to precisely what these terms mean.

Question 2: Which methods of rediagnosing or reevaluating treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?

We partitioned this question into four subquestions. The first two subquestions addressed differential diagnosis of epileptic seizures from nonepileptic seizures. The remaining two subquestions addressed the differential diagnosis of different seizure types. Whether we addressed some questions depended on the findings for previous questions.

Question 2A: Do all patients diagnosed with epilepsy that is deemed to be treatment-resistant truly have epilepsy?

This question attempts to gauge the extent of the need for rediagnosis among patients thought to have treatment-resistant epilepsy. Our evaluation of the published literature suggests the following:

  • Meta-analysis suggests that up to 35 percent of patients originally diagnosed with treatment-resistant epilepsy either do not have epilepsy, or they have a combination of both epileptic and nonepileptic seizures. Because this number is derived from studies that enrolled patients suspected of having nonepileptic seizures, the actual number is probably lower.

  • None of the studies included in the above-mentioned meta-analysis contained pediatric patients. Thus, the prevalence of pediatric patients diagnosed with treatment resistant epilepsy and who either do not have epilepsy or have a combination of both epileptic and nonepileptic seizures is unknown.

  • These findings suggest that some patients enrolled in studies included in this Evidence Report may not have epilepsy. If this is the case, then our estimates of the efficacy of the interventions that we address may be imprecise.

Question 2B: Which diagnostic modalities are useful in differentiating seizure types commonly mistaken for epilepsy from true epileptic seizures?

  • A paucity of high-quality evidence limited our ability to draw evidence-based conclusions about measurement of serum prolactin levels as a diagnostic tool. Consequently, we were precluded from developing diagnostic decision-model algorithms that take into account the realities of clinical practice, where a differential diagnosis is based on information from many diagnostic technologies, not just information from a single diagnostic in isolation.

  • The only relevant diagnostic supported by a sufficient quantity of literature to allow evidence-based analysis was serum prolactin. The relatively low quality of this literature, however, precludes firm evidence-based conclusions. Rather, this literature only allows the conclusion that serum prolactin levels could plausibly distinguish epileptic seizures from some nonepileptic seizures. Further research is required to determine whether the performance of this test is sufficient to warrant its use in clinical practice.

  • Despite the importance of video-electroencephalography (vEEG) in diagnostic protocols aimed at differentiating epileptic seizures from nonepileptic seizures, we do not draw evidence-based conclusions regarding the diagnostic performance of this technology in the present report because less than five high quality studies were identified. The fact that evidence-based conclusions were not drawn should not be interpreted as evidence that this technology is not effective or useful. Indeed, vEEG may very well have an important role in diagnostic algorithms designed to differentiate patients with epilepsy from patients with nonepileptic seizure disorders. Until more high-quality studies become available, however, the diagnostic performance characteristics of vEEG and its place in such diagnostic algorithms cannot be determined.

Question 2C: Is seizure type in patients with treatment-resistant epilepsy misdiagnosed in some patients?

  • There were too few acceptable studies addressing this question to permit analysis.

Question 2D: Which diagnostic modalities are useful in differentiating between different seizure types?

  • Because no evidence-based conclusions could be reached for Question 2C, the diagnostic modalities that are most useful in differentiating between different seizure types could not be determined.

Question 3: Is there evidence that patients with treatment-resistant epilepsy are not optimized at their current level of treatment?

  • Not all patients with treatment-resistant epilepsy receive optimized AED treatment.

  • The percentage of patients with treatment-resistant epilepsy who are not receiving optimized therapy is difficult to estimate. This is because of a lack of relevant, large, population-based studies. Further, many studies of AEDs do not report whether patients comply with their AED regimens.

Question 4: Which drug treatment strategy, (A) sequential monotherapy, (B) polytherapy, or (C) optimized current therapy leads to improved outcomes for patients with treatment-resistant epilepsy, and (D) What are the relative improvements obtained with each strategy?

Based on the recommendation of the partners, for the purposes of this question, sequential monotherapy is defined as changing a patient's drug regimen from one or many AEDs to a single, different AED. Polytherapy is defined as changing a patient's drug regimen from one or many AEDs to a different multiple-AED regimen. In this report, all polytherapy trials were trials of a single add-on AED. Optimized current therapy was defined as changing the dose and/or the frequency of administration. Based on the recommendation of the partners, we also included the removal of one or more drugs within this definition.

Question 4A: Sequential monotherapy

  • During long-term studies, an estimated 89 percent of patients continued to have seizures when switched to monotherapy. The remaining 11 percent of patients were seizure-free during the studies. When short-term studies were included, 16 percent of patients were seizure-free. However, because these data come from studies that indirectly addressed this issue, whether sequential monotherapy is directly responsible for these patients becoming seizure-free cannot be determined.

  • An estimated 16 percent of patients experienced a doubling of monthly seizure frequency during studies of sequential monotherapy.

  • An estimated 14 percent of patients experienced a doubling of two-day seizure frequency during studies of sequential monotherapy.

  • Sequential monotherapy required the removal of patients' prior AEDs, and in some patients the increases in seizure frequency were likely caused by this removal. Increases may be more likely in the subset of patients who switched from multiple AEDs to a single AED, but available data do not address this possibility.

  • These findings suggest that sequential monotherapy is more likely to increases seizures than to eliminate seizures.

  • One cannot determine the side effects (or their rates) associated with sequential monotherapy because no studies compared the adverse effects experienced by patients during sequential monotherapy with the adverse effects they had been experiencing during their prestudy drug regimens. Many patients (53 percent to 95 percent) experienced mild adverse reactions to the new monotherapy drug.

  • An estimated 5 percent of patients exited studies of sequential monotherapy due to adverse effects.

  • The findings listed above are applicable only to the drugs and doses examined in this report.

  • There was insufficient evidence to draw firm conclusions about the influence of sequential monotherapy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, ability to hold a driver's license, or mortality.

Question 4B: Polytherapy

  • Adding certain AEDs to a patient's drug regimen has potential advantages and disadvantages. Patients who receive these add-on drugs are more likely to experience reductions in seizures compared to patients who receive an add-on placebo. However, recipients of these drugs are also more likely to experience adverse effects leading to trial exit than are placebo recipients (8 percent vs. 4 percent). Many patients (55 percent to 94 percent) experienced mild adverse effects while taking the new drugs.

  • The preceding estimates of the effect of add-on therapy are based on random-effects meta-analyses that combined different AEDs. These estimates serve as approximate guides for future research on polytherapy. However, their generalizability may be limited to the drugs and doses in the included trials. Further, the apparent effectiveness of an add-on drug may depend on concurrent medications. Thus, the results may not be applicable to patients receiving other concurrent medications. Also, the results of these trials cannot be generalized to other implementations of the polytherapy strategy (e.g., the addition of two drugs).

  • Insufficient evidence was available to draw firm conclusions about the influence of polytherapy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, ability to hold a driver's license, or mortality.

Question 4C: Optimized Current Therapy

  • Drug reduction may lead to increases in seizure frequency in at least some patients. Although some patients experience reduced seizure frequency, these reductions were likely due to regression to the mean. The only other explanation is that the withdrawn drugs were somehow causing seizures. Given that the patients included in these studies had been on their baseline AED regimens for some time, this seems implausible.

  • Convincing evidence is lacking to suggest that drug reduction improves quality of life, mood, cognitive function, or that it reduces the occurrence of drug related adverse events. Thus, the available evidence suggests that implementation of the drug-reduction strategy, at least with the AEDs considered in this report, may lead to increases in seizure frequency and provide little benefit.

  • Due to limited data, no evidence-based conclusions could be drawn about optimized current therapy that employed dose increases or changes in frequency of administration.

Question 4D: Comparing AED Strategies

  • No included studies directly compared the three AED strategies. Because of the different goals of optimized therapy and the other two AED strategies, these interventions cannot be compared. Differences in the severity of disease of patients given polytherapy and sequential monotherapy preclude quantitative comparison. However, sequential monotherapy was more likely to be harmful than to be beneficial. The reverse was true for polytherapy. These qualitative conclusions suggest that polytherapy may be clinically preferable to sequential monotherapy.

Question 5: Which methods of nondrug treatment for epilepsy after initial treatment failure lead to improved outcomes for patients with treatment-resistant epilepsy?

Question 5A: Surgical Interventions
Temporal Lobe Surgery

  • Threshold analyses of retrospective data suggest that 2 years after temporal lobe surgery, 55 percent of patients are completely seizure-free, and 68 percent are free of complex partial seizures. The retrospective case series design of the studies reporting these outcomes prevents stating that these rates are the direct result of surgery, because some patients may have become seizure-free without surgery. However, 50 percent of similar patients who did not receive surgery in similarly designed studies would have to be seizure-free before concluding that surgery did not improve this outcome. Similarly, 65 percent of similar patients who did not receive surgery would have to be free of complex partial seizures before concluding that surgery had no effect on complex partial seizures. To put these thresholds in context, published data from one RCT suggest that only 8 percent of patients who do not receive surgery become seizure-free. This suggests that many patients are seizure-free because of temporal lobe surgery.

  • Meta-analysis did not reveal any relationship between whether a patient becomes seizure-free after temporal lobe surgery and the patient's age at surgery, age at seizure onset, side of surgery, or the presence of simple partial seizures. Larger studies are required to prove that there is no relationship between these patient characteristics and the outcome of surgery.

  • The rate of new cases of depression after surgery ranges from 4 percent to 24 percent. Why this range is so wide is not clear, and whether surgery was responsible for these new cases cannot be determined.

  • Threshold analysis suggests that 3 percent of patients develop psychosis after surgery. However, data from one trial with similar patients who did not receive surgery suggest that as many as 2 percent of these patients develop psychosis. Two percent is also the threshold at which a relationship between surgery and the onset of psychosis becomes statistically nonsignificant. Therefore, surgery cannot be assumed responsible for new cases of psychosis.

  • Threshold analysis suggests that after temporal lobe surgery, approximately 13 percent of patients experience clinically significant increases in IQ and 10 percent of patients experience clinically significant decreases in IQ. The threshold analysis suggests that surgery may not be responsible for these changes if 10 percent of similar patients who did not receive surgery experienced an increase in IQ, and 7 percent of similar patients who did not receive surgery experienced a decrease in IQ. Data from one trial suggest that without surgery, 5 percent of patients experience a decrease and 5 percent of patients experience an increase in IQ. Therefore, if there is an effect of surgery on IQ, it does not affect large numbers of patients.

  • Approximately 2 percent of patients will experience permanent complications from temporal lobe surgery, primarily some form of partial paralysis. Data reported in studies of temporal lobe surgery reporting deaths due to surgery suggest that approximately 0.24 percent of patients will die because of the surgical procedure.

  • There was insufficient evidence to draw firm conclusions about the influence of temporal lobe surgery on quality of life, memory, functional status or ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.

Corpus Callosotomy

  • Threshold analyses suggest that 2 years after corpus callosotomy, 20 percent of patients have achieved a 90 percent or better reduction in overall seizure frequency. The retrospective case series design of the studies reporting this outcome prevents stating that these rates are the direct result of surgery, because some patients may achieve a 90 percent reduction in seizure frequency without surgery. However, 15 percent of similar patients who did not receive surgery would have to experience a 90 percent or better reduction before concluding that surgery did not improve this outcome. No studies were available to provide context for these figures. Given the severity of patients' conditions, however, surgery is the most likely cause of these seizure reductions.

  • Despite the improvements seen in some patients, 16 percent of patients will achieve no reduction in overall seizure frequency or show an increase in seizure frequency after corpus callosotomy.

  • Threshold analysis suggests that 2 years after corpus callosotomy, 26 percent of patients will be free of their most disabling seizures. However, 20 percent of similar patients who did not receive surgery would have to become free of their most disabling seizures before concluding that surgery did not improve this outcome. No studies were available to provide context for these figures. Given the severity of patients' conditions, however, surgery is the most likely cause of these seizure reductions.

  • Approximately 3.6 percent of patients will experience serious complications after corpus callosotomy, primarily some form of partial paralysis, disconnection syndrome, or language difficulty. The precise mortality rate associated with this procedure is uncertain.

  • There was insufficient evidence to draw firm conclusions about the influence of corpus callosotomy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.

Frontal Lobe Surgery

  • Studies of frontal lobe surgery report that 2 years after surgery, 20 percent to 100 percent of patients will be “seizure-free” depending on how this outcome is defined. These variations in outcome reporting prevented any meaningful threshold analysis.

  • Approximately 8.4 percent of patients will experience some type of complication after frontal lobe surgery, primarily some form of partial paralysis. However, this figure may be inaccurate because only two studies reported complications. Data reported in three studies of frontal lobe surgery reported only one death among 96 patients. These data are insufficient to estimate the true death rate from this type of surgery.

  • There was insufficient evidence to draw firm conclusions about the influence of frontal lobe surgery on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.

Hemispherectomy

  • Three studies reported that between 40 percent and 70 percent of patients who receive hemispherectomy are seizure-free 2 years after surgery. Approximately 7 percent of patients may receive no benefit from this surgery. The paucity of literature on this topic means that these rates are not precise. Given the severity of patients' conditions, however, surgery is the most likely cause of this improvement.

  • Ten studies reported only two serious permanent complications from surgery (0.8 percent). However, given the small number of patients examined in these 10 studies, this may not be a reliable estimate. Among the same studies, the percentage of patients developing a mild or transient complication was 21 percent. Data reported in 11 studies of hemispherectomy suggest that approximately 2.6 percent of patients (26 deaths per 1,000 patients) will die because of the surgical procedure.

  • There was insufficient evidence to draw firm conclusions about the influence of hemispherectomy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, or ability to return to (or remain in) school.

Multiple Subpial Transection

  • Reported percentages of patients who are seizure-free six or more months after multiple subpial transection vary from 0 percent to 75 percent, depending on how “seizure-free” is defined. Similarly, the estimates for patients who do not benefit from this surgery vary from 0 percent to 42 percent. Consequently, the data are inconsistent across studies and do not allow for firm evidence-based conclusions as to the exact proportion of patients who will become seizure-free or who will not benefit from multiple subpial transection.

  • Nine studies reporting serious permanent complications from surgery estimated that approximately 5.9 percent of patients experience these types of complications, particularly aphasia or dysphasia. Although no deaths were reported in any of these studies, they may be reported in future studies.

  • There was insufficient evidence to draw firm conclusions about the influence of multiple subpial transection on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.

Other Surgery

  • Too few studies were available to allow for an evidence-based evaluation of parietal or occipital lobe surgery.

Question 5B: Nondrug, Nonsurgical Interventions

  • Trends from two RCTs suggest that vagal nerve stimulation (VNS), when applied as an adjunct intervention, safely provides limited seizure frequency reduction in some patients with treatment-resistant epilepsy. The precise degree of seizure reduction depends upon the specific measure of seizure frequency.

  • Currently available evidence does not suggest a dramatic effect of VNS on quality of life.

  • There was insufficient evidence to draw firm conclusions about the influence of VNS on mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.

  • Too few studies were available to allow for an evidence-based evaluation of ketogenic diets, chiropractic procedures, acupuncture, hyperbaric oxygen therapy, herbal medicine and homeopathy, cranial realignment, magnetic therapy, electrical brain stimulation, and vitamin B6 therapy.

Question 6: Which social, psychological or psychiatric services for treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?

  • There were too few acceptable studies addressing this question to permit analysis.

Question 7: What characteristics of treatment-resistant epilepsy interfere with ability to obtain and maintain employment, or attend and perform well in school?

  • There were too few acceptable studies addressing this question to permit analysis.

Question 8: What is the mortality rate of patients with treatment-resistant epilepsy?

  • Persons with treatment-resistant epilepsy are approximately 2 to 10 times more likely to die compared to people in the general population. This excess mortality in persons with treatment-resistant epilepsy is largest among younger individuals.

  • Sudden unexpected death appears to be a major cause of death among patients with treatment-resistant epilepsy, representing 6 percent to 55 percent of the total deaths in studies that reported relevant data.

  • Drowning rates are higher among treatment-resistant patients with epilepsy compared to the general population. Higher quality evidence is needed to determine the precise magnitude of the difference in drowning rates.

  • There is insufficient evidence to determine whether accident-related mortality, or mortality due to pneumonia, aspiration, suicide or cancer is higher among persons with epilepsy compared to the general population.

Question 9: Is there a correlation between the number and/or type of seizure and sudden death?

  • Generalized tonic-clonic seizures appear to increase the risk of sudden death.

  • The relationship between overall seizure frequency and sudden death is uncertain.

Future Research

Our analysis suggests that at least some patients receiving treatment for epilepsy either do not have epilepsy or have another condition in addition to epilepsy that also causes seizures or seizure-like events. Studies that clearly describe the diagnostic procedures used to confirm that patients actually have epilepsy are needed and would present a more accurate assessment of the efficacy of the treatment under study. Our analysis also suggests that some patients receive AEDs at less than the maximum tolerable dose. Future studies could ensure that patients are truly treatment-resistant by enrolling only subjects who are optimized and compliant with their current therapy.

In the absence of a control group, the effects of treatment cannot be differentiated from placebo effects, regression to the mean, extraneous events, or other threats to internal validity. Although there are situations in which controlled trials are impractical, controlled trials are needed to provide a more accurate picture of the effects of treatment.

Studies with inadequate numbers of patients cannot detect clinically meaningful differences in outcomes between treatment groups. When designing clinical trials, a priori power analysis calculations can be used as a guide to ensure that sufficient numbers of patients are enrolled so that the proposed trial can uncover clinically meaningful relationships between treatments and outcomes.

Many publications do not contain sufficient information to enable the reader to accurately judge the evidence. Some confusion could be alleviated if seizure-free outcome measurements were standardized. A well-reported trial would include seizure frequency as well as a measure of data dispersion, both at baseline and at several followup periods.

Studies of diagnostics

The lack of an accepted gold standard for the differential diagnosis of epileptic seizures from nonepileptic seizures makes evaluating the utility of any given diagnostic problematic. This is because of the difficulty in verifying that the diagnostic decisions that result from the use of the test are correct. Given this lack of an acceptable gold standard, attempting to determine whether the use of a diagnostic improves patient outcomes may offer a fruitful avenue for future research. Such an approach requires determining whether the use of the diagnostic of interest ultimately leads to improved patient outcomes and, as a consequence, requires a prospective, randomized controlled trial.

Because a diagnosis of epilepsy is not made based on the findings of a single diagnostic technology, studies are needed to evaluate the effectiveness of different clinical algorithms that utilize data collected from combinations of diagnostic technologies. Again, this approach would require a prospective, randomized controlled trial.

Studies of treatment

In the literature on drug strategies, an important direction for future research involves direct comparisons between the drug strategies for treatment-resistant epilepsy. None of the studies included in our assessment of drug strategies made direct comparisons between sequential monotherapy and polytherapy. Ideally, a trial would randomize patients to different drug strategies, and compare seizure frequency outcomes as well as adverse effects of treatment.

Another area for future research on drugs concerns the adverse effects patients experience from their pretrial drug regimens and changes in these adverse effects on the new treatment regime. Changes in the frequency and severity of the adverse effects associated with each drug treatment strategy need to be evaluated, because patients and clinicians seek to reduce adverse effects as well as seizure frequency.

Prospective studies of surgical interventions are needed. This approach would allow seizure and nonseizure-related outcome measures to be recorded at multiple followup periods (1 year, 2 year, 5 year, etc.) rather than the single mean or median followup reported in most retrospective studies. Better reporting of patient characteristics is also needed and, if possible, individual patient characteristics should be reported when study sizes are small (less than 20 patients). Studies reporting standardized quality of life measures, validated for patients with epilepsy, would help in determining the effect of surgery on this important nonseizure-related outcome. Studies reporting other types of nonseizure-related outcome measures, such as employment, education, and cognitive function data, are also needed.

Higher quality controlled trials are particularly lacking for the nonmedical treatments such as education and training in skills that may help prevent seizures or enable patients to better adapt to seizures. This area constitutes another important direction for future research.

Studies of patient characteristics related to employment and school

Reporting of employment and schooling status among patients with treatment-resistant epilepsy is particularly lacking in both the medical and nonmedical treatment literature. The ideal study design to address this question would be a prospective cohort study using multiple regression techniques to evaluate the potential correlation between specific patient characteristics and the ability to work or attend school both before and after treatment. This is an area in particular need of future research and higher quality studies.

Studies of mortality

The present literature has a number of large (mostly retrospective) studies that have calculated standardized mortality rates (SMRs) for overall mortality, but few studies have calculated separate SMRs for specific causes of death or subgroups of specific ages. To generate meaningful data, cohort studies must enroll sufficient numbers of patients and follow the patients for sufficient periods. The most useful study of mortality among patients with treatment-resistant epilepsy would be a large prospective study that followed patients for several years. In addition to calculating an SMR for overall mortality, the study would calculate SMRs for specific causes of death, especially those that could be related to epilepsy (such as accidents, drowning, and motor vehicle accidents).

Large prospective studies where all suspected sudden unexpected death in epilepsy (SUDEP) cases receive an autopsy are needed. An autopsy is particularly important because it provides the best evidence that the death did not have an explainable cause. This would increase the accuracy of estimates of SUDEP rates for different age subgroups of patients with treatment-resistant epilepsy.

More prospective case-control studies using multiple regression analysis would be useful to address the potential relationship between SUDEP and seizure type or frequency. Future studies would ideally include a hundred patients or more to ensure that there is adequate statistical power to detect correlations. Multiple regression analysis is needed to reduce the effect of possible confounding variables and increase the likelihood that an observed statistically significant correlation represents an actual causal relationship.

Chapter 1. Introduction

Scope and Objectives of this Report

The objective of this report is to evaluate and synthesize, in an evidence-based fashion, the published literature on the management of treatment-resistant epilepsy. Epilepsy is a condition characterized by recurrent, unprovoked seizures. The term “seizure” is an inclusive generic term that encompasses the clinical manifestations of epilepsy as well as other disorders. Epileptic seizures arise from the abnormal discharge of electrical activity by cerebral neurons, and result in loss of consciousness, alterations in perception or impairment of psychic functions, convulsive movements, disturbances of sensation, or some combination of these events.1 Nonepileptic seizures (NES), such as psychogenic (hysterical) seizures and seizures associated with syncope, are not caused by an abnormal neuronal discharge.

The first part of this Evidence Report deals with how the published literature defines “treatment-resistant” epilepsy. An acceptable definition for treatment-resistant epilepsy could aid in the identification and management of these patients. The terms “medically intractable” epilepsy and “refractory” epilepsy are frequently used to describe patients with uncontrolled seizures that do not respond to appropriate antiepileptic drugs (AEDs).2, 3 Other terms, with or without descriptive details, are also used.

This Evidence Report then considers the methods of diagnosis used to determine if a patient has treatment-resistant epilepsy. This section examines the possibility that some patients diagnosed with treatment-resistant epilepsy in fact have conditions other than epilepsy. We assessed diagnostic procedures that differentiate epileptic seizures from nonepileptic seizures and diagnostic procedures that aid in the diagnosis of epilepsy. Diagnostic procedures used to localize epileptogenic foci prior to surgery are not addressed in this report.

An evaluation of treatment interventions for patients with treatment-resistant epilepsy, specifically pharmacological and surgical procedures, as well as some nondrug/nonsurgical interventions, comprises a large part of this report. Rather than evaluate separate drugs, this report looks at which drug treatment strategy (sequential monotherapy, polytherapy, or optimized current therapy) may benefit patients with treatment-resistant epilepsy. In addition, we also examined a variety of surgical procedures and nondrug, nonsurgical interventions.

The final sections of this Evidence Report examine the potential impact of special services on patients with treatment-resistant epilepsy and the effect of treatment-resistant epilepsy on employment, education, and mortality. These areas are critical in the continuing management of patients with treatment-resistant epilepsy and in evaluating interventions beyond their effect on seizure frequency. The literature was examined for information on occupational, speech, and physical therapies, patient education, neuropsychological evaluation, and psychiatric consultation and treatment.

This report was prepared at the request of the Centers for Disease Control and Prevention and the Social Security Administration in an effort to evaluate the diagnostic procedures available for identifying patients with treatment-resistant epilepsy, identify the characteristics of patients with treatment-resistant epilepsy that interfere with employment and schooling, and assess the potential benefits of the available medical and nonmedical interventions for patients with treatment-resistant epilepsy. The patient population of interest in this report includes infants, children, and adults with treatment-resistant epilepsy.

Epilepsy and Treatment-Resistant Epilepsy

Neurobiology

An epileptic seizure can be defined clinically as an intermittent, stereotyped, disturbance of consciousness, behavior, emotion, motor function, or sensation that results from abnormal cortical neuronal discharge and recur without provocation.4 The discharge may result in an almost instantaneous loss of consciousness, alteration of perception or impairment of psychic function, convulsive movements, disturbance of sensation, or some combination of these events.1 The onset of the abnormal neuronal discharge may be widespread and bilateral or it may be localized. In the latter instance, the abnormal discharge arises from an assemblage of excitable neurons, called a focus, in any part of the cerebral cortex that may or may not be associated with a visible lesion.a Cortical excitation may then spread to the adjacent cortex and to the contralateral cortex through interhemispheric pathways as well as to subcortical areas such as the basal ganglion, thalamus, and brainstem. Clinical manifestations of a seizure occur when the excitation reaches these areas. In rare instances, death may occur due to sustained cessation of respiration, derangement of cardiac action, or some unknown cause.

Etiology and Pathology of Epilepsy

The focal cortical lesions responsible for the abnormal discharges associated with epileptic seizures may arise from a variety of causes. Epilepsies in which no pathological lesion can be found are referred to as primary or idiopathic epilepsies and include certain generalized tonic-clonic and absence seizure conditions.1 The underlying cause of these seizure types is probably genetic. Secondary or symptomatic epilepsies are associated with a discernable lesion. Secondary epilepsies include simple partial seizures and complex partial seizures. The lesions may be zones of neuronal loss and scarring (sclerosis or gliosis), vascular malformations, tumors, or cortical dysplasia. Many patients with temporal lobe epilepsy have a condition called mesial temporal sclerosis (MTS) that is characterized by a loss of volume and scarring in the hippocampus and adjacent gyri on one or both sides. Posttraumatic epilepsy may occur after head trauma, brain surgery, and various infections that are responsible for creating focal lesions that result in epilepsy.

Signs, Symptoms, and Characteristics of Epilepsy and Treatment-Resistant Epilepsy

Epilepsy seizures may be classified according to etiology, site of origin, clinical form, frequency, or electrophysiologic characteristics.1 Classification schemes have repeatedly changed, but the most commonly used scheme is the one adopted by the Commission on Classification and Terminology of the International League Against Epilepsy.5 This classification is based primarily on the clinical form of the seizure and its electroencephalographic (EEG) features. Seizures are divided into partial seizures (a focal or localized onset can be discerned) and generalized seizures (bilateral origin and diffuse cerebral cortical involvement from the onset). Partial seizures that develop into generalized seizures are referred to as secondarily generalized seizures. Partial seizures (also called focal seizures) are further classified as simple when consciousness is maintained, and complex if consciousness is altered or lost. Simple partial seizures can be motor, sensory, autonomic, or psychic. Simple partial seizures are also called auras and may be a precursor to a complex seizure or may constitute the entire seizure. Generalized seizures may be convulsive or nonconvulsive. The common convulsive type is the tonic-clonic seizure and the common nonconvulsive type is the absence seizure that is characterized by a brief lapse of consciousness.

Patients with treatment-resistant epilepsy may experience one or more of the various types of epileptic seizures depending on the etiology of their condition. The following is a brief description of the clinical characteristics of each type of epileptic seizure that a patient with treatment-resistant epilepsy may experience. Adams and Victor's Principles of Neurology was used a guide in preparing these descriptions.1

Generalized tonic-clonic seizures are sometimes preceded by subjective phenomena (prodromes) that may take the form of psychological changes or myoclonic jerks of the trunk or limbs. Most often, the seizures occur without warning and the patient loses consciousness and falls to the ground. The seizure starts with an initial flexion of the trunk, opening of the mouth and eyelids, and upward deviation of the eyes. The tonic phase follows with protracted extension of the back and neck and then the arms and legs. Breathing is also impaired. The tonic phase lasts for 10 to 20 seconds. The clonic phase follows the tonic phase with a mild generalized tremor that represents relaxation of the tonic contractions. The rate of contractions gradually lessens over 30 seconds. Breathing is still impaired until the end of the clonic phase. Finally, all movements end. Upon regaining consciousness, the patient usually experiences a period of disorientation and/or fatigue.

Absence seizures are very brief, always associated with impaired consciousness, and may or may not have accompanying abnormal motor activity. Some are so short as to resemble daydreaming. The seizure comes without warning and consists of a sudden interruption of consciousness. These patients usually do not fall. After 2 to 10 seconds the patients become conscious again and resume their preseizure activity. Absence seizures are the most common form of epileptic seizure of childhood and rarely begin before 4 years of age or after puberty. Absence seizures tend to occur frequently, with as many as several hundred occurring in a single day. This type of seizure may be the only type seen in childhood and the attacks diminish in frequency with age and eventually disappear. However, absence seizures may be replaced in some instances by generalized tonic-clonic seizures.

Partial seizures are the product of focal abnormalities in some part of the cerebral cortex. Such lesions are often associated with focal EEG abnormalities. Simple partial seizures most often derive from a focal area in the sensory and motor cortex, while complex partial seizures most often derive from the temporal lobe of one side. Somatosensory seizures usually have a focus in the precentral or postcentral convolution. The sensation is described as numbness or tingling that usually starts in the lips, fingers, or toes and spreads to adjacent parts of the body. Visceral seizures (vague feelings in the thorax and abdomen) are some of the most frequent simple partial seizures. Other less common types of simple partial seizures include visual seizures (sensation of darkness and flashes of light), auditory hallucinations (buzzing or roaring in the ears), vertiginous sensations, and olfactory hallucinations (abnormal odors).

Loss of consciousness distinguishes complex partial seizures from simple partial seizures. However, unlike generalized tonic-clonic seizures, the patient suffers a period of altered behavior and consciousness with no later recollection, instead of a complete loss of control of thought and action. Any type of complex partial seizure may develop into other forms of secondary generalized seizures. Complex partial seizures are not exclusive to any age group but show an increased incidence in adolescence and adult years.

The Lennox-Gastaut Syndrome is a neurological syndrome characterized by frequent seizures of various types. These patients have intellectual impairment and serious neurologic disease. The syndrome is usually diagnosed between 2 and 6 years of age.

Temporal lobe epilepsy, which is typically manifested by complex partial seizures described above, commonly starts before 10 years of age. By adolescence or early adulthood, this condition often becomes treatment-resistant.6 Recurrent seizures may cause damage in the hippocampus resulting in a progressive loss in hippocampal volume as long as the seizures persist.7

Arroyos, Brodie, Avanzini, et al.3 suggest that treatment-resistant epilepsy may be a distinct condition within epilepsy characterized by progressive neuronal, cognitive, and psychosocial deterioration. These authors point out that several markers have been advanced as indicators of treatment-resistant epilepsy. These include age at onset younger than 1 year, type of epilepsy (partial epilepsies or catastrophic epilepsies of childhood), failure of the first AED, use of more than two drugs, duration of treatment without achieving control, and specific pathologies. However, these markers may not be especially sensitive or specific for treatment-resistant epilepsy.3

Epidemiology of Epilepsy

Epilepsy is among the most common serious neurologic condition, with a prevalence rate 10 times higher than multiple sclerosis and 100 times higher than motor neuron disease.8 An International League Against Epilepsy (ILAE) Commission Report from 1997 gives the prevalence of active epilepsy as 40 to100 in 10,000.9 The ILAE prevalence and incidence rates are based on a review of selected studies, mostly from developed countries, by Sander and Shorvon .10 Both rates varied widely among the studies included in this review. Their review found prevalence rates that ranged from 1.5 to 31 per 1,000 and incidence rates that varied from 11 to 134 per 100,000. The authors believe that the highest prevalence and incidence rates are more accurate because studies with these findings used more intensive and sophisticated case ascertainment methods. They further state that the low rates found in some studies were probably due to deficiencies in patient reporting (physical manifestations are transient and not observed by a clinician, patients are unaware of or deny their condition) and in diagnosis of epilepsy (syncope and psychogenic seizures are misdiagnosed as epilepsy, diagnostic criteria are unspecified or loosely defined). Sander and Shorvon10 also believe that retrospective review of patient medical records in most case ascertainment studies also leads to an underestimate of prevalence and incidence.

Burden of Illness

In addition to the immediate effects of seizures, patients with epilepsy may also have to contend with the following burdens:4, 11, 12

  • interference with normal activities

  • increased possibility of accidental injury and even death (addressed in Key Question 8 in this report)

  • impaired physical, psychological, and social functioning

  • economic loss

  • diminished quality of life

  • psychiatric disorders, in particular, anxiety and depression13

Technologies Assessed in this Report

Table 1. Technologies addressed
Drug StrategiesSurgeryNondrug, NonsurgicalService-related InterventionsDiagnostics
Sequential monotherapyTemporal lobeVagal nerve stimulation (VNS)Multidisciplinary neurobehavioral treatmentsBlood prolactin
PolytherapyHemispherectomyKetogenic dietsEEG biofeedbackRoutine-EEG
Drug reductionCorpus callosotomyChiropracticEpilepsy educationVideo-EEG
Maximum tolerable dosageMultiple subpial transectionAcupunctureVocational servicesAmbulatory-EEG
Dose frequency optimizationFrontal lobeHyperbaric oxygenPhysical exerciseBlood creatine kinase
Parietal lobeHerbal medicine and homeopathyMedical resonance therapy musicComputed tomography
Occipital lobeCranial realignmentSahaja yogaMagnetic resonance imaging
Magnetic therapyMeditationSingle Photon Emission Computed Tomography
Electrical brain stimulationSelf-help group (group therapy)Minnesota Multiphasic Personal Inventory
Vitamin B6CounselingProvocation techniques
Progressive muscle relaxationTilt table
End-tidal CO2 biofeedbackAuditory evoked potentials
Systematic desensitizationHypnotic recall
Tongue biting

Note: Bolded, italicized technologies are those addressed in this report. The remaining technologies were not addressed due to insufficient literature

In this report, we considered 49 different technologies. These are comprised of 14 diagnostic technologies, 5 drug treatment strategies, 7 types of surgery, 10 nondrug, nonsurgical interventions, and 13 service-related interventions. For each of these technologies, we considered 16 different patient-oriented outcomes (for a list of these outcomes, see the Article Inclusion Criteria in the Methodology section of this Evidence Report). However, there was sufficient literature to address only ten of these technologies. Table 1 lists each of the technologies considered in this Evidence Report. The technologies for which there was sufficient literature are shown in bold italicized type.

Chapter 2. Methodology

Defining Treatment-Resistant Epilepsy

There is no generally accepted definition of treatment-resistant epilepsy. However, for the purposes of retrieving articles for this report, an operational definition of this term was needed. Accordingly, we defined treatment resistance as failure of one or more AEDs at a maximum tolerable dose to provide complete seizure relief.

This definition was based on consensus obtained during a 1-day meeting with an Expert Panel and subsequent discussions with Technical Experts, during discussions with the two agencies that requested this report, the Centers for Disease Control and Prevention (CDC) and the Social Security Administration (SSA), and in consultation with the Agency for Healthcare Research and Quality (AHRQ).

Many articles did not provide sufficient information to allow a determination of the definition employed (see our conclusions to Question 1). Consequently, and after consultation with the Technical Experts, CDC and SSA, we included articles even if they only stated that the enrolled patients were “treatment-resistant”, or used some other synonym (see Question 1 for examples of such synonyms).

Expert Panel

At the beginning of this project, we worked with an Expert Panel that assisted in defining the scope of this Evidence Report, developing its questions, defining the outcomes of interest, and developing the criteria for retrieving and including articles. The involvement of this panel consisted of their participation in a 1-day meeting with ECRI, AHRQ, and representatives of SSA and CDC.

To establish the Expert Panel, we solicited nine organizations to nominate individuals who could serve as its members. All solicitations were preapproved by AHRQ, and all nine organizations nominated an individual. Thus, the Expert Panel was comprised of individuals from the following organizations:

  • American Academy of Neurology

  • American Academy of Pediatrics

  • American College of Occupational and Environmental Medicine

  • American Epilepsy Society

  • Child Neurology Society

  • Citizens United for Research in Epilepsy

  • Epilepsy Foundation/Penn Epilepsy Center

  • National Association of Epilepsy Centers

  • Society for Behavioral and Cognitive Neurology

The participation of these individuals and organizations in this project does not imply their endorsement of the findings of this Evidence Report.

Technical Experts

Subsequent to the 1-day meeting, the Expert Panel was disbanded, and a group of Technical Experts was formed. We collaborated with this group to further refine this project's scope, questions, outcomes of interest, and criteria for retrieving and including articles. The Technical Experts also served as a source of information throughout the project. Collaboration with these Experts was accomplished through telephone conversations and e-mail.

The Technical Experts were comprised of all of the members of the Expert Panel and representatives from:

  • Columbia Presbyterian Medical Center

  • Harborview Medical Center

  • Johns Hopkins School of Medicine

  • Strategic Health Institute

As with the Expert Panel, the participation of these individuals and organizations in this project does not imply their endorsement of the findings of this Evidence Report.

Key Questions

This report addresses nine Questions arrived at through the discussions with the Expert Panel, Technical Experts, and representatives from AHRQ, SSA and CDC. These are:

Question #1: What are the definitions of treatment-resistant epilepsy used in the literature?

Question #2: Which methods of rediagnosing or re-evaluating treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?

Question #3: Is there evidence that patients with treatment-resistant epilepsy are not optimized at their current level of treatment?

Question #4: Which drug treatment strategy, 1) sequential monotherapy, 2) polytherapy, or 3) optimized current therapy leads to improved outcomes for patients with treatment-resistant epilepsy, and what are the relative improvements obtained with each strategy?

Question #5: Which methods of nondrug treatment for epilepsy after initial treatment failure lead to improved outcomes for patients with treatment-resistant epilepsy?

Question #6: Which social, psychological or psychiatric services for treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?

Question #7: What characteristics of treatment-resistant epilepsy interfere with ability to obtain and maintain employment, or attend and perform well in school?

Question #8: What is the mortality rate of patients with treatment-resistant epilepsy?

Question #9: Is there a correlation between the number and/or type of seizure and sudden death?

Causal Pathway

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf1.jpg.

   FIgure 1. Causal pathway

The scope of this report can be illustrated by a causal pathway. More specifically, this pathway illustrates the Key Questions and the relationships among them. It also illustrates items that are beyond the scope of this Evidence Report. This pathway is shown in Figure 1. The rectangles in this figure depict the primary clinical “events,” from presentation of a patient (who has certain symptoms that may be at least partly diagnostic and/or prognostic) to the outcomes that the patient experiences (e.g., improves/does not improve). This pathway proceeds in an approximate chronological order that is depicted by solid arrows that connect the rectangles in Figure 1. Because these arrows connect two rectangles, they are termed “links.” The numbers next to each link represent the number of the question that addresses that link.

Several boxes represent endpoints in the causal pathway. These are identified by double borders. Patients reaching these endpoints do not go on to additional treatments or diagnostic procedures. Although boxes with no arrows emerging from them represent end points in terms of reporting in published studies, the patients themselves may go on to receive additional treatments and experience further outcomes. The outcomes examined in this Evidence Report were determined by the Expert Panel, Technical Experts, CDC, and SSA. Two outcomes, death and performance in school or work, are broken out from other outcomes because they are specifically addressed by their own questions.

The dashed lines in the figure “overarch” several rectangles. We have drawn these lines as dashed because they do not depict the sequence of events in the clinical pathway. In general, these lines portray questions about how patient characteristics (including clinical findings) may influence outcomes.

Theoretically, a question can be derived by drawing a line between any two rectangles in Figure 1. Therefore, rectangles not connected by solid or dashed lines are beyond the scope of this Evidence Report.

Literature Searches

Electronic Database Searches

To obtain information for this report, we systematically searched 23 electronic databases. These were:

  • Center for International Rehabilitation Research Information and Exchange (CIRRIE) (searched November 30, 2001)

  • Cochrane Database of Systematic Reviews (through 2001, Issue 4)

  • Cochrane Registry of Clinical Trials (through 2001, Issue 4)

  • Cochrane Review Methodology Database (through 2001, Issue 4)

  • Cumulative Index to Nursing and Allied Health (CINAHL) (1988 through January 11, 2002)

  • Database of Reviews of Effectiveness (Cochrane Library) (through 2001, Issue 4)

  • ECRI Health Devices Alerts (1977 through January 2002)

  • ECRI Health Devices Sourcebase (through January 2002)

  • ECRI Healthcare Standards (1975 through January 2002)

  • ECRI International Health Technology Assessment (IHTA) (1990 through January 2002)

  • ECRI Library Catalog (through January 2002)

  • ECRI TARGET (through January 2002)

  • Embase (Excerpta Medica) (1975 through January 2002)

  • ERIC (Educational Resources Information Center) (searched January 8, 2002)

  • Health and Psychosocial Instruments (HAPI) (through April 27, 2001)

  • LocatorPlus (through January 2002)

  • NDA Pipeline (searched November 1, 2001)

  • PsycINFO (1975 through January 31, 2002)

  • PubMed® (MEDLINE®, PreMEDLINE®, HealthSTAR) (1975 through January 2002)

  • Rehabdata (searched April 24, 2001)

  • U.K. National Health Service (NHS) Economic Evaluation Database (EED) (through January 2002)

  • U.S. Centers for Medicare & Medicaid Services (CMS) (formerly HCFA) (through January 2002)

  • U.S. National Guidelines ClearinghouseTM (NGC) (through January 2002)

Search Strategies

We employed different searches for different sections of the report, including different searches for different questions. The strategies for these different searches, given in PubMed® /MEDLINE® syntax, are provided in Appendix A.

Other Sources

In addition to the above searches, we also reviewed the bibliographies and reference lists of all studies included in this Evidence Report, and searched Current Contents—Clinical Medicine® on a weekly basis.

Article Inclusion Criteria

To be included in this Evidence Report, an article had to meet specific a priori criteria. Some of these criteria were specific to each question, and these are listed at the beginning of the discussion of each question in the Results section of this Evidence Report. Some criteria were common to all questions except Questions #1 and #9. These common inclusion criteriab were:

  1. The article described a study that enrolled (unless otherwise noted in certain questions) only patients with treatment-resistant epilepsy or, if other patients were enrolled, data from treatment-resistant patients were separately presented.

  2. Only full-length articles were included. We did not include meeting abstracts because they are often preliminary reports of results, and they seldom contain sufficient detail to allow evaluation of study design.

  3. The article described a study that must (unless otherwise noted in certain questions) have been published in 1985 or later. We adopted this criterion in accordance with the wishes of the Expert Panel, which stated that treatments for epilepsy, the technologies associated with the diagnosis of the disease, and the classification of its seizure types have substantially changed from what they were prior to 1985.

  4. Articles had to be English-language. We adopted this criterion out of consideration of the time and budget allotted for this project.

  5. The article described a study that enrolled 10 or more patients. Smaller studies may be of unusual methods or patients. Therefore, their findings may not be applicable to other patients or to settings outside the ones in which the study was conducted. Further, small surgical series may represent studies conducted by physicians who have comparatively little experience with the procedure.

  6. The article described a study that quantitatively reported an outcome or diagnostic test result of interest. Qualitative expressions of results do not allow conclusions to be drawn about how well a treatment works (i.e., about effect sizes) and provide little assurance that results were rigorously evaluated by the investigators.

  7. As per the desires of the Expert Panel and Technical Experts, a study of an intervention must have reported data on one or more of the following outcomes:

    1. Outcomes related to seizure frequency:

      • Absolute seizure frequency

      • Percentage change in seizure frequency from baseline

      • Proportion of patients seizure-free

      • Proportion of patients with >50 percent reduction in seizure frequency from baseline

      • Engel Classification

      • Rundown time to seizure-free

      • Seizure-free period

      • Proportion of patients with any reduction in seizures

      • Proportion of patients with any increase in seizures

      • Proportion of patients exiting a trial due to harmful seizure increases

    2. Nonseizure frequency outcomes:

      • Quality of life

      • Mood (we used this as a general term to describe a range of outcomes, from depression to psychosis)

      • Functional status/ability

      • Cognitive Function

      • Ability to stay in or return to work

      • Ability to stay in or return to school

      • Ability to hold a driver's license

      • Adverse events

      • Mortality

  8. Articles that present data pertaining to quality of life, mood, or cognitive function must have used a validated psychometric instrument. For the purposes of this Evidence Report, a validated psychometric instrument is an instrument for which there is evidence in the peer-reviewed literature to demonstrate that it has construct validity (it measures what it purports to measure) and good reliability (e.g. test-retest reliability; inter-rater reliability). Ideally, the instrument would have been validated using patients with epilepsy. However, given the scarcity of such data on quality of life and psychological status, we decided a priori not to require that all psychometric instruments be validated in a population of patients with epilepsy.

  9. If there were fewer than five studies of a given intervention or diagnostic, and none of these studies was a randomized controlled trial (RCT) that enrolled 50 or more patients in the treatment group, we did not include any studies of the diagnostic or intervention. Where an RCT with 50 or more patients did exist, we included that RCT even if there were fewer than five studies. We adopted the criterion partly out of consideration of the time and budget allotted for this project. However, this criterion also reduces the potential that publication bias will influence our conclusions. Conclusions drawn from small literature bases could be overturned by the results of only one or two unpublished studies. Therefore, requiring that five studies be available before analyzing a given intervention or diagnostic helps to reduce publication bias.

  10. When five or more controlled studies addressed a given intervention or diagnostic, we included only controlled studies. Otherwise, uncontrolled studies were included.

  11. Except for surgical topics (where the great majority of studies were retrospective) when five or more prospective studies addressed a given intervention or diagnostic, we included only the prospective studies. Otherwise, retrospective studies were included.

  12. When there were several publications describing the same trial, only the largest and most recent publication was included. This avoids double counting patients and the consequent distortion of measurements of effect size. We included earlier studies that reported data not in later publications, and earlier publications that contained data from more patients than later publications.

    All surgery trials meeting the inclusion criteria were further examined for patient overlap by cross-matching years of patient enrollment within articles published by the same surgery center. When articles presented overlapping patient populations, the article containing the most recent patient enrollment periods was included. When patient populations did not overlap, all articles from a single center were included.

  13. The seizure types examined in a study were classified according to the International League Against Epilepsy's International Classification of Epileptic Seizures, published in 1981,5, 14 or the article used terminology that was consistent with this classification.

As stated above, these criteria do not apply to Key Questions 1 and 9. We provide the criteria relevant to these questions in the text associated with them.

We reviewed the abstracts of articles identified by our searches against the inclusion criteria to determine whether we would retrieve it. Five research analysts independently performed this task, and each analyst worked on different questions. We retrieved an article whenever there was uncertainty about whether it met the inclusion criteria. We also retrieved articles when an abstract was not present in the search results, but the title of the article suggested that it might be relevant.

Once an article was retrieved, it was examined to determine whether it met the appropriate inclusion criteria. Articles that met these criteria were first examined for “fatal flaws” that precluded interpreting their results. Such articles were excluded. When an article was excluded for design flaws, we presented the reason(s) for its exclusion in Evidence Tables associated with each of the nine Key Questions in this report.

We then examined the remaining articles for design flaws that could potentially bias their results. Wherever possible, we empirically evaluated studies for the presence of bias. When data-driven evaluations of study quality were not possible, we documented and explained a study's potential for bias.

Articles Identified

We identified 11,111 articles with our searches. We retrieved 2,356 of these for Questions 2–9 according to a priori criteria. Three hundred fifty-seven articles remained after evaluating whether the full article met these criteria. After evaluating these latter articles for design flaws so severe that their results could not be interpreted, and after determining whether there were too few articles to permit a firm evidence-based conclusion (see above for Article Inclusion Criteria), 305 articles remained, and were included in this Evidence Report for Questions 2–9. One hundred eighty-five articles for Question #1 (on definitions of treatment-resistant epilepsy) were randomly sampled from these 305 articles.

Table 2. Number of articles in the Evidence Report
Key Question NumberRetrievedMet CriteriaIncluded
1185185185
24703410
32012020
46545750
5804206206
686120
74850
8611410
93299
Totals =2541542490

Note: Articles for Question 1 were also used to address other questions. Because these articles were evaluated twice, each for a different purpose, they are double-counted in the totals shown in the table. The article count in the text does not incorporate this double-counting

The number of articles retrieved, that met the inclusion criteria for each question, and were included in the Evidence Report are shown in Table 2.

Statistical Methods

Meta-Analyses

We performed meta-analyses of data from RCTs and uncontrolled trials. We performed meta-analyses of RCTs to estimate an average effect of treatment. We performed meta-analytic threshold analyses of uncontrolled studies to determine whether an intervention was plausibly effective. All meta-analyses, except meta-regressions, were performed with software programs developed by ECRI. This software has been extensively validated using published examples and hand calculations. Meta-regressions were performed using SPSS (version 10.1) Statistical Software (Copyright © SPSS Inc., 1989-2000).

Meta-analyses of Randomized Controlled Trials

Our meta-analyses of RCTs are exclusively comprised of random effects models. These analyses appear only in Question 4, and only in regard to the polytherapy drug strategy. We employed random effects models because of the types of trials that addressed this question. These trials were each of a single AED, and not all trials studied the same drug. Therefore, no assumption is made that these trials were all drawn from a single population.

An important aspect of these meta-analyses is that each of the trials is an instance of polytherapy, rather than a study of polytherapy, per se. This is because a planned study of polytherapy would investigate more than one drug and means that each trial in each meta-analysis in Question 4 represents only one way polytherapy might be tested. However, combining these trials into a single meta-analysis of polytherapy means that this analysis approximates a single trial that directly studied this strategy of drug administration.

Another important aspect of the RCTs in our meta-analyses is that some of the trials consisted of more than two groups and therefore have more than two effect sizes. These effects are not independent of each other. Ideally, sophisticated statistical techniques would be used (e.g., general linear modeling or hierarchical regression) to account for this dependence. However, in the present case, there was too little published information to permit such analyses. Therefore, we conducted two analyses on the data from any given set of trials. In the first analysis, we evaluated the effects of the intervention by comparing outcomes in the group that received a study's highest drug dose to outcomes in the placebo group. In the second analysis, we compared outcomes in the group that received a study's lowest drug dose to outcomes in the placebo group. Additional details about the specific analyses we conducted are provided in Question 4.

Because the results of these analyses are not independent, if a meta-analysis of a low dose of drug is statistically significant, a subsequent finding that the effects of high doses are also statistically significant does not provide “twice as much” evidence that the treatment is effective. Further, because the data allow computation of only a meta-analytic summary statistic, and do not permit attempts to explain the cause(s) of any between-studies heterogeneity, some information loss accompanies all of these analyses.

Our random effects analyses were performed as described by DerSimonian and Laird.15 As the measure of the effectiveness of treatment, we employed Cohen's h, the difference between the arcsine transform of two proportions divided by the pooled standard error.

We did not compare the effectiveness of different drugs. This conforms to the wishes of the Expert Panel and Technical Experts, who advised that such comparisons were of secondary importance.

Meta-analyses of Uncontrolled Studies

As mentioned above, our meta-analytic threshold analyses of uncontrolled studies are not intended to produce a summary statistic. Rather, we performed these analyses to assist readers in determining whether an intervention is plausibly effective. In these threshold analyses, we meta-analytically compared the improvement rate in treated patients to increasing rates of improvement in a hypothetical control group. Starting at 0 percent, we increased the rate of improvement in the “control” patients until the difference in improvement between the treated and “control” groups was no longer statistically significant. This value is the threshold. Thus, the threshold is the proportion of untreated patients who would have to improve to render the effect of treatment statistically nonsignificant. Except for analyses of sequential monotherapy, all of these analyses employed fixed effects models. In analyses of sequential monotherapy, we employed random effects models due to the use of different drugs as monotherapy. Where possible, we provide context for these thresholds by supplementing them with historical data on “control” patients obtained from published articles.

We also report the percentage of patients who improved after the intervention, but note that this percentage is not the difference between improvement in a treated and a control group and, therefore, is not the net effectiveness of the intervention. Nevertheless, this percentage is informative because it represents the proportion of patients likely to improve, regardless of the cause of their improvement. We estimated this percentage by back-transforming the summary statistic from the meta-analysis into a percentage.

We conducted these analyses using Cohen's h as the test statistic. In all analyses, we assumed that the number of patients in the “control” group equaled the number of patients in the treated group. We chose Cohen's h because, under these conditions, the Q statistic and each study's standardized residual remain constant as the proportion of improved patients in the “control” group increases.

Meta-regression

Whenever statistically significant heterogeneity among the study results was detected in our threshold analyses, we attempted to “explain” the heterogeneity using meta-regression. We used the Q statistic to determine whether an analysis was heterogeneous. Because this statistic is conservative,16, 17 we adopted a p-value of 0.10 (as opposed to the traditional significance level of 0.05) as the critical value for statistical significance.18

Typically, there was no strong a priori hypothesis to “explain” this heterogeneity. Therefore, we generated a set of regression models for any meta-analysis in which we found statistically significant heterogeneity. We constructed this set by first computing all possible models containing one predictor variable. The number of available predictor variables was often limited by incomplete reporting. This is because we required that at least 90 percent of the studies report the value of a given variable before we entered it into a meta-regression. When more than 90 percent of studies, but less than 100 percent of studies reported the value of a given variable, we assumed the mean value of the variable for the missing data.

After generating all possible one-predictor models, we generated all possible two-predictor models except those containing the coefficients that were not significant in the one-predictor models. Finally, we constructed all three-predictor models except those containing a pair of coefficients that were nonsignificant in the two-predictor models. We only constructed three-predictor models when (QE1-QE2)/QE0 >0.25, where QE0 is the value of QE when there were no predictors in the regression model, QE1 is the value of QE when there was one predictor in the regression model, and QE2 is the value of QE when there were two predictors in the model. We employed this rule to avoid over fitting data from small numbers of studies. Constructing multiple models also assisted in detecting multicolinearity.

For the purposes of constructing models, we set the alpha level required for significance of the regression coefficients at 0.10. This is anticonservative, but allows for examination of a broader range of models than would an alpha of 0.05.

In the text of the Evidence Report, we consider a model to be a plausible “explanation” of variability only if: (1) it was the only model in a set to produce a statistically nonsignificant (p >0.10) QE, (2) all coefficients in the model were statistically significant and, (3) adding another predictor variable to the model caused the value of QE to decrease by less than 25 percent with respect to the value of QE with no predictors in the model. For interpreting models, we used the traditional alpha level of 0.05 for the regression coefficients.

Other Meta-analyses

To ensure there were no systematic biases in the enrollment of patients in RCTs, we conducted, wherever possible, meta-analyses of the characteristics of patients enrolled in them. In these analyses, we compared the characteristics of patients in the control groups to the characteristics of patients who received the intervention. For example, in one such meta-analysis, we sought to determine whether females tended to be enrolled more in the control groups than in the experimental groups of studies of AEDs. These analyses employed fixed effects models, and we used Cohen's h or Hedges' d, as appropriate, to estimate the between-group differences.

Recognizing that meta-analysis has low statistical power to detect influences of patient characteristics on outcomes,19, 20 we extended our fixed effects analysis on surgical outcomes by performing appropriate meta-analyses of data from nested case-control studies. These studies reported the proportion of patients with a given characteristic who had successful or unsuccessful surgery, or they separately reported a continuous variable for patients who had successful or unsuccessful surgery. We performed meta-analyses on proportions using Cohen's h as the test statistic. For studies reporting continuous variables, we computed each study's appropriate point-biserial correlation coefficient and then meta-analytically evaluated these coefficients.

We performed analyses of data from nested case-control studies using only patient-level data. Although meta-analysis of such data using more sophisticated modeling techniques is preferable (e.g., hierarchical models), this was beyond the scope of the present project.

Sensitivity Analyses

We used sensitivity analysis to test whether our meta-analytic summary statistics were robust. We employed four such analyses for each summary statistic. These were recalculations of the meta-analytic summary statistic with: (1) the largest study removed, (2) the smallest study removed, (3) the study with the largest effect removed, and (4) the study with the smallest effect removed from the meta-analysis.

Other Computations

In addition to computing the above-described meta-analytic statistics, we performed numerous other statistical computations. We note each of these computations in the text of this Evidence Report and/or in footnotes to the in-text tables and Evidence Tables. Briefly, the computations we performed included:

  1. Statistical power analyses. Studies that do not contain a sufficient number of patients cannot detect statistically significant differences between groups, even when these differences are clinically meaningful. Therefore, when appropriate, we computed the smallest between-group difference that any given controlled study had the power to detect.

  2. Determinations of whether there were statistically significant differences between the characteristics of patients in the groups of any given study. This is particularly important for studies that are not randomized, because the patients in the different groups of such studies may not be comparable. Further, although other studies may report that they were randomized, the randomization protocol may not have been adequately followed or the study may not have been truly randomized (i.e., randomization may have been nonstochastic). These departures from randomization can manifest themselves in pretreatment between-group differences in patient characteristics. We recognize that in a properly randomized trial, such differences can arise from chance. However, searching for such differences in the context of a systematic review is justifiable because there is no other way to audit whether the randomization was, indeed, accomplished.

  3. Computation of pretreatment effect sizes. Departures from randomization can also manifest themselves as a statistically significant difference in the outcome between groups prior to the administration of treatment. For example, if the seizure frequencies experienced by patients in different groups were significantly different before treatment, the study may not have been truly randomized.

  4. Verification of 2 × 2 tables reported in studies of diagnostic tests. Because peer-reviewed published articles often contain errors in reported results, we attempted to verify the calculations in each article. If an error was found, we corrected the data and included it in our analysis.

  5. Computations of t-tests, chi-square tests, Fisher's exact text, odds ratios (OR), and their 95 percent confidence intervals (CI). Some studies included in this Evidence Report did not report the results of statistical tests that were important for answering the questions. We computed these statistics when such studies reported sufficient data.

  6. Computations of crude (CMRs) and standardized mortality ratios (SMRs). These quantities, which are useful for comparing the mortality rates among persons with epilepsy and those who do not have epilepsy, were not reported in all studies but some studies reported sufficient data to allow us to compute them. CMRs and SMRs are calculated as the number of observed deaths divided by the number of expected deaths. However, SMRs are standardized according to the age distribution of the study population and the age-specific death rates in the country of interest. If a study reports only a mean age or age range for their patient group, then only a crude estimate can be made as to the number of expected deaths. Therefore, caution is required in interpreting crude mortality ratios. We discuss the reasons required for this caution in detail in Question 8.

  7. Numerous other calculations of descriptive statistics for patient characteristics (e.g., mean age) and outcomes (e.g., seizure frequency) were performed when patient-level data were reported.

Methods of Evaluating Literature Quality

Studies of Interventions

Our evaluation of the quality of interventional studies employed three tools. The first was an evidence hierarchy we used to determine which studies to retrieve, include and, in certain questions, to determine whether there was greater potential for bias in some included studies than in others. The second tool we used was a checklist for evaluating each study's internal validity. Finally, we considered the difficulties inherent in certain outcome measurements.

Evidence Hierarchies

We did not restrict the studies included in this Evidence Report to RCTs. Rather, ours is a “best evidence” synthesis in which we accept the best available evidence, not the best possible evidence. Performing such an analysis on studies of surgery for epilepsy is particularly important. This is because withholding treatment to perform such an RCT may be unethical.

To determine the best available evidence, we used an evidence hierarchy. This hierarchy served, in part, to determine which studies we would include and retrieve. In some cases, we also used this hierarchy to evaluate a study's potential for bias. This hierarchy, shown in order of study designs with the least potential for bias to those with the greatest potential for bias was:

  • Randomized controlled trials

  • Controlled clinical trials

  • Studies that made measurements before and after treatment (pre/post studies)

  • Uncontrolled studies

Within each level of the hierarchy, we assumed blinded studies to have lower potential for bias compared to nonblinded studies, and prospective studies to have lower potential for bias compared to retrospective studies. Wherever possible, we empirically evaluated these assumptions, and the assumption that studies lower in the hierarchy had results that were different (i.e. were biased) from studies higher in the hierarchy.

Internal Validity Checklist

The internal validity of an interventional study represents the degree of confidence one can have in whether the intervention caused a change in the outcome of interest. The confidence in this causal relationship can be weakened by a number of biases. Because of this, we employed a second tool, geared to evaluate the potential difficulties with each included study's internal validity. This tool was a checklist of such potential difficulties, and was a modification of the scheme of Cook and Campbell.21 Thus, we evaluated each study to determine whether any of the potential biases listed below was present.

We stress that these are potential biases. The existence of a potential bias does not necessarily mean that a study's results were affected by the bias. We view this question as one that can be empirically determined.

Selection bias

Selection bias is relevant only to studies with control groups. This bias occurs when there are differences between the patients in the different arms of the study at the start of the study. These differences may lead to posttreatment differences in outcome that are not due to treatment. Random assignment of patients to the study arms protects against this bias, but the fact that a study states that assignment was random does not guarantee that randomization protocols were adequately followed. This is a concern when the method of randomization is not reported. In such instances, the method of randomization may not be truly stochastic.

Investigator bias

This bias can occur in studies that are not blinded. In such nonblinded studies, investigators are aware of who is receiving a particular treatment and who is not. This knowledge may influence the measurement of patient outcomes, especially when these outcomes rely on a degree of subjectivity. This bias can affect nonblinded studies of any design.

Patient bias

This bias can occur in open studies or in blinded studies in which blinding has been broken. As a result, patients are aware that they are, or are not, receiving a treatment. This knowledge may influence the way they report an outcome of interest. Given that many seizure frequency and quality of life outcomes rely heavily on patient reports (e.g., seizure diaries), this bias is particularly relevant to the studies considered in this Evidence Report. This bias affects nonblinded studies of any design.

Attrition bias

Attrition refers to the loss of patients before outcome measurements can be recorded. Patients may no longer return to the clinic because they have moved away, have improved to the extent that they believe they no longer need to see a physician, or have died. Because those who completed the study may not be representative of the entire group of patients who entered the study, analyses based only on study “completers” may be biased. This bias can affect studies of any design. In this report, we did not set any limit for attrition beyond which we would not consider the study in our assessment. Exiting a trial before completion was considered an important outcome in our evaluation of studies of drug treatment. In our evaluation of vagal nerve stimulation, we specifically looked for any influence of attrition on outcomes.

Measurement bias

Measurement bias occurs when the method used to measure a particular outcome systematically over- or underestimates the true effect of treatment on that outcome. For example, general health status instruments (e.g., SF-36) may be less sensitive than disease-specific instruments for detecting small changes in health status that are important to patients.22 This bias can affect studies of any design. In the epilepsy literature, the use of seizure diaries to measure seizure frequency may be a source of potential measurement bias. We discuss why in the section entitled “Validity of Seizure-related Outcomes” (below).

Regression bias

This bias, also known as regression to the mean, can occur when there are patients who, upon entry into a study, have relatively good (or relatively poor) performance on an outcome. For example, patients may enter a study when their condition is at its worst. When the disease is not progressive, these patients are unlikely to be so ill upon subsequent measurement, even in the absence of treatment. Patients with extremely high pretreatment seizure frequencies may experience reductions in seizure frequency, even without treatment.23 This bias affects studies of all designs except well-designed RCTs.

Extraneous event bias

This bias occurs when events other than the intervention of interest cause improvements in health outcomes. For example, in an uncontrolled longitudinal study, treatment may incorrectly appear to cause an improvement in health outcomes if patients are given new, effective methods of patient management. This bias can affect studies of all designs including RCTs. RCTs will be affected if the new methods are not uniformly applied to all patients.

Sampling bias

Sampling bias occurs when a study either does not include all enrolled patients who received the treatment of interest, or does not include a random sample of the enrolled patients who received the treatment of interest.

Maturation bias

Maturation bias occurs if individuals improved because of developmental maturation, and not because of treatment. In the present Evidence Report, we only evaluate studies for potential maturation bias if they used followup periods longer than 1 year.

Sample specification bias

In the context of the present report, sample specification bias occurs if a study enrolled some patients who were not treatment-resistant. For example, studies that did not explicitly state that all enrolled patients were experiencing seizures despite prior treatment with at least one AED given at maximum tolerable dose may not have exclusively enrolled patients who met the definition of treatment resistance that was suggested by the Expert Panel and Technical Experts (see the section entitled “Defining Treatment Resistant Epilepsy”). Sample specification bias is a lesser issue for studies of surgical interventions for epilepsy than for the other interventions discussed in this Evidence Report. This is because of the relatively extensive presurgical evaluations that surgical candidates receive.

Statistical power

Studies with low statistical power do not have the ability to find statistically significant differences between groups or between one test and a subsequent test. As such, the failure of a low power study to find a statistically significant difference does not always imply that an intervention is ineffective. This is because the study may not have had the power to detect clinically important differences.

Determining whether a study has sufficient power to detect clinically important differences is subjective. This is because determination of what a clinically important difference is ultimately requires the opinions of patients. Often, these opinions are not well studied. Consequently, we have refrained from making such judgments and, instead, provided the reader with sufficient information to make their own. We accomplish this by computing the smallest between-group percentage difference that a statistically nonsignificant study could have detected. The reader then needs to compare this percentage to the percentage change deemed clinically important. Assume, for example, that we computed that a study only had the power to detect a 30 percent decline in seizures as statistically significant. The reader may decide that, in fact, a 10 percent decline in seizure rates is clinically significant, and then note that the study did not have the statistical power to detect this difference. This would mean that the results of the study were not informative.

Because our consideration of a study's power involves de novo calculations, we consider power in the “synthesis of study results” section of each question, and not in the section devoted to evaluating a study's internal validity.

Validity of Seizure-related Outcomes

There are several commonly reported ways to measure seizures, including mean and median frequencies, the proportion of patients who experience a reduction in seizures greater than a certain percent (e.g., the proportion of patients who experience a greater than 50 percent reduction in frequency), and the percentage of patients who exit a trial due to seizure increases.

All of these outcomes depend upon patient reports, often in the form of seizure diaries. One difficulty with these diaries is that they rely on the objectivity, and memory of the individual responsible for keeping the diary. This affects the accuracy of records of all seizures (see “Measurement bias” above), and accurate recording of auras may be particularly problematic.

Another problem with the way in which seizure frequencies are reported is that not all outcome measurements capture what happens to all patients. For example, a common way to report seizure frequency is to report the percentage of patients who had a 50 percent (or some other percentage) or greater reduction in seizure frequency after treatment. This type of outcome only captures information about patients whose seizure rates decreased. It does not capture information about patients who experienced increases in seizure frequency. In fact, expressing results in this way can be quite misleading. For example, assume a study that reported that seizure frequency reduced by 50 percent or more in 40 percent of their patients. Seizure rates may have actually increased in the remaining 60 percent of patients and, if this were true, the treatment might actually be harmful.

In contrast, measures of absolute seizure frequency do capture information about all patients because the results of patients who became worse are combined with the results of those who improved. However, absolute seizure frequencies are not normally distributed, so a simple average is not an appropriate summary of the data. As a result, many studies report median seizure frequencies. While medians are an appropriate measure of the central tendency of such data, they pose technical difficulties. In particular, methods for combining medians in a meta-analysis are not well developed. One way around this difficulty is to transform seizure frequencies by using a natural log transform. This would render the data normally distributed and allow for computation of meaningful averages and measures of dispersion. Published studies rarely report such results.

Another way of reporting results is to provide the number or proportion of patients who exited a trial due to changes in seizure type and/or frequency. Although trial exit per se can be accurately recorded, it is also based on seizure frequencies and, therefore, typically depends on the accuracy of seizure diaries. Another difficulty with this outcome is that it is relatively insensitive to small or moderate increases in seizure frequency. This is particularly true because a common criterion that investigators set for exiting a monotherapy drug trial is a doubling of seizure frequency.

Studies of Diagnostics

The biases that can affect studies of diagnostics are different from those that can affect studies of interventions. The checklist we employ for determining whether a diagnostic study was potentially affected by a bias incorporated items suggested by Lijmer, Mol, Heisterkamp, et al.,24 Irwig, Tosteson, Gastsonis, et al.,25 Gann,26 Begg and Greenes,27 and Ransohoff and Feinstein.28 These biases are:

Spectrum bias

This bias occurs when there are differences between populations in the spectrum of disease presentation and severity. In the present report, it manifests itself in diagnostic case-control studies in which “cases” (in this instance, patients with epileptic seizures alone) and “controls” (patients with nonepileptic seizures) were selected for inclusion because they were known to have epileptic or nonepileptic seizures prior to the study. Such studies therefore enrolled cases that are relatively easy to diagnosis, and did not enroll cases that are more difficult to diagnose. The effects of spectrum bias have recently been demonstrated empirically by Lijmer, Mol, Heisterkamp et al.24 who found that the diagnostic odds ratio was approximately three times greater in diagnostic case-control studies compared to studies of the same diagnostic carried out using unbiased populations.

Imperfect reference standard bias

This bias occurs when a reference standard against which the diagnostic performance of the diagnostic of interest was measured is not perfect (not a true “gold standard”).

Differential reference standard bias

In the context of this Evidence Report, this bias occurs when patients allocated to the epileptic and nonepileptic seizure groups were not diagnosed using the same reference standard. For example, patients with epileptic seizures may have been diagnosed in a neurology department using a diagnostic such as video-EEG, but patients with syncopal seizures may have been diagnosed in a cardiac department using a diagnostic such as a tilt-table.

Prevalence bias

Prevalence bias occurs when the numbers of cases and controls in a case-control study are artificially chosen to be equal. This artificial prevalence introduces a bias that influences the positive predictive value (PPV) and negative predictive value (NPV) in a manner described by Bayes' theorem.29

Interpretation bias

This bias occurs when the results of the test of interest are subjective and can be influenced by factors that are unrelated to the disease of interest.

Patient bias

This bias may occur in diagnostic studies when patients are aware of their diagnostic group allocation. This bias is a particular problem when the diagnostic of interest involves patient input. For example, the Minnesota Multiphasic Personality Inventory (MMPI) has been proposed as a means of differentiating patients with epileptic seizures from patients with psychogenic seizures. This instrument requires patient input. Patients' awareness of their diagnostic group allocation may influence their input.

Investigator bias

This bias may occur in diagnostic studies when investigators interpreting the results of the diagnostic of interest are not blinded to the diagnostic group allocation of the patients in the study. This is a particular problem when the investigator is required to “interpret” the findings of a diagnostic test. For example, the interpretation of a CT scan requires that an investigator interpret the image. If the investigator is aware of the diagnostic categorization of the patient, his interpretation of the CT image may be influenced.

Verification bias

This bias is only relevant to studies that used followup to confirm the accuracy of the diagnostic of interest and occurs when only one group of patients is followed. This group typically consists of only those with a positive diagnosis. For example, only those diagnosed by the test of interest might be followed up.

Diagnostic yield bias

This bias may occur when only a subset of patients enrolled in a study is reassessed. For example, some patients do not experience a seizure during re-evaluation, so diagnostic data cannot be collected from them. If these patients are somehow different from patients in whom a diagnostic reassessment was possible (for example, the subgroup contained a higher proportion of patients with nonepileptic seizures), then this may lead to a biased estimate of prevalence.

Studies of Mortality

Two questions in this Evidence Report, Questions #8 and #9, concern epilepsy-related mortality rates. Studies examining mortality rates may be biased by factors that are different from those that bias the results of other kinds of studies. Therefore, we examined studies of mortality for the following potential biases:

Sample specification bias

See the above definition of this bias.

Sampling bias

See the above definition of this bias.

Cause validation bias

This bias may occur in studies that did not determine the cause of death by autopsy. For example, investigators may assume that epilepsy is the cause of death in patients with epilepsy who died suddenly. This bias will artificially inflate death rates due to epilepsy.

Mortality ratio bias

This bias occurs in studies that did not present standardized mortality ratios or in studies that did not present sufficient information to allow us to calculate these ratios. Other methods of computing mortality do not allow mortality rates to be standardized by age, which could bias mortality differences in either direction.

Control selection bias

This bias affects only Question #9 regarding sudden unexplained death. It occurs when studies of mortality use an inappropriate control group. For example, in a case-control study of sudden unexpected death where all of the cases were children with epilepsy, the control group should not consist of adults with epilepsy. This would increase the likelihood of finding a spurious relationship between sudden death and a variable that may be unrelated to sudden death (e.g. childhood epilepsies are likely to differ from epilepsies that afflict adults in ways that may be unrelated to the risk of sudden death). An appropriate control group would be living children with epilepsy.

Statistical control bias

This bias also affects only Question #9. It may occur if studies evaluating a relationship between two variables did not use a statistical method that adjusts for the possible effects of other variables. For example, regression techniques are often useful for determining the influence of a variable on an outcome. When such techniques are not used, the magnitude of the relationship between a variable and the outcome may be misestimated.

External Validity

We evaluated each study's external validity (generalizability) according to patient characteristics appropriate to each question. These characteristics are provided as we address each question. We did not evaluate external validity when evidence-based conclusions could not be reached.

Presentation of Results

Evidence Tables vs. Tables

The results of our analyses of internal and external validity and of our meta-analyses are presented in two types of tables. Evidence Tables contain detailed information on each of the studies used in an assessment and the results of meta-analyses of these studies. The Evidence Tables tend to be large and are therefore contained in a separate volume of this report. They are organized according to the Key Questions addressed in this report. Other tables appear in the Results chapter following the discussion of each intervention being assessed. These tables are intended to provide a brief listing and description of the studies and the outcomes reported in these studies. Tables addressing the internal validity of studies used in an assessment are presented in Appendix B.

Figures

Figures are also presented in the Results chapter after the discussion of each intervention assessed in this report. Figures are designed to present summary information in the form of a forest plot (array of study effect sizes usually with a summary estimate), graphs of threshold analyses and meta-regressions, or other appropriate graphical presentations.

Peer Review

Internal Review

Throughout the preparation of this report, the five analysts and the Project Manager held numerous meetings to determine the strategy and methods of analysis. The Project Manager then individually reviewed each completed section of the report, and suggested changes. Upon completion of these changes, the individual sections were assembled into an initial draft report that was again reviewed by the Project Manager. Subsequent to changes made in response to this draft, it was distributed to the five analysts in the project team for review. Suggested changes were reviewed by the Project Manager and discussions were held among the project team to determine which suggestions would be incorporated. Upon incorporation of the appropriate changes, the draft report was sent for external review.

External Review

To select peer-reviewers for the draft Evidence Report, ECRI prepared a list of 27 potential reviewers. This list was submitted to AHRQ, which approved all reviewers. Letters inviting these individuals to review the draft report were then mailed. Twenty-five individuals responded to these letters, 19 agreed to review the draft Evidence Report, and nine individuals returned reviews.

Upon receipt of reviews, ECRI revised the draft report accordingly. ECRI also prepared a document describing the disposition of all substantive reviewer comments and supplied this document to AHRQ for review and approval.

Chapter 3. Results

Definitions of Treatment-Resistant Epilepsy

In this section of the Evidence Report, we addressed Key Question #1: What are the definitions of treatment-resistant epilepsy used in the literature?

The purpose of this question is to catalogue the definitions of treatment-resistant epilepsy that appear in the published literature. To address this question, we abstracted the phrase or sentence used to describe treatment-resistant patients with epilepsy in clinical studies, in clinical practice guidelines, and in reviews that met the inclusion criteria listed below. To tally the number of publications defining treatment resistance, we considered even the least specific of definitions (Evidence Tables 13). However, a synonym was not considered a definition. If patients were described with any of the following terms, and those terms were not further defined, we considered the definition to be “Not Reported”:

  • Medically intractable

  • Medication-resistant

  • Medically refractory

  • Medically resistant

  • Medically uncontrolled

  • Drug-resistant

  • Refractory

  • Intractable

  • Pharmaco-resistant

  • Chronic treatment-resistant

  • Inadequately controlled

  • Uncontrolled

  • Poorly controlled

  • Therapy-resistant

Because the majority of studies and reviews did not report a definition, we also examined the patient inclusion criteria that were used in published studies. Although these criteria do not comprise a formal explicit definition of treatment-resistant epilepsy, they can be used to determine whether there is a consistently applied implicit definition of this term. Such implicit definitions, however, are less informative than explicit definitions. This is because inclusion criteria are constructed to meet the specific demands of the study rather than to address the general concept of what constitutes treatment-resistant epilepsy.

Question specific inclusion criteria

As noted in the Methodology section, the general inclusion criteria listed in that section do not apply to this question. Rather, we included:

  1. Any clinical study that was evaluated in Questions 2 to 6, that enrolled at least 50 patients, and that was published in 1996 or later. All such studies meeting the initial inclusion criteria for each question were included, regardless of whether they were later excluded from the analysis of that question. We abstracted definitions from clinical studies in an effort to obtain a broad sample of definitions. We did not include articles retrieved for Key Questions 7 – 9, because the nature of these studies made them less likely to include definitions of treatment-resistant epilepsy.

  2. A random sample of 100 review articles on treatment-resistant epilepsy published between 1996 and 2001, inclusive. We chose this random sample by using a random number generator to assign a random number to each of the 298 review articles identified in our searches for this Evidence Report. We chose to use a random sample rather than a comprehensive dataset out of consideration of the time and budget for this project.

  3. Any evidence-based clinical practice guidelines identified during our searches. We termed a guideline as “evidence-based” if it was included in the National Guidelines Clearinghouse (NGC).c

Evidence base

Table 3. Definitions of treatment resistance
SourceNumber of Publications Selected for QuestionNumber of Publications Reporting DefinitionsPercentage of Publications Reporting Definitions
Question 26350%
Question 310440%
Question 426727%
Question 539923%
Question 611100%
Research Articles Total822429%
Treatment Guidelines3133%
Review Articles1002121%
Grand Total1854625%
For this question, we included 82 published clinical studies, 100 randomly selected review articles and 3 clinical practice guidelines. Thus, we examined 185 publications. The number of publications reporting a definition are listed in Table 3.

Design and conduct of included studies

This question addresses definitions, not an intervention or diagnostic. As such, an evaluation of the quality of the literature is not relevant.

Definitions in Included Articles

Of the 82 clinical studies that met our inclusion criteria for this question, only 24 (29 percent) reported an explicit definition of “intractable”, “refractory”, “treatment-resistant,” or any similar term. The remainder merely stated that the patients they enrolled had treatment-resistant epilepsy (or some equivalent term) without defining that term. Of the 24 articles reporting a definition, five definitions did not include any specific information (e.g. “incompletely controlled by existing therapy”).30 One study defined treatment resistance in terms of seizure frequency with no mention of treatment.31

Of the remaining 19 studies, 15 reported the number of AEDs patients tried before being considered treatment-resistant. Two studies required at least one AED, four required at least two, and four required three. Five were nonspecific (e.g. “multiple”). Six of the studies named the AEDs that they required patients to have tried before being considered treatment-resistant.

Three definitions mentioned intolerable side effects or ineffectiveness at maximum tolerated dose as a reason to consider drug treatment unsuccessful, four included seizure frequency as part of the definition, six included duration of symptoms, and one mentioned monitoring serum drug levels (Evidence Table 1). None of the studies mentioned auras. Because only 29 percent of the initial 82 articles reported definitions, a common definition of treatment-resistant epilepsy does not seem to be used in the literature. Even among the studies reporting a definition, no consensus can be discerned.

Definitions in Clinical Practice Guidelines

Of the three guidelines identified by our searches, only one reported a definition of treatment-resistant epilepsy. This guideline defined a patient with treatment-resistant epilepsy as having “inadequately controlled seizures or significant side effects for whom no options had been available.32” The reported definitions from guidelines are listed in Evidence Table 2.

Definitions in Review Articles

Of the 100 review articles surveyed, 79 articles defined treatment-resistant epilepsy as the presence of uncontrolled seizures or a similar term. Two of the remaining definitions were not specific.33, 34 Of the remaining 19 reviews, eight reported the number of AEDs patients tried before being declared treatment-resistant. Three of the eight reviews required at least two AEDs and three required three. Two of the eight reviews were not specific (e.g. “several”). Only one of the reviews named the AEDs that they required patients to have tried before being considered treatment-resistant.

Twelve reviews mentioned intolerable side effects or ineffectiveness at maximum tolerated dose as a reason to consider drug treatment unsuccessful. Four reviews mentioned frequent seizures as part of their definition, but none of these quantified what was meant by “frequent.” Rather, their effect on the ability of the patient to lead a normal life was considered the proper criterion in three of these four reviews.

Four definitions included duration of symptoms, with one simply stating that duration was not a criterion.35 Three mentioned monitoring serum AED levels, with one stating that dosage of AEDs should be increased to the maximum tolerated regardless of serum concentrations.36 None of the reviews mentioned auras as part of their definitions. Reported definitions are listed in Evidence Table 3. No consensus definition of treatment-resistant epilepsy can be inferred from the available information.

Definitions Implied by Inclusion Criteria and Patient Characteristics in Clinical Studies

Because definitions were infrequently reported, we examined the inclusion/exclusion criteria and the characteristics of patients in clinical studies to determine the characteristics of patients deemed to have treatment-resistant epilepsy. These characteristics may imply a definition. However, the requirements of a trial are not necessarily the same as the requirements of a patient seeking treatment. A patient experiencing one seizure a year may be considered treatment-resistant but is unlikely to be included in a clinical trial. Thus, patient inclusion criteria may be biased toward enrolling more severely ill patients.

In addition to listing inclusion/exclusion criteria of studies (Evidence Table 4), we examined, at the request of the Expert Panel and Technical Experts, whether these criteria differed depending on the purpose of the trial or the target population of the intervention being studied (Evidence Table 5).

Of the 82 clinical studies included, eight specifically examined pediatric patients, two focused on Lennox-Gastaut syndrome, while two examined mesial temporal sclerosis (MTS), and one examined non-MTS focal lesions.

There were 19 drug trials for US Food and Drug Administration (FDA) approval and 17 additional drug trials that were not performed for this purpose. There were seven nonsurgical studies of nondrug treatments and 39 surgery trials. The surgery trials can be further broken down into control patients (2), temporal lobe surgery (27), hemispherectomy (2), frontal lobe resection (3), multiple subpial transection (2), and corpus callosotomy (3). As can be seen in Evidence Table 5, only nine of 82 studies (11 percent) reported whether AEDs were given until the maximum tolerated dose was reached before treatment was considered a failure. The majority of these studies (six) were drug studies that were not performed in order to obtain FDA approval. However, six studies is still a minority (35 percent) of the 17 non-FDA drug studies meeting the inclusion criteria. Only 13 studies (16 percent) required a minimum duration of illness before patients were considered treatment-resistant. We considered this number too small for a meaningful analysis of whether different types of studies required different durations of illness.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf2.jpg.

   Figure 2. Minimum number of AEDs: different patient types

*“>1” indicates that the study used a nonspecific term such as “several”

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf3.jpg.

   Figure 3. Minimum number of AEDs: different treatments

*“>1” indicates that the study used a nonspecific term such as “several”

In contrast, 44 studies (54 percent) required patients to have tried a minimum number of AEDs before being considered treatment-resistant. In Figures 2 and 3, we examine whether a study of a particular type of treatment, or patient has a different requirement for the minimum number of AEDs compared to other studies. To be included in this summary, a subgroup of studies had to include at least five studies reporting such a requirement.

In those studies that reported a minimum number of AEDs, the majority required at least one AED. This proportion did not differ dramatically from the proportion in studies of pediatric patients or studies in which there was no special patient group.

All FDA drug studies and most non-FDA drug studies reported that a minimum number of AEDs must have been tried without success before a patient was considered treatment-resistant. In both cases, the minimum number was nearly always one. Most studies of surgery (80 percent) did not report a minimum number of AEDs that had been tried. However, when a number was reported, it was always greater than one. This difference between drug and surgical trials probably reflects differences in trial qualifications rather than differences in definitions of treatment resistance.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf4.jpg.

   Figure 4. Minimum baseline seizure frequency: different patient types

Nearly half (49 percent) of the studies reported a minimum seizure frequency before patients were considered treatment-resistant. This number ranged from less than 1 per month to 60 per month. Some studies (2.4 percent) were not specific about the precise number required, reporting only that seizures were “frequent” or some equivalent term. Pediatric studies differed from studies in which no special group was examined (Figure 4) in that a higher proportion of studies required a minimum seizure frequency (75 percent, as opposed to 49 percent) and the required seizure frequency tended to be lower. Among pediatric studies, 38 percent required a minimum seizure frequency of less than two per month, while among studies of no special group, 36 percent required a minimum of two to five.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf5.jpg.

   Figure 5. Minimum baseline seizure frequency: different treatments

When studies of different treatments are compared, studies of surgery seldom (8 percent) reported a minimum seizure frequency (Figure 5), while drug studies conducted to obtain FDA approval always reported these data.

Rediagnosing and Reevaluating Treatment-Resistant Epilepsy

In this section of the Evidence Report, we addressed Key Question #2: Which methods of rediagnosing or reevaluating treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?

There are three primary roles for diagnostics in the management of patients with epilepsy. The first is to determine whether the patient is experiencing epileptic or nonepileptic seizures. Once a firm diagnosis of epilepsy has been established, the second role is to aid in the classification of epileptic seizures into seizure type. The third role of a diagnostic in the management of patients with epilepsy is to aid in the lateralization and localization of epileptic foci prior to epilepsy surgery. In this section of the report, we address the first two of these roles and how they apply to the subpopulation of patients with treatment-resistant epilepsy. We did not address the third role, which was in keeping with the desires of the Technical Experts and the Expert Panel.

We partitioned Question 2 into four subquestions (A - D). The first two subquestions address the differential diagnosis of epileptic seizures from nonepileptic seizures. The remaining two subquestions address the differential diagnosis of different seizures types. Whether we addressed some questions depended on the findings from previous questions.

We examined the peer-reviewed literature to determine whether there was evidence to suggest that some patients with a diagnosis of treatment-resistant epilepsy were misdiagnosed and their seizures were either not epileptic or they consisted of a combination of epileptic and nonepileptic seizures (Question 2A). If such evidence was found, we then examined the literature to determine which diagnostic technologies were likely to aid in the differential diagnosis of epileptic seizures from nonepileptic seizures (Question 2B).

Similarly, we examined the peer-reviewed literature to determine whether there was evidence to suggest that some patients with treatment-resistant epilepsy were diagnosed with an incorrect seizure type (Question 2C). If such evidence was found, we then examined the literature to determine which diagnostic technologies were likely to aid in the differential diagnosis of one seizure type from another (Question 2D).

Do all patients diagnosed with epilepsy that is deemed to be treatment-resistant truly have epilepsy?

To address Question 2A, we looked for studies that attempted to estimate the prevalence of patients with nonepileptic seizures among populations of patients with a diagnosis of treatment-resistant epilepsy. These nonepileptic seizures may have been the sole seizure type experienced by a patient (in which case the patient was misdiagnosed), or they may have occurred in addition to true epileptic seizures (in which case the patient was correctly diagnosed with epilepsy but the additional diagnosis describing the nonepileptic seizures was missed). In the former case, patients would not be expected to respond satisfactorily to treatment with AEDs. In the latter case, the epileptic seizures may be well controlled by AEDs, and the seizures experienced by the patient are nonepileptic in nature. In either case, such patients would, unless given a new diagnosis, remain incorrectly labeled as exclusively having treatment-resistant epilepsy.

Question specific inclusion criteria

Articles were included for Question 2A if they met the general criteria for inclusion presented in the Methodology section and the article reported that patients originally diagnosed as having epilepsy at the time of enrollment into the study were considered treatment-resistant. Studies that enrolled patients with known nonepileptic seizures (either alone or in combination with epileptic seizures), in addition to patients considered to have treatment-resistant epilepsy alone, cannot be used to answer Question 2A. Consequently, such studies were not considered for inclusion in this section of the report unless data from these patients were presented separately. We did not exclude studies that enrolled patients with a diagnosis of treatment-resistant epilepsy but who were suspected of having nonepileptic seizures. This is because all patients who were enrolled in such studies did have a diagnosis of treatment-resistant epilepsy on entry into the study and such studies do contain information on the accuracy of the original diagnosis of epilepsy.

Excluded studies

We did not exclude any of the articles that met both the general criteria for inclusion in this report and the question-specific inclusion criterion for reasons related to poor quality.

Evidence base

Table 4. Evidence base for determining if patients diagnosed with treatment-resistant epilepsy actually have epilepsy
ReferenceStudy DesignCountry in Which Study PerformedSize (N)MulticenterNumber of Centers
Zaidi (2000)38Cross-sectional case seriesUnited Kingdom74Yes2a
Holmes (1998)39Cross-sectional case seriesUnited States379No1
Henry (1997)37Cross-sectional case seriesUnited States145No1
Arnold (1996)40Cross-sectional case seriesUnited States45No1
Slater (1995)41Cross-sectional case seriesUnited States101No1
a

Patients enrolled at two centers but diagnostic reassessment was performed at a single study center

Five articles met both the general inclusion and the question-specific inclusion criterion presented above. These five articles are listed in Table 4. Details of these studies are presented in Evidence Tables 6 through 9.

All five articles in the evidence base were cross-sectional, case series. In these studies, a series of patients (total N = 744) were given diagnostic reassessment in order to determine the prevalence of patients with nonepileptic seizures among specific subgroups of patients, all of who were considered, prior to reassessment, to have treatment-resistant epilepsy.

Four of the five articles included in Table 4 described studies that were carried out at a single center. The remaining article described a study in which patients were recruited at two different centers. However, all patients in this latter study had their diagnosis reassessed at a single study center by a single diagnostic team.

Design and conduct of included studies

The following section presents the findings of our systematic assessment of the quality of the evidence base on the prevalence of patients with nonepileptic seizures (alone or in combination with epileptic seizures) among patients with a diagnosis of treatment-resistant epilepsy. This systematic assessment consists of an appraisal of each study's internal and external validity.

Internal validity

The internal validity of a study designed to measure the prevalence of some disease in a population of interest can be weakened by a number of potential biases. Sampling bias is not a concern in these studies because patients were consecutively enrolled during a fixed period. Reference standard bias is a concern in all of the studies because at present no stand alone “gold-standard” for diagnosing epilepsy is available for routine use in clinical practice. Thus, in practice, the differential diagnosis of epileptic seizures is based on a clinical judgment made by one or more specialists. This judgment is based on information from many sources. These sources include medical history, routine-EEG, ambulatory EEG, video EEG, imaging data, cardiac monitoring data, etc.

The potential biases in each study included in the evidence base for this question are discussed in greater detail in Appendix B.

External validity

The generalizability of a study's results were evaluated by examining the study's inclusion and exclusion criteria, and by evaluating the characteristics of the patients actually enrolled in the study. Details of the inclusion/exclusion criteria used by each of the relevant studies, along with the characteristics of the patients actually recruited by these studies, are presented in Evidence Tables 7 and 8.

The ability to draw conclusions about the generalizability of the studies addressing this question is limited because details on patient characteristics were incompletely reported. All five of the studies included in the present evidence base were carried out at specialist referral centers (three were specialist electrophysiology centers, one was a specialist neurosurgery center and one was a specialist epilepsy center). Such patients are unlikely to be representative of the general population of patients with treatment-resistant epilepsy. In addition, none of the five studies included children less than sixteen years of age. Therefore, the prevalence data extracted from the studies included in the present evidence base may not be generalizable to pediatric populations.

In four of the studies, some of the patients were referred to the specialist center for a diagnostic reassessment because their original seizure diagnosis was deemed questionable. Estimates of the prevalence of patients with an incorrect diagnosis based on data collected from these studies are likely to lead to an overestimate of the true extent of the misdiagnosis problem as it occurs in the more general population of patients with treatment-resistant epilepsy. In the remaining study, the study sample consisted of patients who were all considered candidates for epilepsy surgery. Not all patients with treatment-resistant epilepsy are surgical candidates and, thus, the findings of this study can only be generalized to a very select population of patients.

Synthesis of study results

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf6.jpg.

   Figure 6. Prevalence of nonepileptic seizures

Prevalence of nonepileptic seizures among patients diagnosed with treatment-resistant epilepsy

NES: Non-epileptic seizure

ES: Epileptic seizure

The prevalence of nonepileptic seizures among the patients in each of the five studies used to address Question 2A are presented in Evidence Table 9 and are summarized in Figure 6. This figure demonstrates that between 8.3 percent and 37.6 percent of patients believed to have treatment-resistant epilepsy turned out to either not have epilepsy or to suffer from a combination of both epileptic and nonepileptic seizures. With the exception of the study of Henry and Drury,37 the majority of these patients suffered from nonepileptic seizures alone. Only a small proportion of patients had a combination of both epileptic and nonepileptic seizures (Range: 0 percent to 1.0 percent).

The patients examined in the study of Henry and Drury37 were undergoing presurgical evaluation. Such patients were probably assessed more often and/or more completely by clinicians who specialize in epilepsy compared to most other patients with a diagnosis of treatment-resistant epilepsy. Consequently, that no patients in their sample suffered from nonepileptic seizures alone is not a surprise. However, that 8.3 percent of the patients in this study experienced a combination of epileptic and nonepileptic seizures is surprising. Followup of these patients revealed that the changes in patient management that resulted from the reassessment led to a complete cessation of seizures in three patients (25 percent). Whether these three patients would have been identified prior to surgery had the study not been performed is unknown. However, they may possibly have undergone unnecessary surgery.

To determine an overall estimate of the prevalence of patients with nonepileptic seizures among patients who, prior to re-evaluation, were considered to have treatment-resistant epilepsy, we performed a meta-analysis. This meta-analysis did not include prevalence data abstracted from the study of Henry and Drury37 for reasons explained above. The results of this homogenous (Q = 0.28; p = 0.96439) fixed-effects meta-analysis are presented in Evidence Table 10.

This meta-analysis shows that the proportion of patients who were misdiagnosed as having treatment-resistant epilepsy was substantial (35 percent; CI: 29 percent to 41 percent). Therefore, a problem of misdiagnosis clearly exists in clinical practice. However, these findings do not accurately represent the proportion of misdiagnosed patients in the overall population of patients with treatment-resistant epilepsy because none of the studies included in the meta-analysis were population-based. Also, the patients included in the four studies that we did meta-analyze represent a subpopulation of patients referred for specialist evaluation of their seizures.

Which diagnostic modalities are useful in differentiating seizure types commonly mistaken for epilepsy from true epileptic seizures?

Based on the results of Question 2A, we addressed Question 2B by evaluating the evidence on the diagnostic technologies most commonly used to differentiate epileptic seizures from nonepileptic seizures. As stated above, in clinical practice, the differential diagnosis of epileptic seizures is usually based on information from many sources (medical history, etc.). The clinical diagnosis is seldom based on one diagnostic technology alone. Ultimately then, to answer this question, diagnostic performance data from each analysis of each individual diagnostic technology must be combined into a single decision model which better describes the true clinical picture. As will be seen, a paucity of available evidence precluded the construction of such a model. Thus, we were limited to an analysis of the clinical utility of individual “stand alone” diagnostic technologies.

Question specific inclusion criteria

In addition to employing the general inclusion criteria, we included articles if they met the following criteria:

  1. The study must have evaluated the effectiveness of a diagnostic technology used for the differential diagnosis of epileptic seizures from nonepileptic seizures.

  2. The patients enrolled in the study were not restricted to only those with treatment-resistant epilepsy. Because the intent of Question 2B is to determine the utility of those diagnostic technologies that have been used to differentiate epileptic seizures from nonepileptic seizures, addressing this question requires a study that enrolls both kinds of patients.

  3. The study must have either reported diagnostic test performance characteristics (e.g., sensitivity and specificity) or presented data in a format that allows calculation of test performance characteristics based on a comparison with some “reference” standard.d Alternatively, the study must have included followup data that allow conclusions about the effects of using a diagnostic on patient outcomes.

Number of articles addressing each diagnostic

Table 5. Articles addressing each diagnostic
DiagnosticNumber of Articles
Blood Prolactin Levels8
Minnesota Multiphasic Personality Inventory6
Video-EEG6
Ambulatory-EEG5
Provocation techniques4
Routine EEG4
Creatinine kinase levels3
Tilt table2
Auditory evoked potentials1
Hypnotic recall1
Magnetic Resonance Imaging1
Single Photon Emission Computed Tomography1
Tongue biting1
Computed Tomography0
Forty-three articles met the inclusion criteria for Question 2B. The numbers of articles that address each of the diagnostics meeting the inclusion criteria are presented in Table 5. A full list of articles and the diagnostics that they addressed are presented in Evidence Table 11.

The most common type of excluded article reported on a case-series study in which a group of patients was diagnosed with a given modality, and this diagnosis was then used to influence medical management. However, none of these studies reported whether these management changes led to improvements in patient outcomes. Although this study design is seen by some as being a legitimate design for the assessment of a diagnostic,42 this assumes that the diagnostic test was accurate. Requiring the assumption that a diagnostic be accurate in order to assess the accuracy of that same diagnostic is circular reasoning. It also assumes perfect sensitivity and specificity of the test, which is not possible. This sort of study design is particularly common in the literature on EEG technologies (i.e. routine EEG, ambulatory-EEG, and video-EEG).

Of the fourteen diagnostics considered, only four (blood prolactin levels, the MMPI, video-EEG, and ambulatory-EEG) were addressed by five or more studies. As per the general inclusion criteria specified in the Methodology section, data about diagnostics not addressed by at least five studies are not considered further in this report. Consequently, we do not include further information about provocation techniques, routine EEG, creatinine kinase levels, tilt tables, auditory evoked potentials, hypnotic recall, MRI, SPECT, tongue biting, or CT in this report.

Blood Prolactin Level Monitoring

Interest in the use of blood prolactin levels as a diagnostic tool for the differentiation of epileptic seizures from nonepileptic seizures began in 1976 when Ohman, Walinder, Balldin et al.43 reported that blood prolactin levels rose following epileptic seizures induced by electroconvulsive therapy. Blood levels of prolactin were found to peak approximately 30 to 40 minutes after the seizure occurred and then decline to normal preseizure levels. These findings were confirmed by Trimble44 who demonstrated that blood prolactin levels increased following spontaneous epileptic seizures and noted that blood prolactin levels did not rise following a nonepileptic, psychogenic seizure (also known as a pseudoseizure, hysterical seizure, or psychological seizure). Since then, a number of reports have been published that have assessed the relationship between blood or plasma prolactin levels in patients with epilepsy and a number of different nonepileptic seizure types.44–57 In the following section of the report, we assess the evidence related to the value of measuring blood prolactin levels in differentiating epileptic seizures from nonepileptic seizures.

Excluded articles

We excluded three studies for reasons of quality. These studies, and the reasons for which they were excluded, are listed in Evidence Table 12.

Evidence base

Following the exclusion of the articles, five articles describing five separate studies that enrolled 305 patients remained. Details on each study (study design characteristics, patient characteristics, and study results) are presented in Evidence Tables 13 through 21.

Design and conduct of included studies

The following section presents the findings of our systematic assessment of the quality of the evidence on the diagnostic utility of blood prolactin level measurements in the differentiation of epileptic seizures from nonepileptic seizures.

Internal validity

All five studies included in the present evidence base utilized a diagnostic case-control study design. The case-control study design is commonly used in the early stages of the evaluation of a diagnostic and is particularly susceptible to a number of biases that lead to overestimation of a test's true diagnostic performance.24–28 No studies presented patient outcome data. Thus, no direct determination is possible about whether the use of blood prolactin level measurements will lead to improvements in patient outcome. However, a reasonable assumption is that a good diagnostic test will allow patients' nonepileptic seizures to be identified and treated more appropriately, thus leading to improved patient outcomes.

Imperfect reference standard bias (all five studies), prevalence bias, and spectrum bias (four studies each) were the most common potential biases in the studies of blood prolactin measurements. Patient bias, diagnostic yield bias, and verification bias were not present in these studies. These potential biases with respect to this question are discussed in detail in Appendix B.

External validity

Complete details of the inclusion/exclusion criteria used by each of the studies that comprise the present evidence base, along with the characteristics of the patients actually recruited by these studies are presented in Evidence Tables 17 and 18.

Details of both study inclusion/exclusion criteria and the patient characteristics included in the relevant studies were incompletely reported. Four of the five articles described the inclusion/exclusion criteria. Four articles reported on age, no article reported on sex distribution, one article reported on the duration of disease, no article reported on seizure frequency, only one article reported the number of patients who had cognitive or developmental deficits, and only one article reported the number of AEDs used by patients in the study.

Synthesis of study results

The assessment of study quality presented above indicates that, given the present evidence, definitive conclusions cannot be drawn about whether blood prolactin level measurements have a useful role in differentiating epileptic seizures from nonepileptic seizures. Acknowledging this, we have instead evaluated the available data with the aim of determining the plausibility of blood prolactin measurements having a role in differentiating epileptic seizures from nonepileptic seizures.

Table 6. Differential diagnoses of seizures
Epileptic Seizure Type Nonepileptic Seizure Type
ReferenceMixed ESGTCSCPSSPSMixed NESPsySSynS
Lusic (1999)53[check]From[check]
Anzola (1993)45[check]From[check]
Zelnik (1991)56[check]From[check]
Mishra (1990)57[check]From[check]
Wroe (1989)50[check][check][check][check]From[check]
Lusic (1999)53[check]From[check]

CPS Complex partial seizure

ES Epileptic seizure

FS Febrile seizure

GTCS Generalized tonic-clonic seizure

NES Nonepileptic seizure

PsyS Psychogenic seizure

SPS Simple partial seizure

SynS Syncopal seizure

Not all of the studies in the present evidence base evaluated the ability of blood prolactin level measurements to differentiate epileptic seizures from the same type of nonepileptic seizure. The specific differential diagnoses assessed by each of the studies included in this section of the report are presented in Table 6.

These studies primarily assessed the ability of blood prolactin level measurements to differentiate several epileptic seizure types (mixed seizures, generalized tonic-clonic seizures, complex partial seizures, and simple partial seizures) from two paroxysmal seizure disorders that are often misdiagnosed as epileptic. These two nonepileptic seizure types were syncopal seizures and psychogenic seizures. Thus, the findings of this assessment are not applicable to the differentiation of epileptic seizures from any other nonepileptic seizure type.

Differentiating epileptic seizures from syncopal seizures

This section summarizes the findings of the studies that reported on the diagnostic utility of blood prolactin level measurement in differentiating epileptic seizures from syncopal seizures. Three of the five studies (Anzola,45 Lusic, Pintaric, Hozo, et al.,53 and Zelnik, Kahana, Rafael, et al.56) presented data on this differentiation. Of these, two (Lusic, Pintaric, Hozo, et al.53 and Anzola45) presented dichotomous diagnostic performance data that allowed sensitivity, specificity, PPV and NPV of the test at a predetermined threshold to be directly determined. These terms are defined in Evidence Table 19. Data from these studies are presented in Evidence Table 20.

Typically, diagnostic performance data is captured by Receiver Operator Characteristic (ROC) curves that describe the trade off between sensitivity and specificity. If enough studies report appropriate data, the data can be meta-analytically combined into a single summary ROC (SROC) curve. Because the present data set consisted of only three studies, we have not attempted such a meta-analysis. As mentioned earlier, Zelnik, Kahana, Rafael, et al.56 did not present their diagnostic performance data in a typical 2 by 2 format. Instead, they summarized their data in the form of mean (and standard deviation) blood prolactin levels.

Because the present data could not be meta-analyzed, and because ROC curves synthesized from the available continuous data sets may be misleading, we have instead summarized these data as the effect size, Hedges' d. Although this effect size cannot be used to describe the diagnostic performance of blood prolactin measurements, it does allow diagnostic data to be compared and contrasted among studies that reported this data in different formats.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf7.jpg.

   Figure 7. Blood prolactin: discrimination between epileptic and syncopal seizures

The data from the three studies reporting the diagnostic utility of blood prolactin level measurement in differentiating epileptic seizures from syncopal seizures is summarized in Figure 7. The effect sizes with confidence intervals that overlap zero indicate that the diagnostic test did not discriminate epileptic seizures from syncopal seizures any better than chance. Thus, data from Lusic, Pintaric, Hozo, et al.53 did not show that blood prolactin level measurements were useful in differentiating epileptic seizures from nonepileptic seizures. The studies by Anzola45 and Zelnik, Kahana, Rafael, et al.,56 however, both found that the test did discriminate these two seizure types from one another statistically significantly better than chance.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf8.jpg.

   Figure 8. Differences in threshold when evaluating test performance in studies of blood prolactin measurement

Exploration suggests that Lusic, Pintaric, Hozo, et al.53 did not find that the test was significantly better than chance because the performance characteristics data were collected at a threshold that was not optimal for the test. This is illustrated in Table 5, which shows three point estimates plotted in ROC space. These point estimates, along with their confidence intervals, came from the dichotomous data presented by Lusic, Pintaric, Hozo, et al.53 and Anzola45 (Evidence Table 20). All three point estimates can conceivably originate from a single underlying ROC curve (Figure 8).e However, because Lusic, Pintaric, Hozo, et al.53 chose to use a lower threshold compared to any of the thresholds used by Anzola45, the point estimate falls nearer the chance line. Thus, given the available data, blood prolactin measurements may plausibly provide information that aids in differentiating epileptic seizures from syncopal seizures. Further data are required, however, before stating that this test performs well enough to be used in actual clinical practice.

Differentiating epileptic seizures from psychogenic seizures

Two of the five included studies attempted to use blood prolactin levels to differentiate epileptic seizures from psychogenic seizures. These data are presented in Evidence Table 20 and 21. Wroe, Henlet, John et al.50 presented dichotomous diagnostic performance data that allowed the sensitivity, specificity, PPV and NPV of the test at a predetermined threshold to be directly determined. Mishra, Gahlaut, and Kumar57 presented summary statistics (means, standard deviations, etc.) that describe the distributions of blood prolactin levels in the two diagnostic groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf9.jpg.

   Figure 9. Blood prolactin: discrimination between epileptic and psychogenic seizures

We summarized the available data from these two studies in terms of Hedges' d. This summary, shown in Figure 9, suggests that blood prolactin measures can plausibly provide information that aids in differentiating epileptic seizures from psychogenic seizures.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf10.jpg.

   Figure 10. Blood prolactin: discrimination between different epileptic seizure types and psychogenic seizures

Data abstracted from Mishra (1990), GTCS: Generalized tonic-clonic seizures, CPS: Complex partial seizures, SPS: Simple partial seizures

In addition to the data presented in Figure 9, Mishra, Gahlaut, and Kumar57 also presented data about the effectiveness of blood prolactin level measurement in differentiating three types of epileptic seizures (generalized tonic-clonic seizures; complex partial seizures; and simple partial seizures) from psychogenic seizures. These data, presented in Figure 10, suggest that while blood prolactin level measurements may be of some use in differentiating generalized tonic-clonic seizures and complex partial seizures from psychogenic seizures, the test appears to have little or no value in differentiating patients with simple partial seizures from those with psychogenic seizures. The data from Mishra, Gahlaut, and Kumar57 show that blood levels of prolactin do not increase following simple partial seizures. Thus, blood prolactin levels will probably have little value in differentiating simple partial epileptic seizures from syncopal seizures as well.

Minnesota Multiphasic Personality Inventory

The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) and its predecessor (MMPI) instruments are among the most widely used and widely researched tests of adult psychopathology. These instruments provide a broad psychological profile across a number of domains. Some investigators believe that the psychological profile of a patient with psychogenic seizures (either alone or in combination with epileptic seizures) may be different from that of patients with epileptic seizures alone.

Excluded studies

Not all of the articles that met the general and subquestion specific inclusion criteria were included in the evidence base for this diagnostic. We list the studies that were excluded for reasons of quality in Evidence Table 22, along with an explanation as to why they were excluded.

After the exclusion of the two articles listed in the table, four studies remained. As per the general inclusion criteria specified in the Methodology section, data about treatments not addressed by at least five included studies (or at least one large RCT with 50 or more patients in each study arm) are not considered further. Consequently, we do not further assess the MMPI-2 or MMPI in this report.

Video-EEG

Video-EEG monitoring is used in clinical practice to verify the seizure type, to localize the area of seizure onset if surgery is being considered, and to verify the diagnosis of epilepsy if the diagnosis is in doubt. Video-EEG monitoring consists of the simultaneous recording of EEG brain wave activity combined with time synchronized video recording of the patient. This diagnostic procedure is performed on an in-patient basis, and requires highly specialized equipment and dedicated space. Patients are monitored for extended periods in order to capture typical seizure events on video and simultaneously capture EEG activity during that event. In some centers, patients' medications are withdrawn in order to increase the chance of recording a seizure.

Excluded studies

We list the studies that were excluded for reasons of quality in Evidence Table 23, along with an explanation as to why they were excluded.

After the exclusion of the two articles listed in the table, four studies remained. As per the general inclusion criteria specified in the Methodology section, data about diagnostics not addressed by at least five included studies (or at least one large RCT with 50 or more patients in each study arm) are not considered. Consequently, we do not further consider video-EEG in this report.

Ambulatory-EEG

Ambulatory-EEG monitoring is the recording of EEG brain wave activity remotely from the hospital environment. Ambulatory-EEG, like video-EEG, has the advantage over routine EEG of allowing EEG traces to be recorded continuously over long periods. This increases the chance of recording an ictal event. Unlike video-EEG, no video record of seizure events is available, and patients or their caregivers must accurately record the occurrence of a typical seizure event in order to temporally compare the occurrence of a seizure with the EEG trace.

Early clinical investigations documented the ability of ambulatory-EEG to record identifiable focal and generalized epileptiform activity. In 1983, a cassette tape ambulatory-EEG system was introduced. This system had continuous 8-channel recording capability, real-time identification, gain adjustment, and filter adjustments. Since then, improvements in computer technology have led to the development of instruments that can perform portable continuous recording of more than 16 channels with sampling rates of over 200 Hz.

Excluded studies

Not all of the articles that met the general and subquestion specific inclusion criteria were included in the evidence base for this diagnostic. We list the studies that were excluded for reasons of quality in Evidence Table 24 along with an explanation as to why they were excluded.

After the exclusion of the two articles listed in the table, three studies remained. As per the general inclusion criteria specified in the Methodology section, data about treatments not addressed by at least five included studies (or at least one RCT with more than 50 patients in each study arm) are not considered. Consequently, we do not further consider ambulatory-EEG in this report.

Comments on EEG Technologies

That the evidence base for the three EEG technologies (routine ictal and interictal EEG, ambulatory-EEG and video-EEG) did not reach the required minimum of five acceptable studies is surprising. Even with the expansion of the inclusion criteria so that we included articles published between 1980 and 1985 and articles describing retrospective studies, we did not reach five studies. This may be because all three of these EEG technologies are commonly used as aids in the diagnosis of epilepsy, and video-EEG is considered by many to be the “gold standard” for the differentiation of epileptic seizures from nonepileptic seizures.58 Some reviews of the use of video-EEG in the differential diagnosis of epileptic seizures from nonepileptic seizures include studies in which provocation was used in an attempt to induce a seizure that was then captured by video-EEG, citing such studies as evidence of the effectiveness of video-EEG in combination with provocation. Without exception, however, these studies used video-EEG as a “reference standard” against which the effectiveness of provocation was measured. Thus, such studies cannot be considered as part of the evidence base for the diagnostic utility of video-EEG and instead form the evidence base for seizure provocation techniques.59

Two previous technology assessments looked at the clinical utility of video-EEG and addressed much the same issue being addressed in this report by subquestion 2B.60, 61 These technology assessments and the relevant references mentioned in each assessment are presented in Evidence Table 25. Also in this Evidence Table is an indication as to whether each article was included in the current report and, if the article was not included, an explanation as to why.

Evidence Table 25 shows that none of the articles included in the previous two technology assessments met the inclusion criteria for the current report. The primary reason for not being included in the present report was that the studies utilized a case series design in which a group of patients were evaluated with video-EEG and a diagnosis or change in diagnosis was made based on the information gained from the assessment. No reference standards were used against which to compare the effectiveness of video-EEG, nor were patients followed up in order to verify the accuracy of the diagnosis. Thus, the investigators in these studies made the implicit assumption that video-EEG did accurately differentiate epileptic seizures from nonepileptic seizures. In other words, the investigators assumed that false-negative (making an incorrect diagnosis of non-epileptic seizure) and false-positive decisions (making an incorrect diagnosis of epileptic seizures) will not occur when video-EEG is used. Such assumptions, though they may be true for some seizure typesf, do not always hold true. Both assumptions rely on the supposition that an abnormal EEG always accompanies a true epileptic seizure. While this may be true for many seizure manifestations, this is not always the case.

For example, a number of studies of patients with implanted electrodes have demonstrated that epileptic seizures originating in the medial or orbital surface of the frontal lobe, the parietal lobe, or the temporal lobe, often occur in the absence of a measurable EEG abnormality when the EEG is performed using scalp electrodes.62–65 These types of seizures may arguably be relatively rare. However, given that the appearance of a nonepileptic seizure is often very similar to epileptic seizures originating in the medial or orbital surface of the frontal lobe, the parietal lobe, or the temporal lobe,66 these are the very patients who are the most likely to be misdiagnosed as having epileptic seizures. Thus, some false-negative decisions must be assumed to occur when video-EEG is used.

The fact that evidence-based conclusions were not drawn in the present report regarding the ability of vEEG to differentiate epileptic seizures from nonepileptic seizures should not be interpreted as evidence that this technology is not effective or useful. Indeed, vEEG may very well have an important role in diagnostic algorithms that are designed to make such a differential diagnosis. Until more high quality studies become available, however, the diagnostic performance characteristics of vEEG and its place in such diagnostic algorithms cannot be determined.

Is seizure type in some patients with treatment-resistant epilepsy misdiagnosed in some patients?

There are two purposes to the present question. First, to establish whether there is evidence in the peer-reviewed literature to indicate that some patients believed to suffer from a specific seizure type actually suffer from a different seizure type (either alone or in combination with the originally diagnosed seizure type) and would, therefore, not be expected to respond satisfactorily to their current treatment regimen. The second purpose is to quantify, if relevant, the prevalence of these patients among the population of patients thought to suffer from a particular seizure type.

Question specific inclusion criteria

Articles were included for Question 2C if they met the general criteria for inclusion presented in the Methodology section, and if the article reported on a study that enrolled patients originally diagnosed as having a specific type of epileptic seizure (partial seizure, generalized seizure, absence seizure, etc).

Evidence base

No studies addressed Question 2C and met both the general and subquestion specific inclusion criteria listed above. Consequently, Question 2C could not be answered.

Which diagnostic modalities are useful in differentiating between different seizure types?

Because Question 2C cannot be answered in an evidence-based fashion, Question 2D could not be addressed in an evidence-based fashion.

Optimization of Antiepileptic Drugs

In this section of the Evidence Report, we addressed Key Question #3: Is there evidence that patients with treatment-resistant epilepsy are not optimized at their current level of treatment?

In the present question, we address whether patients described as having treatment-resistant epilepsy are receiving optimal dosages of the AED regimen prescribed for them. The available evidence for this question is derived from two types of studies. The first type is comprised of studies that assessed, using drug level monitoring, whether patients were truly treatment-resistant or at an otherwise optimized level of drug therapy. The second type is comprised of drug treatment studies that presented information on the pretrial or baseline status of the patients enrolled in a clinical trial. Patients in a clinical trial often receive optimized treatment as part of the trial and are therefore not representative of patients who are maintained on AEDs in clinical practice. Thus, the pretrial status of the patients is the best indication of whether drug optimization was part of routine clinical practice. We examined these groups of studies because they were the most likely type of studies to report the information necessary to address this question.

For the purposes of the present question, a drug regimen was defined as not optimized if a study reported enrolling any patients whose prior drug regimen: 1) had not been titrated, 2) was not in the therapeutic range, or 3) produced side effects. If a study reported that some patients had not received the maximal tolerated dosage, we considered this evidence of lack of titration, and therefore definitive evidence of lack of optimization. If the therapeutic range was not defined, we defined the range as either the therapeutic range of the maintenance dose or the blood concentration.g We considered patients receiving more than one AED to be in the upper end of the therapeutic range if at least one AED dosage or blood level was in this range. The Expert Panel and the Technical Experts formulated criteria 1 and 3. The second criterion, as originally suggested by the Technical Experts, specified only the upper end of the therapeutic range. This is a more stringent way to define optimization and may be inaccurate, as the maximum tolerable dosage for some patients may be below the upper end of the therapeutic range. We modified this criterion for this reason, and because several studies reported that not all patients were receiving drug doses in the therapeutic range. However, the possibility remains that certain patients outside the therapeutic range may have been optimized. Thus, the second and third criterion suggest the possibility of nonoptimization but do not provide definitive evidence of its existence. Therefore, we separately report patients who were not in the upper end of the therapeutic range

Question specific inclusion criteria

In addition to the general inclusion criteria (see Methodology section), we used the following criteria to determine whether a study was included:

  1. The study must have reported information indicating that some patients in the study did not meet at least one of the criteria for optimization described above. Thus, we are seeking only evidence of nonoptimization, not a percentage of patients who are optimized.

  2. The study must have been published in 1975 or later. This ensured the use of evidence on standard AEDs as well as evidence on newer agents.

  3. All drugs in the study must have been cleared for marketing in the United States by the Food and Drug Administration. If the study included some patients on non-FDA approved drugs, it was required to report data separately from patients on FDA-approved drugs, and we abstracted results only from the latter group of patients. This criterion was determined by the Expert Panel and the Technical Experts.

Excluded studies

We did not exclude any studies for reasons of quality.

Evidence base

We included 20 studies, all of which suggested that at least some patients may not have been optimized prior to study enrollment (Evidence Table 26). Six studies were conducted with the goal of assessing medical intractability or lack of optimized therapy through drug level monitoring; the remaining 14 studies were drug treatment studies that presented pretrial or baseline information concerning drug optimization.

Design and conduct of included studies

Internal validity

Since this question does not involve an analysis of results, but merely reporting of patient status before entering a study, there is only one relevant issue concerning the internal validity of these studies. An apparently nonoptimized drug regimen could result not only from an inadequate drug dosage, but also from a patient's lack of compliance with the prescribed regimen. If the nonoptimized patients in a study were actually noncompliant, this would alter the assumption that nonoptimization was primarily due to prescription of nonoptimized drug regimens. Four studies required that all patients in the study were compliant.67-70 Two studies reported that some patients were suspected of noncompliance,71, 72 while in one study 8 of 35 patients admitted noncompliance (this study was not excluded because clearly other patients in the study were not optimized).73 The remaining 13 studies reported no information concerning compliance.

External validity

Of the 20 studies mentioned above, seven were conducted in the United States and the remaining 13 were conducted in other countries (Evidence Table 26). Four of seven United States studies and six of 13 studies from other countries evaluated only adult patients. Two United States studies and six studies from other countries evaluated a study group of adult and pediatric patients. One United States study evaluated only pediatric patients, and one study from outside of the United States provided no information on the age range of its patient population.

Synthesis of study results

The summary of results is broken down according to the three criteria described in the introduction to this question. The relevant data for this question are presented in Evidence Table 26.

Did the study report any patients whose prior drug regimen had not been titrated?

Six studies (two United States and four outside the United States) reported information indicating that the prior drug regimen of some patients had not been titrated. The lack of reporting of titration information does not necessarily mean that few patients have their drug regimens titrated in clinical practice. Titration may be a common practice, but study investigators may not report it.

Did the study report any patients whose prior drug regimen was not in the therapeutic range?

Ten of 20 studies (three United States and seven from other countries) presented information indicating that the prior drug regimen was not in the therapeutic range for at least some patients in the studies. We further examined whether some studies enrolled patients whose prior drug regimen was not in the upper end of the therapeutic range.

Sixteen of 20 studies (five United States and 11 from other countries) presented information indicating that the prior drug regimen for some patients was not in the upper end of the therapeutic range.

Did the study report that there were any patients whose prior drug regimen produced side effects?

Four studies (two United States and two outside the United States) presented information indicating that drug side effects occurred in at least some patients on a prior drug regimen.

Drug Treatment Strategies

In this section of the Evidence Report, we addressed Key Question #4: Which drug treatment strategy, 1) sequential monotherapy, 2) polytherapy, or 3) optimized current therapy leads to improved outcomes for patients with treatment-resistant epilepsy, and what are the relative improvements obtained with each strategy?

In this question, we address three drug treatment strategies that could potentially benefit patients with treatment-resistant epilepsy. By definition, patients with treatment-resistant epilepsy have already received AEDs that were ineffective. Therefore, in the present question we are addressing whether any changes in patients' drug regimens can potentially reduce their seizures.

We define sequential monotherapy as switching patients to a single AED that none of the patients had yet received. According to the desires of the Partners, patients could have been receiving multiple prior drugs before initiation of monotherapy. Polytherapy is defined as the simultaneous administration of more than one AED. It typically involves the addition of a single novel AED to patients' drug regimens (referred to as “add-on” treatment). Finally, we define optimized current therapy as altering the dose of at least one drug in patients' drug regimens, or removing at least one drug from patients' drug regimens. In optimized current therapy, drug dose can be altered by changing the total daily dose, the number of doses in a given day, or the drug preparation (such as a slow-release preparation). According to the desires of the Partners, optimized current therapy can also consist of the removal of a drug from patients' regimens. We address each of these three strategies in separate subquestions.

This question addresses the safety and efficacy of drug strategies, not of particular drugsh. However, the literature on monotherapy is comprised primarily of trials that examine the effects of changing patients' treatment from a number of AEDs to a singlespecific drug (e.g., topiramate). Similarly, the literature on polytherapy is comprised primarily of trials that involve adding a specific drug (that patients had not previously received) to their existing regimen, and the literature on optimized current therapy is comprised primarily of trials that removed a specific drug from patients' drug regimens. Thus, the literature is comprised primarily of certain specific implementations of these strategies. Although the findings of the individual trials have limited generalizability, when considered in aggregate (as below) they provide the best available estimates of the effectiveness of the three strategies.

Question specific inclusion criteria

Although we divided this question into four subsections (one for each treatment strategy, and one for comparisons between strategies), we employed the same inclusion criteria for studies of each strategy. Thus, in addition to the general inclusion criteria described in the Methodology section, we included trials for this question if they met all of the following criteria:

  1. The trial must have been published in 1975 or later. For school- and work-related outcomes, the trial must have been published in 1985 or later. Including trials since 1975, as well as more recent trials, facilitated incorporation of data from standard AEDs that may no longer be the focus of clinical research.

  2. Before the trial, patients must have received unsuccessful treatment with at least one of the following drugs: carbamazepine, ethosuximide, phenobarbital, phenytoin, primidone, or valproate. The Technical Experts determined that these six drugs are standard AEDs.

  3. All drugs received by patients during the trial must be cleared for marketing in the United States by the U.S. Food and Drug Administration (FDA). If the trial included some patients on non-FDA-approved drugs, it was required to report data separately from patients on FDA-approved drugs, and we abstracted results only from the latter group of patients. We included trials employing off-label usage of drugs for the treatment of epilepsy. Confining the question to only FDA-approved drugs was in accordance with the wishes of the Expert Panel and Technical Experts.

  4. If a trial reported that some patients had been noncompliant, then results must have been reported separately for patients who were compliant. Noncompliant patients may not be treatment-resistant, and their seizure rates may drop during a trial. Whether the improvement in noncompliant patients was due to better compliance or to the beneficial effect of the trial drug cannot be determined. Therefore, the outcomes of noncompliant patients were not included.

  5. If there were five or more placebo-controlled randomized trials on a specific drug treatment strategy, then other trials with other designs (e.g., trials that used a low AED dose as a control) were not considered. We adopted this criterion because results of placebo-controlled randomized trials are more easily interpreted compared to results of trials that employed other control groups.

  6. Trial must be a Phase II or III efficacy trial. Earlier trials (Phase I) were not primarily intended to reduce seizures, and later trials (Phase IV) involved drugs whose effectiveness had already been documented by other trials.

  7. Trials that used a crossover design must have reported results for the first period (i.e., before the crossover), or must have reported that seizure frequencies returned to baseline at the end of the washout period. In a crossover trial, the use of a drug at the start of a trial may have potentially influenced the effectiveness of a different drug used later in the trial. If seizure frequency returned to baseline at the end of the washout period, then the evidence suggests that the first drug is no longer active, and data for the second drug are interpretable. However, if a return to baseline was not reported, then we only abstracted data for the first period.

To include the maximum number of potentially relevant studies, we did not require studies to report patients' seizure frequencies at baseline. However, baseline seizure frequencies do provide a measure of the severity of patients' initial conditions, thus aiding in the interpretation of study findings. For example, suppose a treatment eliminated all seizures. Such an outcome would be more impressive if patients' baseline seizure frequencies were 20 per month than if frequencies were only five per month. Because the baseline frequency helps place the study results in proper context, ideally all studies would report this frequency.

Number of articles on each intervention

Applying the inclusion criteria yielded 55 studies describing the three drug strategies. There were 14 studies of sequential monotherapy, 30 studies of polytherapy, and 11 studies of optimized current therapy.

Sequential Monotherapy

Sequential monotherapy involves administering a single AED not yet received by any of the patients. Patients can receive any number of AEDs prior to the initiation of the new drug. However, all prior AEDs must be withdrawn from patients' drug regimens in order to investigate the effect of the novel monotherapy drug. In this section, we describe the evidence base for sequential monotherapy, assess the quality of these trials with respect to both internal and external validity, and analyze the trials' results for all relevant outcomes.

Excluded studies

Fourteen studies met the inclusion criteria. One of the 14 studies was excluded because the authors only reported a qualitative description of treatment efficacy.74

Evidence base

The evidence base for sequential monotherapy consists of 13 studies that enrolled 1,542 patients.

Design and conduct of included studies

Table 7. Drugs and doses in studies of sequential monotherapy
Felbamate Gabapentin Lamotrigine Oxcarbazepine Primidone Tiagabine Topiramate Valproate
Reference3600 mg/day2400 mg/day3600 mg/day500 mg/day2400 mg/day750 mg/day36 mg/day1000 mg/day150 μg/ml
Sachdeo (2001)68[check]
Beydoun (2000)86[check]
Kanner (2000)87[check]
Schachter (1999)79[check]
Gilliam (1998)76[check]
Bergey (1997)78[check]
Beydoun (1997)85[check]
Beydoun (1997)83[check]
Sachdeo (1997)84[check]
Devinsky (1995)75[check]
Schachter (1995)88[check]
Theodore (1995)89[check]
Faught (1993)77[check]
Totals311131111

mg/day Maximum dose in milligrams per day

μg/ml Maximum dose in micrograms per milliliter

Relevant design aspects of the 13 included studies appear in Evidence Tables 27 through 30. To assess the effect of sequential monotherapy, the ideal study would have randomly assigned patients to receive either a new drug as monotherapy, or to receive the same drug regimens used before the trial. None of the 13 studies employed this design. Twelve studies were randomized and controlled, but patients in the control groups did not receive their prestudy drug regimens. Instead, all prestudy drugs were withdrawn. In three studies, patients in the control groups received a placebo alone, and in the other nine studies, patients in the control groups received a low dose of a drug. Because these control groups do not address whether sequential monotherapy causes an improvement over patient's prestudy drug regimens, we did not abstract data from the control groups. Instead, we abstracted data from only the high-dose active-drug group in each of the 12 controlled studies. The 13th study did not have a control group, thus we abstracted data from the single group in that study. Among the 13 studies, eight drugs were given as monotherapy: felbamate (three studies), oxcarbazepine (three studies), gabapentin (two studies), lamotrigine (one study), primidone (one study), tiagabine (one study), topiramate (one study), and valproate (one study) (Table 7).

Internal validity

In evaluating internal validity, we determined whether the results were potentially biased by the threats to validity that are discussed in the Methodology section. Although other questions in this report consider the potential for attrition bias, we do not consider it here because attrition was a study outcome. As discussed earlier, the control groups of these studies are not relevant to the question. Consequently, for the purpose of this report, the studies can be viewed as case series and susceptible to several threats to internal validity (see Appendix B). All were potentially affected by both regression bias and extraneous event bias. Further, most studies were potentially affected by sample specification bias (12/13 studies) and measurement bias (10/11 studies that reported the method of seizure measurement).

External validity

In our appraisal of the external validity of studies of sequential monotherapy, we considered aspects of patient enrollment as well as the actual characteristics of patients in the studies. All patient characteristics appear in Evidence Tables 31 through 34. All 13 studies enrolled patients because of seizure type (partial seizures), thus the results of these studies are not applicable to the treatment of generalized seizures. Three studies enrolled adults only,75–77 and the remaining 10 studies enrolled both children and adults. The mean age of patients in the studies ranged from 33.4 to 37 years. The proportion of patients who were female ranged from 0.43 to 0.63 and was greater than 0.50 in nine of the 11 studies that reported this characteristic. Median seizure frequency ranged from 5.5 to 13.4 seizures per month, and mean seizure frequency ranged from 6.3 to 70.7 seizures per month. The proportion of patients receiving two or more prior AEDs ranged from 0 to 1. The proportion was less than 0.5 in nine of the 11 studies that reported this patient characteristic. As a whole, then, the results of these studies apply primarily to adults with treatment-resistant epilepsy who experience partial seizures.

Synthesis of study results

Table 8. Outcomes in studies of sequential monotherapy
ReferenceSeizure FrequencyAdverse EffectsQuality of LifeMoodCognitive FunctionAbility to Return to WorkAbility to Return to SchoolAbility to Hold a Driver's LicenseMortality
Sachdeo (2001)68[check][check][check]
Beydoun (2000)86[check][check]
Kanner (2000)87[check][check]
Schachter (1999)79[check][check]
Gilliam (1998)76[check][check]
Bergey (1997)78[check][check][check]
Beydoun (1997)85[check][check][check]
Beydoun (1997)83[check][check][check][check][check][check]
Sachdeo (1997)84[check][check][check]
Devinsky (1995)75[check][check]
Schachter (1995)88[check][check][check][check][check]
Theodore (1995)89[check][check][check]
Faught (1993)77[check][check]
Totals13132320005
In this section, we assess the results separately for each of the following outcomes: seizure frequency, adverse effects, quality of life, mood, cognitive function, functional status/ability, ability to return to work, ability to return to school, ability to hold a driver's license, and mortality. We included freedom from seizures as a seizure frequency outcome. The outcomes reported by each study are listed in Table 8. All outcomes from all of the studies appear in Evidence Tables 35 through 38. Seizure frequency and adverse effects were each reported by all 13 studies.

In cases where meta-analysis was feasible, we used random effects models. We employed these models because the included studies investigated different drugs. Therefore, these studies cannot be viewed as having been sampled from a population of studies with a fixed mean. Random effects models employ statistical methods that are most applicable when studies use different variations of a treatment.

Also, these studies reported data on an intent-to-treat basis. This was particularly important because they employed a priori exit criteria (such as doubling of monthly seizure frequency) to limit harms to patients. If any patient met an exit criterion, investigators removed the patient from the study and reinstituted the patient's prior AED regimen. Consequently, the analyses described below included all patients who were randomized to receive high-dose monotherapy.

Seizure frequency

Table 9. Seizure frequency outcomes in studies of sequential monotherapy
ReferenceMean Absolute Seizure FrequencyMedian Absolute Seizure FrequencyMedian Absolute Difference From BaselineMedian Percent Difference From BaselineNumber of Patients Seizure-freeNumber of Patients With ≥75 Percent ReductionNumber of Patients With ≥50 Percent ReductionNumber of Patients With Any ReductionMean Rank of Seizure FrequencyNumber of Patients With Doubling of Monthly Seizure FrequencyNumber of Patients With Doubling of Highest Consecutive Two-Day Seizure FrequencyMedian Time to ExitMean Time on MonotherapyRisk Ratio of Time to Exit
Sachdeo (2001)68[check][check][check]
Beydoun (2000)86[check][check][check][check][check]
Kanner (2000)87[check][check]
Schachter (1999)79[check][check][check]
Gilliam (1998)76[check]
Bergey (1997)78[check][check]
Beydoun (1997)85[check][check][check][check][check][check]
Beydoun (1997)83[check][check][check][check]
Sachdeo (1997)84[check][check][check][check][check][check]
Devinsky (1995)75[check]
Schachter (1995)88[check][check][check]
Theodore (1995)89[check]
Faught (1993)77[check][check]
Totals12116241155442
Details of the seizure frequency results are presented in Evidence Table 35. The studies we included for this question reported 14 different measures of seizure frequency (Table 9). Only three seizure frequency outcomes were reported by five or more studies: the percentage of patients who were seizure-free during the study, the percentage of patients whose monthly seizure frequency doubled during the study (vs. baseline), and the percentage of patients whose highest two-day seizure frequency doubled during the study (vs. baseline). We emphasize that, because seizure frequency changes over time, a study's length of followup influences seizure frequency measurements. For example, a given patient is more likely to be seizure-free during a short-term study than a long-term study. Therefore, for each outcome, we considered the length of followup in the studies that reported the outcome.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf11.jpg.

   Figure 11. Threshold analysis: sequential monotherapy and seizure freedom

Percentage of patients who were seizure-free during the study. This outcome was reported by six studies and ranged from 9 percent to 28 percent. Because of the lack of relevant control groups, we performed a threshold analysis (see the Methodology section for a discussion of this approach). In this analysis, we compared the results obtained in patients who received sequential monotherapy to those of a synthetic control group in which we varied the percentage of seizure-free patients. The percentage at which the difference between the monotherapy and “control” group became statistically nonsignificant is the threshold. The results of this analysis appear in Figure 11. Each summary estimate in the figure is based on Cohen's h. The summary estimate calculated at the 0 percent point on the graph (no patients in a synthetic control group were seizure-free) was 0.81 (CI: 0.64 to 0.98, p <0.000001) and corresponded to 16 percent (CI: 10 percent to 22 percent)i of patients experiencing no seizures during sequential monotherapy. The summary estimate became nonsignificant (no statistically significant difference between monotherapy and control patients in the number of patients becoming seizure-free) when the proportion of patients in the synthetic control group reached 10 percent.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf12.jpg.

   Figure 12. Threshold analysis: monotherapy and seizure freedom (long-term studies)

We next performed a second threshold analysis of these data to test the effect of followup period on seizure-free status. Two of the six studies that reported seizure freedom employed short followup times (8 days in Bergey, Morris, Rosenfeld, et al.,78 and 10 days in Schacter, Vasquez, Fisher, et al.).79 In the other four studies, patients were followed for at least 16 weeks. The percentage of patients who were seizure-free was greater than 25 percent in both short-term studies, but was less than 14 percent in all of the long-term studies. Therefore, our second threshold analysis included only the four studies with longer followup times (Figure 12). The summary estimate calculated at the 0 percent point on the graph (no patients in a synthetic control group were seizure-free) was 0.67 (CI: 0.47 to 0.87, p <0.000001) and corresponded to 11 percent (CI: 5 percent to 18 percent) of patients experiencing no seizures during sequential monotherapy. The summary estimate became nonsignificant (no statistically significant difference between monotherapy and control patients in the number of patients becoming seizure-free) when the proportion of patients in the synthetic control group reached 6 percent.

In summary, approximately 11 percent of patients are seizure-free during long-term studies of sequential monotherapy. However, given the designs of these studies, whether the new drug actually caused any of the patients to become seizure-free during the study is not clear. Further, seizure frequencies change from month to month, and some patients with treatment-resistant epilepsy may experience periods without seizures, and some patients may have been misdiagnosed (Question 2). These latter patients may be more likely to become seizure-free. Even if some patients become seizure-free as a result of sequential monotherapy, the majority of patients (approximately 89 percent) continue to have seizures despite receiving a new drug as monotherapy. A firm conclusion about whether sequential monotherapy produces any new seizure-free patients would require the use of a relevant control group (i.e., continuation of prior drug regimens).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf13.jpg.

   Figure 13. Threshold analysis: monotherapy and doubling of monthly seizure frequency

Percentage of patients whose monthly seizure frequency doubled. Freedom from seizures measures the percentage of patients who experienced maximum benefit. In contrast, seizure doubling indexes the percentage of patients who experienced significant harm. This outcome was reported by five studies and ranged from 9 percent to 29 percent. All five studies followed patients for at least 16 weeks, thus the concern about study duration does not apply to seizure doubling. Due to the lack of relevant control groups, we performed a threshold analysis of this outcome (Figure 13). Each summary estimate in the figure is based on Cohen's h. The summary estimate calculated at the 0 percent point on the graph (no patients in a synthetic control group had a doubling of monthly seizure frequency) was 0.82 (CI: 0.64 to 0.99, p <0.000001) and corresponded to 16 percent (CI: 10 percent to 23 percent) of patients experiencing a doubling of monthly seizure frequency. The summary estimate became nonsignificant (no statistically significant difference between monotherapy and control patients in the number of patients experiencing a doubling of monthly seizure frequency) when the proportion of patients in the synthetic control group reached 10 percent.

Sequential monotherapy cannot be directly considered the cause of the doubling in seizure frequency because of the lack of a true control group. However, three factors do suggest a causal relation. First, at the beginning of the sequential monotherapy studies, all prestudy drugs were removed from patients' regimens and replaced with a new AED. Presumably, the original AEDs were already reducing seizure frequency. The removal of these drugs, therefore, may have caused seizures to increase. Second, a doubling of monthly seizure frequency was set by investigators as an a priori exit criterion. A doubling of seizure frequency resulted in immediate removal from the study, and all prestudy drugs were reinstituted. This suggests that investigators believed that the doubling of monthly seizure frequency was being caused by sequential monotherapy. Third, given the possibility that patients enter drug trials when they are relatively sick, they would not be expected to become even worse. Instead, based on regression-to-the-mean, reductions in seizure frequency would be expected. Each of these factors suggest that sequential monotherapy caused dramatic increases in seizures in some patients. A definitive conclusion about this possibility would require randomization of patients to either sequential monotherapy or a continuation of the prestudy drug regimen.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf14.jpg.

   Figure 14. Threshold analysis: monotherapy and doubling of two-day seizure frequency

Percentage of patients whose highest two-day seizure frequency doubled. This outcome also measures the percentage of patients who experienced significant harm during studies of sequential monotherapy. It was reported by five studies, and ranged from 4 percent to 23 percent. All five studies followed patients for at least 16 weeks. Due to the lack of relevant control groups, we performed a threshold analysis of this outcome (Figure 14). Each summary estimate in the figure is based on Cohen's h. The summary estimate calculated at the 0 percent point on the graph (no patients in a synthetic control group had a doubling of two-day seizure frequency) was 0.76 (CI: 0.56 to 0.96, p <0.000001) and corresponded to 14 percent (CI: 8 percent to 21 percent) of patients experiencing a doubling of two-day seizure frequency. The summary estimate became nonsignificant (no statistically significant difference between monotherapy and control patients in the number of patients experiencing a doubling of two-day seizure frequency) when the proportion of patients in the synthetic control group reached 8 percent.

As stated previously, whether sequential monotherapy was the cause of doubling of two-day seizure frequency cannot be determined without a true control group. However, the same three factors discussed above apply to this outcome as well. Based on these factors, sequential monotherapy in some patients appears to have caused doubling in two-day seizure frequency. Combining the estimates of the two seizure increase outcomes that result in exiting a trial, 30 percent of patients in studies of sequential monotherapy experience a doubling in either monthly seizure frequency or two-day seizure frequency. Therefore, some patients may be experiencing large seizure increases as a direct result of sequential monotherapy. To provide a definite answer, randomizing patients to receive either sequential monotherapy or a continuation of the prestudy drug regimen would be necessary.

Adverse effects

In clinical practice, a physician prescribing an AED for a patient with treatment-resistant epilepsy must consider not only the possible reduction of seizure frequency, but also the possible adverse effects of the new drug. Before entering studies, patients with treatment-resistant epilepsy were already experiencing adverse effects from their prestudy antiepileptic drug regimens. None of the studies reported these patients' prestudy adverse effects, and none reported whether the adverse effects observed during the study were more or less severe compared to patients' prestudy adverse effects. This latter outcome would have been informative because patients (and physicians) seek to reduce adverse effects as well as seizure frequency.

Table 10. Overview of adverse effects of sequential monotherapy
ReferenceDrugDose in mg/dayPercent of Patients Who Experienced Any Adverse EffectName of Most Commonly Experienced Adverse EffectPercent of Patients Who Experienced This Adverse Effect
Sachdeo (2001)68Oxcarbazepine2400NRHeadache11% (5/45)
Beydoun (2000)86Oxcarbazepine2400NRDizziness46% (19/41)
Kanner (2000)87Primidone75053% (16/30)Irritability37% (11/30)
Schachter (1999)79Oxcarbazepine240091% (46/51)Nervous system45% (23/51)
Gilliam (1998)76Lamotrigine50075% (57/76)Dizziness20% (15/76)
Bergey (1997)78Gabapentin360073% (29/40)Ataxia20% (8/40)
Beydoun (1997)85Valproate150 μG/mLNRTremor64% (61/96)
Beydoun (1997)83Gabapentin240088% (80/91)Dizziness25% (23/91)
Sachdeo (1997)84Topiramate1000NRParesthesia58% (14/24)
Devinsky (1995)75Felbamate3600NRNRNR
Schachter (1995)88Tiagabine3695% (91/96)Dizziness35% (34/96)
Theodore (1995)89Felbamate3600NRNRNR
Faught (1993)77Felbamate3600NRHeadache34% (19/56)

mg/day Milligrams per day

NR Not reported

μG/ml Micrograms per milliliter

All 13 included studies of sequential monotherapy reported adverse effects of the new drug treatment. The overall percentage of patients who experienced any side effects was reported by six studies and ranged from 53 percent to 95 percent (Table 10). Dizziness was the most common adverse effect in four studies, and headache was the most common adverse effect in two studies. All details of the adverse effects in the 13 studies appear in Evidence Table 36.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf15.jpg.

   Figure 15. Threshold analysis: monotherapy and trial exits due to adverse effects

Percentage of patients who exited trials due to adverse effects. To summarize the available data on adverse effects, we focused on whether the adverse effects in a given patient were severe enough to warrant discontinuation of the new drug (i.e., trial exit). This outcome is a marker of treatment failure. All 13 included trials reported the percentage of patients who exited trials due to adverse effects, and it ranged from 0% to 29%. As with seizure frequency, due to the lack of relevant control groups we performed a threshold analysis (Figure 15). Each summary estimate in the figure is based on Cohen's h. The summary estimate calculated at the 0 percent point on the graph (no patients in a synthetic control group exited due to adverse effects) was 0.47 (CI: 0.24 to 0.71, p <0.000073) and corresponded to 5 percent (CI: 1 percent to 12 percent) of patients exiting trials due to adverse effects. The summary estimate became nonsignificant (no statistically significant difference between monotherapy and control patients in the number of patients exiting trials due to adverse effects) when the proportion of patients in the synthetic control group reached 2 percent.

Quality of life

Only two studies of sequential monotherapy reported quality of life outcomes (Evidence Table 37).80, 81 Evidence Table 39 lists the scales and subscales used to measure quality of life in these studies. Due to the small number of studies, we did not perform meta-analyses of the results. There were no statistically significant changes in quality of life in either of the two studies. The lack of statistical significance may have been due to insufficient power. An estimate of the power of pre- vs. posttests would require knowledge of the correlation between baseline and outcome measurements. However, the authors did not report these correlations and therefore the power of this study to detect statistically significant quality of life changes could not be determined. Many of the subscales showed a nonsignificant improvement over baseline. However, these subscales are not independent (i.e., improvement on one subscale is likely to result in improvement in another subscale). Therefore, firm evidence-based conclusion about the influence of sequential monotherapy on quality of life cannot be based on these data.

Mood

Three studies of sequential monotherapy reported outcomes related to mood (Evidence Table 37).80–82 Evidence Table 40 lists the scales and subscales used to measure mood in these studies. Each of the three studies investigated a different drug for sequential monotherapy. Two of the three studies used the same set of scales (Dodrill, Arnett, Hayes, et al.81 and Dodrill, Arnett, Shu, et al.80). As with quality of life, the small number of studies precluded any meta-analysis. None of the subscales in the study by Dodrill, Arnett, Hayes, et al.81 showed statistically significant changes in mood. In the study by Dodrill, Arnett, Shu, et al.,80 one of eight subscales (the Vigor-Activity subscale) exhibited a statistically significant decrement from baseline in mood. As discussed in the quality of life section, insufficient power may have prevented these studies from detecting changes in mood. However, incomplete reporting in the published literature prevents investigating this possibility.

Ketter, Malow, Flamini, et al.82 reported that mood and psychiatric symptom scores changed after 2 weeks of sequential monotherapy in patients given felbamate. After the removal of all AEDs, patients' mood scores significantly worsened relative to baseline for seven of the 11 subscales. At both week 1 and week 2 of felbamate monotherapy, the decrements persisted. Thus, the initiation of felbamate monotherapy did not return patients' mood scores to baseline. With longer followup, patients' mood scores may potentially have returned to baseline or even improved over baseline. However, these findings suggest that the first phase of sequential monotherapy (i.e., the drug reduction phase) may cause significant worsening of mood and psychiatric symptom scores. However, because there was only one study reporting such changes, firm evidence-based conclusions cannot be drawn about the general effect of sequential monotherapy on mood.

Cognitive function

Only two studies of sequential monotherapy reported cognitive function (Evidence Table 38).80, 81 The two studies used the same subscales for measuring cognitive function (Evidence Table 41). Due to the small number of studies that reported the effect of sequential monotherapy on cognitive function, we did not perform meta-analyses of these data. In the study described by Dodrill, Arnett, Hayes, et al.81, none of the 19 cognitive function subscales were significantly different from baseline. Of the 19 subscales in the study described by Dodrill, Arnett, Shu, et al.,80 four showed a statistically significant improvement from baseline. These results may have been caused by a practice effect and not by tiagabine (see the discussion of instrumentation bias in the Methodology section). The power of these studies to detect changes in cognitive function could not be calculated because the authors did not report the correlations between baseline and outcome measurements.

Functional status/ability

No studies of sequential monotherapy reported this outcome.

Ability to return to work

No studies of sequential monotherapy reported this outcome.

Ability to return to school

No studies of sequential monotherapy reported this outcome.

Ability to hold a driver's license

No studies of sequential monotherapy reported this outcome.

Mortality

Five of the 13 studies of sequential monotherapy (38 percent) reported whether any patients died during the study. Three of the five studies reported that no patients died,78, 83, 84 and the other two studies each reported one death.68, 85 The authors did not attribute either death to the treatment. The mortality rates in these five studies ranged from 0 percent to 2 percent (Evidence Table 42).

Polytherapy

Polytherapy is defined as the administration of a multiple-drug regimen in which at least one of the drugs is novel to each patient. As with sequential monotherapy, patients received any number of drugs prior to the initiation of a new drug. Most polytherapy interventions involve the addition of a single novel drug to patients' regimens (referred to as “add-on” treatment). In this section, we describe the evidence base for polytherapy, assess the quality of these trials with respect to both internal and external validity, and analyze the trials' results for all relevant outcomes.

Excluded studies

Thirty trials of polytherapy met the inclusion criteria. None were excluded for quality reasons.

Evidence base

The evidence base contained 30 trials that enrolled 4,834 patients.

Design and conduct of included studies

Table 11. Drugs and doses in trials of polytherapy
ReferenceDrugTrial Dose(s)aTotal Number of Trials That Used This Drug/Dose Combination
Faught (2001)94Zonisamide4001
Ben-Menachem (2000)95Levetiracetam30002
Betts (2000)96Levetiracetam2000, 40001, 1
Cereghino (2000)97Levetiracetam1000, 30001, 2
Glauser (2000)98OxcarbazepineTailored to weight1
Appleton (1999)99GabapentinTailored to weight1
Biton (1999)100TopiramateTailored to weight3
Duchowny (1999)30LamotrigineTailored to weight2
Elterman (1999)101TopiramateTailored to weight3
Korean Topiramate Study Group (1999)102Topiramate6004
Sachdeo (1999)103TopiramateTailored to weight3
Uthman (1998)104Tiagabine16, 32, 561, 2, 1
Sachdeo (1997)105Tiagabine32b, 32c2, 1
Ben-Menachem (1996)90Topiramate8002
Chadwick (1996)106Gabapentin12003
Faught (1996)91Topiramate200, 400, 6001, 2, 4
Privitera (1996)107Topiramate600, 800, 10004, 2, 1
Sharief (1996)108Topiramate4002
Tassinari (1996)109Topiramate6004
Willmore (1996)110ValproateTailored to weight1
Anhut (1994)111Gabapentin900, 12002, 3
Messenheimer (1994)112Lamotrigine4001
Bourgeois (1993)113Felbamate36001
Felbamate Study Group (1993)114FelbamateTailored to weight1
Matsuo (1993)115Lamotrigine300, 5001, 1
McLean (1993)116Topiramate600, 1200, 18004, 1, 1
Schmidt (1993)117ZonisamideTailored to weight1
Sivenius (1991)118Gabapentin9002
UK Gabapentin Study Group (1990)119Gabapentin12003
Jawad (1989)70LamotrigineTailored to weight2
a

Maximum dose in milligrams per day

b

Based on 16 milligrams twice per day

c

Based on 8 milligrams four times a day

Aspects of the trial designs appear in Evidence Tables 43 through 46, and the patient characteristics appear in Evidence Tables 47 through 53. All 30 trials were randomized, placebo-controlled, add-on trials. In these trials, patients continued to take their pretrial drug regimens, and either a placebo or a new drug was added to those regimens. Nine add-on drugs were investigated in these trials: topiramate (9 trials), gabapentin (6 trials), lamotrigine (4 trials), levetiracetam (3 trials), tiagabine (2 trials), zonisamide (2 trials), felbamate (2 trials), oxcarbazepine (1 trial), and valproate (1 trial). No single dose of a given drug was used in all trials of that drug (Table 11). Of the 29 drug doses, 20 (69 percent) were employed by only one trial, and no drug dose was employed by more than four trials. Ten trials (33 percent) individualized the dose to each patient based on weight. These observations highlight the wide variation among trials' implementations of the polytherapy strategy.

Internal validity

For each trial of polytherapy, we determined whether the results were potentially biased by the factors noted in the Methodology section. Other questions in this report consider the potential for attrition bias, but for polytherapy, we did not consider it because attrition was a study outcome. All 30 trials of polytherapy were randomized and placebo-controlled. Thus, they were free from many potential threats to internal validity (see Appendix B). We meta-analytically tested selection bias with respect to several patient characteristics, and trials were free from potential selection bias in all but two cases (Evidence Table 54 through 58; also see discussion in Appendix B). All of the trials were free from five potential biases (sampling, regression, investigator, patient, and extraneous event). However, all of the trials had potential measurement bias. In addition, 90 percent of the trials had sample specification bias.

External validity

In our appraisal of the external validity of trials of polytherapy, we considered aspects of patient enrollment as well as the actual characteristics of patients in the trials (Evidence Tables 43 to 53). Twenty-seven trials (90 percent) enrolled patients because of seizure type: 25 for partial seizures, and two for generalized seizures. Two trials enrolled only patients with Lennox-Gastaut syndrome, and one trial included patients with any seizure type or syndrome. Six trials enrolled children only, 23 trials enrolled adults only, and one trial enrolled both children and adults. In the six trials of children, the mean age ranged from 7.9 to 13.0, and in the 23 trials of adults, the mean age ranged from 29.4 to 38.0. The proportion of patients who were female ranged from 0.14 to 0.69, and was less than 0.50 in 21 of the 28 trials that reported this characteristic. Median seizure frequency ranged from 1 to 80 seizures per month, and mean seizure frequency ranged from 7.3 to 68.7 seizures per month. The proportion of patients who had received two or more prior AEDs ranged from 0 to 0.81. This proportion was greater than 0.5 in 15 of the 18 trials that reported this patient characteristic. As a whole, then, the characteristics of the patients in these studies are not particularly unusual.

Synthesis of study results

Outcomes in trials of polytherapy
ReferenceSeizure FrequencyAdverse EffectsQuality of LifeMoodCognitive FunctionAbility to Return to WorkAbility to Return to SchoolAbility to Hold a Driver's LicenseMortality
Faught (2001)94[check][check]
Ben-Menachem (2000)95[check][check]
Betts (2000)96[check][check]
Cereghino (2000)97[check][check][check][check]
Glauser (2000)98[check][check][check]
Appleton (1999)99[check][check][check]
Biton (1999)100[check][check]
Duchowny (1999)30[check][check]
Elterman (1999)101[check][check][check]
KTSG (1999)102[check][check]
Sachdeo (1999)103[check][check]
Uthman (1998)104[check][check][check][check][check]
Sachdeo (1997)105[check][check]
Ben-Menachem (1996)90[check][check][check]
Chadwick (1996)106[check][check][check]
Faught (1996)91[check][check][check]
Privitera (1996)107[check][check][check]
Sharief (1996)108[check][check]
Tassinari (1996)109[check][check]
Willmore (1996)110[check][check]
Anhut (1994)111[check][check][check]
Messenheimer (1994)112[check][check]
Bourgeois (1993)113[check][check]
FSG (1993)114[check][check]
Matsuo (1993)115[check][check]
McLean (1993)116[check][check]
Schmidt (1993)117[check][check]
Sivenius (1991)118[check][check]
UKGSG (1990)119[check][check]
Jawad (1989)70[check][check]
Totals30302110009
In this section, we assess the results separately for each of the relevant outcomes (Table 12). All reported outcomes appear in Evidence Tables 59 through 62. Seizure frequency and adverse effects were each reported by all 30 trials, whereas the other outcomes were not commonly reported.

In cases where meta-analyses were conducted, we used random-effects models because, as shown in Table 12, the trials employed a variety of drugs and doses. The trials are therefore not derived from a population of trials with a fixed mean. Our meta-analytic syntheses of trial results yield approximate estimates of the typical effect of adding a new AED to patients' prior AED regimens. However, these estimates have limited generalizability because the effect of a new AED may depend on the other AEDs in patients' regimens. Each trial employed a control group of patients who received an add-on placebo, but the prior regimens were different among different trials (and among patients in a single trial). Thus, the 30 trials did not administer the exact same “control” treatment. Because the treatments and controls differ across trials, the summary effect sizes from random-effects meta-analyses can only be used as approximate estimates of the effect of adding a new AED and may be best suited for use as starting points in future research. The actual effect on seizure frequency or adverse effects in any single patient is likely to depend on the specific drug to be added as well as characteristics of the AEDs already in use.

We performed all meta-analyses on an intent-to-treat basis. This means that we included all randomized patients in our analyses, not solely the patients who completed the trials. If a patient exited early from a trial and the authors did not report the relevant outcome for that patient, we assumed that seizure frequency did not decrease for that patient. This is a reasonable assumption because all patients who respond to a drug would likely be reported as responders.

Seizure frequency

Seizure frequency outcomes in trials of polytherapy
Absolute Monthly Seizure Frequency Absolute Difference From Baseline Absolute Percent Difference From Baseline Number of Patients With
ReferenceMeanMedianMeanMedianMeanMedianSeizure-free≥75 Percent Reduction≥50 Percent Reduction≥25 Percent ReductionAny ReductionAny Increase≥50 Percent Increase≥75 Percent IncreaseMean Response RatioaMean Adjustment Response Ratioa
Faught (2001)94[check][check][check][check]
Ben-Menachem (2000)95[check][check][check][check]
Betts (2000)96[check][check][check][check]
Cereghino (2000)97[check][check][check][check]
Glauser (2000)98[check][check][check]
Appleton (1999)99[check][check][check][check][check][check][check][check][check]
Biton (1999)100[check][check][check][check]
Duchowny (1999)30[check][check][check][check]
Elterman (1999)101[check][check][check][check]
KTSG (1999)102[check][check][check][check][check]
Sachdeo (1999)103[check][check][check][check]
Uthman (1998)104[check][check][check][check][check]
Sachdeo (1997)105[check][check][check]
Ben-Menachem (1996)90[check][check][check][check][check][check]
Chadwick (1996)106[check][check]
Faught (1996)91[check][check][check][check]
Privitera (1996)107[check][check][check]
Sharief (1996)108[check][check][check][check]
Tassinari (1996)109[check][check][check][check][check]
Willmore (1996)110[check][check][check][check][check][check][check][check]
Anhut (1994)111[check][check][check][check][check][check]
Messenheimer (1994)112[check]
Bourgeois (1993)113
FSG (1993)114[check][check][check][check][check]
Matsuo (1993)115[check][check][check][check][check][check][check]
McLean (1993)116[check][check][check][check][check][check]
Schmidt (1993)117[check][check][check][check][check]
Sivenius (1991)118[check][check][check][check][check][check][check]
UKGSG (1990)119[check][check][check][check][check][check][check][check]
Jawad (1989)70[check][check]
Totalsb211132241817278564224
a

The response ratio is the ratio (T-B)/(T+B) where T is the number of seizures a month during treatment and B is the number of seizures a month during baseline. Some authors adjusted the response ratio in order to account for differences between centers in multi-center trials (using ANOVA).

The included trials reported 20 different measures of seizure frequency (Table 13). Seven measures were reported by five or more trials. One was a measure of absolute seizure frequency (median percentage reduction), and the remaining six were dichotomous measures. We did not analyze two of the dichotomous measures (75 percent or more reduction and 25 percent or more reduction) because they provided data that was effectively captured by other dichotomous measures (seizure-freedom, 50 percent or more reduction, and any reduction). The use of multiple seizure types and multiple study intervals necessitated that we adopt two selection rules for abstracting data from an included study.j First, if a study reported the same seizure frequency measure for more than one seizure type, we selected the most general type for inclusion in any meta-analyses.k Second, if a study reported the same seizure frequency measure for different study intervals, we selected the longest interval for inclusion in any meta-analysesl These selection rules permitted us to focus our analyses on the most general and widely reported seizure frequency measures.

In considering meta-analyses of the seizure frequency outcomes, nine of the 30 trials each contained three or more groups of patients. From each of these nine trials, therefore, multiple effect sizes can be computed (e.g., dose 1 vs. placebo, and dose 2 vs. placebo). Multiple effect sizes within a single trial are statistically dependent. Ideally, we would analyze these data using general linear models that account for this dependence. This, however, was precluded by the relative paucity of data. Therefore, to avoid this dependence, for each meta-analysis we selected only one drug dose from each trial. Thus, the effect size we computed for each trial was based on the difference between outcomes in one add-on drug group and the add-on placebo group. In some meta-analyses (“high-dose”), we selected the highest-dose group in each trial, whereas in other meta-analyses (“low-dose”), we selected the lowest-dose group in each trial. Trials with only one add-on drug group appear in both the high-dose and low-dose meta-analyses. Consequently, the high-dose meta-analysis was not independent of the low-dose meta-analysis.

A comparison between the results of a high-dose meta-analysis with those of the corresponding low-dose meta-analysis can be viewed as a form of sensitivity analysis. Larger effect sizes may be expected a priori in the high-dose meta-analysis (i.e., larger effects with higher doses). Performing both analyses permits us to estimate the robustness of the results. Although this approach allows us to estimate the effect of high- and low-dose polytherapy, it has the disadvantage that each meta-analysis uses only a subset of the available data. Consequently, some information is lost in our analysis.

Median percentage reduction. Twenty-four of the 30 included trials reported the median percentage reduction in seizures. However, none of these trials reported the dispersion about these medians (e.g., variances, standard deviations, interquartile ranges). Therefore, effect sizes could not be calculated and a meta-analysis was not conducted with these data.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf16.jpg.

   Figure 16. Median percentage reduction in seizures after polytherapy

Note: In this plot, positive numbers represent reductions in seizures, whereas negative numbers represent increases in seizures.

Because the median percentage of seizure reduction was a commonly reported seizure frequency outcome, we plotted a summary of the published findings. This plot (Figure 16) depicts the 24 statistical comparisons to placebo that were reported.m Twenty-two of these 24 comparisons were statistically significant in favor of the add-on drug. The remaining two comparisons also favored the add-on drug, but the differences were not statistically significant. The range of medians was -18 percent to 13 percent for the groups that received add-on placebo, and 13 percent to 51 percent for the add-on drug groups (as a convention in the epilepsy literature, negative numbers represent percentage increases from baseline, and positive numbers represent percentage decreases from baseline).

The data in Figure 16 provide evidence for a placebo effect: 14 of the 17 placebo groups (82 percent) had a median percentage reduction that was greater than zero (i.e., a beneficial effect indicated by the rightward shift on the x-axis). This percentage is significantly larger than 50 percent (two-tailed sign test, p = 0.013). The size of this placebo effect cannot be estimated because the trials did not report dispersion statistics for median percentage reduction. The observed medians, however, do indicate that patients with treatment-resistant epilepsy have fewer seizures when a placebo is added to their drug regimens. This placebo effect does not influence our investigations of polytherapy because all trials were placebo-controlled and therefore all effect sizes involved comparisons to placebo groups. However, the placebo effect does underscore the need for placebo controls in treatment trials involving patients with epilepsy, because if a treated group improves, part of that improvement may be due to the initiation of any medical intervention rather than to the intervention itself.

Seizure-freedom. Eighteen of the 30 included trials reported the percentage of patients who became seizure-free. Two of these trials, however, only reported seizure-freedom for a severe type of partial seizure (secondarily generalized seizures) that was experienced by only a subset of patients before the trials.90, 91 They did not report freedom from a seizure type that all patients had experienced before the trial. Thus, we analyzed the 16 trials of polytherapy that did report the latter kind of seizure-freedom results.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf17.jpg.

   Figure 17. Forest plot: polytherapy and seizure-freedom (high-dose)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf18.jpg.

   Figure 18. Forest plot: polytherapy and seizure-freedom (low-dose)

We first performed the high-dose meta-analysis. The percentage of patients who were seizure-free ranged from 0 percent to 9 percent in the high-dose groups and from 0 percent to 2 percent in the add-on placebo groups. The effect sizes are plotted in Figure 17, and the details of the random-effects meta-analysis appear in Evidence Table 63. The random-effects summary statistic (Cohen's h) was 0.29 (CI: 0.20 to 0.37). Patients who received a high-dose of add-on drug were statistically significantly more likely to become seizure-free compared to patients who received add-on placebo. The estimated summary percentages were 5 percent for the high-dose groups (CI: 3 percent to 7 percent) and 1 percent for the placebo groups (CI: 0 percent to 1.4 percent) as calculated from the back-transformed Cohen's h. Similar results were observed in the low-dose meta-analysis for seizure-freedom (Figure 18 and Evidence Table 64). The summary Cohen's h (0.28, CI: 0.20 to 0.36) was only slightly lower than the high-dose meta-analysis. The estimated summary percentage was 5 percent (CI: 3 percent to 7 percent) in the low-dose groups.

We performed four sensitivity analyses separately for the high-dose and low-dose meta-analyses. The sensitivity analyses involved recalculating the meta-analysis after separately removing the trial with the largest effect size, the smallest effect size, the largest sample size, and the smallest sample size. None of the four sensitivity analyses overturned our findings (Evidence Table 65 and Table 66).

In summary, the evidence suggests that adding a drug to patient's regimens increases the likelihood of becoming seizure-free. This finding occurred in both the high-dose and low-dose groups, and multiple sensitivity analyses did not overturn the results. However, seizure-freedom was analyzable for only 16 of the 30 trials of polytherapy.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf19.jpg.

   Figure 19. Forest plot: polytherapy and 50 percent seizure reduction (high-dose)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf20.jpg.

   Figure 20. Forest plot: polytherapy and 50 percent seizure reduction (low-dose)

50 percent reduction. Twenty-seven trials reported the percentage of patients who experienced 50 percent or more reduction in seizures. As with seizure-freedom, we performed both a high-dose meta-analysis and a low-dose meta-analysis. The range was 13 percent to 50 percent in the high-dose groups and 0 percent to 25 percent in the placebo groups. A plot of the effect sizes appears in Figure 19, and the statistical details of the meta-analysis are in Evidence Table 67. The random effects summary statistic (Cohen's h) was 0.52 (CI: 0.43 to 0.62). Patients who received a high-dose of add-on drug were significantly more likely to experience 50 percent reduction compared to patients who received add-on placebo. The estimated summary percentages were 35 percent for the high-dose groups (CI: 31 percent to 38 percent) and 13 percent for the placebo groups (CI: 10 percent to 15 percent). We observed similar results with the low-dose meta-analysis (Figure 20 and Evidence Table 68). The random-effects summary Cohen's h was 0.45 (CI: 0.35 to 0.55), and the estimated summary percentage for the low-dose groups was 31 percent (CI: 27 percent to 36 percent).

As described previously, we performed four sensitivity analyses for both the high-dose and low-dose meta-analyses. None of these analyses overturned our findings (Evidence Tables 69 and 70).

These studies suggest that when a drug is added to patients' drug regimens, approximately one-third of patients will experience a 50 percent or more reduction in seizures. As mentioned above, however, the generalizability of this finding may be limited.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf21.jpg.

   FIgure 21. Forest plot: polytherapy and any seizure reduction (high-dose)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf22.jpg.

   Figure 22. Forest plot: polytherapy and any seizure reduction (low-dose)

Any reduction. Five trials reported the percentage of patients who experienced any reduction in seizures. As with other measures, we performed both a high-dose meta-analysis and a low-dose meta-analysis. The range was 61 percent to 80 percent in the high-dose groups and 41 percent to 72 percent in the placebo groups. A plot of the effect sizes appears in Figure 21, and the statistical details of the meta-analysis are in Evidence Table 71. The random effects summary statistic (Cohen's h) was 0.37 (CI: 0.19 to 0.55). Patients who received a high-dose of add-on drug were significantly more likely to experience a reduction compared to patients who received add-on placebo. The estimated summary percentages were 70 percent for the high-dose groups (CI: 61 percent to 77 percent) and 52 percent for the placebo groups (CI: 44 percent to 61 percent). We obtained similar results with the low-dose meta-analysis (Figure 22 and Evidence Table 72). The random-effects summary Cohen's h was 0.31 (CI: 0.15 to 0.47), and the estimated summary percentage for the low-dose groups was 67 percent (CI: 59 percent to 74 percent).

We performed the four sensitivity analyses for both the high-dose and low-dose meta-analyses. None of these overturned our findings (Evidence Table 73 and 74).

These studies suggest that when certain AEDs are added to patients' drug regimens, approximately two-thirds of patients will experience some reduction in seizures. This analysis, like the previous one, may have limited generalizability.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf23.jpg.

   Figure 23. Forest plot: polytherapy and any seizure increase (high-dose)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf24.jpg.

   FIgure 24. Forest plot: polytherapy and any seizure increase (low-dose)

Any increase. Six trials reported the percentage of patients who experienced any increase in seizures. One of these trials, however, reported this outcome for a specific seizure type that was experienced by only a subset of patients before the trial. Thus, we analyzed seizure increase data from the other five trials. As with other measures, we performed both a high-dose meta-analysis and a low-dose meta-analysis. The range was 16 percent to 38 percent in the high-dose groups and 28 percent to 44 percent in the placebo groups. A plot of the effect sizes appears in Figure 23, and the statistical details of the meta-analysis are in Evidence Table 75. The random effects summary statistic (Cohen's h) was 0.38 (CI: 0.23 to 0.53). Patients who received a high-dose of add-on drug were significantly less likely to experience an increase compared to patients who received add-on placebo. The estimated summary percentages were 21 percent for the high-dose groups (CI: 15 percent to 28 percent) and 39 percent for the placebo groups (CI: 32 percent to 46 percent). We observed similar results with the low-dose meta-analysis (Figure 24 and Evidence Table 76). The random-effects summary Cohen's h was 0.39 (CI: 0.22 to 0.57), and the estimated summary percentage for the low-dose groups was 20 percent (CI: 15 percent to 27 percent).

We performed the four sensitivity analyses for both the high-dose and low-dose meta-analyses. None of these analyses overturned our findings (Evidence Table 77 and 78).

These data suggest that when certain AEDs are added to patients' drug regimens, approximately 20 percent of patients will experience an increase in seizures. This analysis, like the previous ones, may have limited generalizability.

Adverse effects

Table 14. Overview of adverse effects of polytherapy
ReferenceDrug and Dose (mg/day)Percent of Patients Who Experienced Any Adverse EventName of Most Commonly Experienced Adverse EventPercent of Patients Who Experienced This Adverse Event
Faught (2001)94Zonisamide 400NRSomnolence15% (18/118)
Ben-Menachem (2000)95Levetiracetam 300055% (100/181)Asthenia13.8% (25/181)
Betts (2000)96Levetiracetam 200083% (35/42)Asthenia31% (13/42)
Betts (2000)96Levetiracetam 400084% (32/38)Somnolence45% (17/38)
Cereghino (2000)97Levetiracetam 100089% (87/98)Infection28% (27/98)
Cereghino (2000)97Levetiracetam 300089% (90/101)Infection27% (27/101)
Glauser (2000)98Oxcarbazepine 180091% (125/138)Vomiting36% (50/138)
Appleton (1999)99Gabapentin 1800NRViral infection11% (13/119)
Biton (1999)100Topiramate 400NRUpper respiratory tract infection41% (16/39)
Duchowny (1999)30Lamotrigine 75094% (92/98)Somnolence24% (24/98)
Elterman (1999)101Topiramate 400NRUpper respiratory tract infection41% (17/41)
KTSG (1999)102Topiramate 60081% (74/91)Anorexia21% (19/91)
Sachdeo (1999)103Topiramate 600NRSomnolence42% (20/48)
Uthman (1998)104Tiagabine 16NRNervous system69% (42/61)
Uthman (1998)104Tiagabine 32NRNervous system70% (62/88)
Uthman (1998)104Tiagabine 56NRNervous system77% (44/57)
Sachdeo (1997)105Tiagabine 32NRNervousness10.5% (11/105)
Ben-Menachem (1996)90Topiramate 800NRFatigue79% (22/28)
Chadwick (1996)106Gabapentin 120067% (39/58)Somnolence12% (7/58)
Faught (1996)91Topiramate 200NRDizziness36% (16/45)
Faught (1996)91Topiramate 400NRDizziness33% (15/45)
Faught (1996)91Topiramate 600NRDizziness35% (16/46)
Privitera (1996)107Topiramate 600NRFatigue38% (18/48)
Privitera (1996)107Topiramate 800NRAbnormal thinking44% (21/48)
Privitera (1996)107Topiramate 1000NRDizziness38% (18/47)
Sharief (1996)108Topiramate 400NRSomnolence35% (8/23)
Tassinari (1996)109Topiramate 600NRHeadache27% (8/30)
Willmore (1996)110Valproate 90 mg/kgNRNausea48% (37/77)
Anhut (1994)111Gabapentin 90063% (33/52)Somnolence22% (24/111)
Anhut (1994)111Gabapentin 120068% (76/111)Somnolence13% (7/52)
Messenheimer (1994)112Lamotrigine 400NRRash7% (3/44)
Bourgeois (1993)113Felbamate 3600NRHeadache40% (12/30)
FSG (1993)114Felbamate 3600NRAnorexia49% (18/37)
Matsuo (1993)115Lamotrigine 300NRHeadache32% (23/71)
Matsuo (1993)115Lamotrigine 500NRDizziness54% (39/72)
McLean (1993)116Gabapentin 60088% (89/101)Dizziness25% (13/53)
McLean (1993)116Gabapentin 120091% (49/54)Somnolence36% (36/101)
McLean (1993)116Gabapentin 180087% (46/53)Somnolence20% (11/54)
Schmidt (1993)117Zonisamide 20 mg/kg59% (42/71)Fatigue23% (16/71)
Sivenius (1991)118Gabapentin 900NRDrowsiness25% (4/16)
UKGSG (1990)119Gabapentin 120062% (38/61)Somnolence14.8% (9/61)
Jawad (1989)70Lamotrigine 400NRNRNR

mg/day Milligrams per day

NR Not reported

mg/kg Milligrams per kilogram

All 30 included studies of polytherapy reported adverse effects of the new drug treatment. The overall percentage of patients who experienced any side effects was reported by 16 studies, and ranged from 55 percent to 94 percent (Table 14). Somnolence was the most common adverse effect in nine studies, and dizziness was the most common adverse effect in four studies. All details of the adverse effects in the 30 studies appear in Evidence Table 60.

To summarize the available data on adverse effects, we focused on whether the adverse effects in a given patient were severe enough to warrant discontinuation of the new drug (i.e., trial exit). In trials of polytherapy, an add-on drug may be more likely or less likely to be discontinued due to adverse effects compared to add-on placebo.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf25.jpg.

   Figure 25. Forest plot: polytherapy and trial exits due to adverse effects (high-dose)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf26.jpg.

   Figure 26. Forest plot: polytherapy and trial exits due to adverse effects (low-dose)

Percentage of patients exiting trials due to adverse effects. All 30 trials of polytherapy reported this outcome. We meta-analyzed these data using the same methods that we used to analyze seizure frequency. The effect sizes are plotted in Figure 25, and the details of the high-dose meta-analysis appear in Evidence Table 79. The random-effects summary Cohen's h was significantly negative (-0.18, CI: -0.26 to -0.11). Thus, patients in the high-dose groups were significantly more likely to exit trials due to adverse effects compared to patients in placebo groups. The estimated summary percentages were 8 percent for the high-dose groups (CI: 6 percent to 10 percent) and 4 percent for the placebo groups (CI: 2 percent to 5 percent). Similar results were observed for the low-dose meta-analysis (Figure 26 and Evidence Table 80). The random-effects summary statistic was -0.16 (CI: -0.23 to -0.08), and the estimated summary percentage for the low-dose groups was 7 percent (CI: 5 percent to 9 percent).

We performed the same sensitivity analyses and they did not overturn any of our findings (Evidence Table 81 and 82).

Thus, adding a certain AED to a patient's drug regimen is more likely to cause adverse effects resulting in trial exit compared to adding a placebo. This finding persisted through multiple sensitivity analyses.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf27.jpg.

   Figure 27. Tradeoff between seizure frequency and adverse effects

Tradeoff between seizure frequency and adverse effects. We next evaluated the tradeoff between seizure frequency and adverse effects in trials of polytherapy. In the section on seizure frequency, we concluded that adding a drug to a patient's regimen is more likely to reduce seizures compared to adding a placebo. However, in the section on adverse effects, we concluded that adding a drug is also more likely to cause adverse effects resulting in trial exit. To illustrate the tradeoff, we constructed a scatterplot in which the horizontal axis represented the effect size for 50 percent seizure reduction and the vertical axis represented the effect size for exiting the trial due to adverse effects (Figure 27). We inverted the vertical axis so that the ideal drug would fall in the upper right quadrant of the plot (corresponding to fewer seizures and fewer adverse effects). Forty groups of patients who received an add-on drug are included in the plot (corresponding to the 27 trials that reported both 50 percent seizure reduction and adverse effect attrition). Thirty-one of 40 patient groups (78 percent) were in the lower right quadrant (fewer seizures and more adverse effects), and seven groups (18 percent) were in the upper right quadrant. This plot demonstrates the tradeoff between seizure frequency and adverse effects. However, reductions in both seizure frequencies and side effects also seem to occur.

Quality of life

Only two of the included trials of polytherapy reported quality of life (Evidence Table 61).92, 93 The two trials used different scales to measure quality of life (Evidence Table 83). Due to the small number of trials, we did not perform meta-analyses of the results. Instead, we created plots indicating the trials' results for all reported subscales. In the trial by Cramer, Arrigo, Van Hammee, et al.,92 four of the nine subscales of quality of life showed a statistically significant advantage of levetiracetam over placebo. Each of the other five subscales showed a nonsignificant advantage of levetiracetam. A statistical power analysis of this trial was not possible due to the lack of reporting of measures of dispersion. The results of this trial suggest that polytherapy with levetiracetam improves some aspects of quality of life. In the trial by Dodrill, Arnett, Sommerville, et al.,93 no statistically significant effect was found on any of the 10 subscales of quality of life. A statistical power analysis of this trial could not be conducted because these were pre-post comparisons and the authors did not report the correlations between baseline and outcome measurements. Because only two trials reported quality of life outcomes after polytherapy, evidence-based conclusions could not be made about the influence of polytherapy on quality of life.

Mood

One trial of polytherapy (add-on tiagabine) reported mood outcomes (Evidence Table 61).93 The trial used eight subscales to measure mood (Evidence Table 84). None of the eight subscales showed a statistically significant improvement in mood after add-on tiagabine. A statistical power analysis of this trial was not possible because the authors did not report the correlations between baseline and outcome measurements. Because this is only one trial, drawing any evidence-based conclusions about whether polytherapy affects mood is not possible.

Cognitive function

Only one of the included trials of polytherapy reported cognitive function (Evidence Table 62).93 The subscales for measuring cognitive function in this trial appear in Evidence Table 85. Because there was only one trial, we created a plot indicating its results for all reported subscales. Of the 19 subscales of cognitive function, only one (the Benton Visual Retention test, Form F) demonstrated a statistically significant effect. Patients in the placebo group improved from baseline more compared to patients who received tiagabine. A power analysis of this trial could not be conducted because the authors did not report the correlations between baseline and outcome measurements. Because only one trial addressed this issue, evidence-based conclusions cannot be made about the influence of polytherapy on cognitive function.

Functional status/ability

No trials of polytherapy reported this outcome.

Ability to return to work

No trials of polytherapy reported this outcome.

Ability to return to school

No trials of polytherapy reported this outcome.

Ability to hold a driver's license

No trials of polytherapy reported this outcome.

Mortality

Nine of the 30 trials of polytherapy (30 percent) reported whether any patients died during the trial. The mortality results of these trials are listed in Evidence Table 86. The mortality rates ranged from 0 percent to 2 percent. Five of the nine trials reported that no patients died, three trials each reported one death, and one trial reported two deaths. None of the authors attributed the deaths to the add-on drugs.

Optimization of Current Drug Therapy

The previous two parts of the present question addressed strategies related to the use of new AEDs or new combinations of AEDs in patients with treatment-resistant epilepsy. In this section, we assess strategies designed to optimize the effectiveness of a patient's current drug regimen. Strategies designed to optimize current drug therapy seek to improve patient outcomes by either; (1) reducing seizure frequency without increasing the incidence (or intensity) of the side effects associated with AED treatment or, (2) by reducing the side effects of AED without increasing seizure frequency, seizure severity, or the onset of a new seizure type. Ideally, a drug regimen would both decrease seizure frequency and reduce side effects. However, as shown above, this rarely occurs in patients with treatment-resistant epilepsy, and a trade-off exists between the intensity of drug treatment and the incidence and severity of associated side effects. Thus, in order for an optimization strategy to be of value, it must either lead to reductions in seizure frequency or reductions in side effects (and/or improvements in quality of life, cognitive function, and mood) while not leading to increases in the other.

Published literature describes three different methods for optimizing drug therapy in patients with treatment-resistant epilepsy: (1) increasing the dose of the current drug (or drugs) to maximum tolerable levels, (2) modifying the frequency of dosing, and (3) reducing the total number of drugs. In this subquestion, we evaluate the literature pertaining to all three of these strategies.

Number of studies addressing each drug optimization strategy

Eleven included articles addressed one of the three drug optimization strategies presented above (Evidence Table 87). Eight of the eleven articles described studies that assessed the drug reduction strategy, two articles described studies that assessed the maximum tolerable dose strategy, and one article described a study that assessed the dosing frequency strategy.

As discussed in the Methodology section of this report, only treatment strategies that were addressed by at least five acceptable studies were evaluated. One of the three drug-optimization strategies, the drug reduction strategy, was addressed by enough studies to meet this criterion. The prerequisite number of studies did not address the remaining two strategies, even when the inclusion criteria were relaxed to allow for the inclusion of retrospective studies. Consequently, we do not include further information concerning implementation of either the maximal tolerable dose or the optimized dosing frequency strategies.

Drug Reduction Strategy

The goal of drug reduction strategy is to reduce the number of AEDs without increasing seizure frequency above some unacceptable level. As implied above, this strategy is based on the (reasonable) assumption that reducing the number of AEDs taken by a patient should result in reduced side effects, which will lead to increased quality of life, improved cognitive function, improvements in mood, and reduced costs.n

Excluded articles

We excluded one of the eight articles that both met the general and question-specific inclusion criteria. This article and the reason for its exclusion are presented in Evidence Table 88.

Evidence base

After the exclusion of one study, seven articles remained.120–126 These studies included data collected from 311 patients. Details of the studies described by these articles are presented in Evidence Tables 89 through 98.

All of the studies included in the present evidence base were prospective, three were controlled,121, 122, 126 and the remaining four utilized a case series design.120, 123–125 Two of the three controlled trials were single-blinded,o and not randomized.121, 126 The remaining controlled trial was randomized and double blinded.122 However, this study randomized patients within the drug reduction arm to drug reduction at either a slow rate or a fast rate, and patients were not randomly allocated to the two principal arms of the study, the drug reduction and the control arms. Since the primary objective of the present subquestion is to determine whether implementation of the drug reduction strategy leads to improved patient outcomes, this study, for the purposes of this section of the report, must be considered a nonrandomized controlled trial.

Design and conduct of included studies

This section presents the findings of our systematic assessment of the quality of the seven studies that assessed the effectiveness of the drug reduction strategy. This systematic assessment consisted of an appraisal of both the internal and external validity of each included study.

Internal validity

Measurement bias, regression to the mean, extraneous event bias, and sample specification bias were potentially present in all seven studies. Patient reporting bias and investigator reporting bias may have been present in six studies. Selection bias potential affected the three controlled trials. Sampling bias may have been present in the six studies that did not report how patients were enrolled in the study. Attrition bias was a potential factor in one study with more than a 10 percent attrition rate. These potential biases with respect to this question are discussed in detail in Appendix B.

External validity

Details of the patient characteristics that were reported by each of the articles in the present evidence base are presented in Evidence Tables 93 through 98.

The range of ages covered by each of the studies in the present evidence base tended to be broad, and, although no study exclusively enrolled adults, six of the seven studies enrolled mainly adults.120–122, 124–126 The remaining study enrolled solely children.123 We were unable to determine the upper age of the patients in the study described by Callaghan, O'Dwyer, and Keating124 because of inconsistent reporting (the reported mean patient age was 26 years but the range was reported as 6 to 24 years). The duration of epilepsy suffered by the patients in the included studies varied considerably with durations ranging from less than 1 year to well over 60 years.

The proportion of females in each of the studies included in the present evidence base varied considerably between studies (from under 25 percent to over 80 percent). One study did not report the sex ratio (Schmidt125).

Two of the seven studies included for this question did not restrict their patient sample by age or seizure type.124, 126 The remaining five studies enrolled patients because they were considered representative of a specific subpopulation of patients with treatment-resistant epilepsy. Three of the studies recruited institutionalized patients with severe epilepsy and multiple cognitive and/or behavioral deficits.120–122 Two of the studies recruited patients because they suffered from a particular seizure type.120, 125

Although all studies included in the present evidence base investigated a common optimization strategy (the drug reduction strategy), each study did so in a different way. For example, the aim of Specht, Boenigk, Wolf, et al.123 was to evaluate the effects of the removal of all patients in their study from a single drug (clonazepam), whereas the aim of the study by Callaghan, O'Dwyer, and Keating124 was to reduce all patients in their study from polytherapy to monotherapy or, if this was not possible, to two AEDs. Because the evidence base pertaining to drug reduction strategy was small, quantitative analyses could not be performed that would indicate whether the findings of the individual studies were similar. Without evidence to demonstrate such similarity, conclusions about the effectiveness of the drug reduction strategy as a whole are not possible. Instead, each variation of the drug reduction strategy must be considered separately, and the findings of each individual study may only be generalized to patients with characteristics similar to those included in that study.

Synthesis of study results

The assessment of study quality presented above indicates that, given the present evidence base, definitive conclusions cannot be drawn about whether implementation of the drug reduction strategy is effective in improving outcome in patients with treatment-resistant epilepsy. Acknowledging this, we have instead evaluated the available data with the aim of determining whether the implementation of this strategy may plausibly be effective in improving outcome among patients with treatment-resistant epilepsy.

Table 15. Outcomes in studies of drug reduction
Reported Seizure Outcomes Reported Nonseizure Outcomes
ReferenceDifference in Absolute Seizure FrequencyPercent Change in Seizure FrequencyPercent Patients Seizure-freePercent of Patients With >50 Percent Reduction in Seizure RatePercent of Patients With an Increase in Seizure FrequencyMoodCognitive FunctionAdverse EventsMortality
Controlled trials performed outside of the United States
May (1992)121[check]a[check][check]
Duncan (1990)122[check][check]b[check]c[check]d[check]
Thompson (1982)126[check][check][check][check]
Case series performed in the United States
Mirza (1993)120[check][check][check][check]
Case series performed outside of the United States
Specht (1989)123[check]e[check]e[check]e[check]e[check]e[check][check]
Callaghan (1984)124[check]?[check][check][check]
Schmidt (1983)125[check]e[check]e[check]e[check]e[check]e[check][check]
Number of articles addressing outcome4f24432357
a

May, Bulmahn, Wolhlhuter et al.121 reported that no statistically significant between groups differences in seizure frequency were seen but did not present any data

b

Mood data abstracted from Kendrick, Duncan, and Trimble136

c

Cognitive function data abstracted from Duncan, Shorvon, and Trimble137

d

Adverse events data abstracted from Duncan, Shorvon, and Trimble138

e

Data calculated by ECRI from individual patient data

f

Does not include May, Bulmahn, Wolhlhuter et al.121 (see footnote a above).

Not all of the outcomes listed by the Technical Expert Panel (see question-specific inclusion criteria above) were reported on in all of the articles in the present evidence base. The reported outcome measures and the articles that contained data pertaining to these outcome measures are presented in Table 15.

Seizure frequency outcomes

As stated previously, the goal of the drug reduction strategy is to remove a drug (or drugs), thereby reducing the occurrence of (or the risk for) adverse effects associated with the use of AEDs. This goal must be accomplished without increasing seizure frequency to unacceptably high levels. Although reductions in seizure frequency are desirable and may indeed occur, they are not, in this instance, to be expected. Consequently, studies needed only to demonstrate that implementation of the drug reduction strategy resulted in other benefits such as reductions in adverse events, increases in cognitive function, increases in quality of life, reduced cost, etc. This means that trials that evaluate changes in seizure frequency that result from drug reduction strategy must also demonstrate, through hypothesis testing, that clinically meaningful increases in seizure frequency did not occur.

In such trials, which are akin to studies of therapeutic equivalence,p classical hypothesis testing (with the usual null hypothesis that there is no difference between the interventions) is inappropriate.127–131 This is because the desired result of a bioequivalence study would be to prove the null hypothesis by showing that no increases in seizure frequency occurred in the treatment group when compared to the comparison group.q An alternative hypothesis allows meaningful statistical analyses to be performed. In this instance, the alternative hypothesis is that seizure frequency increases in the treatment group will be less than a prespecified level, δ, above the seizure frequencies seen in the comparison group (HA: XDRS-Xc<δ; where HA = alternative hypothesis). Thus, to demonstrate that implementation of a drug reduction strategy does not lead to increases in seizure frequency, any difference in seizure frequency between the treatment group and the comparison group (along with its 95 percent confidence intervals) must fall entirely below δ. Confidence intervals that extend above δ indicate that the alternative hypothesis has not been refuted and implementation of the strategy may lead to increases in seizure frequency.

Table 16. Possible decisions based on hypothesis test
Testing H0: XC = XDRSTesting H0': XDRS≥Xc + δ
True Difference XDRS - XcFail to Reject (good for DRS)Reject (bad for DRS)Reject (good for DRS)Fail to Reject (bad for DRS)
XDRS - Xc = 0 (good for DRS)Correct DecisionType I errorCorrect DecisionType II error
XDRS - Xc = δ (bad for DRS)Type II errorCorrect DecisionType I errorCorrect Decision

Adapted from Blackwelder128

DRS Drug reduction study

H0 Null hypothesis (standard)

H0' Null hypothesis (therapeutic equivalence studies)

Xc Mean seizure frequency in control group

XDRS Mean seizure frequency in drug reduction therapy group

δ Predefined difference in mean seizure frequency above which use of drug reduction study is unacceptable

As is the case with conventional hypothesis testing, a study should be designed with adequate power to avoid the possibility of making Type II statistical errors. As shown in Table 16, when performing hypothesis testing using the alternative hypotheses, the “standard” rules of a Type I error and a Type II error become reversed. Thus, a Type I error is made if the difference XDRS-Xc is less than δ when, in fact, the difference is greater than or equal to δ, and a Type II error is made when the difference is greater than or equal to δ when it is actually less than δ.

Given the information above, the seizure outcomes of importance in this evaluation are those that assess increases in seizure frequency. Outcomes that assess improvements in seizure frequency (proportion of patients seizure-free, proportion of patients achieving a greater than 50 percent decrease in seizure frequency), though interesting, are of secondary importance. As a result, we have focused this section of the report on three seizure frequency outcomes (absolute seizure frequency, percentage change in seizure frequency, and proportion of patients with an increase in seizure frequency). Data pertaining to the remaining seizure frequency outcomes are summarized in Evidence Table 99 but are not discussed further.

Absolute seizure frequency.Two of the three controlled trials included for this subquestion presented data on (mean or median) absolute seizure frequency. Two studies are too few to allow a quantitative analysis to be performed. As a result, we present the findings of our semi-quantitative analysis of the available data. These data are presented in Evidence Table 99.

As discussed above, to demonstrate that seizure frequency does not increase in patients using a drug reduction strategy, the strategy must be shown not to cause clinically important increases in seizure frequency (XDRS- Xc ≤ δ). This requires the authors to explicitly state what they consider a meaningful increase in seizure frequency (δ). Based on this seizure frequency, they should then state the size of the study (power) necessary to overturn the null hypothesis that seizure frequency in patients who received drug reduction will increase above this predefined seizure frequency.r Neither of the two controlled trials that reported this outcome stated what they considered to be a clinically important difference in seizure frequency, nor did they perform a power analysis.s

All of the statistical analyses presented in these two articles tested the traditional null hypothesis that no between-groups differences in seizure frequency exist. Thus, their analyses essentially attempted to prove the null hypothesis that there was no change in seizure frequency. As discussed above, this is inappropriate.

Because the investigators did not determine the power of their study and because their statistical analyses were not appropriate for the clinical question of interest, the seizure frequency analyses in the articles are of limited value. However, summary data from these studies may still be used to provide some useful information. This can be accomplished by calculating the mean difference in seizure frequency (and its CI) between the drug-reduction group and the control group for each study. The upper CI of this difference can then be used to determine the maximum magnitude of increase in seizure frequency that will not lead to the alternative hypothesis (HA: XDRS - Xc ≤ δ) being accepted over the null hypothesis (H0: XDRS - Xc >δ). Such an approach, however, requires that the study report seizure frequency data in such a way that a difference can be calculated.

Duncan, Shorvon, and Trimble122 summarized their seizure frequency data in terms of mean seizure frequencies along with its standard deviation. The range cannot be used to calculate a valid standardized between-groups difference. No other measures of dispersion were reported. As a result, the seizure frequency data presented by Duncan, Shorvon, and Trimble122cannot be used to determine whether implementation of the drug reduction strategy leads to clinically important increases in seizure frequency.

Unlike Duncan, Shorvon, and Trimble,122 Thompson and Trimble126 presented mean seizure frequency along with its dispersion (expressed in terms of standard deviations). However, the analysis described above still cannot be performed, because the technique is sensitive to pretreatment differences in seizure frequency. Although no statistically significant between-groups differences in seizure frequency data were detected at pretreatment, a between-groups differences in seizure frequency at baseline did exist, and these differences were large enough to lead to biased posttreatment effect size estimates. For example, the mean pretreatment frequency for partial seizures in the drug reduction group was 21.1 (SD: 34.6) seizures per week compared to 6.8 (SD: 9.7) per week in the control group. Thus, patients in the drug reduction arm were experiencing more than three times the number of seizures per week compared to the patients in the control arm at study onset. Consequently, the study is biased against finding that the implementation of the drug reduction strategy will lead to increases in seizure frequency.

To summarize, the data from the currently available controlled trials could not be used to draw evidence-based conclusions about whether or not implementation of the drug reduction strategy leads to increases in seizure frequency.

Although none of the four included case series reported on this outcome, two studies did present individual patient data that allowed us to summarize the seizure frequency data both pre and post implementation of the drug reduction strategy. These data are presented in Evidence Table 99. They do not suggest that seizure frequencies increase following implementation of the drug reduction strategy. However, because these data originate from two uncontrolled studies and, because seizure frequency in patients with treatment-resistant epilepsy commonly demonstrates regression to the mean (see Methodology section),23, 132 this observation does not provide convincing evidence to support the contention that implementation of the drug reduction strategy does not lead to increases in seizure frequency.

As will be seen in the following sections, other seizure frequency-based outcomes suggest that drug reduction strategy may lead to large increases in seizure frequency in some patients.

Mean or median percentage change in seizure frequency. None of the three included controlled trials presented data on the percentage change in seizure frequency following implementation of the drug reduction strategy. Thus, conclusions about this outcome can only be based on case series data.

Two of the four case-series studies presented individual patient data that allowed us to assess this outcome. These data show that the median percentage change in seizure frequency from baseline was -0.09 percentt (Range: -100 percent to 412 percent) in the study by Specht, Boenigk, Wolf, et al.123 and -0.12 percent (Range: -100 percent to 2,678 percent) in the study by Schmidt.125 In both studies, more than 50 percent of the patients experienced a reduction in seizure frequency following implementation of the drug reduction strategy, and just under 50 percent of the patients experienced an increase in seizure frequency from baseline (43 percent of patients in the study by Specht, Boenigk, Wolf, et al.123 and 47 percent of patients in the study of Schmidt125). The proportion of patients who experience an increase in seizure frequency from baseline is addressed in more detail in the following section of the report.

Thus, these data suggest that a high proportion of patients (close to 50 percent) may experience increases in seizure frequency following the implementation of the drug reduction strategy. The data also suggest that some patients may experience decreases in seizure frequency. Given that regression to the mean is known to influence seizure frequency data,23,132 some of these observed reductions in seizure frequency were probably a manifestation of this bias. The only other possible explanation is that the withdrawn drug was somehow causing seizures.

Proportion of patients with an increase in seizure frequency. None of the three included controlled trials presented data on the proportion of patients with an increase in seizure frequency following implementation of the drug reduction strategy. Thus, conclusions on this outcome can only be based on data from case series.

One of the four case series presented data on increases in seizure frequency. Callaghan, O'Dwyer, and Keating124 reported that three of the 35 patients (9 percent) included in their study demonstrated an increase in seizure frequency. This information, however, is of limited value because the authors did not define what they meant by “worse.” Consequently, the magnitude of the reported increase in seizure frequency in these three patients cannot be determined, and no conclusions can be drawn as to whether these increases were clinically important.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf28.jpg.

   Figure 28. Increase in seizure frequency and drug reduction strategies

Percentage presented in parentheses is the actual proportion of patients with seizure frequencies greater than the percent increase in seizure frequency shown on the X-axis. The diamond and error bars represent the effect size and 95% CI.

Two other articles presented individual patient data that allowed us to calculate the proportion of patients with an increase in seizure frequency (Schmidt125 and Specht, Boenigk, Wolf, et al.123). Because the magnitude of a clinically important increase in seizure frequency remains ambiguous, we believed that arbitrarily reporting the proportion of patients above any single frequency was inappropriate. Instead, we calculated the proportion of patients that demonstrated increases in seizure frequency above a series of percentage increases from baseline (thresholds). These data, which are presented in Evidence Table 99, are summarized in Figure 28.

This figure shows that a statistically significant proportion of patients in both case series exhibited large (>100 percent) increases in seizure frequency when compared to baseline (27.8 percent of patients in the study by Schmidt125 and 8.6 percent in the study by Specht, Boenigk, Wolf, et al.123). In neither study did these patients have unusually low seizure frequency rates at the onset of the study, suggesting that implementation of the drug reduction strategy will result in increased seizure frequency in a significant proportion of patients.

Mood

Two of the three controlled trials presented data on changes in mood following the implementation of a drug reduction strategy. Two studies are too few to allow a quantitative analysis of the available data to be performed. As a result, we present the findings of our semi-quantitative analysis of the available data. These data are presented in Evidence Table 99.

Both Thompson and Trimble126 and Duncan, Shorvon, and Trimble122 presented mood data collected using two validated self-administered psychometric instruments. These instruments were the Middlesex Hospital Questionnaire (MHQ) and the Mood Adjective Checklist (MACL).

The MHQ is a self-administered questionnaire that measures six domains and provides a composite score. This instrument is commonly used as an aid in the diagnosis of clinical depression. The six domains that are assessed include: Free-floating anxiety (F-FA), phobic anxiety (PHO), obsessive-compulsive (OBS), somatic anxiety (SOM), depressive traits (DEP), and hysteric (HYS) traits. Although Duncan, Shorvon, and Trimble122 presented data for all six domains, Thompson and Trimble126 only reported on two (F-FA and DEP). Neither Duncan, Shorvon, and Trimble122 nor Thompson and Trimble126 found a statistically significant between-groups difference in any of the domains measured using the MHQ following completion of drug reduction. Nor were any trends in the data detected that would indicate that mood either improved or deteriorated following drug reduction.

The MACL is a standardized scale commonly used to detect alterations in mood across five domains. These domains provide measures of anxiety, fatigue, hostility, vigor, and depression, along with a composite score. Although both studies measured mood alterations using this instrument, only Thompson and Trimble126 presented relevant data in their article. Again, as was the case with reporting of the data obtained using the MHQ, Thompson and Trimble126 did not report data for all of the measured domains (in this case data for the domain “hostility” was not reported) and no explanation was provided as to why this was the case. Analysis of data abstracted from Thompson and Trimble126 did not find statistically significant between-group differences in any of the domains measured using the MACL. Nor were any trends in the data detected that would indicate that mood either improved or deteriorated following drug reduction. This finding was corroborated by Duncan, Shorvon, and Trimble122 who reported that, “There were no statistically significant differences between the four groupsu on the anxiety, depression, fatigue, vigor, or hostility subscales of the Mood Adjective Checklist.”122

Cognitive function

All three of the controlled trials included for the present subquestion presented data on changes in cognitive function following implementation of a drug reduction strategy when compared to a control group comprised of patients who were maintained on their current polytherapy drug regimen. Three studies are too few to allow a quantitative analysis to be performed. As a result, we present the findings of our semi-quantitative analysis of the available data. These data are presented in Evidence Table 99.

All three studies measured cognitive function using a series of standardized clinical tests. These tests included tests of concentration and attention, memory, and tests of psychomotor performance.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf29.jpg.

   Figure 29. Drug reduction strategies and tests of concentration/attention

Tests of concentration/attention. All three controlled trials measured concentration/attention before and after the implementation of the drug reduction strategy. These data are summarized in Figure 29. May, Bulmahn, Wohlhuter et al.121 used the d2 test and the modified version of the Frankfurt Concentration Test for Children (FCTC). Duncan, Shorvon, and Trimble122 used the Letter Cancellation Task (LCT), and Thompson and Trimble126 used the Stroop test (ST) and a test of visual scanning speed (VSS).

Data from only one of the three studies, Duncan, Shorvon, and Trimble,122 suggested that concentration improvement was statistically significant among patients who had undergone drug-reduction when compared to patients in the control group. The only statistically significant posttreatment benefit was seen in patients who were removed from sodium valproate (t = 4.245; p = 0.000108v) followed by patients who were removed from phenytoin (t = 1.965; p = 0.056). Assessment of the pretreatment LCT data, however, suggested the presence of selection bias, with patients who were removed from sodium valproate having statistically significantly higher baseline LCT scores compared to those in the control group (t = 3.404; p = 0.00140). Thus, the posttreatment between-groups difference was essentially the same as the pretreatment difference.

No such bias was found to have affected the LCT scores on removal of phenytoin and these data suggest that removal of phenytoin may lead to an improvement in concentration/attention in some patients. However, interpretation of the importance of a mean improvement of 18 points is difficult because the authors did not indicate if such a between-groups difference was clinically important.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf30.jpg.

   Figure 30. Drug reduction strategies and the Frankfurt Concentration Test for Children

Pre- and posttreatment Frankfurt Concentration Test for Children data from May (1992)121

May, Bulmahn, Wohlhuter et al.121 argued that their FCTC data showed a statistically significant between-groups difference in patients in the drug reduction arm (all of whom had phenytoin removed). Figure 30 shows graphically their reported pre-and posttreatment FCTC data. The data, as presented in the article, can lead to different conclusions. Changes in FCTC score seen from baseline between the two arms of their study were compared instead of the posttreatment data alone. Because FCTC scores improved in the reduction group and declined in the control group, the comparison found a significant between-groups difference. As shown in Figure 30, the changes in FCTC could reasonably be argued to be due to regression to the mean rather than an effect of treatment.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf31.jpg.

   Figure 31. Drug reduction strategies and tests of memory

Abbreviations:

LGT: Lern- und Gedachtnis Test

DSF: Digit scan forwards

DSB: Digit scan backwards

PHT: Phenytoin vs. Control

CBZ: Carbamezapine vs. Control

VPA: Valprioc acid vs. Control

IR: Immediate recall

DR: Delayed recall

R: Recognition

Memory. All three controlled trials measured memory before and after the implementation of the drug reduction strategy. May, Bulmahn, Wohlhuter et al.121 measured memory using a digit span and an immediate recall of pictures, and a delayed-recall task at the end of the test session that were taken from the Lern- und Gedachtnis-test (LGT-3). Duncan, Shorvon, and Trimble122 measured memory using a digit span task derived from the Wechsler Adult Intelligence Scale. Thompson and Trimble126 used an immediate-recall and delayed-recall of pictures task that they developed and validated themselves.133–135 Data on the effects of drug reduction on memory collected in these studies are summarized in Figure 31.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf32.jpg.

   Figure 32. Drug reduction strategies and digital scanning score

Data from Duncan (1990)122 showing effects of valprioc acid removal on digital scanning score

When considered as a whole, these data do not provide evidence that drug reduction leads to improved memory. Although the data from Duncan, Shorvon, and Trimble122 suggest that drug reduction may lead to statistically significant improvements in short-term memory (as measured by digital scanning backwards) in some patients who were removed from sodium valproate, these results may be biased. This is illustrated by Figure 32, which shows that a pretreatment difference in short-term memory existed between patients removed from sodium valproate and patients in the control group. Although this difference was not statistically significant (t = 1.842; p = 0.072; Hedges' d = 0.53; CI: -0.05 to 1.11), it is large enough to have biased the posttreatment between-groups effect size data. Indeed, Duncan, Shorvon, and Trimble122 reported that their statistical analyses showed that removal of phenytoin, carbamazepine, or sodium valproate did not lead to improvements in short-term memory.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf33.jpg.

   Figure 33. Drug reduction strategies and tests of psychomotor function

FT: Finger tapping

DH: Dominant hand

NDH: Non-dominant hand

PB: Pegboard

PRF: Pursuit Rotor Failure

PFD: Pursuit Failure Duration

PHT: Phenytoin vs. Control

CBZ: Carbamezapine vs. Control

VPA: Valprioc acid vs. Control

Psychomotor function. All three controlled trials measured psychomotor function before and after drug reduction. May, Bulmahn, Wohlhuter et al.121 used the pegboard, a pursuit rotor, and tapping. Duncan, Shorvon, and Trimble122 used tapping alone, as did Thompson and Trimble.126 Data on the effects of drug reduction on psychomotor function in these studies are summarized in Figure 33. These posttreatment, between-groups effect sizes do not provide evidence that implementation of the drug reduction strategy results in improved psychomotor function.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf34.jpg.

   Figure 34. Drug reduction strategies and psychomotor function

Pre-and posttreatment psychomotor function data presented by May (1992)121

Again, these findings contradict the conclusions drawn by the authors. May, Bulmahn, Wohlhuter et al.121 reported that their data demonstrated a statistically significant improvement in psychomotor function when measured using finger tapping (with the dominant hand) and pursuit rotor failure (again using the dominant hand). Figure 34 shows graphically the pre-and posttreatment data reported in these studies. These data can lead to different conclusions. As stated above, the discrepancy is due to comparing the changes in psychomotor function from baseline between the two arms of the study instead of comparing the posttreatment data alone. As shown by such a comparison in Figure 34, changes in psychomotor function in the drug reduction arm of the study could reasonably be argued to be due to regression to the mean rather than treatment.

Adverse events

Identification of treatment-related morbidities can only be achieved by comparing reported adverse event rates in patients who underwent drug reduction against a control group comprised of patients who were maintained on their current treatment regimen. Although case series identify possible adverse events that may be associated with a treatment, their data cannot be used to draw evidence-based conclusions about whether these adverse events are a consequence of the drug reduction strategy. As a result, we only considered adverse events data abstracted from controlled trials in this section of the report. However, adverse events data abstracted from the four case series included in the present evidence base are tabled in Evidence Table 100.

One of the three controlled trials reported relevant data on adverse events. Patients included in the drug reduction arm of the study by Duncan, Shorvon, and Trimble122 suffered no additional adverse events compared to patients included in the control arm. Thus, although the patients undergoing drug reduction did experience some adverse events, these adverse events cannot be attributed to drug reduction strategy used in this study.

Mortality

No patients were reported to have died during any of the seven included studies. Thus, no evidence exists to suggest that implementation of the drug reduction strategy leads to increased mortality.

Comparisons of Drug Strategies

None of the trials that met the inclusion criteria directly compared the drug strategies. Indirectly comparing two of the drug strategies is possible if the patients enrolled in all of the trials of all strategies were similar. The drug reduction strategy cannot be compared with the other two strategies, because of the differing intentions of investigators in these latter trials. The intent of the trials of polytherapy and sequential monotherapy was to reduce seizures without causing adverse effects, whereas the intent of the trials of drug reduction therapy was to reduce the number of drugs without increasing seizures.

To determine whether trials of polytherapy and sequential monotherapy enrolled similar patients, we compared the number of drugs given to patients receiving these strategies. Differences in the number of drugs likely mean that the severity of epilepsy in patients who received polytherapy was different from that in patients who received sequential monotherapy. Among 11 trials of sequential monotherapy that reported this percentage, two (18 percent) reported that more than half of the patients were receiving two or more prior AEDs. By contrast, among the 18 trials of polytherapy that reported this percentage, 16 (89 percent) reported that more than half of the patients were receiving two or more prior AEDs. These percentages are significantly different (χ2(1) =14.5, p = 0.00014). Thus, patients who received polytherapy had more severe epilepsy compared to patients who received sequential monotherapy. This difference precludes comparison of the quantitative results of the two strategies.

A qualitative comparison, however, suggests that polytherapy is clinically preferable to sequential monotherapy. In the section on sequential monotherapy, the evidence indicated that some patients had harmful increases in seizures as a direct result of the treatment, and whether sequential monotherapy caused any patients to become seizure-free could not be determined. In short, sequential monotherapy appeared more likely to be harmful than beneficial. By contrast, the reverse was true for polytherapy. Adding a drug reduced seizures by 50 percent in many patients, whereas adverse effects causing trial exits were rare. By inference, this suggests that polytherapy is preferable to sequential monotherapy.

Further, as discussed above, patients who received polytherapy had been receiving more drugs before the trial, thus they likely had more severe epilepsy. Patients with more severe epilepsy are, by definition, more difficult to treat. Thus, even in a more difficult-to-treat population, polytherapy helped many patients. This finding underscores the qualitative conclusion that polytherapy is preferable to sequential monotherapy for patients with treatment-resistant epilepsy.

Nondrug Treatments

In this section of the Evidence Report, we addressed Key Question #5: Which methods of nondrug treatment for epilepsy after initial treatment failure lead to improved outcomes for patients with treatment-resistant epilepsy? This question is divided into two parts. The first part addresses surgical interventions and the second addresses nondrug, nonsurgical interventions.

Surgical Interventions

In this section, we address the efficacy of surgical intervention when treatment with AEDs has failed to produce adequate seizure control. Patients who receive surgery have been determined to be treatment-resistant as part of their presurgical evaluation. For most patients, only a single surgical option will be available due to the nature and location of the lesion or condition responsible for generating their seizures. In patients undergoing temporal lobectomy, hemispherectomy, or corpus callosotomy, some variations in procedures are available.

A list of the specific surgical interventions and outcome measures addressed under surgical interventions is presented in the following section.

Question specific inclusion criteria

We included articles if they met the general inclusion criteria detailed in the Methodology section, and if they met the following question-specific criteria:

  1. All seizure frequency outcomes were reported before and 2 or more years after surgery, except for studies of multiple subpial transection (MST). This followup period was recommended by the Expert Panel and Technical Experts, who noted that because surgery is irreversible, relatively long-term data are of primary interest. However, because MST is a relatively new surgical procedure with a limited reference base, a minimum of 6 months followup was used to increase the size of the evidence base for this procedure.

  2. The study was published in 1985 or later. This cutoff was used because the Expert Panel and Technical Experts noted that surgical treatments for epilepsy have substantially changed since this date.

  3. One of the following specific interventions, as recommended by the Expert Panel and Technical Experts, was examined:

    1. Anterior temporal lobe resection

    2. Frontal lobe resection

    3. Parietal lobe resection

    4. Occipital lobe resection

    5. Cerebral hemispherectomy

    6. Corpus callosotomy

    7. MST separate from or in combination with other resections

Number of studies addressing surgical intervention

The order of the material presented in this section differs from that presented in the discussion of other interventions. This change in organization was necessary because we required a minimum 2-year followup period for most outcome measures used to evaluate surgery. The only exceptions were outcomes for mood (depression and psychosis), cognitive function (IQ and memory), and complications and mortality related to surgery. We shortened the minimum required followup time for these outcomes because they may manifest themselves relatively early after surgery.

Different studies make up the evidence base for each outcome. We will separately discuss each outcome and its specific evidence base under each surgical intervention rather than examining all of the studies in the evidence base for a single intervention. This section, number of studies addressing surgical intervention, presents an overview of all of the studies meeting our inclusion criteria for each surgical intervention examined under Key Question #5. The actual evidence base for each intervention and outcome will be discussed separately later in this report.

Table 17. Epilepsy surgery studies
InterventionTotal Number of Studiesa
Temporal Lobe Surgery105
Corpus Callosotomy26
Frontal Lobe Surgery18
Hemispherectomy11
Multiple Subpial Transection10
Parietal Lobe Surgery2
Occipital Lobe Surgery2
Surgical Controls12
a

Seven studies reported on more than one surgery category and are therefore double counted in this table

One hundred and seventy-nine studies met our inclusion criteria for surgical intervention. We provide a listing of each study meeting the inclusion criteria for each surgical intervention in Evidence Table 102 and a summary in Table 17. Only two studies each were found to meet our inclusion criteria for parietal lobe and occipital lobe surgery for the treatment of epilepsy. Consequently, we did not assess these interventions.

Evidence Tables 103 to 108 provide general information on each of the studies examined in this report organized into tables according to surgical intervention or reporting of control patients. The information in these tables includes the years during which the studies were conducted, the country in which the study was conducted, the primary center where the surgery was performed, if the study was conducted in multiple centers, whether patients were selected retrospectively or prospectively, and the study design.

Temporal Lobe Surgery

Temporal lobe surgery is intended to eliminate complex partial seizures by removing the lesion or epileptogenic area responsible for the development of these seizures. Complex partial seizures with or without secondary generalization are the most common seizure type associated with temporal lobe epilepsy.139 The second most common seizure type is a simple partial seizure, which is commonly experienced as the patient's typical aura.

Temporal lobe surgery candidates constitute the largest group of epilepsy surgery patients.140 Preoperative evaluation determines the type of lesion (tumor, vascular malformation, mesial temporal sclerosis, or other known or unknown etiology). The actual procedure depends on the location of the lesion (deep or superficial) and the extent to which tissue is to be removed.141–143 An en bloc anterior temporal lobectomy is a standardized operative procedure in which 4.5 to 5.0 cm of the anterior lateral temporal lobe neocortex is removed along with the amygdala, the anterior aspect of the parahippocampal gyrus, and the hippocampus in the medial portion of the temporal lobe. Neocortical lesionectomy is used when the lesion, usually a tumor or vascular malformation, is contained entirely in the neocortex of the temporal lobe. Selective amygdalohippocampectomy (AH) involves the removal of the amygdala and hippocampus only. Intraoperative EEG readings may be used to “tailor” the extent of tissue resection by defining a zone of frequent interictal spiking. The use of this technique may result in more or less tissue being removed compared to the “standard” approach. Another modification to the standard approach is to remove less than 4.5 cm of the anterior temporal lobe and is referred to as “partial” resection. The Evidence Tables pertaining to temporal lobe surgery will refer to these procedures as standard, tailored, partial, amygdalohippocampectomy, and neocortex.

Seizure-free

Several outcome measurements examined in other questions of this report, such as changes in the proportion of patients experiencing at least a 50 percent reduction in seizure frequency, are not included in our examination of surgical intervention because they are rarely (if ever) reported in studies of epilepsy surgery.

Excluded studies

We excluded one study of temporal lobe surgery reporting seizure-free outcome measures from the evidence base because of poor quality. This study and the reason for its exclusion are listed in Evidence Table 109.

Evidence base

Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, 73 reported some sort of seizure-free outcome measurement. Studies of temporal lobe surgery used four different outcome measurements when reporting a patient as “seizure-free.” Each outcome measurement results in a different set of patients being considered “seizure-free” and therefore the data collected under each outcome measurement must be evaluated separately.

Table 18. Temporal lobe surgery: seizure-free outcome reporting
ReferenceUndefinedNo AurasWith AurasEngel Class I
Bouilleret (2002)165[check][check]
Alsaadi (2001)167[check]
Boling (2001)169[check]
Hennessy (2001)171[check]
Hennessy (2001)148[check]
Jan (2001)174[check]
Kanemoto (2001)176[check]
Schramm (2001)178[check]
Sotero de Menezes (2001)180[check][check]
Verma (2001)182[check]
Wilson (2001)184[check][check]
Dupont (2000)186[check][check]
Eberhardt (2000)188[check][check][check]
Foldvary (2000)190[check]
Holmes (2000)192[check]
Iannelli (2000)194[check]
Markand (2000)151[check][check][check]
Rao (2000)197[check]
Robinson (2000)199[check]
Assaf (1999)201[check]
Eriksson (1999)203[check]
Henry (1999)204[check]
Holmes (1999)206[check]
Mathern (1999)208[check]
Mitchell (1999)210[check]
Rossi (1999)212[check]
Salanova (1999)214[check]
Son (1999)166[check]
Maher (1998)168[check][check]
Radhakrishnan (1998)170[check]
Szabo (1998)172[check][check]
Bizzi (1997)173[check]
Cappabianca (1997)175[check]
Casazza (1997)177[check][check]
Ho (1997)179[check]
Keene (1997)181[check]
Kilpatrick (1997)183[check][check]
McLachlan (1997)185[check]
Schwartz (1997)187[check][check]
Silander (1997)189[check]
Sisodiya (1997)191[check][check][check]
Adam (1996)193[check][check][check]
Goldstein (1996)195[check]
Holmes (1996)196[check]
Sirven (1996)198[check]
Acciarri (1995)200[check]
Berkovic (1995)202[check]
Davies (1995)163[check]
Jooma (1995)205[check]
Jooma (1995)207[check]
Liu (1995)209[check]
Renowden (1995)211[check]
Thadani (1995)213[check]
Vossler (1995)215[check]
Blume (1994)146[check][check]
Guldvog (1994)160[check]
Guldvog (1994)161[check]
Berkovic (1991)216[check]
Hopkins (1991)217[check]
Rasmussen (1991)218[check]
Wieser (1991)219[check]
Bidzinski (1990)162[check]
Mizrahi (1990)152[check]
Walczak (1990)164[check]
Yeh (1990)220[check]
Sperling (1989)221[check]
Estes (1988)222[check]
Bladin (1987)223[check]
Cutfield (1987)147[check]
Drake (1987)224[check]
Lieb (1986)225[check]
Meyer (1986)226[check][check]
Delgado-Escueta (1985)153[check][check][check]
The most often used outcome measurement among the 73 studies in our evidence base was Engel class I, which was reported in 33 studies (Table 18). Engel class I is part of a four-part system for evaluating the success of surgery in patients with epilepsy.144 In this class, patients are considered “seizure-free” if they fit into one of four categories. The four categories are completely seizure-free since surgery (free of both complex and simple partial seizures); aura only since surgery (the patient is free of complex partial seizures but still has simple partial seizures); some seizures after surgery, but seizure-free for at least 2 years; and atypical generalized convulsions with AED withdrawal only.

The other three outcome measurements for “seizure-free” all assume that patients are free of complex partial seizures at the time of examination, but differ on whether they consider a patient “seizure-free” if they still have simple partial seizures (auras). Twenty studies specifically considered patients as “seizure-free” if they were free of both complex partial seizures and simple partial seizures (Table 18). In this report, we will refer to this group of patients as seizure-free with no auras. Twenty-six studies specifically considered patients as “seizure-free” if they were free of complex partial seizures, but patients could still have simple partial seizures and be considered “seizure-free” (Table 18). Therefore, this outcome measurement combines patients who are free of both complex and simple partial seizures with patients who are free of complex partial seizures but still have auras. In this report, we will refer to this group of patients as seizure-free with auras. Studies using the fourth outcome measurement, “seizure-free” did not state whether such patients experienced auras. Sixteen studies used this outcome measurement (Table 18). Since these studies do not report if their “seizure-free” patients do or do not have auras, these studies are probably a combination of studies reporting seizure-free with no auras and studies reporting seizure-free with auras. In this report, we will refer to this group of patients as seizure-free undefined.

Studies using Engel class I, which has the least restrictive means of determining if a patient is seizure-free, may be expected to report the largest percentage of seizure-free patients. Seizure-free with auras is similar to Engel class I, but somewhat more restrictive. Studies using this outcome measurement may be expected to report slightly fewer patients as seizure-free compared to Engel class I. Seizure-free with no auras is the most restrictive, and studies using this outcome measurement may be expected to report the smallest percentage of seizure-free patients. Because studies using seizure-free undefined may be a combination of studies using seizure-free with no auras and seizure-free with auras, these studies may be expected to report a percentage of seizure-free patients somewhere between studies reporting seizure-free no auras and studies reporting seizure-free with auras.

The 73 studies of temporal lobe surgery examined 3,978 patients. Twenty studies with 734 patients reported seizure-free with no auras, 26 studies with 1,396 patients reported seizure-free with auras, 16 studies with 977 patients reported seizure-free undefined and 33 studies with 1,549 patients reported Engel class I. If a study reported separate outcome and patient information according to a specific age group, type of surgery, or pathology, these data are presented separately in Evidence Table 110 and are considered a separate study in any analysis. Sixteen studies reported more than one of the four seizure-free categories, but no studies reporting seizure-free as undefined with respect to auras also reported one of the other categories. Of the studies that reported more than one outcome, five studies reported seizure-free with no auras, seizure-free with auras, and Engel class I, nine studies reported seizure-free with no auras and seizure-free with auras, and twelve studies reported Engel class I with either seizure-free with no auras or seizure-free with auras.

Table 19. Temporal lobe surgery: seizure-free outcome reporting in “control” patients
Seizure-Free Outcome Measurements
ReferenceUndefinedNo AurasWith AurasEngel Class I
Bauer (2001)227[check]
Kumlien (2001)228[check]
Wiebe (2001)145[check][check]
Markand (2000)151[check]
Holmes (1998)229[check]
Wolf (1998)230[check][check][check]
McLachlan (1997)185[check]
Hermanns (1996)231[check]
Vickrey (1995)232[check][check]
Guldvog (1991)233Reported changes in seizure frequency only
Huttenlocher (1990)234[check]
Harbord (1987)235[check]
In addition to the studies of temporal lobe surgery, our evidence base also includes 12 studies that report seizure frequency outcome measurements for a total of 749 surgery “control” patients. Table 19 presents a listing of the seizure-free categories used by each of these studies. Seven reported seizure-free without reference to auras, three studies reported both seizure-free with no auras and seizure-free with auras, one study reported only seizure-free with no auras, and a single study reported Engel class I along with seizure-free with no auras and seizure-free with auras.

Design and conduct of included studies

Once a patient has been identified as a suitable candidate for surgery, withholding surgery may be considered unethical. Consequently, the literature on surgical interventions consists mainly of uncontrolled trials in which all patients receive a single treatment and patients are not randomized to a nonsurgery group or a group receiving an alternative treatment approach. These studies generally do not provide a control group against which to evaluate the efficacy of surgery. The remainder of this section presents an assessment of the quality of the evidence base used to draw conclusions about the effectiveness of temporal lobe surgery in patients with treatment-resistant epilepsy. Our assessment consists of an appraisal of each study's internal and external validity.

Internal validity

Table 20. Temporal lobe surgery: study designs
Study designs for studies of temporal lobe surgery reporting seizure-free outcomes
Prospective Nested Case-controlled StudiesaRetrospective Nested Case-controlled StudiesProspective Case Series StudiesRetrospective Case Series Studies
530533
a

Nested case-controlled studies are defined as any study reporting patient characteristics (age at treatment, age at seizure onset, duration of epilepsy prior to treatment, etc.) separately for patients with good outcomes (seizure-free, Engel Class I, etc.) and patients with poor outcomes. Nested case-controlled studies are also considered case series studies because all patients received the same treatment.

Internal validity refers to the strength of the presumed causal relationship between the intervention and the outcome of interest.21 For studies of surgical intervention in treatment-resistant epilepsy, one presumed relationship is between the surgical removal of tissue and changes in posttreatment seizure frequency. Table 20 lists the study designs in the evidence base for seizure-free outcome measurements after temporal lobe surgery. These studies are exclusively case series. Case series have a number of biases that can weaken the internal validity of a study. These biases can be ruled out if they are considered implausible in the particular context of a given study or they are plausible but did not actually occur.21 Specific aspects of internal validity are discussed in the Methodology section of this document.

All of the studies discussed in this section on seizure frequency outcomes potentially have the following biases: extraneous event bias, investigator reporting bias, and patient reporting bias. Attrition bias and maturation bias are of specific importance to studies of surgery.

Attrition bias refers to the loss of patients, for any reason, before the minimum 2-year followup period. All studies with retrospective patient enrollment have this bias because they only record outcomes for patients with the minimum 2-year followup period. Only 10 of the 73 studies of seizure frequency outcomes had prospective as opposed to retrospective patient enrollment. The effect of attrition bias in the surgical studies considered in this report was limited by the requirement that studies report consecutive patients.

Maturation bias refers to individuals who received surgery but would have eventually “outgrown” the disease without surgical intervention. This seems implausible since surgery candidates often wait for more than a year before undergoing surgery and individuals may wait on average for 20 years from the onset of seizures before considering a surgical option.145 A randomized controlled trial of temporal lobe surgery reported that 8 percent of control patients became free of complex partial seizures during a 1-year waiting period prior to surgery.145 This finding suggests that maturation may occur, but that it affects only a small proportion of surgical patients.

External validity

As previously discussed, candidates for epilepsy surgery must complete an extensive presurgical evaluation to determine their suitability for surgery. Patients with temporal lobe epilepsy usually have a specific focal lesion and experience complex partial seizures with or without secondary generalized seizures.139 Therefore, the patients in published studies of surgery for temporal lobe epilepsy should be representative of all patients considering this surgery. However, many publications of epilepsy surgery select a specific patient population based on age or pathology, or use only one variation of a surgical technique. The results of these studies may or may not be generalizable to all temporal lobe surgery patients.

The specific patient characteristics of temporal lobe surgery patients reported in each publication in the evidence base for seizure-free outcomes are presented in Evidence Table 110. Age at surgery, age at seizure onset, and duration of epilepsy prior to surgery are commonly reported patient characteristics.

Among the 21 studies reporting seizure-free with no auras, 20 reported a mean age at surgery. The mean age at surgery in these studies varied from 9.4 years to 35 years, with only two studies having a mean less than 20 years of age. The range for age at surgery varied from 3 years to 62 years of age. Two studies examined only patients who were less than 20 years of age. Age at seizure onset was reported in 10 studies. The mean age at onset in these studies varied from 4 to 21 years of age with a range of less than a year to 44 years of age. Duration of epilepsy prior to surgery was reported in 11 studies. The mean duration varied from 6 to 19 years and the range varied from 1 to 45 years.

The patient characteristics for studies reporting seizure-free with auras were similar to the studies reporting seizure-free with no auras. Among the 26 studies reporting seizure-free with auras, 23 reported a mean age at surgery. The mean age at surgery varied from 8.3 years to 37 years with five studies having a mean less than 20 years of age. The range for age at surgery varied from 1 year to 86 years of age. Four studies examined only patients who were less than 20 years of age. Age at seizure onset was reported in 15 studies. The mean age at onset varied from 1 to 25 years of age with a range of less than a year to 62 years of age. Duration of epilepsy prior to surgery was reported in 11 studies. The mean duration varied from 5 to 26 years and the range varied from less than a year to 81 years.

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results

We will separately discuss each of the four “seizure-free” outcome measurements because, as mentioned above, each outcome measurement refers to a different group of “seizure-free” patients. We begin our analysis with studies reporting patients as seizure-free with no auras. This is the most restrictive group, but the ultimate goal of surgery is to be completely seizure-free. Next, we analyze studies reporting patients as seizure-free with auras. This patient population is free of complex partial seizures. Our analysis of the studies reporting Engel class I follows our analysis of the more restrictive “seizure-free” outcome measurements. Studies that did not report if auras were considered in their calculation of the number of patients who were seizure-free after surgery are analyzed last.

Meta-analytic threshold analysis of studies reporting seizure-free with no auras

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf35.jpg.

   Figure 35. Forest plot: temporal lobe surgery and seizure-free with no auras

A scale is not shown because the effect sizes were not calculated with actual control groups

Evidence Table 111 presents the actual patient counts, percentages, and calculated effect sizes for each study used in this analysis. The individual study effect sizes (Cohen's h) presented in this Evidence Table were based on no patients in a synthetic control group becoming seizure-free with no auras. Figure 35 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf36.jpg.

   Figure 36. Threshold analysis: temporal lobe surgery and seizure-free with no auras

The results of our threshold analysis of studies reporting seizure-free with no auras appear in Figure 36. Each summary estimate in the threshold analysis is based on Cohen's h. The summary estimate calculated at the 0 percent point (no patients in a synthetic control group became seizure-free with no auras) was 1.67 (CI: 1.57 to 1.77, p <0.000001) and corresponded to 55 percent (CI: 50 percent to 60 percent)w of patients becoming completely seizure-free after surgery. The summary estimate became nonsignificant (no statistically significant difference between surgery and control patients in the number of patients becoming seizure-free) when the proportion of patients in the synthetic control group reached 50 percent. There was no statistically significant heterogeneity among the studies in the threshold analysis (Q = 11.9, p = 0.92).

This analysis suggests that, after temporal lobe surgery, approximately 55 percent of patients will be completely seizure-free. However, this calculation was based on no patients in similar studies becoming seizure-free without surgery, so it does not estimate the percentage of patients who become seizure-free because of surgery. Some patients may become seizure-free without surgery. Readers are asked to consider the plausibility of 50 percent of temporal lobe epilepsy patients becoming completely seizure-free without benefit of surgery. They should also consider the above-noted difficulties with the internal validity of these studies, difficulties that could cause the threshold to decrease.

Meta-analytic threshold analysis of studies reporting seizure-free with auras

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf37.jpg.

   Figure 37. Forest plot: temporal lobe surgery and seizure-free with auras

A scale is not shown because the effect sizes were not calculated with actual control groups

MTS = Patients with mesial temporal sclerosis

Evidence Table 112 presents the actual patient counts, percentages, and calculated effect sizes for each study used in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on no patients in a synthetic control group becoming seizure-free with auras. Figure 37 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf38.jpg.

   Figure 38. Threshold analysis: temporal lobe surgery and seizure-free with auras

The results of our threshold analysis of studies reporting seizure-free with auras appear in Figure 38. The summary estimate calculated at the 0 percent point was 1.95 (CI: 1.87 to 2.02, p <0.000001) and corresponded to 68 percent (CI: 65 percent to 72 percent) of patients becoming free of complex partial seizures after surgery. The summary estimate became nonsignificant when the proportion of patients in the synthetic control group reached 65 percent. There was no statistically significant heterogeneity among the studies in the threshold analysis (Q = 24.2, p = 0.57).

This analysis suggests that, after temporal lobe surgery, approximately 68 percent of patients will be free of complex partial seizures (some patients may still have auras). However, this calculation was based on no patients becoming seizure-free without surgery, so it does not estimate net health benefit of surgery. Some patients may become seizure-free without surgery. The threshold analysis suggests that approximately 65 percent of patients in similarly designed studies would have to become seizure-free without surgery before surgery could be considered ineffective. Readers are asked to consider the plausibility of temporal lobe epilepsy patients achieving this threshold level without benefit of surgery. They should also consider the above-noted difficulties with the internal validity of these studies, difficulties that could cause the threshold to decrease.

To evaluate the plausibility of these threshold levels occurring among surgical candidates who do not receive surgery, we examined seizure rates in the available literature on such patients. Of the twelve studies reporting seizure-free outcome measurements for surgery control patients, only three reported both seizure-free with no auras and seizure-free with auras (Evidence Table 113). An additional study reported just seizure-free with no auras. Estimates of the percentage of control patients likely to become seizure-free with no auras varied from 0 percent to 20 percent. The estimates for seizure-free with auras varied from 7.5 percent to 27 percent. These differences in seizure rates are most likely due to differences in the patients considered in each study. Patients may have refused surgery or were considered unsuitable for surgery and then were reported as “control” patients. Several studies did not report the reasons why patients did not receive surgery (Evidence Table 114). Therefore, although these data suggest that temporal lobe surgery is effective, the patients in these studies may not be comparable to the surgical patients from the studies used in our meta-analysis.

Comparison of meta-analytic threshold results to findings of a randomized controlled trial of temporal lobe surgery

An RCT conducted by Wiebe, Blume, Girvin, et al.145 at the London Health Sciences Center at the University of Western Ontario examined seizure-free outcomes in patients who were randomized to temporal lobe surgery or required to wait 1 year before receiving surgery. A 1-year wait before undergoing preoperative investigations is routine practice at this institution. Therefore, randomizing patients to a wait list of 1 year was considered ethical. Patients were older than 16 years of age and continued to have at least monthly seizures despite the use of one or more AEDs. Patients randomized to surgery underwent a standard anterior temporal lobectomy. All patients were evaluated every 3 months for 1 year, and two epileptologists who were blinded to the identity of the patients and their treatment groups judged the adequacy of treatment through written clinical information.

Both seizure-free with no auras and seizure-free with auras were used to define the seizure-free status of the patients in this study. This study was not included in our analyzes of seizure-free data because the followup period was only 1 year. In a group of 40 control patients, one patient became seizure-free with no auras and two additional patients became seizure-free with auras for a total of three seizure-free patients (7.5 percent). Based on this study's findings, the synthetic control group levels of 50 percent and 65 percent needed to overturn the results of our threshold analysis seem unlikely to be achieved in a clinical setting.

Wiebe, Blume, Girvin, et al.145 reported that among the 40 surgery patients 38 percent were completely free of seizures and 58 percent were free of seizures impairing awareness (seizure-free with or without auras). These results are somewhat lower than our meta-analytic estimates of 55 percent (CI: 50 percent to 60 percent) and 68 percent (CI: 65 percent to 72 percent), respectively, based on studies with a minimum 2 year followup. These results do fit within the range of results reported for studies that were included in the analysis (Evidence Table 111 and 112).

Factors that may influence seizure-free outcomes

The lack of statistically significant heterogeneity among the effect sizes in the studies reporting seizure-free no auras and seizure-free with auras indicates that several covariates, such as the surgical procedures, country where the study was performed, and specific pathology reported by each study, did not have large influences on the success of surgery.x For example, if tailored temporal lobectomy had produced many more completely seizure-free patients compared to standard temporal lobectomy, then our meta-analysis of studies reporting seizure-free no auras would have shown significant heterogeneity. This was not the case. The same can be said for studies examining only specific pathologies. We did not find that studies examining only patients with mesial temporal sclerosis, tumors, or vascular malformations had differing effect sizes. However, during the original organization of this project, the Expert Panel expressed an interest in knowing if certain study level factors influenced surgical outcomes. Therefore, we regrouped studies according to specific covariates (United States versus other countries, studies of mesial temporal sclerosis only versus studies examining various pathologies, and studies of standard temporal lobectomy versus studies of tailored temporal lobectomy versus studies of other surgical procedures). Evidence Table 115 and 116 show the summary effect size estimates based on seizure-free with no auras and seizure-free with aura outcome measurements, respectively. The recalculated summary estimates showed no statistically significant effect of any of these covariates.

Meta-analytic threshold analysis of studies reporting Engel class I

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf39.jpg.

   Figure 39. Forest plot: temporal lobe surgery and Engel Class I

A scale is not shown because the effect sizes were not calculated with actual control groups

Evidence Table 117 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on no patients in a synthetic control group achieving Engel class I. Figure 39 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

Our threshold analysis of studies reporting Engel class I found statistically significant heterogeneity among the effect sizes indicating a large amount of variation among study results (Q = 77.7, p = 0.00002). Therefore, the summary estimates in any threshold analysis of these data were not calculated. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.

Despite the heterogeneity, all of the effect sizes (based on a Cohen's h with no control patients achieving Engel class I) in these studies were statistically significant. Therefore, these studies indicate that temporal lobe surgery is effective in producing seizure-free patients. The heterogeneity prevents an accurate estimation of the overall percentage of patients likely to achieve Engel class I status after surgery.

Meta-regression. In our meta-regression of the 33 studies reporting Engel class I, we again computed Cohen's h assuming a synthetic control group that did not experience any changes in the outcome of interest. Our prior analysis of studies reporting seizure-free no auras and seizure-free with auras suggested that the type of surgical procedures used in each study and the pathology examined in each study does not influence the estimate of the number of patients likely to become seizure-free. Therefore, we did not enter surgical procedures or pathology into this meta-regression. We instead looked for sources of heterogeneity due to differences in usage of the Engel classification system between countries and possible shifts in usage over time. Usage refers to differences in the interpretation of which patients belong in Engel class I. We entered into the meta-regression whether the study was performed in the United States, the year the study started, and the year the study ended. The data used in the meta-regression is presented in Evidence Table 118 and the results of the meta-regression appear in Evidence Table 119.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf40.jpg.

   Figure 40. Meta-regression: temporal lobe surgery and Engel class I

None of the three variables in our meta-regression explained the heterogeneity when used in one-, two-, or three-predictor models. Figure 40 graphically presents the results of the meta-regression. The dotted line on the graph represents the level of reduction in heterogeneity needed to obtain a statistically insignificant QE in any of the models. The meta-regressions failed to reach or pass this line. Therefore, the heterogeneity among studies using Engel class I is not explained by differences in usage between the United States and other countries or due to shifts in usage over time. Consequently, a summary estimate that is adjusted for the sources of heterogeneity among study results could not be derived, and there is no ready explanation for why the results of these studies differ.

Meta-analytic threshold analysis of studies reporting seizure-free undefined

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf41.jpg.

   Figure 41. Forest plot: temporal lobe surgery and seizure-free undefined

A scale is not shown because the effect sizes were not calculated with actual control groups

Evidence Table 120 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on no patients in a synthetic control group becoming seizure-free undefined. Figure 41 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

Our threshold analysis of studies reporting seizure-free undefined found statistically significant heterogeneity among the effect sizes (Q = 43.4, p = 0.00002). Therefore, the summary estimates in any threshold analysis of these data were not calculated. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.

Despite the heterogeneity, all of the effect sizes calculated from studies reporting seizure-free undefined were statistically significant. Therefore, these studies indicate that temporal lobe surgery is effective in producing seizure-free patients. The heterogeneity prevents an accurate estimation of the overall percentage of patients likely to achieve seizure-free status after surgery.

Meta-regression. In our meta-regression of the 16 studies reporting seizure-free undefined, we computed Cohen's h again assuming a synthetic control group that did not experience any changes in the outcome of interest. Our prior analysis of studies reporting seizure-free no auras and seizure-free with auras indicates that the type of surgical procedures used in each study and the pathology examined in each study did not influence the estimate of the number of patients who were likely to become seizure-free. Therefore, we did not enter surgical procedures or pathology into this meta-regression. Since this outcome is probably a combination of patients who are seizure-free no auras and seizure-free with auras, the heterogeneity is most likely due to differences in usage between studies. We therefore looked for sources of heterogeneity due to differences in usage between countries and possible shifts in usage over time. We entered into the meta-regression whether the study was performed in the United States, the year the study started, and the year the study ended. The data used in the meta-regression is presented in Evidence Table 121 and the results of the meta-regression appear in Evidence Table 122.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf42.jpg.

   Figure 42. Meta-regression: temporal lobe surgery and seizure-free undefined

None of the three variables in our meta-regression explained the heterogeneity when used in one-, two-, or three-predictor models. Figure 42 graphically presents the results of the meta-regression. The dotted line on the graph represents the level of reduction in heterogeneity needed to obtain a statistically insignificant QE in any of the models. The meta-regressions failed to reach or pass this line. Therefore, the heterogeneity among studies using seizure-free undefined is not explained by differences in usage between the United States and other countries or due to shifts in usage over time. Consequently, a summary estimate that is adjusted for the sources of heterogeneity among study results could not be derived, and there is no ready explanation for why the results of these studies differ.

Analysis of nested case-control studies

Within any single study, seizure-free outcome measures may have been analyzed by the authors for variables that influenced the success of surgery. We term studies that reported these findings as nested case-control studies. Unlike actual controlled studies, no patients in these studies are untreated. Rather, following treatment, patients are divided into those with successful outcomes and those without, and then various patient characteristics or other variables are compared for differences between the successful patients and nonsuccessful patients. Variables commonly examined for their influence on surgical success are age at surgery, age at first seizure, duration of epilepsy prior to surgery, gender, location of surgery (left vs. right temporal lobe), and type of pathology. Evidence Tables 123 and 124 present the findings, both statistically significant and nonsignificant, reported by each of the nine nested case-control studies in our evidence base for seizure-free measurements. Nested case-control studies using multiple or logistic regression to control for covariates in their analysis provide a more reliable estimate of the correlation between surgical success and patient characteristics compared to studies using univariate approaches. For this reason, our evidence table listed whether a study used multiple regression or a univariate test (t-test or chi-square test) in their analysis. Among the nine studies, two reported using multiple regression (Blume, Desai, Girvin, et al.,146 and Cutfield and Wrightson147). Blume, Desai, Girvin, et al.,146 in a study of 125 patients, found that younger age at surgery favored outcomes that are more successful. Cutfield and Wrightson,147 in a study of 26 patients, did not find any patient characteristics that favored successful surgery. Only one of the seven studies using univariate procedures, Hennessy, Elwes, Honavar, et al.,148 also found that younger age significantly favored successful surgery.

Meta-analysis of patient characteristics

Dodrill, Van Belle, and Wilkus149 have pointed out that small sample sizes have lead to inconsistency in the conclusions reached about the significance of most variables believed to influence surgical outcomes. Therefore, many individual nested case-control studies may not be able to detect clinically meaningful effects.

Table 21. Temporal lobe surgery: individual patient data
Studies of temporal lobe surgery reporting individual patient data for patients with successful and nonsuccessful surgery
ReferenceAge at TreatmentAge at Seizure OnsetDuration of Epilepsy Prior to TreatmentGenderSide of SurgerySimple Partial SeizuresSecondarily Generalized Seizures
Bouilleret (2002)165[check][check][check][check][check][check]
Hennessy (2001)148[check][check][check][check]
Hennessy (2001)171[check]
Sotero de Menezes (2001)180[check][check]
Verma (2001)182[check][check][check][check][check]
Eberhardt (2000)188[check][check][check]
Holmes (1999)206[check][check][check]
Szabo (1998)172[check][check][check][check][check]
Kilpatrick (1997)183[check][check][check]
Schwartz (1997)187[check][check][check]
Sisodiya (1997)191[check][check][check]
Adam (1996)193[check][check]
Goldstein (1996)195[check]
Jooma (1995)205[check][check][check]
Liu (1995)209[check][check][check][check][check][check]
Vossler (1995)215[check][check][check][check][check]
Blume (1994)146[check]
Berkovic (1991)216[check][check][check][check][check][check][check]
Hopkins (1991)217[check][check][check]
Mizrahi (1990)152[check][check][check][check]
Yeh (1990)220[check][check][check][check][check][check][check]
Estes (1988)222[check]
Drake (1987)224[check][check][check][check][check][check]
Delgado-Escueta (1985)153[check][check][check][check][check][check][check]
To address this difficulty, we performed several separate meta-analyses. Table 21 presents a list of the 24 studies of temporal lobe surgery that provided data for these analyses. All of these studies were included in the previous meta-analyses examining the efficacy of surgery based on one of the four outcome measurements for reporting patients as seizure-free. At least five studies reported one or more of the following continuous variables: individual patient data for age at surgery, age at seizure onset, or duration of epilepsy prior to surgery. At least five studies separately reported one of the following dichotomous variables for patients who received successful and nonsuccessful surgery: the number of males versus female, left side surgeries versus right side surgeries, patients with simple partial seizures versus patients without simple partial seizures, or patients with secondarily generalized seizures versus patients without secondarily generalized seizures. Success was based on any of the four “seizure-free” outcome measurements.

We calculated a point-biserial correlation (rpb) from the individual patient data in each study reporting the age at surgery, age at seizure onset, and duration of epilepsy prior to surgery, and then combined these in a separate meta-analysis for each variable. The coefficient was calculated so that a positive correlation indicated that an older age or longer duration favored a successful outcome and a negative result indicated that a younger age or shorter duration favored a successful outcome. For the other patient characteristics, we calculated Cohen's h so that a positive effect size indicated that males, the left side, patients with simple partial seizures, or patients with secondarily generalized seizures had more successful surgery compared to females, the right side, patients without simple partial seizures, or patients without secondarily generalized seizures.

Our summary estimates are not adjusted for the influence of the other potentially important covariates in a study. An analysis using hierarchical modeling would be useful to search for factors that influence surgical outcomes by combining the patient-level data across studies, but such an analysis is beyond the scope of this report.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf43.jpg.

   Figure 43. Forest plot: temporal lobe surgery and patient age at surgery

Age at surgery. In our first meta-analysis of “predictors” of surgical success, we sought to determine whether different outcomes were obtained in patients of different ages at the time they receive surgery. Individual ages at surgery for patients with successful and nonsuccessful surgery were reported in 18 studies with 297 patients. Evidence Table 125 presents the definition used for successful surgery and the point-biserial correlation calculated in each of the 18 studies. Figure 43 presents a forest plot of the correlations. The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.02, CI: -0.11 to 0.14, p = 0.81) suggesting that age at surgery had no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 10.7, p = 0.91).

We performed a sensitivity analysis to show that a single study did not have excessive influence over the results of the analysis. This ensures that our conclusion (no effect of age on success of surgery) cannot be overturned by the removal of just one study. The summary estimate and other statistics did not change because of the sensitivity analysis. The correlation between surgical success and age at surgery changed by no more than 0.02 due to removal of studies during the sensitivity analysis. The summary estimate remained statistically nonsignificant. The results of the sensitivity analysis and the original meta-analysis are presented in Evidence Table 126.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf44.jpg.

   Figure 44. Forest plot: temporal lobe surgery and patient age at onset of seizures

Age at seizure onset. In our second meta-analysis of “predictors” of surgical success, we sought to determine whether different outcomes were obtained in patients of different ages at seizure onset. Individual ages at seizure onset for patients with successful and nonsuccessful surgery were reported in 13 studies with 207 patients. Evidence Table 127 presents the definition used for successful surgery and the point-biserial correlation in each of the 13 studies. Figure 44 presents a forest plot of the correlations. The meta-analysis produced a summary estimate that was not statistically significant (rpb = -0.11, CI: -0.26 to 0.04, p = 0.16) suggesting that age at seizure onset had no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 7.2, p = 0.89).

We performed a sensitivity analysis to ensure that a single study did not have excessive influence over the results of the analysis. The summary estimate and other statistics did not change because of the sensitivity analysis. The correlation coefficient changed by no more than 0.03 due to removal of studies during the sensitivity analysis. The summary estimate remained statistically nonsignificant. The results of the sensitivity analysis and the original meta-analysis are presented in Evidence Table 128.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf45.jpg.

   Figure 45. Forest plot: temporal lobe surgery and duration of epilepsy prior to surgery

Duration of epilepsy prior to surgery. In this meta-analysis, we sought to determine whether different outcomes were obtained in patients with different durations of epilepsy prior to the time they receive surgery. Individual durations of epilepsy prior to surgery for patients with successful and nonsuccessful surgery were reported in 12 studies with 192 patients. Evidence Table 129 presents the definition used for successful surgery and the point-biserial correlation in each of the 12 studies. Figure 45 presents a forest plot of the effect sizes. The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.15, CI: -0.01 to 0.30, p = 0.06) suggesting that duration of epilepsy prior to surgery did not influence the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 15.9, p = 0.20).

We then performed sensitivity analyses on these results. When the study with the largest negative effect size (favors shorter duration of epilepsy) was removed, the summary estimate became statistically significant (rpb = 0.20, CI: 0.04 to 0.35, p = 0.02). The effect sizes remained homogenous when this study was removed (Q = 9.4, p = 0.58). Thus, without this study in the analysis, patients with a longer duration of epilepsy prior to surgery appear to have a slightly better chance of having successful surgery compared to patients with a shorter duration of epilepsy prior to surgery. The removal of other studies during the sensitivity analysis changed the correlation by no more than 0.03. Therefore, patients with a longer duration of epilepsy prior to surgery appear to have a tendency towards better outcomes after surgery, but this tendency is not robust. The results of the sensitivity analysis as well as the original meta-analysis are presented in Evidence Table 130.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf46.jpg.

   Figure 46. Forest plot: temporal lobe surgery and male and female patients

Studies reported the success of surgery among male and female patients

Gender. We next investigated whether a greater percentage of males compared to females had successful surgery. The number of male and female patients among patients with successful and nonsuccessful surgery was reported in 15 studies with 306 patients. Evidence Table 131 presents the individual number of male and female patients and the number of successful surgeries in each, the definition used for successful surgery, and the Cohen's h in each of the 15 studies. Figure 46 presents a forest plot of these effects. The meta-analysis produced a statistically significant Q statistic (27.9, p = 0.015), so the summary effect size is not meaningful. Two of the 15 studies showed a statistically significant increase in the number of female patients with successful outcomes compared to male patients. Of the remaining 13 studies, eight favored male patients and five favored female patients, although none of these studies showed a statistically significant difference.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf47.jpg.

   Figure 47. Meta-regression: temporal lobe surgery and male and female patients

To “explain” this heterogeneity, we performed 36 meta-regressions (see the Methodology section for a description of our approach to meta-regression). Of these, no one-predictor model explained the heterogeneity, and five two-predictor models did. No clear “best” model was obvious among these five models. Consequently, no obvious explanation for the variation among these studies is apparent, and why surgery is more or less successful in males compared to females in these studies is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 132. The meta-regressions are presented in Evidence Table 133 and Figure 47.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf48.jpg.

   FIgure 48. Forest plot: temporal lobe surgery and location of surgery

Studies reported the success of surgery among patients with left side and right side surgery

Location of surgery. In this meta-analysis, we sought to determine whether surgery was more successful in patients who had surgery in the left temporal lobe or the right temporal lobe. The percentage of left-sided and right-sided operations among patients with successful and nonsuccessful surgery was reported in 19 studies with 404 patients. Evidence Table 134 presents the number of left-sided and right-sided operations and the number of successful patients in each, the definition used for successful surgery, and the Cohen's h calculated in each of the 19 studies. Figure 48 presents a forest plot of these effects. The meta-analysis produced a summary estimate that was not statistically significant (-0.07, CI: -0.27 to 0.13, p = 0.49), suggesting that location of surgery had little or no influence on the success of surgery. The effect sizes in this meta-analysis were not heterogeneous (Q = 17.9, p = 0.46).

The summary estimate and other statistics did not change because of the sensitivity analysis. The back-transformed estimate for the difference between the percentage of left side surgery patients who achieved successful surgery and the percentage of right side surgery patients who achieved successful surgery was 0 regardless of the studies that were removed during the sensitivity analysis. The sensitivity analysis and the original meta-analysis are presented in Evidence Table 135.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf49.jpg.

   Figure 49. Forest plot: temporal lobe surgery and simple partial seizures

Studies reported the success of surgery in patients with and without simple partial seizures (SPS)

Simple partial seizures. We next compared surgical success rates in patients with simple partial seizures to success rates in patients without simple partial seizures. The number of patients with simple partial seizures among patients with successful and nonsuccessful surgery was reported in five studies with 131 patients. Evidence Table 136 presents the number of patients with and without simple partial, the number of successful surgeries in each, the definition used for successful surgery, and the Cohen's h calculated in each of the five studies. Figure 49 presents a forest plot of these effects. The meta-analysis produced a summary estimate that was not statistically significant (0.10, CI: -0.30 to 0.51, p = 0.62), suggesting that the presence of simple partial seizures had no influence on the success of surgery. The effect sizes in this meta-analysis were not heterogeneous (Q = 7.1, p = 0.13).

The summary estimate and other statistics showed only small changes because of the sensitivity analysis. The back-transformed estimates for the difference between the percentage of patients with simple partial seizures who achieved successful surgery and the percentage of patients without simple partial seizures who achieved successful surgery varied between -9 and 1 as studies were removed during the sensitivity analysis. The summary estimate did not become statistically significant when studies were removed. The sensitivity analysis and the original meta-analysis are presented in Evidence Table 137.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf50.jpg.

   Figure 50. Forest plot: temporal lobe surgery and secondarily generalized seizures

Studies reported the success of surgery among patients with and without secondarily generalized seizures (SGS)

MTS = Patients with mesial temporal sclerosis

Secondarily generalized seizures. In our final meta-analysis on characteristics that may “predict” successful temporal lobe surgery, we examined whether patients with secondarily generalized seizures had different outcomes compared to patients without secondarily generalized seizures. The number of patients with or without secondarily generalized seizures among patients with successful and nonsuccessful surgery was reported in seven studies with 256 patients. Evidence Table 138 presents the individual number of patients with and without secondarily generalized seizures and the number of successful surgeries in each, the definition used for successful surgery, and the Cohen's h calculated in each of the seven studies. Figure 50 presents a forest plot of these effects. The meta-analysis produced a statistically significant Q statistic (31.8, p = 0.00002) so the summary effect size was not meaningful.

Two studies reported that patients without secondarily generalized seizures have better outcomes, one study reported that patients with secondarily generalized seizures have better outcomes, and four studies reported no differences in outcomes between patients with or without secondarily generalized seizures.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf51.jpg.

   Figure 51. Meta-regression: temporal lobe surgery and secondarily generalized seizures

To “explain” this heterogeneity, we performed 51 meta-regressions. Of these, no models explained the heterogeneity. Consequently, no obvious reason is apparent for why some studies had different results compared to other studies, and whether surgery is more or less successful in patients with secondarily generalized seizures is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 139. The results of the meta-regression are presented in Evidence Table 140 and Figure 51.

Quality of Life Outcome Measurements
Evidence base

The Epilepsy Surgery Inventory global score was reported in one study with 47 patients150 and the Quality of Life in Epilepsy global score was reported in one study with 90 patients.151

Design and conduct of included studies

All of the previously discussed biases about the internal validity of studies that reported seizure-free outcome measures potentially apply to the studies that reported quality of life outcome measurements.

Synthesis of study results

Evidence Table 141 presents a summary of the findings in each of the studies reporting quality of life measurements. No statistically significant change was found between the baseline Epilepsy Surgery Inventory overall score and the overall score 2 years after surgery.150 However, the authors did report that patients with low baseline scores showed the greatest improvement after surgery. This suggests the presence of regression to the mean. Patients who received surgery did show a statistically significant improvement in the Quality of Life in Epilepsy global score 2 years after surgery both compared to baseline and a control group of patients.151 The entire improvement in global score was contributed by patients who became completely seizure-free. Once again, though, regression to the mean cannot be ruled out as an explanation for these results.

Employment Outcome Measurements
Evidence base

Table 22. Temporal lobe surgery: employment studies
ReferenceNumber of PatientsCountryYears Study ConductedMean Age at SurgeryYoungest PatientOldest Patient
Boling (2001)16918Canada1981-1999545064
Reeves (1997)154134United States1988-199131
Sperling (1995)23673United States1986-199033.2
Mizrahi (1990)15222United States1980-198621736
Delgado-Escueta (1985)15315United States1972-198326.51239
Among the 105 studies of temporal lobe surgery in our evidence base, five reported some form of employment data that met our inclusion criteria. Studies must have reported the number of patients not able to obtain work prior to surgery and the number of patients able to obtain work after surgery, or must have reported the number of patients working prior to surgery and the number of patients not able to remain at work after surgery. The five studies had 318 patients. Four of the studies were conducted in the United States and the fifth study was from Canada. Table 22 presents a listing of the five studies reporting employment data.

Design and conduct of included studies

All of the previously discussed biases about the internal validity of studies that reported seizure-free outcome measures potentially apply to the studies that reported employment outcome measurements. In particular, the lack of a precise definition of who is employed may lead to inconsistencies in the reporting of this outcome.

Synthesis of study results

Although each of the five included studies evaluated more than 10 patients, in three studies fewer than 10 patients were reported to be in the “not able to obtain work prior to surgery” category or in the “working prior to surgery” category (Evidence Table 142). The other patients in the study were not actively seeking employment, were of preschool age or in school. Therefore, we did not perform a meta-analysis of these data. The studies do show that some patients who were unable to obtain employment prior to surgery do find employment after surgery. In the two studies with more than 20 patients unable to obtain work prior to surgery, 7 out of 20 patients and 15 out of 28 patients were able to obtain employment after surgery. In two studies with more than 30 patients, 57 out of 67 patients and 30 out of 33 patients working prior to surgery were able to maintain employment after surgery. A third study with 13 patients showed that nine patients remained working after surgery. While 85 percent (96 out of 113) of the patients in these latter three studies were able to remain employed after surgery, 15 percent of the patients were not able to maintain their employment.

Education Outcome Measurements
Evidence base

Return to (or ability to remain in) school was reported in two studies with 37 patients, however only one study had more than 10 patients of school age.152, 153

Design and conduct of included studies

All of the previously discussed cautions about the internal validity of studies that reported seizure-free outcome measures apply to the two studies that reported education outcome measurements.

Synthesis of study results

Evidence Table 143 presents a summary of the findings in the two studies reporting education outcome measurements. These studies reported that all patients attending school prior to surgery remained in school after surgery.152, 153

Ability to Obtain a Driver's License Outcome Measurements
Evidence base

Only Reeves, So, Evans et al.,154 who studied 134 patients, reported on the ability of patients to obtain a drivers license after surgery.

Design and conduct of included studies

All of the previously discussed cautions about the internal validity of studies that reported seizure-free outcome measures apply to this study.

Synthesis of study results

Evidence Table 144 presents a summary of the findings in the study reporting ability to obtain a driver's license. Surgery was reported to have produced a statistically significant increase in the number of patients able to drive.154

Mood Outcome Measurements - Depression

Epilepsy has been associated with an increased incidence and prevalence of behavioral disorders and in particular with anxiety and depression.4, 155 New cases of depression have been associated with temporal lobe surgery156 and the National Institutes of Health Consensus Development Conference Statement: Surgery for Epilepsy has recommended that symptoms of anxiety and depression be assessed following surgery.140 The following section evaluates studies that reported new cases of depression after temporal lobe surgery.

Evidence base

Table 23. Temporal lobe surgery: new cases of depression after surgery
ReferenceNumber of PatientsCountryYears Study ConductedNumber of Cases
Kanemoto (2001)17652Japan1987-19992
Kohler (2001)23758United States1986-19996
Nees (2001)23850England1992-199414
Wiebe (2001)14536Canada1996-20007
Anhoury (2000)239109England1988-199726
Derry (2000)24039Canada1996-19984
Altshuler (1999)15649United States1974-19905
Ring (1998)24160England1995-19967
Naylor (1994)24237Denmark1987-19912
Bladin (1992)243107Australia1975-19915
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, 10 reported whether their patients experienced new cases of depression after surgery. These patients had not been diagnosed with clinical depression prior to surgery. The 10 studies examined 597 patients. Table 23 lists the studies. Evidence Table 145 provides study information including the methods of diagnosis for depression reported in each study. Five studies used the Diagnostic and Statistical Manual of Mental Disorders, 3rd or 4th edition (DSM-III, IV) criteria, one study used the International Classification of Disease 10th revision (ICD-10), two studies used the Center for Epidemiological Studies-Depression Scale (CES-D), and two studies reported diagnose by a psychiatrist. Only the RCT by Wiebe, Blume, Girvin, et al.145 provided data on a control group comparable to the patients receiving surgery.

Although these 10 studies reported new cases of depression after surgery, they did not report the actual number of patients who were either clinically depressed or free of depression prior to surgery. Patients were not excluded from surgery for clinical depression in these studies. Therefore, our analysis uses the total number of patients receiving surgery rather than the actual number of patients free of depression prior to surgery.

Design and conduct of included studies
Internal validity

All but one of the 10 studies in the evidence base for new cases of depression are uncontrolled studies of case series design. Therefore, these uncontrolled studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcomes. Attrition bias may not be a major concern in these studies because all patients were examined during the relatively short followup periods (no more than 1 year).

Depression occurs in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies prevents any determination of whether the effect of surgery is to increase or decrease the incidence of depression. The analysis of the studies can only provide an estimate of the number of patients likely to experience depression after surgery.

External validity

The specific patient characteristics of temporal lobe surgery patients reported in each study are presented in Evidence Table 146. The patients in these studies were between 20 and 50 years old at the time of surgery, the mean age of seizure onset was between 9 and 16 years of age, and the mean duration of epilepsy prior to surgery was approximately 18 years.

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results
Meta-analytic threshold analysis of depression outcome measurements

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf52.jpg.

   Figure 52. Forest plot: temporal lobe surgery and new cases of depression

A scale is not shown because the effect sizes were not calculated with actual control groups

Evidence Table 147 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on no patients in a synthetic control group becoming clinically depressed after surgery. Figure 52 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

All of the studies reported a statistically significant occurrence of new cases with a range of 4 percent to 24 percent. Our threshold analysis of studies reporting new cases of depression found statistically significant heterogeneity among the study results (Q = 18.0, p = 0.035). Therefore, we did not compute the summary estimates in any threshold analysis of these data. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf53.jpg.

   Figure 53. Meta-regression: temporal lobe surgery and new cases of depression

Meta-regression. To “explain” this heterogeneity, we performed 13 meta-regressions. Of these, no models explained the heterogeneity. Consequently, no obvious reason is apparent to explain why some studies reported more new cases of depression than other studies, and whether surgery is more or less responsible for new cases of depression is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 148. The results of the meta-regression are presented in Evidence Table 149 and Figure 53.

Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the new cases of depression or that surgery reduced the incidence of depression. This is highlighted by the results of the one RCT among these studies. Wiebe, Blume, Girvin, et al.,145 using the Center for Epidemiological Studies-Depression Scale, reported that 8 out of 40 control patients (20 percent) developed depression during the year preceding their surgical.

Mood Outcome Measurements - Psychosis

Besides depression, treatment-resistant epilepsy has been associated with a variety of psychiatric disorders.155 Surgery for treatment-resistant epilepsy may also have psychiatric consequences. The following section evaluates studies that reported new cases of psychotic disorders (primarily schizophrenia and bipolar disorder) after temporal lobe surgery.

Evidence base

Table 24. Temporal lobe surgery: new cases of psychosis after surgery
ReferenceNumber of PatientsCountryYears Study ConductedNumber of Cases
Kanemoto (2001)17652Japan1987-19997
Wiebe (2001)14536Canada1996-20001
Anhoury (2000)239109England1988-19973
Blumer (1998)24444United States1994-19952
Naylor (1994)24237Denmark1987-19910
Bladin (1992)243107Australia1975-19913
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, six reported whether their patients experienced new cases of psychosis after surgery. The six studies examined 385 patients. Four of the six studies are also part of the evidence base for depression discussed earlier. Table 24 lists the studies and Evidence Table 150 provides study information including the methods of diagnosis in each. Four of the studies reported using specific criterion, while two studies reported using evaluations by a psychiatrist only. Only the RCT by Wiebe, Blume, Girvin, et al.145 provided data from a control group.

Although these six studies reported new cases of psychosis after surgery, they did not report the actual number of patients who had a psychotic disorder or were free of psychotic disorders prior to surgery. Two of the studies excluded patients who had chronic psychosis and the remaining four studies did not exclude patients with psychiatric disorders. Therefore, our analysis uses the total number of patients receiving surgery rather the actual number of patients free of psychosis prior to surgery.

Design and conduct of included studies
Internal validity

All but one of the six studies in the evidence base for assessing new cases of psychosis are uncontrolled studies of case series design. Therefore, these studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcomes. In particular, variations in the use of any of the specific criteria, or variations in individual psychiatrists could lead to inconsistencies in the reporting of this outcome. Attrition bias is not a concern because all patients were examined after surgery.

Psychosis can occur in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies prevents any determination of whether the effect of surgery is to increase or decrease the incidence of psychosis. Our analysis of these studies can only provide an estimate of the number of patients likely to experience psychosis after surgery.

External validity

The specific patient characteristics of temporal lobe surgery patients reported in each study are presented in Evidence Table 151. The patients in these studies were between approximately 20 to 40 years old at the time of surgery, the mean age of seizure onset was between 10 and 15 years of age, and the mean duration of epilepsy prior to surgery was approximately 18 years.

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results
Meta-analytic threshold analysis of psychosis outcome measurements

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf54.jpg.

   Figure 54. Forest plot: temporal lobe surgery and new cases of psychosis

Evidence Table 152 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on a control group in which no patients develop psychosis. Figure 54 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf55.jpg.

   Figure 55. Threshold analysis: temporal lobe surgery and new cases of psychosis

The results of our threshold analysis of studies reporting new cases of psychosis appear in Figure 55. Each summary estimate in the graph is Cohen's h. The summary estimate calculated at the 0 percent point (no patients in a synthetic control group developed psychosis after surgery) was 0.37 (CI: 0.23 to 0.51, p <0.000001) and corresponded to 3 percent (CI: 1 percent to 6 percent) of patients developing psychosis after surgery. The summary estimate became nonsignificant (no statistically significant difference between surgery and control) when the proportion of patients in the synthetic control group reached 2 percent. There was no statistically significant heterogeneity in the threshold analysis (Q = 6.5, p = 0.26).

Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the new cases of psychosis or that surgery caused an increase or decrease in the incidence of psychosis. This can be seen in the one RCT among these studies. Wiebe, Blume, Girvin, et al.145 reported that 1 out of 40 control patients (2.5 percent) developed psychosis during the year preceding their surgical treatment compared to 1 out of 36 surgery patients (2.8 percent). This percentage of new cases of psychosis among control patients suggests that surgery may not be responsible for all new cases of psychosis after surgery. Nevertheless, our analysis provides an estimate of the number of new cases that may be expected after temporal lobe surgery, regardless of cause.

Cognitive Function Outcome Measurements - IQ

Treatment-resistant epilepsy may be associated with a slow progressive cognitive deterioration. A study of 209 patients with temporal lobe epilepsy reported that patients with a duration of greater than 30 years performed worse on full scale IQ tests compared to patients with less than 30 years duration.157 Due to the nature of the procedure, patients contemplating temporal lobe surgery may also be concerned with the potential for loss of intellectual functioning after surgery. The following section evaluates studies that reported both the number of patients to have a significant change in IQ (increase or decrease) and the pre- and postsurgery mean IQs. The authors of these studies defined a clinically significant increase or decrease in IQ as a change of at least one to two standard errors, and our analysis, therefore, incorporated this definition.

Evidence base

Table 25. Temporal lobe surgery: changes in IQ
Studies of temporal lobe surgery reporting both the number of patients with IQ changes after surgery and the pretreatment and posttreatment mean IQ.
ReferenceNumber of PatientsCountryYears Study ConductedNumber of DecreasesNumber of IncreasesMean Age at SurgeryYoungest PatientOldest Patient
Miranda (2001)24550Canada1976-19987713.36.418.3
Robinson (2000)19921United States1993-19981415.49.421.7
Westerveld (2000)24682United States8714.4617
Chelune (1993)15896United States1990-19918829.4
Ivnik (1988)247141United States1972-1987132728
Powell (1985)24859England1973-198410825.515
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, six reported if their patients experienced a significant decrease or increase in IQ after surgery as well as reported the mean pretest and posttest IQ. The six studies examined 449 patients. Table 25 lists the studies, and Evidence Table 153 provides study information including the methods used in each. We abstracted and analyzed the verbal IQ scores from each study because these data were reported in all six studies. Only the study by Chelune, Nagle, Lueders, et al.158 provided data on a control group.

Design and conduct of included studies
Internal validity

All but one of the six studies in the evidence base for assessing changes in IQ are uncontrolled case series. Therefore, these studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcome reporting. Investigator bias and patient reporting bias may be reduced (but not eliminated) due to the use of a standardized intelligence test (Wechsler Intelligence Scale) and a predefined cutoff determining when a patient's IQ has undergone a significant change. Attrition bias is not a concern because all patients were examined after surgery.

Decreases in IQ scores can occur in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies limits our ability to determine whether surgery decreased IQ scores. However, since increases in IQ scores are unlikely to occur spontaneously, any increase in IQ scores after surgery are likely to be a consequence of surgery. Our analysis provides an estimate of the number of patients likely to experience either an increase or decrease in IQ after surgery.

External validity

The specific patient characteristics of temporal lobe surgery patients reported in each study are presented in Evidence Table 154. Three of the studies examined only children and adolescents while the other three studies examined only adults. The children were approximately 5 to 15 years old at the time of surgery, while the adults were between 20 and 40 years old at the time of surgery. The mean age of seizure onset was approximately 5 years of age for the children and approximately 10 to 15 years for the adults. The mean duration of epilepsy prior to surgery was approximately 10 years in all six studies with a broad range of between 1 to 17 years.

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results
Meta-analytic threshold analysis of decreases in IQ after surgery

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf56.jpg.

   Figure 56. Forest plot: temporal lobe surgery and decreases in IQ after surgery

Studies reported individuals with significant decreases in IQ after surgery

Evidence Table 155 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on a control group in which no patients experience a clinically significant decrease in IQ. Figure 56 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf57.jpg.

   Figure 57. Threshold analysis: temporal lobe surgery and decreases in IQ after surgery

The results of our threshold analysis of studies reporting patients with clinically significant decreases in IQ after surgery appear in Figure 57. Each summary estimate in the graph is Cohen's h. The summary estimate calculated at the 0 percent point (no patients in a synthetic control group showed a significant decrease in IQ) was 0.65 (CI: 0.52 to 0.78, p <0.000001) and corresponded to 10 percent (CI: 7 percent to 14 percent) of patients experiencing a clinically significant decrease (equal to 1 to 2 standard deviation units) in IQ after surgery. The summary estimate became nonsignificant (no statistically significant difference between surgery and control) when the proportion of patients in the synthetic control group reached 7 percent. There was no statistically significant heterogeneity in the threshold analysis (Q = 2.3, p = 0.81).

Meta-analytic threshold analysis of increases in IQ after surgery

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf58.jpg.

   Figure 58. Forest plot: temporal lobe surgery and increases in IQ after surgery

Studies reported individuals with significant increases in IQ after surgery

Evidence Table 155 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on a control group in which no patients experience a clinically significant increase in IQ. Figure 58 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf59.jpg.

   Figure 59. Threshold analysis: temporal lobe surgery and increases in IQ after surgery

The results of our threshold analysis of studies reporting patients with clinically significant increases in IQ after surgery appear in Figure 59. Each summary estimate in the graph is Cohen's h. The summary estimate calculated at the 0 percent point (no patients in a synthetic control group showed a significant increase in IQ) was 0.74 (CI: 0.61 to 0.88, p <0.000001) and corresponded to 13 percent (CI: 9 percent to 18 percent) of patients experiencing a clinically significant increase in IQ after surgery. The summary estimate became nonsignificant (no statistically significant difference between surgery and control) when the proportion of patients in the synthetic control group reached 10 percent. There was no statistically significant heterogeneity among the effect sizes in the threshold analysis (Q = 4.3, p = 0.51).

Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the decreases in individual patient IQ scores, although a case can be made that surgery is responsible for any increases in IQ. The one study with a control group, Chelune, Nagle, Lueders, et al.,158 reported two patients with clinically significant increases and two patients with clinically significant decreases in verbal IQ out of 40 control patients (5 percent each) compared to eight patients with clinically significant increases and eight patients with clinically significant decreases in verbal IQ out of 96 surgery patients (8.3 percent each). These percentages for increases and decreases among the control patients are lower than the percentages needed to overturn the conclusions of our threshold analysis suggesting that surgery may plausibly be responsible for changes in IQ.

Meta-analysis of changes in mean IQ after surgery

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf60.jpg.

   Figure 60. Forest plot: temporal lobe surgery and changes in mean IQ

Studies reported both presurgery and postsurgery mean IQ

We also performed a meta-analysis of the data on mean pretest and posttest verbal IQ scores from these same studies (Evidence Table 156). We excluded from the analysis one study that did not report a measure of dispersion for the means. This analysis used Hedges'd as an effect size. A forest plot of the results of this meta-analysis is presented in Figure 60. The meta-analysis produced a summary estimate that was not statistically significant (-0.05, CI: -0.21 to 0.11, p = 0.53), suggesting no dramatic changes in mean IQ after surgery. The effect sizes in this meta-analysis were not heterogeneous (Q = 1.5, p = 0.82).

The summary estimate showed only small changes during the sensitivity analysis and remained statistically nonsignificant. The results of the sensitivity analysis as well as the original meta-analysis are presented in Evidence Table 157.

Cognitive Function Outcome Measurements - Memory

Temporal lobe surgery usually requires the removal of the hippocampus, a part of the brain important to memory capacity. Therefore, memory function is at risk whenever this procedure is performed.159 The following section evaluates studies that reported both the number of patients with a significant change in memory function (increase or decrease) and the pre- and postsurgery mean memory scores.

Evidence base

Table 26. Temporal lobe surgery: changes in memory
Studies of temporal lobe surgery reporting individual changes in patient memory after surgery
ReferenceNumber of PatientsCountryYears Study ConductedNumber of DecreasesNumber of IncreasesMean Age at SurgeryYoungest PatientOldest Patient
Canizares (2000)24933Spain1998-199931030.9
Chelune (1993)15896United States1990-199128129.4
Ivnik (1988)247141United States1972-1987484828
Ojemann (1985)25013United States1983-19838328.91749
Powell (1985)24859England1973-198481325.515
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, five studies reported individual changes in one of the measurements in the Wechsler Memory Scale as well as reported the mean score before and after surgery. The five studies had 342 patients. Only two of the five studies reported the same portion of the Wechsler Memory Scale. Therefore, we did not perform a meta-analysis of these data. One study, Chelune, Nagle, Lueders, et al.,158 provided data on a control group. Table 26 presents a listing of the five studies reporting memory changes. Study information and the portion of the Wechsler Memory Scale used in each study are presented in Evidence Table 158.

Synthesis of study results

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf61.jpg.

   Figure 61. Temporal lobe surgery: changes in memory after surgery

Studies reported individuals with significant changes in memory after surgery

Evidence Table 159 presents the finding for the five studies in the evidence base for this section. Patients experienced both increases and decreases in memory function, but the individual percentages in each study varied widely (Figure 61). The range of patients who showed an increase was 1 percent to 34 percent and the range of patients who showed a decrease was 9 percent to 62 percent.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf62.jpg.

   Figure 62. Forest plot: temporal lobe surgery and changes in memory

To further explore these data, we calculated individual study results (using Cohen's h) by assuming that a control group would have no patients experiencing an increase or decrease in memory score. Statistically significant effects indicate that the percentage of patients experiencing an increase or decrease in memory score was significantly different from zero. All five studies showed statistically significant percentages of patients with memory decreases, and four studies showed statistically significant percentages of patients with memory increases (Figure 62).

Complications Due to Surgery

Serious permanent complications and transient complications are an inherent part of surgery. Temporal lobe surgery can result in various forms of paralysis due to obstruction of blood vessels or other damage to brain tissue. The following section evaluates studies that reported cases of serious permanent complications. We considered moderate to severe permanent neurological deficits, especially hemiplegia, to be serious complications. We considered all other reported surgical complications to be mild or transient. Development of postsurgical depression or psychosis, and declines in IQ or memory are not considered in this section because we examined them separately (see above).

Evidence base

Table 27. Temporal lobe surgery: complications due to surgery
ReferenceNYears Study ConductedCountryPermenant Complications
Boling (2001)169181981-1999Canada0
Schramm (2001)178611993-1999Germany0
Sotero de Menezes(2001)180151978-1993United States1
Wiebe (2001)145361996-2000Canada2
Iannelli (2000)194371981-1997Italy1
Rao (2000)1971641995-1998India1
Robinson (2000)199211993-1998United States0
Wurm (2000)251161997-1998Austria1
Altshuler (1999)156491974-1990United States5
Leung (1999)252111994-1998Hong Kong1
Parrent (1999)253191994-1997Canada0
Salanova (1999)2141451984-1995United States2
Son (1999)166711994-1999South Korea2
Visudhiphan (1999)255141993-1998Thailand0
Radhakrishnan (1998)1701751988-1991United States2
Wyllie (1998)257721990-1996United States0
Bizzi (1997)173141990-1994United States1
Blume (1997)259141977-1994Canada0
Kilpatrick (1997)183361993-1995Australia0
Adam (1996)193301991-1994France2
Acciarri (1995)200101975-1992Italy0
Davies (1995)163121969-1988England1
Jooma (1995)205301985-1992United States2
Liu (1995)209221983-1990United States2
Wyler (1995)142701990-1992United States3
Blume (1994)1461251974-1989Canada1
Guldvog (1994)160641952-1988Norway0
Guldvog (1994)161351949-1988Norway6
Hopkins (1991)217111978-1988Australia0
Bidzinski (1990)1623201957-1988Poland2
Mackenzie (1990)254301983-1989Australia0
Mizrahi (1990)152221980-1986United States0
Walczak (1990)1641001964-1985United States1
So (1989)256481973-1987Canada0
Cutfield (1987)147261961-1980New Zealand0
Drake (1987)224161974-1986Canada0
King (1986)258231981-1983United States0
Meyer (1986)226501970-1983United States0
Carey (1985)260241975-1984Ireland0
Delgado-Escueta(1985)153151972-1983United States1
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, 40 studies reported on complications due to surgery. The 40 studies examined 2091 patients (Table 27). We abstracted data on serious permanent complications only if the publication specifically reported such a complication or specifically reported that no such complications occurred. We abstracted data on mild or transient complications only from studies reporting data on serious permanent complications. Six of the 40 studies did not report on the occurrence of mild or transient complications.

Design and conduct of included studies
Internal validity

The complications reported by these studies could only have occurred because of surgery, so the internal validity with regard to the cause and effect is not in question. However, some potential biases are still present. Investigator reporting bias may have affected the reporting of mild or transient complications because they may not be regarded as important by some investigators. Attrition bias is not a concern because all patients were examined after surgery. Maturation bias is also not a concern when reporting complications.

External validity

The specific patient characteristics of temporal lobe surgery patients reported in each study are presented in Evidence Table 160. The 40 studies in the evidence base cover a wide range of patient ages at surgery, onset of seizures, and duration of epilepsy. Eleven studies enrolled patients with a mean age at surgery of less than 20 years with no patient exceeding 22 years of age. Twenty-eight studies enrolled patients with a mean age at surgery of greater than 20 years with youngest and oldest ages that varied between 1 year and 86 years. Twelve studies had a mean age of seizure onset of less than 10 years of age and 15 studies had a mean age of seizures onset after 10 years of age. The range of seizure onset varied from less than 1 year of age to 62 years of age. Ten studies reported a mean duration of epilepsy of less than 10 years and 18 studies reported a mean duration of epilepsy of greater than 10 years. The range for duration of epilepsy varied between less than 1 year and 81 years.

Of the 40 studies, three included patients who received surgery starting in the 1940s and 1950s160–162 and three included patients who received surgery starting in the 1960s (Table 27).147, 163, 164

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results

Evidence Table 161 presents a study-by-study list of the complications reported in each of the forty studies in the evidence base. Among the 2,091 temporal lobe surgery patients, 42 serious permanent complications were reported. This corresponds to 2 percent of the patients or 20 serious complications per 1,000 surgery patients. If the six studies which included patients from the 1940s, 1950s, and 1960s are removed, the number of serious complications was 32 out of 1,534 patients or 2.1 percent of patients.

Seventy-nine mild or transient complications were reported among 1,339 patients, which correspond to 6 percent or 59 complications per 1,000 surgery patients. The number of mild or transient complications may be underestimated by these data because of differences in reporting these complications across studies. Clinician judgment as to the importance of reporting various mild or transient complications will likely vary across studies, whereas, the occurrence of permanent paralysis will usually warrant reporting.

Surgery-related Mortality

Any surgical procedure may result in such serious complications that death results. The following section evaluates studies that reported deaths due to temporal lobe surgery.

Evidence base

Table 28. Temporal lobe surgery: surgery-related mortality
ReferenceNYears Study ConductedCountryDeaths
Boling (2001)169181981-1999Canada0
Schramm (2001)178611993-1999Germany0
Wiebe (2001)145361996-2000Canada0
Iannelli (2000)194371981-1997Italy0
Rao (2000)1971641995-1998India1
Robinson (2000)199221993-1998United States0
Wurm (2000)251161997-1998Austria0
Altshuler (1999)156491974-1990United States0
Leung (1999)252111994-1998Hong Kong0
Parrent (1999)253191994-1997Canada0
Salanova (1999)2141451984-1995United States0
Son (1999)166711994-1999South Korea0
Visudhiphan (1999)255141993-1998Thailand0
Wyllie (1998)257721990-1996United States1
Bizzi (1997)173141990-1994United States0
Blume (1997)259141977-1994Canada0
Kilpatrick (1997)183361993-1995Australia0
Adam (1996)193301991-1994France0
Acciarri (1995)200101975-1992Italy0
Berkovic (1995)2021351986-1991Australia0
Davies (1995)163121969-1988England0
Liu (1995)209221983-1990United States0
Wyler (1995)142701990-1992United States0
Blume (1994)1461251974-1989Canada0
Guldvog (1994)160641952-1988Norway0
Guldvog (1994)161351949-1988Norway0
Bladin (1992)2431071975-1991Australia0
Elwes (1991)2611081976-1987England1
Hopkins (1991)217111978-1988Australia0
Bidzinski (1990)1623201957-1988Poland2
Mizrahi (1990)152221980-1986United States0
Yeh (1990)220121982-1986Japan0
So (1989)256481973-1987Canada0
Cutfield (1987)147261961-1980New Zealand0
Drake (1987)224161974-1986Canada0
Meyer (1986)226501970-1983United States0
Carey (1985)260241975-1984Ireland0
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, 38 studies reported a death due to surgery or specifically reported that no deaths occurred due to surgery. The 38 studies examined 2,065 patients (Table 28). Only four of these studies were not included in the evidence base for our analysis on complications (see above).

We abstracted only deaths specifically reported to be caused by surgery. Deaths as a result of invasive presurgical diagnostic procedures were not included.

Design and conduct of included studies
Internal validity

The deaths reported here could only have occurred through surgery. Investigator reporting bias, attrition bias, and maturation bias are not a concern when reporting surgery-related mortality

External validity

The specific patient characteristics of temporal lobe surgery patients reported in each study are presented in Evidence Table 162. The 38 studies in the evidence base cover a wide range of ages at surgery, onset of seizures, and duration of epilepsy. Three studies enrolled patients with a mean age at surgery of less than 10 years and seven studies had a mean age at surgery between 10 and 15 years. The oldest patients in these ten studies did not exceed 22 years of age. Twenty-six studies enrolled patients with a mean age at surgery of greater than 21 years with ranges that varied between the youngest patients being 1 year of age to the oldest patient being 74 years of age. Mean age of seizure onset, reported in 22 studies, was between 2 and 20 years of age. The range of seizure onset varied from less than 1 year of age to 49 years of age. Mean duration of epilepsy, reported in 24 studies, was between 2 and 20 years. The range for duration of epilepsy varied from less than 1 year to 53 years.

Of the 38 studies, three included patients who received surgery starting in the 1940's and 1950's and two included patients who received surgery starting in the 1960's (Table 28).

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.

Synthesis of study results

Table 30. Corpus callosotomy: individual patient data
Studies of corpus callosotomy reporting individual patient data for patients with successful and nonsuccessful surgery
ReferenceAge at TreatmentAge at Seizure OnsetDuration of Epilepsy Prior to Treatment
Sakas (1996)268[check][check]
Claverie (1995)274[check]
Nordgren (1991)279[check][check][check]
Marino (1990)276[check][check][check]
Murro (1988)277[check][check][check]
Purves (1988)278[check][check][check]
Spencer (1988)267[check][check][check]
Among the 2.065 temporal lobe surgery patients, five deaths were reported (0.24 percent or 2.4 deaths per 1,000 patients). The five deaths were reported in four studies, all of which had more than 70 patients (Table 30). The study reporting two deaths enrolled patients from 1957 to 1988. Six studies with 70 or more patients reported no deaths. Twenty-eight studies had a samplee size of less than 70 patients. With a potential incidence rate of 1 or 2 deaths per 1,000 patients, studies with small sample sizes are not likely to report a death due to surgery. If the studies with patients from the 1940's, 1950's and 1960's are removed, three surgery-related deaths occured among 1,608 patients (0.19 percent)

Corpus Callosotomy

Resection of the corpus callosum is intended as a palliative procedure that reduces the frequency of seizures that could lead to injury or seriously interfere with life-style.262–264 These patients typically have multifocal, unresectable, or unlocalized lesions.263 Candidates for this procedure include both children and adult patients with atonic, tonic, and tonic-clonic seizures.263 These patients typically have daily to weekly seizures of multiple types that occur despite therapeutic blood levels of AEDs for at least 2 years prior to surgery.265

Corpus callosotomy is not expected to eliminate all seizures. Under these circumstances, reduction in overall seizure frequency and in specific seizure frequencies are the most valuable outcome measurement for establishing if corpus callosotomy has been effective. Due to the complicated nature of the surgery, an assessment of surgical complications and deaths due to surgery is also necessary to judge the effectiveness of corpus callosum resection.

Percent Reduction in Overall Seizure Frequency
Excluded studies

We excluded one study of corpus callosotomy reporting seizure frequency outcome measures from the evidence base. This study and the reason for its exclusion are listed in Evidence Table 163.

Evidence base

Table 29. Corpus callosotomy and seizure frequency outcomes
Method of reporting effect of surgery on seizure frequency
ReferenceTypes of Seizures EvaluatedPercent Reduction in Seizure FrequencyActual Change in Mean Seizure FrequencySeizure-free
Kwan (2001)270All seizure types[check]
Maehara (2001)271Disabling generalized seizures[check]
Matsuzaka (1999)272Most disabling seizures[check][check]
McInerney (1999)273Most disabling seizures[check][check]
Sakas (1996)268Drop attacks and generalized tonic-clonic seizures[check][check]
Claverie (1995)274All seizure types[check]
Reutens (1993)275All seizure types[check]
Marino (1990)276All seizure types[check][check]
Murro (1988)277All seizure types[check][check][check]
Purves (1988)278All seizure types[check]
Spencer (1988)267All seizure types[check][check][check]
Gates (1987)265All seizure types[check][check][check]
Among the 26 studies of corpus callosotomy meeting our inclusion criteria, 12 reported an outcome measurement related to seizure frequency. Three hundred and forty-nine patients were examined in these studies. Table 29 presents a list of the seizure outcome categories used by each of the 12 studies. The most common means of reporting the effect of surgery on seizure occurrence was to classify patients into groups based on their percentage reduction in the frequency of all seizures or one particular seizure type. Eight studies considered the percentage reduction in all seizure types, while the remaining four studies measured only changes in the most disabling seizure, in disabling generalized seizures, or in drop attacks and generalized tonic-clonic seizures. Four studies reported the number of patients who were seizure-free for all seizure types.

Design and conduct of included studies
Internal validity

As noted for temporal lobe surgery, withholding surgery may be unethical, so the evidence base for corpus callosotomy consists mainly of uncontrolled trials. Indeed, none of the studies in the evidence base for corpus callosotomy employed a control group. Rather, all studies were case series. Therefore, all of the 12 studies in the evidence base may have biases that reduce internal validity as previously discussed for temporal lobe surgery. However, these patients have daily to weekly seizures of multiple types that occur despite therapeutic blood levels of AEDs.265 Therefore, given the severe nature of the seizure activity in individuals considering this type of surgery, explanations for seizure reduction other than the effect of surgery may be considered implausible.

External validity

As stated earlier, patients being considered for resection of the corpus callosum experience characteristics seizures due to the multifocal nature of the lesions responsible for the seizures. Therefore, the patients in published studies of corpus callosotomy should be representative of all patients receiving this surgery. However, differences may exist across studies with regard to age or pathology.

The specific characteristics of corpus callosotomy patients reported in each study are presented in Evidence Table 164. In two studies, the patients were all less than 20 years of age at the time of surgery. In the other 10 studies, the mean age at the time of surgery varied between 20 to 30 years of age, with patient ages ranging from a youngest of about 5 years to an oldest of about 50 years of age. The mean age of seizure onset was less than 10 years of age in the nine studies reporting this patient characteristic. The age of seizure onset ranged from birth to 26 years of age. The mean duration of epilepsy prior to surgery was less than 10 years in one study and between 13 and 21 years in another eight studies. The range for duration of epilepsy prior to surgery was less than a year to 50 years.

Based on the distribution of patient characteristics, this evidence base seems to be generalizable to corpus callosotomy patients in clinical practice.

Synthesis of study results

As with temporal lobe surgery, several categories are used to describe reductions in seizure frequency. Evidence Table 165 presents the data from 12 studies organized according to the percentage of seizure frequency reduction. Ten studies reported a category of 90 percent or greater reduction. Nine studies reported frequency data that could be organized into categories of seizure reductions of greater than or equal to 90 percent, 75 percent to 90 percent, 50 percent to 75 percent, less than 50 percent, and no change or worse. One study reported only the number of patients in the 90 percent reduction group. Two studies did not report a 90 percent reduction category; one study separated patients above and below a 50 percent reduction in seizure frequency and the other reported the number of patients to achieve better than a 75 percent reduction. This last study also reported the number patients with no change or who became worse, but the remaining patients could have been anywhere between 1 percent and 74 percent.

As previously mentioned, the types of seizures being evaluated are not the same across studies. In four of the 12 studies reporting a percentage reduction in seizure frequency, only a single specific type of seizure was considered (Table 29).

Meta-analytic threshold analysis of 90 percent reduction in seizure frequency

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf63.jpg.

   Figure 63. Forest plot: corpus callosotomy and reduction in seizure frequency

Studies reported patients with at least a 90 percent reduction in seizure frequency after surgery

A scale is not shown because the effect sizes were not calculated with actual control groups

Five studies reported the number of patients with a greater than or equal to 90 percent reduction for all seizure types. Evidence Table 166 presents the actual patient counts, percentages, and calculated effect sizes for each study used in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on a control group in which no patients experience a 90 percent reduction in seizure frequency. Figure 63 presents a forest plot of these effect sizes to show the extent of variation between studies, but no scale is provided because these effect sizes were not calculated using actual control groups.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf64.jpg.

   Figure 64. Threshold analysis: corpus callosotomy and reduction in seizure frequency

The results of our threshold analysis appear in Figure 64. The summary estimate calculated at the 0 percent point (no patients in a synthetic control group showed a 90 percent reduction in seizure frequency) was 0.94 (CI: 0.70 to 1.18, p <0.000001). This summary estimate corresponded to 20 percent (CI: 12 percent to 31 percent)y of patients experiencing a 90 percent reduction in seizure frequency after surgery. The summary estimate became nonsignificant (no statistically significant difference between surgery and control) when the proportion of patients in the synthetic control group reached 15 percent. There was no statistically significant heterogeneity in the threshold analysis (Q = 2.9, p = 0.58).

Meta-analysis of no change or increase in seizure frequency

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf65.jpg.

   Figure 65. Forest plot: corpus callosotomy and no benefit from surgery

Studies reported patients who had no change or an increase in seizure frequency

Seven studies reported the number of patients who experienced no change or became worse for all seizure types, and we performed a meta-analysis of these studies. Evidence Table 167 presents the actual patient counts, percentage, and calculated effect sizes (Cohen's h) for each study used in this analysis. We calculated each study's Cohen's h presented in the Evidence Table under the assumption that no surgical patients would have been included under the category of no change or became worse. Figure 65 presents a forest plot of the effect sizes. We did not conduct a threshold analysis because control patients are expected to experience the outcome we are meta-analyzing, no change or an increase in seizure frequency.

Our meta-analysis produced a statistically significant summary estimate (0.83, CI: 0.62 to 1.03, p <0.000001) that corresponds to 16 percent (CI: 9 percent to 24 percent) of patients with no change or an increase in seizure frequency after surgery. There was no statistically significant heterogeneity in the threshold analysis (Q = 9.0, p = 0.17).

Analysis of seizure-free outcome measurements

Four studies reported the number of patients who became completely seizure-free (no auras) after resection of the corpus callosum. Of the 85 patients examined in these four studies, only five patients became seizure-free (6 percent). The range among the individual studies was 0 percent to 14 percent. Evidence Table 168 presents the data from these studies.

Analysis of presurgery and postsurgery seizure frequency outcome measurements

Pre- and postsurgery seizure frequency data were reported in only three studies. Therefore, we did not perform a meta-analysis of these data. All three studies reported a reduction in mean seizure frequency after surgery. Mean presurgery seizure frequency ranged from 110 to 178 seizures per month. The mean postsurgery seizure frequency dropped to a range of 20 to 78 per month. Because each of these studies reported individual patient data for seizure frequency, we looked for significant changes in seizure frequency in each study using a paired t-test. Two of the three studies showed statistically significant reductions in seizure frequency after surgery (p = 0.014 and 0.015) and the third showed a reduction close to being statistically significant (p = 0.065). These studies suggest that corpus callosotomy can be effective in reducing absolute seizure frequency. Evidence Table 169 presents the data abstracted from these studies and the results of our paired t-test calculations.

Analysis of nested case-control studies

Four nested case-control studies of corpus callosotomy presented an evaluation of patient characteristics that could potentially influence surgical outcomes. Evidence Table 170 presents the findings reported by each of the nested case-control studies in our evidence base for seizure frequency outcome measurements. One of the four studies used multiple regression, but did not assess age at surgery, age at seizure onset, or duration of epilepsy prior to treatment.

Meta-analysis of patient characteristics

As mentioned previously in the section on temporal lobe surgery, each nested case-control study may have been too small (i.e., had too little power) to detect clinically meaningful correlations between patient characteristics and successful surgery. To address this, we performed meta-analyses that combined individual patient data across studies. At least five studies reported individual patient data for one or more of the following continuous variables: age at surgery, age at seizure onset, or duration of epilepsy prior to surgery. We calculated a point-biserial correlation for each study and combined these in a meta-analysis. The coefficient was calculated so that a positive correlation indicated that an older age or longer duration favored a successful outcome and a negative correlation indicated that a younger age or shorter duration favored a successful outcome. Table 30 presents a list of the studies of corpus callosotomy that reported characteristics for patients with successful and nonsuccessful surgery. All of these studies were included in the previous meta-analyses examining the efficacy of surgery.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf66.jpg.

   Figure 66. Forest plot: corpus callosotomy and patient age at surgery

Age at surgery. Our first meta-analysis looks at whether different outcomes were obtained in patients of different ages at the time they receive surgery. Individual ages at surgery for patients with successful and nonsuccessful surgery were reported in six studies with 120 patients. Evidence Table 171 presents the definition used for successful surgery and the point-biserial correlation calculated for each of the six studies. Figure 66 presents a forest plot of the effect sizes.

The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.14, CI: -0.05 to 0.32, p = 0.16) suggesting that age at surgery had no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 4.1, p = 0.54).

We performed a sensitivity analysis to determine whether a single study had excessive influence over the results of the analysis. The summary estimate and other statistics did not change markedly because of the sensitivity analysis. The correlation changed by no more than 0.04 due to removal of studies during the sensitivity analysis. The summary estimates remained nonsignificant. The results of the sensitivity analysis and the original meta-analysis are presented in Evidence Table 172.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf67.jpg.

   Figure 67. Forest plot: corpus callosotomy and patient age at onset of seizures

Age at seizure onset. In our second meta-analysis, we sought to determine whether different outcomes were obtained in patients of different ages at seizure onset. Individual ages at seizure onset for patients with successful and nonsuccessful surgery were reported in five studies with 105 patients. Evidence Table 173 presents the definition used for successful surgery and the point-biserial correlations calculated for each of the five studies. Figure 67 presents a forest plot of the effect sizes. The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.04, CI: -0.16 to 0.24, p = 0.70) suggesting that age at seizure onset had little or no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 2.6, p = 0.64).

We performed a sensitivity analysis to ensure that a single study did not have excessive influence over the results of the analysis. The summary estimate and other statistics did not change markedly because of the sensitivity analysis. The point-biserial correlation varied between 0.0 and 0.12 due to removal of studies during the sensitivity analysis. The summary estimates remained nonsignificant. The results of the sensitivity analysis as well as the original meta-analysis are presented in Evidence Table 174.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is er-trepilf68.jpg.

   Figure 68. Forest plot: corpus callosotomy and duration of epilepsy prior to surgery

Duration of epilepsy prior to surgery. In our third meta-analysis, we sought to determine whether different outcomes were obtained in patients with different durations of epilepsy prior to surgery. Individual durations of epilepsy prior to surgery for patients with successful and nonsuccessful surgery were reported in five studies with 105 patients. These same five studies reported individual patient age at onset of seizures. Evidence Table 175 presents the definitions used for successful surgery and the point-biserial correlations calculated for each of the five studies. Figure 68 presents a forest plot of the effect sizes. The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.15, CI: -0.05 to 0.34, p = 0.15) suggesting that duration of epilepsy prior to surgery had no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 6.2, p = 0.18).

We performed a sensitivity analysis to ensure that a single study did not have excessive influence over the results of the analysis. The summary estimate and other statistics did not change markedly because of the sensitivity analysis. The point-biserial correlation varied between 0.06 and 0.21 due to removal of studies during the sensitivity analysis. The summary estimates remained nonsignificant. The results of the sensitivity analysis as well as the original meta-analysis are presented in Evidence Table 176.

Changes in the Frequency of Specific Seizure Types

In the previous section, we analyzed data on overall seizure reduction and estimated that, after corpus callosotomy, only 20 percent of patients are likely to exhibit a 90 percent reduction in all seizure types. The benefits of corpus callosotomy may also be determined from surgery's effect on the most disabling seizures experienced by the patients, or from surgery's effect on specific types of seizures. The most disabling seizures are primarily generalized seizures that can result in falls and injuries. The specific types of seizures for which corpus resection may have a beneficial effect are generalized tonic/clonic seizures, atonic seizures, and tonic seizures.266

Evidence base

Table 31. Corpus callosotomy and specific seizure types
Studies reported patients who were free of specific types after surgery
ReferenceSpecified Most Disabling SeizurePatients with Generalized Tonic-Clonic SeizuresPatients with Atonic Seizures
Kwan (2001)270[check][check]
Maehara (2001)271[check][check]
Matsuzaka (1999)272[check]
McInerney (1999)273[check][check][check]
Sakas (1996)268