The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
Carolyn M. Clancy, M.D.
Director
Agency for Healthcare Research and Quality
Jean Slutsky, P.A., M.S.P.H.
Acting Director, Center for Practice and Technology Assessment
Agency for Healthcare Research and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.
Objectives. This report, commissioned at the request of the Centers for Disease Control and Prevention and the Social Security Administration, addresses in an evidence-based fashion diagnosis of and interventions for treatment-resistant epilepsy (TRE). It addresses drug and surgical treatments, as well as service-related interventions.
Search Strategy. We systematically searched 23 electronic databases, including PubMed® and EMBASE. Search dates ranged from 1985 to January 1, 2002 for all but drug topics, which ranged from 1975 to January 1, 2002. We employed different search strategies for each of the nine key questions addressed. Our searches identified 11,111 articles.
Selection Criteria. We retrieved 2,356 articles, and included 357, according to a priori criteria accounting for the quality and relevance of available studies.
Data Collection and Analysis. We employed a “best evidence” synthesis that used the best available, not the best possible evidence. Case control studies were the most common design for diagnostic topics, RCTs were most common for antiepileptic drug (AED) strategies, and the surgical literature was nearly all retrospective case series. The quality of these studies was systematically considered. We computed summary statistics in meta-analyses of RCTs of multiple AED therapy (polytherapy) and computed thresholds for effectiveness in meta-analyses of sequential AED monotherapy and uncontrolled surgical studies.
Main results. There is no widely used definition of TRE. Lack of high quality studies precludes an evidence-based determination of the most effective diagnostic for rediagnosing or re-evaluating patients. Nevertheless, up to 35 percent of patients (but probably fewer) diagnosed with TRE may also have nonepileptic seizures, or not have epilepsy at all. Not all patients diagnosed with TRE receive optimized therapy, but the number of these patients cannot be determined. Initiation of sequential monotherapy appears to result in seizure increases in many patients, and whether sequential monotherapy causes any patients to become seizure-free is not clear. Polytherapy can reduce seizure frequency, but some patients experience intolerable adverse effects. Drug reduction may cause seizure increases without additional benefit. Results of the AED studies assessed in this report may not be generalizable to drugs not examined in the studies we included. Temporal lobe surgery eliminates seizures in many patients. Hemispherectomy and frontal lobe surgery eliminate seizures in an indeterminate number of patients. Corpus callosotomy reduces seizure frequency but generally does not eliminate seizures. Vagal nerve stimulation affords some seizure reduction. There was insufficient evidence to assess other treatments. Epilepsy is associated with increased all-cause mortality and death from drowning. The link between sudden death and seizure frequency is uncertain. Generalized tonic-clonic seizures seem associated with an increased risk of death.
Conclusions. Some patients diagnosed with treatment-resistant epilepsy are misdiagnosed or not receiving optimized AED treatment. Effective treatments are available, but all have disadvantages. There are many weaknesses in the current literature, particularly in studies of diagnostics and nondrug, nonsurgical interventions. Better-designed studies in these areas are needed.
In this report, we evaluate and synthesize the published literature on diagnosis of, and medical and nonmedical interventions for treatment-resistant epilepsy. This report was commissioned upon the request of the Centers for Disease Control and Prevention and the Social Security Administration.
Epilepsy is a common, serious neurologic condition. An International League Against Epilepsy (ILAE) Commission Report from 1997 estimated the prevalence of active epilepsy as 40 to 100 in 10,000 and the incidence of unprovoked seizures as 2 to 7 per 10,000. However, precise estimates of prevalence and incidence are complicated by differences in the way investigators define epileptic and nonepileptic seizures (NES), and by the fact that prevalence is typically estimated using retrospective methods.
In addition to the immediate, debilitating effects of seizures, epilepsy also interferes with daily activities, and persons with epilepsy may have to contend with the increased possibility of accidental injury and even death. Psychiatric disorders may also be more common in people with epilepsy.
Persons with epilepsy often have impaired physical, psychological, and social functioning, which may lead to economic loss and diminished quality of life. A survey of 1,023 people with epilepsy published in 2000 showed that compared to U.S. Census Bureau norms, respondents received less education, were less likely to be employed, and were more likely to be members of low-income households.
This evidence report addresses nine key research questions encompassing 49 technologies, including several service-related interventions. However, the quantity and quality of published literature was insufficient to permit an evidence-based evaluation of 39 of these technologies. We therefore evaluated one diagnostic technology, three antiepileptic drug (AED) strategies, five surgical procedures, and one nondrug, nonsurgical intervention. In addition, we also surveyed the definitions of treatment-resistant epilepsy in the published clinical literature, with particular emphasis on the definitions reported in clinical studies.
The outcomes we considered depended upon the key research questions. We used 16 patient-oriented outcomes to evaluate the effects of treatment, and all reported measures of diagnostic test performance. We also examined the rates of all-cause mortality and cause-specific mortality among persons with epilepsy.
To obtain information for this report, we systematically searched 23 electronic databases, including PubMed® and EMBASE. In general, literature searches covered the years 1985 to January 1, 2002. For topics on AEDs, we searched for studies published between 1975 and January 1, 2002. We employed these earlier search dates to ensure that we captured data on standard drug treatments, which are likely to be in relatively older literature.
We employed different search strategies for each of the nine key research questions. Searches were implemented by first developing a list of Medical Subject Headings (MeSH) terms, publication types, and textword combinations. This list included the concepts inherent in each of the key research questions. These searches identified 11,111 articles. From these identified articles, we retrieved 2,356 potentially relevant articles to determine whether they met the a priori criteria tailored for each Key Research Question.
Three hundred forty-eight articles met these inclusion criteria. We next evaluated these articles to determine whether they contained design flaws so severe that their results were uninterpretable. Such articles were excluded. In addition, we excluded articles if there were fewer than five published studies on a given intervention or diagnostic, and none of the studies was a randomized controlled trial with 50 or more patients in the treatment arm. We adopted this latter criterion because of the difficulty in reaching firm evidence-based conclusions from a relatively small literature base comprised of studies of less than optimal design. As a result, 299 articles are included in this evidence report for key research questions 2–9. One hundred eighty-five articles for key research question 1 (on definitions of treatment-resistant epilepsy) were selected from all of the articles included in key research questions 2–6, from available clinical guidelines, and from a random sample of 100 review articles.
We employed a “best evidence” synthesis in this Evidence Report. Thus, for each key research question, we used the best available evidence, not the best possible evidence. Consequently, studies of several designs were included in this report. Diagnostic case-control studies are the most common design for diagnostic topics, randomized controlled trials (RCT) are most commonly used for evaluating AED strategies, and the surgical literature is comprised almost exclusively of retrospective case series.
We evaluated the internal validity of all included studies using checklists of biases that could potentially affect their results. In considering study design, we assumed that randomized controlled trials provide results with the least potential for bias. This was followed, in order of increasing potential for bias, by controlled studies of other design, studies that measured patient outcomes before and after some intervention, and uncontrolled studies. Among each type of study, we considered blinded studies to have lower potential for bias than nonblinded studies, and prospective studies to have lower potential for bias than retrospective studies.
In parts of this report, we used a systematic narrative review supplemented by numerous de novo calculations. These include calculations that index the statistical power of nonsignificant studies, various statistics (e.g., chi-square tests), crude mortality ratios, and other quantities, as appropriate. The majority of this evidence report is, however, meta-analytic.
We performed random effects meta-analyses on data from RCTs examining polytherapy AED treatment. We used sensitivity analyses to evaluate how robust the results of these analyses were. Sensitivity analyses consisted of removing the largest and smallest studies from the meta-analysis, and removing the studies with the largest and smallest effects. Each of the trials in these meta-analyses is an instance of polytherapy, rather than a direct study of this strategy. However, combining these trials into a single analysis of polytherapy can provide an approximate estimate of the effect of adding a single new AED to patients' regimens.
We performed threshold analyses on data from uncontrolled studies of sequential monotherapy and surgery. For sequential monotherapy, we employed random effects models, whereas for surgery we employed fixed effects models. We used random effects models for analyses of sequential monotherapy because of the heterogeneity among results of trials using different AEDs. In our threshold analyses, we meta-analytically compared the improvement rate in treated patients to increasing rates of improvement in a hypothetical “control” group. Starting at 0 percent, we increased the rate of improvement in the “control” patients until the difference in improvement between the treated and “control” groups was no longer statistically significant. This value is the threshold. Where possible, we provide context for these thresholds by supplementing them with historical data obtained from published articles.
We also report the percentage of patients who improved after the intervention (as given by the meta-analytic results when improvement in the control group is 0 percent), but note that this percentage is not a measure of the net effectiveness of the intervention. Some patients may have improved without treatment. Nevertheless, this percentage is informative because it represents the proportion of patients likely to improve, regardless of the cause of their improvement.
When heterogeneity among study results was found in a threshold analysis, we attempted to “explain” the source of the heterogeneity using meta-regression. Because of the lack of strong a priori hypotheses about the reasons for this heterogeneity, we constructed multiple meta-regression models for each instance in which heterogeneity was found. The post hoc nature of these analyses led us to adopt stringent criteria for identifying models for further exploration. These explorations consisted of threshold analyses of the regression intercepts.
Treatment resistance is infrequently defined in the literature. Less than one third of the surveyed publications reported any definition of this term.
When treatment resistance was defined, definitions typically included the number of AEDs a patient tried before being considered treatment-resistant. Some definitions also included seizure frequency, duration of illness, and whether AEDs were administered at maximum tolerable doses.
Drug trials tended to require fewer failures of AED treatment compared to surgical trials. This is because a very thorough assessment of drug regimens is usually attempted before surgery is considered. Assessments are usually less thorough when giving a patient another AED.
Despite the fact that reports of clinical trials and review articles regularly use terms such as “intractable,” “refractory,” or “treatment-resistant” to describe patients for whom one or more treatments have failed, no consensus exists as to precisely what these terms mean.
We partitioned this question into four subquestions. The first two subquestions addressed differential diagnosis of epileptic seizures from nonepileptic seizures. The remaining two subquestions addressed the differential diagnosis of different seizure types. Whether we addressed some questions depended on the findings for previous questions.
This question attempts to gauge the extent of the need for rediagnosis among patients thought to have treatment-resistant epilepsy. Our evaluation of the published literature suggests the following:
Meta-analysis suggests that up to 35 percent of patients originally diagnosed with treatment-resistant epilepsy either do not have epilepsy, or they have a combination of both epileptic and nonepileptic seizures. Because this number is derived from studies that enrolled patients suspected of having nonepileptic seizures, the actual number is probably lower.
None of the studies included in the above-mentioned meta-analysis contained pediatric patients. Thus, the prevalence of pediatric patients diagnosed with treatment resistant epilepsy and who either do not have epilepsy or have a combination of both epileptic and nonepileptic seizures is unknown.
These findings suggest that some patients enrolled in studies included in this Evidence Report may not have epilepsy. If this is the case, then our estimates of the efficacy of the interventions that we address may be imprecise.
A paucity of high-quality evidence limited our ability to draw evidence-based conclusions about measurement of serum prolactin levels as a diagnostic tool. Consequently, we were precluded from developing diagnostic decision-model algorithms that take into account the realities of clinical practice, where a differential diagnosis is based on information from many diagnostic technologies, not just information from a single diagnostic in isolation.
The only relevant diagnostic supported by a sufficient quantity of literature to allow evidence-based analysis was serum prolactin. The relatively low quality of this literature, however, precludes firm evidence-based conclusions. Rather, this literature only allows the conclusion that serum prolactin levels could plausibly distinguish epileptic seizures from some nonepileptic seizures. Further research is required to determine whether the performance of this test is sufficient to warrant its use in clinical practice.
Despite the importance of video-electroencephalography (vEEG) in diagnostic protocols aimed at differentiating epileptic seizures from nonepileptic seizures, we do not draw evidence-based conclusions regarding the diagnostic performance of this technology in the present report because less than five high quality studies were identified. The fact that evidence-based conclusions were not drawn should not be interpreted as evidence that this technology is not effective or useful. Indeed, vEEG may very well have an important role in diagnostic algorithms designed to differentiate patients with epilepsy from patients with nonepileptic seizure disorders. Until more high-quality studies become available, however, the diagnostic performance characteristics of vEEG and its place in such diagnostic algorithms cannot be determined.
There were too few acceptable studies addressing this question to permit analysis.
Because no evidence-based conclusions could be reached for Question 2C, the diagnostic modalities that are most useful in differentiating between different seizure types could not be determined.
Not all patients with treatment-resistant epilepsy receive optimized AED treatment.
The percentage of patients with treatment-resistant epilepsy who are not receiving optimized therapy is difficult to estimate. This is because of a lack of relevant, large, population-based studies. Further, many studies of AEDs do not report whether patients comply with their AED regimens.
Based on the recommendation of the partners, for the purposes of this question, sequential monotherapy is defined as changing a patient's drug regimen from one or many AEDs to a single, different AED. Polytherapy is defined as changing a patient's drug regimen from one or many AEDs to a different multiple-AED regimen. In this report, all polytherapy trials were trials of a single add-on AED. Optimized current therapy was defined as changing the dose and/or the frequency of administration. Based on the recommendation of the partners, we also included the removal of one or more drugs within this definition.
During long-term studies, an estimated 89 percent of patients continued to have seizures when switched to monotherapy. The remaining 11 percent of patients were seizure-free during the studies. When short-term studies were included, 16 percent of patients were seizure-free. However, because these data come from studies that indirectly addressed this issue, whether sequential monotherapy is directly responsible for these patients becoming seizure-free cannot be determined.
An estimated 16 percent of patients experienced a doubling of monthly seizure frequency during studies of sequential monotherapy.
An estimated 14 percent of patients experienced a doubling of two-day seizure frequency during studies of sequential monotherapy.
Sequential monotherapy required the removal of patients' prior AEDs, and in some patients the increases in seizure frequency were likely caused by this removal. Increases may be more likely in the subset of patients who switched from multiple AEDs to a single AED, but available data do not address this possibility.
These findings suggest that sequential monotherapy is more likely to increases seizures than to eliminate seizures.
One cannot determine the side effects (or their rates) associated with sequential monotherapy because no studies compared the adverse effects experienced by patients during sequential monotherapy with the adverse effects they had been experiencing during their prestudy drug regimens. Many patients (53 percent to 95 percent) experienced mild adverse reactions to the new monotherapy drug.
An estimated 5 percent of patients exited studies of sequential monotherapy due to adverse effects.
The findings listed above are applicable only to the drugs and doses examined in this report.
There was insufficient evidence to draw firm conclusions about the influence of sequential monotherapy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, ability to hold a driver's license, or mortality.
Adding certain AEDs to a patient's drug regimen has potential advantages and disadvantages. Patients who receive these add-on drugs are more likely to experience reductions in seizures compared to patients who receive an add-on placebo. However, recipients of these drugs are also more likely to experience adverse effects leading to trial exit than are placebo recipients (8 percent vs. 4 percent). Many patients (55 percent to 94 percent) experienced mild adverse effects while taking the new drugs.
The preceding estimates of the effect of add-on therapy are based on random-effects meta-analyses that combined different AEDs. These estimates serve as approximate guides for future research on polytherapy. However, their generalizability may be limited to the drugs and doses in the included trials. Further, the apparent effectiveness of an add-on drug may depend on concurrent medications. Thus, the results may not be applicable to patients receiving other concurrent medications. Also, the results of these trials cannot be generalized to other implementations of the polytherapy strategy (e.g., the addition of two drugs).
Insufficient evidence was available to draw firm conclusions about the influence of polytherapy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, ability to hold a driver's license, or mortality.
Drug reduction may lead to increases in seizure frequency in at least some patients. Although some patients experience reduced seizure frequency, these reductions were likely due to regression to the mean. The only other explanation is that the withdrawn drugs were somehow causing seizures. Given that the patients included in these studies had been on their baseline AED regimens for some time, this seems implausible.
Convincing evidence is lacking to suggest that drug reduction improves quality of life, mood, cognitive function, or that it reduces the occurrence of drug related adverse events. Thus, the available evidence suggests that implementation of the drug-reduction strategy, at least with the AEDs considered in this report, may lead to increases in seizure frequency and provide little benefit.
Due to limited data, no evidence-based conclusions could be drawn about optimized current therapy that employed dose increases or changes in frequency of administration.
No included studies directly compared the three AED strategies. Because of the different goals of optimized therapy and the other two AED strategies, these interventions cannot be compared. Differences in the severity of disease of patients given polytherapy and sequential monotherapy preclude quantitative comparison. However, sequential monotherapy was more likely to be harmful than to be beneficial. The reverse was true for polytherapy. These qualitative conclusions suggest that polytherapy may be clinically preferable to sequential monotherapy.
Threshold analyses of retrospective data suggest that 2 years after temporal lobe surgery, 55 percent of patients are completely seizure-free, and 68 percent are free of complex partial seizures. The retrospective case series design of the studies reporting these outcomes prevents stating that these rates are the direct result of surgery, because some patients may have become seizure-free without surgery. However, 50 percent of similar patients who did not receive surgery in similarly designed studies would have to be seizure-free before concluding that surgery did not improve this outcome. Similarly, 65 percent of similar patients who did not receive surgery would have to be free of complex partial seizures before concluding that surgery had no effect on complex partial seizures. To put these thresholds in context, published data from one RCT suggest that only 8 percent of patients who do not receive surgery become seizure-free. This suggests that many patients are seizure-free because of temporal lobe surgery.
Meta-analysis did not reveal any relationship between whether a patient becomes seizure-free after temporal lobe surgery and the patient's age at surgery, age at seizure onset, side of surgery, or the presence of simple partial seizures. Larger studies are required to prove that there is no relationship between these patient characteristics and the outcome of surgery.
The rate of new cases of depression after surgery ranges from 4 percent to 24 percent. Why this range is so wide is not clear, and whether surgery was responsible for these new cases cannot be determined.
Threshold analysis suggests that 3 percent of patients develop psychosis after surgery. However, data from one trial with similar patients who did not receive surgery suggest that as many as 2 percent of these patients develop psychosis. Two percent is also the threshold at which a relationship between surgery and the onset of psychosis becomes statistically nonsignificant. Therefore, surgery cannot be assumed responsible for new cases of psychosis.
Threshold analysis suggests that after temporal lobe surgery, approximately 13 percent of patients experience clinically significant increases in IQ and 10 percent of patients experience clinically significant decreases in IQ. The threshold analysis suggests that surgery may not be responsible for these changes if 10 percent of similar patients who did not receive surgery experienced an increase in IQ, and 7 percent of similar patients who did not receive surgery experienced a decrease in IQ. Data from one trial suggest that without surgery, 5 percent of patients experience a decrease and 5 percent of patients experience an increase in IQ. Therefore, if there is an effect of surgery on IQ, it does not affect large numbers of patients.
Approximately 2 percent of patients will experience permanent complications from temporal lobe surgery, primarily some form of partial paralysis. Data reported in studies of temporal lobe surgery reporting deaths due to surgery suggest that approximately 0.24 percent of patients will die because of the surgical procedure.
There was insufficient evidence to draw firm conclusions about the influence of temporal lobe surgery on quality of life, memory, functional status or ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.
Threshold analyses suggest that 2 years after corpus callosotomy, 20 percent of patients have achieved a 90 percent or better reduction in overall seizure frequency. The retrospective case series design of the studies reporting this outcome prevents stating that these rates are the direct result of surgery, because some patients may achieve a 90 percent reduction in seizure frequency without surgery. However, 15 percent of similar patients who did not receive surgery would have to experience a 90 percent or better reduction before concluding that surgery did not improve this outcome. No studies were available to provide context for these figures. Given the severity of patients' conditions, however, surgery is the most likely cause of these seizure reductions.
Despite the improvements seen in some patients, 16 percent of patients will achieve no reduction in overall seizure frequency or show an increase in seizure frequency after corpus callosotomy.
Threshold analysis suggests that 2 years after corpus callosotomy, 26 percent of patients will be free of their most disabling seizures. However, 20 percent of similar patients who did not receive surgery would have to become free of their most disabling seizures before concluding that surgery did not improve this outcome. No studies were available to provide context for these figures. Given the severity of patients' conditions, however, surgery is the most likely cause of these seizure reductions.
Approximately 3.6 percent of patients will experience serious complications after corpus callosotomy, primarily some form of partial paralysis, disconnection syndrome, or language difficulty. The precise mortality rate associated with this procedure is uncertain.
There was insufficient evidence to draw firm conclusions about the influence of corpus callosotomy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.
Studies of frontal lobe surgery report that 2 years after surgery, 20 percent to 100 percent of patients will be “seizure-free” depending on how this outcome is defined. These variations in outcome reporting prevented any meaningful threshold analysis.
Approximately 8.4 percent of patients will experience some type of complication after frontal lobe surgery, primarily some form of partial paralysis. However, this figure may be inaccurate because only two studies reported complications. Data reported in three studies of frontal lobe surgery reported only one death among 96 patients. These data are insufficient to estimate the true death rate from this type of surgery.
There was insufficient evidence to draw firm conclusions about the influence of frontal lobe surgery on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.
Three studies reported that between 40 percent and 70 percent of patients who receive hemispherectomy are seizure-free 2 years after surgery. Approximately 7 percent of patients may receive no benefit from this surgery. The paucity of literature on this topic means that these rates are not precise. Given the severity of patients' conditions, however, surgery is the most likely cause of this improvement.
Ten studies reported only two serious permanent complications from surgery (0.8 percent). However, given the small number of patients examined in these 10 studies, this may not be a reliable estimate. Among the same studies, the percentage of patients developing a mild or transient complication was 21 percent. Data reported in 11 studies of hemispherectomy suggest that approximately 2.6 percent of patients (26 deaths per 1,000 patients) will die because of the surgical procedure.
There was insufficient evidence to draw firm conclusions about the influence of hemispherectomy on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, or ability to return to (or remain in) school.
Reported percentages of patients who are seizure-free six or more months after multiple subpial transection vary from 0 percent to 75 percent, depending on how “seizure-free” is defined. Similarly, the estimates for patients who do not benefit from this surgery vary from 0 percent to 42 percent. Consequently, the data are inconsistent across studies and do not allow for firm evidence-based conclusions as to the exact proportion of patients who will become seizure-free or who will not benefit from multiple subpial transection.
Nine studies reporting serious permanent complications from surgery estimated that approximately 5.9 percent of patients experience these types of complications, particularly aphasia or dysphasia. Although no deaths were reported in any of these studies, they may be reported in future studies.
There was insufficient evidence to draw firm conclusions about the influence of multiple subpial transection on quality of life, mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.
Too few studies were available to allow for an evidence-based evaluation of parietal or occipital lobe surgery.
Trends from two RCTs suggest that vagal nerve stimulation (VNS), when applied as an adjunct intervention, safely provides limited seizure frequency reduction in some patients with treatment-resistant epilepsy. The precise degree of seizure reduction depends upon the specific measure of seizure frequency.
Currently available evidence does not suggest a dramatic effect of VNS on quality of life.
There was insufficient evidence to draw firm conclusions about the influence of VNS on mood, cognitive function, functional status/ability, ability to return to (or remain in) work, ability to return to (or remain in) school, or ability to hold a driver's license.
Too few studies were available to allow for an evidence-based evaluation of ketogenic diets, chiropractic procedures, acupuncture, hyperbaric oxygen therapy, herbal medicine and homeopathy, cranial realignment, magnetic therapy, electrical brain stimulation, and vitamin B6 therapy.
There were too few acceptable studies addressing this question to permit analysis.
There were too few acceptable studies addressing this question to permit analysis.
Persons with treatment-resistant epilepsy are approximately 2 to 10 times more likely to die compared to people in the general population. This excess mortality in persons with treatment-resistant epilepsy is largest among younger individuals.
Sudden unexpected death appears to be a major cause of death among patients with treatment-resistant epilepsy, representing 6 percent to 55 percent of the total deaths in studies that reported relevant data.
Drowning rates are higher among treatment-resistant patients with epilepsy compared to the general population. Higher quality evidence is needed to determine the precise magnitude of the difference in drowning rates.
There is insufficient evidence to determine whether accident-related mortality, or mortality due to pneumonia, aspiration, suicide or cancer is higher among persons with epilepsy compared to the general population.
Generalized tonic-clonic seizures appear to increase the risk of sudden death.
The relationship between overall seizure frequency and sudden death is uncertain.
Our analysis suggests that at least some patients receiving treatment for epilepsy either do not have epilepsy or have another condition in addition to epilepsy that also causes seizures or seizure-like events. Studies that clearly describe the diagnostic procedures used to confirm that patients actually have epilepsy are needed and would present a more accurate assessment of the efficacy of the treatment under study. Our analysis also suggests that some patients receive AEDs at less than the maximum tolerable dose. Future studies could ensure that patients are truly treatment-resistant by enrolling only subjects who are optimized and compliant with their current therapy.
In the absence of a control group, the effects of treatment cannot be differentiated from placebo effects, regression to the mean, extraneous events, or other threats to internal validity. Although there are situations in which controlled trials are impractical, controlled trials are needed to provide a more accurate picture of the effects of treatment.
Studies with inadequate numbers of patients cannot detect clinically meaningful differences in outcomes between treatment groups. When designing clinical trials, a priori power analysis calculations can be used as a guide to ensure that sufficient numbers of patients are enrolled so that the proposed trial can uncover clinically meaningful relationships between treatments and outcomes.
Many publications do not contain sufficient information to enable the reader to accurately judge the evidence. Some confusion could be alleviated if seizure-free outcome measurements were standardized. A well-reported trial would include seizure frequency as well as a measure of data dispersion, both at baseline and at several followup periods.
The lack of an accepted gold standard for the differential diagnosis of epileptic seizures from nonepileptic seizures makes evaluating the utility of any given diagnostic problematic. This is because of the difficulty in verifying that the diagnostic decisions that result from the use of the test are correct. Given this lack of an acceptable gold standard, attempting to determine whether the use of a diagnostic improves patient outcomes may offer a fruitful avenue for future research. Such an approach requires determining whether the use of the diagnostic of interest ultimately leads to improved patient outcomes and, as a consequence, requires a prospective, randomized controlled trial.
Because a diagnosis of epilepsy is not made based on the findings of a single diagnostic technology, studies are needed to evaluate the effectiveness of different clinical algorithms that utilize data collected from combinations of diagnostic technologies. Again, this approach would require a prospective, randomized controlled trial.
In the literature on drug strategies, an important direction for future research involves direct comparisons between the drug strategies for treatment-resistant epilepsy. None of the studies included in our assessment of drug strategies made direct comparisons between sequential monotherapy and polytherapy. Ideally, a trial would randomize patients to different drug strategies, and compare seizure frequency outcomes as well as adverse effects of treatment.
Another area for future research on drugs concerns the adverse effects patients experience from their pretrial drug regimens and changes in these adverse effects on the new treatment regime. Changes in the frequency and severity of the adverse effects associated with each drug treatment strategy need to be evaluated, because patients and clinicians seek to reduce adverse effects as well as seizure frequency.
Prospective studies of surgical interventions are needed. This approach would allow seizure and nonseizure-related outcome measures to be recorded at multiple followup periods (1 year, 2 year, 5 year, etc.) rather than the single mean or median followup reported in most retrospective studies. Better reporting of patient characteristics is also needed and, if possible, individual patient characteristics should be reported when study sizes are small (less than 20 patients). Studies reporting standardized quality of life measures, validated for patients with epilepsy, would help in determining the effect of surgery on this important nonseizure-related outcome. Studies reporting other types of nonseizure-related outcome measures, such as employment, education, and cognitive function data, are also needed.
Higher quality controlled trials are particularly lacking for the nonmedical treatments such as education and training in skills that may help prevent seizures or enable patients to better adapt to seizures. This area constitutes another important direction for future research.
Reporting of employment and schooling status among patients with treatment-resistant epilepsy is particularly lacking in both the medical and nonmedical treatment literature. The ideal study design to address this question would be a prospective cohort study using multiple regression techniques to evaluate the potential correlation between specific patient characteristics and the ability to work or attend school both before and after treatment. This is an area in particular need of future research and higher quality studies.
The present literature has a number of large (mostly retrospective) studies that have calculated standardized mortality rates (SMRs) for overall mortality, but few studies have calculated separate SMRs for specific causes of death or subgroups of specific ages. To generate meaningful data, cohort studies must enroll sufficient numbers of patients and follow the patients for sufficient periods. The most useful study of mortality among patients with treatment-resistant epilepsy would be a large prospective study that followed patients for several years. In addition to calculating an SMR for overall mortality, the study would calculate SMRs for specific causes of death, especially those that could be related to epilepsy (such as accidents, drowning, and motor vehicle accidents).
Large prospective studies where all suspected sudden unexpected death in epilepsy (SUDEP) cases receive an autopsy are needed. An autopsy is particularly important because it provides the best evidence that the death did not have an explainable cause. This would increase the accuracy of estimates of SUDEP rates for different age subgroups of patients with treatment-resistant epilepsy.
More prospective case-control studies using multiple regression analysis would be useful to address the potential relationship between SUDEP and seizure type or frequency. Future studies would ideally include a hundred patients or more to ensure that there is adequate statistical power to detect correlations. Multiple regression analysis is needed to reduce the effect of possible confounding variables and increase the likelihood that an observed statistically significant correlation represents an actual causal relationship.
The objective of this report is to evaluate and synthesize, in an evidence-based fashion, the published literature on the management of treatment-resistant epilepsy. Epilepsy is a condition characterized by recurrent, unprovoked seizures. The term “seizure” is an inclusive generic term that encompasses the clinical manifestations of epilepsy as well as other disorders. Epileptic seizures arise from the abnormal discharge of electrical activity by cerebral neurons, and result in loss of consciousness, alterations in perception or impairment of psychic functions, convulsive movements, disturbances of sensation, or some combination of these events.1 Nonepileptic seizures (NES), such as psychogenic (hysterical) seizures and seizures associated with syncope, are not caused by an abnormal neuronal discharge.
The first part of this Evidence Report deals with how the published literature defines “treatment-resistant” epilepsy. An acceptable definition for treatment-resistant epilepsy could aid in the identification and management of these patients. The terms “medically intractable” epilepsy and “refractory” epilepsy are frequently used to describe patients with uncontrolled seizures that do not respond to appropriate antiepileptic drugs (AEDs).2, 3 Other terms, with or without descriptive details, are also used.
This Evidence Report then considers the methods of diagnosis used to determine if a patient has treatment-resistant epilepsy. This section examines the possibility that some patients diagnosed with treatment-resistant epilepsy in fact have conditions other than epilepsy. We assessed diagnostic procedures that differentiate epileptic seizures from nonepileptic seizures and diagnostic procedures that aid in the diagnosis of epilepsy. Diagnostic procedures used to localize epileptogenic foci prior to surgery are not addressed in this report.
An evaluation of treatment interventions for patients with treatment-resistant epilepsy, specifically pharmacological and surgical procedures, as well as some nondrug/nonsurgical interventions, comprises a large part of this report. Rather than evaluate separate drugs, this report looks at which drug treatment strategy (sequential monotherapy, polytherapy, or optimized current therapy) may benefit patients with treatment-resistant epilepsy. In addition, we also examined a variety of surgical procedures and nondrug, nonsurgical interventions.
The final sections of this Evidence Report examine the potential impact of special services on patients with treatment-resistant epilepsy and the effect of treatment-resistant epilepsy on employment, education, and mortality. These areas are critical in the continuing management of patients with treatment-resistant epilepsy and in evaluating interventions beyond their effect on seizure frequency. The literature was examined for information on occupational, speech, and physical therapies, patient education, neuropsychological evaluation, and psychiatric consultation and treatment.
This report was prepared at the request of the Centers for Disease Control and Prevention and the Social Security Administration in an effort to evaluate the diagnostic procedures available for identifying patients with treatment-resistant epilepsy, identify the characteristics of patients with treatment-resistant epilepsy that interfere with employment and schooling, and assess the potential benefits of the available medical and nonmedical interventions for patients with treatment-resistant epilepsy. The patient population of interest in this report includes infants, children, and adults with treatment-resistant epilepsy.
An epileptic seizure can be defined clinically as an intermittent, stereotyped, disturbance of consciousness, behavior, emotion, motor function, or sensation that results from abnormal cortical neuronal discharge and recur without provocation.4 The discharge may result in an almost instantaneous loss of consciousness, alteration of perception or impairment of psychic function, convulsive movements, disturbance of sensation, or some combination of these events.1 The onset of the abnormal neuronal discharge may be widespread and bilateral or it may be localized. In the latter instance, the abnormal discharge arises from an assemblage of excitable neurons, called a focus, in any part of the cerebral cortex that may or may not be associated with a visible lesion.a Cortical excitation may then spread to the adjacent cortex and to the contralateral cortex through interhemispheric pathways as well as to subcortical areas such as the basal ganglion, thalamus, and brainstem. Clinical manifestations of a seizure occur when the excitation reaches these areas. In rare instances, death may occur due to sustained cessation of respiration, derangement of cardiac action, or some unknown cause.
The focal cortical lesions responsible for the abnormal discharges associated with epileptic seizures may arise from a variety of causes. Epilepsies in which no pathological lesion can be found are referred to as primary or idiopathic epilepsies and include certain generalized tonic-clonic and absence seizure conditions.1 The underlying cause of these seizure types is probably genetic. Secondary or symptomatic epilepsies are associated with a discernable lesion. Secondary epilepsies include simple partial seizures and complex partial seizures. The lesions may be zones of neuronal loss and scarring (sclerosis or gliosis), vascular malformations, tumors, or cortical dysplasia. Many patients with temporal lobe epilepsy have a condition called mesial temporal sclerosis (MTS) that is characterized by a loss of volume and scarring in the hippocampus and adjacent gyri on one or both sides. Posttraumatic epilepsy may occur after head trauma, brain surgery, and various infections that are responsible for creating focal lesions that result in epilepsy.
Epilepsy seizures may be classified according to etiology, site of origin, clinical form, frequency, or electrophysiologic characteristics.1 Classification schemes have repeatedly changed, but the most commonly used scheme is the one adopted by the Commission on Classification and Terminology of the International League Against Epilepsy.5 This classification is based primarily on the clinical form of the seizure and its electroencephalographic (EEG) features. Seizures are divided into partial seizures (a focal or localized onset can be discerned) and generalized seizures (bilateral origin and diffuse cerebral cortical involvement from the onset). Partial seizures that develop into generalized seizures are referred to as secondarily generalized seizures. Partial seizures (also called focal seizures) are further classified as simple when consciousness is maintained, and complex if consciousness is altered or lost. Simple partial seizures can be motor, sensory, autonomic, or psychic. Simple partial seizures are also called auras and may be a precursor to a complex seizure or may constitute the entire seizure. Generalized seizures may be convulsive or nonconvulsive. The common convulsive type is the tonic-clonic seizure and the common nonconvulsive type is the absence seizure that is characterized by a brief lapse of consciousness.
Patients with treatment-resistant epilepsy may experience one or more of the various types of epileptic seizures depending on the etiology of their condition. The following is a brief description of the clinical characteristics of each type of epileptic seizure that a patient with treatment-resistant epilepsy may experience. Adams and Victor's Principles of Neurology was used a guide in preparing these descriptions.1
Generalized tonic-clonic seizures are sometimes preceded by subjective phenomena (prodromes) that may take the form of psychological changes or myoclonic jerks of the trunk or limbs. Most often, the seizures occur without warning and the patient loses consciousness and falls to the ground. The seizure starts with an initial flexion of the trunk, opening of the mouth and eyelids, and upward deviation of the eyes. The tonic phase follows with protracted extension of the back and neck and then the arms and legs. Breathing is also impaired. The tonic phase lasts for 10 to 20 seconds. The clonic phase follows the tonic phase with a mild generalized tremor that represents relaxation of the tonic contractions. The rate of contractions gradually lessens over 30 seconds. Breathing is still impaired until the end of the clonic phase. Finally, all movements end. Upon regaining consciousness, the patient usually experiences a period of disorientation and/or fatigue.
Absence seizures are very brief, always associated with impaired consciousness, and may or may not have accompanying abnormal motor activity. Some are so short as to resemble daydreaming. The seizure comes without warning and consists of a sudden interruption of consciousness. These patients usually do not fall. After 2 to 10 seconds the patients become conscious again and resume their preseizure activity. Absence seizures are the most common form of epileptic seizure of childhood and rarely begin before 4 years of age or after puberty. Absence seizures tend to occur frequently, with as many as several hundred occurring in a single day. This type of seizure may be the only type seen in childhood and the attacks diminish in frequency with age and eventually disappear. However, absence seizures may be replaced in some instances by generalized tonic-clonic seizures.
Partial seizures are the product of focal abnormalities in some part of the cerebral cortex. Such lesions are often associated with focal EEG abnormalities. Simple partial seizures most often derive from a focal area in the sensory and motor cortex, while complex partial seizures most often derive from the temporal lobe of one side. Somatosensory seizures usually have a focus in the precentral or postcentral convolution. The sensation is described as numbness or tingling that usually starts in the lips, fingers, or toes and spreads to adjacent parts of the body. Visceral seizures (vague feelings in the thorax and abdomen) are some of the most frequent simple partial seizures. Other less common types of simple partial seizures include visual seizures (sensation of darkness and flashes of light), auditory hallucinations (buzzing or roaring in the ears), vertiginous sensations, and olfactory hallucinations (abnormal odors).
Loss of consciousness distinguishes complex partial seizures from simple partial seizures. However, unlike generalized tonic-clonic seizures, the patient suffers a period of altered behavior and consciousness with no later recollection, instead of a complete loss of control of thought and action. Any type of complex partial seizure may develop into other forms of secondary generalized seizures. Complex partial seizures are not exclusive to any age group but show an increased incidence in adolescence and adult years.
The Lennox-Gastaut Syndrome is a neurological syndrome characterized by frequent seizures of various types. These patients have intellectual impairment and serious neurologic disease. The syndrome is usually diagnosed between 2 and 6 years of age.
Temporal lobe epilepsy, which is typically manifested by complex partial seizures described above, commonly starts before 10 years of age. By adolescence or early adulthood, this condition often becomes treatment-resistant.6 Recurrent seizures may cause damage in the hippocampus resulting in a progressive loss in hippocampal volume as long as the seizures persist.7
Arroyos, Brodie, Avanzini, et al.3 suggest that treatment-resistant epilepsy may be a distinct condition within epilepsy characterized by progressive neuronal, cognitive, and psychosocial deterioration. These authors point out that several markers have been advanced as indicators of treatment-resistant epilepsy. These include age at onset younger than 1 year, type of epilepsy (partial epilepsies or catastrophic epilepsies of childhood), failure of the first AED, use of more than two drugs, duration of treatment without achieving control, and specific pathologies. However, these markers may not be especially sensitive or specific for treatment-resistant epilepsy.3
Epilepsy is among the most common serious neurologic condition, with a prevalence rate 10 times higher than multiple sclerosis and 100 times higher than motor neuron disease.8 An International League Against Epilepsy (ILAE) Commission Report from 1997 gives the prevalence of active epilepsy as 40 to100 in 10,000.9 The ILAE prevalence and incidence rates are based on a review of selected studies, mostly from developed countries, by Sander and Shorvon .10 Both rates varied widely among the studies included in this review. Their review found prevalence rates that ranged from 1.5 to 31 per 1,000 and incidence rates that varied from 11 to 134 per 100,000. The authors believe that the highest prevalence and incidence rates are more accurate because studies with these findings used more intensive and sophisticated case ascertainment methods. They further state that the low rates found in some studies were probably due to deficiencies in patient reporting (physical manifestations are transient and not observed by a clinician, patients are unaware of or deny their condition) and in diagnosis of epilepsy (syncope and psychogenic seizures are misdiagnosed as epilepsy, diagnostic criteria are unspecified or loosely defined). Sander and Shorvon10 also believe that retrospective review of patient medical records in most case ascertainment studies also leads to an underestimate of prevalence and incidence.
In addition to the immediate effects of seizures, patients with epilepsy may also have to contend with the following burdens:4, 11, 12
interference with normal activities
increased possibility of accidental injury and even death (addressed in Key Question 8 in this report)
impaired physical, psychological, and social functioning
economic loss
diminished quality of life
psychiatric disorders, in particular, anxiety and depression13
| Drug Strategies | Surgery | Nondrug, Nonsurgical | Service-related Interventions | Diagnostics |
|---|---|---|---|---|
| Sequential monotherapy | Temporal lobe | Vagal nerve stimulation (VNS) | Multidisciplinary neurobehavioral treatments | Blood prolactin |
| Polytherapy | Hemispherectomy | Ketogenic diets | EEG biofeedback | Routine-EEG |
| Drug reduction | Corpus callosotomy | Chiropractic | Epilepsy education | Video-EEG |
| Maximum tolerable dosage | Multiple subpial transection | Acupuncture | Vocational services | Ambulatory-EEG |
| Dose frequency optimization | Frontal lobe | Hyperbaric oxygen | Physical exercise | Blood creatine kinase |
| Parietal lobe | Herbal medicine and homeopathy | Medical resonance therapy music | Computed tomography | |
| Occipital lobe | Cranial realignment | Sahaja yoga | Magnetic resonance imaging | |
| Magnetic therapy | Meditation | Single Photon Emission Computed Tomography | ||
| Electrical brain stimulation | Self-help group (group therapy) | Minnesota Multiphasic Personal Inventory | ||
| Vitamin B6 | Counseling | Provocation techniques | ||
| Progressive muscle relaxation | Tilt table | |||
| End-tidal CO2 biofeedback | Auditory evoked potentials | |||
| Systematic desensitization | Hypnotic recall | |||
| Tongue biting | ||||
Note: Bolded, italicized technologies are those addressed in this report. The remaining technologies were not addressed due to insufficient literature
There is no generally accepted definition of treatment-resistant epilepsy. However, for the purposes of retrieving articles for this report, an operational definition of this term was needed. Accordingly, we defined treatment resistance as failure of one or more AEDs at a maximum tolerable dose to provide complete seizure relief.
This definition was based on consensus obtained during a 1-day meeting with an Expert Panel and subsequent discussions with Technical Experts, during discussions with the two agencies that requested this report, the Centers for Disease Control and Prevention (CDC) and the Social Security Administration (SSA), and in consultation with the Agency for Healthcare Research and Quality (AHRQ).
Many articles did not provide sufficient information to allow a determination of the definition employed (see our conclusions to Question 1). Consequently, and after consultation with the Technical Experts, CDC and SSA, we included articles even if they only stated that the enrolled patients were “treatment-resistant”, or used some other synonym (see Question 1 for examples of such synonyms).
At the beginning of this project, we worked with an Expert Panel that assisted in defining the scope of this Evidence Report, developing its questions, defining the outcomes of interest, and developing the criteria for retrieving and including articles. The involvement of this panel consisted of their participation in a 1-day meeting with ECRI, AHRQ, and representatives of SSA and CDC.
To establish the Expert Panel, we solicited nine organizations to nominate individuals who could serve as its members. All solicitations were preapproved by AHRQ, and all nine organizations nominated an individual. Thus, the Expert Panel was comprised of individuals from the following organizations:
American Academy of Neurology
American Academy of Pediatrics
American College of Occupational and Environmental Medicine
American Epilepsy Society
Child Neurology Society
Citizens United for Research in Epilepsy
Epilepsy Foundation/Penn Epilepsy Center
National Association of Epilepsy Centers
Society for Behavioral and Cognitive Neurology
The participation of these individuals and organizations in this project does not imply their endorsement of the findings of this Evidence Report.
Subsequent to the 1-day meeting, the Expert Panel was disbanded, and a group of Technical Experts was formed. We collaborated with this group to further refine this project's scope, questions, outcomes of interest, and criteria for retrieving and including articles. The Technical Experts also served as a source of information throughout the project. Collaboration with these Experts was accomplished through telephone conversations and e-mail.
The Technical Experts were comprised of all of the members of the Expert Panel and representatives from:
Columbia Presbyterian Medical Center
Harborview Medical Center
Johns Hopkins School of Medicine
Strategic Health Institute
As with the Expert Panel, the participation of these individuals and organizations in this project does not imply their endorsement of the findings of this Evidence Report.
This report addresses nine Questions arrived at through the discussions with the Expert Panel, Technical Experts, and representatives from AHRQ, SSA and CDC. These are:
Question #1: What are the definitions of treatment-resistant epilepsy used in the literature?
Question #2: Which methods of rediagnosing or re-evaluating treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?
Question #3: Is there evidence that patients with treatment-resistant epilepsy are not optimized at their current level of treatment?
Question #4: Which drug treatment strategy, 1) sequential monotherapy, 2) polytherapy, or 3) optimized current therapy leads to improved outcomes for patients with treatment-resistant epilepsy, and what are the relative improvements obtained with each strategy?
Question #5: Which methods of nondrug treatment for epilepsy after initial treatment failure lead to improved outcomes for patients with treatment-resistant epilepsy?
Question #6: Which social, psychological or psychiatric services for treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?
Question #7: What characteristics of treatment-resistant epilepsy interfere with ability to obtain and maintain employment, or attend and perform well in school?
Question #8: What is the mortality rate of patients with treatment-resistant epilepsy?
Question #9: Is there a correlation between the number and/or type of seizure and sudden death?
The scope of this report can be illustrated by a causal pathway. More specifically, this pathway illustrates the Key Questions and the relationships among them. It also illustrates items that are beyond the scope of this Evidence Report. This pathway is shown in Figure 1
Several boxes represent endpoints in the causal pathway. These are identified by double borders. Patients reaching these endpoints do not go on to additional treatments or diagnostic procedures. Although boxes with no arrows emerging from them represent end points in terms of reporting in published studies, the patients themselves may go on to receive additional treatments and experience further outcomes. The outcomes examined in this Evidence Report were determined by the Expert Panel, Technical Experts, CDC, and SSA. Two outcomes, death and performance in school or work, are broken out from other outcomes because they are specifically addressed by their own questions.
The dashed lines in the figure “overarch” several rectangles. We have drawn these lines as dashed because they do not depict the sequence of events in the clinical pathway. In general, these lines portray questions about how patient characteristics (including clinical findings) may influence outcomes.
To obtain information for this report, we systematically searched 23 electronic databases. These were:
Center for International Rehabilitation Research Information and Exchange (CIRRIE) (searched November 30, 2001)
Cochrane Database of Systematic Reviews (through 2001, Issue 4)
Cochrane Registry of Clinical Trials (through 2001, Issue 4)
Cochrane Review Methodology Database (through 2001, Issue 4)
Cumulative Index to Nursing and Allied Health (CINAHL) (1988 through January 11, 2002)
Database of Reviews of Effectiveness (Cochrane Library) (through 2001, Issue 4)
ECRI Health Devices Alerts (1977 through January 2002)
ECRI Health Devices Sourcebase (through January 2002)
ECRI Healthcare Standards (1975 through January 2002)
ECRI International Health Technology Assessment (IHTA) (1990 through January 2002)
ECRI Library Catalog (through January 2002)
ECRI TARGET (through January 2002)
Embase (Excerpta Medica) (1975 through January 2002)
ERIC (Educational Resources Information Center) (searched January 8, 2002)
Health and Psychosocial Instruments (HAPI) (through April 27, 2001)
LocatorPlus (through January 2002)
NDA Pipeline (searched November 1, 2001)
PsycINFO (1975 through January 31, 2002)
PubMed® (MEDLINE®, PreMEDLINE®, HealthSTAR) (1975 through January 2002)
Rehabdata (searched April 24, 2001)
U.K. National Health Service (NHS) Economic Evaluation Database (EED) (through January 2002)
U.S. Centers for Medicare & Medicaid Services (CMS) (formerly HCFA) (through January 2002)
U.S. National Guidelines ClearinghouseTM (NGC) (through January 2002)
We employed different searches for different sections of the report, including different searches for different questions. The strategies for these different searches, given in PubMed® /MEDLINE® syntax, are provided in Appendix A.
In addition to the above searches, we also reviewed the bibliographies and reference lists of all studies included in this Evidence Report, and searched Current Contents—Clinical Medicine® on a weekly basis.
To be included in this Evidence Report, an article had to meet specific a priori criteria. Some of these criteria were specific to each question, and these are listed at the beginning of the discussion of each question in the Results section of this Evidence Report. Some criteria were common to all questions except Questions #1 and #9. These common inclusion criteriab were:
The article described a study that enrolled (unless otherwise noted in certain questions) only patients with treatment-resistant epilepsy or, if other patients were enrolled, data from treatment-resistant patients were separately presented.
Only full-length articles were included. We did not include meeting abstracts because they are often preliminary reports of results, and they seldom contain sufficient detail to allow evaluation of study design.
The article described a study that must (unless otherwise noted in certain questions) have been published in 1985 or later. We adopted this criterion in accordance with the wishes of the Expert Panel, which stated that treatments for epilepsy, the technologies associated with the diagnosis of the disease, and the classification of its seizure types have substantially changed from what they were prior to 1985.
Articles had to be English-language. We adopted this criterion out of consideration of the time and budget allotted for this project.
The article described a study that enrolled 10 or more patients. Smaller studies may be of unusual methods or patients. Therefore, their findings may not be applicable to other patients or to settings outside the ones in which the study was conducted. Further, small surgical series may represent studies conducted by physicians who have comparatively little experience with the procedure.
The article described a study that quantitatively reported an outcome or diagnostic test result of interest. Qualitative expressions of results do not allow conclusions to be drawn about how well a treatment works (i.e., about effect sizes) and provide little assurance that results were rigorously evaluated by the investigators.
As per the desires of the Expert Panel and Technical Experts, a study of an intervention must have reported data on one or more of the following outcomes:
Outcomes related to seizure frequency:
Absolute seizure frequency
Percentage change in seizure frequency from baseline
Proportion of patients seizure-free
Proportion of patients with >50 percent reduction in seizure frequency from baseline
Engel Classification
Rundown time to seizure-free
Seizure-free period
Proportion of patients with any reduction in seizures
Proportion of patients with any increase in seizures
Proportion of patients exiting a trial due to harmful seizure increases
Nonseizure frequency outcomes:
Quality of life
Mood (we used this as a general term to describe a range of outcomes, from depression to psychosis)
Functional status/ability
Cognitive Function
Ability to stay in or return to work
Ability to stay in or return to school
Ability to hold a driver's license
Adverse events
Mortality
Articles that present data pertaining to quality of life, mood, or cognitive function must have used a validated psychometric instrument. For the purposes of this Evidence Report, a validated psychometric instrument is an instrument for which there is evidence in the peer-reviewed literature to demonstrate that it has construct validity (it measures what it purports to measure) and good reliability (e.g. test-retest reliability; inter-rater reliability). Ideally, the instrument would have been validated using patients with epilepsy. However, given the scarcity of such data on quality of life and psychological status, we decided a priori not to require that all psychometric instruments be validated in a population of patients with epilepsy.
If there were fewer than five studies of a given intervention or diagnostic, and none of these studies was a randomized controlled trial (RCT) that enrolled 50 or more patients in the treatment group, we did not include any studies of the diagnostic or intervention. Where an RCT with 50 or more patients did exist, we included that RCT even if there were fewer than five studies. We adopted the criterion partly out of consideration of the time and budget allotted for this project. However, this criterion also reduces the potential that publication bias will influence our conclusions. Conclusions drawn from small literature bases could be overturned by the results of only one or two unpublished studies. Therefore, requiring that five studies be available before analyzing a given intervention or diagnostic helps to reduce publication bias.
When five or more controlled studies addressed a given intervention or diagnostic, we included only controlled studies. Otherwise, uncontrolled studies were included.
Except for surgical topics (where the great majority of studies were retrospective) when five or more prospective studies addressed a given intervention or diagnostic, we included only the prospective studies. Otherwise, retrospective studies were included.
When there were several publications describing the same trial, only the largest and most recent publication was included. This avoids double counting patients and the consequent distortion of measurements of effect size. We included earlier studies that reported data not in later publications, and earlier publications that contained data from more patients than later publications.
All surgery trials meeting the inclusion criteria were further examined for patient overlap by cross-matching years of patient enrollment within articles published by the same surgery center. When articles presented overlapping patient populations, the article containing the most recent patient enrollment periods was included. When patient populations did not overlap, all articles from a single center were included.
The seizure types examined in a study were classified according to the International League Against Epilepsy's International Classification of Epileptic Seizures, published in 1981,5, 14 or the article used terminology that was consistent with this classification.
As stated above, these criteria do not apply to Key Questions 1 and 9. We provide the criteria relevant to these questions in the text associated with them.
We reviewed the abstracts of articles identified by our searches against the inclusion criteria to determine whether we would retrieve it. Five research analysts independently performed this task, and each analyst worked on different questions. We retrieved an article whenever there was uncertainty about whether it met the inclusion criteria. We also retrieved articles when an abstract was not present in the search results, but the title of the article suggested that it might be relevant.
Once an article was retrieved, it was examined to determine whether it met the appropriate inclusion criteria. Articles that met these criteria were first examined for “fatal flaws” that precluded interpreting their results. Such articles were excluded. When an article was excluded for design flaws, we presented the reason(s) for its exclusion in Evidence Tables associated with each of the nine Key Questions in this report.
We then examined the remaining articles for design flaws that could potentially bias their results. Wherever possible, we empirically evaluated studies for the presence of bias. When data-driven evaluations of study quality were not possible, we documented and explained a study's potential for bias.
We identified 11,111 articles with our searches. We retrieved 2,356 of these for Questions 2–9 according to a priori criteria. Three hundred fifty-seven articles remained after evaluating whether the full article met these criteria. After evaluating these latter articles for design flaws so severe that their results could not be interpreted, and after determining whether there were too few articles to permit a firm evidence-based conclusion (see above for Article Inclusion Criteria), 305 articles remained, and were included in this Evidence Report for Questions 2–9. One hundred eighty-five articles for Question #1 (on definitions of treatment-resistant epilepsy) were randomly sampled from these 305 articles.
| Key Question Number | Retrieved | Met Criteria | Included |
|---|---|---|---|
| 1 | 185 | 185 | 185 |
| 2 | 470 | 34 | 10 |
| 3 | 201 | 20 | 20 |
| 4 | 654 | 57 | 50 |
| 5 | 804 | 206 | 206 |
| 6 | 86 | 12 | 0 |
| 7 | 48 | 5 | 0 |
| 8 | 61 | 14 | 10 |
| 9 | 32 | 9 | 9 |
| Totals = | 2541 | 542 | 490 |
Note: Articles for Question 1 were also used to address other questions. Because these articles were evaluated twice, each for a different purpose, they are double-counted in the totals shown in the table. The article count in the text does not incorporate this double-counting
We performed meta-analyses of data from RCTs and uncontrolled trials. We performed meta-analyses of RCTs to estimate an average effect of treatment. We performed meta-analytic threshold analyses of uncontrolled studies to determine whether an intervention was plausibly effective. All meta-analyses, except meta-regressions, were performed with software programs developed by ECRI. This software has been extensively validated using published examples and hand calculations. Meta-regressions were performed using SPSS (version 10.1) Statistical Software (Copyright © SPSS Inc., 1989-2000).
Our meta-analyses of RCTs are exclusively comprised of random effects models. These analyses appear only in Question 4, and only in regard to the polytherapy drug strategy. We employed random effects models because of the types of trials that addressed this question. These trials were each of a single AED, and not all trials studied the same drug. Therefore, no assumption is made that these trials were all drawn from a single population.
An important aspect of these meta-analyses is that each of the trials is an instance of polytherapy, rather than a study of polytherapy, per se. This is because a planned study of polytherapy would investigate more than one drug and means that each trial in each meta-analysis in Question 4 represents only one way polytherapy might be tested. However, combining these trials into a single meta-analysis of polytherapy means that this analysis approximates a single trial that directly studied this strategy of drug administration.
Another important aspect of the RCTs in our meta-analyses is that some of the trials consisted of more than two groups and therefore have more than two effect sizes. These effects are not independent of each other. Ideally, sophisticated statistical techniques would be used (e.g., general linear modeling or hierarchical regression) to account for this dependence. However, in the present case, there was too little published information to permit such analyses. Therefore, we conducted two analyses on the data from any given set of trials. In the first analysis, we evaluated the effects of the intervention by comparing outcomes in the group that received a study's highest drug dose to outcomes in the placebo group. In the second analysis, we compared outcomes in the group that received a study's lowest drug dose to outcomes in the placebo group. Additional details about the specific analyses we conducted are provided in Question 4.
Because the results of these analyses are not independent, if a meta-analysis of a low dose of drug is statistically significant, a subsequent finding that the effects of high doses are also statistically significant does not provide “twice as much” evidence that the treatment is effective. Further, because the data allow computation of only a meta-analytic summary statistic, and do not permit attempts to explain the cause(s) of any between-studies heterogeneity, some information loss accompanies all of these analyses.
Our random effects analyses were performed as described by DerSimonian and Laird.15 As the measure of the effectiveness of treatment, we employed Cohen's h, the difference between the arcsine transform of two proportions divided by the pooled standard error.
We did not compare the effectiveness of different drugs. This conforms to the wishes of the Expert Panel and Technical Experts, who advised that such comparisons were of secondary importance.
As mentioned above, our meta-analytic threshold analyses of uncontrolled studies are not intended to produce a summary statistic. Rather, we performed these analyses to assist readers in determining whether an intervention is plausibly effective. In these threshold analyses, we meta-analytically compared the improvement rate in treated patients to increasing rates of improvement in a hypothetical control group. Starting at 0 percent, we increased the rate of improvement in the “control” patients until the difference in improvement between the treated and “control” groups was no longer statistically significant. This value is the threshold. Thus, the threshold is the proportion of untreated patients who would have to improve to render the effect of treatment statistically nonsignificant. Except for analyses of sequential monotherapy, all of these analyses employed fixed effects models. In analyses of sequential monotherapy, we employed random effects models due to the use of different drugs as monotherapy. Where possible, we provide context for these thresholds by supplementing them with historical data on “control” patients obtained from published articles.
We also report the percentage of patients who improved after the intervention, but note that this percentage is not the difference between improvement in a treated and a control group and, therefore, is not the net effectiveness of the intervention. Nevertheless, this percentage is informative because it represents the proportion of patients likely to improve, regardless of the cause of their improvement. We estimated this percentage by back-transforming the summary statistic from the meta-analysis into a percentage.
We conducted these analyses using Cohen's h as the test statistic. In all analyses, we assumed that the number of patients in the “control” group equaled the number of patients in the treated group. We chose Cohen's h because, under these conditions, the Q statistic and each study's standardized residual remain constant as the proportion of improved patients in the “control” group increases.
Whenever statistically significant heterogeneity among the study results was detected in our threshold analyses, we attempted to “explain” the heterogeneity using meta-regression. We used the Q statistic to determine whether an analysis was heterogeneous. Because this statistic is conservative,16, 17 we adopted a p-value of 0.10 (as opposed to the traditional significance level of 0.05) as the critical value for statistical significance.18
Typically, there was no strong a priori hypothesis to “explain” this heterogeneity. Therefore, we generated a set of regression models for any meta-analysis in which we found statistically significant heterogeneity. We constructed this set by first computing all possible models containing one predictor variable. The number of available predictor variables was often limited by incomplete reporting. This is because we required that at least 90 percent of the studies report the value of a given variable before we entered it into a meta-regression. When more than 90 percent of studies, but less than 100 percent of studies reported the value of a given variable, we assumed the mean value of the variable for the missing data.
After generating all possible one-predictor models, we generated all possible two-predictor models except those containing the coefficients that were not significant in the one-predictor models. Finally, we constructed all three-predictor models except those containing a pair of coefficients that were nonsignificant in the two-predictor models. We only constructed three-predictor models when (QE1-QE2)/QE0 >0.25, where QE0 is the value of QE when there were no predictors in the regression model, QE1 is the value of QE when there was one predictor in the regression model, and QE2 is the value of QE when there were two predictors in the model. We employed this rule to avoid over fitting data from small numbers of studies. Constructing multiple models also assisted in detecting multicolinearity.
For the purposes of constructing models, we set the alpha level required for significance of the regression coefficients at 0.10. This is anticonservative, but allows for examination of a broader range of models than would an alpha of 0.05.
In the text of the Evidence Report, we consider a model to be a plausible “explanation” of variability only if: (1) it was the only model in a set to produce a statistically nonsignificant (p >0.10) QE, (2) all coefficients in the model were statistically significant and, (3) adding another predictor variable to the model caused the value of QE to decrease by less than 25 percent with respect to the value of QE with no predictors in the model. For interpreting models, we used the traditional alpha level of 0.05 for the regression coefficients.
To ensure there were no systematic biases in the enrollment of patients in RCTs, we conducted, wherever possible, meta-analyses of the characteristics of patients enrolled in them. In these analyses, we compared the characteristics of patients in the control groups to the characteristics of patients who received the intervention. For example, in one such meta-analysis, we sought to determine whether females tended to be enrolled more in the control groups than in the experimental groups of studies of AEDs. These analyses employed fixed effects models, and we used Cohen's h or Hedges' d, as appropriate, to estimate the between-group differences.
Recognizing that meta-analysis has low statistical power to detect influences of patient characteristics on outcomes,19, 20 we extended our fixed effects analysis on surgical outcomes by performing appropriate meta-analyses of data from nested case-control studies. These studies reported the proportion of patients with a given characteristic who had successful or unsuccessful surgery, or they separately reported a continuous variable for patients who had successful or unsuccessful surgery. We performed meta-analyses on proportions using Cohen's h as the test statistic. For studies reporting continuous variables, we computed each study's appropriate point-biserial correlation coefficient and then meta-analytically evaluated these coefficients.
We performed analyses of data from nested case-control studies using only patient-level data. Although meta-analysis of such data using more sophisticated modeling techniques is preferable (e.g., hierarchical models), this was beyond the scope of the present project.
We used sensitivity analysis to test whether our meta-analytic summary statistics were robust. We employed four such analyses for each summary statistic. These were recalculations of the meta-analytic summary statistic with: (1) the largest study removed, (2) the smallest study removed, (3) the study with the largest effect removed, and (4) the study with the smallest effect removed from the meta-analysis.
In addition to computing the above-described meta-analytic statistics, we performed numerous other statistical computations. We note each of these computations in the text of this Evidence Report and/or in footnotes to the in-text tables and Evidence Tables. Briefly, the computations we performed included:
Statistical power analyses. Studies that do not contain a sufficient number of patients cannot detect statistically significant differences between groups, even when these differences are clinically meaningful. Therefore, when appropriate, we computed the smallest between-group difference that any given controlled study had the power to detect.
Determinations of whether there were statistically significant differences between the characteristics of patients in the groups of any given study. This is particularly important for studies that are not randomized, because the patients in the different groups of such studies may not be comparable. Further, although other studies may report that they were randomized, the randomization protocol may not have been adequately followed or the study may not have been truly randomized (i.e., randomization may have been nonstochastic). These departures from randomization can manifest themselves in pretreatment between-group differences in patient characteristics. We recognize that in a properly randomized trial, such differences can arise from chance. However, searching for such differences in the context of a systematic review is justifiable because there is no other way to audit whether the randomization was, indeed, accomplished.
Computation of pretreatment effect sizes. Departures from randomization can also manifest themselves as a statistically significant difference in the outcome between groups prior to the administration of treatment. For example, if the seizure frequencies experienced by patients in different groups were significantly different before treatment, the study may not have been truly randomized.
Verification of 2 × 2 tables reported in studies of diagnostic tests. Because peer-reviewed published articles often contain errors in reported results, we attempted to verify the calculations in each article. If an error was found, we corrected the data and included it in our analysis.
Computations of t-tests, chi-square tests, Fisher's exact text, odds ratios (OR), and their 95 percent confidence intervals (CI). Some studies included in this Evidence Report did not report the results of statistical tests that were important for answering the questions. We computed these statistics when such studies reported sufficient data.
Computations of crude (CMRs) and standardized mortality ratios (SMRs). These quantities, which are useful for comparing the mortality rates among persons with epilepsy and those who do not have epilepsy, were not reported in all studies but some studies reported sufficient data to allow us to compute them. CMRs and SMRs are calculated as the number of observed deaths divided by the number of expected deaths. However, SMRs are standardized according to the age distribution of the study population and the age-specific death rates in the country of interest. If a study reports only a mean age or age range for their patient group, then only a crude estimate can be made as to the number of expected deaths. Therefore, caution is required in interpreting crude mortality ratios. We discuss the reasons required for this caution in detail in Question 8.
Numerous other calculations of descriptive statistics for patient characteristics (e.g., mean age) and outcomes (e.g., seizure frequency) were performed when patient-level data were reported.
Our evaluation of the quality of interventional studies employed three tools. The first was an evidence hierarchy we used to determine which studies to retrieve, include and, in certain questions, to determine whether there was greater potential for bias in some included studies than in others. The second tool we used was a checklist for evaluating each study's internal validity. Finally, we considered the difficulties inherent in certain outcome measurements.
We did not restrict the studies included in this Evidence Report to RCTs. Rather, ours is a “best evidence” synthesis in which we accept the best available evidence, not the best possible evidence. Performing such an analysis on studies of surgery for epilepsy is particularly important. This is because withholding treatment to perform such an RCT may be unethical.
To determine the best available evidence, we used an evidence hierarchy. This hierarchy served, in part, to determine which studies we would include and retrieve. In some cases, we also used this hierarchy to evaluate a study's potential for bias. This hierarchy, shown in order of study designs with the least potential for bias to those with the greatest potential for bias was:
Randomized controlled trials
Controlled clinical trials
Studies that made measurements before and after treatment (pre/post studies)
Uncontrolled studies
Within each level of the hierarchy, we assumed blinded studies to have lower potential for bias compared to nonblinded studies, and prospective studies to have lower potential for bias compared to retrospective studies. Wherever possible, we empirically evaluated these assumptions, and the assumption that studies lower in the hierarchy had results that were different (i.e. were biased) from studies higher in the hierarchy.
The internal validity of an interventional study represents the degree of confidence one can have in whether the intervention caused a change in the outcome of interest. The confidence in this causal relationship can be weakened by a number of biases. Because of this, we employed a second tool, geared to evaluate the potential difficulties with each included study's internal validity. This tool was a checklist of such potential difficulties, and was a modification of the scheme of Cook and Campbell.21 Thus, we evaluated each study to determine whether any of the potential biases listed below was present.
We stress that these are potential biases. The existence of a potential bias does not necessarily mean that a study's results were affected by the bias. We view this question as one that can be empirically determined.
Selection bias is relevant only to studies with control groups. This bias occurs when there are differences between the patients in the different arms of the study at the start of the study. These differences may lead to posttreatment differences in outcome that are not due to treatment. Random assignment of patients to the study arms protects against this bias, but the fact that a study states that assignment was random does not guarantee that randomization protocols were adequately followed. This is a concern when the method of randomization is not reported. In such instances, the method of randomization may not be truly stochastic.
This bias can occur in studies that are not blinded. In such nonblinded studies, investigators are aware of who is receiving a particular treatment and who is not. This knowledge may influence the measurement of patient outcomes, especially when these outcomes rely on a degree of subjectivity. This bias can affect nonblinded studies of any design.
This bias can occur in open studies or in blinded studies in which blinding has been broken. As a result, patients are aware that they are, or are not, receiving a treatment. This knowledge may influence the way they report an outcome of interest. Given that many seizure frequency and quality of life outcomes rely heavily on patient reports (e.g., seizure diaries), this bias is particularly relevant to the studies considered in this Evidence Report. This bias affects nonblinded studies of any design.
Attrition refers to the loss of patients before outcome measurements can be recorded. Patients may no longer return to the clinic because they have moved away, have improved to the extent that they believe they no longer need to see a physician, or have died. Because those who completed the study may not be representative of the entire group of patients who entered the study, analyses based only on study “completers” may be biased. This bias can affect studies of any design. In this report, we did not set any limit for attrition beyond which we would not consider the study in our assessment. Exiting a trial before completion was considered an important outcome in our evaluation of studies of drug treatment. In our evaluation of vagal nerve stimulation, we specifically looked for any influence of attrition on outcomes.
Measurement bias occurs when the method used to measure a particular outcome systematically over- or underestimates the true effect of treatment on that outcome. For example, general health status instruments (e.g., SF-36) may be less sensitive than disease-specific instruments for detecting small changes in health status that are important to patients.22 This bias can affect studies of any design. In the epilepsy literature, the use of seizure diaries to measure seizure frequency may be a source of potential measurement bias. We discuss why in the section entitled “Validity of Seizure-related Outcomes” (below).
This bias, also known as regression to the mean, can occur when there are patients who, upon entry into a study, have relatively good (or relatively poor) performance on an outcome. For example, patients may enter a study when their condition is at its worst. When the disease is not progressive, these patients are unlikely to be so ill upon subsequent measurement, even in the absence of treatment. Patients with extremely high pretreatment seizure frequencies may experience reductions in seizure frequency, even without treatment.23 This bias affects studies of all designs except well-designed RCTs.
This bias occurs when events other than the intervention of interest cause improvements in health outcomes. For example, in an uncontrolled longitudinal study, treatment may incorrectly appear to cause an improvement in health outcomes if patients are given new, effective methods of patient management. This bias can affect studies of all designs including RCTs. RCTs will be affected if the new methods are not uniformly applied to all patients.
Sampling bias occurs when a study either does not include all enrolled patients who received the treatment of interest, or does not include a random sample of the enrolled patients who received the treatment of interest.
Maturation bias occurs if individuals improved because of developmental maturation, and not because of treatment. In the present Evidence Report, we only evaluate studies for potential maturation bias if they used followup periods longer than 1 year.
In the context of the present report, sample specification bias occurs if a study enrolled some patients who were not treatment-resistant. For example, studies that did not explicitly state that all enrolled patients were experiencing seizures despite prior treatment with at least one AED given at maximum tolerable dose may not have exclusively enrolled patients who met the definition of treatment resistance that was suggested by the Expert Panel and Technical Experts (see the section entitled “Defining Treatment Resistant Epilepsy”). Sample specification bias is a lesser issue for studies of surgical interventions for epilepsy than for the other interventions discussed in this Evidence Report. This is because of the relatively extensive presurgical evaluations that surgical candidates receive.
Studies with low statistical power do not have the ability to find statistically significant differences between groups or between one test and a subsequent test. As such, the failure of a low power study to find a statistically significant difference does not always imply that an intervention is ineffective. This is because the study may not have had the power to detect clinically important differences.
Determining whether a study has sufficient power to detect clinically important differences is subjective. This is because determination of what a clinically important difference is ultimately requires the opinions of patients. Often, these opinions are not well studied. Consequently, we have refrained from making such judgments and, instead, provided the reader with sufficient information to make their own. We accomplish this by computing the smallest between-group percentage difference that a statistically nonsignificant study could have detected. The reader then needs to compare this percentage to the percentage change deemed clinically important. Assume, for example, that we computed that a study only had the power to detect a 30 percent decline in seizures as statistically significant. The reader may decide that, in fact, a 10 percent decline in seizure rates is clinically significant, and then note that the study did not have the statistical power to detect this difference. This would mean that the results of the study were not informative.
Because our consideration of a study's power involves de novo calculations, we consider power in the “synthesis of study results” section of each question, and not in the section devoted to evaluating a study's internal validity.
There are several commonly reported ways to measure seizures, including mean and median frequencies, the proportion of patients who experience a reduction in seizures greater than a certain percent (e.g., the proportion of patients who experience a greater than 50 percent reduction in frequency), and the percentage of patients who exit a trial due to seizure increases.
All of these outcomes depend upon patient reports, often in the form of seizure diaries. One difficulty with these diaries is that they rely on the objectivity, and memory of the individual responsible for keeping the diary. This affects the accuracy of records of all seizures (see “Measurement bias” above), and accurate recording of auras may be particularly problematic.
Another problem with the way in which seizure frequencies are reported is that not all outcome measurements capture what happens to all patients. For example, a common way to report seizure frequency is to report the percentage of patients who had a 50 percent (or some other percentage) or greater reduction in seizure frequency after treatment. This type of outcome only captures information about patients whose seizure rates decreased. It does not capture information about patients who experienced increases in seizure frequency. In fact, expressing results in this way can be quite misleading. For example, assume a study that reported that seizure frequency reduced by 50 percent or more in 40 percent of their patients. Seizure rates may have actually increased in the remaining 60 percent of patients and, if this were true, the treatment might actually be harmful.
In contrast, measures of absolute seizure frequency do capture information about all patients because the results of patients who became worse are combined with the results of those who improved. However, absolute seizure frequencies are not normally distributed, so a simple average is not an appropriate summary of the data. As a result, many studies report median seizure frequencies. While medians are an appropriate measure of the central tendency of such data, they pose technical difficulties. In particular, methods for combining medians in a meta-analysis are not well developed. One way around this difficulty is to transform seizure frequencies by using a natural log transform. This would render the data normally distributed and allow for computation of meaningful averages and measures of dispersion. Published studies rarely report such results.
Another way of reporting results is to provide the number or proportion of patients who exited a trial due to changes in seizure type and/or frequency. Although trial exit per se can be accurately recorded, it is also based on seizure frequencies and, therefore, typically depends on the accuracy of seizure diaries. Another difficulty with this outcome is that it is relatively insensitive to small or moderate increases in seizure frequency. This is particularly true because a common criterion that investigators set for exiting a monotherapy drug trial is a doubling of seizure frequency.
The biases that can affect studies of diagnostics are different from those that can affect studies of interventions. The checklist we employ for determining whether a diagnostic study was potentially affected by a bias incorporated items suggested by Lijmer, Mol, Heisterkamp, et al.,24 Irwig, Tosteson, Gastsonis, et al.,25 Gann,26 Begg and Greenes,27 and Ransohoff and Feinstein.28 These biases are:
This bias occurs when there are differences between populations in the spectrum of disease presentation and severity. In the present report, it manifests itself in diagnostic case-control studies in which “cases” (in this instance, patients with epileptic seizures alone) and “controls” (patients with nonepileptic seizures) were selected for inclusion because they were known to have epileptic or nonepileptic seizures prior to the study. Such studies therefore enrolled cases that are relatively easy to diagnosis, and did not enroll cases that are more difficult to diagnose. The effects of spectrum bias have recently been demonstrated empirically by Lijmer, Mol, Heisterkamp et al.24 who found that the diagnostic odds ratio was approximately three times greater in diagnostic case-control studies compared to studies of the same diagnostic carried out using unbiased populations.
This bias occurs when a reference standard against which the diagnostic performance of the diagnostic of interest was measured is not perfect (not a true “gold standard”).
In the context of this Evidence Report, this bias occurs when patients allocated to the epileptic and nonepileptic seizure groups were not diagnosed using the same reference standard. For example, patients with epileptic seizures may have been diagnosed in a neurology department using a diagnostic such as video-EEG, but patients with syncopal seizures may have been diagnosed in a cardiac department using a diagnostic such as a tilt-table.
Prevalence bias occurs when the numbers of cases and controls in a case-control study are artificially chosen to be equal. This artificial prevalence introduces a bias that influences the positive predictive value (PPV) and negative predictive value (NPV) in a manner described by Bayes' theorem.29
This bias occurs when the results of the test of interest are subjective and can be influenced by factors that are unrelated to the disease of interest.
This bias may occur in diagnostic studies when patients are aware of their diagnostic group allocation. This bias is a particular problem when the diagnostic of interest involves patient input. For example, the Minnesota Multiphasic Personality Inventory (MMPI) has been proposed as a means of differentiating patients with epileptic seizures from patients with psychogenic seizures. This instrument requires patient input. Patients' awareness of their diagnostic group allocation may influence their input.
This bias may occur in diagnostic studies when investigators interpreting the results of the diagnostic of interest are not blinded to the diagnostic group allocation of the patients in the study. This is a particular problem when the investigator is required to “interpret” the findings of a diagnostic test. For example, the interpretation of a CT scan requires that an investigator interpret the image. If the investigator is aware of the diagnostic categorization of the patient, his interpretation of the CT image may be influenced.
This bias is only relevant to studies that used followup to confirm the accuracy of the diagnostic of interest and occurs when only one group of patients is followed. This group typically consists of only those with a positive diagnosis. For example, only those diagnosed by the test of interest might be followed up.
This bias may occur when only a subset of patients enrolled in a study is reassessed. For example, some patients do not experience a seizure during re-evaluation, so diagnostic data cannot be collected from them. If these patients are somehow different from patients in whom a diagnostic reassessment was possible (for example, the subgroup contained a higher proportion of patients with nonepileptic seizures), then this may lead to a biased estimate of prevalence.
Two questions in this Evidence Report, Questions #8 and #9, concern epilepsy-related mortality rates. Studies examining mortality rates may be biased by factors that are different from those that bias the results of other kinds of studies. Therefore, we examined studies of mortality for the following potential biases:
See the above definition of this bias.
See the above definition of this bias.
This bias may occur in studies that did not determine the cause of death by autopsy. For example, investigators may assume that epilepsy is the cause of death in patients with epilepsy who died suddenly. This bias will artificially inflate death rates due to epilepsy.
This bias occurs in studies that did not present standardized mortality ratios or in studies that did not present sufficient information to allow us to calculate these ratios. Other methods of computing mortality do not allow mortality rates to be standardized by age, which could bias mortality differences in either direction.
This bias affects only Question #9 regarding sudden unexplained death. It occurs when studies of mortality use an inappropriate control group. For example, in a case-control study of sudden unexpected death where all of the cases were children with epilepsy, the control group should not consist of adults with epilepsy. This would increase the likelihood of finding a spurious relationship between sudden death and a variable that may be unrelated to sudden death (e.g. childhood epilepsies are likely to differ from epilepsies that afflict adults in ways that may be unrelated to the risk of sudden death). An appropriate control group would be living children with epilepsy.
This bias also affects only Question #9. It may occur if studies evaluating a relationship between two variables did not use a statistical method that adjusts for the possible effects of other variables. For example, regression techniques are often useful for determining the influence of a variable on an outcome. When such techniques are not used, the magnitude of the relationship between a variable and the outcome may be misestimated.
We evaluated each study's external validity (generalizability) according to patient characteristics appropriate to each question. These characteristics are provided as we address each question. We did not evaluate external validity when evidence-based conclusions could not be reached.
The results of our analyses of internal and external validity and of our meta-analyses are presented in two types of tables. Evidence Tables contain detailed information on each of the studies used in an assessment and the results of meta-analyses of these studies. The Evidence Tables tend to be large and are therefore contained in a separate volume of this report. They are organized according to the Key Questions addressed in this report. Other tables appear in the Results chapter following the discussion of each intervention being assessed. These tables are intended to provide a brief listing and description of the studies and the outcomes reported in these studies. Tables addressing the internal validity of studies used in an assessment are presented in Appendix B.
Figures are also presented in the Results chapter after the discussion of each intervention assessed in this report. Figures are designed to present summary information in the form of a forest plot (array of study effect sizes usually with a summary estimate), graphs of threshold analyses and meta-regressions, or other appropriate graphical presentations.
Throughout the preparation of this report, the five analysts and the Project Manager held numerous meetings to determine the strategy and methods of analysis. The Project Manager then individually reviewed each completed section of the report, and suggested changes. Upon completion of these changes, the individual sections were assembled into an initial draft report that was again reviewed by the Project Manager. Subsequent to changes made in response to this draft, it was distributed to the five analysts in the project team for review. Suggested changes were reviewed by the Project Manager and discussions were held among the project team to determine which suggestions would be incorporated. Upon incorporation of the appropriate changes, the draft report was sent for external review.
To select peer-reviewers for the draft Evidence Report, ECRI prepared a list of 27 potential reviewers. This list was submitted to AHRQ, which approved all reviewers. Letters inviting these individuals to review the draft report were then mailed. Twenty-five individuals responded to these letters, 19 agreed to review the draft Evidence Report, and nine individuals returned reviews.
Upon receipt of reviews, ECRI revised the draft report accordingly. ECRI also prepared a document describing the disposition of all substantive reviewer comments and supplied this document to AHRQ for review and approval.
In this section of the Evidence Report, we addressed Key Question #1: What are the definitions of treatment-resistant epilepsy used in the literature?
Medically intractable
Medication-resistant
Medically refractory
Medically resistant
Medically uncontrolled
Drug-resistant
Refractory
Intractable
Pharmaco-resistant
Chronic treatment-resistant
Inadequately controlled
Uncontrolled
Poorly controlled
Therapy-resistant
Because the majority of studies and reviews did not report a definition, we also examined the patient inclusion criteria that were used in published studies. Although these criteria do not comprise a formal explicit definition of treatment-resistant epilepsy, they can be used to determine whether there is a consistently applied implicit definition of this term. Such implicit definitions, however, are less informative than explicit definitions. This is because inclusion criteria are constructed to meet the specific demands of the study rather than to address the general concept of what constitutes treatment-resistant epilepsy.
As noted in the Methodology section, the general inclusion criteria listed in that section do not apply to this question. Rather, we included:
Any clinical study that was evaluated in Questions 2 to 6, that enrolled at least 50 patients, and that was published in 1996 or later. All such studies meeting the initial inclusion criteria for each question were included, regardless of whether they were later excluded from the analysis of that question. We abstracted definitions from clinical studies in an effort to obtain a broad sample of definitions. We did not include articles retrieved for Key Questions 7 – 9, because the nature of these studies made them less likely to include definitions of treatment-resistant epilepsy.
A random sample of 100 review articles on treatment-resistant epilepsy published between 1996 and 2001, inclusive. We chose this random sample by using a random number generator to assign a random number to each of the 298 review articles identified in our searches for this Evidence Report. We chose to use a random sample rather than a comprehensive dataset out of consideration of the time and budget for this project.
Any evidence-based clinical practice guidelines identified during our searches. We termed a guideline as “evidence-based” if it was included in the National Guidelines Clearinghouse (NGC).c
| Source | Number of Publications Selected for Question | Number of Publications Reporting Definitions | Percentage of Publications Reporting Definitions |
|---|---|---|---|
| Question 2 | 6 | 3 | 50% |
| Question 3 | 10 | 4 | 40% |
| Question 4 | 26 | 7 | 27% |
| Question 5 | 39 | 9 | 23% |
| Question 6 | 1 | 1 | 100% |
| Research Articles Total | 82 | 24 | 29% |
| Treatment Guidelines | 3 | 1 | 33% |
| Review Articles | 100 | 21 | 21% |
| Grand Total | 185 | 46 | 25% |
This question addresses definitions, not an intervention or diagnostic. As such, an evaluation of the quality of the literature is not relevant.
Of the 82 clinical studies that met our inclusion criteria for this question, only 24 (29 percent) reported an explicit definition of “intractable”, “refractory”, “treatment-resistant,” or any similar term. The remainder merely stated that the patients they enrolled had treatment-resistant epilepsy (or some equivalent term) without defining that term. Of the 24 articles reporting a definition, five definitions did not include any specific information (e.g. “incompletely controlled by existing therapy”).30 One study defined treatment resistance in terms of seizure frequency with no mention of treatment.31
Of the remaining 19 studies, 15 reported the number of AEDs patients tried before being considered treatment-resistant. Two studies required at least one AED, four required at least two, and four required three. Five were nonspecific (e.g. “multiple”). Six of the studies named the AEDs that they required patients to have tried before being considered treatment-resistant.
Of the 100 review articles surveyed, 79 articles defined treatment-resistant epilepsy as the presence of uncontrolled seizures or a similar term. Two of the remaining definitions were not specific.33, 34 Of the remaining 19 reviews, eight reported the number of AEDs patients tried before being declared treatment-resistant. Three of the eight reviews required at least two AEDs and three required three. Two of the eight reviews were not specific (e.g. “several”). Only one of the reviews named the AEDs that they required patients to have tried before being considered treatment-resistant.
Twelve reviews mentioned intolerable side effects or ineffectiveness at maximum tolerated dose as a reason to consider drug treatment unsuccessful. Four reviews mentioned frequent seizures as part of their definition, but none of these quantified what was meant by “frequent.” Rather, their effect on the ability of the patient to lead a normal life was considered the proper criterion in three of these four reviews.
Because definitions were infrequently reported, we examined the inclusion/exclusion criteria and the characteristics of patients in clinical studies to determine the characteristics of patients deemed to have treatment-resistant epilepsy. These characteristics may imply a definition. However, the requirements of a trial are not necessarily the same as the requirements of a patient seeking treatment. A patient experiencing one seizure a year may be considered treatment-resistant but is unlikely to be included in a clinical trial. Thus, patient inclusion criteria may be biased toward enrolling more severely ill patients.
Of the 82 clinical studies included, eight specifically examined pediatric patients, two focused on Lennox-Gastaut syndrome, while two examined mesial temporal sclerosis (MTS), and one examined non-MTS focal lesions.
*“>1” indicates that the study used a nonspecific term such as “several”
*“>1” indicates that the study used a nonspecific term such as “several”
In those studies that reported a minimum number of AEDs, the majority required at least one AED. This proportion did not differ dramatically from the proportion in studies of pediatric patients or studies in which there was no special patient group.
All FDA drug studies and most non-FDA drug studies reported that a minimum number of AEDs must have been tried without success before a patient was considered treatment-resistant. In both cases, the minimum number was nearly always one. Most studies of surgery (80 percent) did not report a minimum number of AEDs that had been tried. However, when a number was reported, it was always greater than one. This difference between drug and surgical trials probably reflects differences in trial qualifications rather than differences in definitions of treatment resistance.
Nearly half (49 percent) of the studies reported a minimum seizure frequency before patients were considered treatment-resistant. This number ranged from less than 1 per month to 60 per month. Some studies (2.4 percent) were not specific about the precise number required, reporting only that seizures were “frequent” or some equivalent term. Pediatric studies differed from studies in which no special group was examined (Figure 4
When studies of different treatments are compared, studies of surgery seldom (8 percent) reported a minimum seizure frequency (Figure 5
In this section of the Evidence Report, we addressed Key Question #2: Which methods of rediagnosing or reevaluating treatment-resistant epilepsy lead to, or can be expected to lead to improved patient outcomes?
There are three primary roles for diagnostics in the management of patients with epilepsy. The first is to determine whether the patient is experiencing epileptic or nonepileptic seizures. Once a firm diagnosis of epilepsy has been established, the second role is to aid in the classification of epileptic seizures into seizure type. The third role of a diagnostic in the management of patients with epilepsy is to aid in the lateralization and localization of epileptic foci prior to epilepsy surgery. In this section of the report, we address the first two of these roles and how they apply to the subpopulation of patients with treatment-resistant epilepsy. We did not address the third role, which was in keeping with the desires of the Technical Experts and the Expert Panel.
We partitioned Question 2 into four subquestions (A - D). The first two subquestions address the differential diagnosis of epileptic seizures from nonepileptic seizures. The remaining two subquestions address the differential diagnosis of different seizures types. Whether we addressed some questions depended on the findings from previous questions.
We examined the peer-reviewed literature to determine whether there was evidence to suggest that some patients with a diagnosis of treatment-resistant epilepsy were misdiagnosed and their seizures were either not epileptic or they consisted of a combination of epileptic and nonepileptic seizures (Question 2A). If such evidence was found, we then examined the literature to determine which diagnostic technologies were likely to aid in the differential diagnosis of epileptic seizures from nonepileptic seizures (Question 2B).
Similarly, we examined the peer-reviewed literature to determine whether there was evidence to suggest that some patients with treatment-resistant epilepsy were diagnosed with an incorrect seizure type (Question 2C). If such evidence was found, we then examined the literature to determine which diagnostic technologies were likely to aid in the differential diagnosis of one seizure type from another (Question 2D).
To address Question 2A, we looked for studies that attempted to estimate the prevalence of patients with nonepileptic seizures among populations of patients with a diagnosis of treatment-resistant epilepsy. These nonepileptic seizures may have been the sole seizure type experienced by a patient (in which case the patient was misdiagnosed), or they may have occurred in addition to true epileptic seizures (in which case the patient was correctly diagnosed with epilepsy but the additional diagnosis describing the nonepileptic seizures was missed). In the former case, patients would not be expected to respond satisfactorily to treatment with AEDs. In the latter case, the epileptic seizures may be well controlled by AEDs, and the seizures experienced by the patient are nonepileptic in nature. In either case, such patients would, unless given a new diagnosis, remain incorrectly labeled as exclusively having treatment-resistant epilepsy.
Articles were included for Question 2A if they met the general criteria for inclusion presented in the Methodology section and the article reported that patients originally diagnosed as having epilepsy at the time of enrollment into the study were considered treatment-resistant. Studies that enrolled patients with known nonepileptic seizures (either alone or in combination with epileptic seizures), in addition to patients considered to have treatment-resistant epilepsy alone, cannot be used to answer Question 2A. Consequently, such studies were not considered for inclusion in this section of the report unless data from these patients were presented separately. We did not exclude studies that enrolled patients with a diagnosis of treatment-resistant epilepsy but who were suspected of having nonepileptic seizures. This is because all patients who were enrolled in such studies did have a diagnosis of treatment-resistant epilepsy on entry into the study and such studies do contain information on the accuracy of the original diagnosis of epilepsy.
We did not exclude any of the articles that met both the general criteria for inclusion in this report and the question-specific inclusion criterion for reasons related to poor quality.
| Reference | Study Design | Country in Which Study Performed | Size (N) | Multicenter | Number of Centers |
|---|---|---|---|---|---|
| Zaidi (2000)38 | Cross-sectional case series | United Kingdom | 74 | Yes | 2a |
| Holmes (1998)39 | Cross-sectional case series | United States | 379 | No | 1 |
| Henry (1997)37 | Cross-sectional case series | United States | 145 | No | 1 |
| Arnold (1996)40 | Cross-sectional case series | United States | 45 | No | 1 |
| Slater (1995)41 | Cross-sectional case series | United States | 101 | No | 1 |
Patients enrolled at two centers but diagnostic reassessment was performed at a single study center
All five articles in the evidence base were cross-sectional, case series. In these studies, a series of patients (total N = 744) were given diagnostic reassessment in order to determine the prevalence of patients with nonepileptic seizures among specific subgroups of patients, all of who were considered, prior to reassessment, to have treatment-resistant epilepsy.
The following section presents the findings of our systematic assessment of the quality of the evidence base on the prevalence of patients with nonepileptic seizures (alone or in combination with epileptic seizures) among patients with a diagnosis of treatment-resistant epilepsy. This systematic assessment consists of an appraisal of each study's internal and external validity.
The internal validity of a study designed to measure the prevalence of some disease in a population of interest can be weakened by a number of potential biases. Sampling bias is not a concern in these studies because patients were consecutively enrolled during a fixed period. Reference standard bias is a concern in all of the studies because at present no stand alone “gold-standard” for diagnosing epilepsy is available for routine use in clinical practice. Thus, in practice, the differential diagnosis of epileptic seizures is based on a clinical judgment made by one or more specialists. This judgment is based on information from many sources. These sources include medical history, routine-EEG, ambulatory EEG, video EEG, imaging data, cardiac monitoring data, etc.
The potential biases in each study included in the evidence base for this question are discussed in greater detail in Appendix B.
The ability to draw conclusions about the generalizability of the studies addressing this question is limited because details on patient characteristics were incompletely reported. All five of the studies included in the present evidence base were carried out at specialist referral centers (three were specialist electrophysiology centers, one was a specialist neurosurgery center and one was a specialist epilepsy center). Such patients are unlikely to be representative of the general population of patients with treatment-resistant epilepsy. In addition, none of the five studies included children less than sixteen years of age. Therefore, the prevalence data extracted from the studies included in the present evidence base may not be generalizable to pediatric populations.
In four of the studies, some of the patients were referred to the specialist center for a diagnostic reassessment because their original seizure diagnosis was deemed questionable. Estimates of the prevalence of patients with an incorrect diagnosis based on data collected from these studies are likely to lead to an overestimate of the true extent of the misdiagnosis problem as it occurs in the more general population of patients with treatment-resistant epilepsy. In the remaining study, the study sample consisted of patients who were all considered candidates for epilepsy surgery. Not all patients with treatment-resistant epilepsy are surgical candidates and, thus, the findings of this study can only be generalized to a very select population of patients.
Prevalence of nonepileptic seizures among patients diagnosed with treatment-resistant epilepsy
NES: Non-epileptic seizure
ES: Epileptic seizure
The patients examined in the study of Henry and Drury37 were undergoing presurgical evaluation. Such patients were probably assessed more often and/or more completely by clinicians who specialize in epilepsy compared to most other patients with a diagnosis of treatment-resistant epilepsy. Consequently, that no patients in their sample suffered from nonepileptic seizures alone is not a surprise. However, that 8.3 percent of the patients in this study experienced a combination of epileptic and nonepileptic seizures is surprising. Followup of these patients revealed that the changes in patient management that resulted from the reassessment led to a complete cessation of seizures in three patients (25 percent). Whether these three patients would have been identified prior to surgery had the study not been performed is unknown. However, they may possibly have undergone unnecessary surgery.
This meta-analysis shows that the proportion of patients who were misdiagnosed as having treatment-resistant epilepsy was substantial (35 percent; CI: 29 percent to 41 percent). Therefore, a problem of misdiagnosis clearly exists in clinical practice. However, these findings do not accurately represent the proportion of misdiagnosed patients in the overall population of patients with treatment-resistant epilepsy because none of the studies included in the meta-analysis were population-based. Also, the patients included in the four studies that we did meta-analyze represent a subpopulation of patients referred for specialist evaluation of their seizures.
Based on the results of Question 2A, we addressed Question 2B by evaluating the evidence on the diagnostic technologies most commonly used to differentiate epileptic seizures from nonepileptic seizures. As stated above, in clinical practice, the differential diagnosis of epileptic seizures is usually based on information from many sources (medical history, etc.). The clinical diagnosis is seldom based on one diagnostic technology alone. Ultimately then, to answer this question, diagnostic performance data from each analysis of each individual diagnostic technology must be combined into a single decision model which better describes the true clinical picture. As will be seen, a paucity of available evidence precluded the construction of such a model. Thus, we were limited to an analysis of the clinical utility of individual “stand alone” diagnostic technologies.
In addition to employing the general inclusion criteria, we included articles if they met the following criteria:
The study must have evaluated the effectiveness of a diagnostic technology used for the differential diagnosis of epileptic seizures from nonepileptic seizures.
The patients enrolled in the study were not restricted to only those with treatment-resistant epilepsy. Because the intent of Question 2B is to determine the utility of those diagnostic technologies that have been used to differentiate epileptic seizures from nonepileptic seizures, addressing this question requires a study that enrolls both kinds of patients.
The study must have either reported diagnostic test performance characteristics (e.g., sensitivity and specificity) or presented data in a format that allows calculation of test performance characteristics based on a comparison with some “reference” standard.d Alternatively, the study must have included followup data that allow conclusions about the effects of using a diagnostic on patient outcomes.
| Diagnostic | Number of Articles |
|---|---|
| Blood Prolactin Levels | 8 |
| Minnesota Multiphasic Personality Inventory | 6 |
| Video-EEG | 6 |
| Ambulatory-EEG | 5 |
| Provocation techniques | 4 |
| Routine EEG | 4 |
| Creatinine kinase levels | 3 |
| Tilt table | 2 |
| Auditory evoked potentials | 1 |
| Hypnotic recall | 1 |
| Magnetic Resonance Imaging | 1 |
| Single Photon Emission Computed Tomography | 1 |
| Tongue biting | 1 |
| Computed Tomography | 0 |
The most common type of excluded article reported on a case-series study in which a group of patients was diagnosed with a given modality, and this diagnosis was then used to influence medical management. However, none of these studies reported whether these management changes led to improvements in patient outcomes. Although this study design is seen by some as being a legitimate design for the assessment of a diagnostic,42 this assumes that the diagnostic test was accurate. Requiring the assumption that a diagnostic be accurate in order to assess the accuracy of that same diagnostic is circular reasoning. It also assumes perfect sensitivity and specificity of the test, which is not possible. This sort of study design is particularly common in the literature on EEG technologies (i.e. routine EEG, ambulatory-EEG, and video-EEG).
Of the fourteen diagnostics considered, only four (blood prolactin levels, the MMPI, video-EEG, and ambulatory-EEG) were addressed by five or more studies. As per the general inclusion criteria specified in the Methodology section, data about diagnostics not addressed by at least five studies are not considered further in this report. Consequently, we do not include further information about provocation techniques, routine EEG, creatinine kinase levels, tilt tables, auditory evoked potentials, hypnotic recall, MRI, SPECT, tongue biting, or CT in this report.
Interest in the use of blood prolactin levels as a diagnostic tool for the differentiation of epileptic seizures from nonepileptic seizures began in 1976 when Ohman, Walinder, Balldin et al.43 reported that blood prolactin levels rose following epileptic seizures induced by electroconvulsive therapy. Blood levels of prolactin were found to peak approximately 30 to 40 minutes after the seizure occurred and then decline to normal preseizure levels. These findings were confirmed by Trimble44 who demonstrated that blood prolactin levels increased following spontaneous epileptic seizures and noted that blood prolactin levels did not rise following a nonepileptic, psychogenic seizure (also known as a pseudoseizure, hysterical seizure, or psychological seizure). Since then, a number of reports have been published that have assessed the relationship between blood or plasma prolactin levels in patients with epilepsy and a number of different nonepileptic seizure types.44–57 In the following section of the report, we assess the evidence related to the value of measuring blood prolactin levels in differentiating epileptic seizures from nonepileptic seizures.
The following section presents the findings of our systematic assessment of the quality of the evidence on the diagnostic utility of blood prolactin level measurements in the differentiation of epileptic seizures from nonepileptic seizures.
All five studies included in the present evidence base utilized a diagnostic case-control study design. The case-control study design is commonly used in the early stages of the evaluation of a diagnostic and is particularly susceptible to a number of biases that lead to overestimation of a test's true diagnostic performance.24–28 No studies presented patient outcome data. Thus, no direct determination is possible about whether the use of blood prolactin level measurements will lead to improvements in patient outcome. However, a reasonable assumption is that a good diagnostic test will allow patients' nonepileptic seizures to be identified and treated more appropriately, thus leading to improved patient outcomes.
Imperfect reference standard bias (all five studies), prevalence bias, and spectrum bias (four studies each) were the most common potential biases in the studies of blood prolactin measurements. Patient bias, diagnostic yield bias, and verification bias were not present in these studies. These potential biases with respect to this question are discussed in detail in Appendix B.
Details of both study inclusion/exclusion criteria and the patient characteristics included in the relevant studies were incompletely reported. Four of the five articles described the inclusion/exclusion criteria. Four articles reported on age, no article reported on sex distribution, one article reported on the duration of disease, no article reported on seizure frequency, only one article reported the number of patients who had cognitive or developmental deficits, and only one article reported the number of AEDs used by patients in the study.
The assessment of study quality presented above indicates that, given the present evidence, definitive conclusions cannot be drawn about whether blood prolactin level measurements have a useful role in differentiating epileptic seizures from nonepileptic seizures. Acknowledging this, we have instead evaluated the available data with the aim of determining the plausibility of blood prolactin measurements having a role in differentiating epileptic seizures from nonepileptic seizures.
| Epileptic Seizure Type | Nonepileptic Seizure Type | |||||||
|---|---|---|---|---|---|---|---|---|
| Reference | Mixed ES | GTCS | CPS | SPS | Mixed NES | PsyS | SynS | |
| Lusic (1999)53 | ![]() | From | ![]() | |||||
| Anzola (1993)45 | ![]() | From | ![]() | |||||
| Zelnik (1991)56 | ![]() | From | ![]() | |||||
| Mishra (1990)57 | ![]() | From | ![]() | |||||
| Wroe (1989)50 | ![]() | ![]() | ![]() | ![]() | From | ![]() | ||
| Lusic (1999)53 | ![]() | From | ![]() | |||||
CPS Complex partial seizure
ES Epileptic seizure
FS Febrile seizure
GTCS Generalized tonic-clonic seizure
NES Nonepileptic seizure
PsyS Psychogenic seizure
SPS Simple partial seizure
SynS Syncopal seizure
These studies primarily assessed the ability of blood prolactin level measurements to differentiate several epileptic seizure types (mixed seizures, generalized tonic-clonic seizures, complex partial seizures, and simple partial seizures) from two paroxysmal seizure disorders that are often misdiagnosed as epileptic. These two nonepileptic seizure types were syncopal seizures and psychogenic seizures. Thus, the findings of this assessment are not applicable to the differentiation of epileptic seizures from any other nonepileptic seizure type.
Typically, diagnostic performance data is captured by Receiver Operator Characteristic (ROC) curves that describe the trade off between sensitivity and specificity. If enough studies report appropriate data, the data can be meta-analytically combined into a single summary ROC (SROC) curve. Because the present data set consisted of only three studies, we have not attempted such a meta-analysis. As mentioned earlier, Zelnik, Kahana, Rafael, et al.56 did not present their diagnostic performance data in a typical 2 by 2 format. Instead, they summarized their data in the form of mean (and standard deviation) blood prolactin levels.
Because the present data could not be meta-analyzed, and because ROC curves synthesized from the available continuous data sets may be misleading, we have instead summarized these data as the effect size, Hedges' d. Although this effect size cannot be used to describe the diagnostic performance of blood prolactin measurements, it does allow diagnostic data to be compared and contrasted among studies that reported this data in different formats.
The data from the three studies reporting the diagnostic utility of blood prolactin level measurement in differentiating epileptic seizures from syncopal seizures is summarized in Figure 7
Exploration suggests that Lusic, Pintaric, Hozo, et al.53 did not find that the test was significantly better than chance because the performance characteristics data were collected at a threshold that was not optimal for the test. This is illustrated in Table 5, which shows three point estimates plotted in ROC space. These point estimates, along with their confidence intervals, came from the dichotomous data presented by Lusic, Pintaric, Hozo, et al.53 and Anzola45 (Evidence Table 20). All three point estimates can conceivably originate from a single underlying ROC curve (Figure 8
We summarized the available data from these two studies in terms of Hedges' d. This summary, shown in Figure 9
Data abstracted from Mishra (1990), GTCS: Generalized tonic-clonic seizures, CPS: Complex partial seizures, SPS: Simple partial seizures
The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) and its predecessor (MMPI) instruments are among the most widely used and widely researched tests of adult psychopathology. These instruments provide a broad psychological profile across a number of domains. Some investigators believe that the psychological profile of a patient with psychogenic seizures (either alone or in combination with epileptic seizures) may be different from that of patients with epileptic seizures alone.
After the exclusion of the two articles listed in the table, four studies remained. As per the general inclusion criteria specified in the Methodology section, data about treatments not addressed by at least five included studies (or at least one large RCT with 50 or more patients in each study arm) are not considered further. Consequently, we do not further assess the MMPI-2 or MMPI in this report.
Video-EEG monitoring is used in clinical practice to verify the seizure type, to localize the area of seizure onset if surgery is being considered, and to verify the diagnosis of epilepsy if the diagnosis is in doubt. Video-EEG monitoring consists of the simultaneous recording of EEG brain wave activity combined with time synchronized video recording of the patient. This diagnostic procedure is performed on an in-patient basis, and requires highly specialized equipment and dedicated space. Patients are monitored for extended periods in order to capture typical seizure events on video and simultaneously capture EEG activity during that event. In some centers, patients' medications are withdrawn in order to increase the chance of recording a seizure.
After the exclusion of the two articles listed in the table, four studies remained. As per the general inclusion criteria specified in the Methodology section, data about diagnostics not addressed by at least five included studies (or at least one large RCT with 50 or more patients in each study arm) are not considered. Consequently, we do not further consider video-EEG in this report.
Ambulatory-EEG monitoring is the recording of EEG brain wave activity remotely from the hospital environment. Ambulatory-EEG, like video-EEG, has the advantage over routine EEG of allowing EEG traces to be recorded continuously over long periods. This increases the chance of recording an ictal event. Unlike video-EEG, no video record of seizure events is available, and patients or their caregivers must accurately record the occurrence of a typical seizure event in order to temporally compare the occurrence of a seizure with the EEG trace.
Early clinical investigations documented the ability of ambulatory-EEG to record identifiable focal and generalized epileptiform activity. In 1983, a cassette tape ambulatory-EEG system was introduced. This system had continuous 8-channel recording capability, real-time identification, gain adjustment, and filter adjustments. Since then, improvements in computer technology have led to the development of instruments that can perform portable continuous recording of more than 16 channels with sampling rates of over 200 Hz.
After the exclusion of the two articles listed in the table, three studies remained. As per the general inclusion criteria specified in the Methodology section, data about treatments not addressed by at least five included studies (or at least one RCT with more than 50 patients in each study arm) are not considered. Consequently, we do not further consider ambulatory-EEG in this report.
That the evidence base for the three EEG technologies (routine ictal and interictal EEG, ambulatory-EEG and video-EEG) did not reach the required minimum of five acceptable studies is surprising. Even with the expansion of the inclusion criteria so that we included articles published between 1980 and 1985 and articles describing retrospective studies, we did not reach five studies. This may be because all three of these EEG technologies are commonly used as aids in the diagnosis of epilepsy, and video-EEG is considered by many to be the “gold standard” for the differentiation of epileptic seizures from nonepileptic seizures.58 Some reviews of the use of video-EEG in the differential diagnosis of epileptic seizures from nonepileptic seizures include studies in which provocation was used in an attempt to induce a seizure that was then captured by video-EEG, citing such studies as evidence of the effectiveness of video-EEG in combination with provocation. Without exception, however, these studies used video-EEG as a “reference standard” against which the effectiveness of provocation was measured. Thus, such studies cannot be considered as part of the evidence base for the diagnostic utility of video-EEG and instead form the evidence base for seizure provocation techniques.59
For example, a number of studies of patients with implanted electrodes have demonstrated that epileptic seizures originating in the medial or orbital surface of the frontal lobe, the parietal lobe, or the temporal lobe, often occur in the absence of a measurable EEG abnormality when the EEG is performed using scalp electrodes.62–65 These types of seizures may arguably be relatively rare. However, given that the appearance of a nonepileptic seizure is often very similar to epileptic seizures originating in the medial or orbital surface of the frontal lobe, the parietal lobe, or the temporal lobe,66 these are the very patients who are the most likely to be misdiagnosed as having epileptic seizures. Thus, some false-negative decisions must be assumed to occur when video-EEG is used.
The fact that evidence-based conclusions were not drawn in the present report regarding the ability of vEEG to differentiate epileptic seizures from nonepileptic seizures should not be interpreted as evidence that this technology is not effective or useful. Indeed, vEEG may very well have an important role in diagnostic algorithms that are designed to make such a differential diagnosis. Until more high quality studies become available, however, the diagnostic performance characteristics of vEEG and its place in such diagnostic algorithms cannot be determined.
There are two purposes to the present question. First, to establish whether there is evidence in the peer-reviewed literature to indicate that some patients believed to suffer from a specific seizure type actually suffer from a different seizure type (either alone or in combination with the originally diagnosed seizure type) and would, therefore, not be expected to respond satisfactorily to their current treatment regimen. The second purpose is to quantify, if relevant, the prevalence of these patients among the population of patients thought to suffer from a particular seizure type.
Articles were included for Question 2C if they met the general criteria for inclusion presented in the Methodology section, and if the article reported on a study that enrolled patients originally diagnosed as having a specific type of epileptic seizure (partial seizure, generalized seizure, absence seizure, etc).
No studies addressed Question 2C and met both the general and subquestion specific inclusion criteria listed above. Consequently, Question 2C could not be answered.
Because Question 2C cannot be answered in an evidence-based fashion, Question 2D could not be addressed in an evidence-based fashion.
In this section of the Evidence Report, we addressed Key Question #3: Is there evidence that patients with treatment-resistant epilepsy are not optimized at their current level of treatment?
In the present question, we address whether patients described as having treatment-resistant epilepsy are receiving optimal dosages of the AED regimen prescribed for them. The available evidence for this question is derived from two types of studies. The first type is comprised of studies that assessed, using drug level monitoring, whether patients were truly treatment-resistant or at an otherwise optimized level of drug therapy. The second type is comprised of drug treatment studies that presented information on the pretrial or baseline status of the patients enrolled in a clinical trial. Patients in a clinical trial often receive optimized treatment as part of the trial and are therefore not representative of patients who are maintained on AEDs in clinical practice. Thus, the pretrial status of the patients is the best indication of whether drug optimization was part of routine clinical practice. We examined these groups of studies because they were the most likely type of studies to report the information necessary to address this question.
For the purposes of the present question, a drug regimen was defined as not optimized if a study reported enrolling any patients whose prior drug regimen: 1) had not been titrated, 2) was not in the therapeutic range, or 3) produced side effects. If a study reported that some patients had not received the maximal tolerated dosage, we considered this evidence of lack of titration, and therefore definitive evidence of lack of optimization. If the therapeutic range was not defined, we defined the range as either the therapeutic range of the maintenance dose or the blood concentration.g We considered patients receiving more than one AED to be in the upper end of the therapeutic range if at least one AED dosage or blood level was in this range. The Expert Panel and the Technical Experts formulated criteria 1 and 3. The second criterion, as originally suggested by the Technical Experts, specified only the upper end of the therapeutic range. This is a more stringent way to define optimization and may be inaccurate, as the maximum tolerable dosage for some patients may be below the upper end of the therapeutic range. We modified this criterion for this reason, and because several studies reported that not all patients were receiving drug doses in the therapeutic range. However, the possibility remains that certain patients outside the therapeutic range may have been optimized. Thus, the second and third criterion suggest the possibility of nonoptimization but do not provide definitive evidence of its existence. Therefore, we separately report patients who were not in the upper end of the therapeutic range
In addition to the general inclusion criteria (see Methodology section), we used the following criteria to determine whether a study was included:
The study must have reported information indicating that some patients in the study did not meet at least one of the criteria for optimization described above. Thus, we are seeking only evidence of nonoptimization, not a percentage of patients who are optimized.
The study must have been published in 1975 or later. This ensured the use of evidence on standard AEDs as well as evidence on newer agents.
All drugs in the study must have been cleared for marketing in the United States by the Food and Drug Administration. If the study included some patients on non-FDA approved drugs, it was required to report data separately from patients on FDA-approved drugs, and we abstracted results only from the latter group of patients. This criterion was determined by the Expert Panel and the Technical Experts.
We did not exclude any studies for reasons of quality.
Since this question does not involve an analysis of results, but merely reporting of patient status before entering a study, there is only one relevant issue concerning the internal validity of these studies. An apparently nonoptimized drug regimen could result not only from an inadequate drug dosage, but also from a patient's lack of compliance with the prescribed regimen. If the nonoptimized patients in a study were actually noncompliant, this would alter the assumption that nonoptimization was primarily due to prescription of nonoptimized drug regimens. Four studies required that all patients in the study were compliant.67-70 Two studies reported that some patients were suspected of noncompliance,71, 72 while in one study 8 of 35 patients admitted noncompliance (this study was not excluded because clearly other patients in the study were not optimized).73 The remaining 13 studies reported no information concerning compliance.
Six studies (two United States and four outside the United States) reported information indicating that the prior drug regimen of some patients had not been titrated. The lack of reporting of titration information does not necessarily mean that few patients have their drug regimens titrated in clinical practice. Titration may be a common practice, but study investigators may not report it.
Ten of 20 studies (three United States and seven from other countries) presented information indicating that the prior drug regimen was not in the therapeutic range for at least some patients in the studies. We further examined whether some studies enrolled patients whose prior drug regimen was not in the upper end of the therapeutic range.
Sixteen of 20 studies (five United States and 11 from other countries) presented information indicating that the prior drug regimen for some patients was not in the upper end of the therapeutic range.
Four studies (two United States and two outside the United States) presented information indicating that drug side effects occurred in at least some patients on a prior drug regimen.
In this section of the Evidence Report, we addressed Key Question #4: Which drug treatment strategy, 1) sequential monotherapy, 2) polytherapy, or 3) optimized current therapy leads to improved outcomes for patients with treatment-resistant epilepsy, and what are the relative improvements obtained with each strategy?
In this question, we address three drug treatment strategies that could potentially benefit patients with treatment-resistant epilepsy. By definition, patients with treatment-resistant epilepsy have already received AEDs that were ineffective. Therefore, in the present question we are addressing whether any changes in patients' drug regimens can potentially reduce their seizures.
We define sequential monotherapy as switching patients to a single AED that none of the patients had yet received. According to the desires of the Partners, patients could have been receiving multiple prior drugs before initiation of monotherapy. Polytherapy is defined as the simultaneous administration of more than one AED. It typically involves the addition of a single novel AED to patients' drug regimens (referred to as “add-on” treatment). Finally, we define optimized current therapy as altering the dose of at least one drug in patients' drug regimens, or removing at least one drug from patients' drug regimens. In optimized current therapy, drug dose can be altered by changing the total daily dose, the number of doses in a given day, or the drug preparation (such as a slow-release preparation). According to the desires of the Partners, optimized current therapy can also consist of the removal of a drug from patients' regimens. We address each of these three strategies in separate subquestions.
This question addresses the safety and efficacy of drug strategies, not of particular drugsh. However, the literature on monotherapy is comprised primarily of trials that examine the effects of changing patients' treatment from a number of AEDs to a singlespecific drug (e.g., topiramate). Similarly, the literature on polytherapy is comprised primarily of trials that involve adding a specific drug (that patients had not previously received) to their existing regimen, and the literature on optimized current therapy is comprised primarily of trials that removed a specific drug from patients' drug regimens. Thus, the literature is comprised primarily of certain specific implementations of these strategies. Although the findings of the individual trials have limited generalizability, when considered in aggregate (as below) they provide the best available estimates of the effectiveness of the three strategies.
Although we divided this question into four subsections (one for each treatment strategy, and one for comparisons between strategies), we employed the same inclusion criteria for studies of each strategy. Thus, in addition to the general inclusion criteria described in the Methodology section, we included trials for this question if they met all of the following criteria:
The trial must have been published in 1975 or later. For school- and work-related outcomes, the trial must have been published in 1985 or later. Including trials since 1975, as well as more recent trials, facilitated incorporation of data from standard AEDs that may no longer be the focus of clinical research.
Before the trial, patients must have received unsuccessful treatment with at least one of the following drugs: carbamazepine, ethosuximide, phenobarbital, phenytoin, primidone, or valproate. The Technical Experts determined that these six drugs are standard AEDs.
All drugs received by patients during the trial must be cleared for marketing in the United States by the U.S. Food and Drug Administration (FDA). If the trial included some patients on non-FDA-approved drugs, it was required to report data separately from patients on FDA-approved drugs, and we abstracted results only from the latter group of patients. We included trials employing off-label usage of drugs for the treatment of epilepsy. Confining the question to only FDA-approved drugs was in accordance with the wishes of the Expert Panel and Technical Experts.
If a trial reported that some patients had been noncompliant, then results must have been reported separately for patients who were compliant. Noncompliant patients may not be treatment-resistant, and their seizure rates may drop during a trial. Whether the improvement in noncompliant patients was due to better compliance or to the beneficial effect of the trial drug cannot be determined. Therefore, the outcomes of noncompliant patients were not included.
If there were five or more placebo-controlled randomized trials on a specific drug treatment strategy, then other trials with other designs (e.g., trials that used a low AED dose as a control) were not considered. We adopted this criterion because results of placebo-controlled randomized trials are more easily interpreted compared to results of trials that employed other control groups.
Trial must be a Phase II or III efficacy trial. Earlier trials (Phase I) were not primarily intended to reduce seizures, and later trials (Phase IV) involved drugs whose effectiveness had already been documented by other trials.
Trials that used a crossover design must have reported results for the first period (i.e., before the crossover), or must have reported that seizure frequencies returned to baseline at the end of the washout period. In a crossover trial, the use of a drug at the start of a trial may have potentially influenced the effectiveness of a different drug used later in the trial. If seizure frequency returned to baseline at the end of the washout period, then the evidence suggests that the first drug is no longer active, and data for the second drug are interpretable. However, if a return to baseline was not reported, then we only abstracted data for the first period.
To include the maximum number of potentially relevant studies, we did not require studies to report patients' seizure frequencies at baseline. However, baseline seizure frequencies do provide a measure of the severity of patients' initial conditions, thus aiding in the interpretation of study findings. For example, suppose a treatment eliminated all seizures. Such an outcome would be more impressive if patients' baseline seizure frequencies were 20 per month than if frequencies were only five per month. Because the baseline frequency helps place the study results in proper context, ideally all studies would report this frequency.
Applying the inclusion criteria yielded 55 studies describing the three drug strategies. There were 14 studies of sequential monotherapy, 30 studies of polytherapy, and 11 studies of optimized current therapy.
Sequential monotherapy involves administering a single AED not yet received by any of the patients. Patients can receive any number of AEDs prior to the initiation of the new drug. However, all prior AEDs must be withdrawn from patients' drug regimens in order to investigate the effect of the novel monotherapy drug. In this section, we describe the evidence base for sequential monotherapy, assess the quality of these trials with respect to both internal and external validity, and analyze the trials' results for all relevant outcomes.
Fourteen studies met the inclusion criteria. One of the 14 studies was excluded because the authors only reported a qualitative description of treatment efficacy.74
The evidence base for sequential monotherapy consists of 13 studies that enrolled 1,542 patients.
| Felbamate | Gabapentin | Lamotrigine | Oxcarbazepine | Primidone | Tiagabine | Topiramate | Valproate | ||
|---|---|---|---|---|---|---|---|---|---|
| Reference | 3600 mg/day | 2400 mg/day | 3600 mg/day | 500 mg/day | 2400 mg/day | 750 mg/day | 36 mg/day | 1000 mg/day | 150 μg/ml |
| Sachdeo (2001)68 | ![]() | ||||||||
| Beydoun (2000)86 | ![]() | ||||||||
| Kanner (2000)87 | ![]() | ||||||||
| Schachter (1999)79 | ![]() | ||||||||
| Gilliam (1998)76 | ![]() | ||||||||
| Bergey (1997)78 | ![]() | ||||||||
| Beydoun (1997)85 | ![]() | ||||||||
| Beydoun (1997)83 | ![]() | ||||||||
| Sachdeo (1997)84 | ![]() | ||||||||
| Devinsky (1995)75 | ![]() | ||||||||
| Schachter (1995)88 | ![]() | ||||||||
| Theodore (1995)89 | ![]() | ||||||||
| Faught (1993)77 | ![]() | ||||||||
| Totals | 3 | 1 | 1 | 1 | 3 | 1 | 1 | 1 | 1 |
mg/day Maximum dose in milligrams per day
μg/ml Maximum dose in micrograms per milliliter
In evaluating internal validity, we determined whether the results were potentially biased by the threats to validity that are discussed in the Methodology section. Although other questions in this report consider the potential for attrition bias, we do not consider it here because attrition was a study outcome. As discussed earlier, the control groups of these studies are not relevant to the question. Consequently, for the purpose of this report, the studies can be viewed as case series and susceptible to several threats to internal validity (see Appendix B). All were potentially affected by both regression bias and extraneous event bias. Further, most studies were potentially affected by sample specification bias (12/13 studies) and measurement bias (10/11 studies that reported the method of seizure measurement).
| Reference | Seizure Frequency | Adverse Effects | Quality of Life | Mood | Cognitive Function | Ability to Return to Work | Ability to Return to School | Ability to Hold a Driver's License | Mortality |
|---|---|---|---|---|---|---|---|---|---|
| Sachdeo (2001)68 | ![]() | ![]() | ![]() | ||||||
| Beydoun (2000)86 | ![]() | ![]() | |||||||
| Kanner (2000)87 | ![]() | ![]() | |||||||
| Schachter (1999)79 | ![]() | ![]() | |||||||
| Gilliam (1998)76 | ![]() | ![]() | |||||||
| Bergey (1997)78 | ![]() | ![]() | ![]() | ||||||
| Beydoun (1997)85 | ![]() | ![]() | ![]() | ||||||
| Beydoun (1997)83 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |||
| Sachdeo (1997)84 | ![]() | ![]() | ![]() | ||||||
| Devinsky (1995)75 | ![]() | ![]() | |||||||
| Schachter (1995)88 | ![]() | ![]() | ![]() | ![]() | ![]() | ||||
| Theodore (1995)89 | ![]() | ![]() | ![]() | ||||||
| Faught (1993)77 | ![]() | ![]() | |||||||
| Totals | 13 | 13 | 2 | 3 | 2 | 0 | 0 | 0 | 5 |
In cases where meta-analysis was feasible, we used random effects models. We employed these models because the included studies investigated different drugs. Therefore, these studies cannot be viewed as having been sampled from a population of studies with a fixed mean. Random effects models employ statistical methods that are most applicable when studies use different variations of a treatment.
Also, these studies reported data on an intent-to-treat basis. This was particularly important because they employed a priori exit criteria (such as doubling of monthly seizure frequency) to limit harms to patients. If any patient met an exit criterion, investigators removed the patient from the study and reinstituted the patient's prior AED regimen. Consequently, the analyses described below included all patients who were randomized to receive high-dose monotherapy.
| Reference | Mean Absolute Seizure Frequency | Median Absolute Seizure Frequency | Median Absolute Difference From Baseline | Median Percent Difference From Baseline | Number of Patients Seizure-free | Number of Patients With ≥75 Percent Reduction | Number of Patients With ≥50 Percent Reduction | Number of Patients With Any Reduction | Mean Rank of Seizure Frequency | Number of Patients With Doubling of Monthly Seizure Frequency | Number of Patients With Doubling of Highest Consecutive Two-Day Seizure Frequency | Median Time to Exit | Mean Time on Monotherapy | Risk Ratio of Time to Exit |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sachdeo (2001)68 | ![]() | ![]() | ![]() | |||||||||||
| Beydoun (2000)86 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||
| Kanner (2000)87 | ![]() | ![]() | ||||||||||||
| Schachter (1999)79 | ![]() | ![]() | ![]() | |||||||||||
| Gilliam (1998)76 | ![]() | |||||||||||||
| Bergey (1997)78 | ![]() | ![]() | ||||||||||||
| Beydoun (1997)85 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||
| Beydoun (1997)83 | ![]() | ![]() | ![]() | ![]() | ||||||||||
| Sachdeo (1997)84 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||
| Devinsky (1995)75 | ![]() | |||||||||||||
| Schachter (1995)88 | ![]() | ![]() | ![]() | |||||||||||
| Theodore (1995)89 | ![]() | |||||||||||||
| Faught (1993)77 | ![]() | ![]() | ||||||||||||
| Totals | 1 | 2 | 1 | 1 | 6 | 2 | 4 | 1 | 1 | 5 | 5 | 4 | 4 | 2 |
Percentage of patients who were seizure-free during the study. This outcome was reported by six studies and ranged from 9 percent to 28 percent. Because of the lack of relevant control groups, we performed a threshold analysis (see the Methodology section for a discussion of this approach). In this analysis, we compared the results obtained in patients who received sequential monotherapy to those of a synthetic control group in which we varied the percentage of seizure-free patients. The percentage at which the difference between the monotherapy and “control” group became statistically nonsignificant is the threshold. The results of this analysis appear in Figure 11
We next performed a second threshold analysis of these data to test the effect of followup period on seizure-free status. Two of the six studies that reported seizure freedom employed short followup times (8 days in Bergey, Morris, Rosenfeld, et al.,78 and 10 days in Schacter, Vasquez, Fisher, et al.).79 In the other four studies, patients were followed for at least 16 weeks. The percentage of patients who were seizure-free was greater than 25 percent in both short-term studies, but was less than 14 percent in all of the long-term studies. Therefore, our second threshold analysis included only the four studies with longer followup times (Figure 12
In summary, approximately 11 percent of patients are seizure-free during long-term studies of sequential monotherapy. However, given the designs of these studies, whether the new drug actually caused any of the patients to become seizure-free during the study is not clear. Further, seizure frequencies change from month to month, and some patients with treatment-resistant epilepsy may experience periods without seizures, and some patients may have been misdiagnosed (Question 2). These latter patients may be more likely to become seizure-free. Even if some patients become seizure-free as a result of sequential monotherapy, the majority of patients (approximately 89 percent) continue to have seizures despite receiving a new drug as monotherapy. A firm conclusion about whether sequential monotherapy produces any new seizure-free patients would require the use of a relevant control group (i.e., continuation of prior drug regimens).
Percentage of patients whose monthly seizure frequency doubled. Freedom from seizures measures the percentage of patients who experienced maximum benefit. In contrast, seizure doubling indexes the percentage of patients who experienced significant harm. This outcome was reported by five studies and ranged from 9 percent to 29 percent. All five studies followed patients for at least 16 weeks, thus the concern about study duration does not apply to seizure doubling. Due to the lack of relevant control groups, we performed a threshold analysis of this outcome (Figure 13
Sequential monotherapy cannot be directly considered the cause of the doubling in seizure frequency because of the lack of a true control group. However, three factors do suggest a causal relation. First, at the beginning of the sequential monotherapy studies, all prestudy drugs were removed from patients' regimens and replaced with a new AED. Presumably, the original AEDs were already reducing seizure frequency. The removal of these drugs, therefore, may have caused seizures to increase. Second, a doubling of monthly seizure frequency was set by investigators as an a priori exit criterion. A doubling of seizure frequency resulted in immediate removal from the study, and all prestudy drugs were reinstituted. This suggests that investigators believed that the doubling of monthly seizure frequency was being caused by sequential monotherapy. Third, given the possibility that patients enter drug trials when they are relatively sick, they would not be expected to become even worse. Instead, based on regression-to-the-mean, reductions in seizure frequency would be expected. Each of these factors suggest that sequential monotherapy caused dramatic increases in seizures in some patients. A definitive conclusion about this possibility would require randomization of patients to either sequential monotherapy or a continuation of the prestudy drug regimen.
Percentage of patients whose highest two-day seizure frequency doubled. This outcome also measures the percentage of patients who experienced significant harm during studies of sequential monotherapy. It was reported by five studies, and ranged from 4 percent to 23 percent. All five studies followed patients for at least 16 weeks. Due to the lack of relevant control groups, we performed a threshold analysis of this outcome (Figure 14
As stated previously, whether sequential monotherapy was the cause of doubling of two-day seizure frequency cannot be determined without a true control group. However, the same three factors discussed above apply to this outcome as well. Based on these factors, sequential monotherapy in some patients appears to have caused doubling in two-day seizure frequency. Combining the estimates of the two seizure increase outcomes that result in exiting a trial, 30 percent of patients in studies of sequential monotherapy experience a doubling in either monthly seizure frequency or two-day seizure frequency. Therefore, some patients may be experiencing large seizure increases as a direct result of sequential monotherapy. To provide a definite answer, randomizing patients to receive either sequential monotherapy or a continuation of the prestudy drug regimen would be necessary.
In clinical practice, a physician prescribing an AED for a patient with treatment-resistant epilepsy must consider not only the possible reduction of seizure frequency, but also the possible adverse effects of the new drug. Before entering studies, patients with treatment-resistant epilepsy were already experiencing adverse effects from their prestudy antiepileptic drug regimens. None of the studies reported these patients' prestudy adverse effects, and none reported whether the adverse effects observed during the study were more or less severe compared to patients' prestudy adverse effects. This latter outcome would have been informative because patients (and physicians) seek to reduce adverse effects as well as seizure frequency.
| Reference | Drug | Dose in mg/day | Percent of Patients Who Experienced Any Adverse Effect | Name of Most Commonly Experienced Adverse Effect | Percent of Patients Who Experienced This Adverse Effect |
|---|---|---|---|---|---|
| Sachdeo (2001)68 | Oxcarbazepine | 2400 | NR | Headache | 11% (5/45) |
| Beydoun (2000)86 | Oxcarbazepine | 2400 | NR | Dizziness | 46% (19/41) |
| Kanner (2000)87 | Primidone | 750 | 53% (16/30) | Irritability | 37% (11/30) |
| Schachter (1999)79 | Oxcarbazepine | 2400 | 91% (46/51) | Nervous system | 45% (23/51) |
| Gilliam (1998)76 | Lamotrigine | 500 | 75% (57/76) | Dizziness | 20% (15/76) |
| Bergey (1997)78 | Gabapentin | 3600 | 73% (29/40) | Ataxia | 20% (8/40) |
| Beydoun (1997)85 | Valproate | 150 μG/mL | NR | Tremor | 64% (61/96) |
| Beydoun (1997)83 | Gabapentin | 2400 | 88% (80/91) | Dizziness | 25% (23/91) |
| Sachdeo (1997)84 | Topiramate | 1000 | NR | Paresthesia | 58% (14/24) |
| Devinsky (1995)75 | Felbamate | 3600 | NR | NR | NR |
| Schachter (1995)88 | Tiagabine | 36 | 95% (91/96) | Dizziness | 35% (34/96) |
| Theodore (1995)89 | Felbamate | 3600 | NR | NR | NR |
| Faught (1993)77 | Felbamate | 3600 | NR | Headache | 34% (19/56) |
mg/day Milligrams per day
NR Not reported
μG/ml Micrograms per milliliter
Percentage of patients who exited trials due to adverse effects. To summarize the available data on adverse effects, we focused on whether the adverse effects in a given patient were severe enough to warrant discontinuation of the new drug (i.e., trial exit). This outcome is a marker of treatment failure. All 13 included trials reported the percentage of patients who exited trials due to adverse effects, and it ranged from 0% to 29%. As with seizure frequency, due to the lack of relevant control groups we performed a threshold analysis (Figure 15
Ketter, Malow, Flamini, et al.82 reported that mood and psychiatric symptom scores changed after 2 weeks of sequential monotherapy in patients given felbamate. After the removal of all AEDs, patients' mood scores significantly worsened relative to baseline for seven of the 11 subscales. At both week 1 and week 2 of felbamate monotherapy, the decrements persisted. Thus, the initiation of felbamate monotherapy did not return patients' mood scores to baseline. With longer followup, patients' mood scores may potentially have returned to baseline or even improved over baseline. However, these findings suggest that the first phase of sequential monotherapy (i.e., the drug reduction phase) may cause significant worsening of mood and psychiatric symptom scores. However, because there was only one study reporting such changes, firm evidence-based conclusions cannot be drawn about the general effect of sequential monotherapy on mood.
No studies of sequential monotherapy reported this outcome.
No studies of sequential monotherapy reported this outcome.
No studies of sequential monotherapy reported this outcome.
No studies of sequential monotherapy reported this outcome.
Polytherapy is defined as the administration of a multiple-drug regimen in which at least one of the drugs is novel to each patient. As with sequential monotherapy, patients received any number of drugs prior to the initiation of a new drug. Most polytherapy interventions involve the addition of a single novel drug to patients' regimens (referred to as “add-on” treatment). In this section, we describe the evidence base for polytherapy, assess the quality of these trials with respect to both internal and external validity, and analyze the trials' results for all relevant outcomes.
Thirty trials of polytherapy met the inclusion criteria. None were excluded for quality reasons.
The evidence base contained 30 trials that enrolled 4,834 patients.
| Reference | Drug | Trial Dose(s)a | Total Number of Trials That Used This Drug/Dose Combination |
|---|---|---|---|
| Faught (2001)94 | Zonisamide | 400 | 1 |
| Ben-Menachem (2000)95 | Levetiracetam | 3000 | 2 |
| Betts (2000)96 | Levetiracetam | 2000, 4000 | 1, 1 |
| Cereghino (2000)97 | Levetiracetam | 1000, 3000 | 1, 2 |
| Glauser (2000)98 | Oxcarbazepine | Tailored to weight | 1 |
| Appleton (1999)99 | Gabapentin | Tailored to weight | 1 |
| Biton (1999)100 | Topiramate | Tailored to weight | 3 |
| Duchowny (1999)30 | Lamotrigine | Tailored to weight | 2 |
| Elterman (1999)101 | Topiramate | Tailored to weight | 3 |
| Korean Topiramate Study Group (1999)102 | Topiramate | 600 | 4 |
| Sachdeo (1999)103 | Topiramate | Tailored to weight | 3 |
| Uthman (1998)104 | Tiagabine | 16, 32, 56 | 1, 2, 1 |
| Sachdeo (1997)105 | Tiagabine | 32b, 32c | 2, 1 |
| Ben-Menachem (1996)90 | Topiramate | 800 | 2 |
| Chadwick (1996)106 | Gabapentin | 1200 | 3 |
| Faught (1996)91 | Topiramate | 200, 400, 600 | 1, 2, 4 |
| Privitera (1996)107 | Topiramate | 600, 800, 1000 | 4, 2, 1 |
| Sharief (1996)108 | Topiramate | 400 | 2 |
| Tassinari (1996)109 | Topiramate | 600 | 4 |
| Willmore (1996)110 | Valproate | Tailored to weight | 1 |
| Anhut (1994)111 | Gabapentin | 900, 1200 | 2, 3 |
| Messenheimer (1994)112 | Lamotrigine | 400 | 1 |
| Bourgeois (1993)113 | Felbamate | 3600 | 1 |
| Felbamate Study Group (1993)114 | Felbamate | Tailored to weight | 1 |
| Matsuo (1993)115 | Lamotrigine | 300, 500 | 1, 1 |
| McLean (1993)116 | Topiramate | 600, 1200, 1800 | 4, 1, 1 |
| Schmidt (1993)117 | Zonisamide | Tailored to weight | 1 |
| Sivenius (1991)118 | Gabapentin | 900 | 2 |
| UK Gabapentin Study Group (1990)119 | Gabapentin | 1200 | 3 |
| Jawad (1989)70 | Lamotrigine | Tailored to weight | 2 |
Maximum dose in milligrams per day
Based on 16 milligrams twice per day
Based on 8 milligrams four times a day
| Reference | Seizure Frequency | Adverse Effects | Quality of Life | Mood | Cognitive Function | Ability to Return to Work | Ability to Return to School | Ability to Hold a Driver's License | Mortality |
|---|---|---|---|---|---|---|---|---|---|
| Faught (2001)94 | ![]() | ![]() | |||||||
| Ben-Menachem (2000)95 | ![]() | ![]() | |||||||
| Betts (2000)96 | ![]() | ![]() | |||||||
| Cereghino (2000)97 | ![]() | ![]() | ![]() | ![]() | |||||
| Glauser (2000)98 | ![]() | ![]() | ![]() | ||||||
| Appleton (1999)99 | ![]() | ![]() | ![]() | ||||||
| Biton (1999)100 | ![]() | ![]() | |||||||
| Duchowny (1999)30 | ![]() | ![]() | |||||||
| Elterman (1999)101 | ![]() | ![]() | ![]() | ||||||
| KTSG (1999)102 | ![]() | ![]() | |||||||
| Sachdeo (1999)103 | ![]() | ![]() | |||||||
| Uthman (1998)104 | ![]() | ![]() | ![]() | ![]() | ![]() | ||||
| Sachdeo (1997)105 | ![]() | ![]() | |||||||
| Ben-Menachem (1996)90 | ![]() | ![]() | ![]() | ||||||
| Chadwick (1996)106 | ![]() | ![]() | ![]() | ||||||
| Faught (1996)91 | ![]() | ![]() | ![]() | ||||||
| Privitera (1996)107 | ![]() | ![]() | ![]() | ||||||
| Sharief (1996)108 | ![]() | ![]() | |||||||
| Tassinari (1996)109 | ![]() | ![]() | |||||||
| Willmore (1996)110 | ![]() | ![]() | |||||||
| Anhut (1994)111 | ![]() | ![]() | ![]() | ||||||
| Messenheimer (1994)112 | ![]() | ![]() | |||||||
| Bourgeois (1993)113 | ![]() | ![]() | |||||||
| FSG (1993)114 | ![]() | ![]() | |||||||
| Matsuo (1993)115 | ![]() | ![]() | |||||||
| McLean (1993)116 | ![]() | ![]() | |||||||
| Schmidt (1993)117 | ![]() | ![]() | |||||||
| Sivenius (1991)118 | ![]() | ![]() | |||||||
| UKGSG (1990)119 | ![]() | ![]() | |||||||
| Jawad (1989)70 | ![]() | ![]() | |||||||
| Totals | 30 | 30 | 2 | 1 | 1 | 0 | 0 | 0 | 9 |
We performed all meta-analyses on an intent-to-treat basis. This means that we included all randomized patients in our analyses, not solely the patients who completed the trials. If a patient exited early from a trial and the authors did not report the relevant outcome for that patient, we assumed that seizure frequency did not decrease for that patient. This is a reasonable assumption because all patients who respond to a drug would likely be reported as responders.
| Absolute Monthly Seizure Frequency | Absolute Difference From Baseline | Absolute Percent Difference From Baseline | Number of Patients With | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Reference | Mean | Median | Mean | Median | Mean | Median | Seizure-free | ≥75 Percent Reduction | ≥50 Percent Reduction | ≥25 Percent Reduction | Any Reduction | Any Increase | ≥50 Percent Increase | ≥75 Percent Increase | Mean Response Ratioa | Mean Adjustment Response Ratioa |
| Faught (2001)94 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Ben-Menachem (2000)95 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Betts (2000)96 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Cereghino (2000)97 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Glauser (2000)98 | ![]() | ![]() | ![]() | |||||||||||||
| Appleton (1999)99 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||
| Biton (1999)100 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Duchowny (1999)30 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Elterman (1999)101 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| KTSG (1999)102 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||||
| Sachdeo (1999)103 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Uthman (1998)104 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||||
| Sachdeo (1997)105 | ![]() | ![]() | ![]() | |||||||||||||
| Ben-Menachem (1996)90 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||||
| Chadwick (1996)106 | ![]() | ![]() | ||||||||||||||
| Faught (1996)91 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Privitera (1996)107 | ![]() | ![]() | ![]() | |||||||||||||
| Sharief (1996)108 | ![]() | ![]() | ![]() | ![]() | ||||||||||||
| Tassinari (1996)109 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||||
| Willmore (1996)110 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||
| Anhut (1994)111 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||||
| Messenheimer (1994)112 | ![]() | |||||||||||||||
| Bourgeois (1993)113 | ||||||||||||||||
| FSG (1993)114 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||||
| Matsuo (1993)115 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||
| McLean (1993)116 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||||
| Schmidt (1993)117 | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||||
| Sivenius (1991)118 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |||||||||
| UKGSG (1990)119 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ||||||||
| Jawad (1989)70 | ![]() | ![]() | ||||||||||||||
| Totalsb | 2 | 11 | 1 | 3 | 2 | 24 | 18 | 17 | 27 | 8 | 5 | 6 | 4 | 2 | 2 | 4 |
The response ratio is the ratio (T-B)/(T+B) where T is the number of seizures a month during treatment and B is the number of seizures a month during baseline. Some authors adjusted the response ratio in order to account for differences between centers in multi-center trials (using ANOVA).
In considering meta-analyses of the seizure frequency outcomes, nine of the 30 trials each contained three or more groups of patients. From each of these nine trials, therefore, multiple effect sizes can be computed (e.g., dose 1 vs. placebo, and dose 2 vs. placebo). Multiple effect sizes within a single trial are statistically dependent. Ideally, we would analyze these data using general linear models that account for this dependence. This, however, was precluded by the relative paucity of data. Therefore, to avoid this dependence, for each meta-analysis we selected only one drug dose from each trial. Thus, the effect size we computed for each trial was based on the difference between outcomes in one add-on drug group and the add-on placebo group. In some meta-analyses (“high-dose”), we selected the highest-dose group in each trial, whereas in other meta-analyses (“low-dose”), we selected the lowest-dose group in each trial. Trials with only one add-on drug group appear in both the high-dose and low-dose meta-analyses. Consequently, the high-dose meta-analysis was not independent of the low-dose meta-analysis.
A comparison between the results of a high-dose meta-analysis with those of the corresponding low-dose meta-analysis can be viewed as a form of sensitivity analysis. Larger effect sizes may be expected a priori in the high-dose meta-analysis (i.e., larger effects with higher doses). Performing both analyses permits us to estimate the robustness of the results. Although this approach allows us to estimate the effect of high- and low-dose polytherapy, it has the disadvantage that each meta-analysis uses only a subset of the available data. Consequently, some information is lost in our analysis.
Median percentage reduction. Twenty-four of the 30 included trials reported the median percentage reduction in seizures. However, none of these trials reported the dispersion about these medians (e.g., variances, standard deviations, interquartile ranges). Therefore, effect sizes could not be calculated and a meta-analysis was not conducted with these data.
Note: In this plot, positive numbers represent reductions in seizures, whereas negative numbers represent increases in seizures.
Seizure-freedom. Eighteen of the 30 included trials reported the percentage of patients who became seizure-free. Two of these trials, however, only reported seizure-freedom for a severe type of partial seizure (secondarily generalized seizures) that was experienced by only a subset of patients before the trials.90, 91 They did not report freedom from a seizure type that all patients had experienced before the trial. Thus, we analyzed the 16 trials of polytherapy that did report the latter kind of seizure-freedom results.
In summary, the evidence suggests that adding a drug to patient's regimens increases the likelihood of becoming seizure-free. This finding occurred in both the high-dose and low-dose groups, and multiple sensitivity analyses did not overturn the results. However, seizure-freedom was analyzable for only 16 of the 30 trials of polytherapy.
These studies suggest that when a drug is added to patients' drug regimens, approximately one-third of patients will experience a 50 percent or more reduction in seizures. As mentioned above, however, the generalizability of this finding may be limited.
These studies suggest that when certain AEDs are added to patients' drug regimens, approximately two-thirds of patients will experience some reduction in seizures. This analysis, like the previous one, may have limited generalizability.
These data suggest that when certain AEDs are added to patients' drug regimens, approximately 20 percent of patients will experience an increase in seizures. This analysis, like the previous ones, may have limited generalizability.
| Reference | Drug and Dose (mg/day) | Percent of Patients Who Experienced Any Adverse Event | Name of Most Commonly Experienced Adverse Event | Percent of Patients Who Experienced This Adverse Event |
|---|---|---|---|---|
| Faught (2001)94 | Zonisamide 400 | NR | Somnolence | 15% (18/118) |
| Ben-Menachem (2000)95 | Levetiracetam 3000 | 55% (100/181) | Asthenia | 13.8% (25/181) |
| Betts (2000)96 | Levetiracetam 2000 | 83% (35/42) | Asthenia | 31% (13/42) |
| Betts (2000)96 | Levetiracetam 4000 | 84% (32/38) | Somnolence | 45% (17/38) |
| Cereghino (2000)97 | Levetiracetam 1000 | 89% (87/98) | Infection | 28% (27/98) |
| Cereghino (2000)97 | Levetiracetam 3000 | 89% (90/101) | Infection | 27% (27/101) |
| Glauser (2000)98 | Oxcarbazepine 1800 | 91% (125/138) | Vomiting | 36% (50/138) |
| Appleton (1999)99 | Gabapentin 1800 | NR | Viral infection | 11% (13/119) |
| Biton (1999)100 | Topiramate 400 | NR | Upper respiratory tract infection | 41% (16/39) |
| Duchowny (1999)30 | Lamotrigine 750 | 94% (92/98) | Somnolence | 24% (24/98) |
| Elterman (1999)101 | Topiramate 400 | NR | Upper respiratory tract infection | 41% (17/41) |
| KTSG (1999)102 | Topiramate 600 | 81% (74/91) | Anorexia | 21% (19/91) |
| Sachdeo (1999)103 | Topiramate 600 | NR | Somnolence | 42% (20/48) |
| Uthman (1998)104 | Tiagabine 16 | NR | Nervous system | 69% (42/61) |
| Uthman (1998)104 | Tiagabine 32 | NR | Nervous system | 70% (62/88) |
| Uthman (1998)104 | Tiagabine 56 | NR | Nervous system | 77% (44/57) |
| Sachdeo (1997)105 | Tiagabine 32 | NR | Nervousness | 10.5% (11/105) |
| Ben-Menachem (1996)90 | Topiramate 800 | NR | Fatigue | 79% (22/28) |
| Chadwick (1996)106 | Gabapentin 1200 | 67% (39/58) | Somnolence | 12% (7/58) |
| Faught (1996)91 | Topiramate 200 | NR | Dizziness | 36% (16/45) |
| Faught (1996)91 | Topiramate 400 | NR | Dizziness | 33% (15/45) |
| Faught (1996)91 | Topiramate 600 | NR | Dizziness | 35% (16/46) |
| Privitera (1996)107 | Topiramate 600 | NR | Fatigue | 38% (18/48) |
| Privitera (1996)107 | Topiramate 800 | NR | Abnormal thinking | 44% (21/48) |
| Privitera (1996)107 | Topiramate 1000 | NR | Dizziness | 38% (18/47) |
| Sharief (1996)108 | Topiramate 400 | NR | Somnolence | 35% (8/23) |
| Tassinari (1996)109 | Topiramate 600 | NR | Headache | 27% (8/30) |
| Willmore (1996)110 | Valproate 90 mg/kg | NR | Nausea | 48% (37/77) |
| Anhut (1994)111 | Gabapentin 900 | 63% (33/52) | Somnolence | 22% (24/111) |
| Anhut (1994)111 | Gabapentin 1200 | 68% (76/111) | Somnolence | 13% (7/52) |
| Messenheimer (1994)112 | Lamotrigine 400 | NR | Rash | 7% (3/44) |
| Bourgeois (1993)113 | Felbamate 3600 | NR | Headache | 40% (12/30) |
| FSG (1993)114 | Felbamate 3600 | NR | Anorexia | 49% (18/37) |
| Matsuo (1993)115 | Lamotrigine 300 | NR | Headache | 32% (23/71) |
| Matsuo (1993)115 | Lamotrigine 500 | NR | Dizziness | 54% (39/72) |
| McLean (1993)116 | Gabapentin 600 | 88% (89/101) | Dizziness | 25% (13/53) |
| McLean (1993)116 | Gabapentin 1200 | 91% (49/54) | Somnolence | 36% (36/101) |
| McLean (1993)116 | Gabapentin 1800 | 87% (46/53) | Somnolence | 20% (11/54) |
| Schmidt (1993)117 | Zonisamide 20 mg/kg | 59% (42/71) | Fatigue | 23% (16/71) |
| Sivenius (1991)118 | Gabapentin 900 | NR | Drowsiness | 25% (4/16) |
| UKGSG (1990)119 | Gabapentin 1200 | 62% (38/61) | Somnolence | 14.8% (9/61) |
| Jawad (1989)70 | Lamotrigine 400 | NR | NR | NR |
mg/day Milligrams per day
NR Not reported
mg/kg Milligrams per kilogram
To summarize the available data on adverse effects, we focused on whether the adverse effects in a given patient were severe enough to warrant discontinuation of the new drug (i.e., trial exit). In trials of polytherapy, an add-on drug may be more likely or less likely to be discontinued due to adverse effects compared to add-on placebo.
Thus, adding a certain AED to a patient's drug regimen is more likely to cause adverse effects resulting in trial exit compared to adding a placebo. This finding persisted through multiple sensitivity analyses.
Tradeoff between seizure frequency and adverse effects. We next evaluated the tradeoff between seizure frequency and adverse effects in trials of polytherapy. In the section on seizure frequency, we concluded that adding a drug to a patient's regimen is more likely to reduce seizures compared to adding a placebo. However, in the section on adverse effects, we concluded that adding a drug is also more likely to cause adverse effects resulting in trial exit. To illustrate the tradeoff, we constructed a scatterplot in which the horizontal axis represented the effect size for 50 percent seizure reduction and the vertical axis represented the effect size for exiting the trial due to adverse effects (Figure 27
No trials of polytherapy reported this outcome.
No trials of polytherapy reported this outcome.
No trials of polytherapy reported this outcome.
No trials of polytherapy reported this outcome.
The previous two parts of the present question addressed strategies related to the use of new AEDs or new combinations of AEDs in patients with treatment-resistant epilepsy. In this section, we assess strategies designed to optimize the effectiveness of a patient's current drug regimen. Strategies designed to optimize current drug therapy seek to improve patient outcomes by either; (1) reducing seizure frequency without increasing the incidence (or intensity) of the side effects associated with AED treatment or, (2) by reducing the side effects of AED without increasing seizure frequency, seizure severity, or the onset of a new seizure type. Ideally, a drug regimen would both decrease seizure frequency and reduce side effects. However, as shown above, this rarely occurs in patients with treatment-resistant epilepsy, and a trade-off exists between the intensity of drug treatment and the incidence and severity of associated side effects. Thus, in order for an optimization strategy to be of value, it must either lead to reductions in seizure frequency or reductions in side effects (and/or improvements in quality of life, cognitive function, and mood) while not leading to increases in the other.
Published literature describes three different methods for optimizing drug therapy in patients with treatment-resistant epilepsy: (1) increasing the dose of the current drug (or drugs) to maximum tolerable levels, (2) modifying the frequency of dosing, and (3) reducing the total number of drugs. In this subquestion, we evaluate the literature pertaining to all three of these strategies.
As discussed in the Methodology section of this report, only treatment strategies that were addressed by at least five acceptable studies were evaluated. One of the three drug-optimization strategies, the drug reduction strategy, was addressed by enough studies to meet this criterion. The prerequisite number of studies did not address the remaining two strategies, even when the inclusion criteria were relaxed to allow for the inclusion of retrospective studies. Consequently, we do not include further information concerning implementation of either the maximal tolerable dose or the optimized dosing frequency strategies.
The goal of drug reduction strategy is to reduce the number of AEDs without increasing seizure frequency above some unacceptable level. As implied above, this strategy is based on the (reasonable) assumption that reducing the number of AEDs taken by a patient should result in reduced side effects, which will lead to increased quality of life, improved cognitive function, improvements in mood, and reduced costs.n
All of the studies included in the present evidence base were prospective, three were controlled,121, 122, 126 and the remaining four utilized a case series design.120, 123–125 Two of the three controlled trials were single-blinded,o and not randomized.121, 126 The remaining controlled trial was randomized and double blinded.122 However, this study randomized patients within the drug reduction arm to drug reduction at either a slow rate or a fast rate, and patients were not randomly allocated to the two principal arms of the study, the drug reduction and the control arms. Since the primary objective of the present subquestion is to determine whether implementation of the drug reduction strategy leads to improved patient outcomes, this study, for the purposes of this section of the report, must be considered a nonrandomized controlled trial.
This section presents the findings of our systematic assessment of the quality of the seven studies that assessed the effectiveness of the drug reduction strategy. This systematic assessment consisted of an appraisal of both the internal and external validity of each included study.
Measurement bias, regression to the mean, extraneous event bias, and sample specification bias were potentially present in all seven studies. Patient reporting bias and investigator reporting bias may have been present in six studies. Selection bias potential affected the three controlled trials. Sampling bias may have been present in the six studies that did not report how patients were enrolled in the study. Attrition bias was a potential factor in one study with more than a 10 percent attrition rate. These potential biases with respect to this question are discussed in detail in Appendix B.
The range of ages covered by each of the studies in the present evidence base tended to be broad, and, although no study exclusively enrolled adults, six of the seven studies enrolled mainly adults.120–122, 124–126 The remaining study enrolled solely children.123 We were unable to determine the upper age of the patients in the study described by Callaghan, O'Dwyer, and Keating124 because of inconsistent reporting (the reported mean patient age was 26 years but the range was reported as 6 to 24 years). The duration of epilepsy suffered by the patients in the included studies varied considerably with durations ranging from less than 1 year to well over 60 years.
The proportion of females in each of the studies included in the present evidence base varied considerably between studies (from under 25 percent to over 80 percent). One study did not report the sex ratio (Schmidt125).
Two of the seven studies included for this question did not restrict their patient sample by age or seizure type.124, 126 The remaining five studies enrolled patients because they were considered representative of a specific subpopulation of patients with treatment-resistant epilepsy. Three of the studies recruited institutionalized patients with severe epilepsy and multiple cognitive and/or behavioral deficits.120–122 Two of the studies recruited patients because they suffered from a particular seizure type.120, 125
Although all studies included in the present evidence base investigated a common optimization strategy (the drug reduction strategy), each study did so in a different way. For example, the aim of Specht, Boenigk, Wolf, et al.123 was to evaluate the effects of the removal of all patients in their study from a single drug (clonazepam), whereas the aim of the study by Callaghan, O'Dwyer, and Keating124 was to reduce all patients in their study from polytherapy to monotherapy or, if this was not possible, to two AEDs. Because the evidence base pertaining to drug reduction strategy was small, quantitative analyses could not be performed that would indicate whether the findings of the individual studies were similar. Without evidence to demonstrate such similarity, conclusions about the effectiveness of the drug reduction strategy as a whole are not possible. Instead, each variation of the drug reduction strategy must be considered separately, and the findings of each individual study may only be generalized to patients with characteristics similar to those included in that study.
The assessment of study quality presented above indicates that, given the present evidence base, definitive conclusions cannot be drawn about whether implementation of the drug reduction strategy is effective in improving outcome in patients with treatment-resistant epilepsy. Acknowledging this, we have instead evaluated the available data with the aim of determining whether the implementation of this strategy may plausibly be effective in improving outcome among patients with treatment-resistant epilepsy.
| Reported Seizure Outcomes | Reported Nonseizure Outcomes | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Reference | Difference in Absolute Seizure Frequency | Percent Change in Seizure Frequency | Percent Patients Seizure-free | Percent of Patients With >50 Percent Reduction in Seizure Rate | Percent of Patients With an Increase in Seizure Frequency | Mood | Cognitive Function | Adverse Events | Mortality |
| Controlled trials performed outside of the United States | |||||||||
| May (1992)121 | a | ![]() | ![]() | ||||||
| Duncan (1990)122 | ![]() | b | c | d | ![]() | ||||
| Thompson (1982)126 | ![]() | ![]() | ![]() | ![]() | |||||
| Case series performed in the United States | |||||||||
| Mirza (1993)120 | ![]() | ![]() | ![]() | ![]() | |||||
| Case series performed outside of the United States | |||||||||
| Specht (1989)123 | e | e | e | e | e | ![]() | ![]() | ||
| Callaghan (1984)124 | ![]() | ? | ![]() | ![]() | ![]() | ||||
| Schmidt (1983)125 | e | e | e | e | e | ![]() | ![]() | ||
| Number of articles addressing outcome | 4f | 2 | 4 | 4 | 3 | 2 | 3 | 5 | 7 |
May, Bulmahn, Wolhlhuter et al.121 reported that no statistically significant between groups differences in seizure frequency were seen but did not present any data
Mood data abstracted from Kendrick, Duncan, and Trimble136
Cognitive function data abstracted from Duncan, Shorvon, and Trimble137
Adverse events data abstracted from Duncan, Shorvon, and Trimble138
Data calculated by ECRI from individual patient data
Does not include May, Bulmahn, Wolhlhuter et al.121 (see footnote a above).
As stated previously, the goal of the drug reduction strategy is to remove a drug (or drugs), thereby reducing the occurrence of (or the risk for) adverse effects associated with the use of AEDs. This goal must be accomplished without increasing seizure frequency to unacceptably high levels. Although reductions in seizure frequency are desirable and may indeed occur, they are not, in this instance, to be expected. Consequently, studies needed only to demonstrate that implementation of the drug reduction strategy resulted in other benefits such as reductions in adverse events, increases in cognitive function, increases in quality of life, reduced cost, etc. This means that trials that evaluate changes in seizure frequency that result from drug reduction strategy must also demonstrate, through hypothesis testing, that clinically meaningful increases in seizure frequency did not occur.
In such trials, which are akin to studies of therapeutic equivalence,p classical hypothesis testing (with the usual null hypothesis that there is no difference between the interventions) is inappropriate.127–131 This is because the desired result of a bioequivalence study would be to prove the null hypothesis by showing that no increases in seizure frequency occurred in the treatment group when compared to the comparison group.q An alternative hypothesis allows meaningful statistical analyses to be performed. In this instance, the alternative hypothesis is that seizure frequency increases in the treatment group will be less than a prespecified level, δ, above the seizure frequencies seen in the comparison group (HA: XDRS-Xc<δ; where HA = alternative hypothesis). Thus, to demonstrate that implementation of a drug reduction strategy does not lead to increases in seizure frequency, any difference in seizure frequency between the treatment group and the comparison group (along with its 95 percent confidence intervals) must fall entirely below δ. Confidence intervals that extend above δ indicate that the alternative hypothesis has not been refuted and implementation of the strategy may lead to increases in seizure frequency.
| Testing H0: XC = XDRS | Testing H0': XDRS≥Xc + δ | |||
|---|---|---|---|---|
| True Difference XDRS - Xc | Fail to Reject (good for DRS) | Reject (bad for DRS) | Reject (good for DRS) | Fail to Reject (bad for DRS) |
| XDRS - Xc = 0 (good for DRS) | Correct Decision | Type I error | Correct Decision | Type II error |
| XDRS - Xc = δ (bad for DRS) | Type II error | Correct Decision | Type I error | Correct Decision |
Adapted from Blackwelder128
DRS Drug reduction study
H0 Null hypothesis (standard)
H0' Null hypothesis (therapeutic equivalence studies)
Xc Mean seizure frequency in control group
XDRS Mean seizure frequency in drug reduction therapy group
δ Predefined difference in mean seizure frequency above which use of drug reduction study is unacceptable
As discussed above, to demonstrate that seizure frequency does not increase in patients using a drug reduction strategy, the strategy must be shown not to cause clinically important increases in seizure frequency (XDRS- Xc ≤ δ). This requires the authors to explicitly state what they consider a meaningful increase in seizure frequency (δ). Based on this seizure frequency, they should then state the size of the study (power) necessary to overturn the null hypothesis that seizure frequency in patients who received drug reduction will increase above this predefined seizure frequency.r Neither of the two controlled trials that reported this outcome stated what they considered to be a clinically important difference in seizure frequency, nor did they perform a power analysis.s
All of the statistical analyses presented in these two articles tested the traditional null hypothesis that no between-groups differences in seizure frequency exist. Thus, their analyses essentially attempted to prove the null hypothesis that there was no change in seizure frequency. As discussed above, this is inappropriate.
Because the investigators did not determine the power of their study and because their statistical analyses were not appropriate for the clinical question of interest, the seizure frequency analyses in the articles are of limited value. However, summary data from these studies may still be used to provide some useful information. This can be accomplished by calculating the mean difference in seizure frequency (and its CI) between the drug-reduction group and the control group for each study. The upper CI of this difference can then be used to determine the maximum magnitude of increase in seizure frequency that will not lead to the alternative hypothesis (HA: XDRS - Xc ≤ δ) being accepted over the null hypothesis (H0: XDRS - Xc >δ). Such an approach, however, requires that the study report seizure frequency data in such a way that a difference can be calculated.
Duncan, Shorvon, and Trimble122 summarized their seizure frequency data in terms of mean seizure frequencies along with its standard deviation. The range cannot be used to calculate a valid standardized between-groups difference. No other measures of dispersion were reported. As a result, the seizure frequency data presented by Duncan, Shorvon, and Trimble122cannot be used to determine whether implementation of the drug reduction strategy leads to clinically important increases in seizure frequency.
Unlike Duncan, Shorvon, and Trimble,122 Thompson and Trimble126 presented mean seizure frequency along with its dispersion (expressed in terms of standard deviations). However, the analysis described above still cannot be performed, because the technique is sensitive to pretreatment differences in seizure frequency. Although no statistically significant between-groups differences in seizure frequency data were detected at pretreatment, a between-groups differences in seizure frequency at baseline did exist, and these differences were large enough to lead to biased posttreatment effect size estimates. For example, the mean pretreatment frequency for partial seizures in the drug reduction group was 21.1 (SD: 34.6) seizures per week compared to 6.8 (SD: 9.7) per week in the control group. Thus, patients in the drug reduction arm were experiencing more than three times the number of seizures per week compared to the patients in the control arm at study onset. Consequently, the study is biased against finding that the implementation of the drug reduction strategy will lead to increases in seizure frequency.
To summarize, the data from the currently available controlled trials could not be used to draw evidence-based conclusions about whether or not implementation of the drug reduction strategy leads to increases in seizure frequency.
As will be seen in the following sections, other seizure frequency-based outcomes suggest that drug reduction strategy may lead to large increases in seizure frequency in some patients.
Mean or median percentage change in seizure frequency. None of the three included controlled trials presented data on the percentage change in seizure frequency following implementation of the drug reduction strategy. Thus, conclusions about this outcome can only be based on case series data.
Two of the four case-series studies presented individual patient data that allowed us to assess this outcome. These data show that the median percentage change in seizure frequency from baseline was -0.09 percentt (Range: -100 percent to 412 percent) in the study by Specht, Boenigk, Wolf, et al.123 and -0.12 percent (Range: -100 percent to 2,678 percent) in the study by Schmidt.125 In both studies, more than 50 percent of the patients experienced a reduction in seizure frequency following implementation of the drug reduction strategy, and just under 50 percent of the patients experienced an increase in seizure frequency from baseline (43 percent of patients in the study by Specht, Boenigk, Wolf, et al.123 and 47 percent of patients in the study of Schmidt125). The proportion of patients who experience an increase in seizure frequency from baseline is addressed in more detail in the following section of the report.
Thus, these data suggest that a high proportion of patients (close to 50 percent) may experience increases in seizure frequency following the implementation of the drug reduction strategy. The data also suggest that some patients may experience decreases in seizure frequency. Given that regression to the mean is known to influence seizure frequency data,23,132 some of these observed reductions in seizure frequency were probably a manifestation of this bias. The only other possible explanation is that the withdrawn drug was somehow causing seizures.
Proportion of patients with an increase in seizure frequency. None of the three included controlled trials presented data on the proportion of patients with an increase in seizure frequency following implementation of the drug reduction strategy. Thus, conclusions on this outcome can only be based on data from case series.
One of the four case series presented data on increases in seizure frequency. Callaghan, O'Dwyer, and Keating124 reported that three of the 35 patients (9 percent) included in their study demonstrated an increase in seizure frequency. This information, however, is of limited value because the authors did not define what they meant by “worse.” Consequently, the magnitude of the reported increase in seizure frequency in these three patients cannot be determined, and no conclusions can be drawn as to whether these increases were clinically important.
Percentage presented in parentheses is the actual proportion of patients with seizure frequencies greater than the percent increase in seizure frequency shown on the X-axis. The diamond and error bars represent the effect size and 95% CI.
This figure shows that a statistically significant proportion of patients in both case series exhibited large (>100 percent) increases in seizure frequency when compared to baseline (27.8 percent of patients in the study by Schmidt125 and 8.6 percent in the study by Specht, Boenigk, Wolf, et al.123). In neither study did these patients have unusually low seizure frequency rates at the onset of the study, suggesting that implementation of the drug reduction strategy will result in increased seizure frequency in a significant proportion of patients.
Both Thompson and Trimble126 and Duncan, Shorvon, and Trimble122 presented mood data collected using two validated self-administered psychometric instruments. These instruments were the Middlesex Hospital Questionnaire (MHQ) and the Mood Adjective Checklist (MACL).
The MHQ is a self-administered questionnaire that measures six domains and provides a composite score. This instrument is commonly used as an aid in the diagnosis of clinical depression. The six domains that are assessed include: Free-floating anxiety (F-FA), phobic anxiety (PHO), obsessive-compulsive (OBS), somatic anxiety (SOM), depressive traits (DEP), and hysteric (HYS) traits. Although Duncan, Shorvon, and Trimble122 presented data for all six domains, Thompson and Trimble126 only reported on two (F-FA and DEP). Neither Duncan, Shorvon, and Trimble122 nor Thompson and Trimble126 found a statistically significant between-groups difference in any of the domains measured using the MHQ following completion of drug reduction. Nor were any trends in the data detected that would indicate that mood either improved or deteriorated following drug reduction.
The MACL is a standardized scale commonly used to detect alterations in mood across five domains. These domains provide measures of anxiety, fatigue, hostility, vigor, and depression, along with a composite score. Although both studies measured mood alterations using this instrument, only Thompson and Trimble126 presented relevant data in their article. Again, as was the case with reporting of the data obtained using the MHQ, Thompson and Trimble126 did not report data for all of the measured domains (in this case data for the domain “hostility” was not reported) and no explanation was provided as to why this was the case. Analysis of data abstracted from Thompson and Trimble126 did not find statistically significant between-group differences in any of the domains measured using the MACL. Nor were any trends in the data detected that would indicate that mood either improved or deteriorated following drug reduction. This finding was corroborated by Duncan, Shorvon, and Trimble122 who reported that, “There were no statistically significant differences between the four groupsu on the anxiety, depression, fatigue, vigor, or hostility subscales of the Mood Adjective Checklist.”122
All three studies measured cognitive function using a series of standardized clinical tests. These tests included tests of concentration and attention, memory, and tests of psychomotor performance.
Tests of concentration/attention. All three controlled trials measured concentration/attention before and after the implementation of the drug reduction strategy. These data are summarized in Figure 29
Data from only one of the three studies, Duncan, Shorvon, and Trimble,122 suggested that concentration improvement was statistically significant among patients who had undergone drug-reduction when compared to patients in the control group. The only statistically significant posttreatment benefit was seen in patients who were removed from sodium valproate (t = 4.245; p = 0.000108v) followed by patients who were removed from phenytoin (t = 1.965; p = 0.056). Assessment of the pretreatment LCT data, however, suggested the presence of selection bias, with patients who were removed from sodium valproate having statistically significantly higher baseline LCT scores compared to those in the control group (t = 3.404; p = 0.00140). Thus, the posttreatment between-groups difference was essentially the same as the pretreatment difference.
No such bias was found to have affected the LCT scores on removal of phenytoin and these data suggest that removal of phenytoin may lead to an improvement in concentration/attention in some patients. However, interpretation of the importance of a mean improvement of 18 points is difficult because the authors did not indicate if such a between-groups difference was clinically important.
Pre- and posttreatment Frankfurt Concentration Test for Children data from May (1992)121
Abbreviations:
LGT: Lern- und Gedachtnis Test
DSF: Digit scan forwards
DSB: Digit scan backwards
PHT: Phenytoin vs. Control
CBZ: Carbamezapine vs. Control
VPA: Valprioc acid vs. Control
IR: Immediate recall
DR: Delayed recall
R: Recognition
Data from Duncan (1990)122 showing effects of valprioc acid removal on digital scanning score
FT: Finger tapping
DH: Dominant hand
NDH: Non-dominant hand
PB: Pegboard
PRF: Pursuit Rotor Failure
PFD: Pursuit Failure Duration
PHT: Phenytoin vs. Control
CBZ: Carbamezapine vs. Control
VPA: Valprioc acid vs. Control
Pre-and posttreatment psychomotor function data presented by May (1992)121
One of the three controlled trials reported relevant data on adverse events. Patients included in the drug reduction arm of the study by Duncan, Shorvon, and Trimble122 suffered no additional adverse events compared to patients included in the control arm. Thus, although the patients undergoing drug reduction did experience some adverse events, these adverse events cannot be attributed to drug reduction strategy used in this study.
No patients were reported to have died during any of the seven included studies. Thus, no evidence exists to suggest that implementation of the drug reduction strategy leads to increased mortality.
None of the trials that met the inclusion criteria directly compared the drug strategies. Indirectly comparing two of the drug strategies is possible if the patients enrolled in all of the trials of all strategies were similar. The drug reduction strategy cannot be compared with the other two strategies, because of the differing intentions of investigators in these latter trials. The intent of the trials of polytherapy and sequential monotherapy was to reduce seizures without causing adverse effects, whereas the intent of the trials of drug reduction therapy was to reduce the number of drugs without increasing seizures.
To determine whether trials of polytherapy and sequential monotherapy enrolled similar patients, we compared the number of drugs given to patients receiving these strategies. Differences in the number of drugs likely mean that the severity of epilepsy in patients who received polytherapy was different from that in patients who received sequential monotherapy. Among 11 trials of sequential monotherapy that reported this percentage, two (18 percent) reported that more than half of the patients were receiving two or more prior AEDs. By contrast, among the 18 trials of polytherapy that reported this percentage, 16 (89 percent) reported that more than half of the patients were receiving two or more prior AEDs. These percentages are significantly different (χ2(1) =14.5, p = 0.00014). Thus, patients who received polytherapy had more severe epilepsy compared to patients who received sequential monotherapy. This difference precludes comparison of the quantitative results of the two strategies.
A qualitative comparison, however, suggests that polytherapy is clinically preferable to sequential monotherapy. In the section on sequential monotherapy, the evidence indicated that some patients had harmful increases in seizures as a direct result of the treatment, and whether sequential monotherapy caused any patients to become seizure-free could not be determined. In short, sequential monotherapy appeared more likely to be harmful than beneficial. By contrast, the reverse was true for polytherapy. Adding a drug reduced seizures by 50 percent in many patients, whereas adverse effects causing trial exits were rare. By inference, this suggests that polytherapy is preferable to sequential monotherapy.
Further, as discussed above, patients who received polytherapy had been receiving more drugs before the trial, thus they likely had more severe epilepsy. Patients with more severe epilepsy are, by definition, more difficult to treat. Thus, even in a more difficult-to-treat population, polytherapy helped many patients. This finding underscores the qualitative conclusion that polytherapy is preferable to sequential monotherapy for patients with treatment-resistant epilepsy.
In this section of the Evidence Report, we addressed Key Question #5: Which methods of nondrug treatment for epilepsy after initial treatment failure lead to improved outcomes for patients with treatment-resistant epilepsy? This question is divided into two parts. The first part addresses surgical interventions and the second addresses nondrug, nonsurgical interventions.
In this section, we address the efficacy of surgical intervention when treatment with AEDs has failed to produce adequate seizure control. Patients who receive surgery have been determined to be treatment-resistant as part of their presurgical evaluation. For most patients, only a single surgical option will be available due to the nature and location of the lesion or condition responsible for generating their seizures. In patients undergoing temporal lobectomy, hemispherectomy, or corpus callosotomy, some variations in procedures are available.
A list of the specific surgical interventions and outcome measures addressed under surgical interventions is presented in the following section.
We included articles if they met the general inclusion criteria detailed in the Methodology section, and if they met the following question-specific criteria:
All seizure frequency outcomes were reported before and 2 or more years after surgery, except for studies of multiple subpial transection (MST). This followup period was recommended by the Expert Panel and Technical Experts, who noted that because surgery is irreversible, relatively long-term data are of primary interest. However, because MST is a relatively new surgical procedure with a limited reference base, a minimum of 6 months followup was used to increase the size of the evidence base for this procedure.
The study was published in 1985 or later. This cutoff was used because the Expert Panel and Technical Experts noted that surgical treatments for epilepsy have substantially changed since this date.
One of the following specific interventions, as recommended by the Expert Panel and Technical Experts, was examined:
Anterior temporal lobe resection
Frontal lobe resection
Parietal lobe resection
Occipital lobe resection
Cerebral hemispherectomy
Corpus callosotomy
MST separate from or in combination with other resections
The order of the material presented in this section differs from that presented in the discussion of other interventions. This change in organization was necessary because we required a minimum 2-year followup period for most outcome measures used to evaluate surgery. The only exceptions were outcomes for mood (depression and psychosis), cognitive function (IQ and memory), and complications and mortality related to surgery. We shortened the minimum required followup time for these outcomes because they may manifest themselves relatively early after surgery.
Different studies make up the evidence base for each outcome. We will separately discuss each outcome and its specific evidence base under each surgical intervention rather than examining all of the studies in the evidence base for a single intervention. This section, number of studies addressing surgical intervention, presents an overview of all of the studies meeting our inclusion criteria for each surgical intervention examined under Key Question #5. The actual evidence base for each intervention and outcome will be discussed separately later in this report.
| Intervention | Total Number of Studiesa |
|---|---|
| Temporal Lobe Surgery | 105 |
| Corpus Callosotomy | 26 |
| Frontal Lobe Surgery | 18 |
| Hemispherectomy | 11 |
| Multiple Subpial Transection | 10 |
| Parietal Lobe Surgery | 2 |
| Occipital Lobe Surgery | 2 |
| Surgical Controls | 12 |
Seven studies reported on more than one surgery category and are therefore double counted in this table
Temporal lobe surgery is intended to eliminate complex partial seizures by removing the lesion or epileptogenic area responsible for the development of these seizures. Complex partial seizures with or without secondary generalization are the most common seizure type associated with temporal lobe epilepsy.139 The second most common seizure type is a simple partial seizure, which is commonly experienced as the patient's typical aura.
Temporal lobe surgery candidates constitute the largest group of epilepsy surgery patients.140 Preoperative evaluation determines the type of lesion (tumor, vascular malformation, mesial temporal sclerosis, or other known or unknown etiology). The actual procedure depends on the location of the lesion (deep or superficial) and the extent to which tissue is to be removed.141–143 An en bloc anterior temporal lobectomy is a standardized operative procedure in which 4.5 to 5.0 cm of the anterior lateral temporal lobe neocortex is removed along with the amygdala, the anterior aspect of the parahippocampal gyrus, and the hippocampus in the medial portion of the temporal lobe. Neocortical lesionectomy is used when the lesion, usually a tumor or vascular malformation, is contained entirely in the neocortex of the temporal lobe. Selective amygdalohippocampectomy (AH) involves the removal of the amygdala and hippocampus only. Intraoperative EEG readings may be used to “tailor” the extent of tissue resection by defining a zone of frequent interictal spiking. The use of this technique may result in more or less tissue being removed compared to the “standard” approach. Another modification to the standard approach is to remove less than 4.5 cm of the anterior temporal lobe and is referred to as “partial” resection. The Evidence Tables pertaining to temporal lobe surgery will refer to these procedures as standard, tailored, partial, amygdalohippocampectomy, and neocortex.
Several outcome measurements examined in other questions of this report, such as changes in the proportion of patients experiencing at least a 50 percent reduction in seizure frequency, are not included in our examination of surgical intervention because they are rarely (if ever) reported in studies of epilepsy surgery.
Among the 105 studies of temporal lobe surgery meeting our inclusion criteria, 73 reported some sort of seizure-free outcome measurement. Studies of temporal lobe surgery used four different outcome measurements when reporting a patient as “seizure-free.” Each outcome measurement results in a different set of patients being considered “seizure-free” and therefore the data collected under each outcome measurement must be evaluated separately.
| Reference | Undefined | No Auras | With Auras | Engel Class I |
|---|---|---|---|---|
| Bouilleret (2002)165 | ![]() | ![]() | ||
| Alsaadi (2001)167 | ![]() | |||
| Boling (2001)169 | ![]() | |||
| Hennessy (2001)171 | ![]() | |||
| Hennessy (2001)148 | ![]() | |||
| Jan (2001)174 | ![]() | |||
| Kanemoto (2001)176 | ![]() | |||
| Schramm (2001)178 | ![]() | |||
| Sotero de Menezes (2001)180 | ![]() | ![]() | ||
| Verma (2001)182 | ![]() | |||
| Wilson (2001)184 | ![]() | ![]() | ||
| Dupont (2000)186 | ![]() | ![]() | ||
| Eberhardt (2000)188 | ![]() | ![]() | ![]() | |
| Foldvary (2000)190 | ![]() | |||
| Holmes (2000)192 | ![]() | |||
| Iannelli (2000)194 | ![]() | |||
| Markand (2000)151 | ![]() | ![]() | ![]() | |
| Rao (2000)197 | ![]() | |||
| Robinson (2000)199 | ![]() | |||
| Assaf (1999)201 | ![]() | |||
| Eriksson (1999)203 | ![]() | |||
| Henry (1999)204 | ![]() | |||
| Holmes (1999)206 | ![]() | |||
| Mathern (1999)208 | ![]() | |||
| Mitchell (1999)210 | ![]() | |||
| Rossi (1999)212 | ![]() | |||
| Salanova (1999)214 | ![]() | |||
| Son (1999)166 | ![]() | |||
| Maher (1998)168 | ![]() | ![]() | ||
| Radhakrishnan (1998)170 | ![]() | |||
| Szabo (1998)172 | ![]() | ![]() | ||
| Bizzi (1997)173 | ![]() | |||
| Cappabianca (1997)175 | ![]() | |||
| Casazza (1997)177 | ![]() | ![]() | ||
| Ho (1997)179 | ![]() | |||
| Keene (1997)181 | ![]() | |||
| Kilpatrick (1997)183 | ![]() | ![]() | ||
| McLachlan (1997)185 | ![]() | |||
| Schwartz (1997)187 | ![]() | ![]() | ||
| Silander (1997)189 | ![]() | |||
| Sisodiya (1997)191 | ![]() | ![]() | ![]() | |
| Adam (1996)193 | ![]() | ![]() | ![]() | |
| Goldstein (1996)195 | ![]() | |||
| Holmes (1996)196 | ![]() | |||
| Sirven (1996)198 | ![]() | |||
| Acciarri (1995)200 | ![]() | |||
| Berkovic (1995)202 | ![]() | |||
| Davies (1995)163 | ![]() | |||
| Jooma (1995)205 | ![]() | |||
| Jooma (1995)207 | ![]() | |||
| Liu (1995)209 | ![]() | |||
| Renowden (1995)211 | ![]() | |||
| Thadani (1995)213 | ![]() | |||
| Vossler (1995)215 | ![]() | |||
| Blume (1994)146 | ![]() | ![]() | ||
| Guldvog (1994)160 | ![]() | |||
| Guldvog (1994)161 | ![]() | |||
| Berkovic (1991)216 | ![]() | |||
| Hopkins (1991)217 | ![]() | |||
| Rasmussen (1991)218 | ![]() | |||
| Wieser (1991)219 | ![]() | |||
| Bidzinski (1990)162 | ![]() | |||
| Mizrahi (1990)152 | ![]() | |||
| Walczak (1990)164 | ![]() | |||
| Yeh (1990)220 | ![]() | |||
| Sperling (1989)221 | ![]() | |||
| Estes (1988)222 | ![]() | |||
| Bladin (1987)223 | ![]() | |||
| Cutfield (1987)147 | ![]() | |||
| Drake (1987)224 | ![]() | |||
| Lieb (1986)225 | ![]() | |||
| Meyer (1986)226 | ![]() | ![]() | ||
| Delgado-Escueta (1985)153 | ![]() | ![]() | ![]() | |
Studies using Engel class I, which has the least restrictive means of determining if a patient is seizure-free, may be expected to report the largest percentage of seizure-free patients. Seizure-free with auras is similar to Engel class I, but somewhat more restrictive. Studies using this outcome measurement may be expected to report slightly fewer patients as seizure-free compared to Engel class I. Seizure-free with no auras is the most restrictive, and studies using this outcome measurement may be expected to report the smallest percentage of seizure-free patients. Because studies using seizure-free undefined may be a combination of studies using seizure-free with no auras and seizure-free with auras, these studies may be expected to report a percentage of seizure-free patients somewhere between studies reporting seizure-free no auras and studies reporting seizure-free with auras.
| Seizure-Free Outcome Measurements | ||||
|---|---|---|---|---|
| Reference | Undefined | No Auras | With Auras | Engel Class I |
| Bauer (2001)227 | ![]() | |||
| Kumlien (2001)228 | ![]() | |||
| Wiebe (2001)145 | ![]() | ![]() | ||
| Markand (2000)151 | ![]() | |||
| Holmes (1998)229 | ![]() | |||
| Wolf (1998)230 | ![]() | ![]() | ![]() | |
| McLachlan (1997)185 | ![]() | |||
| Hermanns (1996)231 | ![]() | |||
| Vickrey (1995)232 | ![]() | ![]() | ||
| Guldvog (1991)233 | Reported changes in seizure frequency only | |||
| Huttenlocher (1990)234 | ![]() | |||
| Harbord (1987)235 | ![]() | |||
Once a patient has been identified as a suitable candidate for surgery, withholding surgery may be considered unethical. Consequently, the literature on surgical interventions consists mainly of uncontrolled trials in which all patients receive a single treatment and patients are not randomized to a nonsurgery group or a group receiving an alternative treatment approach. These studies generally do not provide a control group against which to evaluate the efficacy of surgery. The remainder of this section presents an assessment of the quality of the evidence base used to draw conclusions about the effectiveness of temporal lobe surgery in patients with treatment-resistant epilepsy. Our assessment consists of an appraisal of each study's internal and external validity.
| Study designs for studies of temporal lobe surgery reporting seizure-free outcomes | |||
|---|---|---|---|
| Prospective Nested Case-controlled Studiesa | Retrospective Nested Case-controlled Studies | Prospective Case Series Studies | Retrospective Case Series Studies |
| 5 | 30 | 5 | 33 |
Nested case-controlled studies are defined as any study reporting patient characteristics (age at treatment, age at seizure onset, duration of epilepsy prior to treatment, etc.) separately for patients with good outcomes (seizure-free, Engel Class I, etc.) and patients with poor outcomes. Nested case-controlled studies are also considered case series studies because all patients received the same treatment.
All of the studies discussed in this section on seizure frequency outcomes potentially have the following biases: extraneous event bias, investigator reporting bias, and patient reporting bias. Attrition bias and maturation bias are of specific importance to studies of surgery.
Attrition bias refers to the loss of patients, for any reason, before the minimum 2-year followup period. All studies with retrospective patient enrollment have this bias because they only record outcomes for patients with the minimum 2-year followup period. Only 10 of the 73 studies of seizure frequency outcomes had prospective as opposed to retrospective patient enrollment. The effect of attrition bias in the surgical studies considered in this report was limited by the requirement that studies report consecutive patients.
Maturation bias refers to individuals who received surgery but would have eventually “outgrown” the disease without surgical intervention. This seems implausible since surgery candidates often wait for more than a year before undergoing surgery and individuals may wait on average for 20 years from the onset of seizures before considering a surgical option.145 A randomized controlled trial of temporal lobe surgery reported that 8 percent of control patients became free of complex partial seizures during a 1-year waiting period prior to surgery.145 This finding suggests that maturation may occur, but that it affects only a small proportion of surgical patients.
As previously discussed, candidates for epilepsy surgery must complete an extensive presurgical evaluation to determine their suitability for surgery. Patients with temporal lobe epilepsy usually have a specific focal lesion and experience complex partial seizures with or without secondary generalized seizures.139 Therefore, the patients in published studies of surgery for temporal lobe epilepsy should be representative of all patients considering this surgery. However, many publications of epilepsy surgery select a specific patient population based on age or pathology, or use only one variation of a surgical technique. The results of these studies may or may not be generalizable to all temporal lobe surgery patients.
Among the 21 studies reporting seizure-free with no auras, 20 reported a mean age at surgery. The mean age at surgery in these studies varied from 9.4 years to 35 years, with only two studies having a mean less than 20 years of age. The range for age at surgery varied from 3 years to 62 years of age. Two studies examined only patients who were less than 20 years of age. Age at seizure onset was reported in 10 studies. The mean age at onset in these studies varied from 4 to 21 years of age with a range of less than a year to 44 years of age. Duration of epilepsy prior to surgery was reported in 11 studies. The mean duration varied from 6 to 19 years and the range varied from 1 to 45 years.
The patient characteristics for studies reporting seizure-free with auras were similar to the studies reporting seizure-free with no auras. Among the 26 studies reporting seizure-free with auras, 23 reported a mean age at surgery. The mean age at surgery varied from 8.3 years to 37 years with five studies having a mean less than 20 years of age. The range for age at surgery varied from 1 year to 86 years of age. Four studies examined only patients who were less than 20 years of age. Age at seizure onset was reported in 15 studies. The mean age at onset varied from 1 to 25 years of age with a range of less than a year to 62 years of age. Duration of epilepsy prior to surgery was reported in 11 studies. The mean duration varied from 5 to 26 years and the range varied from less than a year to 81 years.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
We will separately discuss each of the four “seizure-free” outcome measurements because, as mentioned above, each outcome measurement refers to a different group of “seizure-free” patients. We begin our analysis with studies reporting patients as seizure-free with no auras. This is the most restrictive group, but the ultimate goal of surgery is to be completely seizure-free. Next, we analyze studies reporting patients as seizure-free with auras. This patient population is free of complex partial seizures. Our analysis of the studies reporting Engel class I follows our analysis of the more restrictive “seizure-free” outcome measurements. Studies that did not report if auras were considered in their calculation of the number of patients who were seizure-free after surgery are analyzed last.
A scale is not shown because the effect sizes were not calculated with actual control groups
The results of our threshold analysis of studies reporting seizure-free with no auras appear in Figure 36
This analysis suggests that, after temporal lobe surgery, approximately 55 percent of patients will be completely seizure-free. However, this calculation was based on no patients in similar studies becoming seizure-free without surgery, so it does not estimate the percentage of patients who become seizure-free because of surgery. Some patients may become seizure-free without surgery. Readers are asked to consider the plausibility of 50 percent of temporal lobe epilepsy patients becoming completely seizure-free without benefit of surgery. They should also consider the above-noted difficulties with the internal validity of these studies, difficulties that could cause the threshold to decrease.
A scale is not shown because the effect sizes were not calculated with actual control groups
MTS = Patients with mesial temporal sclerosis
The results of our threshold analysis of studies reporting seizure-free with auras appear in Figure 38
This analysis suggests that, after temporal lobe surgery, approximately 68 percent of patients will be free of complex partial seizures (some patients may still have auras). However, this calculation was based on no patients becoming seizure-free without surgery, so it does not estimate net health benefit of surgery. Some patients may become seizure-free without surgery. The threshold analysis suggests that approximately 65 percent of patients in similarly designed studies would have to become seizure-free without surgery before surgery could be considered ineffective. Readers are asked to consider the plausibility of temporal lobe epilepsy patients achieving this threshold level without benefit of surgery. They should also consider the above-noted difficulties with the internal validity of these studies, difficulties that could cause the threshold to decrease.
An RCT conducted by Wiebe, Blume, Girvin, et al.145 at the London Health Sciences Center at the University of Western Ontario examined seizure-free outcomes in patients who were randomized to temporal lobe surgery or required to wait 1 year before receiving surgery. A 1-year wait before undergoing preoperative investigations is routine practice at this institution. Therefore, randomizing patients to a wait list of 1 year was considered ethical. Patients were older than 16 years of age and continued to have at least monthly seizures despite the use of one or more AEDs. Patients randomized to surgery underwent a standard anterior temporal lobectomy. All patients were evaluated every 3 months for 1 year, and two epileptologists who were blinded to the identity of the patients and their treatment groups judged the adequacy of treatment through written clinical information.
Both seizure-free with no auras and seizure-free with auras were used to define the seizure-free status of the patients in this study. This study was not included in our analyzes of seizure-free data because the followup period was only 1 year. In a group of 40 control patients, one patient became seizure-free with no auras and two additional patients became seizure-free with auras for a total of three seizure-free patients (7.5 percent). Based on this study's findings, the synthetic control group levels of 50 percent and 65 percent needed to overturn the results of our threshold analysis seem unlikely to be achieved in a clinical setting.
A scale is not shown because the effect sizes were not calculated with actual control groups
Our threshold analysis of studies reporting Engel class I found statistically significant heterogeneity among the effect sizes indicating a large amount of variation among study results (Q = 77.7, p = 0.00002). Therefore, the summary estimates in any threshold analysis of these data were not calculated. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.
Despite the heterogeneity, all of the effect sizes (based on a Cohen's h with no control patients achieving Engel class I) in these studies were statistically significant. Therefore, these studies indicate that temporal lobe surgery is effective in producing seizure-free patients. The heterogeneity prevents an accurate estimation of the overall percentage of patients likely to achieve Engel class I status after surgery.
None of the three variables in our meta-regression explained the heterogeneity when used in one-, two-, or three-predictor models. Figure 40
A scale is not shown because the effect sizes were not calculated with actual control groups
Our threshold analysis of studies reporting seizure-free undefined found statistically significant heterogeneity among the effect sizes (Q = 43.4, p = 0.00002). Therefore, the summary estimates in any threshold analysis of these data were not calculated. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.
Despite the heterogeneity, all of the effect sizes calculated from studies reporting seizure-free undefined were statistically significant. Therefore, these studies indicate that temporal lobe surgery is effective in producing seizure-free patients. The heterogeneity prevents an accurate estimation of the overall percentage of patients likely to achieve seizure-free status after surgery.
None of the three variables in our meta-regression explained the heterogeneity when used in one-, two-, or three-predictor models. Figure 42
Dodrill, Van Belle, and Wilkus149 have pointed out that small sample sizes have lead to inconsistency in the conclusions reached about the significance of most variables believed to influence surgical outcomes. Therefore, many individual nested case-control studies may not be able to detect clinically meaningful effects.
| Studies of temporal lobe surgery reporting individual patient data for patients with successful and nonsuccessful surgery | |||||||
|---|---|---|---|---|---|---|---|
| Reference | Age at Treatment | Age at Seizure Onset | Duration of Epilepsy Prior to Treatment | Gender | Side of Surgery | Simple Partial Seizures | Secondarily Generalized Seizures |
| Bouilleret (2002)165 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |
| Hennessy (2001)148 | ![]() | ![]() | ![]() | ![]() | |||
| Hennessy (2001)171 | ![]() | ||||||
| Sotero de Menezes (2001)180 | ![]() | ![]() | |||||
| Verma (2001)182 | ![]() | ![]() | ![]() | ![]() | ![]() | ||
| Eberhardt (2000)188 | ![]() | ![]() | ![]() | ||||
| Holmes (1999)206 | ![]() | ![]() | ![]() | ||||
| Szabo (1998)172 | ![]() | ![]() | ![]() | ![]() | ![]() | ||
| Kilpatrick (1997)183 | ![]() | ![]() | ![]() | ||||
| Schwartz (1997)187 | ![]() | ![]() | ![]() | ||||
| Sisodiya (1997)191 | ![]() | ![]() | ![]() | ||||
| Adam (1996)193 | ![]() | ![]() | |||||
| Goldstein (1996)195 | ![]() | ||||||
| Jooma (1995)205 | ![]() | ![]() | ![]() | ||||
| Liu (1995)209 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |
| Vossler (1995)215 | ![]() | ![]() | ![]() | ![]() | ![]() | ||
| Blume (1994)146 | ![]() | ||||||
| Berkovic (1991)216 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| Hopkins (1991)217 | ![]() | ![]() | ![]() | ||||
| Mizrahi (1990)152 | ![]() | ![]() | ![]() | ![]() | |||
| Yeh (1990)220 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| Estes (1988)222 | ![]() | ||||||
| Drake (1987)224 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |
| Delgado-Escueta (1985)153 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
We calculated a point-biserial correlation (rpb) from the individual patient data in each study reporting the age at surgery, age at seizure onset, and duration of epilepsy prior to surgery, and then combined these in a separate meta-analysis for each variable. The coefficient was calculated so that a positive correlation indicated that an older age or longer duration favored a successful outcome and a negative result indicated that a younger age or shorter duration favored a successful outcome. For the other patient characteristics, we calculated Cohen's h so that a positive effect size indicated that males, the left side, patients with simple partial seizures, or patients with secondarily generalized seizures had more successful surgery compared to females, the right side, patients without simple partial seizures, or patients without secondarily generalized seizures.
Our summary estimates are not adjusted for the influence of the other potentially important covariates in a study. An analysis using hierarchical modeling would be useful to search for factors that influence surgical outcomes by combining the patient-level data across studies, but such an analysis is beyond the scope of this report.
Age at surgery. In our first meta-analysis of “predictors” of surgical success, we sought to determine whether different outcomes were obtained in patients of different ages at the time they receive surgery. Individual ages at surgery for patients with successful and nonsuccessful surgery were reported in 18 studies with 297 patients. Evidence Table 125 presents the definition used for successful surgery and the point-biserial correlation calculated in each of the 18 studies. Figure 43
Age at seizure onset. In our second meta-analysis of “predictors” of surgical success, we sought to determine whether different outcomes were obtained in patients of different ages at seizure onset. Individual ages at seizure onset for patients with successful and nonsuccessful surgery were reported in 13 studies with 207 patients. Evidence Table 127 presents the definition used for successful surgery and the point-biserial correlation in each of the 13 studies. Figure 44
Duration of epilepsy prior to surgery. In this meta-analysis, we sought to determine whether different outcomes were obtained in patients with different durations of epilepsy prior to the time they receive surgery. Individual durations of epilepsy prior to surgery for patients with successful and nonsuccessful surgery were reported in 12 studies with 192 patients. Evidence Table 129 presents the definition used for successful surgery and the point-biserial correlation in each of the 12 studies. Figure 45
Studies reported the success of surgery among male and female patients
To “explain” this heterogeneity, we performed 36 meta-regressions (see the Methodology section for a description of our approach to meta-regression). Of these, no one-predictor model explained the heterogeneity, and five two-predictor models did. No clear “best” model was obvious among these five models. Consequently, no obvious explanation for the variation among these studies is apparent, and why surgery is more or less successful in males compared to females in these studies is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 132. The meta-regressions are presented in Evidence Table 133 and Figure 47
Studies reported the success of surgery among patients with left side and right side surgery
Studies reported the success of surgery in patients with and without simple partial seizures (SPS)
Studies reported the success of surgery among patients with and without secondarily generalized seizures (SGS)
MTS = Patients with mesial temporal sclerosis
Two studies reported that patients without secondarily generalized seizures have better outcomes, one study reported that patients with secondarily generalized seizures have better outcomes, and four studies reported no differences in outcomes between patients with or without secondarily generalized seizures.
To “explain” this heterogeneity, we performed 51 meta-regressions. Of these, no models explained the heterogeneity. Consequently, no obvious reason is apparent for why some studies had different results compared to other studies, and whether surgery is more or less successful in patients with secondarily generalized seizures is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 139. The results of the meta-regression are presented in Evidence Table 140 and Figure 51
The Epilepsy Surgery Inventory global score was reported in one study with 47 patients150 and the Quality of Life in Epilepsy global score was reported in one study with 90 patients.151
All of the previously discussed biases about the internal validity of studies that reported seizure-free outcome measures potentially apply to the studies that reported quality of life outcome measurements.
| Reference | Number of Patients | Country | Years Study Conducted | Mean Age at Surgery | Youngest Patient | Oldest Patient |
|---|---|---|---|---|---|---|
| Boling (2001)169 | 18 | Canada | 1981-1999 | 54 | 50 | 64 |
| Reeves (1997)154 | 134 | United States | 1988-1991 | 31 | ||
| Sperling (1995)236 | 73 | United States | 1986-1990 | 33.2 | ||
| Mizrahi (1990)152 | 22 | United States | 1980-1986 | 21 | 7 | 36 |
| Delgado-Escueta (1985)153 | 15 | United States | 1972-1983 | 26.5 | 12 | 39 |
All of the previously discussed biases about the internal validity of studies that reported seizure-free outcome measures potentially apply to the studies that reported employment outcome measurements. In particular, the lack of a precise definition of who is employed may lead to inconsistencies in the reporting of this outcome.
Return to (or ability to remain in) school was reported in two studies with 37 patients, however only one study had more than 10 patients of school age.152, 153
All of the previously discussed cautions about the internal validity of studies that reported seizure-free outcome measures apply to the two studies that reported education outcome measurements.
Only Reeves, So, Evans et al.,154 who studied 134 patients, reported on the ability of patients to obtain a drivers license after surgery.
All of the previously discussed cautions about the internal validity of studies that reported seizure-free outcome measures apply to this study.
Epilepsy has been associated with an increased incidence and prevalence of behavioral disorders and in particular with anxiety and depression.4, 155 New cases of depression have been associated with temporal lobe surgery156 and the National Institutes of Health Consensus Development Conference Statement: Surgery for Epilepsy has recommended that symptoms of anxiety and depression be assessed following surgery.140 The following section evaluates studies that reported new cases of depression after temporal lobe surgery.
| Reference | Number of Patients | Country | Years Study Conducted | Number of Cases |
|---|---|---|---|---|
| Kanemoto (2001)176 | 52 | Japan | 1987-1999 | 2 |
| Kohler (2001)237 | 58 | United States | 1986-1999 | 6 |
| Nees (2001)238 | 50 | England | 1992-1994 | 14 |
| Wiebe (2001)145 | 36 | Canada | 1996-2000 | 7 |
| Anhoury (2000)239 | 109 | England | 1988-1997 | 26 |
| Derry (2000)240 | 39 | Canada | 1996-1998 | 4 |
| Altshuler (1999)156 | 49 | United States | 1974-1990 | 5 |
| Ring (1998)241 | 60 | England | 1995-1996 | 7 |
| Naylor (1994)242 | 37 | Denmark | 1987-1991 | 2 |
| Bladin (1992)243 | 107 | Australia | 1975-1991 | 5 |
Although these 10 studies reported new cases of depression after surgery, they did not report the actual number of patients who were either clinically depressed or free of depression prior to surgery. Patients were not excluded from surgery for clinical depression in these studies. Therefore, our analysis uses the total number of patients receiving surgery rather than the actual number of patients free of depression prior to surgery.
All but one of the 10 studies in the evidence base for new cases of depression are uncontrolled studies of case series design. Therefore, these uncontrolled studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcomes. Attrition bias may not be a major concern in these studies because all patients were examined during the relatively short followup periods (no more than 1 year).
Depression occurs in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies prevents any determination of whether the effect of surgery is to increase or decrease the incidence of depression. The analysis of the studies can only provide an estimate of the number of patients likely to experience depression after surgery.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
A scale is not shown because the effect sizes were not calculated with actual control groups
All of the studies reported a statistically significant occurrence of new cases with a range of 4 percent to 24 percent. Our threshold analysis of studies reporting new cases of depression found statistically significant heterogeneity among the study results (Q = 18.0, p = 0.035). Therefore, we did not compute the summary estimates in any threshold analysis of these data. Rather, we sought to “explain” the source(s) of heterogeneity using meta-regression.
Meta-regression. To “explain” this heterogeneity, we performed 13 meta-regressions. Of these, no models explained the heterogeneity. Consequently, no obvious reason is apparent to explain why some studies reported more new cases of depression than other studies, and whether surgery is more or less responsible for new cases of depression is unclear. All of the study and patient characteristics used in our meta-regression are presented in Evidence Table 148. The results of the meta-regression are presented in Evidence Table 149 and Figure 53
Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the new cases of depression or that surgery reduced the incidence of depression. This is highlighted by the results of the one RCT among these studies. Wiebe, Blume, Girvin, et al.,145 using the Center for Epidemiological Studies-Depression Scale, reported that 8 out of 40 control patients (20 percent) developed depression during the year preceding their surgical.
Besides depression, treatment-resistant epilepsy has been associated with a variety of psychiatric disorders.155 Surgery for treatment-resistant epilepsy may also have psychiatric consequences. The following section evaluates studies that reported new cases of psychotic disorders (primarily schizophrenia and bipolar disorder) after temporal lobe surgery.
Although these six studies reported new cases of psychosis after surgery, they did not report the actual number of patients who had a psychotic disorder or were free of psychotic disorders prior to surgery. Two of the studies excluded patients who had chronic psychosis and the remaining four studies did not exclude patients with psychiatric disorders. Therefore, our analysis uses the total number of patients receiving surgery rather the actual number of patients free of psychosis prior to surgery.
All but one of the six studies in the evidence base for assessing new cases of psychosis are uncontrolled studies of case series design. Therefore, these studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcomes. In particular, variations in the use of any of the specific criteria, or variations in individual psychiatrists could lead to inconsistencies in the reporting of this outcome. Attrition bias is not a concern because all patients were examined after surgery.
Psychosis can occur in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies prevents any determination of whether the effect of surgery is to increase or decrease the incidence of psychosis. Our analysis of these studies can only provide an estimate of the number of patients likely to experience psychosis after surgery.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
Evidence Table 152 presents the actual patient counts, percentages, and calculated effect sizes for each study in this analysis. The individual study effect sizes (Cohen's h) presented in the Evidence Table were based on a control group in which no patients develop psychosis. Figure 54
The results of our threshold analysis of studies reporting new cases of psychosis appear in Figure 55
Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the new cases of psychosis or that surgery caused an increase or decrease in the incidence of psychosis. This can be seen in the one RCT among these studies. Wiebe, Blume, Girvin, et al.145 reported that 1 out of 40 control patients (2.5 percent) developed psychosis during the year preceding their surgical treatment compared to 1 out of 36 surgery patients (2.8 percent). This percentage of new cases of psychosis among control patients suggests that surgery may not be responsible for all new cases of psychosis after surgery. Nevertheless, our analysis provides an estimate of the number of new cases that may be expected after temporal lobe surgery, regardless of cause.
Treatment-resistant epilepsy may be associated with a slow progressive cognitive deterioration. A study of 209 patients with temporal lobe epilepsy reported that patients with a duration of greater than 30 years performed worse on full scale IQ tests compared to patients with less than 30 years duration.157 Due to the nature of the procedure, patients contemplating temporal lobe surgery may also be concerned with the potential for loss of intellectual functioning after surgery. The following section evaluates studies that reported both the number of patients to have a significant change in IQ (increase or decrease) and the pre- and postsurgery mean IQs. The authors of these studies defined a clinically significant increase or decrease in IQ as a change of at least one to two standard errors, and our analysis, therefore, incorporated this definition.
| Studies of temporal lobe surgery reporting both the number of patients with IQ changes after surgery and the pretreatment and posttreatment mean IQ. | ||||||||
|---|---|---|---|---|---|---|---|---|
| Reference | Number of Patients | Country | Years Study Conducted | Number of Decreases | Number of Increases | Mean Age at Surgery | Youngest Patient | Oldest Patient |
| Miranda (2001)245 | 50 | Canada | 1976-1998 | 7 | 7 | 13.3 | 6.4 | 18.3 |
| Robinson (2000)199 | 21 | United States | 1993-1998 | 1 | 4 | 15.4 | 9.4 | 21.7 |
| Westerveld (2000)246 | 82 | United States | 8 | 7 | 14.4 | 6 | 17 | |
| Chelune (1993)158 | 96 | United States | 1990-1991 | 8 | 8 | 29.4 | ||
| Ivnik (1988)247 | 141 | United States | 1972-1987 | 13 | 27 | 28 | ||
| Powell (1985)248 | 59 | England | 1973-1984 | 10 | 8 | 25.5 | 15 | |
All but one of the six studies in the evidence base for assessing changes in IQ are uncontrolled case series. Therefore, these studies have the same concerns with regard to internal validity as previously discussed with regard to seizure-free outcome reporting. Investigator bias and patient reporting bias may be reduced (but not eliminated) due to the use of a standardized intelligence test (Wechsler Intelligence Scale) and a predefined cutoff determining when a patient's IQ has undergone a significant change. Attrition bias is not a concern because all patients were examined after surgery.
Decreases in IQ scores can occur in patients with epilepsy, both before and after surgery. Therefore, the lack of control patients in most of these studies limits our ability to determine whether surgery decreased IQ scores. However, since increases in IQ scores are unlikely to occur spontaneously, any increase in IQ scores after surgery are likely to be a consequence of surgery. Our analysis provides an estimate of the number of patients likely to experience either an increase or decrease in IQ after surgery.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
Studies reported individuals with significant decreases in IQ after surgery
The results of our threshold analysis of studies reporting patients with clinically significant decreases in IQ after surgery appear in Figure 57
Studies reported individuals with significant increases in IQ after surgery
The results of our threshold analysis of studies reporting patients with clinically significant increases in IQ after surgery appear in Figure 59
Because all but one study lacked a control group, these studies do not provide evidence that surgery was directly responsible for the decreases in individual patient IQ scores, although a case can be made that surgery is responsible for any increases in IQ. The one study with a control group, Chelune, Nagle, Lueders, et al.,158 reported two patients with clinically significant increases and two patients with clinically significant decreases in verbal IQ out of 40 control patients (5 percent each) compared to eight patients with clinically significant increases and eight patients with clinically significant decreases in verbal IQ out of 96 surgery patients (8.3 percent each). These percentages for increases and decreases among the control patients are lower than the percentages needed to overturn the conclusions of our threshold analysis suggesting that surgery may plausibly be responsible for changes in IQ.
Studies reported both presurgery and postsurgery mean IQ
Temporal lobe surgery usually requires the removal of the hippocampus, a part of the brain important to memory capacity. Therefore, memory function is at risk whenever this procedure is performed.159 The following section evaluates studies that reported both the number of patients with a significant change in memory function (increase or decrease) and the pre- and postsurgery mean memory scores.
| Studies of temporal lobe surgery reporting individual changes in patient memory after surgery | ||||||||
|---|---|---|---|---|---|---|---|---|
| Reference | Number of Patients | Country | Years Study Conducted | Number of Decreases | Number of Increases | Mean Age at Surgery | Youngest Patient | Oldest Patient |
| Canizares (2000)249 | 33 | Spain | 1998-1999 | 3 | 10 | 30.9 | ||
| Chelune (1993)158 | 96 | United States | 1990-1991 | 28 | 1 | 29.4 | ||
| Ivnik (1988)247 | 141 | United States | 1972-1987 | 48 | 48 | 28 | ||
| Ojemann (1985)250 | 13 | United States | 1983-1983 | 8 | 3 | 28.9 | 17 | 49 |
| Powell (1985)248 | 59 | England | 1973-1984 | 8 | 13 | 25.5 | 15 | |
Studies reported individuals with significant changes in memory after surgery
To further explore these data, we calculated individual study results (using Cohen's h) by assuming that a control group would have no patients experiencing an increase or decrease in memory score. Statistically significant effects indicate that the percentage of patients experiencing an increase or decrease in memory score was significantly different from zero. All five studies showed statistically significant percentages of patients with memory decreases, and four studies showed statistically significant percentages of patients with memory increases (Figure 62
Serious permanent complications and transient complications are an inherent part of surgery. Temporal lobe surgery can result in various forms of paralysis due to obstruction of blood vessels or other damage to brain tissue. The following section evaluates studies that reported cases of serious permanent complications. We considered moderate to severe permanent neurological deficits, especially hemiplegia, to be serious complications. We considered all other reported surgical complications to be mild or transient. Development of postsurgical depression or psychosis, and declines in IQ or memory are not considered in this section because we examined them separately (see above).
| Reference | N | Years Study Conducted | Country | Permenant Complications |
|---|---|---|---|---|
| Boling (2001)169 | 18 | 1981-1999 | Canada | 0 |
| Schramm (2001)178 | 61 | 1993-1999 | Germany | 0 |
| Sotero de Menezes(2001)180 | 15 | 1978-1993 | United States | 1 |
| Wiebe (2001)145 | 36 | 1996-2000 | Canada | 2 |
| Iannelli (2000)194 | 37 | 1981-1997 | Italy | 1 |
| Rao (2000)197 | 164 | 1995-1998 | India | 1 |
| Robinson (2000)199 | 21 | 1993-1998 | United States | 0 |
| Wurm (2000)251 | 16 | 1997-1998 | Austria | 1 |
| Altshuler (1999)156 | 49 | 1974-1990 | United States | 5 |
| Leung (1999)252 | 11 | 1994-1998 | Hong Kong | 1 |
| Parrent (1999)253 | 19 | 1994-1997 | Canada | 0 |
| Salanova (1999)214 | 145 | 1984-1995 | United States | 2 |
| Son (1999)166 | 71 | 1994-1999 | South Korea | 2 |
| Visudhiphan (1999)255 | 14 | 1993-1998 | Thailand | 0 |
| Radhakrishnan (1998)170 | 175 | 1988-1991 | United States | 2 |
| Wyllie (1998)257 | 72 | 1990-1996 | United States | 0 |
| Bizzi (1997)173 | 14 | 1990-1994 | United States | 1 |
| Blume (1997)259 | 14 | 1977-1994 | Canada | 0 |
| Kilpatrick (1997)183 | 36 | 1993-1995 | Australia | 0 |
| Adam (1996)193 | 30 | 1991-1994 | France | 2 |
| Acciarri (1995)200 | 10 | 1975-1992 | Italy | 0 |
| Davies (1995)163 | 12 | 1969-1988 | England | 1 |
| Jooma (1995)205 | 30 | 1985-1992 | United States | 2 |
| Liu (1995)209 | 22 | 1983-1990 | United States | 2 |
| Wyler (1995)142 | 70 | 1990-1992 | United States | 3 |
| Blume (1994)146 | 125 | 1974-1989 | Canada | 1 |
| Guldvog (1994)160 | 64 | 1952-1988 | Norway | 0 |
| Guldvog (1994)161 | 35 | 1949-1988 | Norway | 6 |
| Hopkins (1991)217 | 11 | 1978-1988 | Australia | 0 |
| Bidzinski (1990)162 | 320 | 1957-1988 | Poland | 2 |
| Mackenzie (1990)254 | 30 | 1983-1989 | Australia | 0 |
| Mizrahi (1990)152 | 22 | 1980-1986 | United States | 0 |
| Walczak (1990)164 | 100 | 1964-1985 | United States | 1 |
| So (1989)256 | 48 | 1973-1987 | Canada | 0 |
| Cutfield (1987)147 | 26 | 1961-1980 | New Zealand | 0 |
| Drake (1987)224 | 16 | 1974-1986 | Canada | 0 |
| King (1986)258 | 23 | 1981-1983 | United States | 0 |
| Meyer (1986)226 | 50 | 1970-1983 | United States | 0 |
| Carey (1985)260 | 24 | 1975-1984 | Ireland | 0 |
| Delgado-Escueta(1985)153 | 15 | 1972-1983 | United States | 1 |
The complications reported by these studies could only have occurred because of surgery, so the internal validity with regard to the cause and effect is not in question. However, some potential biases are still present. Investigator reporting bias may have affected the reporting of mild or transient complications because they may not be regarded as important by some investigators. Attrition bias is not a concern because all patients were examined after surgery. Maturation bias is also not a concern when reporting complications.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
Seventy-nine mild or transient complications were reported among 1,339 patients, which correspond to 6 percent or 59 complications per 1,000 surgery patients. The number of mild or transient complications may be underestimated by these data because of differences in reporting these complications across studies. Clinician judgment as to the importance of reporting various mild or transient complications will likely vary across studies, whereas, the occurrence of permanent paralysis will usually warrant reporting.
Any surgical procedure may result in such serious complications that death results. The following section evaluates studies that reported deaths due to temporal lobe surgery.
| Reference | N | Years Study Conducted | Country | Deaths |
|---|---|---|---|---|
| Boling (2001)169 | 18 | 1981-1999 | Canada | 0 |
| Schramm (2001)178 | 61 | 1993-1999 | Germany | 0 |
| Wiebe (2001)145 | 36 | 1996-2000 | Canada | 0 |
| Iannelli (2000)194 | 37 | 1981-1997 | Italy | 0 |
| Rao (2000)197 | 164 | 1995-1998 | India | 1 |
| Robinson (2000)199 | 22 | 1993-1998 | United States | 0 |
| Wurm (2000)251 | 16 | 1997-1998 | Austria | 0 |
| Altshuler (1999)156 | 49 | 1974-1990 | United States | 0 |
| Leung (1999)252 | 11 | 1994-1998 | Hong Kong | 0 |
| Parrent (1999)253 | 19 | 1994-1997 | Canada | 0 |
| Salanova (1999)214 | 145 | 1984-1995 | United States | 0 |
| Son (1999)166 | 71 | 1994-1999 | South Korea | 0 |
| Visudhiphan (1999)255 | 14 | 1993-1998 | Thailand | 0 |
| Wyllie (1998)257 | 72 | 1990-1996 | United States | 1 |
| Bizzi (1997)173 | 14 | 1990-1994 | United States | 0 |
| Blume (1997)259 | 14 | 1977-1994 | Canada | 0 |
| Kilpatrick (1997)183 | 36 | 1993-1995 | Australia | 0 |
| Adam (1996)193 | 30 | 1991-1994 | France | 0 |
| Acciarri (1995)200 | 10 | 1975-1992 | Italy | 0 |
| Berkovic (1995)202 | 135 | 1986-1991 | Australia | 0 |
| Davies (1995)163 | 12 | 1969-1988 | England | 0 |
| Liu (1995)209 | 22 | 1983-1990 | United States | 0 |
| Wyler (1995)142 | 70 | 1990-1992 | United States | 0 |
| Blume (1994)146 | 125 | 1974-1989 | Canada | 0 |
| Guldvog (1994)160 | 64 | 1952-1988 | Norway | 0 |
| Guldvog (1994)161 | 35 | 1949-1988 | Norway | 0 |
| Bladin (1992)243 | 107 | 1975-1991 | Australia | 0 |
| Elwes (1991)261 | 108 | 1976-1987 | England | 1 |
| Hopkins (1991)217 | 11 | 1978-1988 | Australia | 0 |
| Bidzinski (1990)162 | 320 | 1957-1988 | Poland | 2 |
| Mizrahi (1990)152 | 22 | 1980-1986 | United States | 0 |
| Yeh (1990)220 | 12 | 1982-1986 | Japan | 0 |
| So (1989)256 | 48 | 1973-1987 | Canada | 0 |
| Cutfield (1987)147 | 26 | 1961-1980 | New Zealand | 0 |
| Drake (1987)224 | 16 | 1974-1986 | Canada | 0 |
| Meyer (1986)226 | 50 | 1970-1983 | United States | 0 |
| Carey (1985)260 | 24 | 1975-1984 | Ireland | 0 |
We abstracted only deaths specifically reported to be caused by surgery. Deaths as a result of invasive presurgical diagnostic procedures were not included.
The deaths reported here could only have occurred through surgery. Investigator reporting bias, attrition bias, and maturation bias are not a concern when reporting surgery-related mortality
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to temporal lobe surgery patients in clinical practice.
| Studies of corpus callosotomy reporting individual patient data for patients with successful and nonsuccessful surgery | |||
|---|---|---|---|
| Reference | Age at Treatment | Age at Seizure Onset | Duration of Epilepsy Prior to Treatment |
| Sakas (1996)268 | ![]() | ![]() | |
| Claverie (1995)274 | ![]() | ||
| Nordgren (1991)279 | ![]() | ![]() | ![]() |
| Marino (1990)276 | ![]() | ![]() | ![]() |
| Murro (1988)277 | ![]() | ![]() | ![]() |
| Purves (1988)278 | ![]() | ![]() | ![]() |
| Spencer (1988)267 | ![]() | ![]() | ![]() |
Resection of the corpus callosum is intended as a palliative procedure that reduces the frequency of seizures that could lead to injury or seriously interfere with life-style.262–264 These patients typically have multifocal, unresectable, or unlocalized lesions.263 Candidates for this procedure include both children and adult patients with atonic, tonic, and tonic-clonic seizures.263 These patients typically have daily to weekly seizures of multiple types that occur despite therapeutic blood levels of AEDs for at least 2 years prior to surgery.265
Corpus callosotomy is not expected to eliminate all seizures. Under these circumstances, reduction in overall seizure frequency and in specific seizure frequencies are the most valuable outcome measurement for establishing if corpus callosotomy has been effective. Due to the complicated nature of the surgery, an assessment of surgical complications and deaths due to surgery is also necessary to judge the effectiveness of corpus callosum resection.
| Method of reporting effect of surgery on seizure frequency | ||||
|---|---|---|---|---|
| Reference | Types of Seizures Evaluated | Percent Reduction in Seizure Frequency | Actual Change in Mean Seizure Frequency | Seizure-free |
| Kwan (2001)270 | All seizure types | ![]() | ||
| Maehara (2001)271 | Disabling generalized seizures | ![]() | ||
| Matsuzaka (1999)272 | Most disabling seizures | ![]() | ![]() | |
| McInerney (1999)273 | Most disabling seizures | ![]() | ![]() | |
| Sakas (1996)268 | Drop attacks and generalized tonic-clonic seizures | ![]() | ![]() | |
| Claverie (1995)274 | All seizure types | ![]() | ||
| Reutens (1993)275 | All seizure types | ![]() | ||
| Marino (1990)276 | All seizure types | ![]() | ![]() | |
| Murro (1988)277 | All seizure types | ![]() | ![]() | ![]() |
| Purves (1988)278 | All seizure types | ![]() | ||
| Spencer (1988)267 | All seizure types | ![]() | ![]() | ![]() |
| Gates (1987)265 | All seizure types | ![]() | ![]() | ![]() |
As noted for temporal lobe surgery, withholding surgery may be unethical, so the evidence base for corpus callosotomy consists mainly of uncontrolled trials. Indeed, none of the studies in the evidence base for corpus callosotomy employed a control group. Rather, all studies were case series. Therefore, all of the 12 studies in the evidence base may have biases that reduce internal validity as previously discussed for temporal lobe surgery. However, these patients have daily to weekly seizures of multiple types that occur despite therapeutic blood levels of AEDs.265 Therefore, given the severe nature of the seizure activity in individuals considering this type of surgery, explanations for seizure reduction other than the effect of surgery may be considered implausible.
As stated earlier, patients being considered for resection of the corpus callosum experience characteristics seizures due to the multifocal nature of the lesions responsible for the seizures. Therefore, the patients in published studies of corpus callosotomy should be representative of all patients receiving this surgery. However, differences may exist across studies with regard to age or pathology.
Based on the distribution of patient characteristics, this evidence base seems to be generalizable to corpus callosotomy patients in clinical practice.
Studies reported patients with at least a 90 percent reduction in seizure frequency after surgery
A scale is not shown because the effect sizes were not calculated with actual control groups
The results of our threshold analysis appear in Figure 64
Studies reported patients who had no change or an increase in seizure frequency
Our meta-analysis produced a statistically significant summary estimate (0.83, CI: 0.62 to 1.03, p <0.000001) that corresponds to 16 percent (CI: 9 percent to 24 percent) of patients with no change or an increase in seizure frequency after surgery. There was no statistically significant heterogeneity in the threshold analysis (Q = 9.0, p = 0.17).
Age at surgery. Our first meta-analysis looks at whether different outcomes were obtained in patients of different ages at the time they receive surgery. Individual ages at surgery for patients with successful and nonsuccessful surgery were reported in six studies with 120 patients. Evidence Table 171 presents the definition used for successful surgery and the point-biserial correlation calculated for each of the six studies. Figure 66
The meta-analysis produced a summary estimate that was not statistically significant (rpb = 0.14, CI: -0.05 to 0.32, p = 0.16) suggesting that age at surgery had no influence on the success of surgery in these studies. The effect sizes in this meta-analysis were not heterogeneous (Q = 4.1, p = 0.54).
Age at seizure onset. In our second meta-analysis, we sought to determine whether different outcomes were obtained in patients of different ages at seizure onset. Individual ages at seizure onset for patients with successful and nonsuccessful surgery were reported in five studies with 105 patients. Evidence Table 173 presents the definition used for successful surgery and the point-biserial correlations calculated for each of the five studies. Figure 67
Duration of epilepsy prior to surgery. In our third meta-analysis, we sought to determine whether different outcomes were obtained in patients with different durations of epilepsy prior to surgery. Individual durations of epilepsy prior to surgery for patients with successful and nonsuccessful surgery were reported in five studies with 105 patients. These same five studies reported individual patient age at onset of seizures. Evidence Table 175 presents the definitions used for successful surgery and the point-biserial correlations calculated for each of the five studies. Figure 68
In the previous section, we analyzed data on overall seizure reduction and estimated that, after corpus callosotomy, only 20 percent of patients are likely to exhibit a 90 percent reduction in all seizure types. The benefits of corpus callosotomy may also be determined from surgery's effect on the most disabling seizures experienced by the patients, or from surgery's effect on specific types of seizures. The most disabling seizures are primarily generalized seizures that can result in falls and injuries. The specific types of seizures for which corpus resection may have a beneficial effect are generalized tonic/clonic seizures, atonic seizures, and tonic seizures.266