NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Treatment of Degenerative Lumbar Spinal Stenosis Volume 1: Evidence Report: Evidence Report/Technology Assessment Number 32
 
hserta
AHRQ Evidence Reports
public health

Chapter  32:  Treatment of Degenerative Lumbar Spinal Stenosis Volume 1: Evidence Report: Evidence Report/Technology Assessment Number 32

A47141

Prepared for:
Agency for Healthcare Research and Quality
Department of Health and Human Services
U.S. Public Health Service
2101 East Jefferson Street
Rockville, MD 20852


www.ahrq.gov



Contract No. 290-97-0020

Prepared by:
ECRI, Plymouth Meeting, PA
Health Technology Assessment Group

AHRQ Publication No. 01-E048

June 2001

ISBN 1-58763-051-6
ISSN 1530-4396

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or as a basis for reimbursement and coverage policies. Endorsement by the Agency for Healthcare Research and Quality (AHRQ) or the Department of Health and Human Services (DHHS) of such derivative products may not be stated or implied.

Prepared for:
Agency for Healthcare Research and Quality
Department of Health and Human Services
U.S. Public Health Service
2101 East Jefferson Street
Rockville, MD 20852


www.ahrq.gov



Contract No. 290-97-0020

Prepared by:
ECRI, Plymouth Meeting, PA
Health Technology Assessment Group

AHRQ Publication No. 01-E048

June 2001

ISBN 1-58763-051-6
ISSN 1530-4396

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or as a basis for reimbursement and coverage policies. Endorsement by the Agency for Healthcare Research and Quality (AHRQ) or the Department of Health and Human Services (DHHS) of such derivative products may not be stated or implied.

Preface

The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Acting Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.

Acting DirectorJohn M. Eisenberg, M.D.
Center for Practice and Technology AssessmentDirector
Agency for Healthcare Research and QualityAgency for Healthcare Research and Quality

The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Structured Abstract

Objectives

This report assesses the clinical evidence describing the natural history, diagnosis, and treatment of degenerative lumbar spinal stenosis (LSS). LSS causes low back pain, radiculopathy, and neurogenic claudication and impedes normal physical activity. LSS is commonly diagnosed in the elderly. Medicare records indicate that the rate of LSS surgery in the United States is between 30 and 132 per 100,000.

Search strategy

A comprehensive data set was obtained by searching MEDLINE®, EMBASE®, and 23 additional electronic databases; the gray literature; and the World Wide Web and by hand-searching article bibliographies. Date ranges spanned from database inception to July 2000. Our searches yielded 4,788 items for evaluation.

Selection criteria

Broad inclusion criteria were used to ensure that no relevant information was overlooked. All controlled studies of natural history and conservative treatments, and all clinical studies of any design with relevant quantitative data on diagnostics and surgical treatment were included in our assessment.

Data analysis

We performed meta-analyses on combinable data using the standardized difference between means (Hedges' d) as the test statistic and checked for heterogeneity using the Q test and each study's standardized residual. The [union or logical sum] statistics, graphical representations, and binomial effect size displays were used to interpret the results.

Our quantitative analyses of data that did not permit combination consisted of performing de novo statistical analyses of published data. Effect sizes (Hedges' d) were calculated, and statistical power analyses were performed.

Main results

Our meta-analyses, based on the few available studies, suggest that patients with congenitally narrower spinal canals are more likely to exhibit LSS symptoms and that patients with symptomatic LSS may have smaller canals. Considerable overlap exists between the spinal diameters of patients with and without LSS diagnoses. Some circumstantial evidence indicates that the development of symptomatic LSS may be influenced by patient age, weight, and occupation and osteoarthritis of the hips.

Definitive conclusions cannot be made about the efficacy of diagnostic imaging methods or of conservative or surgical treatments for LSS. Evaluation of conservative treatment trials is complicated by the lack of patient inclusion criteria restricted to lumbar spinal stenosis. Evaluation of specific surgical techniques is difficult because relevant controlled studies often used demonstrably different patient groups. Trials that compare conservative to surgical treatment also fail to examine patients with similar clinical signs and symptoms. One randomized controlled trial provides evidence that patients with severe symptoms will benefit more from surgery than conservative therapy. Data from a single cohort study suggest that at one year after treatment, patients with moderate pain will benefit more from surgery than from conservative treatment.

Conclusions

Definitive evidence-based statements about the diagnosis and treatment of LSS await the results of well-designed clinical trials. Available data imply that patients with narrower spinal canals may be more likely to develop some symptoms of LSS. The relationship between degree of stenosis and severity of symptoms is unclear and cannot be quantified. Some evidence suggests that patients with moderate to severe symptoms will benefit more from surgery than from conservative treatment.

This document is in the public domain and may be used and reprinted without permission except for copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders. AHRQ appreciates citation as to source and the suggested format is as follows:

ECRI Health Technology Assessment Group. Treatment of Degenerative Lumbar Spinal Stenosis. Evidence Report/Technology Assessment No. 32 (Prepared by ECRI under Contract No. 290-97-0020). AHRQ Publication No. 01-E048. Rockville (MD): Agency for Healthcare Research and Quality; June 2001.

Summary

Overview

The purpose of this report is to assess, in an evidence-based fashion, the efficacy of methods for the diagnosis and treatment of degenerative lumbar spinal stenosis. Degenerative lumbar spinal stenosis is defined as a focal narrowing of the spinal canal, although there is some variation among investigators about the precise amount of narrowing that must occur before the canal is considered stenotic. The general term "spinal stenosis" can be applied to three root compression mechanisms alone or in combination: (1) disk protrusion or herniation, (2) osteotic overgrowth into the spinal canal or the foramina through which the roots pass laterally, and (3) vertebral slippage or spondylolisthesis. Although symptoms overlap for these three mechanisms, the second category, osteotic stenosis, is specifically termed spinal stenosis; this category is the focus of this evidence report, with spondylolisthesis also being addressed.

In extreme cases, lumbar stenosis can cause cauda equina syndrome, a syndrome characterized by neuromuscular dysfunction, and may result in permanent nerve damage. Because many studies excluded patients with cauda equina syndrome, we were not able to consider evidence related to it; and, therefore, consideration of cauda equina syndrome is beyond the scope of this evidence report. This report, therefore, focuses on less extreme manifestations of lumbar spinal stenosis and considers the evidence surrounding all aspects of this condition.

Incidence and prevalence data on lumbar spinal stenosis come from several studies. In a Swedish study that defined spinal stenosis as a canal of 11 mm or less, the annual incidence of spinal stenosis observed among patients referred to orthopedic departments was approximately 5 per 100,000 inhabitants. The National Low Back Pain Study examined records for 2,374 patients with chronic low back pain who sought help from orthopedic surgeons and neurosurgeons at eight academic medical centers across the United States. From this study's data, we calculated that among the patients seeking treatment for low back problems, 35 percent had osteo-related root compression and were possible candidates for bone-removing surgery. However, the severity of disease was not reported in this study. Thus, the proportion of these patients with disease severe enough to indicate surgery is not known.

Additional data from the National Ambulatory Medical Care Survey and the National Spine Network indicate that among patients with low back pain who see a specialist, 13 percent to14 percent may have spinal stenosis. The same references also show that among patients with low back pain who see a general physician, 3 percent to 4 percent may have spinal stenosis.

The longitudinal Framingham Heart Study provides rates of degenerative vertebral slippage. This study found that 1 percent of men and 1.5 percent of women already had vertebral slippage at the baseline measurement at the mean age of 54 years. Over the following 25 years, 11 percent (23/217) of men and 25 percent (100/400) of women developed degenerative vertebral slippage.

Patients with symptomatic spinal stenosis typically have chronic low back pain and pain and weakness in the legs that limit standing and walking to brief durations and short distances. This places limitations on their ability to carry out self-supporting daily activities as well as work, social, and recreational activities. This lack of activity may lead to obesity and general physical deterioration that may eventually result in the onset of cardiovascular and other serious health problems. These activity restrictions may also lead to depression and other psychological problems.

More severe stenosis can result in cauda equina syndrome. A common belief is that untreated spinal stenosis can result in severe symptoms and may become permanent and unresponsive to medical or surgical treatment. However, except for acute onset of symptoms seen among patients with herniated disks, none of the studies that met our inclusion criteria examined how often these consequences occur among patients with lumbar spinal stenosis.

Reporting the Evidence

The present evidence report focuses on nine key questions. These are:

  1. What is the relationship between each relevant patient characteristic and the presence and/or intensity of each of the patient signs, symptoms, and conditions of lumbar spinal stenosis?

  2. Which relevant patient characteristics are associated with an increased likelihood of focal narrowing of the spinal canal?

  3. What is the relationship between the degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?

  4. What is the relationship between the signs and symptoms and other features of the patient history and physical and the results of the imaging examination? Implicit in this question is an examination of the criteria for diagnosis of spinal stenosis.

  5. What is the relationship between the signs and symptoms and other features of the patient history and physical and the results of conservative treatment; and what is the relationship between the type of conservative treatment and patient outcomes? Implicit in this question is whether any particular patient subgroup benefits from medical management of spinal stenosis.

  6. What is the relationship between the signs and symptoms and other features of the patient history and physical and the success or failure of surgical treatment? Implicit in this question is whether any particular patient subgroup benefits from surgical treatment of spinal stenosis and whether some patients might benefit more from surgery than from medical management.

  7. What is the relationship between the results of the imaging examination and the success or failure of surgical treatment? Implicit in this question is whether it is possible to predict that a certain patient subgroup will benefit from surgery.

  8. What is the relationship between the type of surgery received and the success or failure of surgical treatment?

  9. What costs are associated with nonsurgical and surgical treatment of spinal stenosis?

Among the diagnostic imaging methods considered in this report are myelography, computed tomography (CT), and magnetic resonance imaging (MRI). In evaluating these methods, the typical measures used to gauge test performance (sensitivity, specificity, and positive and negative predictive values) were considered. Also examined is whether these diagnostic methods can be used to predict which patients may respond to treatment. Evaluating the efficacy of diagnostic methods for spinal stenosis is difficult because stenosis is often defined by the imaging findings themselves. Imaging examinations for spinal stenosis are often performed, after medical management has proven unsuccessful, for the purpose of planning surgery. Because there is no evidence to suggest imaging is not necessary for surgical planning, we did not examine this question.

The present evidence report also considers both medical and surgical treatments of spinal stenosis, and our searches for information about both classes of treatments were comprehensive. In our analysis, we pay particular attention to patient-oriented outcomes (i.e., relief of symptoms). This is because outcomes such as surgical reduction or elimination of the stenosis are possible without accomplishing a concomitant reduction in the intensity of symptoms.

Methodology

In order to focus, refine, and arrive at the key questions addressed by this evidence report, a research team comprising five experts in technology assessment developed a preliminary evidence model. This model and written descriptions of the specific issues depicted in it were then discussed in telephone conversations with 11 experts in the field of lumbar spinal stenosis and 1 patient representative. From these conversations a final evidence model was developed. The essential questions of this evidence report were then developed from the evidence model.

To ensure that our identification of information relevant to these key questions was comprehensive, we searched the following databases for information:

The Cochrane Database of Systematic Reviews (through 2000 Issue 1)

The Cochrane Registry of Clinical Trials (through 2000 Issue 1)

The Cochrane Review Methodology Database (through 2000 Issue 1)

CRISP (through May 2000)

Cumulative Index to Nursing and Allied Health Literature (CINAHL)® (1988 through November 22, 1999)

Current Contents® -- Clinical Medicine (through May 2000)

The Database of Abstracts of Reviews of Effectiveness (Cochrane Library) (through 2000 Issue 1)

Dissertation Abstracts (through February 29, 2000)

ECRI Library Catalog (through May 2000)

EMBASE® (Excerpta Medica) (1974 through November 19, 1999)

Health Care Financing Administration (HCFA) Web site (through May 2000) including:

Medicare Provider Analysis and Review (MEDPAR)

1999 National Physician Fee Schedule Payment Amount File

Health Devices Alerts® (1977 through May 2000)

Healthcare Standards (1975 through May 2000)

Health Devices Sourcebase® (through May 2000)

Health Services Research Projects (HSRProj) (through February 29, 2000)

HealthSTAR (Health Services, Technology, Administration, and Research) (1990 through May 20, 2000)

International Health Technology Assessment (IHTA)© (1990 through May 2000)

Locatorplus (through May 2000)

MANTIS™ (through February 4, 2000)

MEDLINE® (1964 through March 8, 2000)

National Guideline Clearinghouse™ (NGC) (through May 2000)

NHS Economic Evaluation Database (NHS EED) (through May 2000)

PsycINFO® (1967 through January 17, 2000)

REHABDATA (through February 2000)

TARGET™ (through May 2000)

Search dates generally spanned each database from its inception to July 2000. These searches were supplemented by hand searches of article bibliographies, searches of the gray literature, and searches of the World Wide Web. Key words used as part of the search strategies for the following databases are listed below.

Searches in MEDLINE®, EMBASE®, CINAHL®, Current Contents®

(presented in PubMed syntax)

  • Lumbar spinal stenosis literature key words: Spinal stenosis, sciatica, backache, spinal diseases, ischialgia, compressive neuropathy, spinal claudication, neurogenic claudication, intermittent claudication, nerve root entrapment, nerve root compression, osteoarthritis, spondylosis, spondylolisthesis, cauda equina, spinal osteophytosis, stenosis (lumbar, exit zone, nerve root canal, foraminal)

  • Spinal canal measurement literature key words: Spinal canal/anatomy, histology, spine, anthropometry, spinal (column, canal, diameter, measure)

Searches in PsycINFO®

(presented in DIALOG® syntax)

  • Key words: Back pain (chronic), cauda equina (syndrome, compression), sciatica, lumbago, spondylolisthesis, spinal osteophytosis, stenosis (spinal, lumbar, foraminal, nerve root canal), intermittent neurogenic claudication

Searches were restricted to studies examining human subjects. Case reports were excluded.

Our literature searches were geared to seeking out articles on a broad range of conservative treatments. These included physical therapy, management of symptoms with drugs, rigid brace, bed rest, epidural injection of steroids and/or anesthetics, injection of calcitonin, acupuncture, trigger point treatment, electrical stimulation of nerves or muscle, facet joint injections of steroids or anesthetics, chiropractic manipulation, and multidisciplinary approaches. We also searched for studies on a broad range of surgical treatments. These included total radical laminectomy, standard wide decompressive laminectomy, standard wide decompressive laminectomy with fusion, standard wide decompressive laminectomy with fusion and instrumentation, partial laminectomy (hemilaminectomy), laminotomy, and foraminectomy.

Our searches yielded 4,788 items that were evaluated for this project.

To prevent potential biases in this evidence report, we adopted specific a priori criteria for determining whether we would retrieve any article identified by our literature searches. Separate criteria were developed for studies of conservative treatment, studies of surgical treatment, and studies of diagnostic modalities. Disputes were always resolved in favor of retrieving the full article.

Because the different key questions of this report could be answered using different kinds of data, we adopted slightly different criteria to select articles relevant to answering questions about natural history, treatment, and diagnosis. Our criteria were, in general, very broad in order to ensure that we did not overlook any relevant information.

Patient groups less than 20 years of age were excluded to ensure that degenerative lumbar spinal stenosis and not congenital lumbar spinal stenosis was being examined. We abstracted only the data on degenerative conditions when these data were reported separately.

To answer questions about the natural history of lumbar spinal stenosis, we retrieved all published studies that contained (or purported to contain) measurements of the spinal canal among patients with lumbar stenosis and all trials that contained (or purported to contain) measurements of the spinal canal in normal healthy patients or patients with back pain, regardless of cause. To be retrieved, studies relevant to the natural history of spinal stenosis had to contain more than 10 patients; or, if the study contained more than 1 patient group, it had to contain more than 10 patients per group.

Only two general criteria were applied for retrieving articles on diagnosis of spinal stenosis. First, the article had to be on a diagnostic, and second, it had to be on patients with degenerative lumbar spinal stenosis.

In general, we employed the following criteria for determining whether an article on conservative treatments or surgery would be retrieved:

  • We retrieved any controlled trial, regardless of whether the trial was randomized or concurrently controlled.

  • In addition to controlled trials, we also retrieved any clinical study of surgical treatment for lumbar spinal stenosis.

  • Only studies with at least 10 patients in each arm of the trial were retrieved.

  • Studies of patients with degenerative spondylolisthesis, also called pseudospondylolisthesis, were included.

Once an article was retrieved, we evaluated the comparability of the patients in its different groups, the reporting of data, whether results from more than one type of treatment were combined, and whether results from patients with different disorders or conditions were combined. Trials that combined data from different procedures or from patients with different conditions were not considered further.

Data from all articles that met our inclusion criteria were abstracted using electronic data abstraction forms.

Our analysis of the literature comprised questions for which we were able to combine evidence from different studies and questions with evidence that did not permit such combination. We employed quantitative methods to answer both types of questions.

For parts of questions 1 and 2, we found data that permitted us to combine evidence, and we performed meta-analyses to address them.

Our quantitative analyses of those questions with data that did not permit combination consisted of performing de novo statistical analyses of the data published in the relevant articles wherever possible. Such statistics were either computed from raw data presented in the article or from figures contained in the article. In some cases, we computed effect sizes based on the published data.

Findings

  • Patients with back pain or claudication tend to have narrower spines than asymptomatic patients.

  • Increased patient age and the presence of herniated disks may also contribute to the development of back pain and other symptoms of stenosis. The strength of these relationships and the exact ages at which patients are most likely to develop symptoms cannot be determined from the information available.

  • Some evidence suggests that disk degeneration, narrowing of the spinal canal, and degenerative changes in the spinal ligaments contribute to stenosis and that instability increases with age. However, the strength of this relationship and the age at which stenosis is most likely to occur cannot be determined from the available information.

  • Heavier patients may be more likely to develop the degenerative changes leading to stenosis. Similarly, patients with osteoarthritis of the hips, as well as patients who perform heavy labor, tend to have more disk degeneration than other patients.

  • Very little evidence exists correlating degree of narrowing of the lumbar spine with the presence or severity of the signs, symptoms, or conditions associated with stenosis. Difficulties associated with finding such correlations include the presence of large numbers of patients with spinal narrowing and no symptoms, variations in canal size throughout the population, and lack of an accepted system for quantifying the degree of narrowing.

  • Only two studies provide numerical evidence of a lack of association between severity of stenosis or spondylolisthesis and severity of back pain. There is some evidence of a relationship between degree of spinal instability and back pain. Among patients with symptomatic stenosis, those with more severe stenosis tend to have more disability.

  • Clinical signs and symptoms do not appear to predict whether the results of imaging tests will show severe stenosis.

  • Evaluation of conservative treatment trials is complicated by the lack of patient inclusion criteria restricted to lumbar spinal stenosis. Controlled trials specifically examining and reporting on patients with lumbar spinal stenosis who receive conservative treatments are rare.

  • Few studies have examined the question of the relationship of initial signs and symptoms to the final status or amount of change following conservative treatment. Studies reporting patient outcomes for conservative treatment vary in their results.

  • One well-designed randomized controlled trial (RCT) indicates that local anesthetic block provides temporary relief from neurogenic claudication for about 1 month. Conclusions about effectiveness beyond 3 months cannot be made.

  • Evidence for the efficacy of other conservative treatments in lumbar spinal stenosis patients is lacking. However, the lack of evidence for effectiveness does not prove that these treatments are not effective.

  • The few studies that did stratify outcomes by patient characteristics, especially those that examined degree of stenosis, did not find a connection between successful treatment and specific patient characteristics.

  • The lack of comparable patient groups and pretreatment data is a common problem in evaluating studies that examined both surgical and nonsurgical treatment groups.

  • One RCT provides evidence that patients with severe symptoms will benefit more from surgery than conservative therapy.

  • In general, data are lacking on the effect of conservative treatment on patients with severe stenosis since these patients seem to receive surgery shortly after diagnosis.

  • There is limited, contradictory evidence on whether patients with moderate pain benefit more from surgery or from conservative treatment.

  • No published trials provided the data necessary to determine whether the results of an imaging examination will determine the extent of success of surgical treatment.

  • We are unable to determine whether imaging results can identify patient groups that would be more or less likely to benefit from surgery.

  • The results of two RCTs seem to suggest that instrumentation in addition to fusion does not improve surgical outcomes among patients with spondylolisthesis. However, both trials likely had too few patients (and, therefore, insufficient statistical power) to render any definitive conclusion.

  • One study provides evidence that fusion is beneficial compared to decompressive surgery alone.

  • Information on the cost of surgical treatment of lumbar spinal stenosis came from several sources. Because present data did not allow us to estimate the effectiveness of any treatment or diagnostic, we were unable to perform a cost-effectiveness analysis.

Future Research

At least some of the gaps in current research on lumbar spinal stenosis seem to arise from the suboptimal designs and incomplete reporting of patient characteristics and results in studies that have been conducted to date. Controlled trials specifically are rare; our search revealed that only 4 of the 178 studies examining conservative treatments and 7 of the 147 trials examining surgical treatments for lumbar spinal stenosis were RCTs.

The absence of detailed descriptions of patients enrolled in observational studies is another gap in currently available literature. Mean age and duration of illness prior to treatment were the only patient information reported in all four conservative treatment trials; among the surgical trials, only mean age was reported in more than 80 percent of the publications. The lack of detailed reporting of patient signs and symptoms complicates any attempt at correlating them with the extent of recovery after treatment.

Lack of detailed reporting of patient characteristics can sometimes be overcome if studies employ rather restricted patient inclusion criteria. Available information suggests, however, that such criteria were only infrequently applied across studies of lumbar spinal stenosis. For example, although the mean age of patients reported in most studies was between ages 50 and 70, some studies had means outside this range. Variation in age was even greater within studies, where the age ranges often extended to 30 years on either side of the mean.

In addition, in the 15 studies of surgical treatments which reported on whether patients received (and showed no improvement after) a course of conservative treatment, the actual amount of prior conservative treatment reported ranged from as little as 2 weeks to as much as 16 years.

In order to advance patient care in this field, definitive evidence-based statements about the natural history, diagnosis, and treatment of spinal stenosis await the results of well-designed clinical trials. In particular, studies are needed to determine:

  • the value of imaging to increase the likelihood of success with either conservative or surgical treatment, and

  • the proper course of treatment, conservative or surgical, for patients with mild, moderate, or severe symptoms caused by degenerative lumbar stenosis.

Chapter 1. Introduction

Scope and Purpose of This Report

The purpose of this report is to assess, in an evidence-based fashion, the efficacy of methods for the diagnosis and treatment of degenerative lumbar spinal stenosis. Lumbar spinal stenosis is defined as a focal narrowing of the spinal canal, although the precise amount of narrowing that must occur before the canal is considered stenotic differs among individuals (Alvarez and Hardy Jr, 1998; Bueff and Van der Reis, 1996; Fast and Greenbaum, 1995; Herkowitz, Abrahan, and Fischgrund, 1998; Herno, Saari, Suomalainen et al., 1999; Postacchini, 1996; Spivak, 1998) The general term "spinal stenosis" can be applied to three root compression mechanisms alone or in combination: (1) disk protrusion or herniation, (2) osteotic overgrowth into the spinal canal or the foramina through which the roots pass laterally, and (3) vertebral slippage or spondylolisthesis. Although symptoms overlap for these three mechanisms, the second category, osteotic stenosis, is specifically termed spinal stenosis; this category is the focus of this evidence report, with spondylolisthesis also being addressed.

In extreme cases, lumbar stenosis can cause cauda equina syndrome, which is characterized by severe neuromuscular, bladder, and rectal dysfunction, and is considered to require immediate surgery to prevent permanent nerve damage (Gunzburg and Szpalski, 1999). However, because many studies excluded patients with cauda equina syndrome, virtually no evidence related to this syndrome could be evaluated, and, therefore, consideration of cauda equina syndrome is beyond the scope of this evidence report.

This evidence report, therefore, focuses on less extreme manifestations of lumbar spinal stenosis and considers the evidence surrounding all aspects of this condition. This includes evidence concerning the natural history of lumbar stenosis. This is important because it is not firmly established that this condition is progressive, nor has it been firmly established that stenosis per se is responsible for the symptoms experienced by many patients. Among these symptoms are low back pain, radiculopathy, and neurogenic claudication (Alvarez and Hardy Jr, 1998; Bueff and Van der Reis, 1996; Fast and Greenbaum, 1995; Herkowitz, Abrahan, and Fischgrund, 1998; Herno, Saari, Suomalainen et al., 1999; Postacchini, 1996; Spivak, 1998).

This report also considers the methods used to diagnose lumbar spinal stenosis. Among these methods are myelography, computed tomography (CT), and magnetic resonance imaging (MRI). In evaluating these methods, the typical quantities used to gauge test performance (sensitivity, specificity, and positive and negative predictive values) are considered. Also considered is whether these methods can be used to predict which patients may respond to treatment. Evaluating the efficacy of diagnostic methods for spinal stenosis has many difficulties because imaging is often performed after medical management has proved unsuccessful. Therefore, a common use of imaging is for planning surgery.

Finally, treatments for lumbar spinal stenosis are considered. Included are both medical and surgical treatments. Our searches for information about both classes of treatments were comprehensive. In this analysis, particular attention is paid to patient-oriented outcomes (i.e., relief of symptoms). This is because reducing or eliminating the stenosis may not provide a concomitant reduction in the intensity of symptoms.

Epidemiology of Lumbar Spinal Stenosis

Epidemiology data on lumbar spinal stenosis come from several studies. The annual incidence of spinal stenosis observed in a Swedish study that defined spinal stenosis as a canal of 11 mm or less among patients referred to orthopedic departments was approximately 5 per 100,000 inhabitants (Johnsson, 1995).

Table 1. National Low Back Pain Study Patient Characteristics and Syndromes
DescriptionProportion (%)
Mean age 45 years (SD 12.8; range 25 to 75 years)
Male46
Female54
Main Category
Root compression62
Myofacial syndrome20
Instability19
Postsurgical complications2
Other19
Root Compression
Herniated disk59
Spinal stenosis23
Lumbar spondylosis20
Osteoarthritic root compression14
Nonherniated degenerated disk10
Disk versus Osteo Root Compression
Disk69
Osteo57
Both26
Instability
Spondylolisthesis39
Facet joint arthritis26
Lumbar instability19
Spondylolysis17
Compression fracture10
Spina bifida3
Root Compression Treatment
Conservative care55
Surgery30
No treatment13
Instability Treatment
Conservative care71
Surgery15
No treatment14

Adapted from the National Low Back Pain Study. (Long, BenDebba, Torgerson et al., 1996)

In the National Low Back Pain Study (Long, BenDebba, Torgerson et al., 1996), records were examined for 2,374 patients with chronic low back pain. These patients sought help from orthopedic surgeons and neurosurgeons at eight academic medical centers across the United States from 1986 to 1991. Of these patients, 45.8 percent were male and 54.2 percent were female, with a mean age of 45.3 years (standard deviation 12.79, range 25 to 75). The proportions and categories diagnosed were 62 percent root compression, 19.6 percent myofascial syndrome, 18.7 percent instability, 2.1 percent postsurgical complications, and 19.1 percent other (see Table 1). Within the broad category of root compression, the final diagnoses were 59.2 percent herniated disk, 22.6 percent spinal stenosis, 19.7 percent lumbar spondylosis, 14.0 percent osteoarthritic root compression, and 9.8 percent nonherniated degenerated disk. In other words, 69 percent of root compressions were disk-related, and 57 percent were osteo-related (26 percent were both, so these percentages sum to 126 percent). Within the broad category of instability, the final diagnoses were 39.1 percent spondylolisthesis, 25.7 percent facet joint arthritis, 19.3 percent lumbar instability, 16.6 percent spondylolysis, 10.2 percent compression fracture, and 2.7 percent spina bifida. For patients with root compression, 54.5 percent were given conservative care, 30.4 percent were given surgery, and 13.3 percent were given no treatment. For those with instability, 70.6 percent were given conservative care, 15.3 percent were given surgery, and 14.1 percent were given no treatment.

From the above data, we calculate that of all of these patients seeking treatment for low back problems, 35 percent had osteo-related root compression and were possible candidates for bone-removing surgery (62 percent had root compression, and 57 percent of these had osteo-related compression: 23 percent spinal stenosis, 20 percent lumbar spondylosis, 14 percent osteoarthritic root compression). However, the severity of disease was not reported; thus, the proportion of these patients with disease severe enough to indicate surgery is not known.

The National Ambulatory Medical Care Survey (NAMCS) also provides data on the incidence of lumbar spinal stenosis in the US population (Hart, Deyo, and Cherkin, 1995). The NAMCS is an annual survey of 3,000 general physicians conducted by the National Center for Health Statistics and is intended to be representative of practicing, nongovernmental, office-based physicians in the United States. Over the period 1989 to 1990, the diagnostic cluster for low back pain ranked fifth in frequency among categories and accounted for 2.8 percent of patient visits. Only visits for hypertension, pregnancy, general medical exam, and acute upper respiratory infection ranked higher in frequency of reasons for visiting a physician. The survey estimated that 29,964,894 visits for mechanical back problems were made in the United States during this period. Of these visits, 56.8 percent were classified as nonspecific backache, 11.1 percent as herniated disk, and 3.9 percent as spinal stenosis. For the purposes of analyzing survey data, spinal stenosis was defined as lumbar stenosis or spondylogenic compression of the lumbar spinal cord or nerve roots.

The National Spine Network (NSN) provides another estimate of the prevalence of lumbar spinal stenosis (Fanuele, Birkmeyer, Abdu et al., 2000). Data on 17,774 patients from 25 centers that treat back and neck problems were examined in this study. The average patient age was 47.5 years (SD 15.4, range 17 to 98), 54.7 percent of patients were male, and 84.2 percent of patients were white. Among these patients, 13.1 percent were specifically diagnosed with spinal stenosis, 12.9 percent with degenerative spondylosis due to aging, and 19.2 percent with herniated disks.

Table 2. Comparison of Prevalence Rates for Lumbar Spinal Stenosis Presented in Three Separate Studies
Number of Patients in Each Diagnostic CategoryHart, Deyo and Cherkin, 1995Long, BenDebba, Torgerson et al., 1996Fanuele, Birkmeyer, Abdu et al., 2000
Number of patients with back pain29,964,894 (estimated)2,37417,774
Spinal stenosis3.9%14%13.1%
Spondylosis---12%12.9%
Herniated disk11.1%37%19.2%
Probable degenerative changes12.5%------
A comparison of the data from these three studies indicates that among patients with low back pain who see a specialist, 13 percent to 14 percent may have spinal stenosis. These surveys also indicate that among patients with low back pain who see a general physician, 3 percent to 4 percent may have spinal stenosis (see Table 2). The NAMCS estimate of 3.9 percent of backache patients having lumbar spinal stenosis is probably the more reliable because this patient base comes from office-based physicians in the United States.

Table 3. Longitudinal Study of Development of Spondylolisthesis in Older Adults
StudynSelectionPatient CharacteristicsImage and InterpretationStratificationFrequency of Lumbar Spondylo-listhesisComments
Kauppila, Eustace, Kiel et al., 1998617Surviving participants in Framingham Heart Study who had lateral lumbar radiographs35% male, 65% female; mean age at start 54 yr, mean age at followup 79 yr.Radiographs (lateral) Readers: slippage identified by one researcher blinded to demographic variables and symptoms; slippage measured by same and additional radiologist; only slippage >3 mm according to both readers was countedMales FemalesBaseline 1% (2/219) 25 yr followup 12% (25/217) Baseline 1.5% (6/400) 25 yr followup 26.5% (106/400)Spondylolisthesis criterion: degenerative slippage of >3 mm, not including spondylolytic spondylolisthesis
In regard to spondylolisthesis, in lateral radiographs taken for the longitudinal Framingham Heart Study (Kauppila, Eustace, Kiel et al., 1998), 1 percent (2/219) of men and 1.5 percent (6/400) women already had slippage at the baseline measurement at the mean age of 54 years. Over the following 25 years, 11 percent (23/217) of men and 25 percent (100/400) of women developed degenerative vertebral slippage (see Table 3).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F001.jpg.

   Figure 1. Differential Diagnosis of Lumbar Spinal Stenosis

The following figure is intended to portray the differential diagnosis presented for low back pain as described by Borenstein. (Borenstein, 1996). Additional information was obtained from several other references (Alvarez and Hardy Jr, 1998; Bueff and Van der Reis, 1996; Fast and Greenbaum, 1995; Herkowitz, Abrahan, and Fischgrund, 1998; Herno, Saari, Suomalainen et al., 1999; Postacchini, 1996; Spivak, 1998; Tan, Roux, Dunand et al., 1992).As such, this figure is not intended as a recommendation or guideline.

There are many etiologies for chronic low back pain, of which lumbar spinal stenosis is just one. Figure 1 provides a differential diagnosis that separates the typical symptoms of lumbar spinal stenosis from other potential causes of low back pain.

Natural History of Lumbar Spinal Stenosis

Shrinkage and narrowing of disks as well as osteoarthritic changes can lead to both spinal stenosis and spinal instability (Jane, Jane, Helm et al., 1996; Mirkovic, Garfin, Rydevik et al., 1992; Mirkovic, Garfin, Rydevik et al., 1992; Postacchini and Perugia, 1991; Rowe, 1969). Therefore, disk degeneration and osteoarthritis of the spine are discussed in this Natural History section as underlying conditions that can lead to spinal stenosis, even though treatment for disk herniation and rupture and osteoarthritis is outside the scope of our analysis.

Disk Degeneration

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F002.jpg.

   Figure 2. Etiology of Lumbar Spinal Stenosis

Radicular pain (radiculopathy): disorder of the peripheral nerves due to compression of the nerve roots where they emerge from the spinal cord; responsible for leg pain. Patients have intense sciatica that may persist upon sitting. Nocturnal pain may become a predominant symptom.

Neurogenic intermittent claudication: radiating pain to the thighs and calves associated with compression of the caudea equina. Burning, cramping, numbness, tingling, and dull fatigue in the thighs and legs are classic clinical presentation for lumbar spinal stenosis. These symptoms appear or worsen with the onset of walking, or the severity increases with standing only; both are promptly relieved by sitting, bending, or lying down. The patient may also have urinary incontinence and sphincter dysfunction.

Ordinary wear and tear, combined with age-associated changes in the tissue composition of the spinal disks, lead to disk degeneration (see Figure 2). Type II collagen, the more hydrated form, decreases in proportion to type I collagen (Jane, Jane, Helm et al., 1996). Proteoglycan content declines (Mirkovic, Garfin, Rydevik et al., 1992). In the absence of hydrostatic proteoglycans, the water content of the disk declines. As the overall hydration of the disk decreases, the disk becomes less elastic and thus less able to evenly distribute the constantly shifting loads placed upon it (Prescher, 1998). Unevenly distributed stresses may lead to the development of tears and weaknesses in the disk. To a certain extent, this degeneration is an inevitable result of aging (Jane, Jane, Helm et al., 1996). Degeneration begins as early as the second decade in men and the third in women (Mirkovic, Garfin, Rydevik et al., 1992). By the age of 40, 80 percent of males' and 65 percent of females' disks are moderately degenerated (Mirkovic, Garfin, Rydevik et al., 1992).

In susceptible individuals, disk degeneration can lead to pain and damage to the spinal nerves. Whether joint degeneration leads to pathologic consequences depends upon the characteristics of the individual patient (Jane, Jane, Helm et al., 1996).

Osteoarthritis in the Spine

Incremental microdamage in the lower disks of susceptible patients ultimately results in venting and loss of pressure in the disk nucleus (Rowe, 1969). As the disk degenerates, it narrows, decreasing the distance between vertebrae. The distribution of forces in the joint is altered. The ligaments connecting the vertebrae become lax, destabilizing the joint. Instability and altered force distribution lead to mechanical stress, which in turn can cause osteoarthritic changes in the articular processes (Fast and Greenbaum, 1995). Consequently, the vertebral facets become enlarged, the vertebral pedicles thicken, and the ligamentum flavum thickens. Type II collagen replaces elastic tissue (Jane, Jane, Helm et al., 1996; Schrader, Grob, Rahn et al., 1999), and calcium crystals are deposited (Schrader, Grob, Rahn et al., 1999). Hyalinization of the collagen fibers and proliferation of chondrocytes also contribute to the ossification of the ligament. Facet hypertrophy, thickening of the pedicles, and ossification of the ligamentum flavum lead to narrowing of the central spinal canal. Traction spurs may develop. These spurs can also impinge on the spinal canal or the nerve roots.

Eventually, vertebral stability may be regained as scarring occurs across the nuclear compartment (Rowe, 1969). Osteoarthritic changes may also lead to increased stability or even fusion between two vertebrae (Postacchini and Perugia, 1991; Rosenberg, 1975).

In susceptible individuals, the degenerative changes in the facet joints lead to two overlapping pathological and clinical entities: central and lateral stenosis. The two conditions may not be distinguished by their symptoms (Amundsen, Weber, Lilleas et al., 1995). The extent to which the degree and location of stenosis correlates with the nature, intensity, and location of symptoms is unclear. Individuals are frequently observed to have marked stenosis and no symptoms (LaRocca and Macnab, 1969; Nagler and Bodack, 1993; Postacchini and Perugia, 1991; Splithoff, 1953). Among patients with symptoms, long periods of remission are thought, at least by some, to be common (Rosenberg, 1976). However, the incidence and duration of these periods of remission are not well studied.

Degenerative Central Lumbar Stenosis

Lumbar spinal stenosis can be separated into three broad categories, specifically central stenosis, lateral stenosis, and spondylolisthesis. We have used these categories in our analysis to organize the literature and assist in combining evidence. Specific consideration of the etiology of the stenosis is important because different conservative and/or surgical treatments may be more or less effective depending on the nature of the stenosis.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F003.jpg.

   Figure 3. Top View of a Normal Lumbar Vertebra

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F005.jpg.

   Figure 5. Oblique View of the Normal L3 to L5 Lumbar Vertebrae

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F006.jpg.

   Figure 6. Pathology Associated with Lumbar Spinal Stenosis

Central stenosis refers to a narrowing of the spinal canal across the anteroposterior diameter, the transverse diameter, or both (Gunzburg and Szpalski, 1999; Postacchini, 1999; Woolsey, 1986). The central canal is enclosed anteriorly by the posterior portion of the vertebral body and the vertebral disk and posteriorly by the lamina and the base of the spinous process (see Figures 3, 5, and 6).

Degenerative Lateral Stenosis

Entrapment and compression of the nerve root in its pathway through the spine, referred to as the nerve root canal, is termed lateral stenosis (Gunzburg and Szpalski, 1999; Jenis and An, 2000; Postacchini, 1999; Woolsey, 1986). The nerve root canal begins where the nerve root exits the dura and ends where the nerve root leaves the intervertebral foramen. The nerve root canal is bordered by the pedicle of the vertebra above and the pedicle of the vertebra below. The anterior side of the canal is formed by the vertebral body and vertebral disk. The posterior side of the canal is formed by the facet joint structures of the vertebrae above and below (see Figures 3, 4, and 5). Lateral stenosis occurs when the spinal nerve is compressed within the nerve root canal and/or the vertebral foramina (Fritz, Delitto, Welch et al., 1998). As the disk narrows, the pedicle may move in an inferior direction, narrowing the lateral recess and pinching the spinal nerve (Jane, Jane, Helm et al., 1996; Mirkovic, Garfin, Rydevik et al., 1992). MacNab (1977) originally described this entrapment and compression of the nerve root between a diffuse lateral bulge of the disk and the pedicle above as pedicular kinking. Narrowing of the lateral recess can also be the result of facet hypertrophy or enlargement and ossification of the ligamentum flavum. Radiculopathy, or decreased function of a nerve root, is commonly observed with lateral stenosis. Impingement of the disk into the lateral recess is considered a separate condition (Amundsen, Weber, Lilleas et al., 1995) and is beyond the scope of this evidence report.

Degenerative Spondylolisthesis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F007.jpg.

   Figure 7. Spondylolisthesis Showing Entrapment of Spinal Canal and Nerve Roots

Degenerative spondylolisthesis is a slippage of one lumbar (L) vertebra over an adjacent vertebra and commonly occurs between L4-L5 (Woolsey, 1986). When L4 slips forward, the central canal and the nerve roots become entrapped between the posterior body of L5 and the inferior articular facets of lamina of L4 (see Figure 7).

Disk degeneration may lead to spinal instability. Decreased disk height and degenerative changes in the angles of the facet joints contribute to this instability (Mirkovic, Garfin, Rydevik et al., 1992; Postacchini and Perugia, 1991; Rowe, 1969). This can lead to spondylolisthesis, or displacement of the vertebrae. Lumbar vertebrae have a tendency to be displaced forward due to their normal anterior curve and the action of gravity, muscular force, and other forces (Kim and Lee, 1995). As the vertebra slips forward, the medial edge of the superior facet may encroach on the central canal (Postacchini and Perugia, 1991). Distal to the slipping vertebra, the dural sac becomes trapped between the intervertebral disk and the posterosuperior angle of the underlying vertebral body on one side and the advancing neural arch of the slipping vertebra on the other. This is most commonly observed at the L4-5 level (Herkowitz, 1995; Newman, 1976). The anatomical reasons for the greater likelihood of degeneration at L4-5 have been described by MacGibbon and Farfan (MacGibbon and Farfan, 1979). Lateral stenosis may also result from spondylolisthesis when the nerve root becomes compressed between the inferior facet of the superior vertebra and the vertebral body of the inferior vertebra (Newman, 1976).

Further remodeling of the articular processes may eventually limit the vertebral slippage (Postacchini and Perugia, 1991). The degree of slippage usually does not exceed 20 percent to 30 percent of the width of the inferior vertebra (Herkowitz, 1995; Postacchini and Perugia, 1991; Rosenberg, 1975).

Spinal Stenosis and Spondylolisthesis in the Asymptomatic Population

Table 4. Spinal Stenosis in Asymptomatic Adults
StudynSelectionPatient CharacteristicsImage and InterpretationStratificationFrequency of Lumbar StenosisComments
Healy, Healy, Wong et al., 199619Long-time highly active male athletes over age 40, currently free of pain or physical limitations (some were symptomatic with low back pain in past)All male; mean age 53 yr (range 41 to 69)MRI Readers: 2 blinded, 1 not; no significant differences in interpretation; small differences resolved by consensusNoneStenosis 5% (1/19) Neural foraminal narrowing 16% (3/19)One patient was considered to have "marked spinal stenosis"; only 16% had "normal" MRIs; 79% had abnormal disks
Jensen, Brant-Zawadzki, Obuchowski et al., 199498Volunteers recruited by advertising among hospital staff; no previous back pain for longer than 2 days51% male, 49% female; mean age 42.3 yr (range 20 to 80)MRI 2 readers: blinded by randomly adding 27 abnormal scans (22% of total); differences resolved by consensusNoneStenosis 7% Neural foramen stenosis 7%Stenosis criteria: obliteration of epidural fat and thecal sac flattening; foramen criterion: obliteration of perineural fat; 64% had abnormal disks
Parkkola, Rytokoski, and Kormano, 199360Healthy volunteers randomly selected from Finnish national insurance roll; matched by sex, age, employment, and profession with a cohort of chronic low back pain patients55% male, 45% female; "middle aged," Million index1 mean (SD): males 6 (±11), females 7 (±12)MRI 2 readers; intraobserver kappa = 0.93 and 1.0, p >0.001; interobserver kappa = 0.33 and 0.85, p <0.05NoneCentral stenosis 3% (2/60)Disk bulging in 15% of healthy patients; 40% of the 3 lowest disks were degenerated
Boden, Davis, Dina et al., 199067Volunteers recruited with newspaper ads with no history of back pain of greater than 1 day and no history of sciatica or neurogenic claudication45% male, 55% female; mean age 42 yr (range 20 to 80)MRI 3 readers: blinded by randomly adding 33 abnormal scans (33% of total); ratings were averagedTotal, n = 67 20-39 yr n = 35 40-59 yr n = 18 60-80 yr n = 14 5% 1% 0 21%Stenosis criteria: nondisk loss of signal in epidural fat with compression of neural tissues within canal; 24% had herniated disks
Wiesel et al., 198452Volunteers with no history of back pain or sciatica33% male, 67% female; mean age 40 yr (range 21 to 80)CAT 3 readers: blinded by randomly adding 6 abnormal scans (10% of total); ratings were averaged; some scans rejected as inadequateTotal n = 48 <40 yr n = 22 >40 yr n = 26 5% 0 9%Stenosis criteria: facet arthritis or bulging anulus and loss of epidural fat and compression of neural tissues

1 Million index scale: 0 = perfect back health, 100 = unable to manage daily activities due to back pain.

As mentioned above, not all patients with spinal stenosis are symptomatic. Five studies (Boden, Davis, Dina et al., 1990; Healy, Healy, Wong et al., 1996; Jensen, Brant-Zawadzki, Obuchowski et al., 1994; Parkkola, Rytokoski, and Kormano, 1993; Wiesel, Tsourmas, Feffer et al., 1984) have looked at the proportion of the asymptomatic adult population that has spinal stenosis upon imaging (see Table 4). Their results suggest that 3.3 percent to 5 percent of the adult population have central stenosis but is asymptomatic; 7 percent to 16 percent have neural foramen stenosis or narrowing. The proportion of stenotic, asymptomatic individuals was larger in older age groups (Boden, Davis, Dina et al., 1990), 21 percent (3/14) of adults age 60 and over, versus <1 percent (1/53) of adults below age 60.

Although the presence of apparent stenosis in the asymptomatic population raises a question about whether stenosis per se causes symptoms, those with more severe symptoms are more likely to have stenosis. In one study, stenosis was observed by imaging in 50 percent (5/10) of patients with severe low back pain, 11 percent (4/38) of patients with less severe low back pain, and 3.3 percent (2/60) of asymptomatic controls matched to the back pain patients by sex, age, employment, and profession (Parkkola, Rytokoski, and Kormano, 1993).

Table 5. Spondylolisthesis in Asymptomatic Older Adults
StudynSelectionPatient CharacteristicsImage and InterpretationStratificationFrequency of Degenerative Spondylo-listhesisComments
Kauppila, Eustace, Kiel et al., 1998477Surviving participants in Framingham Heart Study who had lateral lumbar radiographs and no chronic back pain or stiffness35% male, 65% female; mean age 79 yrRadiographs (lateral) Readers: slippage identified by one researcher blinded to demographic variables and symptoms; slippage measured by same and additional radiologist; only slippage >3 mm according to both readers was countedNone17.6% 1 (84/477)Spondylolisthesis criteria: degenerative slippage of >3 mm, not including spondylolytic spondylolisthesis

1 Our calculation from the reported data

There is also evidence that not all individuals with vertebral displacement are symptomatic. Based on lateral radiographs taken for the longitudinal Framingham Heart Study (Kauppila, Eustace, Kiel et al., 1998), at the mean age of 79, 19 percent (90/483) of patients without vertebral slippage had back pain or stiffness on most days compared to 32 percent (39/123) of patients with slippage (χ2 = 10.5, p = 0.001). This latter proportion implies that 68 percent of patients with slippage did not have chronic back pain. From this, we were able to calculate that 17.6 percent (84/477) of patients without chronic back pain had degenerative spondylolisthesis (see Table 5).

The presence of stenosis and slippage in spinal images of asymptomatic people indicates that treatment must be based on the convergence of symptoms and image evidence rather than on either type of evidence alone. And in spite of the relationship between image evidence and more severe symptoms, the possibility exists that, even in more severe cases, treatment based on the image and symptoms may actually address a coincidental condition without eradicating the actual cause of symptoms.

Burden of Disease

Although there is some literature on the burden of disease of low back pain in general, we could find only one study of the societal burden of disease specifically for spinal stenosis. The 1995 population study of two regions in Sweden (Johnsson, 1995) reported that with a spinal stenosis incidence of about 50 per 100,000, between 42 percent and 58 percent of these patients had claudication (leg pain or weakness upon standing or walking). From these data, the incidence was calculated as about 25 per 100,000 inhabitants for spinal-stenosis-associated claudication. More severe stenosis can result in cauda equina syndrome, which is characterized by the loss of sexual function and urinary and/or fecal incontinence. This same study reported that cauda equina syndrome had an incidence of less than 1 per 100,000.

Review articles (Alvarez and Hardy Jr, 1998; Spivak, 1998; Tatarek and Nancy Elizabeth, 2000) and textbooks (Gunzburg and Szpalski, 1999; Kirkaldy-Willis and Bernard, 1999) provide anecdotal evidence that individual patients with symptomatic spinal stenosis typically have chronic low back pain and pain and weakness in the legs that limits standing and walking to brief durations and short distances. This places limitations on the ability to carry out self-supporting daily activities as well as work, social, and recreational activities. This lack of activity may lead to obesity and general physical deterioration that may eventually result in the onset of cardiovascular and other serious health problems. These activity restrictions may also lead to depression and other psychological problems. More severe stenosis can result in cauda equina syndrome. If untreated, severe spinal stenosis is commonly believed to have the potential to result in severe symptoms that may become permanent and unresponsive to medical or surgical treatment (Dawson and Bernbeck, 1998; Hirsch, 1966; Johnsson, Uden, and Rosen, 1991; Johnsson, Uden, and Rosen, 1992; Kirkaldy-Willis and Bernard, 1999; Nagler and Bodack, 1993; Onel, Sari, and Donmez, 1993; Porchet, Vader, Larequi-Lauber et al., 1999; Postacchini, 1988; Shakil, Vaccaro, Albert et al., 1999; Shapiro S, 2000; Shapiro, Cooper, and Miller, 1993; Wiltse, 1977; Wisneski and Farmer, 1994). However, we could find no data supporting this belief except for acute onset of symptoms with herniated disks (Shapiro S, 2000; Shapiro, Cooper, and Miller, 1993) or postoperative complications (McLaren and Bailey, 1986).

Patterns of Care

The rates of lower back surgery and of surgery for spinal stenosis increased dramatically during the 1980s (Ciol, Deyo, Howell et al., 1996). Data from the National Hospital Discharge Survey showed an age-adjusted 40 percent increase in lumbar spine surgery between 1979 and 1992 from 113 per 100,000 to 132 per 100,000. The increase for older patients was even greater, from 51 per 100,000 to 188 per 100,000, a 3.7-fold increase. This latter increase was attributed mainly to an increase in surgery for spinal stenosis, from 7.8 per 100,000 to 61.4 per 100,000, an almost eight-fold increase. Most of these increases occurred in the early 1980s and appeared to be leveling off by the early 1990s (Davis, 1994); however, we found no published systematic data on back surgery rates during the late 1990s.

According to Medicare records (patients 65 and over), the rate of spinal stenosis surgery in 1989 varied among states by more than four-fold, from 30 per 100,000 in Rhode Island to 132 per 100,000 in Utah (Ciol, Deyo, Howell et al., 1996). The rates were generally highest in Pacific and Mountain states and lowest in New England, Mid-Atlantic, and Southeastern states. While some of this divergence may result from socioeconomic factors, the geographic variations suggest that the divergence is substantially caused by the limited consensus regarding surgical indications for spinal stenosis (Ciol, Deyo, Howell et al., 1996).

Reoperation is an important part of lower back surgeries. A population-based cohort study of patients receiving lower back surgery in 1988 in Washington State reported that within five years there was an 18 percent reoperation rate for patients receiving fusion surgery and a 15 percent reoperation rate for those receiving nonfusion lower back surgery (Malter, McNeney, Loeser et al., 1998).

Ciol and colleagues used the United States Health Care Financing Administration data for the cohort of Medicare beneficiaries who received lumbar spine surgery in 1985 to estimate the rate of reoperation for this type of surgery (Ciol, Deyo, Kreuter et al., 1994). All hospitalizations up to four years after the surgery were examined for repeat surgery. After excluding for malignancy, infection, fracture, trauma, and Medicare eligibility based on end-stage renal disease, 27,111 patients were identified as having lumbar spine surgery related to spinal stenosis (39.3 percent), herniated disk (41.4 percent), degenerative disk disease (11.6 percent), or possible instability (3.3 percent). The average patient age was 71.8 years (range 59 to 97), 57.1 percent were women, and 92.7 percent were white. After four years, the rate of reoperation for lumbar spinal stenosis was approximately 10 percent and the reoperation rate for herniated disks was approximately 12 percent.

Diagnostic Tests

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F008.jpg.

   Figure 8. Diagnosis and Treatment of Lumbar Spinal Stenosis

The information presented in this figure was derived from conversations with experts in the field and from literature sources (Alvarez and Hardy Jr, 1998; Bueff and Van der Reis, 1996; Eisenberg and Margulis, 2000b; Fast and Greenbaum, 1995; Grossman, Katz, Santelli et al., 1994; Herkowitz, Abrahan, and Fischgrund, 1998; Herno, Saari, Suomalainen et al., 1999; Postacchini, 1996; Spivak, 1998). This figure is not intended to be a practice recommendation or guideline.

Patients with lumbar spinal stenosis usually undergo a "staged" diagnosis (see Figure 8). The first diagnostic stage is the physician visit, during which the patient receives a physical examination. Results of the physical are combined with information from the patient history in a preliminary diagnosis. Lumbar spinal stenosis is not definitively diagnosed at this stage, so the diagnostic results are described as "consistent with" spinal stenosis or not consistent with spinal stenosis.

Although clinicians consider a combination of results of the history and physical examination and imaging findings to be the most effective means of diagnosing lumbar spinal stenosis, no objective criteria for using the history and physical examination have been reported. In addition, there are no reported clinical trials of the effectiveness of such a composite diagnosis. The only quantitative evidence correlating diagnostic information with outcomes is for the imaging findings. This absence of evidence limits the analysis that can be performed for some of the research questions.

History and Physical Examination

Katz et al. (1995) examined the value of the history and physical examination in the diagnosis of degenerative lumbar spinal stenosis. In this study, 93 patients over 40 years of age with symptoms of low back pain were examined by attending physicians who were then asked the extent to which they were certain the patient had lumbar spinal stenosis. The diagnostic impressions of expert clinicians and imaging, when available, were used as a reference standard to evaluate the attending physician's diagnosis. Severe lower extremity pain, absence of pain when seated, a wide-based gait, thigh pain following 30 seconds of lumbar extension, and neuromuscular deficits were all strongly associated with patients with lumbar spinal stenosis. No pain when seated and wide-based gait had the highest specificity, 93 percent and 97 percent, respectively. The highest sensitivity came from age greater than 65 (77 percent), pain below buttocks (88 percent), and no pain with flexion (79 percent) (Katz, Dalgas, Stucki et al., 1995).

Fritz et al. (1997) have developed a treadmill test as a clinical diagnostic tool for the differentiation of neurogenic claudication due to lumbar spinal stenosis from other pathologies that may produce similar symptoms. Spinal extension and weight bearing that occur during walking narrow the spinal canal and exacerbate the symptoms of lumbar spinal stenosis. Spinal flexion or nonweight-bearing postures that occur while sitting increase the dimensions of the spinal canal and reduce symptoms. The treadmill test involves having the patient walk on a level surface and an inclined surface. The time until onset of symptoms, total walking time, and time until symptoms return to baseline are recorded for each surface. Walking on an inclined plane produces spinal flexion and may be better tolerated by patients with lumbar spinal stenosis. The treadmill test was evaluated using 45 subjects with low back pain of varying etiologies and self-reported limitations in walking. Diagnostic images with MRI or CT were used as the gold standard for diagnosis. Twenty-six of the subjects were diagnosed by imaging as being stenotic. Self-reported sitting to relieve symptoms was significantly related to diagnosis. The sensitivity of this self-reported measure was 88.5 percent (95 percent confidence interval [CI] of 76.2 to 100), but specificity was 38.9 percent (95 percent CI of 16.4 to 61.4). For the treadmill test, earlier onset of symptoms with level walking, greater total walking time during inclined walking, and prolonged recovery after level walking were significantly related to a diagnosis of lumbar spinal stenosis. The sensitivity and specificity for earlier onset of symptoms with level walking were 68.0 percent (95 percent CI of 49.7 to 86.3) and 83.3 percent (95 percent CI of 66.1 to 100), respectively; for larger total walking time during inclined walking, they were 50.0 percent (95 percent CI of 37.5 to 62.5) and 92.3 percent (95 percent CI of 77.8 to 100), respectively; and for prolonged recovery after level walking, they were 81.8 percent (95 percent CI of 5.7 to 97.9) and 68.4 percent (95 percent CI of 47.5 to 89.3), respectively. The authors concluded that a two-stage treadmill test might be more useful in the differential diagnosis of lumbar spinal stenosis compared to patients' self-reports of posture (Fritz, Erhard, Delitto et al., 1997).

Use of the treadmill-bicycle test for the differential diagnosis of neurogenic claudication was also examined by Tenhula et al. (2000). In this study, 32 patients with documented lumbar spinal stenosis were evaluated before and after surgery. Patients were found to have a significant increase in their symptoms from the start to the end of the treadmill test but fewer patients were found to have significant symptoms on bicycle testing. Two years after surgery, patients had an improvement in their walking ability on treadmill testing, but showed no improvement in their ability to bicycle. The authors believe the treadmill-bicycle test may be a useful tool for the differential diagnosis of neurogenic claudication (Tenhula, Lenke, Bridwell et al., 2000).

Imaging examinations appear to be used primarily in a second diagnostic stage. At this stage, some surgical intervention is usually under consideration, so the imaging examination is as much for surgical planning as it is for confirmation of the preliminary diagnosis. In chiropractic care, a plain x-ray image may initially be obtained to aid in chiropractic therapy even if surgery is not planned (DuPriest, 1993).

In clinical practice, the imaging results help to confirm the diagnosis of spinal stenosis after the history and physical indicate the likelihood of spinal stenosis. Clinical trials interchangeably use myelogram, CT, or MRI results for confirmation of the diagnosis. This is partly because spinal stenosis is defined in terms of the anatomy displayed in the images. Also, there is no other means of verifying the results, short of measurement of the spinal canal during surgery. As discussed in the next section, plain film radiography is not considered a definitive standard for use in diagnosing lumbar spinal stenosis (Widelec, Bacq, and Peetrons, 1999). Because no independent means is available to confirm that imaging results are right or wrong, assessment of the performance of any of the imaging modalities for diagnosis of lumbar spinal stenosis is not possible in the same way we usually assess diagnostics. This is particularly true for negative cases, in which there would not be subsequent surgery on the spine.

Diagnostic Imaging Modalities

Radiography

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F004.jpg.

   Figure 4. Lateral View of a Normal Lumbar Vertebra

The first imaging modality used to diagnose and evaluate lumbar spinal stenosis was radiography: film-based x-ray imaging, colloquially known as "plain film." The typical lumbar spine examination consists of AP (anteroposterior: front to back), lateral (side to side; see Figure 4), and oblique (diagonal; see Figure 5) views (Widelec, Bacq, and Peetrons, 1999). Film radiography has excellent spatial resolution for displaying small anatomic details but has several characteristics that limit its value in diagnosing spinal stenosis. First, it is a projection method: small anatomic details may be obscured by overlapping structures. Obtaining multiple views can sometimes resolve those structures. Second, the soft-tissue contrast of radiography is relatively low. Bones are depicted very clearly on plain radiographs, so they are frequently used to rule out vertebral fracture, if it is suspected.

Plain radiographs depict the spinal canal well enough for measurement of its diameter. The lateral radiograph is also useful for diagnosing spondylolisthesis, forward/backward displacement of a vertebra (Sackett, 1994; Wood, Popp, Transfeldt et al., 1994). For measuring displacement, radiography is considered the gold standard.

Myelography

Intradural contrast-enhanced radiography of the spine is known as myelography. To obtain a myelogram, a radiopaque contrast agent is injected into the spinal canal, and x-ray images are taken. The contrast agent diffuses through the spinal canal, outlining it quite clearly on the myelogram, but if stenosis causes a complete blockage of the spinal canal, no contrast agent will flow below (inferior to) the stenosis. Modern contrast agents for myelography are water soluble and contain iodine. Some patients are sensitive to iodine and may have an allergic reaction to the contrast material. Myelography is contraindicated in these patients. Adverse side effects are common in myelography. About half of patients experience head or neck pain, and 15 percent experience nausea or dizziness. Nearly all patients find myelography uncomfortable and will not be able to resume some normal activities, like driving, until the day after the procedure (American College of Radiology. 1999; Mitchell, 2000; Ramsbacher, Schilling, Wolf et al., 1997).

Table 17. Surgical Trials Reporting Imaging Results
Authors and yearImaging results reportedSigns and symptomsClinical results
Herno, Partanen, Talaslahti et al., 1999Stenosis grade (four-point scale)NoneCorrelation not reported
Hurri, Slatis, Soini et al., 1998AP distance (<7 mm or >7 mm)Correlation not reportedSuccess
Airaksinen, Herno, Turunen et al., 1997AP distance (CT or myelogram, five-point scale)NoneOswestry score, return to work
Nishizawa and Fujimura, 1997Stenosis grade (myelogram; 3-point scale); stenosis grade (enhanced CT; - point scale)NoneChange in JOA score
Sato and Kikuchi, 1997One-level or two-level stenosisSymptomsSuccess
Kawauchi, Yone, and Sakou, 1996Myeloscope; degree of adhesive arachnoiditisNoneSuccess
Rompe, Eysel, Hopf et al., 1995AP distance (CT; 2-point scale)CharacteristisConfounded by disk disease
McCullen, Bernini, Bernstein et al., 1994Degree of scoliosis, percent of slipNoneTurner score
Silvers, Lewis, and Asch, 1993Complete block or no block (myelogram)NonePain relief (short-term and long-term)
Satomi, Hirabayashi, Toyama et al., 1992Stenosis grade (myelogram 3-point scale)SymptomsConfounded by age
Johnsson, Uden, and Rosen, 1991Complete or partial block (myelogram)SymptomsSuccess
Jalovaara, Lahde, Iikko et al., 1989Marked or moderate stenosis (myelogram)NoneSuccess
Young, Veerapen, and O'Laoire, 1988Stenosis grade (myelogram)NoneStenosis grade at surgery
Surin, Hedelin, and Smith, 1982Marked or moderate stenosis (myelogram)NoneSuccess
Johnsson, Willner, and Pettersson, 1981Complete or partial block (myelogram)SymptomsSuccess
Tajima, Fukazawa, and Ishio, 1980Exclude; method not reported
Verbiest, 1977Type of stenosis and disk involvementSymptomsSuccess
Paine, 1976AP distanceSuccess
Shenkin and Hash, 1976Specific myelogram findings, level of stenosisSymptomsCorrelation not reported
Spinal stenosis is traditionally defined by its appearance on a myelogram. Complete blockage is as described above. Partial blockage is a stenosis where the film shows a complete interruption of the column of contrast agent at the point of focal narrowing (i.e., the stenosis), but enough contrast agent gets through the stenosis to opacify the spinal canal below (inferior to) the stenosis. Stenoses of lesser degree are defined by the diameter of the contrast agent column at the stenosis. This is usually measured in the AP dimension, as viewed on the lateral film. The threshold measurement used to define stenosis varies from investigator to investigator (see Table 17 and Question 7 in Chapter 3).

The myelogram has long been considered a gold standard for imaging spinal stenosis and helping to confirm the diagnosis, but it has been supplanted by the cross-sectional modalities CT and MRI. Recent clinical handbooks now recommend MRI rather than plain films or myelography as the primary imaging modality in cases of suspected spinal stenosis. Myelography is now rarely done in routine clinical practice (Eisenberg and Margulis, 2000b; Grossman, Katz, Santelli et al., 1994; Gundry and Heithoff, 1999; Mitchell, 2000).

Computed Tomography

Because it is a three-dimensional modality, CT avoids the overlapping-structure problem of film-based radiography. But CT is still an x-ray modality, and bones will appear more distinct than soft tissue on a CT scan. CT is limited to axial images (cross sections as viewed from the patient's head to toe; see Figure 3), but all currently available CT scanners have tilting gantries that allow the imaging plane to be tilted 20° to 30° in each direction (ECRI. 1999). For many patients, this permits acquisition of images parallel to the disks, so the entire disk is in one image, although the vertebral column is not straight up and down.

CT scans can be acquired following administration of contrast agent, just like radiographs. Because the three-dimensional images resolve overlapping structures, there is less need for contrast agent in CT scans of the spine. Also, smaller quantities of the contrast agent are necessary with CT, but CT can be done following a myelogram with its larger contrast agent dose.

Magnetic Resonance Imaging

While it depicts anatomy in cross-section as CT does, MRI is based on completely different physical principles. A full explanation of how MRI works is beyond the scope of this report. Briefly, MRI derives contrast primarily from differences in the T1 and T2 relaxation times of hydrogen nuclei in the body. Those differences stem from differences in the chemical environment of water in different types of cells. Parameters for magnetic resonance (MR) image acquisition ("pulse sequences") can be varied to emphasize T1 or T2 or to base contrast on some combination of the two. Usually, MR examinations comprise several sets of images using several different pulse sequences. As with all other MRI applications, some pulse sequences are more effective than others for imaging the spine (Ramsbacher, Schilling, Wolf et al., 1997). Because technical characteristics of different MR scanners are different, the pulse sequences that are ideal for one scanner may not be ideal for another scanner.

Besides the advantage of soft-tissue contrast, MRI has the advantage of being able to acquire images in any user-selected plane. Sagittal sections (cross sections as viewed from the patient's left to right; see Figure 4) are particularly useful in spine imaging, and it is quite easy to select axial image planes parallel to the disks. Contrast-enhancement agents are also available for MRI, but instead of containing iodine or other atoms of high atomic number like x-ray contrast agents, MR contrast agents contain paramagnetic ions like gadolinium. Because MR images inherently provide good contrast of the spine and its contents, contrast agents are not usually necessary in MRI of the spine. Studies described as "MR myelography" do not necessarily involve contrast agents (Ramsbacher, Schilling, Wolf et al., 1997).

The chief disadvantage of MRI is that its spatial resolution is not as good as that of film radiographs (Ramsbacher, Schilling, Wolf et al., 1997). MRI can also be susceptible to spatial distortions that would cause errors in quantitative measurements of the vertebrae, but adherence to routine quality assurance procedures at the imaging center will minimize distortions.

Ultrasound

Ultrasound can also generate cross-sectional images, but it is not well suited to imaging of the spine and contents. Ultrasound waves do not penetrate bone, so imaging can only be done through "windows" like the ligamentum flavum (Pai, 1993). Adequate windows cannot be found in some patients, so their spinal canal cannot be visualized with ultrasound. One of the two clinical trials of spinal ultrasound reported that this problem is worse in individuals with spinal stenosis (Pai, 1993). In one-third of the measurements reported in this trial, ultrasound results differed from radiographic results (considered the reference standard) by 2 mm or more. The other published ultrasound trial did not report quantitative measurements (Tait, Charlesworth, and Lemon, 1985). Ultrasound is inexpensive and safe, but it lacks accuracy in quantitative measurement of the spinal canal.

Imaging of the Spine in Typical Practice

Recent textbooks and clinical handbooks of radiology identify an important shift in clinical practice for diagnosis of spinal stenosis, herniation of the disks, and other conditions of the lumbar spine. The cross-sectional modalities, particularly MRI, are now considered the primary imaging tools for diagnosis of spinal stenosis (Eisenberg and Margulis, 2000a; Eisenberg and Margulis, 2000b; Grossman, Katz, Santelli et al., 1994; Gundry and Heithoff, 1999; Kirkaldy-Willis and Bernard Jr, 1999; Mitchell, 2000; Spengler, 2000; Widelec, Bacq, and Peetrons, 1999; Wilmink, 2000). Reasons cited for the shift include multiplanar imaging capabilities, better ability to show bulging or fragmentation of the disks, and the risks and discomfort of myelography. These books should not be considered evidence based. They represent "conventional wisdom" among radiologists, which is shaped in part by clinical trials and other evidence. In fact, Wilmink notes the absence of conclusive evidence that MRI is the most effective modality for diagnosing spinal stenosis (Wilmink, 2000).

Official guidelines developed by professional specialty societies like the American College of Radiology (ACR) and the Canadian Association of Radiologists (CAR) are also based on a mix of scientific evidence and expert opinion. CAR states that "CT and MRI have replaced myelography as the examination of choice for the above [spinal stenosis and other myelographic] indications" (Fontaine, Lee, Maloney et al., 1996). Myelography would still be an appropriate choice if the patient cannot be imaged by CT or MRI due to unavailability of a scanner, metal implants, or patient size or noncompliance. Appropriateness criteria from the ACR do not specifically address diagnosis of spinal stenosis, but for various types of nontraumatic myelopathy, MRI is considered very appropriate (8 on a scale of 1 to 9), and CT and myelography are considered less appropriate (usually 4 on the same scale) (American College of Radiology, 1998).

Plain films still have a role in diagnosis of spinal disorders; they may be all that is necessary to diagnose vertebral fracture and/or spondylolisthesis (Crosby and Brant-Zawadzki, 1994; Grossman, Katz, Santelli et al., 1994), and they are capable of measuring the diameter of the spinal canal (Sackett, 1994). Plain radiographs are considered "appropriate" by ACR in some clinical circumstances (6 or 7 on a 9 point scale) (American College of Radiology, 1998).

All the imaging modalities are adversely affected by surgical hardware and other metal objects implanted in the body. These objects are radiopaque; so they obscure overlapping structures in plain x-rays and myelograms. The effect is heightened in CT scanning, where the metal objects cause artifacts that can affect the entire image. Metal objects also affect MR scans, although most orthopedic devices are nonmagnetic and do not completely contraindicate MRI. Metal causes susceptibility artifacts: there is no signal from the object itself, and portions of the image near the object may be distorted or lose signal. The magnitude of this effect depends on the size and placement of the object and its composition (Fredrickson, 2000). Since diagnosis among patients who already have had surgery is outside the scope of this report, the studies we examined for this technology assessment did not deal with artifacts from metal implants.

All imaging modalities are subject to variation in image quality and diagnostic effectiveness and variation in interpretation by the radiologist. We located one study that addresses how variations in image quality affect the diagnosis of spinal stenosis. Jarvik et al. (2000) examined variations in the quality of lumbar spine MR images, and attempted to correlate them with characteristics like the magnetic field strength of the scanner and the ownership and siting of the scanner. Three readers rated the quality of 69 examinations from 17 centers. Readers were blinded to identifying information on the images, including information that could identify the center or the type of scanner used. The readers graded each examination on a four-point scale for the sharpness with which each of seven clinically important structures was depicted. They also assigned an overall quality score to each examination, using the same scale. The average overall quality rating for each center ranged from 1.96 to 3.56, while the average rating of the seven structures from each center ranged from 2.25 to 3.82 (Jarvik, Robertson, Wessbecher et al., 2000).

Jarvik et al. (2000) found that image quality was significantly decreased by the following characteristics: low magnetic field strength (less than 1.0 tesla), location of the scanner outside a hospital, and for-profit ownership of the scanner. The number of radiologists interpreting images at a center and the percentages of images reviewed by a physician before the patient was dismissed had smaller but still statistically significant effects on study quality. The authors note that they were unable to measure some characteristics that would be expected to affect study quality, such as training and experience of the radiologists. However, the results of this study are consistent with the idea that the quality of diagnostic imaging examinations, especially complicated ones like MRI, should not be taken for granted.

Finally, image quality should not be mistaken for diagnostic effectiveness, although poor image quality can impede effective diagnosis. While one modality may be able to depict structures with more detail and more contrast than another, the modality of lesser "quality" may still depict the anatomy clearly enough to permit a correct diagnosis. For this reason, we base our evaluation of diagnostic imaging modalities on their diagnostic results, not on subjective or objective measurements of image characteristics.

Surgical Planning

Imaging data are also necessary for planning surgical treatment. The surgeon needs to determine the extent of decompression, identify the bony anatomy, and measure the extent of the stenosis (Gunzburg and Szpalski, 1999). Accurate location of the stenosis is essential to avoiding operation at the wrong lumbar level. Identification of vertebral defects, anatomic landmarks, and other bone anatomy is necessary to minimize complications (Bernard and Yong-Hing, 1999). Selection of the correct type, size, and placement of orthopedic hardware is aided by preoperative imaging studies (Visarius, 2000). Both plain x-rays and CT or MRI are often needed in surgical planning (Lazennec, Ramare, Arafati et al., 1999); the specific combination of modalities varies from surgeon to surgeon and from patient to patient. There is no single preoperative imaging plan that is appropriate for all spinal stenosis patients.

Assessment of this indication for spinal imaging is complicated by the absence of any clinical trials measuring the effectiveness of the various imaging modalities in surgical planning.

Summary

Diagnostic imaging data are essential in planning for surgical treatment for lumbar spinal stenosis and related conditions. No one procedure or combination of procedures is right in every case, and multiple imaging modalities are used in most cases. Images obtained at the time the diagnosis of the patient's condition is confirmed are usually sufficient for surgical planning, in part because part of the purpose of those imaging examinations is surgical planning. No clinical evidence is currently available on the effect of any particular imaging modality on surgical outcomes.

Chapter 2. Methodology

Research Questions

Focus and Refinement of Topic

In order to focus, refine, and arrive at the key questions addressed by this assessment, the research team held telephone conversations with 11 experts in the field of lumbar spinal stenosis and one patient representative. These individuals were invited to comment on a document prepared by the research team charged with preparing the present evidence report. This research team consisted of five experts in technology assessment. The document comprised a preliminary evidence model (evidence models are discussed below) and written descriptions of the specific issues depicted in this model. Conversations with the experts and the patient representative clarified that the evidence report was to cover primarily lumbar stenosis in adults and that congenital stenosis or stenosis resulting from trauma was beyond the scope of the present evidence report. Upon receipt of the comments of the experts and the patient representative, the research team met to ensure that the evidence model was refined to incorporate the comments of the experts and the patient representative.

Peer Review

Upon completion of a draft report, the document was sent to additional experts in the field of spinal stenosis. The substantive comments were addressed in a document that described the disposition of each comment, and the draft evidence report was accordingly revised.

Evidence Model

The evidence model developed for this report is a graphic that depicts the relationships between the diagnostics and treatments of spinal stenosis and the benefits and harms that result from their use. These relationships are denoted by links (represented by arrows). The important links in the model were used to generate the key questions addressed in this report. Because of the complexity of the topic, we divided the evidence model into two sections. The first section portrays the questions on the natural history of lumbar spinal stenosis, and the second section portrays the questions on the diagnosis and treatment of lumbar spinal stenosis.

Key Questions on the Natural History of Lumbar Spinal Stenosis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F009.jpg.

   Figure 9. Evidence Model Depicting the Natural History of Lumbar Spinal Stenosis

The evidence model for the natural history of lumbar spinal stenosis addresses three questions that relate patient characteristics to patient signs, symptoms, and conditions (see Figure 9). The three questions are:
  1. What is the relationship between each relevant patient characteristic and the presence and/or intensity of each of the patient signs, symptoms, and conditions of lumbar spinal stenosis?

  2. Which relevant patient characteristics are correlated with an increased likelihood of focal narrowing of the spinal canal?

  3. What is the relationship the degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?

Key Questions on the Diagnosis and Treatment of Lumbar Spinal Stenosis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F010.jpg.

   Figure 10. Evidence Model Depicting the Diagnosis and Treatment of Lumbar Spinal Stenosis

The evidence model for the diagnosis and treatment of lumbar spinal stenosis addresses five questions that relate to the connection between the signs and symptoms of lumbar spinal stenosis and the results of conservative treatment, the imaging examination, and the results of surgical treatment (see Figure 10).
  • 4

    What is the relationship between the signs and symptoms and other features of the history and physical and the results of the imaging examination? Implicit in this question is an examination of the criteria for diagnosis of spinal stenosis.

  • 5

    What is the relationship between the signs and symptoms and other features of the history and physical and results of conservative treatment and what is the relationship between the type of conservative treatment, and patient outcomes? Implicit in this question is whether any particular patient subgroup benefits from medical management of spinal stenosis.

  • 6

    What is the relationship between the signs and symptoms and other features of the history and physical and the success or failure of surgical treatment? Implicit in this question is whether any particular patient subgroup benefits from surgical treatment of spinal stenosis and whether some patients might benefit more from surgery than from medical management.

  • 7

    What is the relationship between the results of the imaging examination and the success or failure of surgical treatment? Implicit in this question is whether it is possible to predict that a certain patient subgroup will benefit from surgery.

  • 8

    What is the relationship between the type of surgery received and patient outcomes?

Costs Associated With Lumbar Spinal Stenosis

The last question does not appear in our evidence models because it deals with medical costs associated with lumbar spinal stenosis.

  • 9

    What costs are associated with nonsurgical and surgical treatment of spinal stenosis?

Databases Searched

Our literature searches were geared towards ensuring that our retrieval of information relevant to our key questions was comprehensive. The following is a list of the databases we searched, and the dates covered by our searches:

The Cochrane Database of Systematic Reviews (through 2000 Issue 1)

The Cochrane Registry of Clinical Trials (through 2000 Issue 1)

The Cochrane Review Methodology Database (through 2000 Issue 1)

CRISP (through May 2000)

Cumulative Index to Nursing and Allied Health Literature (CINAHL)® (1988 through November 22, 1999)

Current Contents® -- Clinical Medicine (through May 2000)

The Database of Abstracts of Reviews of Effectiveness (Cochrane Library) (through 2000 Issue1)

Dissertation Abstracts (through February 29, 2000)

ECRI Library Catalog (through May 2000)

EMBASE® (Excerpta Medica) (1974 through November 19, 1999)

Health Care Financing Administration (HCFA) Web site (through May 2000) including:

Medicare Provider Analysis and Review (MEDPAR) 1999 National Physician Fee Schedule Payment Amount File

Health Devices Alerts® (1977 through May 2000)

Healthcare Standards (1975 through May 2000)

Health Devices Sourcebase® (through May 2000)

Health Services Research Projects (HSRPROJ) (through February 29, 2000)

HealthSTAR (Health Services, Technology, Administration, and Research) (1990 through May 20, 2000)

International Health Technology Assessment© (IHTA) (1990 through May 2000)

LocatorPlus (through May 2000)

MANTIS® (through February 4, 2000)

MEDLINE® (1964 through March 8, 2000)

National Guideline Clearinghouse® (NGC) (through May 2000)

NHS Economic Evaluation Database (NHS EED) (through May 2000)

PsycINFO® (1967 through January 17, 2000)

Rehabdata (through February 2000)

TARGET® (through May 2000)

Hand Searches of Journal and Nonjournal Literature

To further ensure that our searches allowed for comprehensive identification of information, we searched Current Contents© -- Clinical Medicine on a weekly basis and routinely reviewed more than 1,600 journals and supplements maintained in our collections. Nonjournal publications and conference proceedings from professional organizations, private agencies, and government agencies were also screened.

Finally, to further ensure the comprehensiveness of our identification of relevant literature, we reviewed bibliographies/reference lists from peer-reviewed and gray literature. (Gray literature includes reports and studies produced by local government agencies, private organizations, educational facilities, and corporations that do not appear in the peer-reviewed literature.)

World Wide Web Searches

Searches of the World Wide Web were also conducted using various resources and search engines, including (but not limited to) AltaVista, NorthernLight, and Google. Selected Internet resources include:

American Academy of Orthopedic Surgeons (AAOS) http://www.aaos.org/

American Back Society http://www.americanbacksoc.org/home.htm

Centre for Clinical Effectiveness http://www.med.monash.edu.au/publichealth/cce/

Development Evaluation Committee http://www.hta.nhsweb.nhs.uk/rapidhta/main.htm

GICD Online http://www.gicd.org/index.html

HCUPnet http://www.ahcpr.gov/data/hcup/hcupnet.htm

Mecqa (benchmarking data) http://www.mecqa.com/consumer/frame.htm

Medscape http://www.medscape.com

NHS Centre for Reviews and Dissemination http://www.york.ac.uk/inst/crd/welcome.htm

North American Spine Society http://www.spine.org

Spinal Disorders http://www.spinaldisorders.org/index.html

SUM Search http://sumsearch.uthscsa.edu/searchform4.htm

The Spinal Foundation http://www.spinal-foundation.org/home.htm

TRIP Database http://www.ceres.uwcm.ac.uk/section.cfm?section=trip

MESH Terms and Search Strategy

Our literature search strategies were designed to ensure broad retrieval of information. Thus, we sought to perform inclusive literature searches that had low specificity and high sensitivity. Our search strategies are described below.

MEDLINE®, EMBASE®, CINAHL®, Current Contents® (presented in PubMed syntax)

Search for Lumbar Spinal Stenosis Literature

  1. (Spinal stenosis OR sciatica OR backache OR spinal diseases[mh] OR ischialgia OR compressive neuropathy OR spinal claudication OR neurogenic claudication OR intermittent claudication OR nerve root entrapment OR nerve root compression OR osteoarthritis OR spondylosis OR spondylolisthesis OR cauda equina OR spinal osteophytosis) AND (lumbar or lumbosacral)

  2. stenosis AND (lumbar OR exit zone OR nerve root canal OR foraminal)

  3. #1 OR #2

  4. #3 AND (meta-analysis[pt] OR meta-analysis OR clinical trial[pt] OR clinical trials[mh] OR random allocation[mh] OR random* OR double-blind method[mh] OR single-blind method[mh] OR placebo* OR multicenter study[pt] OR prospective* OR historical OR controls[ab])

  5. #4 AND (human[mh] OR preMEDLINE[sb] OR publisher[sb])

  6. #5 NOT (letter[pt] OR news[pt] OR editorial[pt] OR comment[pt] OR case report[mh] OR anesth* OR anaesth* OR "idiopathic scoliosis")

Search for Spinal Canal Measurement Literature

  1. Spinal canal/anatomy AND histology[mh] OR ((spinal canal[mh] OR spine[mh]) AND anthropometry[mh])

  2. Spinal (column OR canal) AND diameter AND measure*

  3. #1 OR #2

  4. #3 AND (human[mh] OR preMEDLINE[sb] OR publisher[sb])

  5. #4 NOT (foetal OR fetal OR embryo* OR prenatal OR neonat* OR comment[pt] OR letter[pt] OR editorial[pt] OR news[pt] OR case report[mh] OR bone density)

PsycINFO® (presented in Dialog® syntax)

  1. Back pain! AND chronic

  2. Cauda ()equina () (syndrome or compression) or sciatica or lumbago or spondylolisthesis or spinal ()osteophytosis or (spinal or lumbar or foraminal or nerve ()root ()canal) ()stenosis or intermittent ()neurogenic ()claudication

  3. s1 or s2

  4. s3 and (lumbar or lumbosacral)

  5. s4 and (quality of life/de or QOL or quality (2n)life or disability evaluation/de or disability)

Range of Searches for Conservative Treatments

Our literature searches were geared to seeking out articles on a broad range of conservative treatments. These included:

  1. Physical therapy

    • Exercises: strengthening and endurance, stretching and flexibility, flexion v. extension, isometric flexion, aerobic exercise, swimming, stationary bicycle

    • Application of heat or ice, ultrasound, massage, traction.

  2. Management of symptoms with drugs

    • Nonsteroidal anti-inflammatory, muscle relaxants, analgesics, antidepressants, narcotics

  3. Rigid brace

  4. Bed rest

  5. Epidural injection of steroids and/or anesthetics

  6. Injection of calcitonin

  7. Acupuncture

  8. Trigger-point treatment: trigger points are tender nodes of degenerated muscle tissue that may cause severe pain; treatment may involve injection of anesthetics or steroids, stretch, ice massage, ultrasound, or electrical stimulation

  9. Electrical stimulation: transcutaneous nerve stimulation (TENS), electrical muscle stimulation

  10. Facet joint injections of steroids or anesthetics

  11. Chiropractic manipulation

  12. Multidisciplinary approach

Range of Searches for Surgical Treatments

We also searched for studies on a broad range of surgical treatments. The terminology for surgical procedures is inconsistent in the literature. Therefore, items in the following list of surgical procedures are general in nature. Specific modifications or refinements of these techniques appear in many publications, and these specific modifications were located by our searches.

  1. Total radical laminectomy (all of the facet joints are removed)

  2. Standard wide decompressive laminectomy

    • Single or multiple levels

  3. Standard wide decompressive laminectomy with fusion (arthrodesis)

    • Single or multiple levels

  4. Standard wide decompressive laminectomy with fusion and instrumentation (pedicle screw fixation)

    • Single or multiple levels

    • Instrumentation may be bilateral or unilateral

  5. Partial laminectomy, hemilaminectomy

  6. Laminotomy

    • Single or multiple levels

  7. Foraminectomy

    • Unilateral or bilateral

Criteria for Retrieval of Full Articles

To prevent the potential for bias in this evidence report, we adopted specific a priori criteria for determining whether we would retrieve any article identified by our literature searches. These criteria were developed by the research team charged with preparing this report and were applied by three research analysts. Separate criteria were developed for treatments of trials of conservative treatment, surgical treatment, and studies of diagnostic modalities.

The decisions of the analysts who applied the criteria were reviewed by two other research analysts, and disputes were always resolved in favor of retrieving the full article.

Once an article was retrieved, it had to meet additional criteria before we included it in our analysis. Different inclusion/exclusion criteria had to be developed for each of our key questions. These question-specific criteria are described as part of the analysis for each question.

Patient Populations Included

Few restrictions were placed on the patient populations in clinical trials of conservative or surgical treatments that were retrieved for this analysis. All patients, or a separately reported subset of patients, had to be diagnosed specifically with lumbar spinal stenosis. Groups of patients less than 20 years of age were excluded to ensure that degenerative lumbar spinal stenosis and not congenital lumbar spinal stenosis was being examined. However, several publications mixed congenital conditions with degenerative conditions. We abstracted only the data on degenerative conditions when these data were reported separately.

Years Included

For all aspects of this evidence report, publications from 1960 to the present were retrieved.

Criteria for Retrieving Articles on Conservative Treatment

We employed the following criteria for determining whether an article on conservative treatments would be retrieved:

  1. We retrieved any controlled trial, regardless of whether the trial was randomized or concurrently controlled.

  2. Only studies with at least 10 patients in each arm of the trial were retrieved.

  3. Studies of patients with degenerative spondylolisthesis, also called pseudospondylolisthesis, were retrieved.

Criteria for Retrieving Articles on Surgical Treatments

The following inclusion criteria were used to determine whether articles on surgical treatments were retrieved:

  1. Providing that the article met the two inclusion criteria listed below, we retrieved all controlled and uncontrolled trials, regardless of whether they were prospective or retrospective and, if controlled, regardless of whether the controls were historical or concurrent, and regardless of whether patients were randomly assigned to groups.

  2. Only studies with at least 10 patients in one arm of the trial were retrieved.

  3. Studies of patients with degenerative spondylolisthesis (also called pseudospondylolisthesis) were retrieved.

Criteria for Retrieving Articles on Diagnostics

Only two general criteria were applied for retrieving articles on diagnosis of spinal stenosis. First, the article had to be on a diagnostic, and second, it had to be on patients with spinal stenosis. Surgical trials included in the analysis were reviewed for image-related data.

Criteria for Retrieving Articles on the Natural History of Lumbar Spinal Stenosis

To answer questions about the natural history of lumbar spinal stenosis, we retrieved all publications that contained (or purported to contain) measurements of the spinal canal among patients with lumbar stenosis and all trials that contained (or purported to contain) measurements of the spinal canal in normal healthy patients or patients with back pain, regardless of cause. Studies reporting spinal measurements in cadavers or skeletons were retrieved, but were not used because they did not include information describing the health status of the subjects before death. To be retrieved, studies relevant to the natural history of spinal stenosis had to contain more than 10 patients, or if the study contained more than one patient group, it had to contain more than 10 patients per group.

Electronic Data Abstraction Forms

Data from all articles that met our inclusion criteria were abstracted using electronic data abstraction forms. These forms were created using Microsoft Access. Using this software, data abstraction forms were designed for entering data about basic trial design information; patient signs, symptoms, characteristics, and treatments; reporting of treatment outcomes; surgical complications; and spinal measurements. The data abstraction forms are presented in the appendix.

The abstraction form for trial information was designed to contain information on trial design, purpose, author, year of publication, general diagnosis of patient condition, a specific description of the treatment outcomes examined, inclusion and exclusion criteria, and other important information with which to judge the quality of the trial. One record containing a unique trial identification number appears for each trial entered in the database.

The abstraction form for patient characteristics and treatments was designed to contain information on each patient group within a trial. A separate record containing a unique patient group identification number appears for each patient group within a trial. This form contains entries for specific diagnosis and type of lumbar spinal stenosis, treatment given to the patient group, division of patient groups based on postsurgical outcomes, number of surgical levels involved, number of patients in the group, specific descriptions of patient treatment, and patient characteristics such as age, dropouts, signs, symptoms, and duration of symptoms prior to treatment. This form also has entries for extent of disease based on the degree of stenosis and reported comorbidity.

Abstraction forms with similar design were created to contain information on treatment outcomes. Separate abstraction forms were needed for ability to work, back pain, back pain relief, leg pain, leg pain relief, dependency and disability, mental status, activities of daily living and physical activities, quality of life, walking capacity, and global success of treatment. These forms have entries for the patient group identification number, number of patients reporting the outcomes, time in months when the outcome was measured (followup time after treatment), mean and standard deviation of the outcome score, statistics used and p values reported, and categorical outcomes scales (usually presented as some combination of Excellent, Good, Fair, Poor, and Very Poor categories). A separate record was entered for each patient group and each followup time for which an outcome was reported.

Special forms were designed for spinal measurements, surgical complications, and surgical outcomes.

Quality of the Literature

Our analysis of the quality of the literature emphasizes study design. In particular, we focus on the following aspects of study design:

  1. Whether the trial was controlled. (This does not form part of our evaluation of trials of diagnostics, inasmuch as diagnostics are rarely tested by means of controlled trials).

  2. If the trial was controlled, how patients were assigned to groups.

  3. Whether the trial contained any confounds that prevent a reader from reaching a conclusion about the efficacy of the treatment or diagnostic studied by that trial.

  4. Whether the study reported data required to reach a conclusion about our key questions. This includes (but is not limited to) whether the study reported data that would allow us to perform de novo statistical analyses to test hypotheses related to our key questions.

Our assessment of the quality of the literature takes the form of a systematic narrative review. This assessment was performed for each question and appears in our discussion of the literature that underlies each question. Also addressed in certain appropriate questions is an accounting of the number of articles identified, retrieved, and ultimately suitable for answering each question.

Approaches to Evaluating and Combining Evidence

Our analysis of the literature comprises questions for which we were able to combine evidence from different studies, and questions with evidence that did not permit such combination. We employed quantitative methods to answer both types of questions.

Parts of questions 1 and 2 had data that permitted us to combine evidence, and we performed meta-analyses to address them. These data came from observational studies and not randomized controlled trials (RCTs). Therefore, caution is needed in interpreting these data. In these two questions, we were interested in the difference between spinal canal sizes of patients with and without certain symptoms. To perform meta-analyses on such data, we converted the mean canal size difference between groups to the effect size, Hedges' d (Hedges and Olkin, 1985). This effect size is determined by expressing the difference between two groups in standard deviation units. A 95 percent confidence interval around the effect size indicates the range into which, if the experiment were repeated a large number of times, the effect size would be 95 percent of the time. If this interval does not include zero, the effect is considered statistically significant. In some cases, a very large effect size may be considered clinically significant even if it is not statistically significant, and vice versa. We employed a fixed effects model for all of our meta-analyses.

To check the validity of the summary statistic of these models, we used two tests for heterogeneity, the Q test and each study's standardized residual. A standardized residual of >1.96 was taken to indicate that a study's effect size was heterogeneous with respect to the other studies in the meta-analysis. Results of our meta-analyses are graphically displayed in Forrest plots. In this type of plot, each effect size (including the summary statistic) is depicted as a point, surrounded by bars illustrating the 95 percent confidence interval around each study's effect size and around the summary statistic.

To further assist in interpreting the results of our meta-analyses, we depicted the results of our meta-analyses in plots that show a pair of normal curves. One of these curves represents the distribution of nondiseased (or, where appropriate, asymptomatic) patients. The mean of this curve was arbitrarily set to zero. The second curve in each plot represents diseased (or, where appropriate, symptomatic) patients. The difference between the mean of this curve and the mean of the former curve represents d, the effect size. Perhaps more importantly, the curves depict the degree of overlap in the normalized distributions of the canal sizes of the two groups of patients that are being compared in the meta-analysis. We quantified the degree of this overlap using the [union or logical sum] statistics described by Cohen (Cohen, 1988). More specifically, we computed the [union or logical sum]1 statistic, which is the percentage of nonoverlap of the two normal curves. We then subtracted this number from 100 percent to obtain the percentage of overlap between the two curves.

As a final way of facilitating interpretation of the results of our meta-analyses, we expressed their results in terms of a binomial effect size display, or BESD (Rosenthal, 1991). This display depicts the results of a meta-analysis as a 2 by 2 table in which there are 100 patients in each of the two groups being compared. The BESD also artificially dichotomizes the dependent variable and sets the number of patients in each category of this variable to 100. As such, the BESD illustrates the proportion of patients in each group who do or do not have a certain characteristic.

Some minor differences exist between the results derived from the [union or logical sum] statistic and those illustrated by the BESD. These differences have not been explored in the statistical literature, but they do make it important to display the results of both statistical procedures. We stress that these differences are minor, and one would arrive at the same conclusions using either one of these methods alone.

Our quantitative analyses of those questions with data that did not permit combination consisted of performing de novo statistical analyses of the data published in the relevant articles wherever possible. Such statistics were either computed from raw data presented in the article or from data read off of figures contained in the article. In some cases, we computed effect sizes based on the published data. For computation of effect sizes derived from dichotomous outcomes, we employed the odds ratio and natural log transformation as described by Hasselblad and Hedges (Hasselblad and Hedges, 1995). We used the method of Torgerson (1958) to analyze data from rating scales that comprised more than two categories and transformed the result of this statistic into Hedges' d. We also conducted additional de novo statistical analyses using a variety of standard tests. Because these tests are quite varied, we describe each of them in the text when used (Torgerson, 1958).

Previous Meta-analyses of Treatments for Lumbar Spinal Stenosis

Prior meta-analysis has been performed on clinical studies of treatments for lumbar spinal stenosis (Niggemeyer, Strauss, and Schulitz, 1997; Turner, Ersek, Herron et al., 1992), degenerative lumbar spondylolisthesis (Mardjetko, Connolly, and Shott, 1994), and degenerative lumbar spondylosis (Gibson, Grant, and Waddell, 1999). These meta-analyses differ in approaches to the pooling of data from across studies and in the interpretation of results. Important differences exist between our approach to the meta-analysis of lumbar spinal stenosis and these other meta-analyses. None of these meta-analyses attempted to evaluate conservative therapy for lumbar spinal stenosis or spondylolisthesis.

Turner et al. (1992) looked at the literature concerning outcomes of surgery for spinal stenosis from 1966 to 1990. Studies were included if they were original studies of patients undergoing decompressive laminectomy for lumbar spinal stenosis and appeared in English-language journals. Studies were excluded when spinal stenosis patients could not be separated from other diagnoses, outcome data were not reported, sample size was less than six, and there were no data concerning patient preoperative characteristics. Seventy-four articles met the inclusion/exclusion criteria. The authors noted the wide variation in outcome measures and created their own definitions for the outcome categories of good-to-excellent, fair, and poor. For actual analysis, the outcomes were dichotomized into good-to-excellent versus fair and poor. Thirty-one studies reported outcomes that allowed the recoding according to this good-to-excellent versus fair and poor rating scale. Studies were included regardless of study design, presence of a control group, or patient characteristics that would preclude combining data across studies. Only three studies had prospective designs. A chi-square test was used to determine whether the proportion of good-to-excellent versus fair and poor outcomes varied significantly across studies. This test was highly significant (p <0.0001), indicating significant heterogeneity of outcomes across studies. The average proportion of good-to-excellent outcomes among the studies was 64 percent, but the range was 26 percent to 100 percent. No standard deviations or confidence limits were calculated. The authors used regression analysis to examine whether specific predetermined variables could explain the large variation across studies. No significant associations between outcome and patient variables such as age, gender, neurogenic claudication, number of levels of laminectomy, or prior back surgery were found. The authors concluded that the poor scientific quality of the literature, especially major deficits in study design, analysis, and reporting, prevented the construction of a proper meta-analysis. In particular, the lack of comparative trials, especially randomized controlled trials, prohibited any attempt at addressing the question of efficacy of various therapies through meta-analysis (Turner, Ersek, Herron et al., 1992).

There are difficulties in interpreting the Turner et al. (1992) analysis. The authors do not provide a detailed description of their methods and, in particular, do not state how (or whether) they calculated effect sizes for each study. Therefore, it is not at all clear that they used meta-analytic methods to perform their analysis. Failure to use such methods could severely compromise their findings. Consequently, we will not assess their findings.

Niggemeyer et al. (1997) conducted a meta-analysis of surgical treatments for lumbar spinal stenosis, again using their own definitions for good, fair, and poor outcomes. Studies published between 1975 and 1995 in English, French, or German were included if the studies reported cases of degenerative lumbar spinal stenosis with no prior back surgery and had a minimum of seven patients. Thirty articles met the inclusion criteria. As in the meta-analysis by Turner et al., studies were included regardless of study design, presence of a control group, or patient characteristics that would preclude combining data across studies. The authors calculated a pooled weighted proportion for each of the three outcome categories within each of three surgical approaches: decompression alone, decompression plus fusion, and decompression with fusion and instrumentation. Proportions for each outcome category were then compared across surgical approaches using a Z score and pooled variance. This resulted in the calculation of nine separate Z scores (3 categories by 3 surgical comparisons). None of the Z scores were significantly different from zero. The use of uncontrolled trials in this analysis compromises its validity. The study was further compromised by the lack of any test of heterogeneity. Therefore, one cannot determine whether the trials were, in fact, combinable. Consequently, we do not discuss this analysis further (Niggemeyer, Strauss, and Schulitz, 1997).

Mardjetko et al. (1994) performed a meta-analysis of studies examining decompression and fusion for treating degenerative lumbar spondylolisthesis. Studies were included if they enrolled patients with degenerative spondylolisthesis with radicular leg pain or neurogenic claudication, were published in English between 1970 and 1993, had at least four patients, and reported primary outcomes specifically for the spondylolisthesis patients. Twenty-five papers met the inclusion criteria, and of these, 21 were classified as retrospective, nonrandomized, and uncontrolled. Outcomes were reassigned into satisfactory or unsatisfactory categories, and pooled weighted proportions were calculated. Z scores were then calculated for comparisons of decompression without fusion to decompression with fusion and decompression with fusion and instrumentation. Significant Z scores were found in favor of fusion increasing patient satisfaction and instrumentation increasing fusion rates. The authors commented on the serious methodological flaws, such as poorly defined outcome measures, lack of blinded outcome assessment, failure to provide adequate patient demographics, and lack of stratification of outcomes by diagnosis, that led to a poor meta-analysis. The meta-analysis method allowed data from poorly designed studies to dilute the data from better-quality studies. Like the Niggemeyer et al. (1997) analysis, this analysis does not provide a test of heterogeneity. For these reasons, we do not consider the results of this analysis (Mardjetko, Connolly, and Shott, 1994).

In 1999, Cochrane Review published a discussion of the difficulties of conducting a meta-analysis of randomized controlled trials on surgery for degenerative lumbar spondylosis (Gibson, Grant, and Waddell, 1999). Their analysis addressed the questions of what evidence exists concerning the clinical effectiveness of lumbar spine surgery and what evidence exists concerning alternative forms and techniques of lumbar spine surgery. All randomized and quasirandomized controlled trials identified between 1966 and 1998 and pertaining to degenerative lumbar spondylosis and the associated pathologies or clinical syndromes of back pain, instability, spinal stenosis, and degenerative spondylolisthesis were included. Fourteen RCTs for degenerative lumbar spondylosis were retrieved. Many of the trials had major defects of design, such as including heterogeneous pathologies and clinical syndromes, and the subgroups were often too small to give meaningful results. The authors mentioned that clinical outcomes were mainly crude ratings on a three- or four-point scale. The reader should note that spondylosis is a large category of spinal pathology. Differences in the number of RCTs mentioned in the present analysis and the Cochrane Review are due to separating out the specific pathologies of lumbar spinal stenosis and spondylolisthesis, as well as not including studies that did not report outcomes separately for these conditions. The Cochrane Review found no RCTs dealing with the efficacy of surgical decompression for degenerative lumbar spondylosis or spinal stenosis or of surgical fusion for degenerative spondylosis and back pain. A meta-analysis of fusion rates in trials of instrumented fusion versus fusion alone was performed. This analysis suggested that solid fusion was more likely if instrumentation was used. However, the significant heterogeneity among the trials prevented the drawing of conclusions about the results of instrumented fusion for any particular pathologic condition or about any particular instrumentation system. The authors concluded that there is a serious lack of scientific evidence supporting surgical management for degenerative lumbar spondylosis. The authors go on to say that "there is no scientific evidence on the effectiveness of any form of surgical decompression or fusion for degenerative lumbar spondylosis compared with natural history, placebo, or conservative management."

A meta-analysis of the accuracy of CT, MRI, and myelography was published in 1992 (Kent, Haynor, Larson et al., 1992). The authors wished to evaluate what was known about the diagnostic accuracy of these imaging tests for the diagnosis of lumbar spinal stenosis in adults without prior surgery. Studies were included if they contained cases of canal, lateral recess, or foraminal stenosis. Cases with entirely acute-onset radiculopathy, where all cases had herniated disk, or with fewer than 10 cases of lumbar spinal stenosis were excluded. The contribution of plain film to diagnosis was not reviewed. One hundred and sixteen articles from 1986 to 1991 were read, and only 14 were retained for analysis. A series of quality criteria were developed based on the technical quality of study procedures (index test quality, reference test quality, application of reference test) and on the proper selection of patients and unbiased interpretation of information (independence of interpretations, clinical description, cohort assembly, and sample size). Based on these criteria, an overall rating from A (best) to D (worst) was assigned to each article. All studies received a C or D rating. C studies had poor clinical descriptions but had intermediate ratings in all other categories. Studies failing the C rating received a D. The authors calculated sensitivities of 0.81 to 0.97 for MRI (four studies), 0.70 to 1.0 for CT (nine studies), and 0.67 to 0.78 for myelography (four studies) and specificities of 0.72 to 1.0 for MRI (three studies), 0.80 to 0.83 for CT (four studies), and 0.7 for myelography (one study). Based on their review of the literature, the authors concluded that any estimates of sensitivity or specificity must be considered imprecise, since the estimates are based on small sample sizes and on studies with serious methodological flaws. In particular, the imaging literature for spinal stenosis failed to provide independent corroboration with a reference standard for the diagnosis. Thirty articles were excluded due to the lack of a proper reference standard. Failure to assemble a representative cohort and failure to maintain independence between image reading and reference standards were also common problems. The applicability of these cases to general practices was reduced by a selection of patients referred for surgery. These patients will tend to have the most severe disease and increase the frequency of true-positive diagnosis. The variation in study design prevented the pooling of sensitivity and specificity estimates.

Our current assessment of studies examining the diagnostic accuracy of MRI, CT, and myelography for lumbar spinal stenosis also concludes that these studies have one or more flaws in design or reporting that adversely affect the reliability or applicability of the results.

Chapter 3. Results

The sequence of events leading to spinal stenosis was described in the introduction to this report. In this chapter, we further describe the natural history of lumbar stenosis by examining the evidence pertaining to questions 1-3, which were previously presented in the evidence model. Each of these questions pertains to a specific link in the evidence model.

Question 1 ("What is the relationship between each relevant patient characteristic and the presence and/or intensity of each of the patient signs, symptoms, and conditions of lumbar spinal stenosis?") examines the overarching link from patient characteristics to symptoms of stenosis. Examination of the issues pertinent to this link is complicated by the fact that many of the symptoms of stenosis are shared by other conditions, such as herniated spinal disks or vascular claudication. For this reason, this link is broken down into two shorter links. Question 2 ("Which relevant patient characteristics are associated with an increased likelihood of focal narrowing of the spinal canal?") examines the link between patient characteristics and the development of spinal stenosis. Question 3 ("What is the relationship between degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?") examines the link between having stenosis and developing each of the signs and symptoms of the condition.

Question 1What is the relationship between each relevant patient characteristic and the presence and/or intensity of each of the patient signs, symptoms, and conditions of lumbar spinal stenosis?

Certain patients may be more vulnerable to the effects of spinal stenosis. In this section, we first examine the available evidence concerning whether there is a relationship between canal diameter and the development of back pain, claudication, or radicular pain/sciatica. We then examine data on whether there is a relationship between patient age or sex or the presence of a herniated disk and the development of stenotic symptoms.

In answer to this first question, we consider only studies that reported some measurement of spinal diameter and that reported data on whether patients had or did not have a particular symptom. In many cases, this limits our treatment of the data to case-control studies, because they were the only studies to report symptomology. Due to a relative lack of information, parts of our answer to this question take the form of a systematic narrative review. While this review reflects the results of and critiques currently available studies, another purpose of it is to illustrate where there is a need for data.

Relationship Between Spinal Canal Diameter and the Presence of Back Pain, Claudication, or Radicular Pain/Sciatica

Although it is commonly believed that the development of the symptoms of lumbar spinal stenosis are related to the size of the spinal canal (Clark, 1969; Postacchini and Perugia, 1991), very little evidence is available to support this belief. While more severe degeneration and degree of slippage are often associated with more severe symptoms, the extent of neural deficit may depend more on the degree of neural compression (Postacchini and Perugia, 1991). If the canal is wide and the stenosis is mild, there is relatively little or no compression of the dural sac. When the spinal cord is normally sized, stenosis may lead to compression of the dural sac. If the spinal canal is smaller than normal, the sac may be severely compressed even if there is only slight narrowing or vertebral slippage. When the canal is small, a developing stenosis may tend to become symptomatic earlier and is likely to present more severe deficits than in a wider canal (Clark, 1969).

There is an important distinction between canal size and stenosis. Canal size refers to the size of the healthy, nonstenotic canal. On the other hand, stenosis refers to a focal narrowing. When canal diameter is reported for stenotic patients, the measurement generally refers to canal size at the narrowest point. When canal size is described in epidemiologic or anthropologic studies such as those described in this section of this report, it refers to canal size at predefined levels. Although the spinal canal tends to narrow with age (Twomey and Taylor, 1988), the point at which normal narrowing becomes stenosis is imprecise.

The spinal canal is usually measured in one of three ways. The most common is to measure the midsagittal (anterior-posterior) diameter (Panjabi, Duranceau, Goel et al., 1991). This is the simple diameter of the canal at the widest point of the sagittal plane, and is measured in centimeters or millimeters. The sagittal diameter is the smallest dimension of the canal (Alvarez and Hardy Jr, 1998). The interpedicular distance, which is the diameter of the canal in the frontal plane, is usually considerably wider. Midsagittal diameter averages approximately 13.4 to 20.4 mm, while interpedicular distance is normally 19 to 27 mm (Postacchini, Pezzeri, Montanaro et al., 1980a). These measurements vary by spinal level as well as by the age, race, and size of the individual patient.

In some cases, the cross sectional area of the canal is measured. This may be a more informative measure if stenosis is occurring primarily in the lateral recesses. Finally, some researchers report the canal size in a ratio with the size of the vertebral body (Jones and Thomson, 1968). This ratio takes into account the observation that differently sized people have different sizes of canal. This reflects an attempt to define a "normal" canal size in relation to the size of the subject. It is not known whether a small person with a correspondingly small canal is less susceptible to spinal stenosis than a large person with a canal of similar size.

Spinal measurements are normally expressed individually for one or more discrete spinal levels. Spinal canal measurements of patients with stenosis may convey information about the global narrowing associated with degenerative changes on the spinal canal, but may not reflect the extent of focal narrowing in the stenotic area of the canal. When an osteophyte or an enlarged ligamentum flavum impinges on a spinal canal, the canal size at this point may be considerably less than in the surrounding area of the canal. This focal narrowing may not be reflected in the published measurements of canal size. The precise area of stenosis that is responsible for the patient's symptoms may be difficult to determine from diagnostic visualization (see section on Diagnostic Tests).

To investigate the relationship between spinal canal size and symptoms of stenosis, we conducted a series of meta-analyses of studies that compared spinal canal sizes among patients with back pain, claudication, or radicular pain to those of asymptomatic age-matched controls. The methods we used for our meta-analysis are described in the section entitled "Approaches to Evaluating and Combining Evidence" (see Chapter 2).

Table 6. Spinal Measurements in Patients with Symptoms of Stenosis
StudyPatient descriptionUnitsnMeasurement
Back Pain
Hamanishi, Matukura, Fujita et al., 1994Healthy controls age 42±15Cross-sectional area of the dural tube (mm2)
L211213±48
L2/L329170±39
L316192±39
L3/L431157±39
L420170±43
L4/L536153±42
L522153±54
L5/S132129±55
S11598±53
Low back pain patients age 44±17
L24168±51
L2/L332144±31
L35189±39
L3/L437135±45
L421143±67
L4/L550130±51
L521130±66
L5/S147113±52
S115103±62
Hultman, Saraste, and Ohlsen, 1992Healthy controlsL3 sagittal diameter, mm1815.9±1.6
Patients with acute back pain3415.6±1.9
Patients with chronic back pain2115.1±1.9
Healthy controlsL3 canal area, mm218240.4±42.8
Patients with acute back pain34240.7±48.8
Patients with chronic back pain21231.4±52.3
Anderson, Adcock, Chovil et al., 1988Healthy controlsOblique parasagittal diameter at L5-S1, cm331.72±0.23
Patients who have missed work because of back pain161.60±0.25
Drinkall, Porter, Hibbert et al., 1984Controls with no history of back painSagittal diameter, cm132
L11.52±0.079
L21.49±0.077
L31.47±0.078
L41.44±0.078
L51.44±0.074
Patients with back pain193
L11.48±0.073
L21.46±0.072
L31.43±0.074
L41.40±0.075
L51.41±0.073
Macdonald, Porter, Hibbert et al., 1984Coal miners who have taken various amounts of time off work because of back painSagittal diameter, cm
No time off16
L11.55±0.08
L21.50±0.07
L31.49±0.07
L41.47±0.06
L51.50±0.08
8-16 weeks off20
L11.53±0.07
L21.50±0.09
L31.46±0.08
L41.43±0.06
L51.45±0.08
16-24 weeks off17
L11.56±0.07
L21.52±0.07
L31.46±0.09
L41.43±0.09
L51.46±0.09
26-52 weeks off17
L11.51±0.08
L21.47±0.08
L31.43±0.09
L41.39±0.09
L51.39±0.10
>52 weeks off4
L11.45±0.05
L21.44±0.04
L31.41±0.03
L41.36±0.04
L51.37±0.06
Claudication
Hamanishi, Matukura, Fujita et al., 1994Healthy controls age 62±11Cross-sectional area of the dural tube (mm2)
L26194±41
L2/L312169±38
L37174±46
L3/L413148±33
L410165±39
L4/L514144±35
L510157±57
L5/S113139±40
S17101±57
Patients with intermittent claudication age 62±12
L213119±38
L2/L338104±35
L319106±52
L3/L44179±28
L43196±41
L4/L55261±28
L52983±41
L5/S14659±34
S12147±38
Uden, Johnsson, Jonsson et al., 1985Control age 71±7Sagittal diameter (mm)11
L318.3±2.8
L415.9±4.1
L516.0±3.3
Claudication age 68±523
L315.6±7.2
L410.8±6.5
L514.0±5.6
Radicular Pain/Sciatica
Hamanishi, Matukura, Fujita et al., 1994Healthy controls age 42±15Cross-sectional area of the dural tube (mm2)
L211213±48
L2/L329170±39
L316192±39
L3/L431157±39
L420170±43
L4/L536153±42
L522153±54
L5/S132129±55
S11598±53
Patients with radicular pain age 45±16
L212193±58
L2/L335131±29
L319166±50
L3/L451128±40
L445147±62
L4/L579116±59
L551133±54
L5/S175107±65
S13591±54
Uden, Johnsson, Jonsson et al., 1985Control age 71±7Sagittal diameter (mm)11
L318.3±2.8
L415.9±4.1
L516.0±3.3
Sciatic pain age 67±633
L318.4±3.3
L414.2±4.5
L514.9±4.3

For the sake of clarity, the measurements used in the meta-analyses are indicated with boldface type.

In the meta-analyses for this first question, we compared the canal diameters of patients with and without back pain and patients with and without claudication. The data we used in this meta-analysis are provided in Table 6. Also shown in this table are data from patients with and without radicular pain/sciatica (for reasons discussed below, these data could not be meta-analytically assessed). Although all of the patients in these studies have at least one symptom associated with spinal stenosis, they do not all have stenosis.

Many of the relevant studies provided spinal canal measurements at more than one level or in more than one set of units. If a study provided more than one unit of measurement, sagittal diameter was preferentially used in our meta-analyses. Similarly, if a study provided canal size at more than one level, then we used the size at L4 in our meta-analyses. This level was reported by all studies that reported multiple levels and is between L3 and L5, each of which was the only level reported by two studies (Anderson, Adcock, Chovil et al., 1988; Hultman, Saraste, and Ohlsen, 1992). Thus, L4 is the most commonly reported level. If canal measurements were given for more than one patient group, then we used the patient group with the most severe symptoms in our meta-analyses, as long as that patient group included at least 10 patients. For the sake of clarity, the measurements used in the meta-analyses are indicated with boldface type.

Table 7. Summary of Meta-Analysis of Differences in Spinal Canal Diameter Between Patients With and Without Back Pain
AuthorYearGroup 1 1 nGroup 2 ndLower CLUpper CLp-valueStandardized Residual
Hamanishi, Matukura, Fujita et al., 199419942120−0.468−1.0890.1530.1400.243
Hultman, Saraste, and Ohlsen, 199219922118−0.443−1.0800.1940.1730.315
Drinkall, Porter, Hibbert et al., 19841984193132−0.524−0.749−0.298<0.00010.270
Anderson, Adcock, Chovil et al., 198819881633−0.499−1.1040.1060.1060.143
Macdonald, Porter, Hibbert et al., 198419841716−1.014−1.739−0.2890.006−1.321
Qp-value of QOverall dLower CLUpper CLp-value of d
1.820.769−0.541−0.726−0.357<0.0001
1

Group 1 denotes patients with back pain, and Group 2 denotes those without back pain.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F011.jpg.

   Figure 11. Forrest Plot of Differences in Spinal Canal Diameter in Patients With and Without Back Pain

Five studies compared spinal canal measurements among patients with or without back pain (Anderson, Adcock, Chovil et al., 1988; Drinkall, Porter, Hibbert et al., 1984; Hamanishi, Matukura, Fujita et al., 1994; Hultman, Saraste, and Ohlsen, 1992; Macdonald, Porter, Hibbert et al., 1984). Differences in spinal canal measurements between patients with or without back pain were statistically significant in two studies (Drinkall, Porter, Hibbert et al., 1984; Macdonald, Porter, Hibbert et al., 1984). The summary effect size for the meta-analytically combined studies was statistically significant (d = −0.541, p <0.0001). Neither test for heterogeneity was significant (Q = 1.82, p = 0.769, largest standardized residual = −1.321). Effect sizes of the individual studies, the 95 percent confidence intervals around them, and the appropriate meta-analytic statistics are presented in Table 7 and illustrated graphically with a Forrest plot in Figure 11. The effect sizes are negative because spinal canals of patients with back pain tend to be smaller than those of control patients.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F012.jpg.

   Figure 12. Overlap of the Distributions of Normalized Canal Diameters in Patients With and Without Back Pain

Table 8. Binomial Effect Size Display: Canal Size in Patients With and Without Back Pain
Small Canal"Normal" Canal
No Pain3763
Back Pain6337
Figure 12 shows the normalized distributions of the canal sizes among patients with and without back pain. In this figure, the mean of the distribution of canal sizes among patients with no back pain is set to zero. The mean canal size among patients with back pain is 0.541 standard deviation units less than the mean canal size of control patients without pain. While the difference between means is statistically significant, there is a 64.8 percent overlap between the distributions of the spinal canal diameters of patients with and without back pain. The overlap of the two distributions demonstrates that although there is a statistically significant relationship between spinal diameter and back pain, this relationship is not perfect. The magnitude of the relationship between canal diameter and back pain can also be illustrated by converting the summary d statistic into a correlation coefficient. This yields a moderately low correlation coefficient of −0.261. When this correlation coefficient is used to construct a binomial effect size display (BESD, Table 8), we find that approximately 1.7 times as many patients with back pain have small spinal canals than age-matched control subjects without back pain.

We caution that even the modest effect obtained in our meta-analysis may be an overestimate of the magnitude of the relationship between pain and canal size. This is because the studies in this analysis may not distinguish a congenitally narrow canal from focal narrowing of the canal. It is possible that some of the patients in these studies had focal spinal stenosis. If the spinal canal measurements in these patients were taken at the level of stenosis, this would lead to a smaller mean canal size for that patient group. If patients with back pain are more likely to have stenosis than those without back pain, this would introduce bias toward finding smaller canals in the pain patients. The relationship between stenosis and back pain is discussed in question 3 ("What is the relationship between degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?").

The fact that patients with back pain tend to have small canals does not allow one to conclude that patients with small canals are more prone to have back pain. This is because, as with all case control studies, the authors were forced to artificially set the prevalence of pain during the enrollment of patients into these studies (this artificiality is typically required to ensure there are enough cases and controls in the study to obtain appropriate statistical power). One probable consequence of this is that the proportion of patients in these studies with back pain is probably much higher than it is in the general population. This means that calculations of predictive values or relative risks cannot be performed on these data, which, in turn, means that it is not possible to use these data to determine whether patients with small canals are prone to have back pain.

This caution also applies to the BESD; although it is in the form of a 2 by 2 table, one cannot use the data in it to compute prevalence. This caution concerning the BESD is reinforced by the fact that this display artificially sets the prevalence of pain to be 50 percent.

In our next meta-analysis, we sought to determine the relationship between the presence or absence of neurogenic claudication and spinal canal diameter. Only two studies compared spinal canal measurements among patients with claudication to measurements in healthy controls (Hamanishi, Matukura, Fujita et al., 1994; Uden, Johnsson, Jonsson et al., 1985). Therefore, the results of this meta-analysis are to be viewed as illustrating a trend in currently available data, and not as providing a definitive estimate of the magnitude of the relationship between these two variables. The raw data from the two studies that contributed data to this meta-analysis are shown in Table 6.

Table 9. Summary of Meta-Analysis of Differences in Spinal Canal Diameters of Patients With and Without Claudication
AuthorGroup 1 1 nGroup 2 ndLower CLUpper CLp-valueStandardized Residual
Hamanishi, Matukura, Fujita et al., 19943110−1.669−2.468−0.8700.00004−1.467
Uden, Johnsson, Jonsson et al., 19852311−0.850−1.597−0.1040.0261.467
Qp-value of QOverall dLower CLUpper CLp-value of d
2.150.142−1.232−1.777−0.6860.00001
1

Group 1 denotes patients with claudication, and Group 2 denotes those without claudication.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F013.jpg.

   Figure 13. Forrest Plot of Differences in Spinal Canal Diameters in Patients With and Without Claudication

Differences in canal diameter between patients with and without claudication proved to be statistically significant in both studies. When data were combined, neither test of heterogeneity was statistically significant (Q = 2.15, p = 0.142, standardized residuals = ±1.467). The effect size for the combined groups was statistically significant (d = −1.23, p = 0.00001). Effect sizes and the 95 percent confidence intervals around them can be found in Table 9 and are illustrated graphically with a Forrest plot in Figure 13.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F014.jpg.

   Figure 14. Overlap of the Distributions of Normalized Canal Measurements in Patients With and Without Claudication

Table 10. Binomial Effect Size Display: Canal Size in Patients With and Without Claudication
Small Canal"Normal" Canal
No Claudication2476
Claudication7624
There is a 36.8 percent overlap between groups, as illustrated in Figure 14. The binomial effect size display in Table 10 suggests that if the results of this meta-analysis can be generalized to the population at large, a person with claudication is approximately three times more likely to have a small spinal canal than a healthy person with no claudication.

Similar to the preceding analysis, data on the prevalence of neurogenic claudication are not available. Therefore, these data cannot be used to determine whether canal size predicts the development of claudication.

Next, we compared canal diameters among patients with and without sciatica or radicular pain. As in the preceding analysis, only two studies (Hamanishi, Matukura, Fujita et al., 1994; Uden, Johnsson, Jonsson et al., 1985) contributed data to this meta-analysis. The raw data from the two studies that contributed data to this meta-analysis appear in Table 6.

Examination of these two studies revealed that both had difficulties with experimental designs that would have made the results of a meta-analysis difficult to interpret. In both studies, patients with different symptoms were divided into different mutually exclusive groups. In the study by Hamanashi, 16 of 53 patients with radicular pain also had claudication (Hamanishi, Matukura, Fujita et al., 1994). Some or even many of the patients in the claudication group in the study by Uden may also have had radicular pain. By stratifying patients in this manner, patients with claudication (which, as shown above, is associated with smaller canal diameters) and radicular pain were eliminated from the sciatica groups, leading to bias in favor of the radicular pain groups having larger canals. For this reason, additional studies are necessary to quantify the relationship between canal size and radicular pain.

Relationship Between Patient Age and Symptoms of Spinal Stenosis

Next, we examined the relationship between patient age and symptoms of lumbar stenosis. Due to a lack of extensive research, our approach to determining the relationship between age and symptoms is, by necessity, a narrative review. Epidemiological surveys show that in the United States, low back pain is most prevalent between the ages of 55 and 64 (Deyo and Tsui-Wu, 1987). The proportion of this pain that is due to stenosis is not known. Initial onset of pain is most commonly reported in the second decade and remains high through the fourth decade, after which it declines (Deyo and Tsui-Wu, 1987). Lumbar instability, which correlates with back pain, is significantly higher among patients under the age of 45 (Friberg, 1989).

A 1998 report states that disability as assessed using the Oswestry index (a measure of disability from back pain, a score of 100 meaning the patient is completely disabled) correlates significantly with age among patients with stenosis (Hurri, Slatis, Soini et al., 1998). Patients between the ages of 41 and 55 years had a mean Oswestry index of 29.7, while patients aged 56 and higher had a mean score of 41.3. This index may include other disabilities that may relate to age but not to spinal stenosis.

Relationship Between Patient Sex and Symptoms of Spinal Stenosis

Although the incidence of disk degeneration in men and women is similar (Lawrence, 1969), men tend to have more severe degeneration. The same survey reports an association between extent of lumbar disk degeneration and severity of back, leg, and hip pain and incapacity. Presence of spondylolisthesis was associated with increased likelihood of back pain in women (p = 0.044), but not in men (Virta and Ronnemaa, 1993). A significant interaction was found between spondylolisthesis and sex, suggesting that the association between spondylolisthesis and back pain is sex dependent (Virta and Ronnemaa, 1993). This association is not explained by the slightly greater degree of spondylolisthetic slip found in women in this study.

Relationship Between Presence of a Herniated Disk and Symptoms of Spinal Stenosis

Spinal stenosis is frequently the result of disk degeneration. Herniated disks are also a result of disk degeneration. For this reason, it is not surprising that herniated disks are often found among patients with spinal stenosis. Of the 35 studies retrieved that reported this information, 692 out of 2,327 (29.7 percent) patients treated for lumbar spinal stenosis also had herniated or protruding disks (see Evidence Table 25*). The range of disk abnormalities among these studies was rather broad; 2.9 percent to 100 percent of stenosis patients also had disk abnormalities (studies that used the presence of herniated disks as part of their inclusion or exclusion criteria were not included in this calculation). While we can conclude that there is an association between herniated disks and spinal stenosis, current data do not permit us to provide a precise estimate of the magnitude of that association or to provide evidence that one causes the other. Narrowing of the spinal canal due to herniated or protruding disks is considered a separate entity from spinal stenosis (Amundsen, Weber, Lilleas et al., 1995) but may contribute to the overall symptomology of the condition.

Summary

Patients with back pain or claudication tend to have narrower spines than asymptomatic patients. This suggests, although it does not prove, that patients with congenitally narrow spines may be more prone to developing symptoms of stenosis, especially back pain and claudication. Increased patient age and the presence of herniated disks may also contribute to the development of back pain and other symptoms of stenosis. The strength of these relationships and the exact ages at which patients are most likely to develop symptoms cannot be determined from the information available. There is some evidence that women with spondylolisthesis are more likely to experience pain than are men.

Question 2Which relevant patient characteristics are associated with an increased likelihood of focal narrowing of the spinal canal?

Certain patients may be more likely to develop spinal stenosis. In this section, we examine the available evidence linking patient characteristics to developing a symptomatic narrowing of the spinal canal.

As with the preceding question, we consider only studies that reported some measurement of spinal diameter and that reported data on whether patients had or did not have a particular symptom. In some cases, this limits our treatment of the data to case-control studies, because they were the only studies to report symptomology. Due to a relative lack of information, parts of our answer to this question take the form of a systematic narrative review. While this review reflects the results of and critiques currently available studies, another purpose of it is to illustrate where there is a need for data.

Relationship Between Nonstenotic Canal Diameter and Development of Focal Spinal Stenosis

Table 11. Spinal Measurements of Patients With Stenosis and in Control Subjects
StudyPatient descriptionUnitsnMeasurement
Prasartritha, Suntisathaporn, Vathana et al., 1997Controls with no back painMidsagittal diameter (Range includes all spinal levels)23Mean 15.67 Range 13.4-17.55
Patients with stenosis 123Mean 13.88 Range 11.6-16.7
"Overall" midsagittal diametersignificantly different between the two groups, p<0.05
Kim and Lee, 1995Healthy controlsL4-5 sagittal diameter, mm2520.3±1.8
Isthmic apondylolisthesis5520.3±1.8
Degenerative spondylolisthesis4819.2±1.8
Yoshida, Shima, Taniguchi et al., 1992Herniated disk patients mean age 20.2Cross sectional area at the level of the L4-5 facet joint10356.7±90.8
Central stenosis23262.8±62.8
Lateral stenosis3302.9±72.4
Spondylolisthesis7223.6±71.4
Mean age for all stenosis patients 63.8
Schonstrom, Bolender, and Spengler, 1985Healthy controlsSagittal diameter of the dural sac, mm
L2-L3514.6±1.2
L3-L4713.9±1.6
L4-L51014.6±1.5
L5-S1516.4±3.1
Average2714.8±3.1
Patients with surgically confirmed central stenosis 2249.1±3.3, range 4.7-19.0
Healthy controlsTransverse area of the dural sac, mm 2
L2-L35177±31
L3-L47176±53
L4-L510171±38
L5-S15193±73
Average27178±50
Patients with surgically confirmed central stenosisb2489.6±35.1, range 35-163
1

Symptomatic stenosis was defined as a clinical entity including back pain, intermittent claudication, sensory parasthesia, and weakness in the lower extremities. It is unclear whether all patients had all symptoms. All 23 patients had surgery for stenosis. Imaging criteria, if any, was not stated.

2

Seven stenotic patients were described as stenotic when in fact they had a protruding disk. Ten additional patients were excluded because no scans were available. N indicates number of measurements at different lumbar levels, not number of patients. There were 13 control subjects and 24 patients with stenosis.

Numbers used to calculate effect sizes appear in bold type.

The results of the preceding analyses suggest that some relationship exists between symptoms and spinal canal size. Therefore, we conducted a meta-analysis to explore the relationship between focal spinal canal narrowing and symptomatic spinal stenosis. We located four studies that compared measurements of spinal canals among patients with stenosis to those of control subjects. They are listed in Table 11. One of these studies (Schonstrom, Bolender, and Spengler, 1985) provided only one measure in common for control and stenotic patients. This measure, mean transverse area of the dural sac, was calculated from the transverse area at the dural sac at four levels of the lumbar spine. However, none of the measurements at any level included data from all 13 control subjects. At two of the levels, data from only five subjects were used. Nearly half (27 of 52) of the measurements that could have been used to calculate mean transverse area were omitted. No data were provided on the number of levels measured for the stenosis patients. In addition, 7 of the 24 patients said to have stenosis in this study actually had protruding disks. Because of these issues of poor design and reporting, this study was excluded from the meta-analysis. For the remaining studies, we chose data for incorporation into the meta-analysis in the same manner as described in the section entitled "Relationship Between Spinal Canal Diameter and the Presence of Back Pain, Claudication, or Radicular Pain/Sciatica." Numbers used to calculate effect sizes appear in Table 11 in bold type.

Table 12. Summary of Meta-Analysis of Differences in Canal Diameter Between Patients With and Without Stenosis
AuthorYearGroup 1 1 nGroup 2 ndLower CLUpper CLp-valueStandardized Residual
Prasartritha, Suntisathaporn, Vathana et al., 199719972323−0.587−1.1770.0030.0510.045
Kim and Lee, 199519954825−0.605−1.098−0.1110.016−0.045
Qp-value of QOverall dLower CLUpper CLp-value of d
0.0020.964−0.597−0.976−0.2190.002
1

Group 1 denotes patients with stenosis, and Group 2 denotes those without stenosis.

The 1992 study by Yoshida (Yoshida, Shima, Taniguchi et al., 1992) incorporated two potential confounding factors that may influence interpretation of the results. First, there was an age difference between patient groups. Patients with stenosis had a mean age of 63.8 years. They were compared to a control group with a mean age of 20.2 years. Because the spinal canal tends to narrow with age (Twomey and Taylor, 1988), the younger controls may have larger canals than an age-matched control group might have had. This would have the effect of introducing bias in favor of finding a significant difference between groups. Also, the control patients all had herniated disks. Patients with disk problems may tend to have smaller spinal canals than do healthy controls (Heliovaara, Vanharanta, Korpi et al., 1986; Ramani, 1974; Ramani, 1976; Winston, Rumbaugh, and Colucci, 1984). If this is the case, this would bias the analysis against finding a significant difference between groups. The effect size from the study by Yoshida (−1.271) is larger (albeit not significantly so) than those from two other studies (Kim and Lee, 1995; Prasartritha, Suntisathaporn, Vathana et al., 1997; see Table 12). The extent to which the age differences between the stenosis patients and the control subjects contributed to this effect cannot be determined. Because of this evidence of confounding, the Yoshida study was excluded from the meta-analysis. Therefore, only two studies compared spinal canal measurements among patients with stenosis to measurements in healthy controls. Therefore, the results of this meta-analysis are to be viewed as illustrating a trend in currently available data, and not as providing a definitive estimate of the magnitude of the relationship between these two variables.

The 1995 study by Kim and Lee had two groups of patients with spondylolisthesis. Because isthmic spondylolisthesis is outside the scope of this assessment, the spinal measurements of patients with degenerative spondylolisthesis were used (Kim and Lee, 1995).

Prasartritha et al. (1997) did not provide a measure of dispersion among measurements of sagittal diameter (Prasartritha, Suntisathaporn, Vathana et al., 1997). Because of this, an exact effect size cannot be calculated. However, the report stated that the mean canal size was significantly different (p <0.05) between groups by t-test. Because the number of patients in each group was known, we were able to calculate the value of Student's t, assuming a p-value of 0.049. This translates to an effect size of at least −0.587. If the actual p-value was smaller (as it may well be), Student's t, and therefore Hedges' d, would be larger. The overall effect size calculated here is therefore a minimum value. How much further it may actually be from zero cannot be determined from the data provided.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F015.jpg.

   Figure 15. Forrest Plot of Differences in Spinal Canal Between Patients With and Without Spinal Stenosis

Effect sizes (Hedges' d) were calculated as described in Chapter 2 of this evidence report. Effect sizes for each study, the summary statistic, and the 95 percent confidence intervals around them can be found in Table 12, and are illustrated graphically with a Forrest plot in Figure 15.

When data from all groups were combined, the test of heterogeneity was not statistically significant (Q = 0.002, p = 0.964, standardized residuals = ±0.045), and the summary statistic, −0.597, was statistically significant (p = 0.002).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F016.jpg.

   Figure 16. Overlap between Normalized Canal Measurements in Patients With and Without Stenosis

Table 13. Binomial Effect Size Display: Canal Size in Patients With and Without Stenosis
Small Canal"Normal" Canal
No Stenosis3664
Stenosis6436
While the difference between means is statistically significant, there is 61.97 percent overlap between groups, as illustrated in Figure 16. The correlation between canal size and the presence and absence of stenosis is significant but low (r = −0.286). The binomial effect size display shown in Table 13 suggests that if the results of this meta-analysis can be generalized to the population at large, a person with stenosis is approximately 1.78 times more likely than a healthy person with no stenosis to have a small spinal canal. Although people with symptomatic stenosis tend to have smaller spinal canals than people without stenosis, the presence of a small canal is not necessarily predictive of stenosis.

Relationship Between Nonstenotic Canal Cross-Sectional Area and Development of Focal Spinal Stenosis

Besides canal diameter, other aspects of the spinal canal may make an individual patient more prone to develop spinal stenosis. A well-documented anatomical variation known as the trefoil-shaped canal may predispose the patient to problems associated with lateral stenosis (Epstein, Epstein, and Lavine, 1962; Schatzker and Pennal, 1968). This variation is characterized by a smaller sagittal diameter and deeper lateral recesses. The association between trefoil canal and spinal stenosis may be merely coincidental (Eisenstein, 1980). A study of trefoil canal in South African skeletons found that while stenosis and trefoil canal may coexist in the same skeleton, they are often present at different vertebrae. Other attempts have been made to identify spinal characteristics that are predictive of future stenosis (MacGibbon and Farfan, 1979). No information is available quantifying the accuracy of these predictions.

Relationship Between Patient Age and Development of Focal Spinal Stenosis

Disk degeneration, spinal instability, and hypertrophy of the facet joints accompany the aging process (Prescher, 1998; Twomey and Taylor, 1988). Degeneration normally begins in the second decade (Mirkovic, Garfin, Rydevik et al., 1992), and its prevalence increases with age (Powell, Wilson, Szypryt et al., 1986). By the age of 40 years, 80 percent of male and 65 percent of female disks are moderately degenerated (Mirkovic, Garfin, Rydevik et al., 1992). Increasing age also correlates with increasing spinal osteoarthritis (Magora and Schwartz, 1978). While increased disk degeneration and spinal osteoarthritis implies increased stenosis, this has not been specifically demonstrated.

A 1988 study of cadavers found a decline in lumbar spinal canal anterior-posterior diameter with age that is significant in males (t = 3.95, df = 4, p <0.01) but not in females (Twomey and Taylor, 1988). This study looked at lumbar spines of men and women age 25 to35 and > 65. Our meta-analyses indicate an association between canal size and back pain, claudication, and stenosis. A narrower spinal canal may render the patient more vulnerable to developing symptomatic stenosis in cases of minor additional trauma or pathology (Clark, 1969; Twomey and Taylor, 1988). The narrowing of the canal with age may therefore contribute to the development of symptomatic stenosis.

There is a correlation between presence of calcium deposits and decreased elastic/collagenous fiber ratio in the ligamentum flavum and age among patients with degenerative stenosis (Schrader, Grob, Rahn et al., 1999). Patients under the age of 60 had 0.032 percent calcification, while patients between the ages of 70 and 75 had 0.336 percent calcification. This correlation was statistically significant (p <0.05) by the Mann-Whitney-Wilcoxon test. Few patients without degenerative stenosis had these changes. No correlation was reported between degree of change and degree of stenosis or symptoms.

Relationship Between Patient Weight and Development of Focal Spinal Stenosis

A 1976 survey indicates that increased weight correlates with increased incidence of spinal osteoarthritis (Magora and Schwartz, 1978). Only 53 percent of slim patients (weight in kg less than height in cm minus 110) had spinal osteoarthritis, compared to 59 percent of average patients (weight in kg equal to height in cm minus 90-110), 69 percent of heavy patients (weight in kg equal to height in cm minus 80 to 90) and 94 percent of very heavy patients (weight in kg more than height in cm minus 80). Overweight patients may also be more likely to have disk degeneration than normal patients (Parkkola, Rytokoski, and Kormano, 1993). A 1991 survey stated that body mass index is a predictor of disk degeneration according to univariate and multivariate analyses, but did not provide a description of the statistical analysis utilized to reach this conclusion (Symmons, van Hemert, Vandenbroucke et al., 1991). The clinical significance of these findings is unclear. Disk degeneration is a common precursor to stenosis, but it occurs in all patients to some extent, and does not inevitably lead to stenosis. No direct evidence in support of a link between body weight and development of stenosis is available.

Relationship Between Osteoarthritis and Development of Focal Spinal Stenosis

Among patients with severe osteoarthritis of one or both hips, there may be an increased frequency of moderate or severe degenerative changes as noted on radiographs in the lumbar spine. Among patients aged 40 to 59, 36 percent of patients with osteoarthritis of the hips had such changes, compared to only 4 percent of control patients matched for age, sex, and occupation. Among patients aged 60 to 69, 53 percent of those with osteoarthritis had moderate or severe degenerative changes of the lumbar spine, compared to 27 percent of matched control patients (Brewerton, 1983). No mention was made of presence or absence of stenosis or back pain, claudication, or other symptoms. In the seven studies giving this information, 42 out of 903 (4.7 percent) patients with lumbar spinal stenosis had osteoarthritis or other hip disease (see Evidence Table 25). Underreporting of this important comorbidity restricts our ability to determine its prevalence or clinical relevance. Whether hip and spine problems are both parts of a larger disease or whether mechanical changes resulting from hip disease lead to spinal degeneration is not known.

Relationship Between Diabetes and Development of Focal Spinal Stenosis

Although some skeletal defects may be more prevalent among patients with diabetes than in the general population, we were unable to locate any evidence on whether patients with diabetes are more prone to developing lumbar spinal stenosis. A cross-sectional epidemiological study published in 1994 found no significant difference in the prevalence of spondylolisthesis between diabetic and nondiabetic adults (Virta, Ronnemaa, and Laakso, 1994).

Relationship Between Type of Patient Employment and Development of Focal Spinal Stenosis

A 1993 report found no correlation (r= 0.07, n = 46) between work index (0-8 scale based on amount of lifting required in four categories of motion) and degree of spondylolisthesis (Virta and Ronnemaa, 1993). No numerical data were provided in support of this statement. In contrast, Lawrence found that coal miners and men doing outdoor or heavy manual work had earlier and greater degrees of disk degeneration than did business, professional or textile workers (Lawrence, 1969). Fifty percent of miners and 40 percent of outdoor workers aged 35-44 had disk degeneration of grade 2 or higher, compared to 14 percent of business and professional workers. Total incidence of disk degeneration for all age groups was significantly lower (p <0.01) in business, professional, and textile workers than in industrial, manual, trade, and outdoor workers or miners. The statistical test used was not stated. No such differences were found in women. Extent of degeneration as indicated by height of the intervertebral space (p = 0.003, n = 46) is associated with a greater degree of spondylolisthetic slip (Virta and Ronnemaa, 1993).

Summary

Patients with spinal stenosis tend to have narrower spines than asymptomatic patients. This suggests, although it does not prove, that patients with congenitally narrow spines may be more prone to developing focal stenosis. The shape of the patients' spinal canal may also contribute to the development of focal narrowing of the canal and symptomatic stenosis, but the evidence for this is weak. There is some evidence that disk degeneration, narrowing of the spinal canal, and degenerative changes in the spinal ligaments contributing to stenosis and instability increase with age. However, the strength of this relationship and the age at which stenosis is most likely to occur cannot be determined from the available information.

Heavier patients may be more likely to develop the degenerative changes leading to stenosis. Similarly, patients with osteoarthritis of the hips as well as patients who perform heavy labor tend to have more disk degeneration than other patients. While these data suggest a relationship between these characteristics and the development of stenosis, there is no direct evidence of a causal relationship. We located no evidence indicating a relationship between diabetes and spinal stenosis, and we located a single study giving evidence against the existence of such a link.

Question 3What is the relationship between degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?

Patients who have abnormal spinal measurements, stenosis, or spondylolisthesis upon imaging but no symptoms of stenosis are frequently described in the literature (Boden, Davis, Dina et al., 1990; Churchill, Taylor, Shimizu et al., 1988; Fitzgerald and Newman, 1976; Healy, Healy, Wong et al., 1996; Karantanas, Zibis, Papaliaga et al., 1998; Nagler and Bodack, 1993; Sandhu, Lakhanpal, and Gupta, 1976; Wiesel, Tsourmas, Feffer et al., 1984). Several studies also report that there is no correlation between imaging appearance or degree of stenosis or spondylolisthesis and intensity of symptoms (Amundsen, Weber, Lilleas et al., 1995; Cauchoix, Benoist, and Chassaing, 1976; Fitzgerald and Newman, 1976; Kikuchi, Hasue, Nishiyama et al., 1984; Nagler and Bodack, 1993; Postacchini and Perugia, 1991; Rosenberg, 1976; Virta and Ronnemaa, 1993), but the studies do not present numerical data or statistical findings in support of this observation. The lack of correlation could relate to the observation that symptoms tend to fluctuate considerably over time (Amundsen, Weber, Lilleas et al., 1995), but again, there is little data to support this position. One study reports that there is no difference in symptoms between lateral and central stenosis (Amundsen, Weber, Lilleas et al., 1995), but no evidence is presented in support of this.

Due to a relative lack of information, our answer to this question takes the form of a systematic narrative review. While this review reflects the results of and critiques currently available studies, another purpose of it is to illustrate where there is a need for data.

Relationship Between Degree of Stenosis and Back Pain

Three studies note a lack of association between degree of stenosis and back pain (Amundsen, Weber, Lilleas et al., 1995; Friberg, 1989; Virta and Ronnemaa, 1993). In two cases, no data were presented in support of this observation. The third study is discussed below.

This 1993 study found that presence of spondylolisthesis was associated with increased likelihood of back pain in women (p = 0.044), but not in men (Virta and Ronnemaa, 1993). Degree of slip was associated with extent of disk degeneration, but not with pain intensity (r= 0.14 in all subjects, r = 0.20 in symptomatic subjects) (Virta and Ronnemaa, 1993).

Table 14. Relationship Between Back Pain and Spondylolisthesis
StudyPatientsnDegree of Maximal Slip (mm)Translatory Instability (mm)
Friberg, 1989Symptomatic (back pain)507.6±3.76.7±3.7
Asymptomatic298.1±5.71.9±2.0
Table 14 contains data from a study by Friberg showing no relationship between degree of maximal slip and back pain (Friberg, 1989). The report went on to state that symptomatic and asymptomatic patients with spondylolisthesis had significantly different amounts of translatory instability as measured by dynamic traction-compression radiography. We conducted unpaired t-tests on the published data and showed a statistically significant difference between groups (t = 7.48, p <0.00001). Thus, while degree of maximal slip may not be related to severity of back pain, translatory instability may correlate strongly with back pain. This may therefore be a more important measure of severity of stenosis than maximal slip.

Relationship Between Degree of Stenosis and Leg Pain

A study of clinical and radiological features of stenosis reported that bilateral leg pain is slightly more frequent among patients with central stenosis than in those with lateral stenosis (Amundsen, Weber, Lilleas et al., 1995), but the study presented no data or statistical analysis of this observation.

Relationship Between Degree of Stenosis and Neurogenic Claudication

A 1991 report observed that claudication is associated with severe stenosis of the entire canal (Postacchini and Perugia, 1991). No data were presented to support this observation. Claudication may be more common with spondylolisthesis of the L4 vertebra than the L5. Again, no data were provided in support of this observation (Postacchini and Perugia, 1991).

Relationship Between Degree of Stenosis and Reduced Physical Function and Activities of Daily Living

A 1998 report states that disability as assessed using the Oswestry index (a measure of disability from back pain; a score of 100 means the patient is completely disabled) correlates significantly with severity of stenosis (Hurri, Slatis, Soini et al., 1998). Patients with moderate stenosis had a mean Oswestry index of 28.0, while patients with severe stenosis had a mean score of 39.0.

Relationship Between Degree of Stenosis and Disability and Dependency

A 1993 study reported that use of medical facilities and likelihood of receiving a disability pension were no different for men or women with or without spondylolisthesis (Virta and Ronnemaa, 1993). Disabling back symptoms necessitating bed rest at the expense of work or hobby activity were more commonly reported in men without spondylolisthesis (70 percent) than in men with spondylolisthesis (38 percent; p = 0.026) (Virta and Ronnemaa, 1993). No information was provided on the presence of other back problems in the control subjects.

Relationship Between Multilevel Stenosis and Symptoms

A 1992 study reported that neurogenic claudication was generally associated with at least two levels of stenosis (Porter and Ward, 1992). In this study, 49 patients with neurogenic claudication were examined by myelography and CT and found to have stenosis in the central and/or root canal. Average patient age was 58.8 years (SD of 8.1). L4-5 was the level most commonly affected by central stenosis. Thirty-nine patients (80 percent) had multilevel central canal stenosis. Root canal stenosis was found in 37 patients. In all, 94 percent of patients had either a two-level or multilevel central stenosis or a one-level central stenosis and associated root canal stenosis. The authors concluded that neurogenic claudication is uncommon in the absence of multilevel stenosis.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F017.jpg.

   Figure 17. Association Between Percentage of Patients Reporting Neurogenic Claudication and the Mean Number of Stenotic Levels

The data for this figure were taken from 49 separate patient groups reported in 39 clinical studies. Data on the mean number of stenotic levels are contained in Evidence Table 10, and data on the number of patients with neurogenic claudication are contained in Evidence Table 11.

We examined the possible connection between multilevel stenosis and neurogenic claudication by plotting the percentage of patients with neurogenic claudication against the mean number of stenotic levels from 49 separate patient groups reported in 39 clinical studies. Several studies provided more than one patient group for this analysis. The results are presented in Figure 17. These data show no association between the percentage of patients reporting neurogenic claudication within a patient group and the mean number of stenotic levels reported for that patient group. Nine patient groups with less than an average of 2.0 stenotic levels reported better than 70 percent of patients with neurogenic claudication, and five patient groups with more than a 2.5 average for stenotic levels reported less than 50 percent of patients with neurogenic claudication.

Summary

Very little evidence exists correlating degree of narrowing of the lumbar spine with the presence or severity of the signs, symptoms, or conditions associated with stenosis. Difficulties associated with finding such correlations include the presence of large numbers of patients with spinal narrowing and no symptoms; variations in canal size throughout the population; and lack of an accepted system for quantifying the degree of narrowing. Many asymptomatic patients have been shown to have stenosis or spondylolisthesis upon diagnostic imaging. Lumbar spinal stenosis is a condition that includes both a focally narrowed spinal canal and the associated symptoms. The extent of narrowing is also likely to change with the posture of the patient. Extension significantly decreases the canal area, whereas flexion has the opposite effect (Inufusa, An, Lim et al., 1996; Willen, Danielson, Gaulitz et al., 1997). Therefore, a static image of the canal dimension may not be predictive of a patient's symptoms. In current clinical practice, the course of treatment depends on the severity of the symptoms, not on the degree of narrowing.

Two studies have provided numerical evidence of a lack of association between severity of stenosis or spondylolisthesis and severity of back pain. There was, however, some evidence of a relationship between degree of spinal instability and back pain. Another study found that among patients with symptomatic stenosis, those with more severe stenosis tended to have more disability. However, other studies have indicated that patients with spondylolisthesis make the same use of medical resources and miss less work due to back pain than patients without spondylolisthesis. While neurogenic claudication is believed to be associated with more severe stenosis, we located no numerical evidence in support of this hypothesis.

Question 4What is the relationship between the signs and symptoms and other features of the patient history and physical and the results of the imaging examination?

Questions 4 and 7 of this evidence report pertain to results of diagnostic imaging examinations such as myelography, CT, and MRI. Assessment of diagnostic tests is different from assessment of treatments. While it is desirable to evaluate diagnostic tests based on prospective controlled trials that measure patient outcomes in groups that have received or not received a particular test, such trials are rare for any diagnostic application, and nonexistent for tests to diagnose spinal stenosis. Therefore, like most other diagnostic tests, imaging tests for diagnosis of spinal stenosis have to be assessed using an evidence base of case series data. Before addressing question 4, we first examine the relevant imaging modalities to ensure that they do, in fact, provide accurate measures of spinal stenosis. This material is also relevant to question 7 but, to avoid redundancy, we present it only in answer to this fourth question.

Quality of the Diagnostic Studies

Diagnostic Imaging Studies

Table 15. Excluded Diagnostic Imaging Studies
Authors and yearYearReason for exclusion
Berlemann, Jeszenszky, Buhler et al., 19991999No diagnostic data
Maher and Henderson, 19991999Fewer than 10 patients
Kuroki, Tajima, Hirakawa et al., 19981998No diagnostic data
Rankine, Gill, Hutchinson et al., 19981998Primarily disk disease
Zwarts, 19981998Test not used in routine practice
Carragee and Kim, 19971997No spinal stenosis patients
De Haan and Eward, 19971997Published as abstract only
Vergauwen, Parizel, van Breusegem et al., 19971997No diagnostic data
Carey and Garrett, 19961996Acute disease
Elkayam, Avrahami, and Yaron, 19961996No diagnostic data
Healy, Healy, Wong et al., 19961996Asymptomatic patients
Jarvik, Deyo, and Koepsell, 19961996Study incomplete
Khandelwal, Takhtani, Malik et al., 19961996No diagnostic data
Dullerud and Johansen, 19951995Examined only disks
Modic, Ross, Obuchowski et al., 19951995Acute disease
Videman, Battie, Gill et al., 19951995Asymptomatic patients
Simmons and Simmons, 19921994Postsurgery
Gunzburg, Servais, and Verhas, 19941994Acute disease
Lusins, Elting, Cicoria et al., 19941994No control group
Tallroth, Ylikoski, Landtman et al., 19941994No diagnostic data
Wood, Popp, Transfeldt et al., 19941994No diagnostic data
Djukic, Vahlensieck, Resendes et al., 19921993Postsurgery
Jinkins, 19931993Fewer than 10 patients
Pai, 19931993No diagnostic data
Antti-Poika, Soini, Tallroth et al., 19901990Examined only disks
Boden, Davis, Dina et al., 19901990Asymptomatic patients
Stanley, McLaren, Euinton et al., 19901990Postsurgery
Stockley, Getty, Dixon et al., 19881989Postsurgery
Deyo, Diehl, and Rosenthal, 19871987No diagnostic data
Waddell, 19871987Disk prolapse only
Downing, Schnitzlein, Clarke et al., 19861986Technically suboptimal
Morris, Di Paola, Vallance et al., 19861986Disk prolapse only
Osborn and Knight, 19861986Postsurgery
McCutcheon and Thompson, 19861985Acute disease
Tait, Charlesworth, and Lemon, 19851985No spinal imaging
Firooznia, Benjamin, Kricheff II et al., 19841984Examined only disks
Frymoyer, Newberg, Pope et al., 19841984Asymptomatic patients
Hindmarsh, Ekholm, Kido et al., 19841984No diagnostic data
Wiesel, Tsourmas, Feffer et al., 19841984Asymptomatic patients
Crouzet, Vasdev, and Chirossel, 19831983No diagnostic data
Gehweiler and Daffner, 19831983Acute disease
Omojola, Vas, and Banna, 19811983Obsolete methods
Postacchini and Pezzeri, 19811981Obsolete equipment
Postacchini, Pezzeri, Montanaro et al., 1980b1981Obsolete equipment
Scafuri and Weinstein, 19811981Obsolete equipment
Sandhu, Lakhanpal, and Gupta, 19761976No diagnostic data
Torgerson and Dotter, 19761976Acute disease

Note: Some articles could be excluded on multiple grounds; only one of those grounds for each article is listed in the table.

We begin our discussion of the validity of diagnostic imaging studies by first describing the reasons for excluding studies from our analysis. We retrieved 52 articles that were described as diagnostic imaging studies. Each of them was reviewed to see if they contained any evidence relevant to questions 4 or 7. Most of these articles (45) had to be excluded from our analysis. These excluded articles, and the reasons for their exclusion, are shown in Table 15. Note that some of the excluded articles met more than one criterion for exclusion, but only one criterion is shown in the table. Therefore, the proportion of articles excluded for a specific reason will underestimate the number of articles that actually have that particular flaw.

The most frequent reason for exclusion was a lack of diagnostic data in the article. This lack of data typically arose because it was the intention of the article to provide a description of the imaging of spinal stenosis and related conditions. Thus, while such articles might report the percentage of images that exhibited a particular feature, no relationship between these features and the patient's diagnosis would be made. These articles are as much tutorials as they are evaluations of the effectiveness of the technology. Typically, they show images that demonstrate normal and abnormal patients, show the features by which a diagnosis can be made, and show examples of how one modality can demonstrate an abnormality while another does not. These examples of cases diagnosed by one modality but not by another do not provide valid evidence on the effectiveness of either modality. There is no guarantee that the cases presented in the article are a representative sample of all cases encountered in clinical practice.

Another common reason for study exclusion was a lack of data on spinal characteristics. Several trials reported only imaging findings related to the intervertebral disks. Other studies were excluded because of the patient populations they studied: some studied asymptomatic patients, some studied patients who had already had back surgery, and some studied few (five or less) or no patients with spinal stenosis.

Table 16. Diagnostic Trials Meeting Preliminary Inclusion Criteria
Authors and yearImaging results reportedSigns and symptomsClinical results
Eberhardt, Hollenbach, Tomandl et al., 1997Degree of stenosis (myelogram and MRI)NoneNone
Muto, De Maria, Izzo et al., 1997Size of spinal canal (CT, MRI in selected cases)NoneNone
Ramsbacher, Schilling, Wolf et al., 1997No quantitative data reportedNoneNone
Wood, Popp, Transfeldt et al., 1994Amount of intervertebral motionNoneNone
Phillips, Howe, Bustin et al., 1990Abnormal or normal spinal motionPhysical exam findingsNone
Modic, Masaryk, Boumphrey et al., 1986Presence or absence of stenosis (MRI, CT, myelography)NoneSurgical findings
Bell, Rothman, Booth et al., 1984Presence or absence of stenosis (CT, myelography)NoneSurgical findings
Seven articles (see Table 16) remained from this group after the preliminary inclusion/exclusion criteria were applied, but passing these criteria did not mean that valid conclusions could be drawn from the data in those articles. The studies are discussed in detail and the data is analyzed elsewhere in the report in specific sections pertaining to validation of the imaging modalities and to clinical questions 4 (Can clinical signs and symptoms predict imaging results?) and 7 (Can imaging results predict surgical results?).

Diagnostic Imaging Data in Studies of Surgical Treatment

Diagnostic imaging data is not limited to articles described by their authors as diagnostic trials. We also reviewed all 141 surgical trials in the electronic database created for this project to see if they contained any evidence that could answer questions 4 and 7. Trials that could answer either of these questions would have to have reported the results of the preoperative imaging examination, so 81 of the trials were excluded because they did not report imaging results or because they reported only sample results from a few patients. We excluded 15 trials that reported only postoperative imaging findings, because treatment could have affected results. Also excluded were 24 trials that reported some imaging results but did not report any correlations between imaging results and initial signs and symptoms (question 4) or outcomes after surgery (question 7). One trial reported both imaging results and clinical outcomes for each patient, but all the patients had the same imaging results, so no questions about imaging efficacy could be answered by this trial.

Additional criteria were used to exclude studies that used technically inferior or obsolete imaging methods. Even though myelography no longer is in routine use (Eisenberg and Margulis, 2000b; Mitchell, 2000), we did not exclude myelography studies, because some consider myelography a standard modality for measuring the spinal canal. Studies using plain film x-rays for diagnosing spinal stenosis (Omojola, Vas, and Banna, 1981) were excluded. While plain films are still used in assessment of spinal disorders (particularly detection of fractures or metastases), they have been supplanted by myelography, then CT and MRI for diagnosis of spinal stenosis (American College of Radiology, 1998; Eisenberg and Margulis, 2000b; Grossman, Katz, Santelli et al., 1994). Studies using first- and second-generation CT scanners (Postacchini and Pezzeri, 1981; Postacchini, Pezzeri, Montanaro et al., 1980b; Scafuri and Weinstein, 1981) were excluded because their long scan times increase the risk of motion artifact. These scanners are now obsolete (ECRI. 1999). An MRI trial was excluded because the scanner was operated at only one-third of its nominal magnetic field strength (Downing, Schnitzlein, Clarke et al., 1986); MR signal intensity and image quality increases with field strength, so that study's technique was clearly suboptimal. More recent MRI studies may plausibly report better results than older studies using older MR technology, but insufficient data were available to test this hypothesis.

After the preliminary inclusion and exclusion criteria were applied, 19 trials were left for further review. They are listed in Table 17. Again, the appearance of a trial in this table does not necessarily mean its results could be used to answer questions in this evidence report. On further review, three of these studies were excluded from further analysis. Herno et al. (1999b) reported only postoperative imaging findings, so the results are not relevant to the clinical questions asked in this report (Herno, Partanen, Talaslahti et al., 1999). The study by Tajima et al. (1980) was excluded because of incomplete reporting of imaging methods (Tajima, Fukazawa, and Ishio, 1980). Kawauchi et al. (1996) studied diagnosis of adhesive arachnoiditis (a condition reported in no other trials) using myeloscopy (a test reported by no other trials) (Kawauchi, Yone, and Sakou, 1996).

Most of the surgical trials that reported imaging results reported spinal measurements such as the anterior-posterior (AP) dimension of the spinal canal. Correlation of this measurement with clinical signs and symptoms is the subject of question 3, and correlation of the measurement with surgical outcomes is the subject of question 6. Imaging findings like complete and incomplete block are included in the analysis for questions 4 and 7, although they are related to the degree of stenosis.

Validation of Imaging Results

In the preceding two sections, we discussed how we arrived at the set of studies used to evaluate whether the relevant diagnostic modalities produced accurate measurements of spinal stenosis. In this subsection, we now consider those data. Because imaging findings are usually considered a reference standard for diagnostic tests in published studies of spinal stenosis, validation of the results of available studies is difficult. This is because validation requires an independent reference standard to which one can compare the imaging results. The only reference standard that would meet this criterion would be physical confirmation of stenosis during surgery.

Six trials report just such a comparison. Ramsbacher et al. (1997) compared MRI results to surgical findings. They determined that "the results of MRM [MR myelography] were identical to those of conventional myelography and corresponded to the surgical findings." However, this report provides no explanation of which imaging and surgical findings were compared: specific measurements of the spine and spinal canal, localization of abnormal disks, or the differential diagnosis between spinal stenosis and other conditions. Furthermore, there are no data presented in the paper to allow us to confirm the authors' conclusions. Although the authors intended to measure sensitivity and specificity of MRI in this study, we cannot use this trial to assess the efficacy of MRI (Ramsbacher, Schilling, Wolf et al., 1997).

The same difficulties affect the article by Muto et al. (1997). These investigators sought to evaluate the sensitivity and specificity of both CT and MRI, and found that CT evaluated the size of the spinal canal "perfectly." However, they do not report what reference standard they used, whether it was surgery, myelography, or some other test. Therefore, we cannot use this trial to validate CT or MRI. Furthermore, this study also has a patient selection bias affecting the MRI results. Only patients with polyradiculopathy or with imaging findings (modality not specified) that disagreed with the preliminary clinical diagnosis were referred for MRI, so the MRI results reflect only difficult cases, not a representative sample of all cases. Therefore, the sensitivity and specificity of MRI would be underestimated by any calculations performed on this data. The article includes examples of selected cases that were correctly or incorrectly diagnosed by CT and MRI, but we are not certain that these cases are a representative sample (Muto, De Maria, Izzo et al., 1997).

An early study validating MRI of the lumbar spine was reported by Modic et al. (1986). The study design was not subject to the difficulties of the other studies, surgical results to use as an independent reference standard were available for most patients (note that no quantitative definition was given for spinal stenosis), but the MR scanner used is now obsolete and technically inferior to scanners in routine use today. Gradient-echo techniques, now considered standard practice, were not available when Modic et al.'s patients were studied. Therefore, results from this study should be considered minimum levels of sensitivity for MRI. Calculation of specificity from the published results is not possible because no negative results were found on surgery. In this series, MRI alone detected 23 of 30 vertebral levels with spinal stenosis (sensitivity = 77 percent), while CT alone (combining studies done with and without contrast material) detected 19 of 24 stenoses (sensitivity = 79 percent), and myelography alone detected 13 of 24 stenoses (sensitivity = 54 percent). When results of both imaging examinations were used together, MRI and CT detected 23 of 24 stenoses (sensitivity = 96 percent) (Modic, Masaryk, Boumphrey et al., 1986).

Relatively current MRI techniques were used by Eberhardt et al. (1997). All 80 patients in their study underwent surgery, but the definition of stenosis used in the surgical reference standard was not reported. Nerve root compression appears to be their diagnostic criterion. There is a discrepancy in the percentages reported by Eberhardt et al. (1997) for the sensitivity of MRI in diagnosing stenosis. For diagnosing severe stenosis, they report 95 percent sensitivity, but with 50 cases of stenosis, this percentage is not possible. This percentage is also inconsistent with the percentage obtained from subtracting results for mild stenosis from results in all stenoses. To take a conservative approach to computing the sensitivity of MRI, we will assume the lower figure (the figure we calculated by subtraction) is correct. Making this assumption, the sensitivity was 100 percent for both x-ray myelography and MRI in diagnosing mild stenosis. For severe stenosis, we found the sensitivity of MRI was 88 percent and the sensitivity of myelography was 72 percent (Eberhardt, Hollenbach, Tomandl et al., 1997).

Validation of CT and myelography was the objective of a study reported by Bell et al. (1984). Their multicenter study compared imaging results to surgical findings. Details of the CT methods were not reported, other than the fact that they did not use contrast agents, so we cannot determine whether the CT scanners used in this study (reported in 1984) are now obsolete. However, these data are nearly 20 years old, and technical improvements have been made in CT scanners, so it is reasonable to assume now that current scanners are more effective in diagnosing spinal stenosis (Bell, Rothman, Booth et al., 1984).

Rather than expressing their findings as quantitative results for stenosis measurement and differential diagnosis, Bell et al. (1984) used descriptive words, "strong," "firm," and "weak," to describe the agreement between test results and surgical findings. For purposes of this evidence report, we will only consider "strong" agreements to be correct diagnoses. This requires both a correct diagnosis of the vertebra as normal or abnormal, and a correct differential diagnosis between disk herniation and spinal stenosis (or both). Based on these data, we calculated that the CT scan made both diagnoses correctly in 55 of 76 cases of disk herniation (72 percent) and 62 of 93 cases of spinal stenosis (67 percent). The myelogram made both diagnoses correctly in 63 of 72 disk herniation cases (83 percent) and 63 of 93 spinal stenosis cases (68 percent). The myelogram correctly diagnosed the presence of disk herniation a significantly greater percentage of the time (p <0.005, McNemar test), but the two modalities were not significantly different for diagnosing spinal stenosis. Unfortunately, the categories used by Bell et al. (1984) do not allow one to distinguish between false positive and false negative results, so their data cannot be expressed as sensitivity and specificity.

While the study by Young et al. (1988) was primarily a report on a new surgical procedure, it did include data validating myelography results by comparison to surgical findings. According to the myelograms, there were 70 stenosed vertebrae in the 32 patients in this series. However, six additional levels of stenosis were found at surgery. Assuming the surgical results to be the reference standard, this means that the sensitivity of myelography was 92 percent. Only seven patients in this study were imaged by CT, and the CT results were not reported (Young, Veerapen, and O'Laoire, 1988).

Summary

All of the clinical trials that report data validating MRI, CT, or myelography for diagnosis of spinal stenosis had one or more flaws in design or reporting that adversely affected the reliability or applicability of the results. Each of the five trials that studied CT or MRI found that the sensitivity of the cross-sectional modality is equal to or better than the sensitivity of myelography. None concluded that myelography was superior to the cross-sectional modalities. None of the trials attempted to validate the quantitative measurements of the spinal canal made by any imaging modality.

Signs, Symptoms, Patient History, and Physical and the Results of Imaging Exams

In this section, we address the fourth key question of this evidence report, "What is the relationship between the signs and symptoms and other features of the patient history and physical and the results of the imaging examination?" In considering our discussion of this question, it is important to bear in mind the conclusion of the previous subsection: there is little evidence that proves how well the modalities used to image spinal stenosis perform.

In answer to this fourth key question, we assess evidence related to whether there is a relationship between patient signs and symptoms the findings of the imaging examination. If such relationships exist, it may be possible to expedite proper diagnosis and treatment of the condition. The evidence base for this question consists of seven trials that retrospectively reported the number of patients with and without a few signs and symptoms among groups of patients with moderate or severe stenosis. In all of these trials, difficulties with patient selection and confounding variables, including disk herniation, could cause apparent associations between clinical signs and symptoms and imaging results in clinical trials. Therefore, trials of better quality are needed to adequately address this question. Nevertheless, currently available data do exhibit some patterns.

Only one of these trials was a diagnostic trial. Phillips et al. reported a study of "stress x-rays" (radiographs taken with the patient bending one way or another) (Phillips, Howe, Bustin et al., 1990). Angles between vertebrae were measured, and motion of the spine during these movements was described as normal or abnormal by the person reading the images. The investigators tried to correlate these descriptions with clinical information such as age, sex, previous history of injury or surgery, participation in sports, and discomfort. No statistically or clinically significant correlation was found with these variables or with physical exam findings like range of motion and palpation results. The authors conclude that the value of these x-rays is questionable. Accordingly, no other published studies report using this technique.

The remaining trials answering the present question were surgical trials that additionally reported imaging results. Sato and Kikuchi (1997) sought to determine whether there was a relationship between the number of stenosed levels and patients' symptoms. To accomplish this, they stratified their report of patients' symptoms by the number of vertebral levels where stenosis was found by myelography. Rates of radicular pain were the same in the two groups: 82 percent (23/28) of the patients with two stenosed levels and 83 percent (44/53) of the patients with one stenosed level. Cauda equina symptoms were present in 75 percent (21/28) of patients with stenosis at two levels but were present in only 53 percent (28/53) of patients with stenosis at one level. According to tests performed by the authors, this difference was not statistically significant (Sato and Kikuchi, 1997).

Rompe et al. (1995) studied the frequency of 13 different symptoms and patient characteristics among patients found to have absolute stenosis (AP diameter of spinal canal 10.0 mm or less as measured by CT) and relative stenosis (diameter 10.1 to 12.0 mm). Statistical tests performed by the authors found that rates of most of these characteristics were not significantly different between absolute and relative groups. Only ankle reflex differed significantly, with 72 percent of the absolute stenosis group having decreased reflex response, and 13 percent of the relative stenosis group having it. The ability of reflex abnormality to predict degree of stenosis was not discussed by the authors, nor is it examined in any other clinical trial (Rompe, Eysel, Hopf et al., 1995).

A pair of studies by Johnsson et al. (1981) compared the rates of major symptoms among patients with different degrees of stenosis. Their article included five patients with normal myelogram results and divided the abnormal results into complete block (n = 7) and partial block (n = 15) of the spinal canal. Our statistical calculations found no significant difference in degree of stenosis between patients with claudication and those with back and leg pain without claudication (χ2 = 0.277, df = 2, p = 0.87) (Johnsson, Willner, and Pettersson, 1981).

The report by the same group (Johnsson, Uden, and Rosen, 1991) appears to include the patients included in the earlier report, along with subsequent surgical patients, as well as a group of patients who were not operated on. Symptoms were categorized as claudication and radiculopathy. Patients who underwent surgery were categorized as having complete stenosis (n = 14) or moderate stenosis (n = 30). The "not operated" group (n = 19) may have included some patients without stenosis, but it also included patients with stenosis who either refused surgery or were considered unacceptable surgical candidates. As in the previous study, we found no significant differences in the proportion in each stenosis group between the two symptom groups in this report (χ2 = 0.398, df = 2, p = 0.82).

Verbiest's case series (Verbiest, 1977) reported measurements of the lumbar canal made during surgery. Therefore, using this data to answer the present question requires one to assume that the presurgical imaging findings accurately determine the degree of stenosis. Verbiest categorized the spinal findings as "absolute stenosis" (spinal canals with diameters of 10 mm or less), "relative stenosis" (spinal canal diameters between 10 and 12 mm), and "mixed stenosis" (some narrowed portions 10 mm or less and some between 10 and 12 mm). To analyze these results, we pooled the "absolute" and "mixed" groups, so that all patients with any spinal measurement of 10 mm or less would be in this severe stenosis group, and all spinal measurements in the patients in the mild stenosis group would be greater than 10 mm. These findings may be confounded by disk herniation. Most of the patients in the group with mild stenosis had disk herniation (29/32: 91 percent), while less than half the patients with severe stenosis had disk herniation (33/84: 39 percent, p <0.001, Fisher's exact test, as per our calculation). Therefore, we computed the correlation between the reported symptoms and the anatomic findings. Sciatica was significantly correlated with disk herniation (r = 0.143, p = 0.027) but not correlated with stenosis (r = 0.000, both calculations performed by us). While the correlation of sciatica with disk herniation was statistically significant, the correlation coefficient was relatively low, so it may have limited clinical significance. We found that sciatica was significantly correlated with disk herniation (r = 0.219, p = 0.004), but the correlation with stenosis was not statistically significant (r = −0.076, p = 0.095). Data for the correlation between intermittent claudication and disk herniation was not reported by Verbiest, so we cannot determine whether claudication correlates more with disk herniation or with stenosis. From this analysis, we can conclude that symptoms like sciatica and lumbago are not predictive of stenosis.

Summary

Clinical signs and symptoms do not appear to predict whether the results of imaging tests will show severe stenosis. The major symptoms of radiculopathy and cauda equina do not predict degree of stenosis. Lumbago and sciatica are significantly associated with disk herniation but not with stenosis. However, no trial that was relevant to the present question was prospective in design, and results of some of the trials are difficult to interpret because some patients had disk herniation. Therefore, a conclusive answer to this question awaits the results of trials of better design.

Question 5 What is the relationship between the signs and symptoms and other features of the patient history and physical and results of conservative treatment; and what is the relationship between the type of conservative treatment and patient outcomes?

This question seeks to determine if successful conservative treatment can be predicted by certain patient characteristics. Implicit in this question is whether conservative treatments are effective. Therefore, we first discuss possible predictors of successful conservative treatment, and then turn to a discussion of this latter implicit question.

Factors That Could Potentially Predict the Success of Conservative Treatment

This part of the analysis addresses whether there are any signs or symptoms that predict the outcome of conservative treatment. Does the initial status among patients have a prognostic before receiving conservative treatment? To answer this question within a single study, there must be individual patient data on the initial signs and symptoms and the outcome of conservative treatment. Alternatively, the outcome must be stratified according to initial status. For these reasons, our inclusion criteria for this question were any trial with a conservative treatment arm that had either individual patient data for initial status and their final outcome or that stratified final outcome by initial status.

We examined every study that had a conservative treatment series, either as a control arm or as a single series. We identified one study that provided individual patient data for treadmill walking distance at the start of conservative treatment and after three months (Eskola, Pohjolainen, Alaranta et al., 1992). We also found one study that stratified patients according to outcome of conservative treatment (improved, same, worse) and reported the mean AP canal diameter for each group before treatment (Johnsson, Uden, and Rosen, 1992). We also located two recent publications that addressed this issue.

Eskola et al. (1992) provided plots portraying individual patient treadmill walking distances before conservative treatment and after three months. This trial had parallel arms, one of which received placebo, and one which received calcitonin. During the 3 month period for which individual patient data was reported, the calcitonin had a nonsignificant effect on treadmill walking distance (we calculated for t-test of difference: p = 0.23; t-test on ratio: p = 0.55). Therefore, we combined the arms to approximate a single series of 38 patients that received conservative treatment. This trial also provided individual patient data for resting pain; however, this is not a major factor in spinal stenosis. Furthermore, the calcitonin treatment had a significant effect in preventing resting pain in those who had it, and so the treatment groups could not be combined, greatly limiting the statistical power for our purposes. Therefore, we did not analyze this outcome (Eskola, Pohjolainen, Alaranta et al., 1992).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F018.jpg.

   Figure 18. Initial vs. Final Walking Distance Computed from Eskola, Pohjolainen, Alaranta et al., 1992

Table 18. Regression Analysis of Treadmill Walking Distance Before and After Three Months of Conservative Treatment
ComparisonCorrelation CoefficientP-value
Initial distance vs. final distance (Figure 18)0.92<0.001
Initial distance vs. difference (Figure 19)0.470.003
Initial distance vs. ratio (Figure 20)−0.120.48

Calculated by us from Figure 6 in Eskola, Pohjolainen, Alaranta et al., 1992.

To analyze the relationship between initial and final walking ability with conservative treatment, we extrapolated the initial and final values from the published plots and carried out a regression analysis (see Figure 18). The correlation coefficient is 0.92, p <0.001 (see Table 18), indicating a strong relationship between initial status and final status with conservative treatment. Two conclusions are suggested by this analysis. First, a substantial number of patients improved after conservative treatment. This resulted in a larger number of points above the 45-degree diagonal line (the line on which points would fall if patients did not improve) than below it. Thus, 18 of 36 points were above the diagonal, 13 of 36 points were on it, and 6 of 36 points were below the diagonal. Further, the regression line is above the diagonal. This indicates that there can be some improvement in a conservatively treated series of patients, but does not indicate whether this improvement is caused by conservative treatment or whether it would have occurred in the absence of treatment. To determine the cause of this improvement, a trial must have a control group. This includes trials in which surgical outcomes are compared to those of conservative treatments. Only the results of controlled trials can be used to ascertain whether surgical treatment provides additional benefit beyond what would occur with conservative treatment.

The second phenomenon observed in this analysis is that patients with poorer half in terms of initial status were more likely to stay the same or become worse (to be on or below the diagonal) than patients with more favorable half initially. For the half of the patients with the worst initial walking ability (below 520 m), 4 of 18 points were below the diagonal, and 8 of 18 points were on the line, for a total of 12 of 18 points. For the group with more favorable initial status, 2 of 18 points were below the line, and 4 of 18 points were on the line, for a total of 6 of 18 points. This is 67 percent the same or worse for the group with poorer initial status, and 33 percent the same or worse for the group with more favorable initial status. Our statistical analyses of these data show that this is an absolute difference of 33 percent (95 percent CI, 1 to 58) and a relative difference of 2.0 (95 percent CI, 0.96 to 4.15). One result is statistically significant, and the other is not. Thus, whether the trend observed is statistically significant in these data appears to depend on how one analyzes the data.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F019.jpg.

   Figure 19. Initial vs. Difference in Walking Computed from Eskola, Pohjolainen, Alaranta et al., 1992

Using information from two additional graphs, we further searched for a relationship between patients' initial status and whether they exhibited improvement after treatment. In Figure 19, we carried out a regression analysis of the initial walking ability compared to the change measured as the difference between the initial and final walking distance. The correlation coefficient is 0.468, p = 0.003, indicating a moderate, positive correlation between the initial walking ability and the amount of improvement. Again, it is noteworthy that most of the patients with a negative change (points below the zero difference line) were in the half with the worst initial walking ability.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F020.jpg.

   Figure 20. Initial vs. Ratio of Change in Walking Distance Computed from Eskola, Pohjolainen, Alaranta et al., 1992

Another way to look at the relationship of initial status to final outcome is to regress initial walking ability on the ratio of improvement, measured as the final walking distance divided by the initial walking distance (see Figure 20). Unlike the above regression, this regression does not have a positive slope. The correlation coefficient is −0.119, p = 0.482, indicating a possible slight negative (but not statistically significant) correlation between initial walking ability and the proportional amount of improvement. Even if this possible slight negative trend were statistically significant, it would be unlikely to have clinical prognostic value. As in the above regressions, most of the patients showing worse proportional change (points below the zero ratio horizontal line) were in the worse initial group.

Whether the regression on change as a difference or as a ratio is more informative is not immediately clear. The fact that the correlation goes from statistically significant and positive for regression on the difference to statistically nonsignificant and negative for regression on the ratio indicates that both types of regressions should be considered. In this regard, the proportional assessment of improvement is more like the way many surgery studies measured outcomes. Typically patients are categorized as being worse, the same, or improved following treatment. Frequently the same and improved categories are further categorized into fair (eg., those patients with 0 to 50 percent improvement), good (50 to 75 percent improvement), and excellent (>75 percent improvement), or similar categories. These categories demand proportional judgments. This expression of results obscures the influence on outcome of the initial status and assumes that, for example, a 50 percent improvement from poor initial status to mediocre status is equivalent to a 50 percent improvement from mediocre status to good status. This may be the case, but studies in this field have not acknowledged or addressed this assumption.

The regression analysis shown in Figure 20 indicates that there is more scatter in the data points from patients with the worst initial walking ability. The slightly negative slope of the regression line is caused by a few patients with poor initial status who had very large gains, while several patients with poor initial status became worse. This is in contrast to the patients with better initial status, most of whom had moderate gains. This wide dispersion prevents initial walking ability from being a useful predictor of the success of conservative treatment with individual patients.

Table 19. Mean Change in Treadmill Walking Distance before and after Three Months of Conservative Treatment Stratified by Initial Status
Initial Distance (m)Final Distance (m)Difference (m) (95% CI)Ratio
Whole Cohort662827164 (58 to 270)1.31 (1.09 to 1.53)
Worst Initial20424743 (−18 to 104)1.39 (0.97 to 1.79)
Best Initial11291433304 (122 to 486)1.27 (1.09 to 1.45)
Strata
Difference
(best initial
minus worst
initial)
261 (54 to 468) t-test p = 0.016−0.12 (−0.59 to 0.34) t-test p = 0.59

Calculated by us from Figure 6 in Eskola, Pohjolainen, Alaranta et al., 1992.

Table 20. Change Category Frequencies for Treadmill Walking Distance After Three Months of Conservative Treatment Stratified by Initial Status
Worse <0.75 Change (95% CI)Same 0.75 to 1.25 Change (95% CI)Improved >1.25 Change (95% CI)
Whole Cohort0 % (0 to 9.6%)61% (44% to 75%)39% (25% to 55%)
Worst Initial6% (1/18) (1% to 26%)61% (11/18) (39% to 80%)33% (6/18) (16% to 56%)
Best Initial0 % (0% to 18%)56% (10/18) (34% to 75%)44% (8/18) (25% to 66%)
Strata
Difference
(best initial
minus worst
initial)
−6% (−26% to 13%)−6% (−34% to 25%)11% (−19% to 39%)Somers' d −0.14 p = 0.38

Calculated by us from Figure 6 in Eskola, Pohjolainen, Alaranta et al., 1992.

In addition to regression analysis, we also looked at the relationship of initial and final walking ability by stratifying the series into the half of the patients with the most favorable status and the half with the poorest initial status. We then calculated the mean outcomes for each group (see Table 19). We also categorized the outcomes into worse, same, and better, and calculated the proportions in each category stratified by initial status (see Table 20). The mean change in walking distance in terms of absolute difference for the whole cohort was 164 m (95 percent CI ±106 m). Three hundred and four meters was the absolute difference for the group with the most favorable initial status, and 43 m was the absolute difference for the group with the poorest initial status. The difference between the mean changes (the effect size) was 261 m (95 percent CI 54 to 468 m; two-tailed t-test p = 0.016). Thus, the group with the most favorable initial status had a substantially greater mean change in walking distance in terms of the absolute distance. The mean proportional change in walking distance for the whole cohort was 1.31 (95 percent CI ±0.22). The mean proportional change in walking distance was 1.27 m for the group with the most favorable initial status and 1.39 m for the group with the poorest initial status. This means that this latter group had a somewhat greater proportional improvement over the followup period. The difference between these ratios was 0.12. This small possible difference is not statistically significant (95 percent CI −0.34 to 0.59; two-tailed t-test p = 0.59). Thus, the group with the most favorable initial status had a statistically greater absolute change in the number of meters walked. However, the proportional change was slightly greater for the group with the poorest initial status, although the difference was not statistically significant, nor likely to be clinically significant.

Looking at the proportion of patients in each group that were improved (ratio >1.25), the same (ratio between 0.75 and 1.25), or worse (ratio <0.75), 44 percent (8/18) of the group with the most favorable initial status improved, 56 percent (10/18) were the same, and none were worse. In the group with the poorest initial status, 33 percent (6/18) of patients improved, 61 percent (11/18) remained the same, and one patient was worse (Somers' d −0.114, p = 0.38). Thus, with this frequency comparison, the best initial group had slightly more patients improved, although the difference was not statistically significant, nor does it appear to have much clinical significance. This crude and arbitrary categorization did not capture the fact that both groups actually had some patients with a change for the worse (4/18 in the group with the poorest initial status, and 2/18 in the group with the most favorable initial status), as was demonstrated in the above regression analyses.

Although this was a small study with only moderate statistical power (18 patients in each group), we conclude from this analysis that there was no substantial difference between the mean proportional improvement of the groups with best and worst initial status in terms of treadmill walking distance. However, a group of the worst initial patients (67 percent or 12/18 in the regression analysis) appears to remain the same or continue to get worse with conservative treatment. But because of the wide distribution of results in the patients with the worst initial status, in this trial the initial status for treadmill walking distance had little prognostic value in terms of predicting the outcome of conservative treatment for individual patients.

Table 21. Initial Stenotic Canal Diameter Stratified by Outcome of Conservative Treatment
Visual Analog Scale ResultsnMean AP Diameter Range)
Worse44.7 mm (0 to 9 mm)
Same196.8 mm (0 to 11 mm)
Improved48.2 (7 to 10 mm)

Adapted from Johnsson, Uden, and Rosen, 1992.

Johnsson et al. (1992) also provided data on the relationship between initial status and the outcome of conservative treatment (see Table 21). This study reported that for a subjective global VAS outcome (45 months average followup) of worse, unchanged, or improved, the mean AP canal diameter (myelographic measurement at the narrowest level) for each group before conservative treatment was 4.7 mm (range 0 to 9 mm, n = 4), 6.8 mm (range 0 to 11 mm, n = 19), and 8.2 mm (range 7 to 10 mm, n = 4), respectively. Because no variance was reported, we were unable to test the statistical significance of this data. These data are consistent with a trend for smaller focal canal diameters to predict worse outcomes with conservative treatment on average. However, as above, the wide distribution (wide and overlapping ranges) prevents this variable from having useful prognostic value for individual patients (Johnsson, Uden, and Rosen, 1992).

Simotas et al. (2000) examined the effects of nonoperative treatment on 49 patients with lumbar spinal stenosis over a 3 year period. Although this study did not contain a surgical arm for comparison, this trial is mentioned here because of the long followup time and the information provided on those patients that eventually needed surgery. Conservative treatment consisted of bed rest for the first 1 to 2 weeks, followed by a physical rehabilitation program with a physical therapist. Patients performed flexion-based lumbar stabilization exercises. Patients with significant radicular symptoms received oral corticosteroids on a 7 day tapering schedule. Epidural steroid injections were used to treat patients who did not achieve adequate symptom control. Many patients had a recurrence of symptoms that required reinjection during the 3 year study period. The average age was 69 years (range 53 to 87). Radiographic severity of the stenosis was judged to be mild in 12 percent of patients, moderate in 43 percent of the patients, and severe in 45 percent of the patients. Nine patients required surgery within a mean of 13 months (range 2 to 41 months). These patients had more serious pretreatment averages for back and leg pain and a greater degree of stenosis. Among the 40 remaining nonoperated patients, only age seemed to be related to outcome variables, with older patients having worse scores for improvement in pain. Older patients had significantly greater radiographic severity, greater levels with at least moderate stenosis, and more frequently had scoliosis. If the patients with scoliosis were removed from the analysis, the relationship between increased age and worse outcomes was not significant.

Amundsen et al. (2000) divided their patients into three groups: patients with severe symptoms who were treated with surgery (n = 19), patients with pain considered too moderate to justify surgery and treated with conservative methods (n = 50), and the remaining patients whose severity of pain did not indicate surgery or conservative treatment (n = 31). The patients in this latter group were randomized to surgical (n = 13) or conservative treatment (n = 18). Over the course of the 10 year followup period, 20 percent of conservatively treated patients with mild pain needed surgery, and 56 percent of conservatively treated patients with moderate pain needed surgery. No convincing associations between any clinical feature, such as age, gender, type of work, duration of pain, and physical findings, and treatment results were observed by the authors. In addition, no convincing associations between any radiologic feature, such as degenerative changes, spondylolisthesis, type of stenosis, degree of narrowness, and measures of different dimensions of the spinal column, were observed. However, the sample size in this trial was small and the study may not have had sufficient statistical power to detect association between patient characteristics and treatment results.

Summary

Few studies have examined the question of the relationship of initial signs and symptoms to the final status or amount of change following conservative treatment. The one study, Eskola et al. (1992) which reported individual patient data indicated that with conservative treatment, some patients improve, some stay the same, and some get worse. Simotas et al. (2000) suggest that the patients with the highest degree of pain are more likely to have unsuccessful conservative treatment and go on to surgery. In Simotas et al. (2000), this was 19 percent of patients over three years. Amundsen et al. (2000) preselected those patients with the highest pain for surgery, but among the mild and moderate pain groups of conservatively treated patients, 20 percent and 56 percent, respectively, eventually required surgery over a 10 year period. This calls into question studies of conservative treatment or surgery that do not have parallel control groups. This also calls into question the usefulness of means in treatment comparisons. Reporting only the mean value for final status or amount of change can obscure the fact that substantial numbers of patients may differ greatly from the mean. The study by Eskola et al. (1992) indicated that for treadmill walking ability (a measure of neurogenic claudication), a moderate positive correlation may exist between initial status and amount of improvement measured as a difference (final status − initial status). However, no substantial correlation was found between initial status and improvement measured as a ratio (final status : initial status). The wide dispersion of results for the difference and ratio indicates that any correlations are not very useful for prognostication for individual patients. Johnsson et al., (1992) suggested some positive correlation between canal size at the point of focal narrowing and the outcome of conservative treatment. Simotas et al. (2000) also suggested that the degree of stenosis was related to the failure of conservative treatment and the need for surgery. Amundsen et al. (2000) found no association between spinal canal dimensions and treatment outcomes; however, the small sample size in this study may have prevented the detection of any significant association.

Comparison of Conservative Treatment Methods

Table 22. Disposition of Publications on Conservative Treatments
DocumentsNumberPercentage
Retrieved from literature searches4,788100%
Conservative treatment trials examined for lumbar spinal stenosis patients1783.71% of literature retrieved from searches
Conservative treatment trials rejected due to lack of control group, too few patients, failure to report outcomes separately for each spinal disorder, or failure to diagnosis specific lumbar spinal stenosis patients17498% of conservative trials
Conservative treatment trials that met inclusion criteria42% of conservative trials
Table 24. Randomized Controlled Trials of Patients Receiving Conservative Treatments
First Author and Year of PublicationPatient GroupNTreatmentOutcomes MeasuredFollowup Period
Trials examining surgical patients with central or lateral lumbar stenosis (type of stenosis unspecified or includes both types of stenosis)
Fukusaki, Kobayashi, Hara et al., 1998116Epidural salineWalking capacity (walking distance that provoked intolerable leg pain)1 week, 1 month, and 3 months
218Epidural anesthetics
319Epidural anesthetics and steroids
Eskola, Pohjolainen, Alaranta et al., 1992120Calcitonin injectionsBack pain (0-100 visual analog scale), Walkingcapacity (maximum distance on treadmill)1, 3, 4, 6, and 12 months
219Calcitonin injections
Cuckler, Bernini, Wiesel et al., 1985120Epidural anesthetics and steroidsGlobal assessment (subjective improvement of 75% or more as judged by the patient)24 hours and 13 to 30 months
217Epidural anesthetics
Patients with degenerative spondylolisthesis
Sinaki, Lutness, Ilstrup et al., 1989129Exercise/flexionAbility to work, back pain (no pain, mild, moderate, and severe), global ssessment (patients were asked if they were recovered)3 months and 3 years
219Exercise/extension
Different methods of medical management can only be compared in trials in which patients are randomly assigned to receive the different treatments. Random assignment, if properly performed, ensures that the patients in the groups being compared are highly similar. Similar patient groups decrease the likelihood that any differences in patient outcomes between the groups are due to differences in patient characteristics, and increase the likelihood that any observed differences are due to differences in treatment efficacy. For this reason, we included only randomized controlled trials in this portion of our analysis. The requirement for this criterion had a dramatic impact on the number of articles we examined. This impact is portrayed in Table 22. This table shows that of the 4,788 studies on treatment that we identified, 178 were trials on conservative treatment, and only four of these were relevant to the question at hand. These four trials are summarized in Table 24. Not shown in this table is that three of these trials had flaws that undermine the usefulness of the data.

Eskola et al. (1992) examined the use of calcitonin injections as a treatment for lumbar spinal stenosis. This trial was previously discussed with regard to factors that may predict the success of conservative treatment. Patients received calcitonin or placebo injections every other day for 4 weeks. The injection period was followed by a 2 month washout period, and then patients crossed over to the other injection. Walking distance on a treadmill was measured before treatment and 1, 3, 4, 6, and 12 months after treatment. Walking capacity at baseline, approximated from published graphs, were the following: Group 1, 710 ± 20 m; Group 2, 550 ± 20 m (means and standard deviations). The authors do not statistically compare data across treatments at each time point, but evaluated only the changes between measurement periods. In the case of walking distance, both Group 1 patients (calcitonin then placebo) and Group 2 patients (placebo then calcitonin) showed significant increases in walking capacity compared to pretreatment values 1 month and 3 months after the first treatment. The increase in walking capacity continued through the crossover period until 6 months after the first treatment. After 6 months, both groups showed a significant decline in walking capacity so that at 12 months, Group 1 patients averaged 800 m and Group 2 patients averaged 475 m. From the data presented by the authors, no conclusions can be drawn as to whether the calcitonin, the placebo treatment, or some other aspect of the treatment process was responsible for the increase in walking capacity. All patients seem to respond during the treatment period regardless of the order in which they received calcitonin or placebo. Therefore, this trial cannot be used as evidence that calcitonin reduces the symptoms associated with lumbar spinal stenosis.

Cuckler (1985) examined the effects of epidural steroid injections on patient-perceived improvement in symptoms both at 24 hours and at 13 to 30 months later. The ability to evaluate the long-term effects of the injections was compromised by a study design that allowed all patients (placebo or steroid) to receive the steroid injection 24 hours after the first injection if symptoms did not improve. Therefore, few patients, 3 of 17 (17.6 percent), were left in the control group of this trial after 24 hours. Since this trial cannot be considered a controlled trial for evaluation of long-term effects of epidural steroid injections, we attempted no further analysis of its data.

Sinaki et al. (1989) examined the effect of two exercise regimens on patients with lumbar spondylolisthesis. Patients were randomized to different physiatrists, who then assigned the exercise regimen, resulting in 29 patients prescribed flexion exercises and 19 patients prescribed extension exercises. The assignment of exercise programs was therefore not random. Consequently, the pretreatment pain classification was very different between the patient groups. All 26 patients in the flexion group were considered to have moderate or severe pain, while in the extension group 12 patients had moderate or severe pain and 6 patients had no pain or mild pain. The lack of true randomization of the patients caused us to exclude this study from further analysis.

Table 23. Analysis of Walking Capacity Data from Fukusaki, Kobayashi, Hara et al., 1998
Patient GroupWalking Distance that Provokes Neurogenic Claudication (Meters, Mean ± Standard Deviation)Effect Size (Hedges ` d )and Confidence Limits Compared to Pretreatment Value
Pre treatment1 week1 month3 months1 week1 month3 months
Saline n = 1610 ± 823 ± 1918 ± 1311 ± 80.869 0.144 − 1.594 p = 0.0190.723 0.007 − 1.438 p =0.0480.122 −0.572 − 08155 p = 0.731
Mepivacaine n = 1811 ± 992 ± 6628 ± 2413 ± 71.681 0.921 − 2.442 p = 0.0000150.917 0.230 − 1.604 p = 0.00890.243 −0.413 − 0.898 p = 0.468
Mepivacaine plus methylprednisolone n = 199 ± 787 ± 5826 ± 2310 ± 81.849 1.089 − 2.608 p = 0.0000020.979 0.306 − 1.652 p = 0.00440.130 −0.506 − 0.767 p = 0.688
Fukusaki et al. (1998) examined the effects of epidural steroid injections on the neurogenic claudication symptoms that are common and debilitating among patients with degenerative lumbar spinal stenosis. Fifty-three patients with neurogenic claudication were randomly assigned to epidural injections of saline, 1 percent mepivacaine, or 1 percent mepivacaine plus methylprednisolone. Patient blinding to treatment was not reported. Either central or lateral lumbar spinal stenosis was demonstrated in each patient by CT and MRI. The average age in all three groups was between 69 and 72 years. At 1 week, 1 month, and 3 months, each patient's walking capacity was determined by having the patient walk until intolerable leg pain developed, an indication of neurogenic claudication. The walking distance was measured by a blinded physical therapist. Table 23 contains the original publication data and our analysis of the effect sizes. The analysis reveals that all three types of injections produced relief of symptoms at 1 week and 1 month after treatment but that symptoms returned 3 months later. The evidence in this study suggests that the local anesthetic block provided by the mepivacaine can reduce symptoms on a short-term basis. Epidural steroids offered no additional benefit on a short-term basis in treating patients with lumbar spinal stenosis (Fukusaki, Kobayashi, Hara et al., 1998).

Summary

Many conservative methods have been proposed for treating patients with lumbar spinal stenosis (Fritz, Delitto, Welch et al., 1998). Our search of the literature has uncovered only one well-designed randomized controlled trial that compared a conservative treatment to placebo treatment specifically in lumbar spinal stenosis patients. That study, Fukusaki et al. (1998), indicates that local anesthetic block provides temporary relief from neurogenic claudication for about 1 month. Conclusions about effectiveness beyond 3 months cannot be made.

Evidence for the efficacy of other conservative treatments in lumbar spinal stenosis patients is lacking. However, the lack of evidence for effectiveness does not prove that these treatments are not effective. The lack of evidence is an indication of the failure to design adequate clinical trials to show effectiveness.

Question 6What is the relationship between the signs, symptoms, and other features of the patient history and physical and the success or failure of surgical treatment?

This question addresses the possible connection between surgical success and patient characteristics. Will patients with less severe symptoms or extent of stenosis benefit to a greater or lesser degree from surgery compared to patients with more severe symptoms or extent of stenosis?

Implicit in this question is whether some patients might benefit more from surgery than from medical management. We turn to this implicit question after first addressing the relationship between patient characteristics and surgical outcomes.

Relationships Between Patient Characteristics and Surgical Outcomes

We addressed this question by performing a systematic narrative review of each study. In this review, we calculated, wherever possible, each study's effect size. The following analysis is subgrouped according to the type of lumbar spinal stenosis and the patient characteristics that were correlated with the outcomes of surgical treatment. Only studies that stratified surgical treatment outcomes by patient characteristics or used a regression analysis to compare patient characteristics to outcomes or used a regression analysis to compare patient characteristics to outcomes were considered to have evidence that could be used to answer question 6.

Table 25. Controlled Trials of Patients with Central Lumbar Spinal Stenosis
Authors and Year of PublicationType of Lumbar Spinal StenosisType of Surgical or Conservative TreatmentPatient GroupNumber of PatientsOutcomes MeasuredFollowup Period
Herno, Saari, Suomalainen et al.,1999aCentral lumbar stenosis no post-surgery stenosisMixed ecompression techniques135Back pain, leg pain, walking, global, DisabilityMean of 47 months
Central lumbar stenosis post surgery stenosisMixed decompression techniques257
Herno, Partanen, Talaslahti et al., 1999bCentral lumbar stenosis post surgery stenosisStandard wide decompressive laminectomy141Walking, global, disabilityBetween 113 and 157 months
Central lumbar stenosis no post surgery stenosisStandard wide decompressive laminectomy215
Hanakita, Suwa, and Mizuno, 1999Central lumbar stenosis younger than 64 yearsStandard wide decompressive laminectomy159Back pain, leg pain, walking, globalBetween 12 and 96 months
Central lumbar stenosis older than 64 yearsStandard wide decompressive laminectomy261
Central lumbar stenosisPartial laminectomy or hemilaminectomy316
Central lumbar stenosisSWDL with fusion (arthrodesis)420
Thomas, Rea, Pikul et al., 1997Central lumbar stenosisStandard wide decompressive laminectomy112WalkingPretreatment and 24 to 62 months
Central lumbar stenosisLaminotomy214
Yone, Sakou, Kawauchi et al., 1996Central lumbar stenosisSWDL with fusion and istrumentation110Back pain, leg pain, walking, globalBetween 24 and 68 months
Central lumbar stenosisLaminotomy217
Grob, Humke, and Dvorak, 1995Central lumbar stenosisPartial laminectomy or hemilaminectomy115Back pain, back pain relief, leg pain relief, walking, globalPretreatment and 24 to 32 months
Central lumbar stenosis single levelPartial laminectomy with fusion and instrumentation215
Central lumbar stenosis multiple segmentsPartial laminectomy with fusion and instrumentation315
Johnsson, Uden, and Rosen, 1991Central lumbar stenosisConservative-not described120Back pain, walking, global, workBetween 7 and 51 months
Central lumbar stenosis moderate stenosisStandard wide decompressive laminectomy230
Central lumbar stenosis severe stenosisStandard wide decompressive laminectomy314
Ray, 1982Central lumbar stenosisStandard wide decompressive laminectomy148GlobalMean of 10 and 13 months
Central lumbar stenosisPartial laminectomy or hemilaminectomy217
Surin, Hedelin, and Smith, 1982Central lumbar stenosis marked stenosisStandard wide decompressive laminectomy115GlobalBetween 14 and 70 months
Central lumbar stenosis moderate stenosisStandard wide decompressive laminectomy27
Tajima, Fukazawa, and Ishio, 1980Central lumbar stenosisMixed decompression techniques113GlobalUnkown post operative time
Central lumbar stenosis and disk lesionMixed decompression techniques214
Herno et al. (Herno, Saari, Suomalainen et al., 1999), Herno et al. (Herno, Partanen, Talaslahti et al., 1999), Hanakita et al. (Hanakita, Suwa, and Mizuno, 1999), Johnsson et al. (Johnsson, Uden, and Rosen, 1991), Surin et al. (Surin, Hedelin, and Smith, 1982), and Tajima et al. (Tajima, Fukazawa, and Ishio, 1980) stratified the reporting of surgical treatment outcomes by patient characteristics (see Table 25).

In addition to these studies, the prospective single arm surgical trials of Katz et al. (Katz, Stucki, Lipson et al., 1999) and Jonsson et al. (Jonsson, Annertz, Sjoberg et al., 1997), and the retrospective single arm surgical trial of Thomas et al. (Thomas, Rea, Pikul et al., 1997) also examined the relationship between patient characteristics and surgical outcome.

Effect of Extent of Stenosis on Surgical Outcomes Among Patients With Central Stenosis

Table 26. Analysis of Walking Capacity in Meters from Johnsson, Uden, and Rosen, 1991
Patient GroupPretreatment Mean ± SDAt Followup Mean ± SDMean Difference at FollowupHedges' d at Followup95% Confidence Limits at Followup
Severe stenosis; n = 14120±1142,373±4,1422,2530.747−0.020 1.513
Moderate stenosis; n = 30186±2031,453±2,9351,2670.6010.080 1.120
Mean Difference between Patient Groups66920
Hedges' d between Patient Groups−0.3600.269
95% Confidence Limits between Patient Groups−0.998 0.279−0.368 0.906
Table 27. Analysis of Walking Capacity According to a Three-Level Scale from Johnsson, Uden, and Rosen, 1991
Patient GroupWalking CapacityEffect Size at FollowupConfidence Limits at Followup
WorseUnchangedImproved
Severe stenosis; n = 1430110.6360.406 0.998
Moderate stenosis; n = 307815
Johnsson et al. (1991) stratified patients receiving standard wide laminectomy according to the extent of pretreatment stenosis. A partial block at myelography indicated moderate stenosis (30 patients, mean and standard deviation of the AP diameter was 7.9 ±2.3 mm), and a total block indicated severe stenosis (14 patients, diameter = 0). The mean ages and their standard deviations were 62 ±9 years in the moderate group and 69 ±8 years in the severe group. Walking capacity was evaluated at a mean followup time of 50 ±32 months in the moderate group and 58 ±32 months in the severe group. The original walking data and our calculations of effect size are presented in Table 26. At baseline, the walking capacity was similar between groups, and both groups showed large increases in walking capacity at followup (>1,200 m). However, only the moderate stenosis group showed a statistically significant effect size when pre- and posttreatment values were compared. The large standard deviations at followup in both patient groups, approximately twice the mean, indicate a large variation in individual improvement in walking capacity. Our analysis of walking capacity based on the three level scale data reported by Johnsson et al. (1991) (worse, unchanged, improved) indicated only a small effect in favor of the severe stenosis group (Hedges' d effect size = .636, confidence levels of 0.406 and 0.998 (see Table 27). The data from this trial seem to suggest that a select group of patients benefited from surgery more than others. However, the degree of stenosis did not seem to determine who these patients would be, and insufficient information is presented in the trial to determine which patients might make up the group that benefits.

Surin et al. (1982) also stratified outcomes by degree of stenosis, but the moderate stenosis group (AP diameter between 11 and 14 mm) contained only seven patients, while the marked stenosis group (<10 mm) contained 15 patients. The sample size < 10 in the moderate stenosis group prevents a reliable comparison of the data between patient groups. The average followup examination after surgery was 29 months, but the range was 14 to 70 months. The small sample size and the large range in followup times prevent the use of data from this study in our analysis.

Herno et al. (1999a) and Herno et al. (1999b) retrospectively examined surgical outcomes based on the presence of stenosis at postsurgical followup. Herno et al. (1999a) reported on patients treated surgically for the first time between 1982 and 1984. Herno et al. (1999b) reported on patients who underwent surgery between 1985 and 1987. The stenosis was found at the original surgical site or an adjacent vertebral level. The mean patient ages were between 50 and 55 years at the time of surgery. Since these are retrospective trials, no pretreatment scores were reported for the outcome measures. Without baseline scores, the extent of patient disability before treatment cannot be judged or compared to patients in other studies. In addition, the degree of improvement over baseline conditions cannot be determined. This makes the results of these studies difficult to interpret. Baseline clinical features were reported. In Herno et al. (1999b) the nonstenosis group (15 patients) had a significantly greater baseline mean cross-sectional area of the dural sac compared to the stenosis group (41 patients) (145 mm2 vs. 95 mm2, p <0.0001, Mann-Whitney test). Laminectomy, extended laterally to decompress the nerve roots, was performed on all patients. After a mean followup time of 10.2 and 11.1 years for the nonstenosis and stenosis groups, respectively, no statistically significant differences were found in the mean Oswestry score (a measure of back pain disability) (28.7 v. 31.2, Kruskal-Wallis test) and walking capacity (15 minutes on a treadmill) (515 m v. 470 m, Kruskal-Wallis test).

Herno et al. (1999a) reported similar baseline clinical characteristics among the 35 patients with no stenosis postsurgery and the 57 patients with postsurgery stenosis. After a mean followup time of 3.5 years, no statistically significant differences were found in the mean Oswestry score (28.4 vs. 26.4, p = 0.755, Kruskal-Wallis test) and walking capacity (706 m vs. 602 m, p = 0.178, Kruskal-Wallis test). In these two studies, the presence of postsurgery stenosis was not correlated with surgical outcomes as measured by disability from back pain and walking capacity at the time of followup. The lack of pretreatment scores prevents any assessment of the actual effect of surgery on these patient groups, and the study does not provide evidence for a correlation between patient characteristics and the success or failure of surgery.

Jonsson et al. (1997) used a logistic regression analysis to correlate patient characteristics with favorable outcomes. In 105 patients with an average age of 65 years (range of 37-83 years), patients with an AP diameter of <6 mm had the most favorable outcomes at 5 years after surgery. No statistical information (p values, correlation coefficients) was reported to support these conclusions.

Effect of Age on Surgical Outcomes Among Patients With Central Lumbar Stenosis

Hanakita et al. (1999) retrospectively stratified by age (<64 years and >65 years at the time of surgery) patients who received laminectomy only, partial hemilaminectomy, or laminectomy plus fusion. A sufficient number of patients were available in the laminectomy-only group to evaluate the effect of age on surgical outcome (59 patients younger than 65 and 61 patients equal to or older than 65). However, the followup time was not specifically reported and could have been between one and eight years for any patient. During the followup period, 12 patients in the younger group were lost to followup (20 percent) and 17 patients in the older group were lost to followup (28 percent). Based on patient evaluation of surgical outcome, 77 percent of the younger group and 66 percent of the older group were considered "cured" or "better" as compared to "unchanged" or "worse" according to the rating scale used by the authors. Our computed effect sizes for this change yielded a Hedges' d of 0.054 with confidence limits of −0.155 and 0.263, which is not statistically significant. If dropouts are considered failures and put in the "worse" category, the analysis still showed no difference between the groups but shifted somewhat in favor of the younger group (Hedges' d = 0.349 with confidence limits of −0.011 and 0.710). This study suggests that surgical success based on patient evaluation of outcome ("I'm better than I was") is not affected by age. This is not the same as saying that surgery benefited both age groups to the same extent. The lack of pretreatment assessment scores describing patient condition before surgery prevents the assessment of actual surgical benefit. The older age group may regard "cured" or "better" as a small improvement, while the younger age group may require a large improvement before they consider themselves "cured" or "better." Assessment of changes (differences in pre- and posttreatment scores) in walking capacity, pain, disability, and activities of daily living are needed for an actual determination of surgical benefit in the two age groups.

Effect of Herniated Disks on Surgical Outcomes Among Patients With Central Lumbar Stenosis

Tajima et al. (1980) divided lumbar spinal stenosis patients into two groups: those with and without accompanying herniated disks. However, the two groups are not readily comparable because of differences in surgical methods with the groups. The stenosis-only group contained 11 patients who received standard wide laminectomy and two patients who received laminectomy plus fusion. The stenosis-and-disk group contained six patients who received laminectomy only, five patients who received laminectomy plus fusion, and three patients in whom "Love's method" was used but not described. The data in this study cannot be used to address the effect of concurrent herniated disk on surgical outcome in lumbar spinal stenosis patients because differences in outcomes between the groups may be due to differences in surgical methods as well as the presence of a herniated disk.

Effect of Patient Health and Comorbidity on Surgical Outcomes Among Patients With Central Lumbar Stenosis

Katz et al. (1999) looked for predictors of surgical outcome in 199 patients two years after surgery. These patients received either standard decompression (138), decompression with fusion (31), or decompression with fusion and instrumentation (30). The physical examination and radiographic variables were not associated with outcomes. The best predictors of symptom severity and walking capacity after surgery were the patients ratings of their own health and the presence of cardiovascular comorbidity. Better walking capacity, better mental health, decompression with fusion, and higher income had borderline significance with symptom severity and walking capacity outcomes. However, these variables combined accounted for only 33 percent of the variation in walking capacity outcome and 27 percent of symptom severity outcome.

Jonsson et al. (1997) used a logistic regression analysis to correlate patient characteristics with favorable outcomes. At 5 years after surgery, patients with comorbid disorders affecting walking ability fared significantly worse than those without these comorbid conditions. No statistical information (p values, correlation coefficients) was reported to support these conclusions.

Thomas et al. (1997) retrospectively examined 26 patients who received either laminectomy or laminotomy in order to determine which patient characteristics were related to patient outcomes. The average age was 68 years (range: 40 to 86). The authors noted that among the patients with poor outcomes there was a multiplicity of other diseases compared to those patients with good outcomes. No statistical analysis was presented. With only 26 patients, this study may have too few patients from which to predict outcomes based on patient characteristics.

Yone et al. (1996) and Ray (1982) both retrospectively examined patients who received different surgical procedures, but did not stratify outcomes by patient characteristics. Grob et al. (1995) randomized patients to different treatments, but did not report outcomes based on patient characteristics. Therefore, these studies cannot be used to judge if patients with less severe symptoms benefit from surgery as well as patients with more severe symptoms.

Effect of Herniated Disks on Surgical Outcomes Among Patients With Lateral Lumbar Stenosis

Table 29. Analysis of Global Success Data from Kirkaldy-Willis, Wedge, Yong-Hing et al., 1982
Patient GroupGlobal Success at FollowupHedges' d Effect SizeConfidence Intervals
Greatly ImprovedSlightly ImprovedNo ImprovementWorsening
Lateral stenosis and Lateral and central stenosis patients; n = 33149100−0.600 p = 0.006−1.028 −0.171
Lateral stenosis plus herniated disk and lateral and central stenosis plus herniated disk patients; n = 4434721

Calculation of an effect size from dichotomous data was performed using an odds ratio and the natural log transformation. To calculate the odds ratio, the number of patients in the greatly improved group were considered successes and the number of patients in the remaining three categories were combined and considered unsuccessful.

The study by Kirkaldy-Willis et al. (1982) was a retrospective case series in which patients were selected if they had one of the following four conditions: lateral stenosis, lateral stenosis with disk herniation, lateral and central stenosis, and lateral and central stenosis with disk herniation. The average age for the entire group of patients was 46 years (range of 41 to 52 years). Between 12 and 120 months after partial laminectomy, these patients were asked to assess the degree of improvement in their condition. We analyzed these categorical data to determine if disk herniation in combination with lateral stenosis was detrimental to patient recovery compared to lateral stenosis alone (see Table 29). Based on patient reporting of general improvement after surgery, the patients with herniated disks in addition to stenosis experienced greater improvement (Hedges' d = −0.600, confidence intervals of −1.028 to −0.171). Improvement after surgery is based on the patient's perception of their pain and disability before treatment. The patients with herniated disks may have experienced more pain or disability before surgery, but in this retrospective study, no pretreatment measures of pain or disability are provided to judge each group's relative condition before surgery. Therefore, this study does not help us in determining if the pretreatment condition of patients with lateral lumbar stenosis influences their response to surgery.

Effect of Patient Characteristics on Surgical Outcomes Among Patients With Degenerative Spondylolisthesis

Table 30. Controlled Trials of Patients with Degenerative Spondylolisthesis
Authors and Year of PublicationType of Lumbar Spinal Stenosis DisorderType of Surgical or Conservative Treatment Received by Patient GroupPatient GroupNumber of PatientsOutcomes MeasuredFollowup Period
Plotz and Benini, 1998Degenerative spondylolisthesisDecompressive surgery without fusion117Back pain relief, leg pain relief, global success9 to 120 months
Degenerative spondylolisthesisDecompressive surgery with fusion and translaminar screw fixation218
Degenerative spondylolisthesisDecompressive surgery with fusion and AO internal fixator371
Fischgrund, Mackay, Herkowitz et al., 1997Degenerative spondylolisthesisSWDL with fusionand instrumentation140Back pain, Leg pain, global success24 to 36 months
Degenerative spondylolisthesisSWDL with fusion (arthrodesis)235
Thomsen, Christensen, Eiskjaer et al., 1997Degenerative spondylolisthesisPartial laminectomy with fusion and instrumentation120Quality of life, mental status, Physical function and activities of daily living12 and 24 months
Degenerative spondylolisthesisPartial laminectomy and fusion221
Yuan, Garfin, Dickman et al., 1994Degenerative spondylolisthesisFusion and pedical screw fixation12177Back pain relief, leg pain relief, physical function and activities of daily living23 to 51 months
Degenerative spondylolisthesisFusion2456
Bridwell, Sedgewick, O'Brien et al., 1993Degenerative spondylolisthesisPartial laminectomy or hemilaminectomy19Walking34 to 45 months
Degenerative spondylolisthesisPartial laminectomy and fusion210
Degenerative spondylolisthesisPartial laminectomy with fusion and instrumentation324
Satomi, Hirabayashi, Toyama et al., 1992Degenerative spondylolisthesisFusion and instrumentation127Back pain, Leg pain, Walking, Global successapproximately 36 months
Degenerative spondylolisthesisMixed decompression techniques214
Herkowitz and Kurz, 1991Degenerative spondylolisthesisStandard wide decompressive laminectomy125Back pain, leg pain, global success29 to 48 months
Degenerative spondylolisthesisSWDL with fusion (arthrodesis)225
Lombardi, Wiltse, Reynolds et al., 1985Degenerative spondylolisthesisStandard wide decompressive laminectomy120Global success24 to 84 months
Degenerative spondylolisthesisSWDL with fusion (arthrodesis)221
Fitzgerald and Newman, 1976Degenerative spondylolisthesisRigid brace129Global success6 to 216 months
Degenerative spondylolisthesisMixed decompression techniques214
Rosenberg, 1976Degenerative spondylolisthesisPartial laminectomy or hemilaminectomy111Global successSoon after treatment to 120 months
Degenerative spondylolisthesisStandard wide decompressive laminectomy215
Degenerative spondylolisthesisConservative-not described3170
Four randomized controlled trials and six controlled trials examined surgical treatment of degenerative spondylolisthesis (see Table 30). The randomized controlled trial of Thomsen et al. (1997) included patients with isthmic spondylolisthesis, secondary degenerative spondylolisthesis, and primary degenerative spondylolisthesis, and reported surgical outcomes separately for each of these groups. However, when reporting on surgical outcomes according to patient characteristics, such as duration of symptoms and number of levels fused, all three types of spondylolisthesis were combined. Therefore, this study does not report evidence specific to primary degenerative spondylolisthesis to address the question of patient characteristics and benefit from surgery. The randomized controlled trials of Fischgrund et al. (1997), Bridwell et al. (1993), and Herkowitz and Kurz (1991) did not stratify or analyze data based on patient characteristics. In the controlled trials of Plotz and Benini (1998), Yuan et al. (1994), Satomi et al. (1992), Lombardi et al. (1985), Fitzgerald and Newman (1976), and Rosenberg (1976) data within the surgical treatment groups was not analyzed or reported in relation to patient characteristics. Therefore, these studies cannot be used to address the connection between patient characteristics and surgical outcome.

Summary

Poor study quality, especially the lack of pretreatment measurements of patient condition, reduce the usefulness of available data to answer the question of whether patient signs, symptoms, and other characteristics determine the success of surgery for degenerative lumbar stenosis. The few studies that do stratify outcomes by patient characteristics, especially those that examined degree of stenosis, did not find a connection between successful treatment and specific patient characteristics. The data from these trials seem to suggest that a select group of patients benefited from surgery; however, insufficient information is presented in the trials to determine which patients make up this group. Regression analysis from two studies suggests that patients in poor health due to comorbidity may have inferior outcomes after surgery compared to healthier patients.

Table 28. Controlled Trials of Patients with Lateral Lumbar Spinal Stenosis
Authors and Year of PublicationType of Lumbar Spinal Stenosis DisorderType of Surgical or Conservative Treatment Received by Patient GroupPatient GroupNumber of PatientsOutcomes MeasuredFollowup Period
Lee and deBari, 1986Lateral lumbar stenosisSWDL with fusion (arthrodesis)112Global successBetween 12 and 72 months
Lateral lumbar stenosisSWDL with fusion and instrumentation212
Kirkaldy-Willis, Wedge, Yong-Hing et al., 1982Lateral lumbar stenosisPartial laminectomy or hemilaminectomy120Leg pain, global success, ability to workBetween 12 and 120 months
Lateral lumbar stenosis with disk herniationPartial laminectomy or hemilaminectomy232
Lateral and central lumbar stenosisPartial laminectomy or hemilaminectomy321
Lateral and central lumbar stenosis with disk herniationPartial laminectomy or hemilaminectomy421
Mikhael, Ciric, Tarkington et al., 1981Lateral lumbar stenosisStandard wide decompressive laminectomy135Global successBetween 12 and 40 months
Healthy controlControl/placebo/none250
One randomized controlled trial and two controlled trials address the efficacy of surgical treatments for patients with lateral lumbar spinal stenosis (see Table 28). The randomized controlled trial by Lee and deBari (1986) and the retrospective trial by Mikhael et al. (1981) did not stratify results by patient characteristics. Lee and deBari (1986) provides very little patient data beyond that of the mean age of the patients. The study by Mikhael et al. (1981) was intended to provide a neuroradiological evaluation of patients with lateral stenosis and asymptomatic patients and therefore only has one surgery group that is not stratified by patient characteristics. Therefore, this study cannot be used to address the connection between patient characteristics and surgical outcome.

Comparison of Surgical Versus Conservative Treatment

Table 31. Trials Comparing Surgery and Conservative Therapy
Authors and Year of PublicationStudy DesignPatient SelectionMatch Conservative versus SurgeryTreatmentnComments
Amundsen T, Weber H, Nordal HJ et al., 2000Patients were diagnosed as having severe, moderate, or mild symptoms; severe patients received surgery, mild patients received conservative treatment, and moderate patients were randomized to conservative or surgical treatmentSciatic pain in the leg (s), with or without back pain; radiologic signs of stenosis and compression of the clinically afflicted nerve roots; no bulging or herniated disk or previous surgery of the back
Age:
21-40:2
41-60:9
61-70:6
>71:1
Sex:
39% M
Severity:
100%
moderate
Pain:
light:0
moderate:5
severe:13
Age:
21-40:2
41-60:7
61-70:4
>71:0
Sex:
69% M
Severity:
100%
moderate
Pain:
light:0
moderate:5
severe:13
Nonsurgery brace, rehabilitation, physiotherapy,18Only study to randomly assign patients to either conservative or surgical treatment
Laminectomy: no fusions13
Mariconda, Zanforlino, Celestino et al., 2000Unmatched observational comparison within cohort: all indicated for surgery; nonsurgery, 3 had contraindicating comorbidity, 14 refused surgery (appears to correlate with milder disease)age >40, radicular pain, central stenosis (degenerative or combined), dural area <130mm2, indicated for surgery, no previous back surgeryStraight leg-raise:Nonsurgery17Evaluator blinded for initial exam, questionnaire for followup exams; no dropouts reported Beaujon score (0 = worst, 20 = normal): neurogenic claudication, leg pain at rest, leg pain at exertion, low back pain, neurological deficit (motor or sensor), medications, QOL
47%45%Laminectomy; diskectomy if necessary; no fusions or instruments20
Neurological deficits:
47%70%
Stenosis:
78 mm268 mm2
Multiple levels:
65%70%
Initial Beaujon score:
11 (±2.4 SD)8.1 (±2.7)
Hurri, Slatis, Soini et al., 1998(1) Unmatched comparison of 57 patients. who had surgery and 18 treated conservatively for unreported reasons (2) comparison of Oswestry means adjusted (general linear model) for age, sex, body mass, severity of stenosis (radiographic)Myelography diagnosis of LSS (<11 mm diameter) in years 1978-82 (134 patients) and able to be traced and interviewed by phone (86 still alive, 75 interviewed)Age:Conservative18Oswestry Disability Index (0 = normal, 50 = worst): pain, personal care, lifting, walking, sitting, standing, sleeping, sex life, social life, traveling
Mean 47Mean 50Majority got laminectomy, some disk surgery, some foraminotomy, no fusions57
Sex:
66% M56% M
Severe (<7.0 mm):
33%46%
Sciatic (knee or below):82%
72%
Neurological deficit:67%
44%21%
Previous surgery:
22%26%
Early retirement:
44%
Swezey, 1996(1) Unmatched comparison; only the 23% (11/47) not improved by conservative treatment were treated surgically (2)Crossover with repeated measures: 11 patients who had unsatisfactory improvement with conservative therapy during first period were given surgery during second period47 consecutive patients in 1986-7, 43 with >mild LSS by CT or MRI; 18 also had spondylolisthesis; 4 had no CT or MRI but had spondylolisthesis by x-ray; all available for phone interview, and 31 examinedConservative treatment improved 56%, left 39% unchanged, 5% worseNone improved by conservative treatment; "greater preponderance" of moderate (7/11) or severe (4/11) neurogenic claudicationConservative: instruction in ergonomics and flexion exercises, analgesics, NSAIDs; 23% had traction, 28% had epidural corticosteroid injections36
Conservative then surgery: all had wide laminectomies, 1 also had foraminectomy, 1 also had fusion11
Herno, Airaksinen, Saari et al., 1996Retrospective matched-pair57 LSS patients from 1980-87 who had myelography and no surgery; from 310 not previously operated on patients with LSS and myelography, 54 were manually matchedMatched on age, sex, myelographic findings (block, subtotal block, AP diameter <10 mm, AP diameter 10-12 mm), major symptoms (claudication, leg pain, mixed), and duration of symptomsMatched on age, sex, myelographic findings (block, subtotal block, AP diameter <10 mm, AP diameter 10-12 mm), major symptoms (claudication, leg pain, mixed), and duration of symptomsConservative (not systematic)54Patients not operated on for these reasons: 69% pain was bearable, 19% refused surgery, 5.6% were retired, 9% other reasons, 4% insufficient data
Surgery: standard wide laminectomy with partial or whole facetectomy if needed54
Atlas, Deyo, Keller et al., 1996Prospective unmatched observational comparison within cohortLSS by neurogenic claudication and radiographic findings of stenosis, including herniated disks; at least 2 weeks of unsatisfactory improvement with conservative treatment; no prior lumbar spinal surgery, cauda equina syndrome, developmental deformity, fractures, infection, tumor, inflammation, pregnancy; age >17Straight leg positive:Conservative: various types58Commonly known as the Maine Lumbar Spine Study 4-year followup outcomes were recently published in Atlas, Keller, Robson et al., 2000
12%28%Surgery: 88% laminectomy, 9% diskectomy only, 4% fusion and laminectomy72
Radiographic normal or mild:
33%18%
Radiographic severe:28%
10%
Extreme low back pain:59%
39%79%
Extreme leg pain:
32%20%
Disabled, in bed:
11%
Nagler and Bodack, 1993Retrospective unmatched observational comparison within cohort; patients receiving surgery were a "select group with fewer concomitant medical problems,"(i.e., a de facto pseudo-randomization to the extent comorbidities are independent)100 patients with LSS symptoms (pain, numbness, weakness in back and legs that was worse with standing or walking and was relieved by sitting or lying) and positive CT and/or myelogram; not predominantly disk caused; 80 able to be interviewed by phone at 1 yearAge:Conservative: combinations of analgesics, electrical stimulation, ice, heat, hydrotherapy, ultrasound, muscle relaxation techniques, stretching and strengthening exercises, treadmill training41
56.4 mean (range 26 to 82)57.4 mean (range 28 to 78)Surgery: laminectomy, with or without foraminectomy39
More comorbiditiesFewer comorbidities
(Johnsson, Uden, and Rosen, 1991)Unmatched (retrospective?) observational comparison within cohort; 2 untreated patients had contraindicating cardiovascular disease; 17 refused surgeryPatients with <12 mm diameter on myelography but not total block, no previous spinal surgery, no impaired circulation in legs (i.e., any claudication was neurogenic)Age:Untreated (details not reported)19
60±9 (range 42 to 80)62±9 (range 45 to 78)Surgery: laminectomy and facetectomy; no fusion30
Symptom duration:
22±24 months (range 4 to 96 months)3±35 months (range 4 to 96 months)
Observation after myelography:
31±12 months (range 7 to 51 months)50±32 months (range 5 to 109 months)
Neurogenic Claudication:
84% (16/19)77% (23/30)
Radicular pain:
11% (2/19)17% (5/30)
Mixed symptoms:
5% (1/19)7% (2/30)
AP diameter (myelography):
8.6±1.7 SD7.9 ±2.3 SD
Walking capacity:
1,355 (±3,081 SD)186 (±203 SD)
In this part of our analysis, we turn to the question that is implicit in our primary question, "Do some patients benefit more from surgery than from medical management?" We located eight studies that examined patients with central and/or lateral stenosis who received nonsurgical or surgical treatment (see Table 31). For each of these studies, we reviewed the design and results to determine if these studies provided reliable evidence to answer this question. In this process, we calculated, wherever possible, each study's effect size. A critical requirement in any study trying to address this question is having comparable patients in both the nonsurgery and surgery groups. Only one of the studies is a randomized controlled trial, so potential bias may exist in how each study assigned patients to treatment groups. Therefore, within each study we will determine if the patient groups are comparable in terms of pretreatment signs, symptoms, and measurements used to assess posttreatment success. These trials do not provide evidence for the effectiveness of any one conservative treatment because multiple types of conservative treatments were used in each nonsurgery patient group.

Johnsson et al. (1991) compared 44 patients treated surgically for lumbar spinal stenosis with 20 patients not surgically treated. The nonsurgery patients refused surgery (19 patients) or the anesthesiologist refused to administer anesthesia because of advanced cardiovascular disease (2 patients). To obtain comparable groups for analysis, one patient in the nonsurgery group with severe stenosis was removed from the analysis, and the surgery patients were divided into severe stenosis (30 patients) and moderate stenosis (14 patients). Thus, we are concerned with the comparison of the conservatively treated moderate-stenosis group with the surgically treated moderate-stenosis group. The smallest AP diameter had a mean and standard deviation of 7.9 ±2.3 mm in the moderate-stenosis surgery group and 8.6 ±1.7 mm in the nonsurgery group. The mean ages were 60, 62, and 69 years for the nonsurgery, moderate-stenosis, and severe-stenosis groups, respectively. Neurogenic claudication was diagnosed before treatment in 84 percent, 77 percent, and 86 percent of patients in the nonsurgery, moderate-stenosis, and severe-stenosis groups, respectively. The signs and symptoms of lumbar spinal stenosis and the extent of stenosis seen by myelography are the only evidence reported that indicate that the nonsurgery and moderate-stenosis groups are comparable. Followup exams were performed, on average, 31 months after treatment in the nonsurgery group and 50 months after treatment in the moderate-stenosis surgery group.

Table 32. Analysis of Neurogenic Claudication Comparing Nonsurgical Patient to Surgical Patients from Johnsson, Uden, and Rosen, 1991
Patient GroupNumber of Patients with Neurogenic Claudication Before TreatmentNumber of Patients with Neurogenic Claudication at FollowupOdds Ratio and 95% Confidence LimitsHedges' d and 95% Confidence Limits
Moderate stenosis; n = 3023111.76 0.50 to 6.22 p = 0.8580.323 −0.395 to 1.040 p = 0.378
No surgery; n = 191610
Table 33. Analysis of Walking Capacity in Meters Comparing Nonsurgical Patient to Surgical Patients from Johnsson, Uden, and Rosen, 1991
Patient GroupPretreatment Mean ± SDAt Followup Mean ± SDMean Difference after TreatmentHedges' d95% Confidence Limits
Moderate stenosis; n = 30186 ± 2031,453 ± 2,9351,267−0.601 p = 0.023−1.119 −0.084
No surgery; n = 191,355 ± 3,0812,342 ± 4,069987−0.268 p = 0.411−0.906 0.371
Mean Difference between Patient Groups1,169889
Hedges' d−0.599 p = 0.045−0.256 p = 0.384
95% Confidence Limits−1.186 −0.012−0.833 0.321
The original data form Johnsson et al. (1991) on the occurrence of neurogenic claudication and on walking capacity, plus our analysis of these data, are contained in Table 32 and Table 33. The data on neurogenic claudication indicate that the frequency of this symptom declined to the same extent in both groups. The chances of recovery from neurogenic claudication were equal between groups (see Table 32). The analysis of the walking data is complicated by the significantly better capacity of the nonsurgery group before treatment (see Table 33). The nonsurgery group could, on average, walk 1,169 m more before treatment than the moderate-stenosis group. After surgery, the two groups had comparable walking capacity. This study seems to indicate that patients with moderate spinal stenosis can recover from some of the symptoms associated with lumbar spinal stenosis. However, this evidence is weakened by the apparent differences between patients in the nonsurgery and surgery groups in initial walking capacity. The authors have pointed out that this difference in walking capacity may indicate that symptoms were more severe in the patients treated surgically.

Nagler and Bodak (1993) retrospectively analyzed the recovery from symptoms of 41 conservatively treated patients and 39 patients who received laminectomies. The degree of improvement, based on a ratio of the original symptoms to those at one year, was similar between groups (54 percent of conservative-treaatment patients and 56 percent of surgery patients had >50 percent improvement; X2 = 0.06114, p = 0.804689, according to our calculations). The actual numbers of patients with each symptom before and after treatment were not reported. Without these data, we cannot determine the comparability of the patient groups before treatment. The authors suggest that the surgery patients represent a selected group with fewer concomitant medical problems. Therefore, the data in this study cannot be used to conclusively determine if patients with comparable levels of lumbar spinal stenosis do better with conservative treatment or surgery.

A recently published randomized controlled trial does provide a comparison of a nonsurgical control group with a surgery group with the same baseline patient characteristics (Amundsen T, Weber H, Nordal HJ et al., 2000). In this study, randomization was considered ethical only among patients whose severity of pain did not indicate to the physician that either surgery or conservative therapy was the indicated treatment. The physicians selected surgery for 19 patients with severe symptoms and conservative treatment for 50 patients with mild symptoms. The remaining 31 patients were randomized: 13 patients received surgery (laminectomy without fusion) and 18 patients received conservative therapy (a brace and rehabilitation for one month, followed by an additional 2 months with the brace, and physiotherapy).

Table 35. Examiner Assessment Data from Surgically and Conservatively Treated Patients with Lumbar Spinal Stenosis from Amundsen T, Weber H, Nordal HJ et al., 2000
Patient GroupTime Post TreatmentExaminer Assessment of Patient Condition Following Treatment
ExcellentFairUnchangedWorse
Surgical treatment Nonrandomized n = 19, 2 dropouts by the 10th year6 month7822
1 year9820
4 year11521
10 year7541
Conservative treatment Nonrandomized n = 50, 9 dropouts and 10 surgery by the 10th year6 month1817510
1 year1814612
4 year1711814
10 year158621
Surgical treatment Randomized n = 13, 2 dropouts by the 10th year6 month4810
1 year5413
4 year8311
10 year5501
Conservative treatment Randomized n = 18, 1 dropout and 9 surgery by the 10th year6 month34110
1 year33210
4 year26010
10 year35010

The numbers of patients in each category were extracted from histograms published in Amundsen et al. (2000). Any conservatively treated patient who was crossed over to surgery was considered a treatment failure and placed in the worse category for conservative treatment.

At followup times of 6 months, 1 year, 4 years, and 10 years, patients were evaluated for pain, working ability, assessment of own condition, walking ability, and physical activity. The examining physician used these findings to determine if a patient's clinical status was "excellent," "fair," "unchanged," or "worse" than at the start of the trial. These categorical data are presented in Table 35. Within 3 to 27 months of entering the study (median 3.5 months), 10 of the 18 conservatively treated moderate-symptom patients were crossed over to surgery (56 percent). Among the patients with moderate symptoms and randomized to treatment, a higher percentage who received surgery were rated good (excellent and fair category combined) at each of the followup periods than those who received conservative therapy (6 months: 92 percent v 39 percent; 12 months: 69 percent vs. 33 percent; 4 years: 85 percent vs. 44 percent; 10 years: 77 percent vs. 44 percent). This finding may be an artifact of the study design.

The trial by Amundsen et al. (2000) represents an attempt to randomize patient treatment between conservative and surgical approaches and thereby resolve an important clinical question. Will lumbar spinal stenosis patients with moderate symptoms benefit more from surgery or from conservative treatment? The data as presented by the authors suggest that surgery may be more beneficial. However, several design and reporting problems reduce the strength of this conclusion. Even the authors acknowledged that "because the situation was observational, the existence of hidden confounders and selection bias made it mandatory for the authors to be descriptive and noninferential." Our assessment of these potential hidden confounders and selection bias follows.

The criteria for assignment to the mild-, moderate-, or severe-symptom groups are not clearly stated. The authors assert that intensity of patient pain was the most important reason for selection to the severe-symptom group which received immediate surgery. However, several pieces of evidence indicate that physicians may have underrated the pain and severity of condition in some patients, resulting in these patients' inclusion in the moderate group as opposed to the severe group. First, the median time lag to crossover to surgery was 3.5 months, with a range of 3 to 27 months. This means that perhaps half of the conservatively treated moderate patients switched to severe patients within 3 months of entering the trial. If the original diagnoses had been correct, one would not have expected that half of the crossovers would have occurred by 3.5 months. Individual or subgroup data on when crossovers occurred were not reported.

Second, while 14 of 19 severe-symptom patients (74 percent) reported having severe pain at the start of the trial, 20 of 31 moderate-symptom patients (65 percent) and 24 of 50 mild-symptom patients (48 percent) also reported severe pain at the start of the trial. This indicates that severity of pain was not the only determining factor in how the physicians allocated patients to the treatment groups. Supporting this notion is that larger proportions of older patients appear in the mild-symptom group than in the other two groups. Nine of the 12 patients (75 percent) older than 71 years appear in the mild group (moderate: 8 percent, severe: 17 percent). Older patients may have been selected for conservative treatment because they would be expected to have a poorer prognosis after surgery. Although this is appropriate clinical judgment, this allocation introduces a bias into the trial that could favor surgery.

Third, there appears to be a difference between how physicians and patients rated symptoms. This difference manifests itself as an underrating by the physician. Thus, the authors report of the agreement between physician and patient yields a kappa statistic of 0.59 at 6 months and smaller kappas at later followup periods. This degree of agreement between patients, and examiner is moderate to small, with patients reporting more in the worse category than examiners (6 vs. 3). Such underrating of patient pain by medical personnel has also been observed in other studies (Choiniere, Melzack, Girard et al., 1990; Daniel, Long, Murphy/Kores et al., 1983). Because of physician underrating, patients may have been assigned to the randomized (moderate) group when they should have been assigned to the surgical (severe) group. If randomized to conservative treatment, these patients would be more likely to have unsuccessful results and be crossed over to surgery. The effect of misclassification is to artifactually reduce the reported effectiveness rate of conservative treatment, because patients who are crossed over are considered failures by intent-to-treat analysis. Misclassification increases the apparent difference in effectiveness between surgery and conservative therapy. Further, the size of this apparent difference increases as more patients are misclassified into the moderate group.

We can make one of two conclusions from our analysis of the patients in the Amundsen et al. (2000) trial. First, surgery is superior to conservative treatment for patients with moderate symptoms. As discussed above, the data are confounded and this conclusion lacks support. The second possible conclusion is that the apparent superiority of surgery in moderate patients is an artifact caused by inclusion of severe patients in the moderate-symptom group who then fail conservative therapy. In which case, this is evidence that surgery is superior to conservative treatment among patients with severe symptoms. Another trial, with more carefully designed patient selection and characterization into mild, moderate, and severe groups, will allow us to determine the actual extent to which surgery or conservative treatment benefits these groups.

Amundsen et al. (2000) provide three measures by which the efficacy of surgery compared to conservative therapy can be judged. The first outcome measure is the number of patients needing surgery after first receiving conservative treatment. In the mild-symptom group, only 20 percent of patients eventually needed surgery, indicating that conservative treatment may be justified in this group. In the moderate-symptom group, 54 percent of patients needed surgery. However, as discussed above, this number may be inflated by physician underrating of patient condition and judging severe patients as moderate. Therefore, conservative treatment is more likely to fail in these patients.

The second outcome measure is the examiner assessment of "overall treatment result." This was described as a subjective global assessment based on the following components: (1) a patient-reported scale of "better," "unchanged," or "worse" compared to their condition at the start of the trial; (2) pain; (3) working ability; (4) walking ability; (5) level of physical activity; and (6) the opinion of the examining physician. We would expect pain to be the major component of this scale because pain influences patient working, walking, and physical activity. Examiners were not blinded to treatment. As mentioned above, the agreement between examiner assessment and patient assessment is moderate to poor.

Table 36. Patient Reported Pain Data Compared to Examiner Assessment Data from Surgically and Conservatively Treated Patients with Lumbar Spinal Stenosis from Amundsen T, Weber H, Nordal HJ et al., 2000
Original Diagnosis of Patient ConditionTreatmentCrossover4-year Outcome Data10-year Outcome Data
nPatients with no painPatients rated as excellentnPatients with no painPatients rated as excellent
Mild symptoms n = 50ConservativeSurgery n = 1010341034
No crossover n = 40401317321215
Moderate symptoms n = 31Randomized to conservative n = 18Surgery n = 10912922
No crossover n = 8812823
Randomized to surgery n = 13---13581155
Severe symptoms n = 19Surgery---196111787
Wilcoxon Signed Ranks Test resultsZ = −2.23, p = 0.026Z = −1.13, p = 0.257
Table 36 shows a comparison of the third outcome measure, patient-reported pain, and overall treatment result at 4 and 10 years for each of the four original treatment groups and the two crossover groups (mild conservative to surgery and moderate conservative to surgery). We used a Wilcoxon Signed Ranks Test to test the hypothesis that the number of patients reporting no pain is equal to the number of patients rated "excellent." At four years, the hypothesis is rejected because, in each treatment group, more patients were rated as "excellent" than were reported as having no pain. This suggests that pain was not a primary determinant of the physician's assessment of treatment results. At 10 years, no significant difference was found in the number of patients rated as "excellent" and the number of patients with no pain. The trend agreed with the 4 year data, but was not statistically significant, possibly because of low power. Given the examiner tendency toward underrating pain and overrating a patient's overall condition, the most reliable measure of treatment outcome presented by Amundsen et al. (2000) is patient-reported pain.

If complete relief from pain is a goal of treatment for lumbar spinal stenosis, then few patients obtained that goal at 4 or 10 years with either conservative or surgical treatment. Patient-reported pain data are presented by Amundsen et al. (2000) for the start of the trial, at 3 months, 4 years, and 10 years. Patients were categorized as having no pain or light pain, moderate pain, or severe pain. At the start of the trial, the only patients with light pain were found in the mild-symptom, conservatively treated group. The data in Table 36 indicate that only one-third of the severe/surgery patients were pain-free at 4 years, and only half were pain-free at 10 years. In the moderate group randomized to surgery, 42 percent of patients were pain-free at 4 years, and 45 percent of patients were pain-free at 10 years. Of patients with mild symptoms who remained with conservative treatment, 34 percent and 44 percent were pain-free at 4 and 10 years, respectively. Of patients with moderate symptoms who remained with conservative treatment, 13 percent and 25 percent were pain-free at 4 and 10 years, respectively. Among the patients who were crossed over from conservative to surgical treatment, 30 percent and 43 percent of mild-symptom patients and 13 percent and 33 percent of moderate-symptom patients were pain-free at 4 and 10 years, respectively. As reported, these data are difficult to interpret. A measure of the magnitude of improvement in pain after treatment would provide a better gauge of treatment success. Although the authors measured pain on a visual analog scale, this data is not reported. Other measures of treatment success such as physical activity and walking ability were assessed but not reported.

Atlas et al. (1996) reported on o1 year outcomes of patients in the Maine Lumbar Spine Study, a prospective, observation cohort study of patients with spinal stenosis treated surgically (81 patients) or nonsurgically (67 patients). This trial has three design features that make its results more reliable than the Amundsen et al. (2000) trial. First, Atlas et al. (1996) used an objective scoring system for classifying patients' severity of disease. Therefore, misclassification of patients with severe disease into the moderate category was less likely. Second, there were no crossovers from conservative to surgical treatment. Therefore, the effect of surgery was not exaggerated. Third, study outcomes were based on patient ratings and not on physician ratings. Therefore, results were not skewed by the observer.

In this trial, laminectomy was performed in 88 percent of the surgery patients. Nonsurgery patients received mostly bed rest (29 percent), back exercises (39 percent), physical therapy (23 percent), spinal manipulation (23 percent), narcotic analgesics (21 percent), and epidural steroids (18 percent). Extensive patient information reported in the article showed that the severity of symptoms and the degree of disability because of pain were significantly greater in the surgically treated patients. These patients reported more frequent and severe leg and back pain and poorer functional status, but had greater improvement than patients treated nonsurgically. Because patient preference was the most common reason for not choosing surgery, the nonsurgery group may have selected conservative treatment because their symptoms were less severe than those patients who chose surgery. Therefore, the entire pool of data in this study cannot be used to determine if patients with comparable levels of lumbar spinal stenosis do better with conservative treatment or surgery.

However, a subgroup of 54 patients (31 surgery and 23 nonsurgery patients) reported moderate symptoms before treatment. Results from this group are useful for comparing conservative and surgical treatments. Patients receiving surgery were significantly improved compared to patients who did not receive surgery.

Table 34. Effect Sizes for One and Four Year Outcomes of Patients with Moderate Symptoms at Baseline from Atlas, Deyo, Keller et al., 1996 and Atlas, Keller, Robson et al., 2000
Patient GroupsMajor Symptom Much Better or Completely GoneOverall Results of TreatmentPatient Satisfied with Treatment
1-Year Outcome DataActual CaseWorst CaseActual CaseWorst CaseActual CaseWorst Case
Surgery; n = 3119 Yes 12 No19 Yes 23 No20 Yes 11 No20 Yes 22 No21 Yes 10 No21 Yes 21 No
No surgery; n = 233 Yes 20 No3 Yes 20 No7 Yes 16 No7 Yes 16 No5 Yes 18 No5 Yes 18 No
Effect size: Hedges' d 11.550.930.750.401.140.70
Lower limit of 95% confidence intervals0.460.180.08−0.190.320.06
Upper limit of 95% confidence intervals2.641.681.420.991.951.34
p value for effect size0.0050.0150.0280.190.0060.03
4-Year Outcome DataActual CaseWorst CaseActual CaseWorst Case
Surgery; n = 2924 Yes 5 No24 Yes 18 No21 Yes 8 No21 Yes 21 No
No surgery; n = 2212 Yes 10 No13 Yes 10 No6 Yes 16 No7 Yes 16 No
Effect size: Hedges' d0.750.011.060.45
Lower limit of 95% confidence intervals0.07−0.550.37−0.14
Upper limit of 95% confidence intervals1.460.581.741.04
p value for effect size0.040.960.0030.14
1

Hedges' d was calculated as described by Hasselblad and Hedges (1995).

Our analysis of percentage data from this group for symptom improvement, overall results of treatment, and patient satisfaction with treatment are presented in Table 34. Since these patients with moderate symptoms were not randomized to treatment, the results may be biased by unknown differences between the two groups. However, these results do provide some evidence that among patients with lumbar spinal stenosis who have moderate pain, surgery may be more beneficial than conservative treatment. Four-year outcomes for this same group of patients were recently published. (Atlas, Keller, Robson et al., 2000) The data from this report continue to show better outcomes among patients who initially had moderate pain and received surgery. Of the 68 patients originally treated nonsurgically, 15 (22 percent) underwent surgery after 3 to 48 months (median 17 months).

The validity of the results of this study is threatened by a high dropout rate and the authors' failure to report the characteristics of those patients who dropped out. The article reports that 148 patients were enrolled in the study, and those in the 25th to 75th percentiles for severity were considered "moderate." While the percentile calculation would suggest that 74 patients should be in this category, only 54 patients were reported (31 surgical, 23 nonsurgical). We presume that nine patients in the moderate-severity group had not reached the one-year followup, because the authors report a total of 130 patients who had reached followup (half of which is 65). This leaves 11 patients unaccounted for. To test whether these dropouts could have threatened the validity of the observed effect, we repeated the effect size calculation under a worst-case scenario, in which all 11 patients were arbitrarily counted as surgical failures. When the calculations were repeated in this manner, effect sizes remained significant for two of the three outcomes of interest at the 1 year followup: "major symptom much better" and patient satisfaction. In the worst-case scenario, the effect size for overall treatment results decreased to a statistically insignificant value. The calculations and effect sizes are shown in Table 34.

Atlas et al. (2000) reported on changes in patients' predominant symptom and satisfaction with treatment at 4 years. Using the data from 29 surgical patients and 22 nonsurgical patients, the authors showed that surgical patients had significantly better outcomes. Our analysis of the reported data and the results of our worst-case scenario for the selected four-year outcomes are shown in Table 34. Overall, results of treatment at four years were not reported by Atlas et al. In the worst-case scenario, the greater number of dropouts at four years (14 instead of 11) may have decreased the effect sizes for both symptomatic improvement and patient satisfaction to a statistically nonsignificant level. However, the effect sizes were still positive. Therefore, the statistical significance of the observed long-term effects is not robust to this worst-case sensitivity analysis of dropouts.

Swezey (1996) retrospectively evaluated the progress of 47 patients diagnosed with neurogenic claudication 5 years earlier. At the time of diagnosis, the patients' average age was 76 years. Forty-three of these patients had moderate to marked lumbar spinal stenosis. The authors do not report how patients were judged to be mild, moderate, or severe with regard to stenosis. No spinal canal measurements were reported. All patients were started on conservative treatment (exercise, use of a cane, analgesic and nonsteroidal anti-inflammatory drugs). Thirteen patients received epidural steroid injections when other measures did not provide relief from neurogenic claudication. During the 5-year period, 11 patients were given laminectomy to relieve symptoms. This group was considered to have a greater proportion of moderate to severe neurogenic claudication, and all were considered improved after surgery. However, the lack of data on spinal canal diameters reduces the usefulness of this data in predicting who will benefit from conservative treatment and who will need surgery. Of the other 36 patients, 20 reported improvement in symptoms, 14 reported no change, and two reported a worsening of symptoms. This is one of the few trials that follows patients from diagnosis through conservative treatment and then to surgery. Although the numbers may be too small to provide a reliable estimate of the success of conservative treatment, this study indicates that 72 percent of patients (34 of 47) that begin conservative treatment will improve or remain the same, while 23 percent of patients (11 of 47) will eventually receive surgery for relief of symptoms. Those patients that receive surgery will tend to have greater stenosis.

Herno et al. (1996) attempted to generate comparable treatment groups by retrospectively matching surgical patients to a group of 57 nonsurgical patients. Patients were matched according to sex, age, myelographic findings, major symptoms, and duration of symptoms. Fifty-four matched pairs were constructed. Total spinal block and subtotal block occurred in only one and three matches, respectively. The remainder of the matches had AP diameters of <12 mm (31 patients) or had lateral stenosis (19 patients). The followup periods were 4.3 and 4.1 years for the nonsurgery and surgery groups, respectively. At followup, measures of disability and functional status were similar between groups. The authors caution that an important shortcoming of this retrospective study is the lack of knowledge about starting pain level and disability in either group and that patients with more pain are likely to have selected surgery over conservative treatment. This precludes any reliable comparison of surgery and medical treatment.

Hurri et al. (1998) retrospectively assessed the outcomes of surgery and conservative treatment 12 years after treatment began. Among the 57 patients in the surgery group, 26 (46 percent) were considered to have severe stenosis (<7.0 mm sagittal diameter of the canal), while in the nonsurgery group, 6 of 18 patients had severe stenosis. The authors report that there were no statistically significant differences in the percentage of patients improved (63 percent surgery vs. 44 percent conservative) or worse (18 percent vs. 11 percent) after 12 years. Although long followup periods are, in general, desirable, the authors suggest that after such a long period, any current patient problems may be caused by factors other than the original stenosis and that this could obscure the efficacy of treatment. This, however, is a difficult hypothesis to test. A study with multiple followup times would seem to be needed, as well as an accounting of the length of time that symptoms are relieved and the number of patients who received relief but later had a return of symptoms. Therefore, this study was not used in our analysis.

The study by Mariconda et al. (2000) consisted of 20 patients who received surgical treatment (standard wide decompressive laminectomy) and 17 patients who did not receive surgery. This latter group consisted of 14 patients who refused surgery and three patients who were not considered for surgery due to advanced cardiopulmonary disease. The treatment given to the patients who did not receive surgery was not described. All patients were older than 40 years, and the mean for the most stenotic dural sac cross-sectional area was 68 mm2 in surgical patients and 78 mm2 in conservatively treated patients. Baseline characteristics were different between the groups. The nonsurgery group had a better functional status as determined by the overall Beaujon scoring system (mean and standard deviation of 11 ±2.4 v. 8.1 ±2.7, p <0.05, t-test), the Beaujon score for leg pain at exertion (0.8 ±0.7 v. 0.25 0.6, p <0.05, t-test), and the Beaujon score for neurological deficits (3.3 ±1.0 v. 2.3 ±1.0, p <0.05, t-test). In the Beaujon scoring system, the overall score goes from 0 (worst) to 20 (normal), the leg pain at exertion score goes from 0 (severe) to 2 (none), and the neurological deficit score goes from 0 (major or sphincter dysfunction) to 4 (none).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F021.jpg.

   Figure 21. Beaujon Score from Mariconda, Zanforlino, Celestino et al., 2000

The Beaujon Scoring System goes from a worst possible functional score of 0 to a normal functional score of 20.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F022.jpg.

   Figure 22. Leg Pain at Exertion Score from Mariconda, Zanforlino, Celestino et al., 2000

The Beaujon Scoring System for leg pain exertion goes from a score of 0 (severe) to 2 (no pain).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F023.jpg.

   Figure 23. Beaujon Score for Neurological Deficit from Mariconda, Zanforlino, Celestino et al., 2000

The Beaujon score for neurological deficit goes from 0 (major or sphincter dysfunction) to 4 (none).

Our calculation of the effect sizes (Hedges' d) for these baseline characteristics were −1.1 and confidence intervals of −1.80 and −0.41 for the overall Beaujon score; −0.83 and confidence intervals of −1.50 and −0.16 for leg pain at exertion; and −0.98 and confidence intervals of −1.66 and −0.29 for neurological deficits. A comparison of outcomes and effect sizes at pretreatment, 1, and 2 years posttreatment are presented in Figures 21, 22, and 23. While baseline scores showed differences, the scores at one and two years posttreatment were nearly identical, implying that the surgical group had benefited more than the conservative group. However, this may be a case of each group reaching the maximum score possible for these patients (a ceiling effect). In Mariconda et al. (2000), patients with less severe symptoms of lumbar spinal stenosis appeared to improve with conservative treatment alone. However, the discussions we present in the section entitled "Factors That Could Potentially Predict the Success of Conservative Treatment" suggests that this conclusion is far from certain. The lack of a nonsurgical control group with baseline scores similar to the surgical group prevents a reliable evaluation of the benefits of surgery. Although scores improved after surgery, we cannot know the extent to which these scores would have improved or become worse if surgery had not been performed.

Some information on the efficacy of conservative vs. surgical treatment for patients with severe symptoms may be obtained from prospective surgical trials provided pretreatment patient characteristics are clearly stratified and reported, and patients have previously failed conservative treatment. As discussed later in Chapter 5, few surgical studies report the use of and failure of prior conservative therapy. Two single arm surgical trials (no nonsurgical patients were included in the studies) have reported patient characteristics and the failure of prior conservative treatment (Weiner, Walker, Brower et al., 1999; Kleeman, Hiscoe, and Berg, 2000). These studies provide some evidence that patients with severe symptoms will not benefit from conservative treatment but will benefit from surgical treatment. Independent evaluation of patient outcomes in both trials increases the validity of this.

Weiner et al. (1999) demonstrated that partial laminectomy improved walking ability in 30 patients with severe neurogenic claudication (average age of 68 years). Nine months after surgery, the average walking ability increased from approximately 100 m to between 300 m and 600 m. Other measures of patient outcomes also improved; 13 patients had almost complete pain relief and 13 patients had a good deal of pain relief. In a similar study with 54 patients with neurogenic claudication (average age 71 years), Kleeman et al. (2000) used partial laminectomy to relieve leg pain. Two and a half years after surgery, 65 percent of patients reported complete relief from their leg pain and 33 percent reported that their pain was better.

Single arm trials (noncontrolled) have a large potential for bias in favor of the therapy under investigation. Therefore, the studies by Weiner et al. (1999) and Kleeman et al. (2000) may be biased in favor of reporting successful results for surgical treatment. Uncontrolled, nonrandomized controlled trials, and historically controlled trials have been shown to favor new therapies over the control therapy compared to RCTs (Sacks, Chalmers, and Smith, 1982; Colditz, Miller, and Mosteller, 1989). Uncontrolled single treatment trials may therefore be expected to produce an over estimate of the benefits of the therapy.

Katz et al. (1999) prospectively evaluated the pain and walking ability of 263 preoperative patients and 199 patients at 2 years post surgery. Patients had a mean age of 69 years at enrollment (range of 50 to 92 years). The use of prior conservative therapy is not reported. Katz et al. (1999) reported that there were no significant differences between patients who dropped out (27 percent of patients by 2 years) and those who remained in the study. At enrollment, 81 percent of patients reported they were in severe pain. By 2 years after surgery, this was 31 percent. Also at 2 years, the percentage of patients who could walk two blocks had increased from 39 percent to 67 percent, and the percentage of patients who could walk 1 mile increased from 13 percent to 42 percent. Eighty percent or more of the patients in this study may be in the severe category of symptoms, but patient outcomes were not stratified according to preoperative conditions.

Other single arm prospective trials of surgical treatment also suggest that patients improve after surgery (Jonsson, Annertz, Sjoberg et al., 1997; Javid and Hadar, 1998). However, these trials have patients with a wide range of ages and symptoms that limit the use of their data for determining if mild-, moderate-, or severe-symptom patients benefit from surgery. Jonsson et al. (1997) examined 105 patients with an average age of 65 years, but a range of 37 to 83. All patients reported preoperative leg pain and dysfunction and were given partial laminectomies. Eighty-six patients were available through the 5 year followup period. At 2 years, leg pain was relieved in 67 percent of patients, and the percentage of patients with poor walking ability (<0.5 km) declined from 70 percent to 30 percent. By 5 years, only 52 percent of patients were free of leg pain and 35 percent of patients could not walk more than 0.5 km. These results for walking capacity should be contrasted with those of Weiner et al. (1999). In this study, patients improved from less than 100 meters to an average of 500 meters. Under the Jonsson et al. (1997) rating of walking ability, the Weiner et al. (1999) patients would still be considered to have poor walking ability. These differences in approach to rating walking ability and the wide ranges in walking ability complicate the comparison of study results and reduce the value of the data presented by Jonsson et al (1997). Average walking distances are not reported for each of the categories of walking ability in Jonsson et al. (1997). Therefore, we are not able to judge the degree of improvement associated with each initial category of walking ability, nor compare patients from Jonsson et al. (1997) with patients of the same condition in Weiner et al. (1999).

Javid and Hadar (1998) examined 86 patients with central stenosis (average 65 years, range of 27 to 89) and 23 patients with lateral recess stenosis (average 54 years, range of 25 to 79), all of whom received a standard wide laminectomy. Preoperative leg pain and walking difficulty was reported by 98 percent and 76 percent of patients with central stenosis, respectively, and 96 percent and 57 percent of patients with lateral recess stenosis, respectively. Data on walking ability are only reported at the last followup (range 1 to 11 years). At the last followup, patients with central stenosis judged their walking ability as much better or somewhat better (69 percent), no different (10 percent), and somewhat or much worse (21 percent). Patients with lateral recess stenosis judged their walking ability as much better or somewhat better (68 percent), no different (9 percent), and somewhat or much worse (23 percent). These walking data are of little value because 24 percent of the central stenosis patients and 43 percent of the lateral recess stenosis patients originally reported no difficulty in walking, yet their evaluation of walking ability is included in the last followup data. As in Jonsson et al. (1997), no average walking distances are reported for each of the categories of walking ability, and therefore, no comparisons to other studies are possible.

Summary

The lack of comparable patient groups and pretreatment data are a common problem in evaluating studies that examined both surgical and nonsurgical treatment groups. Typically, the two patient groups differ in the extent to which lumbar spinal stenosis affects their signs and symptoms and their pretreatment outcome measurements. This leads to treatment selection based on the extent of disease, and patients are not randomly assigned to treatment groups, except in the recent study published by Amundsen et al. (2000). Less severe cases may tend to receive conservative treatment, and more severe cases may tend to receive surgery. Data are lacking on the effect of conservative treatment on patients with severe stenosis because these patients seem to receive surgery shortly after diagnosis. In Swezey (1996), those patients who did not fare well with conservative treatment tended to have greater stenosis, but data on spinal canal diameter are not reported, and this trend cannot be verified. The trial published by Amundsen et al. (2000) offers the best available evidence that patients with severe symptoms benefit more from surgery than conservative treatment. In this trial, the apparent superiority of surgery in moderate-symptom patients may be an artifact caused by inclusion of severe-symptom patients in the moderate-symptom group. The failure of these patients to respond to conservative therapy supports the benefits of surgery for patients with severe symptoms. The studies by Herno et al. (1996) and Mariconda et al. (2000) suggest that patients with moderate stenosis may improve after receiving only conservative treatment. However, the data presented by Atlas et al. (1996, 2000) suggest that at one year after treatment, patients with moderate pain will benefit more from surgery than from conservative treatment. The statistical significance of the observed long-term effects in this study (4 years) is not robust to a worst-case sensitivity analysis of dropouts.

Question 7What is the relationship between the results of the imaging examination and the success or failure of surgical treatment?

The objective of this section of the report is to determine whether there is a relationship between the findings of the imaging examination and the outcomes of surgical treatment for spinal stenosis. If such relationships exist, it may be possible to use the imaging results for patient selection and increase the probability of success for the patients who do receive surgery.

Clinical articles that were identified by our literature searches as potentially having information to answer this question are listed in Table 16. Methods for the literature search and details of the study inclusion and exclusion criteria are reported in the section entitled "Quality of the Diagnostic Studies," which appears at the beginning of our discussion of Question 4. Not all the studies that reported both imaging findings and surgical results will be useful in answering this question. Some studies did not report the relationship between the two.

Degree of Stenosis

Table 37. Relationship between Degree of Stenosis and Surgical Results
Authors and YearModalityResults (most severe stenosis first)
Hurri, Slatis, Soini et al., 1998MyelogramMean Oswestry score Severe stenosis: 39.0 Moderate stenosis: 28.0
Airaksinen, Herno, Turunen et al., 1997Myelogram (403), CT (35)Mean Oswestry score Total block: 28 ± 17 (mean ± SD) Subtotal block: 30 ± 19 Less than 10 mm: 35 ± 19 10 to 12 mm: 36 ± 17 More than 12 mm: 37 ± 15
Nishizawa and Fujimura, 1997Myelogram, CTRecovery rate Grade 3 myelogram: 78% Grade 2 myelogram: 80% Grade 1 myelogram: 75% Grade 3 CT: 56% (p = NS) Grade 2 CT: 81% Grade 1 CT: 81%
Sato and Kikuchi, 1997MyelogramMean JOA score Two-level stenosis: 21.6 One-level stenosis: 24 (p = NS)
Silvers, Lewis, and Asch, 1993MyelogramSatisfactory pain relief Complete block: 12/15 (80%) Incomplete or no block: 70/113 (62%) 1 Resume normal activity Complete block: 12/15 (80%) Incomplete or no block: 60/113 (53%) Overall satisfaction Complete block: 14/15 (93%) Incomplete or no block: 82/113 (73%)
Johnsson, Uden, and Rosen, 1991MyelogramOverall improvement (patient opinion) Complete block: 9 improved, 5 worse Moderate stenosis: 17 improved, 7 unchanged, 6 worse Pain level Complete block: 6 good, 6 fair, 2 poor Moderate stenosis: 13 good, 14 fair, 3 poor
Jalovaara, Lahde, Iikko et al., 1989MyelogramOverall result Marked stenosis: 6 good, 12 fair, 4 unchanged, 3 worse Moderate stenosis: 3 good, 2 fair, 1 unchanged, 0 worse
Surin, Hedelin, and Smith, 1982MyelogramOverall result Marked stenosis: 3 excellent, 9 good, 3 fair or poor Moderate stenosis: 5 excellent, 2 good, 0 fair or poor Back pain Marked stenosis: 3 none, 4 mild, 2 severe Moderate stenosis: 3 none, 4 mild, 0 severe
Johnsson, Willner, and Pettersson, 1981MyelogramOverall result Complete block: 3 excellent, 2 good, 1 unchanged, 1 worse Partial block: 7 excellent, 4 good, 3 unchanged, 1 worse No compression: 0 excellent, 0 good, 1 unchanged, 4 worse
Scafuri and Weinstein, 1981CT (first generation)Excluded due to use of obsolete technology
Verbiest, 1977No imaging reportedNot relevant to this question
Paine, 1976MyelogramOverall result AP diameter 0-5 mm: 7 excellent, 5 good, 4 fair, 5 poor AP diameter 6-10 mm: 14 excellent, 13 good, 6 fair, 6 poor AP diameter 11-15 mm: 14 excellent, 8 good, 2 fair, 1 poor AP diameter > 15 mm: 2 excellent, 3 good, 1 fair, 0 poor
1

Our own calculation.

Twelve of the trials reporting imaging results reported the degree of stenosis (see Table 37). Some trials reported degree of stenosis by the absolute measurement of the spinal canal. Other studies categorized the degree of radiologically observed stenosis on scales with two to five points. Only these latter reports are analyzed in this section. No two articles reported the same system for categorizing the degree of stenosis, suggesting that there is no agreement on what degree of stenosis is considered severe and what is not.

Hurri et al. (1998) used a two-point quantitative scale to measure stenosis: whether the sagittal (anterior-posterior) diameter of the dural sac (measured by myelography) was more or less than 7 mm. Their 1998 articlereports disability rates among their patients at long-term (12 years) followup. They used the Oswestry index to measure back pain disability (100 is completely disabled). Patients with severe stenosis at the time of treatment had greater disability (mean Oswestry index: 39.0) than patients with moderate stenosis (mean Oswestry index: 28.0). The difference was statistically significant both for the unadjusted score and when adjusted for age, sex, and other factors. Hurri et al. (1998) did not distinguish between patients given surgical and nonsurgical treatment. We cannot determine from this article whether surgical treatment was more effective in one group or the other (Hurri, Slatis, Soini et al., 1998).

A Finnish study reported stenosis grade in a five-point scale: total block, subtotal block, anterior-posterior diameter less than 10 mm, 10 to 12 mm, and more than 12 mm (Airaksinen, Herno, Turunen et al., 1997). All 438 patients in this series were treated surgically, so we cannot use these results to see whether image data can be used for patient selection. As with Hurri et al.'s (1998) trial, the Finnish group found a statistically significant association between stenosis degree and Oswestry score at followup (mean 4.3 years). However, this group found the opposite relationship; more severe stenosis was associated with better, not worse, outcomes. If more data from this study had been reported, it may have been possible to resolve this apparent contradiction. Preoperative Oswestry score data could have allowed one to determine if patients with the most severe pretreatment stenosis in the Finnish trial, like those in the Hurri trial, had more severe symptoms than patients with a lesser degree of stenosis.

Nishizawa and Fujimura (1997) used a three-point scale and reported myelography and enhanced CT results separately. Surgical outcomes were reported using the Japan Orthopaedic Association score. The "recovery rate" reported as the outcome measure controlled for preoperative condition because it measured the percentage degree to which patients' scores returned to normal. The percentage of patients with successful surgical outcomes was not significantly affected by stenosis degree. However, the statistical test used by the authors was not reported, and we cannot independently verify the findings. As with the Finnish group, no nonsurgical control group existed to compare results (Nishizawa and Fujimura, 1997).

Silvers et al. (1993) reported patient satisfaction rates in a subset of their case series with complete block seen on preoperative myelography. We were able to calculate the rates among patients without complete block (Table 37) from the totals reported in the article. The patients with complete block reported better results than those without complete block. However, no preoperative data are available to allow us to determine whether a difference between groups existed before surgery or was the result of surgery.

The study by Johnsson et al. (1991) compared patients with complete block on myelography to patients with moderate stenosis. Patients with spinal canal diameter of 12 mm or more were excluded. Patients who were not operated on were used as a control group, but patients were not randomized to that group. Comparisons of the operated and nonoperated groups are analyzed as part of Question 6. The mean spinal canal measurement was slightly larger in the nonoperated group than in the moderate stenosis group, and it was not reported whether any of the nonoperated patients had complete block. Several outcome measures were reported, including patients' and clinicians' estimations of results (worse, unchanged, improved), degree of pain, and estimation of walking distance. None of the results appeared to be associated with degree of stenosis (χ2 for overall results = 4.31, df = 2, p = 0.12; χ2 for pain = 0.19, df = 2, p = 0.91) (Johnsson, Uden, and Rosen, 1991).

The threshold defining "marked" and "moderate" stenosis in the article by Jalovaara et al. (1989) was 10 mm. Patient outcomes were measured after a mean of 3.8 years of followup and were based mostly on degree of pain and return to normal activities. "Good" outcomes were attained when patients had no more than slight pain and resumed normal activity with few limitations. "Fair" outcomes were attained if patients had some relief of pain and could resume limited activities. The other two outcomes, "unchanged" and "worse," are self-explanatory. The results appear to be better for patients with moderate stenosis than for those with severe stenosis. However, as with the other trials, we do not know the patients' preoperative condition, so we cannot tell whether the improvement was the result of the surgery (Jalovaara, Lahde, Iikko et al., 1989).

Surin et al. (1982) reported on a trial with similar groups of patients. This trial was specifically intended to measure the effect of spinal canal measurement on surgical results, but the patients' preoperative condition was reported in little detail, making it difficult to distinguish treatment effects from pre-treatment differences between groups. This renders interpretation of the results difficult (Surin, Hedelin, and Smith, 1982).

Johnsson et al.'s (1981) case series included some patients with no compression of the spinal cord evident on the myelogram (spinal canal AP diameter 11 cm or more) as well as patients with complete or partial block observed. There was no improvement after surgery in any of the patients whose myelograms were negative for stenosis. Otherwise, outcomes reported in the article were not significantly different among patients with moderate stenosis than among patients with a complete block. The investigators did not explain the unusual results for patients with negative myelograms and provide little information on these cases. The types of symptoms of the patients without myelographic evidence of stenosis were not different from those of the other groups. However, the severity of symptoms and the presence or absence of other conditions that could cause these symptoms are not reported. One can question whether these patients did in fact have spinal stenosis. While stenosis is usually defined by its radiographic appearance, Johnsson et al. (1981) included patients in this report based on surgical findings, but those surgical findings do not appear to involve any evaluation of the spinal canal itself. Their definition of spinal stenosis was hypertrophy of the neural arches and especially the facet joints, as well as a lack of dura pulsation and epidural fat. Considering that all the other trials reported using the imaging findings to define spinal stenosis, interpretation of the results of Johnsson et al. (1981) is difficult (Johnsson, Willner, and Pettersson, 1981).

Measurements at surgery were also reported by Verbiest (1977), whose case series spanned the period of 1948 to 1975. No imaging findings were reported in this article, so we cannot determine whether he concurs with Johnsson et al. (1981) about the surgical results of patients with negative myelograms.

Paine reported surgical results from a large series of patients (n = 457) with disk herniation (Paine, 1976). Degenerative spinal stenosis was diagnosed in addition to disk herniation in 95 patients, and myelography results were reported for 91 of them. Details of the methods are lacking, but Paine's article includes a table correlating overall results (combining return-to-work information and patients' pain complaints) with degree of stenosis. Our chi-squared test on these data found no significant difference in results between stenosis groups (χ2 = 8.51, df = 9, p = 0.48).

Number of Stenotic Levels

Sato and Kikuchi (1997) measured the number of stenotic levels rather than the degree of stenosis. They did not find statistically significant differences in JOA scores between patients with stenosis at both L4 and L5 and patients with stenosis only at L5. Data reported in the article are insufficient to permit us to verify this finding or to determine whether the findings resulted from a lack of statistical power in the study design (Sato and Kikuchi, 1997).

Paine (1976) also found that surgical results were better among patients with stenosis of only one vertebral level, compared to patients with two or more levels of stenosis, but our reanalysis of the data found no statistically significant difference (χ2 = 8.355, df = 6, p = 0.21). Preoperative condition of the patients was not reported, so we cannot determine whether surgery was more effective for the patients with mild stenosis (Paine, 1976).

Spondylolisthesis and Scoliosis Measurement

McCullen et al. (1994) reported a regression analysis for the effect of various patient characteristics and imaging findings on surgical outcomes. Stenosis grade or spinal measurements were not included in the analysis, but scoliosis and spondylolisthesis measurements before and after surgery were included. They obtained the data from measurements of plain film x-rays. From those data, we calculated correlation coefficients and their significance (McCullen, Bernini, Bernstein et al., 1994).

Preoperative spondylolisthesis, measured as the percentage of slippage (displacement) between the vertebrae, had a significant negative association with surgical results (correlation coefficient −0.172, p = 0.03). This relationship lost statistical significance when regression results were adjusted for the patients' sex (correlation coefficient −0.148, p = 0.07). Scoliosis was not significantly associated with outcomes (sex-adjusted correlation coefficient −0.107, p = 0.24). As expected, decreases in spondylolisthesis after surgery had a highly significant association with positive surgical outcomes (sex-adjusted correlation coefficient −0.271, p = 0.004). Spondylolisthesis is a predictor of worse outcomes after surgery, but these data do not tell us whether measuring spondylolisthesis predicts whether surgery will help a particular patient.

Summary

No published trials provided the data necessary to determine whether a group of lumbar spinal stenosis patients with particular results on some diagnostic imaging test will have better results after surgery than another group. The one controlled surgical trial that reported imaging results (Hurri, Slatis, Soini et al., 1998) did not differentiate between surgical and nonsurgical patients in its published imaging results.

Nine uncontrolled trials (see Table 37) reported surgical outcomes as a function of degree of stenosis, as measured by myelography. All nine of them used different scales to categorize stenosis. Two trials (Airaksinen, Herno, Turunen et al., 1997; Johnsson, Willner, and Pettersson, 1981) reported better outcomes among patients with more severe stenosis. Three trials (Hurri, Slatis, Soini et al., 1998; Silvers, Lewis, and Asch, 1993; Surin, Hedelin, and Smith, 1982) reported worse outcomes among patients with more severe stenosis. Four trials (Jalovaara, Lahde, Iikko et al., 1989; Johnsson, Uden, and Rosen, 1991; Nishizawa and Fujimura, 1997; Paine, 1976) reported no significant difference between the groups. Most of the studies failed to report preoperative data that would allow us to determine whether the observed differences between groups resulted from surgery or were preoperative differences.

The results found in this set of clinical trials could have several different causes. Sufficient information is not available in any of the articles to permit us to prove an association between imaging findings and surgical results. At this time, we cannot use imaging results to identify patient groups that would be more or less likely to benefit from surgery. One trial (Johnsson, Willner, and Pettersson, 1981) found that patients with normal myelograms did not benefit from surgery, but there is a question as to whether those patients even had spinal stenosis.

Question 8 What is the relationship between the type of surgery received and the success or failure of surgical treatment?

This question addresses the connection between the types of surgical procedures used and surgical success. For a given patient group, which type of surgical procedure produces the greatest benefit? As with the preceding question, we performed a systematic narrative review and calculated each study's effect size for differences between related groups whenever possible.

Comparisons of Laminectomy Procedures in Patients With Central Lumbar Stenosis

One randomized controlled trial and 10 controlled trials address the efficacy of surgical treatments for patients with central lumbar spinal stenosis (see Table 25). The randomized controlled trial of Grob et al. (1995) examined the benefits of partial laminectomy with and without fusion and instrumentation (Grob, Humke, and Dvorak, 1995). Hanakita et al. (1999) compared laminectomy to laminectomy plus fusion and to partial laminectomy among patients less than 64 years old (Hanakita, Suwa, and Mizuno, 1999). Thomas et al. (1997) compared laminectomy with laminotomy, Yone et al. (1996) compared laminectomy with fusion and instrumentation to laminotomy, and Ray (1982) compared laminectomy to partial laminectomy (Thomas, Rea, Pikul et al., 1997; Yone, Sakou, Kawauchi et al., 1996; Ray, 1982).

Only three trials, Hanakita et al. (1999), Thomas et al. (1997), and Ray (1982), provided a possible comparison of results between surgical methods: laminectomy versus partial laminectomy or laminotomy. However, differences between outcomes measured and the mean ages of the patient groups being compared prevented any meaningful combination of data. Hanakita et al. (1999) compared patients all younger than 65 years, while the mean age for Thomas et al. (1997) was 64, and the mean ages for the laminectomy patients and the partial laminectomy patients in Ray were 54 and 46 years, respectively. Thomas et al. (1997) and Ray could have been combined, but each study used different outcome measures (Thomas et al. used walking, and Ray used a global assessment) as well as different followup periods (Thomas et al. used 24 to 62 months, and Ray used 10 and 13 months).

Hanakita et al. (1999) reported a retrospective study of patient evaluation of surgical outcome. Patients received laminectomy, laminectomy plus fusion, or partial laminectomy. No pretreatment data are available to determine patient status before surgery; therefore, there is no means of evaluating the success of each surgical procedure. Comparisons across surgical methods are also complicated by the fact that patients with more than a 10 mm slip distance or a slip of more than 15° received fusion, so the patient groups are not the same. Therefore, this study cannot be used to evaluate the relationship between surgical methods and successful outcomes (Hanakita, Suwa, and Mizuno, 1999).

Ray (1982) reported on patients who received laminectomy with partial laminectomy. The only patient information provided was average age and duration of pain before surgery. As in the previous study, the lack of data on pretreatment conditions prevents the use of this study to evaluate the relationship between surgical methods and successful outcomes (Ray, 1982).

Table 38. Analysis of Walking Capacity Data from Thomas, Rea, Pikul et al., 1997
Patient GroupPosttreatment Walking Distance without Pain (City Blocks)Effect Size (Hedges' d) and Confidence Limits
>31 to 3<1
Laminectomy n = 12525−0.015 −0.787 to 0.756 p = 0.969
Laminotomy n =14473
Thomas et al. (1997) compared laminectomy (12 patients) to laminotomy (14 patients). Walking distance in city blocks (the precise distance of a city block was undefined) was used to measure success. Pretreatment walking data were provided to show that walking capacity was similar between patient groups. Back pain, leg pain, and pain while walking were measured but incompletely reported. The original data and our analysis of these data are contained in Table 38. The effect size calculated from these data is very small (−0.015, which is not statistically significant), and a large increase in patient numbers is not likely to make this difference statistically significant. However, generalizing from such a small trial is problematic, and further research is needed before one can conclude that these two surgical methods are of equal efficacy. Further, it is not clear from this nonrandomized study that the patients in both groups were similar before treatment, nor is it clear whether the two surgical methods had different effects on symptoms other than walking capacity. Data are provided for pain while walking, on a 0 to 10 scale, for the laminectomy group only. A reference is made to a slightly less marked reduction in walking pain after surgery in the laminotomy group, compared to the laminectomy group with no supporting data. Additional patient data are needed to determine if surgical method influenced the extent of back pain, leg pain, and pain while walking after surgery (Thomas, Rea, Pikul et al., 1997).

The Effect of Fusion and Instrumentation in Patients With Central Lumbar Stenosis

Table 39. Comparison of Effect Sizes for Data on Pain Relief from Grob, Humke, and Dvorak, 1995
Surgical MethodPain Relief ScoreP Value of t-TestHedges' dConfidence Limits
No fusion n = 156.0
Single-level fusion n = 155.80.250.429 p = 0.245−0.294 to 1.153
Multiple-level fusion n = 155.80.920.037 p = 0.919−0.679 to 0.753
The single randomized controlled trial, Grob et al. (1995) examined 15 patients with partial laminectomy and 30 patients with fusion and instrumentation. The patients who received fusion and instrumentation were evenly divided between fusion at a single level and fusion at multiple levels. The mean age at the time of the operation was 66, 71, and 71 for no-fusion, single-fusion, and multiple-fusion groups, respectively. Walking capacity, back pain, back pain relief, leg pain relief, and a global assessment were recorded at pretreatment and 24 to 32 months later. Author evaluation of the data found statistically significant increases in walking distance and significant decreases in pain in all the surgical groups but found no differences between groups for relief of pain. Our calculation for the pain-relief data (see Table 39) yielded a nonsignificant effect size of −0.429 (Hedges' d) between no-fusion and single-fusion. Because of its small sample size, this study is underpowered. If the effect size had been 0.5, the study would have needed 65 patients in each of the two treatment groups to have a power of 0.8 (80 percent probability of finding the difference between treatments significant at p <0.05, two tailed). The actual power of this study was approximately 0.25. These power calculations are based on tables found in Rosenthal and Rosnow (1991). Given the low power, this study was only capable of detecting improvements within surgical groups and was not capable of detecting differences between surgical methods. Therefore, the data in this study cannot be used to determine whether the addition of fusion and instrumentation improves surgical outcome (Grob, Humke, and Dvorak, 1995).

The Effect of Fusion and Instrumentation in Patients With Lateral lumbar stenosis

Table 40. Comparison of Effect Sizes for Data on Global Assessment from Lee and deBari, 1986
Patient GroupPreoperative ScorePostoperative ScoreImprovement
With rods n = 1218.9 ±8.7250.50±12.3731.58±17.59
Without rods n = 1221.41±9.2353.58±19.0632.16±15.75
EffectsSize (Hedges' d)−0.269−0.185−0.033
Confidence limits−1.073 to 0.535−0.989 to 0.617−0.834 to 0.767
p value0.5120.6510.935

Scores are presented as mean and standard deviation and are based on a 100-point rating system in which patients with scores less than 30 are considered in severe pain and have severe limitations on activities and patients with scores greater than 75 are considered to have limited or no pain and normal activities.

The randomized controlled trial of Lee and deBari (1986) evaluated the addition of Knodt distraction rods to decompressive laminectomy and bilateral-lateral fusion. Each treatment group had 12 patients who were diagnosed as having multiple-level foraminal stenosis. The mean age for all patients was 45.7 years (range 28 to 65), and the followup evaluations were performed at a mean of 38 months (range 12 to 72). A 100-point scoring system based on amount of pain and degree of tolerable activities of daily living was used to evaluate patients before and after surgery. Patients with scores less than 30 were considered to have severe constant pain with severe incapacitation of activities, and patients with scores between 75 and 100 were considered to have minimal or no pain and restriction of activities. The preoperative and postoperative scores and our analysis of these data are presented in Table 40. Both patient groups had similar preoperative scores and improvement in scores, and the authors concluded that there appears to be no clinical benefit from the use of rods compared to patients without rods. Our calculations show that the effect sizes for the postoperative score and the improvement score are small. A study with several hundred more patients would be needed to show that these effect sizes indicate a true difference between treatment groups. Therefore, this study provides evidence that the addition of Knodt distraction rods to decompression and fusion does not improve surgical outcomes (Lee and deBari, 1986).

The Effect of Fusion and Instrumentation in Patients With Degenerative Spondylolisthesis

Four randomized controlled trials and six controlled trials examined surgical treatment of degenerative spondylolisthesis (see Table 30).

The randomized controlled trial of Thomsen et al. (1997) separately reported outcomes for patients with primary degenerative spondylolisthesis and patients with isthmic or secondary degenerative spondylolisthesis. Twenty-one patients with primary degenerative spondylolisthesis receiving partial laminectomy and fusion were compared to 20 patients with primary degenerative spondylolisthesis receiving partial laminectomy, fusion, and instrumentation. Functional outcome was assessed 2 years after surgery using the Dallas Pain Questionnaire. This questionnaire examines daily, work, and leisure activities; anxiety-depression; and social concerns, specifically among patients with low back pain. Mean differences of 73 and 37 between pretreatment and posttreatment scores were reported for the instrumented group and the noninstrumented group, respectively. An increase in scores indicates improvement. The authors reported that these differences in scores were not significant when using the Mann-Whitney test. The authors reported that 44 patients were needed in each treatment group to detect a clinically relevant difference in functional outcome of 15 in the Dallas Pain Questionnaire score. Sufficient patients were included in the total study, but within the primary degenerative spondylolisthesis subgroup, only 20 and 21 patients were available. Therefore, this study may have had too few primary degenerative spondylolisthesis patients to detect differences in outcomes due to treatment. Therefore, this study does not provide reliable evidence for the use of instrumentation with fusion (Thomsen, Christensen, Eiskjaer et al., 1997).

Table 41. Effect Size for Data on Global Assessment from Fischgrund, Mackay, Herkowitz et al., 1997
Patient GroupGlobal AssessmentEffect Size (Hedges' d)Confidence Limits
ExcellentGoodFairPoor
Instrumentation n = 35207440.327 p = 0.181−0.152 0.805
No instrumentation n = 33161214
The randomized controlled trial of Fischgrund et al. (1997) compared decompressive laminectomy and fusion with (40 patients) and without (35 patients) instrumentation. Back pain and leg pain were rated on a 0 (no pain) to 5 (severe pain) scale. Before surgery, the mean back pain and leg pain scores were 4, and between 24 and 36 months after surgery, the mean scores were significantly reduced to 1 or 2. A global assessment of surgical outcomes was also reported. Our analysis of the global assessment data is presented in Table 41. The Hedges' d of 0.327 was not statistically significant (confidence limits of −0.152 to 0.805, p = 0.181). Both the pain measurements and the global assessment measurements indicate that both groups responded equally well to surgery regardless of the addition of instrumentation. However, this small study likely contains too few patients to conclusively show that instrumentation does not improve surgical outcomes (Fischgrund, Mackay, Herkowitz et al., 1997).

Table 42. Effect Size for Data on Global Assessment from Herkowitz and Kurz, 1991
Patient GroupGlobal AssessmentEffect Size (Hedges' d)Confidence Limits
ExcellentGoodFairPoor
Fusion n = 251113101.437 p = 0.0000060.815 2.059
No fusion n = 2529122
Table 43. Effect Size for Data on Back and Leg Pain from Herkowitz and Kurz, 1991
Patient GroupBack Pain ScoreLeg Pain Score
Pre-Post-Pre-Post-
Fusion n = 253.31.34.31.0
No fusion n = 252.92.54.01.7
Effect Size (Hedges' d)0.571 p = 0.0480.571 p = 0.048
Confidence Limits0.006 1.1370.006 1.137

The trial reported that patients with fusion had significantly less back pain and leg pain (p <0.05, Student t test). Since a standard deviation was not reported in this study, the Hedges' d effect size was calculated from a p value of 0.049. No analysis was reported for the comparison of pretreatment scores or the comparisons of posttreatment to pretreatment scores.

The randomized controlled trial of Herkowitz and Kurz (1991) compared laminectomy (25 patients) to laminectomy plus fusion (25 patients). Back pain and leg pain were rated on a 0 (no pain) to 5 (severe pain) scale, and operative results were rated as excellent, good, fair, or poor. Followup time averaged 36 months (range of 29 to 48 months). The trial reported that patients with fusion had significantly less back pain and leg pain (p = 0.05, Student t test) and were significantly improved based on the operative results scale (p = 0.0001, Fisher exact test). Our analysis of these data is presented in Table 42 and Table 43. These data suggest that the addition of fusion to decompressive laminectomy benefited patients by reducing pain to a larger extent than decompressive laminectomy alone (Herkowitz and Kurz, 1991).

The randomized controlled trial of Bridwell et al. (1993) purposely violated the randomization procedure so that patients with pathological motion (>10° or >3 mm of slippage) always received partial laminectomy, fusion, and instrumentation (24 patients). Other patients received either partial laminectomy only (9 patients) or partial laminectomy and fusion (10 patients). Thus, patients with the most severe condition were segregated to one treatment, and any comparison to the other treatments containing patients with less severe conditions is biased. Therefore, we cannot use these data in a comparison of surgical methods (Bridwell, Sedgewick, O'Brien et al., 1993).

Plotz and Benini (1998) retrospectively reviewed 106 patients who received decompressive surgery and separated patient data into three groups according to additional surgical methods: fusion, fusion plus translaminar screw fixation, or fusion plus AO internal fixator. At the time of followup, 9 to 120 months after surgery, the fusion only group had less than 10 patients, and there were 14 and 64 patients in the other groups, respectively. Since the control group had less than 10 patients, we did not perform any further analysis (Plotz and Benini, 1998).

Table 44. Effect Size for Data on Functional Status, Back Pain, and Leg Pain from Yuan, Garfin, Dickman et al., 1994
Patient GroupFunctional StatusBack PainLeg Pain
ImprovedNo ChangeWorseImprovedNo ChangeWorseImprovedNo ChangeWorse
Noninstrumented1,927192131,941165151,9451764
Pedicle screw fixation391582377693398512
Hedges' d Effect Size0.042 p = 0.0350.086 p = 0.000070.037 p = 0.047
Confidence Limits0.003 0.0800.044 0.1290.0004 0.073

Effect sizes were calculated using the odds ratio and the natural log transformation. The no-change and worse groups were pooled to calculate the odds ratio.

Yuan et al. (1994) reported on an open, nonblinded, historical cohort study that collected patient data from surgeons. Patients received either fusion alone (456 patients) or fusion and pedicle screw fixation (2,177 patients). At followup 23 to 51 months after surgery, patient function, back pain, and leg pain were judged as improved, no change, or worse. The no-change and worse categories were combined and compared to the improved category using a chi-square test and a 2 by 2 table. Each patient group showed significant improvement in functional status (p <0.02), back pain (p <0.01), and leg pain (p <0.03). Our analysis of these data is presented in Table 44. This analysis suggests that pedicle screw fixation benefited patients with degenerative spondylolisthesis, but interpretation of these findings is not straightforward. This is due to the historical, nonblinded nature of the trial, design aspects that could introduce bias. The usefulness of these data are further reduced by the lack of pretreatment pain measures. Such measures could be used to determine the actual extent of changes in pain perception and to determine whether patients in the two groups were similar enough to permit a valid comparison of treatments (Yuan, Garfin, Dickman et al., 1994).

Satomi et al. (1992) reported on a retrospective study that divided patients into groups receiving fusion and instrumentation (no decompression) and those receiving various types of decompressive surgery. These groups also differed in mean age (51 vs. 69 years) and mean time from onset of disease (3.8 vs. 8.7 years). These differences in patient characteristics between treatment groups bias any comparison of treatment outcomes. Therefore, these data were not used in our analysis (Satomi, Hirabayashi, Toyama et al., 1992).

Table 45. Effect Size for Data on Global Assessment from Lombardi, Wiltse, Reynolds et al., 1985
Patient GroupGlobal AssessmentEffect Size (Hedges' d)Confidence Limits
ExcellentGoodFairPoorFailure
Laminectomy n = 20610130−0.586 p = 0.066−1.211 0.040
Laminectomy plus fusion n = 21118011

The number of patients in the fair, poor, and failure groups were combined into a single group in order to perform this analysis.

Lombardi et al. (1985) retrospectively evaluated patients with standard wide laminectomy with (21 patients) and without (20 patients) fusion. This study used a five-level global assessment (excellent, good, fair, poor, failure) to evaluate surgical success 24 to 84 months posttreatment. Our analysis of these data is presented in Table 45 and indicates no differences in global outcomes between treatment groups. The power of this study is about 30 percent indicating that additional patients would be needed to definitively establish that this lack of difference is, in fact, real. The retrospective design did not allow for the inclusion of pretreatment patient data concerning back or leg pain and walking capacity. The lack of these pretreatment data reduces the usefulness of this study in determining the extent to which fusion may benefit patients with spondylolisthesis (Lombardi, Wiltse, Reynolds et al., 1985).

Fitzgerald and Newman (1976) conducted a retrospective examination of patients given a rigid brace as conservative treatment (29 patients) and a variety of decompressive surgical methods. Patients with less severe symptoms (no indication of nerve root compression) were placed in the conservative treatment group. The differences in methods within the surgery group and the differences in patient characteristics between groups prevent any meaningful comparison of treatment outcomes. Therefore, this study cannot be used in our analysis of surgical procedures (Fitzgerald and Newman, 1976).

Rosenberg (1976) retrospectively reported on patients receiving partial laminectomy (11 patients) and standard wide laminectomy (15 patients). The lack of a specifically defined global outcome scale and the lack of specific pretreatment patient characteristics to judge the effect of treatment prevent any meaningful comparison of treatment outcomes. Therefore, this study cannot be used in our analysis of surgical procedures (Rosenberg, 1976).

Summary

Taken as reported, the results of the randomized controlled trials of Thomsen et al. (Thomsen, Christensen, Eiskjaer et al., 1997) and Fischgrund et al. (Fischgrund, Mackay, Herkowitz et al., 1997) would seem to suggest that instrumentation in addition to fusion does not improve surgical outcomes among patients with spondylolisthesis. However, both trials likely had too few patients (and, therefore, insufficient statistical power) to render any definitive conclusion.

Herkowitz and Kurz (1991) provide evidence that fusion is beneficial compared to decompressive surgery alone in these patients. The other RCTs had flaws in design or reporting that made their results of questionable value.

Drawing conclusions from just one or two trials is, however, problematic. This is because such conclusions have a rather high potential to be influenced by publication bias (the potential that trials with negative findings are not published) or the "file drawer" problem (the tendency of investigators to not report findings that are not statistically significant). Until data from larger and well-designed randomized controlled trials are available, reliable conclusions are not possible.

Question 9 What costs are associated with nonsurgical and surgical treatment of spinal stenosis?

Table 46. Treatment Costs for Procedures Related to Lumbar Spinal Stenosis
SourceProcedureTotal Number of DischargesTotal Hospital Charges 1
Mean ($)Median ($)
HCUP Nationwide Inpatient Sample 1997 (Agency for Healthcare Research and Quality, 2000)Lumbar spinal stenosis68,86814,27110,679
Spondylolisthesis4,71922,45620,226
Lumbosacral posterior fusion30,63827,52022,865
Contrast myelogram10,3847,8305,724
Lumbosacral spine x-ray7455,5364,085
MRI of the spinal canal18,75810,1087,233
1

Charges reflect the total hospital charges for the entire hospital stay, not the charges for the procedure alone. These charges do not include professional fees.

SourceProcedureTotal Number of DischargesTotal Charges ($)
Medical Provider Analysis and Review, short-stay hospitals 1998 (Health Care Financing Administration, 2000)Spinal procedures6,143150,605,381
Medical back problems85,634590,751,185
Combined anterior/posterior spinal fusion1,12568,849,249
SourceProcedureBenchmark ($)
MECQA, 1999 (Medical Cost and Quality Assistance, 1999)CT of the lumbar spine with contrast material302.78
MRI of the lumbar spine with contrast material549.04
Myelography, lumbosacral181.67
Laminectomy of spinal canal and/or cauda equina, 1 or 2 segments1,158.30
Laminectomy of spinal canal and/or cauda equina, more than two segments1,260.58
Laminotomy (hemilaminectomy) and/or excision of herniated disk942.32
Arthrodesis, posterior technique, single level1,511.55
Spinal procedures9,543.98
Combined anterior/posterior spinal fusion22,252.75
Posterior segmental instrumentation (pedicle fixation), 3 to 6 segments691.68
Anterior instrumentation, 3 to 6 segments625.41
SourceType of SurgeryMedicalIndemnityPermanencyTotal Cost
Based on data from a third-party payers in Minneapolis/ St. Paul (Nelson, Carpenter, Dreisinger et al., 1999)Lumbar laminectomy30,30035,90016,41482,614
Lumbar fusion62,30090,30016,132168,732
Cervical laminectomy20,75021,80017,75460,304
Cervical fusion43,10053,90015,480112,480
Source% of PatientsCost of Back-Pain-Related Care per Patient
Costs are based on 1059 patients in primary-care clinics in Washington State (Engel, von Korff, and Katon, 1996)0.6%>7,656
10.0%>1,044
21.4%>582
36.9%>353
SourceProviderNumber of EpisodesMean Inpatient CostStandard ErrorMaximum
Data on 1,020 episodes of back pain from the RAND Health Insurance Experiment, representing four United States census regions and urban and rural sites (Shekelle, Markovich, and Louie, 1995)General practitioner102,0805065,529
SourceProcedure and Number of PatientsHospital Costs for SurgeryHospital Costs for Blood BankTotal Hospital Costs
Data from Brigham and Women's Hospital 1989 to 1993 (Katz, Lipson, Lew et al., 1997)Laminectomy only n = 1195,79838712,618
Laminectomy and fusion n = 239,02479718,495
Laminectomy and fusion with instrumentation n = 2113,5541,76225,914
SourceType of Office or Outpatient VisitCosts for Visit to an Office in a Medical FacilityCosts for Visit to a Free Standing Office
Fees for Medicare office visits in the state of Alabama (Health Care Financing Administration, 1999)Office or other outpatient visit for the evaluation and management of a new patient, which requires a comprehensive history, physical examination, and medical decision making of high complexity$130.83$112.58
Office or other outpatient visit for the evaluation and management of an established patient, which requires a comprehensive history, physical examination, and medical decision making of high complexity$91.93$77.17
This question does not appear in the evidence models. Information on the cost of surgical treatment of lumbar spinal stenosis came from several sources and is presented in Table 46. Because present data did not allow us to estimate the effectiveness of any treatment or diagnostic, we were unable to perform a cost-effectiveness analysis. Therefore, in this section, we present only cost information.

Information in Table 46 is derived from several sources. The first of these is the Healthcare Cost and Utilization Project (HCUP). The HCUP is a survey of hospital patient information and charges that is developed and annually updated by AHRQ. The second entry in the table is from the Medical Provider Analysis and Review (MEDPAR) database of the Health Care Financing Administration (HCFA). This file contains information from Medicare beneficiaries using hospital inpatient services. MECQA, the third entry in the table, provides benchmarking figures. The Nelson et al. (1999) data are derived from a 2.5-year study in which third-party payers in Minneapolis were surveyed about average costs. The average followup was 16 months after hospital discharge. Presented in the table are average surgical costs. The Engel et al. (1996) data are derived from a one-year prospective cohort study of 1,059 patients with back pain. The Shekelle et al. (1995) data are derived from a prospective, community-based observational study on the costs of back pain care. In the table, we present only inpatient costs per episode of back pain care. The Katz et al. (1997) data compare the cost of laminectomy without fusion, laminectomy with fusion, and laminectomy with fusion and instrumentation from patients operated on at Brigham and Women's Hospital from 1989 to 1993. Finally, professional fees for office visits were obtained from the Medicare Physicians Fee Schedule for 1999.

Chapter 4. Conclusions

Question 1: What is the relationship between each relevant patient characteristic and the presence and/or intensity of each of the patient signs, symptoms, and conditions of spinal stenosis?

Question 2: Which relevant patient characteristics are associated with an increased likelihood of focal narrowing of the spinal canal?

Question 3: What is the relationship between degree of stenosis and the presence and/or intensity of each of the signs, symptoms, and patient conditions?

Question 4: What is the relationship between the signs and symptoms and other features of the history and physical and the results of the imaging examination?

Question 5: What is the relationship between the signs and symptoms and other features of the history and physical and results of conservative treatment, and what is the relationship between the type of conservative treatment and patient outcomes?

Question 6: What is the relationship between the signs, symptoms, and other features of the history and physical and the success or failure of surgical treatment?

Question 7: What is the relationship between the results of the imaging examination and the success or failure of surgical treatment?

Question 8: What is the relationship between the type of surgery received and the success or failure of surgical treatment?

Question 9: What costs are associated with nonsurgical and surgical treatment ofspinal stenosis?

Chapter 5. Future Research

At least some of the gaps in current research on lumbar spinal stenosis probably arise from the suboptimal designs of the studies that have been conducted to date. In addition, at least some additional gaps likely arise from the manner in which patient characteristics and results are reported, even in well-designed trials. Other gaps seem to result from the somewhat varied patient inclusion criteria used in these studies. The purpose of the present section is to provide evidence supporting these suppositions and to thereby illuminate where these gaps lie.

Study Design

Well-designed RCTs provide the highest quality of evidence for treatment effectiveness because each treatment arm is composed of comparable patient groups having the same characteristics (e.g., age, gender, extent of disease, capacity to respond to treatment). Nonrandomized trials may result in one treatment arm having patients who can appear to respond better or worse to treatment due to the characteristics of the patient group. Additionally, trials without a control group do not examine the potential for minimal treatment to cause recovery, nor do they allow one to accurately gauge the magnitude of any change that occurs after treatment.

Table 47. Trial Designs, Number of Patients, and Reporting of Prior Conservative Therapy in Surgical and Conservative Treatment Trials Related to Lumbar Spinal Stenosis
Type of TreatmentSurgery 147 StudiesConservative 4 Studies
Study Design Randomized controlled trials Controlled trials 1 Prospective trials Retrospective with consecutive patients Case series 7 47 7 28 57 4 0 0 0 0
Year of Publication 1960 1970 1980 1990 2000 3 17 38 86 3 0 0 2 2 0
Number of Patients in All Arms of Trail 10-25 26-50 51-100 >100 25 59 41 22 0 3 1 0
Months of Nonoperative Therapy Reported Not reported 15 132 --- ---
1

A controlled trial is any trial that contains more than one treatment group and is not a randomized controlled trial. These trials may have either a prospective or retrospective design

Only four RCTs of conservative treatments for lumbar spinal stenosis were found in our search of the literature. One hundred seventy-eight publications of conservative treatments may have included lumbar spinal stenosis patients. However, most of these trials did not contain a control group or did not make a specific diagnosis of lumbar spinal stenosis using imaging technology (see Table 47).

Of the 253 publications examining surgical treatments for lumbar spinal stenosis, 147 surgical trials met the inclusion criteria for analysis (see Table 47). Of these publications, only seven were RCTs. Within this group, patients with central stenosis or lateral stenosis were examined in one trial each, and degenerative spondylolisthesis was examined in four trials. The seventh trial was the only study to randomize patients with moderate symptoms to either conservative or surgical treatment.

Forty-seven trials contained more than one treatment arm and have been classified as controlled trials. In these studies, the patient groups were usually not matched and received treatment based on the surgeon's judgment of the patient's condition. Nine of these controlled trials had a nonoperated control group. However, the extent of disease in these patients was generally not as severe as that in patients who received surgery.

Ninety-two trials examined only one surgical technique. These trials were separately classified as prospective (7), retrospective with consecutive patients (28), and case series (57, retrospective but did not report examining consecutive patients).

Reporting of Patient Characteristics, Signs, and Symptoms

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F024.jpg.

   Figure 24. Histogram of Number of Trials Reporting Patient Signs, Symptoms, and Other Characteristics in Randomized Controlled Trials of Conservative Treatments

Reasonable evidence-based conclusions can sometimes be reached from observational studies. Doing so, however, often requires that the studies under consideration provide detailed descriptions of the enrolled patients. Reporting of patient characteristics, signs, and symptoms is inconsistent among the published trials of both conservative and surgical treatments for lumbar spinal stenosis. Evidence Table 1 for the conservative trials, Evidence Table 2 for the surgical trials, and Evidence Table 3 for the natural history trials present a list of patient information commonly reported in clinical trials of lumbar spinal stenosis. The tables indicate with a Yes or No if the trials reported each item. These patient characteristics, signs, and symptoms may potentially influence the outcome of any treatment regime and could be used in a regression analysis of outcome modifiers. Mean age and duration of illness before treatment is the only patient information reported in all four conservative treatment trials. The presence of neurogenic claudication, back or leg pain, and extent of stenosis are reported in one publication each. Comorbidity such as hip and knee disorders are not reported in any of the four trials (see Figure 24).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F025.jpg.

   Figure 25. Histogram of Number of Trials Reporting Patient Signs, Symptoms, and Other Characteristics in Trials of Surgical Treatments

Among the surgical trials, only mean age was reported in more than 80 percent of the publications (see Figure 25). The next most frequently reported patient characteristic was the number of levels surgically involved (54 percent). Neurogenic claudication was reported in 46 percent of trials. Back pain and leg pain were reported in 40 percent of trials and duration of illness before surgery, prior back surgery, leg weakness, and sensory dysfunction were reported in 31 percent to 35 percent of trials. Extent of stenosis and the number of patients with herniated disks were reported in 18 percent of trials. The combination of mean age, number of surgical levels involved, and the presence of neurogenic claudication were reported in 40 trials ( 27 percent of all surgical trials). The addition of reporting back pain dropped the number of trials reporting all four items to 29 (20 percent). Only five trials (3.5 percent) provided information on mean age, surgical levels, neurogenic claudication, back pain, leg pain, duration of illness before surgery, prior back surgery, leg weakness, and sensory dysfunction. The lack of detailed reporting of patient signs and symptoms complicates any attempt at correlating signs and symptoms with the extent of recovery after treatment.

Patient Inclusion Criteri

Lack of detailed reporting of patient characteristics can sometimes be overcome if studies employ rather restrictive patient inclusion criteria. Available information suggests that such criteria were only infrequently applied in studies of lumbar spinal stenosis. Most studies, 125 of the 147 surgical trials (85 percent), enrolled patients of markedly different ages and typically had age ranges of 30 to 80 years.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3660_F026.jpg.

   Figure 26. Plot of Mean Ages and Age Ranges from 125 Studies of Lumbar Spinal Stenosis

Figure 26 shows that although the average age of patients in these studies was somewhat similar (most trials were of middle-aged or older patients), there was nevertheless a wide range of mean ages. Among the 125 studies that reported a mean age and age range, the means start at 34 years and go to a high of 85 years. Further, the age variation is even greater within studies. The ranges of individual studies often extended to 30 years on either side of the mean. Granted, ranges and means can be distorted by a few "outliers," but whether these ranges were the result of such outliers could not be determined. This is because published studies typically provide no estimate of the number of patients younger or older than the mean age.

Several reviews examining the diagnosis and treatment of lumbar spinal stenosis have recommended that a period of conservative therapy be administered before proceeding to surgery (Fritz, Delitto, Welch et al., 1998; Jenis and An, 2000; McCowan, Bowden, and Weisel, 1989; Nagler and Hausen, 1998; Palumbo, Lucas, and Akelman, 1995; Shakil, Vaccaro, Albert et al., 1999; Shiroky, Salanga, and Ropos, 1996; Woolsey, 1986; Zdeblick, 1995). If the conservative therapy fails to improve symptoms and function, then surgery is warranted. Given this recommendation, however, only 15 of the 147 trials of surgical treatments reported that their patients received, and showed no improvement after, a course of conservative treatment. In these 15 studies, the actual amount of prior conservative treatment ranged from as little as 2 weeks to as much as 16 years. Some trials stated that they required an "adequate" period of treatment before surgery but did not define adequate.

Appendix: Electronic Data Abstraction Forms

Input Form for Trial Information Table

Lumbar Spinal Stenosis EPC Project

Record Entry Information

Person Extracting and Entering Record:Date When Record Was Entered:Person Reviewing Record:Date When Record Was Reviewed:
Trial Design and IdentificationUnique ID Number for Trial Publication
111111
Last Name of First AuthorYear of PublicationStudy Design:
Total Number of Patients in Trial
Purpose of Trial:Diagnosis of Patient ConditionYear When Study Was Performed
Country Where Study Was Performed
Clinical Setting Where Trial Was PerformedCentersInstitution of First Author:Funding Source for Clinical Trial:
Confounding or Comments on Study Design and Patient Characteristics:
Diagnostic Imaging Method UsedElectromyography or Somatosensory Diagnostics Used

Patient Outcomes Measured

When describing the method of outcome measure include the number of levels on any scale used in the trial. Typically 3 to 5 levels are listed as Excellent, Good, Fair, Poor, and Very Poor, or the outcomes is considered Satisfactory or Unsatisfactory (2 levels).

Method of Followup to Record Patient OutcomesIndividual Patient Data is Presented
Measured Back PainPerson Reporting Back Pain:Method Used to Measure Back Pain
Measured Back Pain ReliefPerson Reporting Back Pain Relief:Method Used to Measure Back Pain Relief:
Measured Leg PainPerson Reporting Leg Pain:Method Used to Measure Leg Pain
Measured Leg Pain Relief:Person Reporting Leg Pain ReliefMethod Used to Measure Leg Pain Relief:
Measured WalkingPerson Reporting Walking:Method Used to Measure Walkingm
Measured Global SuccessPerson Reporting Global Success:Method Used to Measure Global Success
Number of Levels Used in Global Success Rating System5 or more levels4 levels3 levels2 levels
Measured Ability to WorkPerson Reporting Working:Method Used to Measure Ability to Work
Measured Quality of LifePerson Reporting Quality of Life:Method Used to Measure Quality of Life
Measured Mental StatusPerson Reporting Mental Status:Method Used to Measure Mental Status
Measured Disability/DependencePerson Reporting Disability or Dependency:Method Used to Measure Disability/Dependency
Measured Activities of Daily LivingPerson Reporting Activities of Daily Living and Physical Function:Method for Measuring Activities of Daily Living and Physical Function:

Patient Inclusion and Exclusion Criteria

Months of Nonoperative Therapy Required Prior to Surgery:

Trial Inclusion Criteria

Trial Exclusion Criteria

Jadad Score

RandomizationMethod of RandomizationHow to Determine the Jadad Score:
Jadad Score
Double BlindingMethod of Blinding Used in TrialGive one point for each Check listed above for Randomization, Double Blinding, and Describing Withdrawals and Dropouts. Give one additional point each for describing an appropriate method of randomization and double blinding. Deduct one point each for describing an inappropriate method of randomization and double blinding.
Described Withdrawals and Dropouts

Used in Meta-analysis

Trial was Excluded from Meta-analysis

Reason for Trial Exclusion from Meta-analysis

After entering the data in this form, go to the Patient Characteristics Input Form to enter Patient Characteristics for each arm of the trial. Then enter published data into the appropriate outcomes tables.

Input Form for Patient Characteristics Data for Each Disorder/Treatment Arm in a Trial

Lumbar Spinal Stenosis EPC Project

A separate record is entered for each Patient Group within a Trial (Arm of a Trial). Each Patient Group is distinguished by its Alexandria ID number (linked to Trial publication), Disorder, and Treatment, and Post Treatment Group. These four keys uniquely identifies each Patient Group within a Trial. Each Patient Group receives a unique ID Number

Patient Group Keys

The Patient Group Identification Number is composed of the Alexandria Number followed by a two digit number that uniquely identifies the patients in this arm of the trial

Unique ID NumberPatient Group Identification Number
111111111111-01
Type of Lumbar Spinal Stenosis Disorder:Type of Surgical or Conservative Treatment Received by Patient Group:

Treatment Group Divided by Post Surgical Outcome:

Treatment Description

Number of Patients with Single Level Surgery:Method of FusionName of Device Used for InstrumentationDescription of Conservative Therapy (Number of Treatments, Drug Dosage)
Number of Patients with Multiple Level Surgery:
Mean Number of Levels Surgically Removed:Description of Any Concurrent Therapy

Description of Patient Group Characteristics

Number of PatientsReported Number of Male and Female PatientsNumber of MalesNumber of Females
Mean AgeStandard Deviation of Mean AgeAge of Youngest PatientAge of Oldest PatientTotal Number of Patients to Drop OutNumber of Drop Outs Due to Death Lost to FollowupNumber of Drop Outs Recorded as
Mean or Median Number of Months For Duration of Condition Prior to TreatmentStandard Deviation of the Mean Number of months of Duration of Condition Prior to TreatmentShortest Period in Months of Duration of Condition Prior to TreatmentLongest Period in Months of Duration of Condition Prior to TreatmentNumber of Patients in Trial/Treatment Group with Prior Surgery

Patient Signs and Symptoms Prior to Treatment

Reported Back PainNumber of Patients with Back PainNumber of Patients with Unilateral Back PainNumber of Patients with Bilateral Back Pain
Reported Leg PainNumber of Patients with Leg PainNumber of Patients with Unilateral Leg PainNumber of Patients with Bilateral Leg Pain
Reported Back and Leg PainNumber of Patients with Back and Leg PainNumber of Patients with Unilateral Back and Leg PainNumber of Patients with Bilateral Back and Leg Pain
Reported Leg WeaknessNumber of Patients with Leg WeaknessNumber of Patients with Unilateral Leg WeaknessNumber of Patients with Bilateral Leg Weakness
Reported ClaudicationNumber of Patients with Neurogenic ClaudicationNumber of Patients with Unilateral Neurogenic ClaudicationNumber of Patients with Bilateral Neurogenic Claudication
Reported Sensory DysfunctionNumber of Patients with Sensory DysfunctionNumber of Patients with Unilateral Sensory DysfunctionNumber of Patients with Bilateral Sensory Dysfunction
Reported FlexionNumber of Patients in Which Pain is Relieved by Flexion
Reported IncontinenceNumber of Patients with Bowel or Bladder Incontinence

Extent of Disease in Each Patient Group Prior to Treatment

Extent of disease can be based on the diameter of the spinal canal

Reported Extent of Disease

Number of Patients with Mild Disease Canal Diameter > 12 mmNumber of Patients with Moderate Disease Canal Diameter <12Number of Patients with Severe Disease Total or subtotal (partial) block

Comorbidity in Each Patient Group

Comorbidity ReportedNumber of Patients With No Reported Comorbidity
Number of Patients with Heart or Cardiovascular DiseaseNumber of Patients with Kidney Disease
Number of Patients with Liver DiseaseNumber of Patients with Diabetes
Number of Patients with HypertensionNumber of Patients with Hip Disease
Number of Patients with Knee DiseaseNumber of Patients with Arthritis
Number of Patients with Herniated DiskNumber of Patients With Neurological Disease
Number of Patients With Pulmonary DiseaseNumber of Patients with Other Degenerative Spinal Disorders
Number of Patients with Other ComorbidityDescription of Other Comorbidity

Ability to Work Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Ability to Work Outcome Measurements

These Are Pretreatment/Baseline Outcomes DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsNumber of Patients Working
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard Deviation of Mean Time When Outcome was MeasuredMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Baseline (Pretreatment) OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Ability to Work Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Back Pain Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Back Pain Outcome Measurements

These Are Pretreatment/Baseline Outcomes DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean of Outcome ScoreSD of Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard Deviation of Mean Time When Outcome was MeasuredMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Baseline (Pretreatment) OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Back Pain Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Back Pain Relief Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Back Pain Relief Outcome Measurements

These Are Pretreatment/Baseline Outcomes DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Back Pain Relief Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Dependency/Disability Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Dependency/Disability Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Dependency/Disability Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Global Assessment Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Global Assessment Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Baseline/Pretreatment OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic to Compare Treatments or Pre-Post Outcomes

Global Assessment Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Leg Pain Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Leg Pain Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Leg Pain Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Leg Pain Relief Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Leg Pain Relief Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Leg Pain Relief Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Mental Status Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Mental Status Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Mental Status Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Physical Function and Activities of Daily Living Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Physical Function and Activities of Daily Living Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Physical Function and Activities of Daily Living Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Quality of Life Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Quality of Life Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Quality of Life Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Walking Capacity Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Walking Capacity Outcome Measurements

These Are Baseline/Pretreatment Outcome DataIf these are pretreatment/baseline data, enter 0 in the Time Post Treatment box and make no entries in the Statistics boxes. If these are control group data, enter the appropriate number of months in the Time Post Treatment box and make no entries in the Statistics boxes.
Number of PatientsMean Outcome ScoreSD Outcome Score
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard DeviationMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:
Statistical ApproachIf the statistical comparison is reported as not significant, then enter 0.5 as the p value. If the p value is reported as less than 0.05 then enter 0.049 as the p value and so on for other less than p values.
P Value of Statistical Comparison to Pretreatment/Baseline OutcomeP Value of Statistical Comparison to Control Group OutcomeControl Group ID Number

Statistic Used to Compare Outcomes

Walking Capacity Outcome Scale Measurements

Number of Patients Rated as ExcellentNumber of Patients Given Satisfactory Outcome
Number of Patients Rated as GoodNumber of Patients Given Unsatisfactory Outcome
Number of Patients Rated as Fair
Number of Patients Rated as Poor
Number of Patients Rated as Very Poor

Treatment Complications Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification Number
111111111111-11

Complications

Number of PatientsTotal ComplicationsNumber of Patients Who Died due to Treatment
Number of Patients Reporting Nausea:Number of Patients Reporting Headache:Number of Patients Reporting Diarrhea:Number of Patients Reporting a General Allergic Reaction:

Number of Patients Reporting Allergic Reaction at the Injection Site:

Number of Patients Experiencing Other Adverse ReactionDescription of Other Adverse Reactions
Number of Patients with Post-Operative Spinal InstabilityNumber of Patients with Post Operative Collapsed Fusion:Number of Patients with Post-Operative InfectionNumber of Patients with Post-Operative Thrombosis
Number of Patients with Post-Operative Neurological DisorderDescription of Post-Operative Neurological Disorder
Number of Patients with Post-Operative Dural TearsNumber of Patients with Post-Operative Fractures
Number of Patients with Post-Operative ParaplegiaNumber of Patients with Post-Operative Incontinence

Spinal Canal Measurement Table Input Form

Lumbar Spinal Stenosis EPC Project

These are preoperative measurements

Unique ID NumberPatient Group Identification Number
111111111111-11

If these are control group data, make no entries in the Statistics boxes. If this is not control group data, enter the control group ID number for the statistical comparison group in the next box and enter the appropriate statistical data.

These Are Control Group Data

Control Group ID NumberNumber of Patients

Spinal Canal Diameter Measurements

Imaging Method Used to Obtain DiameterMean Lumbar Spinal Canal Diameter (mm) DiameterStandard Deviation of Lumbar Spinal Canal DiameterMinimum Lumbar Spinal Canal DiameterMaximum Lumbar Spinal Canal
Statistics Used to Compare Patient Groups for DiameterP Value of Statistical Comparison for Diameter

Spinal Canal Cross-section Measurements

Imaging Method Used to Obtain Cross-sectionMean Lumbar Spinal Canal Cross-section (mm2)Standard Deviation of Lumbar Spinal Canal Cross-sectionMinimum Lumbar Spinal Canal Cross-sectionMaximum Lumbar Spinal Canal Cross-section
Statistics Used to Compare Patient GroupsP Value of Statistical Comparison for Cross-section

Spinal Canal Ratio Measurements

Imaging Method Used to Obtain RatioMean Ratio of Canal Size to Vertebral Body SizeStandard Deviation of the Mean RatioMinimum Ratio of Canal Size to Vertebral Body SizeMaximum Ratio of Canal Size to Vertebral Body Size
Statistics Used to Compare Patient Groups for DiameterP Value of Statistical Comparison for Diameter

Spinal Canal Degeneration Measurements

Description of Method Used to Determine Spinal Degeneration Score

Mean Spinal Degeneration ScoreStandard Deviation of Mean Spinal Degeneration ScoreMinimum Spinal Degeneration ScoreMaximum Spinal Degeneration Score
Statistics Used to Compare Patient Groups for Spinal Degeneration ScoreP Value of Statistical Comparison for Spinal Degeneration Score

Spine Slip (Listhesis) Angle Measurements

Imaging Method Used to Obtain Slip Angle

Mean Slip AngleStandard Deviation of the Mean Slip AngleMinimum Slip AngleMaximum Slip Angle
Statistics Used to Compare Patient GroupsP Value of Statistical Comparison

Spine Slip (Listhesis) Percentage Measurements

Imaging Method Used to Obtain Slip %

Mean Slip %Standard Deviation of the Mean Slip %Minimum Slip %Maximum Slip %
Statistics Used to Compare Patient GroupsP Value of Statistical Comparison

Patients with Slip (Olisthesis)

Imaging Method Used to Obtain OlisthesisDefinition of Olisthesis
Number of Patients with OlisthesisPercentage of Patients with Olisthesis
Statistics Used to Compare Patient GroupsP Value of Statistical Comparison

Surgical Outcomes Input Form

Lumbar Spinal Stenosis EPC Project

Unique ID NumberPatient Group Identification
111111111111-11

If these are control group data, make no entries in the Statistics boxes. If this is not control group data, enter the control group ID number for the statistical comparison group in the next box and enter the appropriate statistical data.

These Are Control Group Data

Control Group ID NumberNumber of Patients
Time in Months Post Treatment When Outcome was Measured (Mean or Set)Standard Deviation of Mean Time When Outcome was MeasuredMinimum Time in Months Post Treatment When Outcome was Measured:Maximum Time in Months Post Treatment When Outcome was Measured:

Surgical Outcome Fusion Failure

Number of Patients with Post Operative Fusion FailurePercentage of Patients with Failed FusionP Value of Statistical Comparison to Control Group Fusion Outcome

Surgical Outcome Instrumentation Failure

Number of Patients with Post Operative Instrumentation FailurePercentage of Patients with Failed InstrumentationP Value of Statistical Comparison to Control Group Instrumentation Outcome

Surgical Outcome Spondylolisthesis

Number of Patients with Spondylolisthesis ProgressionRate or Percentage of Patients with Spondylolisthesis ProgressionExtent of Olisthesis in Millimeters.P Value for Comparison of Spondylolisthesis Progression
Description of Grading System for Spondylolisthesis Progression
Statistical ApproachStatistic Used to Compare Outcome

Selected Acronyms Used in This Report

ACR: American College of Radiology

ADL: Activities of daily living

AHRQ: Agency for Healthcare Research and Quality

AO: Arbeitsgemeinschaft fur Osteosynthesefragen

AP: Anteroposterior

BESD: Binomial effect size display

CAR: Canadian Association of Radiologists

CAT: Computerized axial tomography

CI: Confidence interval

CL: Confidence limit

CT: Computed tomography

df: Degree(s) of freedom

EPC: Evidence-based Practice Center

HCFA: Health Care Financing Administration (now Centers for Medicare & Medicaid Services)

HCUP: Healthcare Cost and Utilization Project

JOA: Japanese Orthopedic Association

LSS: Lumbar spinal stenosis

MECQA: Medical Cost and Quality Assistance

MRI: Magnetic resonance imaging

NAMCS: National Ambulatory Medical Care Survey

NS: Nonsignificant

NSAID: Nonsteroidal anti-inflammatory drug

NSN: National Spine Network

QOL: Quality of life

RCT: Randomized controlled trial

SD: Standard deviation

SWDL: Standard wide decompressive laminectomy

TENS: Transcutaneous electrical nerve stimulation

References
Agency for Healthcare Research and Quality, Healthcare Cost and Utilization Project. HCUPnet [dataset online]. [cited 2000 Mar 13]. Available: http://www.ahcpr.gov/data/hcup/hcupnet.htm.
Airaksinen O, Herno A, Turunen V, et al. Surgical outcome of 438 patients treated surgically for lumbar spinal stenosis Spine 1997 Oct 1. 22:(19):227882.
Alvarez JA, Hardy RH Jr. Lumbar spine stenosis: a common cause of back and leg pain Am Fam Physician 1998 Apr 15. 57:(8):182534, 1839-40.
ACR Appropriateness Criteria for Imaging and Treatment Decisions [online]. The American College of Radiology; Reston, VA. [cited 2000 May 31]. Available: http://www.acr.org/departments/appropriateness_criteria/toc.html.
ACR standard for the performance of myelography and cicternography. [article online]. American College of Radiology; 1999 May 25[cited 2000 May 04]. [4 p]. Available: http://www.acr.org.
Amundsen T, Weber H, Nordal HJ, et al. Lumbar spinal stenosis: conservative or surgical management? A prospective 10-year study Spine 2000 Jun 1. 25:(11):142435.discussion 1435-6 .
Amundsen T, Weber H, Lilleas F, et al. Lumbar spinal stenosis. Clinical and radiologic features Spine 1995 May 15. 20:(10):117886.
Anderson DJ, Adcock DF, Chovil AC, et al. Ultrasound lumbar canal measurement in hospital employees with back pain Br J Ind Med 1988 Aug. 45:(8):5525. [PubMed].
Antti-Poika I, Soini J, Tallroth K, et al. Clinical relevance of discography combined with CT scanning. A study of 100 patients J Bone Joint Surg Br 1990 May. 72:(3):4805. [PubMed].
Atlas SJ, Deyo RA, Keller RB, et al. The Maine Lumbar Spine Study, Part III. 1-year outcomes of surgical and nonsurgical management of lumbar spinal stenosis Spine 1996 Aug 1. 21:(15):178794; discussion 1794-5.
Atlas SJ, Keller RB, Robson D, et al. Surgical and nonsurgical management of lumbar spinal stenosis Spine 2000 Mar. 25:(5):55662. [PubMed].
Bell GR, Rothman RH, Booth RE, et al. A study of computer-assisted tomography. II. Comparison of metrizamide myelography and computed tomography in the diagnosis of herniated lumbar disk and spinal stenosis Spine 1984 Sep. 9:(6):5526. [PubMed].
Berlemann U, Jeszenszky DJ, Buhler DW, et al. The role of lumbar lordosis, vertebral end-plate inclination, disk height, and facet orientation in degenerative spondylolisthesis J Spinal Disord 1999 Feb. 12:(1):6873. [PubMed].
Bernard TN, Yong-Hing K. Surgical management. In: Managing low back pain. 4th ed. Philadelphia (PA): Churchill Livingstone; p. 390-419.
Boden SD, Davis DO, Dina TS, et al. Abnormal magnetic-resonance scans of the lumbar spine in asymptomatic subjects. A prospective investigation J Bone Joint Surg Am 1990 Mar. 72:(3):4038. [PubMed].
Borenstein DG. Chronic low back pain Rheum Dis Clin North Am 1996 Aug. 22:(3):43956. [PubMed].
Brewerton DA. Degenerative joint disease in the spines of patients with OA hips J Rheumatol 1983. 10:(Suppl 9):345.
Bridwell KH, Sedgewick TA, O'Brien MF, et al. The role of fusion and instrumentation in the treatment of degenerative spondylolisthesis with spinal stenosis J Spinal Disord 1993 Dec. 6:(6):46172. [PubMed].
Bueff HU, Van der Reis W. Low back pain Prim Care 1996. 23:(2):34564. [PubMed].
Carey TS, Garrett J. Patterns of ordering diagnostic tests for patients with acute low back pain. The North Carolina Back Pain Project Ann Intern Med 1996 Nov 15. 125:(10):80714.
Carragee EJ, Kim DH. A prospective analysis of magnetic resonance imaging findings in patients with sciatica and lumbar disk herniation. Correlation of outcomes with disk fragment and canal morphology Spine 1997 Jul 15. 22:(14):165060.
Cauchoix J, Benoist M, Chassaing V. Degenerative spondylolisthesis Clin Orthop 1976 Mar-Apr(115):1229.
Choiniere M, Melzack R, Girard N, et al. Comparisons between patients' and nurses' assessment of pain and medication efficacy in severe burn injuries Pain 1990 Feb. 40:(2):14352. [PubMed].
Churchill DN, Taylor DW, Shimizu AG, et al. Dialyzer re-use--a multiple crossover study with random allocation to order of treatment Nephron 1988. 50:(4):32531. [PubMed].
Ciol MA, Deyo RA, Howell E, et al. An assessment of surgery for spinal stenosis: time trends, geographic variations, complications, and reoperations J Am Geriatr Soc 1996 Mar. 44:(3):28590. [PubMed].
Ciol MA, Deyo RA, Kreuter W, et al. Characteristics in Medicare beneficiaries associated with reoperation after lumbar spine surgery Spine 1994 Jun 15. 19:(12):132934.
Clark K. Significance of the small lumbar spinal canal: cauda equina compression syndromes due to spondylosis. 2. Clinical and surgical significance J Neurosurg 1969 Nov. 31:(5):4958. [PubMed].
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. New York: Academic Press; 1988. 567 p.
Colditz GA, Miller JN, Mosteller F. How study design affects outcomes in comparisons of therapy. I: medical Stat Med 1989. 8:(4):441454. [PubMed].
Crosby DL, Brant-Zawadzki M. Imaging of spine trauma. In: Textbook of diagnostic imaging. 2nd ed. Vol. 1. Philadelphia (PA): WB Saunders Company; 1994. p. 332-53.
Crouzet G, Vasdev A, Chirossel JP. Computerized tomography in lumbosciaticas of degenerative origin (disk hernia, lumbar arthrosis, narrow lumbar canal). Diagnostic value and radiosurgical correlations J Neuroradiol 1983. 10:(4):32544. [PubMed].
Cuckler JM, Bernini PA, Wiesel SW, et al. The use of epidural steroids in the treatment of lumbar radicular pain. A prospective, randomized, double-blind study J Bone Joint Surg Am 1985 Jan. 67:(1):636. [PubMed].
Daniel M, Long C, Murphy W, et al. Therapists' and chronic pain patients' perceptions of treatment outcome J Nerv Ment Dis 1983 Dec. 171:(12):72933. [PubMed].
Davis H. Increasing rates of cervical and lumbar spine surgery in the United States, 1979-1990 Spine 1994 May 15. 19:(10):111723; discussion 1123.
Dawson E, Bernbeck J. The surgical treatment of low back pain Phys Med Rehabil Clin N Am 1998 May. 9:(2):48995, ff. [PubMed].
De Haan JA, Eward AM. Practice variation in the use of lumbar myelograms for routine evaluation of lower back pain and sciatica. In: Issues and answers for improving health in the 21st century. Washington (DC): AHSR; 1997. p. 267.
Deyo RA, Diehl AK, Rosenthal M. Reducing roentgenography use. Can patient expectations be altered? Arch Intern Med 1987 Jan. 147:(1):1415. [PubMed].
Deyo RA, Tsui-Wu YJ. Descriptive epidemiology of low-back pain and its related medical care in the United States Spine 1987 Apr. 12:(3):2648. [PubMed].
Djukic S, Vahlensieck M, Resendes M, et al. The lumbar spine: postoperative magnetic resonance imaging Bildgebung 1992 Sep. 59:(3):13646. [PubMed].
Downing MN, Schnitzlein NH, Clarke LP, et al. Magnetic resonance imaging of the lumbar spine Rev Interam Radiol 1986. 11:(4):12732.
Drinkall JN, Porter RW, Hibbert CS, et al. Value of ultrasonic measurement of spinal canal diameter in general practice Br Med J (Clin Res Ed) 1984 Jan 14. 288:(6411):1212.
Dullerud R, Johansen JG. CT-diskography in patients with sciatica. Comparison with plain CT and MR imaging Acta Radiol 1995 Sep. 36:(5):497504. [PubMed].
DuPriest CM. Nonoperative management of lumbar spinal stenosis J Manipulative Physiol Ther 1993 Jul-Aug. 16:(6):4114. [PubMed].
Eberhardt KE, Hollenbach HP, Tomandl B, et al. Three-dimensional MR myelography of the lumbar spine: comparative case study to X-ray myelography Eur Radiol 1997. 7:(5):73742. [PubMed].
ECRI. Scanning systems, computed tomography, full-body. Plymouth Meeting (PA): ECRI; 1999 Aug. 17 p. (Healthcare product comparison system; no. 08/99).
Eisenberg RL, Margulis AR. Herniated nucleus pulposus. In: What To Order When. Pocket Guide to Diagnostic Imaging. 2nd ed. Philadelphia (PA): Lippincott Williams & Wilkins; 2000 Jan. p. 288-9.
Eisenberg RL, Margulis AR. Spinal stenosis. In: What To Order When. Pocket Guide to Diagnostic Imaging. 2nd ed. Philadelphia (PA): Lippincott Williams & Wilkins; 2000 Jan. p. 292-3.
Eisenstein S. The trefoil configuration of the lumbar vertebral canal. A study of South African skeletal material J Bone Joint Surg Br 1980 Feb. 62-B:(1):737. [PubMed].
Elkayam O, Avrahami E, Yaron M. The lack of prognostic value of computerized tomography imaging examinations in patients with chronic non progressive back pain Rheumatol Int 1996. 16:(1):1921. [PubMed].
Engel CC, von Korff M, Katon WJ. Back pain in primary care: predictors of high health-care costs Pain 1996 May-Jun. 65:(2-3):197204. [PubMed].
Epstein JA, Epstein BS, Lavine L. Nerve root compression associated with narrowing of the lumbar spinal canal. J Neurol Neurosurg Psychiatry. 1962; 25: 16576. [PubMed]
Eskola A, Pohjolainen T, Alaranta H, et al. Calcitonin treatment in lumbar spinal stenosis: a randomized, placebo- controlled, double-blind, cross-over study with one-year followup Calcif Tissue Int 1992 May. 50:(5):4003. [PubMed].
Fanuele JC, Birkmeyer NJ, Abdu WA, et al. The impact of spinal problems on the health status of patients: have We underestimated the effect? Spine 2000 Jun 15. 25:(12):150914.
Fast A, Greenbaum M. Degenerative lumbar spinal stenosis Phys Med Rehabil St Art Rev 1995 Oct. 9:(3):67382.
Firooznia H, Benjamin V, Kricheff II, et al. CT of lumbar spine disk herniation: correlation with surgical findings Am J Roentgenol 1984. 142:(3):58792. [PubMed].
Fischgrund JS, Mackay M, Herkowitz HN, et al. 1997 Volvo Award winner in clinical studies. Degenerative lumbar spondylolisthesis with spinal stenosis: a prospective, randomized study comparing decompressive laminectomy and arthrodesis with and without spinal instrumentation Spine 1997 Dec 15. 22:(24):280712.
Fitzgerald J, Newman P. Degenerative sponsylolisthesis J Bone Joint Surg Br 1976. 58:(2):18492. [PubMed].
Fontaine S, Lee D, Maloney W, et al. CAR standards and guidelines for myelography. [article online]. 1996 Jun [cited 2000 May 04]. [8 p]. Available: http://car.ca/standards/myelography.htm.
Fredrickson BE. Pedicular fixation in lumbar stenosis. In: Lumbar Spinal Stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 2000. p. 241-57.
Friberg O. Functional radiography of the lumbar spine Ann Med 1989 Oct. 21:(5):3416. [PubMed].
Fritz JM, Delitto A, Welch WC, et al. Lumbar spinal stenosis: a review of current concepts in evaluation, management, and outcome measurements Arch Phys Med Rehabil 1998 Jun. 79:(6):7008. [PubMed].
Fritz JM, Erhard RE, Delitto A, et al. Preliminary results of the use of a two-stage treadmill test as a clinical diagnostic tool in the differential diagnosis of lumbar spinal stenosis J Spinal Disord 1997 Oct. 10:(5):4106. [PubMed].
Frymoyer JW, Newberg A, Pope MH, et al. Spine radiographs in patients with low-back pain. An epidemiological study in men J Bone Joint Surg Am 1984 Sep. 66:(7):104855. [PubMed].
Fukusaki M, Kobayashi I, Hara T, et al. Symptoms of spinal stenosis do not improve after epidural steroid injection Clin J Pain 1998 Jun. 14:(2):14851. [PubMed].
Gehweiler Jr JA, Daffner RH. Low back pain: the controversy of radiologic evaluation AJR Am J Roentgenol 1983 Jan. 140:(1):10912. [PubMed].
Gibson JN, Grant IC, Waddell G. The Cochrane review of surgery for lumbar disk prolapse and degenerative lumbar spondylosis Spine 1999 Sep 1. 24:(17):182032.
Grob D, Humke T, Dvorak J. Degenerative lumbar spinal stenosis. Decompression with and without arthrodesis J Bone Joint Surg Am 1995 Jul. 77:(7):103641. [PubMed].
Cost-Effective Diagnostic Imaging: The Clinician's Guide. 3rd ed. St Louis (MO): Mosby - Year Book, Inc.; 1994 Nov; Low back pain. p. 311-9.
Gundry CR, Heithoff KB. Lumbar spine imaging. In: Managing low back pain. 4th ed. Philadelphia (PA): Churchill Livingstone; 1999. p. 176-205.
Gunzburg R, Servais F, Verhas M. Tomoscintigraphy of the lumbar spine: prospects and clinical application Eur Spine J 1994. 3:(6):30811. [PubMed].
Gunzburg R, Szpalski M, editor (s). Lumbar spinal stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 1999. 352 p.
Hamanishi C, Matukura N, Fujita M, et al. Cross-sectional area of the stenotic lumbar dural tube measured from the transverse views of magnetic resonance imaging J Spinal Disord 1994 Oct. 7:(5):38893. [PubMed].
Hanakita J, Suwa H, MIzuno M. Surgical treatment of lumbar canal stenosis in the elderly Neurol Med Chir (Tokyo) 1999 Jul. 39:(7):51922; discussion 522-3. [Free Full Text in PMC icon.Free Full text in PMC] [PubMed].
Hart LG, Deyo RA, Cherkin DC. Physician office visits for low back pain. Frequency, clinical evaluation, and treatment patterns from a U.S. national survey Spine 1995 Jan 1. 20:(1):119.
Hasselblad V, Hedges LV. Meta-analysis of screening and diagnostic tests Psychol Bull 1995 Jan. 117:(1):16778. [PubMed].
Annual physician fee schedule payment amount file. Calendar year 1999. [dataset online]. Baltimore (MD): Department of Health and Human Services; 1999 [cited 2000 Feb 29]. Available: http://www.hcfa.gov/stats/pufiles.htm#carrpuf.
Medical Provider Analysis and Review (MEDPAR) [dataset online]. [cited 2000 Mar 13]. Available: http:www.hcfa.gov/stats/medpar/medpar.htm.
Healy JF, Healy BB, Wong WH, et al. Cervical and lumbar MRI in asymptomatic older male lifelong athletes: frequency of degenerative findings J Comput Assist Tomogr 1996 Jan-Feb. 20:(1):10712. [PubMed].
Hedges LV, Olkin I. Statistical methods for meta-analysis. Boston (MA): Academic Press, Inc.; 1985. 369 p.
Heliovaara M, Vanharanta H, Korpi J, et al. Herniated lumbar disk syndrome and vertebral canals Spine 1986 Jun. 11:(5):4335. [PubMed].
Herkowitz HN. Spine update. Degenerative lumbar spondylolisthesis Spine 1995 May 1. 20:(9):108490.
Herkowitz HN, Abrahan DJ, Fischgrund JS. The diagnosis and management of degenerative lumbar spondylolisthesis. In: CBS Healthwatch [online]. Medscape, Inc.; 1998. [11 p.]. Available: http://www.healthwatch.medscape.com/medscape/p/Library/Library_print.asp?RecID=1522.
Herkowitz HN, Kurz LT. Degenerative lumbar spondylolisthesis with spinal stenosis: a prospective study comparing decompression with decompression and intertransverse process arthrodesis J Bone Joint Surg Am 1991 Jul. 73:(6):8028. [PubMed].
Herno A, Airaksinen O, Saari T, et al. Lumbar spinal stenosis: a matched-pair study of operated and non operated patients Br J Neurosurg 1996 Oct. 10:(5):4615. [PubMed].
Herno A, Partanen K, Talaslahti T, et al. Long-term clinical and magnetic resonance imaging followup assessment of patients with lumbar spinal stenosis after laminectomy Spine 1999 Aug 1. 24:(15):15337.
Herno A, Saari T, Suomalainen O, et al. The degree of decompressive relief and its relation to clinical outcome in patients undergoing surgery for lumbar spinal stenosis Spine 1999 May 15. 24:(10):10104.
Hindmarsh T, Ekholm SE, Kido DK, et al. Lumbar myelography with iohexol and metrizamide. A double-blind clinical trial Acta Radiol Diagn (Stockh) 1984. 25:(5):3658. [PubMed].
Hirsch C. Etiology and pathogenesis of low back pain Isr J Med Sci 1966 May-Jun. 2:(3):36270. [PubMed].
Hultman G, Saraste H, Ohlsen H. Anthropometry, spinal canal width, and flexibility of the spine and hamstring muscles in 45-55-year-old men with and without low back pain J Spinal Disord 1992 Sep. 5:(3):24553. [PubMed].
Hurri H, Slatis P, Soini J, et al. Lumbar spinal stenosis: assessment of long-term outcome 12 years after operative and conservative treatment J Spinal Disord 1998 Apr. 11:(2):1105. [PubMed].
Inufusa A, An HS, Lim TH, et al. Anatomic changes of the spinal canal and intervertebral foramen associated with flexion-extension movement Spine 1996 Nov 1. 21:(21):241220.
Jalovaara P, Lahde P, Iikko E, et al. The significance of residual stenosis after decompression for lumbar spinal stenosis. Ann Chir Gynaecol. 1989; 78: 3048. [PubMed]
Jane JA Sr, Jane JA Jr, Helm GA, et al. Acquired lumbar spinal stenosis. Clin Neurosurg. 1996; 43: 27599. [PubMed]
Jarvik JG, Deyo RA, Koepsell TD. Screening magnetic resonance images versus plain films for low back pain: a randomized trial of effects on patient outcomes Acad Radiol 1996 Apr. 3:Suppl 1:S2831. [PubMed].
Jarvik JG, Robertson WD, Wessbecher F, et al. Variation in the quality of lumbar spin MR images in Washington state Radiology 2000 May. 215:(2):48390. [PubMed].
Javid MJ, Hadar EJ. Long-term followup review of patients who underwent laminectomy for lumbar stenosis: a prospective study J Neurosurg 1998 Jul. 89:(1):17. [PubMed].
Jenis LG, An HS. Spine update. Lumbar foraminal stenosis Spine 2000 Feb 1. 25:(3):38994.
Jensen MC, Brant-Zawadzki MN, Obuchowski N, et al. Magnetic resonance imaging of the lumbar spine in people without back pain N Engl J Med 1994 Jul 14. 331:(2):6973.
Jinkins JR. Gd-DTPA enhanced MR of the lumbar spinal canal in patients with claudication J Comput Assist Tomogr 1993 Jul-Aug. 17:(4):55562. [PubMed].
Johnsson K. Lumbar spinal stenosis: a retrospective study of 163 cases in southern Sweden Acta Orthop Scand 1995 Oct. 66:(5):4035. [PubMed].
Johnsson KE, Uden A, Rosen I. The effect of decompression on the natural course of spinal stenosis. A comparison of surgically treated and untreated patients Spine 1991 Jun. 16:(6):6159. [PubMed].
Johnsson KE, Uden A, Rosen I. The natural course of lumbar spinal stenosis. Clin Orthop. 1992; 279: .
Johnsson KE, Willner S, Pettersson H. Analysis of operated cases with lumbar spinal stenosis Acta Orthop Scand 1981. 52:(4):42733. [PubMed].
Jones RA, Thomson JL. The narrow lumbar canal. J Bone Joint Surg Br. 1968; 50: 595605. [PubMed]
Jonsson B, Annertz M, Sjoberg C, et al. A prospective and consecutive study of surgically treated lumbar spinal stenosis. Part II: Five-year followup by an independent observer Spine 1997 Dec 15. 22:(24):293844.
Karantanas AH, Zibis AH, Papaliaga M, et al. Dimensions of the lumbar spinal canal: variations and correlations with somatometric parameters using CT Eur Radiol 1998. 8:(9):15815. [PubMed].
Katz JN, Dalgas M, Stucki G, et al. Degenerative lumbar spinal stenosis. Diagnostic value of the history and physical examination Arthritis Rheum 1995 Sep. 38:(9):123641. [PubMed].
Katz JN, Lipson SJ, Lew RA, et al. Lumbar laminectomy alone or with instrumented or noninstrumented arthrodesis in degenerative lumbar spinal stenosis. Patient selection, costs, and surgical outcomes Spine 1997 May 15. 22:(10):112331.
Katz JN, Stucki G, Lipson SJ, et al. Predictors of surgical outcome in degenertive lumbar spinal stenosis Spine 1999. 24:(21):222933. [PubMed].
Kauppila LI, Eustace S, Kiel DP, et al. Degenerative displacement of lumbar vertebrae. A 25-year followup study in Framingham Spine 1998 Sep 1. 23:(17):186873; discussion 1873.
Kawauchi Y, Yone K, Sakou T. Myeloscopic observation of adhesive arachnoiditis in patients with lumbar spinal canal stenosis Spinal Cord 1996 Jul. 34:(7):40310. [PubMed].
Kent DL, Haynor DR, Larson EB, et al. Diagnosis of lumbar spinal stenosis in adults: a metaanalysis of the accuracy of CT, MR, and myelography AJR Am J Roentgenol 1992 May. 158:(5):113544. [PubMed].
Khandelwal N, Takhtani D, Malik N, et al. CT in nonvisualised vertebral levels caudal to the complete block on lumbar myelogram Neurol India 1996. 44:(2):5962.
Kikuchi S, Hasue M, Nishiyama K, et al. Anatomic and clinical studies of radicular symptoms Spine 1984 Jan-Feb. 9:(1):2330. [PubMed].
Kim NH, Lee JW. The relationship between isthmic and degenerative spondylolisthesis and the configuration of the lamina and facet joints Eur Spine J 1995. 4:(3):13944. [PubMed].
Kirkaldy-Willis WH, Bernard TN Jr, editor (s). Managing low back pain. 4th ed. Philadelphia (PA): Churchill Livingstone; 1999. 434 p.
Kirkaldy-Willis WH, Bernard TN Jr. Making a specific diagnosis. In: Managing Low Back Pain. 4th ed. Philadelphia (PA): Churchill Livingstone; 1999. p. 206-26.
Kirkaldy-Willis WH, Wedge JH, Yong-Hing K, et al. Lumbar spinal nerve lateral entrapment Clin Orthop 1982 Sep(169):1718.
Kleeman TJ, Hiscoe AC, Berg EE. Patient outcomes after minimally destabilizing lumbar stenosis decompression Spine 2000. 25:(7):865870. [PubMed].
Kuroki H, Tajima N, Hirakawa S, et al. Comparative study of MR myelography and conventional myelography in the diagnosis of lumbar spinal diseases J Spinal Disord 1998. 11:(6):48792. [PubMed].
LaRocca H, Macnab I. Value of pre-employment radiographic assessment of the lumbar spine. Can Med Assoc J. 1969; 101: 3838.
Lawrence JS. Disk degeneration. Its frequency and relationship to symptoms Ann Rheum Dis 1969 Mar. 28:(2):12138. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC].
Lazennec JY, Ramare S, Arafati N, et al. Importance of sagittal adjustment in lumbosacral fusion. In: Lumbar Spinal Stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 1999. p. 329-44.
Lee CK, deBari A. Lumbosacral spinal fusion with Knodt distraction rods Spine 1986 May. 11:(4):3735. [PubMed].
Lombardi JS, Wiltse LL, Reynolds J, et al. Treatment of degenerative spondylolisthesis Spine 1985 Nov. 10:(9):8217. [PubMed].
Long DM, BenDebba M, Torgerson WS, et al. Persistent back pain and sciatica in the United States: patient characteristics J Spinal Disord 1996 Feb. 9:(1):4058. [PubMed].
Lusins JO, Elting JJ, Cicoria AD, et al. SPECT evaluation of lumbar spondylolysis and spondylolisthesis Spine 1994 Mar 1. 19:(5):60812.
Macdonald EB, Porter R, Hibbert C, et al. The relationship between spinal canal diameter and back pain in coal miners. Ultrasonic measurement as a screening test? J Occup Med 1984 Jan. 26:(1):238. [PubMed].
MacGibbon B, Farfan HF. A radiologic survey of various configurations of the lumbar spine Spine 1979 May-Jun. 4:(3):25866. [PubMed].
MacNab I. Backache. Baltimore (MD): Williams and Wilkins; 1977. 235 p.
Magora A, Schwartz A. Relation between the low back pain and x-ray findings. Degenerative osteoarthritis. Scand J Rehabil Med. 1978; 8: 11525.
Maher CO, Henderson FC. Lateral exit-zone stenosis and lumbar radiculopathy J Neurosurg 1999 Jan. 90:(1 Suppl):528. [PubMed].
Malter AD, McNeney B, Loeser JD, et al. 5-year reoperation rates after different types of lumbar spine surgery Spine 1998 Apr 1. 23:(7):81420.
Mardjetko SM, Connolly PJ, Shott S. Degenerative lumbar spondylolisthesis. A meta-analysis of literature 1970-1993 Spine 1994 Oct 15. 19:(20 Suppl):2256S2265S.
Mariconda M, Zanforlino G, Celestino GA, et al. Factors influencing the outcome of degenerative lumbar spinal stenosis J Spinal Disord 2000 Apr. 13:(2):1317. [PubMed].
McCowan PR, Bowden SD, Weisel SW. Conservative therapy for spinal stenosis Semin Spine Surg 1989. 1:(3):15562.
McCullen GM, Bernini PM, Bernstein SH, et al. Clinical and roentgenographic results of decompression for lumbar spinal stenosis J Spinal Disord 1994 Oct. 7:(5):3807. [PubMed].
McCutcheon ME, Thompson WC 3d. CT scanning of lumbar discography. A useful diagnostic adjunct Spine 1986 Apr. 11:(3):2579. [PubMed].
McLaren AC, Bailey SI. Cauda equina syndrome: a complication of lumbar discectomy. Clin Orthop. 1986 Mar; 204: 1439. [PubMed]
Medical Cost and Quality Assistance (MECQA) [database online]. [cited 1999 Nov 13]. Available: http://www.mecqa.com.
Mikhael MA, Ciric I, Tarkington JA, et al. Neuroradiological evaluation of lateral recess syndrome Radiology 1981 Jul. 140:(1):97107. [PubMed].
Mirkovic S, Garfin SR, Rydevik B, et al. Pathophysiology of spinal stenosis. Instr Course Lect. 1992; 41: 16577. [PubMed]
Mitchell GE. (Radiology Department; Crouse Irving Memorial Hospital). Personal communication. 1 p.
Modic MT, Masaryk T, Boumphrey F, et al. Lumbar herniated disk disease and canal stenosis: prospective evaluation by surface coil MR, CT, and myelography AJR Am J Roentgenol 1986 Oct. 147:(4):75765. [PubMed].
Modic MT, Ross JS, Obuchowski NA, et al. Contrast-enhanced MR imaging in acute lumbar radiculopathy: a pilot study of the natural history Radiology 1995 May. 195:(2):42935. [PubMed].
Morris EW, Di Paola M, Vallance R, et al. Diagnosis and decision making in lumbar disk prolapse and nerve entrapment Spine 1986 Jun. 11:(5):4369. [PubMed].
Muto M, De Maria G, Izzo R, et al. Nondiscal lumbar radiculopathy: Combined diagnostic approach by CT and MR Riv Neuroradiol 1997. 10:(2):16573.
Nagler W, Bodack MP. Management options in lumbar spinal stenosis. In: Ernst E, Jayson MI, Pope MH, editor (s). Advances in idiopathic low back pain. Vienna: Blackwell-MZV; 1993. p. 292-7.
Nagler W, Hausen HS. Conservative management of lumbar spinal stenosis. Identifying patients likely to do well without surgery Postgrad Med 1998 Apr. 103:(4):6971, 76, 81-3 passim. [PubMed].
Nelson BW, Carpenter DM, Dreisinger TE, et al. Can spinal surgery be prevented by aggressive strengthening exercises? A prospective study of cervical and lumbar patients Arch Phys Med Rehabil 1999. 80:(1):205. [PubMed].
Newman PH. Stenosis of the lumbar spine in spondylolisthesis Clin Orthop 1976(115):11621.
Niggemeyer O, Strauss JM, Schulitz KP. Comparison of surgical procedures for degenerative lumbar spinal stenosis: a meta-analysis of the literature from 1975 to 1995 Eur Spine J 1997. 6:(6):4239. [PubMed].
Nishizawa T, Fujimura Y. A clinical study of anterior lumbar interbody fusion for degenerative spondylolisthesis J Orthop Surg 1997. 5:(1):217.
Omojola MF, Vas W, Banna M. Plain film assessment of spinal canal stenosis Can Assoc Radiol J 1981. 32:(2):956.
Onel D, Sari H, Donmez C. Lumbar spinal stenosis: clinical/radiologic therapeutic evaluation in 145 patients. Conservative treatment or surgical intervention? Spine 1993 Feb. 18:(2):2918. [PubMed].
Osborn RE, Knight JA. Retrospective comparison of computed tomography and myelography of the lumbar spine: a study of 37 patients with surgical correlation J Am Osteopath Assoc 1986. 86:(6):3504. [PubMed].
Pai VS. Ultrasound measurement of the lumbar spinal canal J Orthop Surg 1993. 1:(1):1036.
Paine KW. Results of decompression for lumbar spinal stenosis Clin Orthop 1976 Mar-Apr(115):96100.
Palumbo MA, Lucas P, Akelman E. Lumbar spinal stenosis: a review R I Med J 1995 Nov. 78:(11):3213.
Panjabi MM, Duranceau J, Goel V, et al. Cervical human vertebrae. Quantitative three-dimensional anatomy of the middle and lower regions Spine 1991 Aug. 16:(8):8619. [PubMed].
Parkkola R, Rytokoski U, Kormano M. Magnetic resonance imaging of the disks and trunk muscles in patients with chronic low back pain and healthy control subjects Spine 1993 Jun 1. 18:(7):8306.
Phillips RB, Howe JW, Bustin G, et al. Stress X-rays and the low back pain patient J Manipulative Physiol Ther 1990 Mar-Apr. 13:(3):12733. [PubMed].
Plotz GM, Benini A. Lumbar degenerative spondylolisthesis: Review of 106 operated cases with degenerative anterior vertebral translation as the predominant aspect of the spondylosis Neurosurg Q 1998. 8:(4):27187.
Porchet F, Vader JP, Larequi-Lauber T, et al. The assessment of appropriate indications for laminectomy J Bone Joint Surg Br 1999 Mar. 81B:(2):2349. [PubMed].
Porter RW, Ward D. Cauda equina dysfunction. The significance of two-level pathology Spine 1992 Jan. 17:(1):915. [PubMed].
Lumbar spinal stenosis. New York (NY): Springer-Verlag; 1988. 173-218 p.
Postacchini F. Management of lumbar spinal stenosis J Bone Joint Surg Br 1996 Jan. 78:(1):15464. [PubMed].
Postacchini F. Surgical management of lumbar spinal stenosis Spine 1999 May 15. 24:(10):10437.
Postacchini F, Perugia D. Degenerative lumbar spondylolisthesis. Part I: Etiology, pathogenesis, pathomorphology, and clinical features Ital J Orthop Traumatol 1991 Jun. 17:(2):16573. [PubMed].
Postacchini F, Pezzeri G. CT scanning versus myelography in the diagnosis of lumbar stenosis. A preliminary report Int Orthop 1981. 5:(3):20915. [PubMed].
Postacchini F, Pezzeri G, Montanaro A, et al. Computerized tomography in lumbar stenosis. A preliminary report J Bone Joint Surg Br 1980 Feb. 62-B:(1):7882. [PubMed].
Postacchini F, Pezzeri G, Montanaro A, et al. Computerized tomography in lumbar stenosis: a preliminary report J Bone Joint Surg Br 1980. 62:(1):7882. [PubMed].
Powell MC, Wilson M, Szypryt P, et al. Prevalence of lumbar disk degeneration observed by magnetic resonance in symptomless women Lancet 1986 Dec 13. 2:(8520):13667.
Prasartritha T, Suntisathaporn N, Vathana P, et al. The size of the vertebral canal and the significance of epidural fat in lumbar spinal stenosis J Med Assoc Thai 1997 Apr. 80:(4):24756. [PubMed].
Prescher A. Anatomy and pathology of the aging spine Eur J Radiol 1998. 27:(3):18195. [PubMed].
Ramani PS. The relationship of the size of the lumbar spinal canal to the symptomatology of prolapsed lumbar intervertebral disks J Neurosurg Sci 1974 Oct-Dec. 18:(4):2237. [PubMed].
Ramani PS. Variations in size of the bony lumbar canal in patients with prolapse of lumbar intervertebral disks Clin Radiol 1976 Jul. 27:(3):3017. [PubMed].
Ramsbacher J, Schilling AM, Wolf KJ, et al. Magnetic resonance myelography (MRM) as a spinal examination technique Acta Neurochir (Wien) 1997. 139:(11):10804. [Free Full Text in PMC icon.Free Full text in PMC] [PubMed].
Rankine JJ, Gill KP, Hutchinson CE, et al. The therapeutic impact of lumbar spine MRI on patients with low back and leg pain Clin Radiol 1998. 53:(9):68893. [PubMed].
Ray CD. New techniques for decompression of lumbar spinal stenosis Neurosurgery 1982 May. 10:(5):58792. [PubMed].
Rompe JD, Eysel P, Hopf CH, et al. Surgical management of central lumbar spinal stenosis - results with decompressive laminectomy only and with concomitant instrumented fusion with the Cotrel-Dubousset-Instrumentation Neuro-Orthopedics 1995. 19:(1):1731.
Rosenberg NJ. Degenerative spondylolisthesis. Predisposing factors J Bone Joint Surg Am 1975 Jun. 57:(4):46774. [PubMed].
Rosenberg NJ. Degenerative spondylolisthesis surgical treatment. Clin Orthop. 1976; 117: 11220. [PubMed]
Rosenthal R. Meta-analytic procedures for social research. Rev ed. Newbury Park (CA): Sage Publications; 1991. 155 p. (Applied social research methods series; no. 6).
Rosenthal R, Rosnow RL. Essentials of behavioral research: methods and data analysis. New York: McGraw-Hill; 1991. 692 p.
Rowe ML. Low back pain in industry. J Occup Med. 1969; 11: 1619. [PubMed]
Sackett JF. Degenerative change of spinal column causing neurologic deficit. In: Textbook of diagnostic imaging. 2nd ed. Vol. 1. Philadelphia (PA): WB Saunders Company; 1994. p. 322-32.
Sacks H, Chalmers TC, Smith H. Randomized versus historical controls for clinical trials Am J Med 1982. 72:(2):23340. [PubMed].
Sandhu HS, Lakhanpal VP, Gupta SC. Incidence of lumbar spinal canal stenosis in cases of low back ache beyond the age of 35 years: radiographic study Indian J Orthop 1976. 10:(2):717.
Sato K, Kikuchi S. Clinical analysis of two-level compression of the cauda equina and the nerve roots in lumbar spinal canal stenosis Spine 1997 Aug 15. 22:(16):1898903; discussion 1904.
Satomi K, Hirabayashi K, Toyama Y, et al. A clinical study of degenerative spondylolisthesis. Radiographic analysis and choice of treatment Spine 1992 Nov. 17:(11):132936. [PubMed].
Scafuri RL, Weinstein JN. Lumbar spinal stenosis: a prospective review of evaluation and treatment utilizing a new 'back analysis form' Proc Inst Med Chic 1981 Jul-Sep. 34:(3):7982. [PubMed].
Schatzker J, Pennal GF. Spinal stenosis, a cause of cauda equina compression J Bone Joint Surg Br 1968 Aug. 50:(3):60618. [PubMed].
Schonstrom NS, Bolender NF, Spengler DM. The pathomorphology of spinal stenosis as seen on CT scans of the lumbar spine Spine 1985 Nov. 10:(9):80611. [PubMed].
Schrader PK, Grob D, Rahn BA, et al. Histology of the ligamentum flavum in patients with degenerative lumbar spinal stenosis Eur Spine J 1999 Aug. 8:(4):3238. [PubMed].
Shakil MS, Vaccaro AR, Albert TJ, et al. Efficacy of conservative treatment of lumbar spinal stenosis Semin Spine Surg 1999. 11:(3):22933.
Shapiro S. Medical realities of cauda equina syndrome secondary to lumbar disk herniation Spine 2000 Feb 1. 25:(3):34851; discussion 352.
Shapiro S, Cooper PR, Miller CA. Cauda equina syndrome secondary to lumbar disk herniation Neurosurgery 1993. 32:(5):7437. [PubMed].
Shekelle PG, Markovich M, Louie R. Comparing the costs between provider types of episodes of back pain care Spine 1995 Jan 15. 20:(2):2216; discussion 227.
Shenkin HA, Hash CJ. A new approach to the surgical treatment of lumbar spondylosis J Neurosurg 1976 Feb. 44:(2):14855. [PubMed].
Shiroky JB, Salanga VD, Ropos TJ. Spinal stenosis: making the diagnosis, offering relief [corrected] J Musculoskeletal Med 1996 Aug. 13:(8):468, 51-2, 57-8.
Silvers HR, Lewis PJ, Asch HL. Decompressive lumbar laminectomy for spinal stenosis J Neurosurg 1993 May. 78:(5):695701. [PubMed].
Simmons Jr ED, Simmons EH. Spinal stenosis with scoliosis Spine 1992. 17:(6 Suppl):S11720. [PubMed].
Simotas AC, Dorey FJ, Hansraj KK, et al. Nonoperative treatment for lumbar spinal stenosis. Clinical and outcome results and a 3-year survivorship analysis Spine 2000. 25:(2):197204. [PubMed].
Sinaki M, Lutness MP, Ilstrup DM, et al. Lumbar spondylolisthesis: retrospective comparison and three-year followup of two conservative treatment programs Arch Phys Med Rehabil 1989 Aug. 70:(8):5948. [PubMed].
Spengler DM. Current trends in surgery for patients with lumbar spinal stenosis of the degenerative variety. In: Lumbar Spinal Stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 2000. p. 233-40.
Spivak JM. Degenerative lumbar spinal stenosis J Bone Joint Surg Am 1998 Jul. 80:(7):105366. [PubMed].
Splithoff CA. Lumbosacral junction. Roentgenographic comparison of patients with and without backaches. JAMA. 1953; 152: 16103.
Stanley D, McLaren MI, Euinton HA, et al. A prospective study of nerve root infiltration in the diagnosis of sciatica. A comparison with radiculography, computed tomography, and operative findings Spine 1990 Jun. 15:(6):5403. [PubMed].
Stockley I, Getty CJ, Dixon AK, et al. Lumbar lateral canal entrapment: clinical, radiculographic and computed tomographic findings Clin Radiol 1988 Mar. 39:(2):1449. [PubMed].
Surin V, Hedelin E, Smith L. Degenerative lumbar spinal stenosis: results of operative treatment. Acta Orthop Scand. 1982; 53: 7985. [PubMed]
Swezey RL. Outcomes for lumbar stenosis: a 5-year followup study J Clin Rheumatol 1996. 2:(3):12934.
Symmons DP, van Hemert AM, Vandenbroucke JP, et al. A longitudinal study of back pain and radiological changes in the lumbar spines of middle aged women. II. Radiographic findings Ann Rheum Dis 1991 Mar. 50:(3):1626. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC].
Tait WF, Charlesworth D, Lemon JG. Atypical claudication Br J Surg 1985 Apr. 72:(4):3156. [PubMed].
Tajima N, Fukazawa T, Ishio M, et al. The ligamentum flavum in lumbar spinal canal stenosis Acta Med Nagasaki 1980. 25:(1-4):1628.
Tallroth K, Ylikoski M, Landtman M, et al. Reliability of radiographical measurements of spondylolisthesis and extension-flexion radiographs of the lumbar spine Eur J Radiol 1994 Aug. 18:(3):22731. [PubMed].
Tan JC, Roux EB, Dunand J, et al. Role of physical therapy in the management of common low back pain Baillieres Clin Rheumatol 1992 Oct. 6:(3):62955. [PubMed].
Tatarek, Nancy Elizabeth. Changing views of health: spinal stenosis and its impact on well-being in skeletal biology. Diss Abstr Int 60-08 (A):2999.
Tenhula J, Lenke LG, Bridwell KH, et al. Prospective functional evaluation of the surgical treatment of neurogenic claudication in patients with lumbar spinal stenosis J Spinal Disord 2000. 13:(4):27682. [PubMed].
Terenzi TJ, Beadle E, Muller D, et al. Doppler ultrasound diastolic flow analysis for the early identification of peripheral arterial disease J Manipulative Physiol Ther 1992 Jun. 15:(5):28692. [PubMed].
Thomas NW, Rea GL, Pikul BK, et al. Quantitative outcome and radiographic comparisons between laminectomy and laminotomy in the treatment of acquired lumbar stenosis Neurosurgery 1997 Sep. 41:(3):56774; discussion 574-5. [PubMed].
Thomsen K, Christensen FB, Eiskjaer SP, et al. 1997 Volvo Award winner in clinical studies. The effect of pedicle screw instrumentation on functional outcome and fusion rates in posterolateral lumbar spinal fusion: a prospective, randomized clinical study Spine 1997 Dec 15. 22:(24):281322.
Torgerson WS. Theory and methods of scaling. New York: John Wiley & Sons, Inc.; 1958. 460 p.
Torgerson WR, Dotter WE. Comparative roentgenographic study of the asymptomatic and symptomatic lumbar spine J Bone Joint Surg Am 1976 Sep. 58:(6):8503. [PubMed].
Turner JA, Ersek M, Herron L, et al. Surgery for lumbar spinal stenosis. Attempted meta-analysis of the literature Spine 1992 Jan. 17:(1):18. [PubMed].
Twomey L, Taylor J. Age changes in the lumbar spinal and intervertebral canals Paraplegia 1988 Aug. 26:(4):23849. [PubMed].
Uden A, Johnsson KE, Jonsson K, et al. Myelography in the elderly and the diagnosis of spinal stenosis Spine 1985 Mar. 10:(2):1714. [PubMed].
Verbiest H. Results of surgical treatment of idiopathic developmental stenosis of the lumbar vertebral canal: a review of twenty-seven years' experience. J Bone Joint Surg Br. 1977; 59: 1818. [PubMed]
Vergauwen S, Parizel PM, van Breusegem L, et al. Distribution and incidence of degenerative spine changes in patients with a lumbo-sacral transitional vertebra Eur Spine J 1997. 6:(3):16872. [PubMed].
Videman T, Battie MC, Gill K, et al. Magnetic resonance imaging findings and their relationships in the thoracic and lumbar spine: insights into the etiopathogenesis of spinal degeneration Spine 1995. 20:(8):92835. [PubMed].
Virta L, Ronnemaa T. The association of mild-moderate isthmic lumbar spondylolisthesis and low back pain in middle-aged patients is weak and it only occurs in women Spine 1993 Sep 1. 18:(11):1496503.
Virta L, Ronnemaa T, Laakso M. Prevalence of isthmic lumbar spondylolisthesis in nondiabetic subjects and NIDDM patients Diabetes Care 1994 Jun. 17:(6):5924. [PubMed].
Visarius H. Concept and clinical aspects of computer assisted spin surgery. In: Lumbar Spinal Stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 2000. p. 345-53.
Waddell G. Clinical diagnosis of leg pain and nerve root involvement in low back disorders Acta Orthop Belg 1987. 53:(2):1525. [PubMed].
Weiner BK, Walker M, Brower RS. Microdecompression for lumbar spinal canal stenosis Spine 1999. 24:(21):22682272. [PubMed].
Widelec J, Bacq C, Peetrons P. Conventional x-rays and computed tomographic scan in lumbar spinal stenosis. In: Lumbar Spinal Stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 1999. p. 91-6.
Wiesel SW, Tsourmas N, Feffer HL, et al. A study of computer-assisted tomography. I. The incidence of positive CAT scans in an asymptomatic group of patients Spine 1984 Sep. 9:(6):54951. [PubMed].
Willen J, Danielson B, Gaulitz A, et al. Dynamic effects on the lumbar spinal canal: Axially loaded CT- myelography and MRI in patients with sciatica and/or neurogenic claudication Spine 1997. 22:(24):296876. [PubMed].
Wilmink JT. Diagnostic imaging in lumbar spinal stenosis: dynamic myelography, computed tomographic myelography, magnetic resonance imaging, and magnetic resonance myelography. In: Lumbar spinal stenosis. Hagerstown (MD): Lippincott Williams & Wilkins; 2000. p. 97-107.
Wiltse LL. Surgery for intervertebral disk disease of the lumbar spine Clin Orthop 1977 Nov-Dec(129):2245.
Winston K, Rumbaugh C, Colucci V. The vertebral canals in lumbar disk disease Spine 1984 May-Jun. 9:(4):4147. [PubMed].
Wisneski RJ, Farmer JC. Decompressive surgery for lumbar spinal stenosis Semin Spine Surg 1994. 6:(2):11623.
Wood KB, Popp CA, Transfeldt EE, et al. Radiographic evaluation of instability in spondylolisthesis Spine 1994 Aug 1. 19:(15):1697703.
Woolsey RM. Lumbar spinal stenosis Semin Neurol 1986 Dec. 6:(4):3859. [PubMed].
Yone K, Sakou T, Kawauchi Y, et al. Indication of fusion for lumbar spinal stenosis in elderly patients and its significance Spine 1996 Jan 15. 21:(2):2428.
Yoshida M, Shima K, Taniguchi Y, et al. Hypertrophied ligamentum flavum in lumbar spinal canal stenosis. Pathogenesis and morphologic and immunohistochemical observation Spine 1992 Nov. 17:(11):135360. [PubMed].
Young S, Veerapen R, O'Laoire SA. Relief of lumbar canal stenosis using multilevel subarticular fenestrations as an alternative to wide laminectomy: preliminary report Neurosurgery 1988 Nov. 23:(5):62833. [PubMed].
Yuan HA, Garfin SR, Dickman CA, et al. A Historical Cohort Study of Pedicle Screw Fixation in Thoracic, Lumbar, and Sacral Spinal Fusions Spine 1994 Oct 15. 19:(20 Suppl):2279S2296S.
Zdeblick TA. The treatment of degenerative lumbar disorders. A critical review of the literature Spine 1995 Dec 15. 20:(24 Suppl):126S137S.
Zwarts MJ. Magnetic stimulation of the peripheral nervous system: local versus generalized disorders Electromyogr Clin Neurophysiol 1998 Jul-Aug. 38:(5):30916. [PubMed].
Footnotes
*

See Volume 2 of this report for Evidence Tables 1-25.

Help ǀ Contact Bookshelf
AHRQ Evidence Reports
(navigation arrows) Go to previous chapter Go to next chapter Go to top of this page Go to bottom of this page Go to Table of Contents