Figure 1. Osteoporosis diagnosis and monitoring analytic framework
The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-Based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.
To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.
AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.
We welcome written comments on this evidence report. They may be sent to: Acting Director, Center for Practice and Technoloy Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.
| John M. Eisenberg, M.D. | Director, Center for Practice and |
| Director, Agency for Healthcare | Technology Assessment |
| Research and Quality |
| The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service. |
Robert Graham, M.D.
We thank the following members of the Oregon Health & Science University
(OHSU) Evidence-based Practice Center staff for their contributions to this
project:
Piper Hackett, B.S.
Kathryn Pyle Krages,
A.M.L.S., M.A.
Alix E. Seif, M.A.
Patty Davies, M.S.
Patty Davies, M.S.Gary Miranda, M.A.
Connie Levesque, B.S.
Cynthia Davis-O'Reilly
Benjamin Chan, M.S.
Jeani Crichlow,
B.S.
Lynne Schwabe
I'eisha Parker
Linda
Slattery
Katy Lesowski
This report examines the evidence on the effectiveness of various strategies for diagnosing and monitoring postmenopausal women with osteoporosis. Specifically, it addresses: (1) the role of risk factors in identifying high-risk women and guiding their initial treatment, (2) the advantages and disadvantages of various techniques for bone measurement in predicting risk of hip or spine fracture, (3) the effectiveness of bone measurement tests for monitoring response to treatment and for guiding treatment change, (4) the role of markers of bone turnover in diagnosis and treatment management, (5) the evaluation of patients with osteoporosis for secondary causes, and (6) the costs and cost-effectiveness of various diagnostic strategies for osteoporosis.
The authors conducted a MEDLINE search (covering the years 1966 to 2000), supplemented by searches of HealthSTAR (covering 1975 to 2000) of papers published in English, reviewed reference lists of review articles, and sought guidance from local and national experts.
The authors included abstracts relevant to one or more topic areas that had original data about postmenopausal women and osteoporosis. Two reviewers read each abstract to determine its eligibility. Articles were excluded if they did not provide sufficient information to determine the methods for selecting subjects and for analyzing data. For some topics, additional eligibility criteria were applied. For all topics combined, the authors retrieved 10,174 citations. After reviewing these citations for possible relevance, 530 articles about risk factors, 123 about bone measurement testing, 23 about monitoring, 277 about biochemical markers, and 53 about costs were selected for further review. An additional 242 studies were retrieved after reviewing the reference lists of studies and/or by suggestion of others. The search yielded no papers with data for the secondary causes topic.
From full-text published studies of fracture or bone density prediction or bone measurement methods, the authors extracted selected information about the patient population, interventions, clinical endpoints, study design, and study quality, and used this information to construct evidence tables. Additional reviews assessed the internal validity of studies of risk factors and the diagnostic performance of bone measurement tests and biochemical markers, summarized recommendations for testing for secondary causes of osteoporosis, reviewed studies about cost and cost-effectiveness, and compared diagnostic strategies.
Epidemiologic studies report clinical risk factors for osteoporosis and fractures, but few studies evaluate how to use them to identify individual women at risk for fracture, and no studies provide evidence that treatment decisions based on clinical risk factors lead to better or worse fracture outcomes than those based on bone measurement tests. Because of differences between bone measurement techniques, and because individuals have different rates of bone loss at different sites, no one test can exclude osteoporosis at the most important fracture sites -- hip, spine, and wrist. Dual-energy X-ray absorptiometry (DXA) of the femoral neck is the best validated test to predict hip fracture. Other techniques predict hip fracture less accurately or have not been evaluated in prospective studies.
Recent results from clinical trials raise questions about the value of repeated, annual densitometry tests for patients on therapy to prevent osteoporosis or bone loss; moreover, there is no evidence from clinical trials that adjusting therapy based on serial densitometry at any interval improve outcomes. Markers of bone turnover correlate poorly with bone measurement tests and are not good predictors of fractures.
Cost and cost-effectiveness studies, which are based solely on economic models, suggest targeting treatment to women with the lowest bone density and including a risk factor score or less expensive (and more widely available) technology to determine which women should receive hip DXA. The authors' supplementary analysis on cost-effectiveness favors a sequential strategy of quantitative ultrasound at the heel followed by densitometry of those identified by ultrasound as high risk over densitometry alone. In high-risk populations, ultrasound alone may also be cost-effective.
Application of the results from epidemiologic studies to diagnosis and monitoring strategies for individual patients in the clinical setting is currently based on extrapolation from models or, for most questions addressed in this review, is simply lacking. To be more useful for clinicians and patients, future research should focus on the application of these data to the clinical setting.
At an international consensus development conference, osteoporosis was defined as "a systemic skeletal disease characterized by low bone mass and microarchitectural deterioration of bone tissue, leading to enhanced bone fragility and a consequent increase in fracture risk." In 1994, a World Health Organization (WHO) working group proposed that, in epidemiologic studies, osteoporosis could be determined when bone density at the hip, spine, or forearm is 2.5 standard deviations or more below the mean for healthy, young, adult women (a value defined as the T-score), or when a history of a fracture is present in the absence of trauma. The group also proposed that osteopenia be diagnosed when the bone density was 1.0 to 2.5 standard deviations below the mean for young, healthy women.
According to the National Health and Nutrition Examination Survey (NHANES III), an estimated 14 million American women over age 50 years are affected by low bone density at the hip, and 5 million more have bone density that measures -2.5 standard deviations or more below the mean at the hip. The prevalence of osteoporosis in Mexican-American women is similar to that in white women, while rates in black women are approximately half that of the first two groups. The prevalence of osteoporosis increases with age for all sites, and -- by the WHO definition -- up to 70 percent of women over age 80 years have osteoporosis. Furthermore, age is an important factor in the relationship between bone density and the absolute risk of fracture. An increase in age of 13 years increases the risk of hip fracture by the same amount as a decrease in bone density of one standard deviation. Older women have a much higher fracture rate than younger women who have the same bone density, because of increasing risk from other factors, such as a change in bone quality and the tendency to fall.
Women with osteoporosis are more likely to experience fractures. Demographic trends for hip fracture parallel those for osteoporosis. Hip-fracture incidence in white women rises from 50 per 100,000 at age 50 years to 237 per 100,000 at age 65 years. White women are generally two to three times more likely than nonwhite women to suffer a hip fracture. Hip fractures are associated with high rates of mortality and loss of independence. Wrist fracture incidence tends to increase at earlier ages than does that of hip fractures.
Vertebral fractures have also been associated with significant morbidity. Sixteen percent of postmenopausal women have osteoporosis of the lumbar spine; furthermore, five percent of 50-year-old white women and 25 percent of 80-year-old women have had at least one vertebral fracture. Vertebral fractures can cause severe pain and are associated with more than five million days of restricted activity in those age 45 years or older.
The disease burden of osteoporosis extends beyond consequences of low bone density and fractures. For example, the act of screening, diagnosis, and subsequent treatment can also affect the quality of life. Fear of fracture itself can reduce the quality of life in women who have been diagnosed as having osteoporosis.
In 1995, the total direct medical expenditure in the United States for the treatment of osteoporotic fractures in adults older than 45 years was estimated at $13.8 billion. The majority of this total ($8.6 billion) was spent for inpatient care. Hip fracture alone accounted for $8.7 billion (63 percent) of osteoporosis-related costs, while fractures at sites other than the hip accounted for approximately 37 percent of the total expenditure (about $5.1 billion). In addition to these costs is the cost of lost productivity for women with fractures, or for their family or other caregivers. As the median age of the U.S. population increases, the costs associated with osteoporotic fractures are also likely to increase.
This evidence report describes the effectiveness of various strategies for
diagnosing and monitoring postmenopausal women with osteoporosis, as
represented by six topic areas:
Risk Factors. What is the role of clinical risk factors, in conjunction with bone measurement tests, in identifying high-risk women and guiding initial treatment decisions?
Bone Measurement Tests. What are the advantages and disadvantages of various bone measurement tests at different anatomic sites for identifying women at high risk of fracture?
Monitoring. Are bone measurement tests effective for monitoring response to treatment and for guiding decisions about changes in management?
Biochemical Markers. What is the role of markers of bone turnover for identifying women at risk of bone loss, guiding initial treatment decisions, or monitoring response to therapy?
Evaluation for Secondary Causes. What diagnostic or laboratory tests are appropriate for evaluating patients with osteoporosis for secondary causes?
Cost. Assuming consistent treatment approaches, what are the costs and cost-effectiveness of various diagnostic strategies for identifying women with osteoporosis?
These topic areas do not include the effectiveness of dietary, lifestyle, hormonal, and medical interventions for primary prevention or treatment of osteoporosis. This report is confined to diagnostic and monitoring strategies as they apply to individual women, and does not include issues regarding mass screening in the general population. Also, while most of the literature addressing these topic areas is aimed at an audience of clinical researchers who specialize in osteoporosis, we have attempted to assess the research findings from the perspectives of clinicians and patients. However, it is not the purpose of this report to propose practice recommendations.
With input from local and national experts, we developed an analytic framework and key questions in each of the six topic areas. Relevant studies were identified from multiple searches of MEDLINE (for the years 1966 to 2000) and HealthSTAR (for the years 1975 to 2000), from the reference lists of systematic review articles, and from national experts. All searches were limited to publications in the English language. The authors excluded articles if they did not provide sufficient information to determine the methods for selecting subjects and for analyzing data.
Out of 10,174 citations retrieved for all topics combined, the authors selected as possibly relevant 530 articles about risk factors, 123 about bone measurement tests, 23 about measurement tests for bone monitoring, 277 about biochemical markers, and 53 about costs. An additional 242 studies were retrieved after reviewing reference lists of studies and by suggestion of the expert panel or leading researchers in the field. The search yielded no papers with data for the secondary causes topic, but this topic was addressed by reviewing published guidelines and by a supplemental analysis of physician practice patterns for the evaluation of secondary causes of osteoporosis. For articles that were included, the authors applied criteria proposed by the U.S. Preventive Services Task Force to rate the quality of individual studies. A second supplemental analysis investigated the cost-effectiveness of various diagnostic strategies.
After a review of the relevant articles, the findings about the risk
factors that predict bone density, bone loss, and fractures are as follows:
Factors that are consistently associated with increased risks of low bone density and fractures in postmenopausal women include increasing age, white race, low weight or weight loss, nonuse of estrogen replacement, history of previous fracture, family history of fracture, history of falls, and low scores on one or more measures of physical activity or function.
Other factors are less consistent predictors across studies, but also have statistically significant associations with low bone density and fractures. These include smoking, alcohol use, caffeine use, low calcium and vitamin D intake, and use of certain drugs.
Predictors of low bone density are similar to those for fracture, except for those factors related to physical function and falls.
Some clinical risk factors, especially those related to physical function and falls, are as powerful as bone density in the prediction of hip fracture.
Women with multiple risk factors and low bone density have an especially high risk of hip fracture.
Most of the strongest risk factors are consistently related to outcomes, regardless of the racial and ethnic population, although there are few studies of nonwhite women.
A second set of findings about risk factors concerned the accuracy of
methodologies (also called instruments or tools) for assessing risk
factors in identifying women at risk of fracture. These included:
In contrast to the extensive research about determining clinical risk factors for osteoporosis and fractures, there are fewer studies available that evaluate how to use these risk factors to identify individual women at risk for fracture.
Methologies designed to assess risk of low bone density or fractures generally have low to moderate sensitivity and specificity. Those that perform well when originally tested either have performed less well in other populations or have not yet been widely tested. Some methodologies -- especially those developed in large community populations and containing variables known to be strong predictors -- may ultimately be applicable to the clinical setting once they are tested there.
A third set of findings, exploring whether risk factors are useful in
treatment decisions, included:
The authors did not identify any studies that examined whether treatment decisions based on clinical risk factors lead to better or worse health outcomes than those based on bone measurement tests or a combination of bone tests and risk factors.
Regarding bone measurement tests, the major findings related to the
capacity of different bone measurement tests at different sites to
predict fractures included:
Among different bone measurement tests at various sites, bone density measured at the femoral neck by dual energy x-ray absorptiometry (DXA) is the best predictor of hip fracture and is comparable to forearm measurements for predicting fracture at other sites.
In recent prospective studies, quantitative ultrasound (QUS) measured at the heel predicted hip fracture and all nonspine fractures as well or nearly as well as DXA measured at the femoral neck. For each of these tests, a result in the osteoporotic range is independently associated with an increased short-term probability of hip fracture. Individuals who have low scores by one of these tests, but not the other, have a higher risk of fracture than those who have higher scores by both tests, and a lower risk of fracture than those whose results on both tests are low.
While other peripheral measures may approach QUS in predicting hip fracture, there are no recent prospective studies that directly compare prediction of hip fracture of these tests with DXA of the hip. Radiographic absorptiometry (RA) or quantitative microdensitometry (QMD) of the hand can predict the risk of nonspine fractures in general, many of which are in the forearm, but there are no recent data about the ability of hand measurement to predict hip fracture.
Correlations between different bone measurement tests are generally too low to be accepted as evidence that one test will identify patients at similar risk to those identified by another test.
Major findings on identification of the factors related to bone testing
that influence diagnosis included:
The likelihood of being diagnosed with osteoporosis varies greatly, depending on the site and type of the bone measurement test, on the brand of densitometer, and on the relevance of the reference range to the local population.
The likelihood of being diagnosed as having osteoporosis also depends on the number of sites tested. Testing in the forearm, hip, spine, or heel will generally identify different groups. A physician cannot say, based only on one of these tests, that the patient "does not have osteoporosis."
The results of bone measurement tests are often inaccurately reported to patients.
Major findings related to how bone measurement test results affect
patients' and physicians' decisions and actions included:
One randomized trial suggests that women who undergo densitometry are more likely to start hormone replacement therapy than women who do not.
In a randomized trial and a large, uncontrolled case series, women who had densitometry and were told they had osteoporosis were more likely to start or continue hormone replacement therapy than women who were told they had normal bone density.
In one randomized trial, physicians found densitometry reports confusing and were not confident that their interpretations of T-scores were correct.
The major findings regarding whether bone measurement tests are effective
for monitoring response to therapy and for guiding decisions about
changes in management included:
The weight of the evidence is currently against repeating bone density tests within the first year of treatment. There is insufficient evidence to determine whether repeating bone density tests 2 years after starting therapy is useful.
There are also no studies about the effect of either monitoring responses to therapy using densitometry or the choice of test on the outcome of therapy.
Findings regarding whether biochemical markers can be used instead of
bone measurement tests in identifying women at risk for osteoporosis included:
No single marker or cluster of markers accurately predicted the results of densitometry in individuals. Densitometry measures current bone status, whereas markers measure the process of bone turnover.
Major findings as to how well biochemical markers predict fracture included:
No marker was associated with increased fracture risk consistently across all studies. One study provides evidence that using markers in conjunction with densitometry may increase predictability, but this result has not been otherwise confirmed.
In addition, major findings as to whether markers can help select
patients for treatment included:
Studies correlating marker results and bone loss indicated no clear trend. Furthermore, sensitivity and specificity of markers were too low to be useful for the purpose of selecting patients for treatment.
Some studies found better test accuracy when a combination of two or more markers and/or other risk factors was used to predict bone loss.
The report also investigated what is known about the adverse effects of
using markers to identify women at risk for osteoporosis. Major findings
for this issue included:
The primary adverse effect of biochemical markers is the potential for false-positive and false-negative results. Rates of false-positive and false-negative test results vary widely among the biochemical markers. (If markers are used to select women for treatment, a false-positive test could lead to the initiation of unnecessary treatment, while a false-negative result could lead to lack of needed treatment.)
Major findings regarding whether markers can predict a patient's response
to therapy included:
There is a small correlation between response to therapy as measured by densitometry and marker results, but no marker is accurate enough to reliably identify those individuals who will fail to respond to treatment.
Major findings from a literature review, assessment of published
guidelines, and a secondary analysis regarding which diagnostic or
laboratory tests are appropriate for evaluating patients with
osteoporosis for secondary causes included:
There is no evidence from controlled trials on which to base recommendations for a strategy of testing to determine secondary causes of osteoporosis.
Some guidelines and experts support extensive testing to rule out major concomitant disease, while others suggest a limited testing strategy based on findings in the history and physical examination. Because the diagnosis of primary osteoporosis is often seen as a diagnosis of exclusion, the pattern of diagnostic testing may continue to be costly until the diagnostic yield is fully demonstrated.
Assumptions about the probability of a secondary disease or disorder to explain the occurrence of osteoporosis vary by practice type and specialty.
Thyroid stimulating hormone measurement, chemistry battery, and complete blood count were the most frequently ordered tests to rule out secondary causes of osteoporosis cited by respondents in our supplemental analysis. These were also the most frequently recommended tests in our review of expert guidelines.
The report's findings regarding the costs and cost-effectiveness of
diagnostic strategies for identifying women with osteoporosis included:
Published economic assessments suggested that diagnosis and treatment of women at risk for osteoporosis would be more cost-effective by targeting treatment to those with the lowest bone measurement results. Inclusion of another assessment, such as a risk profile or additional bone measurement test, prior to DXA may improve the cost-effectiveness of diagnosis.
Using data from two large studies, the authors conducted cost-effectiveness analyses to estimate cost per hip fracture prevented. These analyses suggested that a sequential diagnostic approach may be more cost-effective than DXA alone. The sequential approach that the authors considered was QUS of the heel followed by DXA of the femoral neck only for those with low values of QUS/BUA. A range of QUS/BUA measures was used because there are no established cut points that separate high risk from normal risk. The authors used QUS as an example of a less expensive and more widely available diagnostic approach than DXA.
Diagnosis with DXA of the femoral neck alone prevented more fractures, in most cases, but at a higher cost per hip fracture prevented compared with the sequential approach.
All current treatment trials have inclusion criteria based on diagnosis with DXA. In sensitivity analyses, if treatment efficacy following diagnosis with QUS (using QUS/BUA cut points between 65 and 75 dB/mHz) were 5%-15% less than that following diagnosis using DXA, diagnosis with QUS alone would have higher costs to prevent fewer hip fractures than other diagnostic options.
Much of the evidence for the diagnosis and monitoring strategies for osteoporosis comes from epidemiologic studies. To be more useful for clinicians and patients, future research should focus on the application of these data to the clinical setting and include a wider diversity of patient populations. Tools for assessing risk factors should be tested in prospective studies to determine if their use can correctly stratify women by risk factors, influence treatment decisions, and ultimately reduce fracture outcomes.
Clinical trials should be conducted to determine if identifying and reducing modifiable risk factors influences fracture outcomes. Addressing some of these modifiable risk factors -- such as by supplementation with calcium and vitamin D -- has already demonstrated effects on fracture risk after intervention. Examples of additional interventions to test include smoking cessation, correcting visual loss, and improving physical function. Randomized, controlled trials of treatments for osteoporosis should be done to test the hypothesis that overall fracture risk, rather than bone measurement results alone, determines the likelihood that a patient will benefit from therapy. Selection of patients for trials should focus on groups of patients who are at high risk of fracture based on clinical risk factors and who have relatively normal bone measurement results. Trials should also address whether patients who are identified to have bone loss demonstrated by different techniques at different sites demonstrate a similar benefit to those identified solely by hip DXA measurements.
Additional research is needed to examine the quality of information provided to patients who undergo various bone measurement tests, as well as to identify other patient education and information needs.
Future research should examine the clinical utility of the WHO working
group's criterion for diagnosing osteoporosis, specifically:
To determine whether the diagnosis of osteoporosis provides benefits to patients above that provided by predicting their risk of fractures;
To assess the added value of obtaining bone measurement tests at more than one site, and the usefulness of using T-scores for different sites; and
To define the prognosis and treated course in patients who are diagnosed to have osteoporosis by a wrist or spine measure, but who are not diagnosed to have osteoporosis by a hip measure.
Studies that use bone measurement tests for monitoring response to therapy could compare fracture outcomes in a group of patients who had tailored therapy based on test results versus a group in whom changes in therapy, if any, were guided by the history alone. A study could also record how often test results in patients on therapy led to a change in therapy or improved compliance, to establish the mechanism by which monitoring leads to improved outcomes. By comparing patients, such a study could establish that monitoring changes in test results can reliably predict fracture risk in individual patients by distinguishing an inadequate response to therapy from an adequate response or poor adherence, and that monitoring changes in therapy made because of an inadequate test response can reduce the rate of fractures.
Prospective studies of biochemical markers should define, apply, and evaluate criteria for using marker results in clinical decision making.
Determining the utility of screening for secondary disorders by use of common laboratory tests requires studies of the frequency of abnormal baseline laboratory test results in large cohorts or in treatment trials of osteoporosis. Clinical follow-up of these subjects would provide data on bone measurement test results or fracture outcomes.
Studies to formally account for the adverse quality-of-life impact of treatment and treatment side effects are needed to more accurately determine the balance of risks and benefits of the therapy options for patients.
Three areas for additional cost-effectiveness research include:
Identifying a scientifically appropriate cut point for QUS/BUA that can be used in either a sequential diagnostic approach or for diagnosis with QUS alone.
Performing additional cost-effectiveness analyses using data from other large, population-based cohorts with various cut points.
Conducting a more detailed, societal-perspective, cost-effectiveness analysis that would address some of the deficiencies in the authors' analysis.
If these findings can be demonstrated in one or two other populations, and in a more complete economic evaluation, a randomized, controlled trial of diagnostic approaches would be a useful next step. Alternatively, an observational-design study with randomization of groups of patients to diagnostic or monitoring procedures could be conducted.
This evidence report describes the effectiveness of various strategies for diagnosing and monitoring postmenopausal women with osteoporosis, as specified in the following six topic areas provided to the investigators by the Agency for Healthcare Research and Quality (AHRQ):
Risk Factors. What is the role of clinical risk factors, in conjunction with bone measurement tests, in identifying high-risk women and guiding initial treatment decisions?
Bone Measurement Tests. What are the advantages and disadvantages of various bone measurement tests at different anatomic sites for identifying women at high risk of fracture?
Monitoring. Are bone measurement tests effective for monitoring response to treatment and for guiding decisions about changes in management?
Biochemical Markers. What is the role of markers of bone turnover for identifying women at risk of bone loss, guiding initial treatment decisions, or monitoring response to therapy?
Evaluation for Secondary Causes of Osteoporosis. What diagnostic or laboratory tests are appropriate for evaluating patients with osteoporosis for secondary causes?
Cost. Assuming consistent treatment approaches, what are the costs and cost-effectiveness of various diagnostic strategies for identifying women with osteoporosis?
These topic areas do not include the effectiveness of dietary, lifestyle, hormonal, and medical interventions for primary prevention or treatment of osteoporosis. This report is confined to diagnostic and monitoring strategies as they apply to individual women and does not include issues regarding mass screening in the general population. Also, while most of the literature addressing these topic areas is aimed at an audience of clinical researchers who specialize in osteoporosis, we have attempted to assess the research findings from the perspectives of clinicians and patients. However, it is not the purpose of this report to propose practice recommendations.
The term "osteoporosis" describes both a process of decreasing bone density as well as the clinical outcome of fracture. As a result, several definitions of osteoporosis have been offered. In 1991, for example, an international consensus development conference sponsored by three organizations (the National Institute of Arthritis and Musculoskeletal Disease, the National Osteoporosis Foundation, and the European Foundation for Osteoporosis and Bone Disease) defined osteoporosis as "a systemic skeletal disease characterized by low bone mass and microarchitectural deterioration of bone tissue, leading to enhanced bone fragility and a consequent increase in fracture risk."1 This definition emphasizes that, in addition to bone mass, the structure of bone also is an important factor in the mechanism of fractures. In 1993, another international consensus development conference, sponsored by the same organizations, endorsed this definition.2 That conference concluded that bone measurements should be used to diagnose osteoporosis, but it did not specify what available tests or sites should be used or how specific results could be used to make decisions for individual patients.
In 1994, a World Health Organization (WHO) working group made a distinction between the definition and the diagnosis of osteoporosis.3 The WHO working group noted that there was a wide range in estimates of the prevalence of osteoporosis (2.5 to 95 percent), depending on what value of bone density was classified as abnormal. While it endorsed the earlier definition of osteoporosis, the working group proposed that, in epidemiologic studies, osteoporosis should be diagnosed when bone mineral density (BMD) is 2.5 standard deviations below the mean for healthy young adult women at the spine, hip, or wrist, or when a history of an atraumatic fracture is present.4
The number of standard deviation units above or below the young healthy mean is called the "T-score." A "Z-score" is the number of standard deviation units above or below the mean for one's own age group. The WHO working group chose a T-score of −2.5 or less as the criterion, noting that it would classify 30 percent of postmenopausal women as having osteoporosis -- twice as many than if the diagnosis were based on low femoral bone density alone. The working group also proposed that low bone mass or "osteopenia" be diagnosed when bone density was 1.0 to 2.5 standard deviations below the young healthy mean (−2.5 standard deviations <T-score <−1.0 standard deviation). These diagnostic criteria have been incorporated into bone density reports and in the inclusion criteria for recent randomized controlled trials of therapies for osteoporosis. Although they were not intended for use as a clinical treatment threshold, they are being used as such -- much as blood pressure, glucose, and cholesterol measurements are used in diagnosing hypertension, diabetes, and hypercholesterolemia.
Some experts criticize the working group's approach to diagnosis because it ignores the quality of bone and the rate of bone loss.5, 6 Many experts suggest that the overall risk of fracture -- not the T-score -- should be used to make decisions in the individual patient,7-9 and that T-scores are confusing and may be misinterpreted by physicians and patients.10,11
To exclude osteoporosis by the WHO criterion, the T-scores for tests at the wrist, hip, and spine must be above -2.5. In practice, however, a patient may be told that she does not have osteoporosis based on a result at just one site, even though this may miss osteoporosis at other anatomic sites. For example, in a prospective study, 392 women who volunteered for a worksite osteoporosis awareness program had dual-energy X-ray absorptiometry (DXA) of the hip and spine and peripheral DXA of the wrist.12 A total of 100 women (26 percent) had osteoporosis by the WHO criterion. Nine of these 100 women had a T-score below -2.5 at all three sites, and 65 had a T-score below -2.5 at two of the three sites.
Because of the problems with the WHO definition, several participants in the WHO working group now recommend that the diagnosis of osteoporosis should be based only on the T-score obtained at the hip and measured by DXA.13 They propose that measurements at other sites and with other technologies may be useful for assessing risk for fracture, but they should not be used for diagnosis of osteoporosis.
Osteoporosis affects a large proportion of American women over the age of 50. Estimates of rates of prevalence depend on the instrument used to measure bone density and on the characteristics, including ethnicity, of the population studied.
The third National Health and Nutrition Examination Survey (NHANES III) reports prevalence rates for osteoporosis by race; these rates are adjusted for age and for census undercount estimates from 1990 and 1993. NHANES III is a 6-year cross-sectional study of noninstitutionalized civilians in the United States. All subjects over age 20 were eligible unless they had experienced fractures in both hips, and each eligible subject's left hip was scanned unless it had been fractured previously. The sample size was sufficiently large (n =3,311 women over age 50) and ethnically representative to justify extrapolation to the entire U.S. population. The young healthy mean was established by scanning 415 women age 20 to 29. All measurements were made using DXA of the proximal femur.14
| Race | Osteopenia (-2.5 <T-score <-1.0) | Osteoporosis (T-score <-2.5) | ||
|---|---|---|---|---|
| Prevalence (%)b | Millionsc | Prevalence (%)b | Millionsc | |
| All | 40 | 14 | 15 | 5 |
| NHW | 41 | 12 | 15 | 5 |
| NHB | 28 | 0.9 | 8 | 0.3 |
| MA | 38 | 0.3 | 16 | 0.1 |
Data taken from NHANESIII: Looker AC, Wahner LW, Dunn WL, et al.115
Age adjusted to 1980 U.S. Census
Undercount adjusted estimates from March 1990 and 1993.
DXA = dual-energy X-ray absorptiometry; NHW = non-Hispanic white; NHB = non-Hispanic black; MA = Mexican American; NHANESIII = National Health and Nutrition Examination Study III
| Age | Spine | Hip | Wrist | At spine, hip, or wrist |
| 50-59 | 7.6 | 3.9 | 3.7 | 14.8 |
| 60-69 | 11.8 | 8.0 | 11.8 | 21.6 |
| 70-79 | 25.0 | 24.5 | 23.1 | 38.5 |
| >80 | 32.0 | 47.5 | 50.0 | 70.0 |
| All | 16.5 | 16.2 | 17.4 | 30.3 |
Data taken from Melton LJ 3rd16
DPA = dual photon absorptiometry; SPA = single photon absorptiometry
| T-score | ||||||||
|---|---|---|---|---|---|---|---|---|
| Age | -5 | -4 | -3 | -2 | -1 | 0 | 1 | 2 |
| 50 yrs | 0.024 | 0.0095 | 0.0038 | 0.0015 | 0.0006 | 0.00023 | ||
| 60 yrs | 0.069 | 0.029 | 0.011 | 0.0047 | 0.0018 | 0.0007 | 0.00025 | |
| 70 yrs | 0.127 | 0.055 | 0.023 | 0.0096 | 0.0039 | 0.001 | ||
| 80 yrs | 0.35 | 0.2 | 0.09 | 0.042 | 0.018 | 0.007 | 0.0028 | |
| 90 yrs | 0.29 | 0.19 | 0.097 | 0.046 | 0.02 | 0.006 | ||
Data derived from decision model presented in the National Osteoporosis Foundation Report8
Women with osteoporosis are more likely to have fractures. In a population-based study conducted in Minnesota, for every one standard deviation reduction in bone density, the age-adjusted odds ratio for hip fracture was 2.40 (95 percent confidence interval [CI] 1.27-3.68). Spine and wrist sites had lower, yet significantly elevated, risks of fracture (spine 1.66 [1.18-2.33]; wrist 1.56 [1.20-2.03]).18
| Age | White | Nonwhite | RR b |
|---|---|---|---|
| 30-34 | 3.7 | 12.3 | 0.3 (0.08-0.99) |
| 35-39 | 3.2 | 9.6 | 0.3 (0.09-1.54) |
| 40-44 | 21.9 | 14.2 | 1.5 (0.54-4.70) |
| 45-49 | 33.9 | 10.6 | 3.2 (1.03-10.6) |
| 50-54 | 50.1 | 18.4 | 2.7 (1.05-6.70) |
| 55-59 | 88.9 | 31.4 | 2.8 (1.33-6.20) |
| 60-64 | 152.8 | 38.2 | 4.0 (1.27-12.4) |
| 65-69 | 237.2 | 101.5 | 2.3 (1.35-3.97) |
| 70-74 | 530.5 | 167.4 | 3.2 (1.92-4.99) |
| 75-79 | 1,018.4 | 409.0 | 2.5 (1.64-3.80) |
| 80-84 | 1,731.5 | 880.6 | 2.0 (1.35-2.81) |
Data taken from Farmer ME, White LR, Brody JA, et al.19
Relative risk comparing white women vs. nonwhite women (95% confidence interval)
| Age group (years) | 1985-94 | ||
|---|---|---|---|
| n | Rate b | 95 % CI | |
| 35-44 | 70 | 131.0 | 102.1-165.5 |
| 45-54 | 92 | 268.4 | 216.3-329.1 |
| 55-64 | 157 | 605.3 | 514.4-707.8 |
| 65-74 | 148 | 671.0 | 567.3-788.3 |
| 75-84 | 147 | 811.3 | 685.5-953.6 |
| >85 | 82 | 864.1 | 687.2-1072 |
| Subtotal c | 696 | 421.3 | 389.2-453.5 |
| Total d | 841 | 287.4 | 267.7-307.1 |
Data taken from Melton (1998)20
Incidence per 100,000 person-years
c Incidence per 100,000 person-years directly age-adjusted to the population of 1990 U.S. whites
d Incidence per 100,000 person-years directly age- and sex-adjusted to the population structure of 1990 U.S. whites
CI = confidence interval
Sixteen percent of postmenopausal women have osteoporosis of the lumbar spine.16 Five percent of 50-year-old white women and 25 percent of 80-year-old women have had at least one vertebral fracture.21 Although many vertebral fractures are only incidentally detected on X-rays, some cause severe pain that leads to 150,000 hospital admissions per year in those over age 65, 161,000 physician office visits, and more than 5 million days of restricted activity in those age 45 or older.22 The age-adjusted annual discharge rate for hospitalizations due to vertebral fracture is 17.1 per 10,000 for white females, 9.9 per 10,000 for white males, 3.7 per 10,000 for black females, and 2.5 per 10,000 for black males.23 These rates underestimate the true impact of vertebral fractures because only about 8 percent of vertebral fractures result in hospitalization.
The functional impact of vertebral fracture on quality of life can be substantial, perhaps as great as that for hip fracture.24 In the Study of Osteoporotic Fractures (SOF) -- a large prospective cohort of nearly 10,000 U.S. women age 65 years or older -- women with at least one incident of vertebral fracture were more likely to have increased back pain than those without (odds ratio [OR] 2.4; CI 1.7-3.3).25 These women also had more back disability (OR 2.5; 1.9-3.7), and at least 1 day of bed rest due to back pain per year (OR 6.7; CI 4.4-10.2), and 7 days of limited activity due to back pain per year (OR 3.8; CI 2.7-5.0).
Vertebral fractures may alter lifestyles of affected women, although only 4 percent are unable to live independently because of these fractures.26 In one study, 60 percent or more of a cohort of 100 women with vertebral fractures reported limitations with daily activities such as housework, and 82 percent reported a fear of falling.27 Difficulties with other activities, including social and leisure activities, were reported less frequently, although these activities were rated as equally important as the others.
The burden of osteoporosis extends beyond the consequences of fracture. The process of diagnosis and treatment also can affect quality of life. An osteoporosis-targeted quality of life questionnaire was developed to assess the impact of the disease in women in the community, specifically focusing on physical difficulty with activities of daily living, necessary adaptations, and fears.28 Using this questionnaire, women with osteoporosis indicated significantly more difficulties with routine daily activities compared with women with osteopenia or normal bone density.29 Also, women who had osteoporosis had significantly more fears than women who had normal bone density. It is not clear how comorbidities influenced these differences in quality of life.
The total direct medical expenditures for osteoporotic fractures in the United States in 1995 dollars were estimated at $13.8 billion.30 About 75 percent of these dollars were spent for white women, 18.4 percent for white men, 5.3 percent for nonwhite women, and 1.3 percent for nonwhite men. The majority of this total ($8.6 billion) was spent for inpatient care, about $3.9 billion was spent for nursing home care, and $1.3 billion was spent for outpatient services. Hip fracture accounted for $8.68 billion, with per-patient expenditures for hip fracture ranging from approximately $27,000 to $35,000. Fractures at other sites accounted for approximately 37 percent of the total expenditure (about $5.08 billion).
The average cost to society for treatment of a symptomatic vertebral fracture is estimated at $1,895 in 1996 Canadian dollars. Of these total dollars, 31 percent were attributed to hospitalizations, 37 percent to lost productivity, 22 percent to other outpatient costs, and 10 percent to caregiver productivity time lost.31
Distal radial fractures require hospitalization in perhaps 20 percent of cases and relatively little rehabilitation.32 Indirect costs (lost productivity) may well exceed the direct medical costs for these types of fractures.
| Test | Sites | Examination time, operator skill needed | Radiation exposure | Capital costs-technology purchase | Precision | Cost | Charges | Comments |
|---|---|---|---|---|---|---|---|---|
| minutes, skill level | $U.S. | % | $U.S. | |||||
| Single photon absorptiometry (SPA) | Wrist, heel | 5-15, low | low | $50-150 | Uses isotopes | |||
| Dual-energy photon absorptiometry (DPA) | Spine, proximal femur, whole body | 20-40 | low | $20,000 | 3-10 | inexpensive | $150-300 | Uses isotopes |
| Single X-ray absorptiometry (SXA) | Peripheral sites | 0.08-4.6 uSv | $20,000 | 0.5-2 | $50-150 | |||
| Dual-energy X-ray absorptiometry (DEXA or DXA) | Lumbar spine, proximal femur | 5-10, high | 0.08-4.6 uSv (pencil beam) or 60 uSv (fan beam) | $100,000-200,000 | 1-5 | Fairly expensive | $136 a | Influenced by osteoarthritis |
| Total body | 0.5 | Expensive | ||||||
| Peripheral dual-energy X-ray absorptiometry (pDXA) | Peripheral (wrist, heel) | 2-5 | Inexpensive | |||||
| Quantitative computed tomography (QCT) | Spine | 10-30, high | 25-360 uSv | $5,000-15,000 | 2-5 | $150-300 | Higher radiation exposure; Measures the true volumetric density | |
| Quantitative ultrasonography (QUS) | Heel, fingers, tibia, patella | 5-10, low | none | $10,000-100,000 | 0.4-4 | $35 a | Low cost, portable, no radiation | |
| Radiographic absorptiometry (RA) & Quantitative microdenistometry (QMD) | Hand | 5-10, high | 0.08-4.6 uSv b | 1-2 | $90-160 | Low cost, portable |
Average Medicare reimbursement
uses conventional CT or radiographic equipment
uSv = microSieverts
Densitometry devices that measure peripheral bone density are considerably less expensive to buy and use than axial DXA. Radiographic absorptiometry (RA) and quantitative microdensitometry (QMD) use computer software to estimate bone density from conventional radiographs of the hand. Other devices for measuring bone density in the arm or heel include single X-ray absorptiometry (SXA), peripheral dual X-ray absorptiometry (pDXA), and peripheral quantitative computed tomography (pQCT).
Quantitative ultrasound (QUS) devices report the way that bone attenuates sound waves and/or the speed with which sound travels through the bone.38 Commercial devices measure the "broadband ultrasonic attenuation" (BUA, expressed as decibels/megahertz [dB/MHz]), the speed of sound (SOS), and the "stiffness," a measure derived from the BUA and SOS. QUS does not measure bone mineral content and is categorized separately from the other technologies. While these measures are not highly correlated with measures of bone density made by DXA, some in vitro studies,39-42 but not all,43 suggest that QUS might reflect other aspects of bone structure that could be associated with fragility.
Bone measurement tests are used to predict fracture, to aid the clinician to diagnose osteoporosis, and to select patients for treatment.44 In clinical centers that specialize in bone density measurement, DXA of the hip is used as the definitive test for determining whether a patient has osteoporosis. Among bone tests at various sites, DXA of the hip is the strongest predictor of hip fracture. Moreover, most recent randomized trials of pharmacotherapy for osteoporosis have used the results of DXA of the hip as a criterion for entry, making it difficult to determine the benefit for patients identified as having osteoporosis by other tests.
For these reasons, DXA of the hip is considered to be the "gold standard" test for assessing bone density, but it is expensive and somewhat inconvenient. Alternatives such as QUS of the heel, RA of the hand, or peripheral DXA of the hand, wrist, or heel have become popular because they are more convenient and have lower capital costs and examination costs. According to some researchers, these technologies reduce the cost of testing and increase access to densitometry in primary care settings and in underserved areas.45,46 Other experts argue that these tests cause confusion because they frequently disagree with the results of DXA of the hip and they do not predict hip fracture as well.47
Is it worthwhile to make fracture risk assessment more widely available, even if the assessment is sometimes less sensitive than DXA of the hip? In part, disagreement over this question arises from a discrepancy between information in the published literature and the information needed to assess the clinical value of different bone measurement tests. There is a wealth of data about the analytical performance of various devices to measure bone, but there are almost no data about the clinical effectiveness of different strategies for identifying patients who are likely to have the greatest benefit from treatment. For example, there are no data by which to judge whether a cheaper and more convenient, but less sensitive, test would provide a net benefit. As Steven Cummings, M.D., has pointed out, "At the moment, women at the greatest risk of fracture appear to be only slightly more likely to receive treatment that reduces fracture risk than women who are at low risk. This strategy seems likely to miss the vast majority of women who account for most of the hip fractures. On the other hand, the impact of strategies to reduce rates of hip fractures in populations critically depends on the reach of the program and adherence with treatment."48
Epidemiologic studies also provide many data on the association of bone measurements with fracture risk. A population-based study provides the best information about the importance of a risk factor, but it is not necessarily the best study design for assessing the performance of a test in everyday clinical settings. Most epidemiologic studies report the adjusted relative risk for fracture per one standard deviation change in bone density or bone ultrasound measurements. By this measure, bone measurements predict hip fracture in women over the age of 60 years better than diastolic blood pressure predicts stroke, or serum cholesterol and smoking predict the risk of coronary artery disease, at any age.49
This relative risk measure is of limited value in comparing the clinical performance of different tests. Absolute risk is preferable to relative risk. However, if relative risks are to be used, they should be adjusted to population risks.13 A clinician cannot easily determine the clinical significance of differences in the relative risk per standard deviation of bone density. In a clinical study of prediction, information about the pretest probability of a fracture would be combined with bone density results. In most epidemiologic studies, the reported adjusted relative risk is an average across groups of patients with different pretest probabilities of a fracture. These studies do not directly address the ability of bone tests to predict fracture or to judge the need for treatment in an individual patient. Potentially meaningful differences in tests may become clinically insignificant if assessment of the pretest probability of fracture is imprecise, or if the pretest probability is inaccurately combined with results of bone density or QUS testing.
Because the relative risk is difficult to interpret clinically, some analytical studies estimate the probability of a fracture during a given time period, such as over 5 years, 10 years, or a lifetime. This approach requires extrapolation from prospective studies and from randomized trials of drugs for osteoporosis, most of which follow individuals for fewer than 4 years.
Another approach is to calculate the sensitivity and specificity of bone measurement tests, considering the test to be a "true positive" if a person who has a low test result has a fracture, and a "false positive" if she does not have a fracture. The values of sensitivity and specificity at different cutoff values of the test are used to construct a "receiver operating characteristic" (ROC) curve. The area under the curve is a measure of discriminant value that can be used to compare the performance of different tests.
The main limitation of this approach is that the observed sensitivity and specificity depend on the length of the observation period.50 The test is false positive only in the sense that the patient has not yet fractured even though she has osteoporosis. This approach also requires a specified time period and a method to adjust the results of studies that use a different duration of followup.
Another objection is that this terminology, which usually is applied to the ability of a test to diagnose an existing condition, should not be used to assess the ability of a test to predict events in the future. A similar objection applies to assessment of the value of other risk factors and of risk assessment tools in identifying patients who have a high risk of fracture. This confusion arises because bone measurement tests are used for prediction of fractures in the future as well as for diagnosis of osteoporosis in the present. It is reasonable to measure the sensitivity and specificity, and use ROC analysis, of one bone measurement test when another test is considered to be the "gold standard" determination of whether a patient has osteoporosis. However, diagnostic disagreement of this type does not necessarily mean that one test is better at predicting fracture than another.
| Publication | Year | Organization Producing Report | Conclusions | ||||
|---|---|---|---|---|---|---|---|
| Risk factors | Choice of test | Monitoring | Use of markers | Cost-effectiveness | |||
| Bone density measurement - a systematic review426 | 1997 | Swedish Council on Technology Assessment in Health Care | Must consider risk factors other than bone density alone to make decisions about testing or treatment. | BMD in hip or spine cannot be reliably estimated from measurements in arm or heel. | Measurements at intervals <2 years are unnecessary. | No documentation that repeated measurements of markers influence treatment in a way that improves long-term clinical outcomes. | |
| Effectiveness of bone density measurement and associated treaments for prevention of fractures427 | 1996 | International Network of Agencies for Health Technology Assessment | The precision and accuracy of all BMD tests in community settings are unknown. Accuracy of ultrasound still not proven. | BMD would require minimum followup of 1 to 1.5 years to detect bone loss of 2 to 3%. | |||
| Bone mineral density testing: does the evidence support its selective use in well women?428 | 1997 | British Columbia Office of Health Technology Assessment | Currently there are no validated risk assessment tools to select patients for BMD testing. In the general population, clinical assessment was no worse than BMD measurement in assessment of fracture risk. | Result of BMD test by any current technology is an unsuitable measure upon which to base clinical decisions. | Available economic evaluations are not adequate evidence that BMD testing is more cost-effective than universal hormone therapy or no intervention. | ||
| Osteoporosis: review of the evidence for prevention, diagnosis, and treatment and cost-effectiveness analysis8 | 1998 | National Osteoporosis Foundation | Appropriateness of measuring BMD depends on fracture risk, determined by age and other risk factors, and treatment being considered. | Given the better predictive value of hip measurements for hip fractures, hip DXA should be the primary measurement. | The longer the interval between measurements, the more precise the estimate of changes in bone mass; effect of monitoring on treatment is unknown. | Biochemical markers are promising but their role in patient management is not yet known. | |
| Quantitative ultrasound for bone density measurement429 | 1998 | Alberta Heritage Foundation for Medical Research | Quantitative calcaneal ultrasound is a promising diagnostic technology, but its role in osteoporosis diagnosis is unclear. Good evidence that ultrasound can identify increased risk of fracture in populations but not individuals. | ||||
| Selective testing with bone density measurement430 | 1999 | Alberta Heritage Foundation for Medical Research | There is potential for selective use of BMD in association with appraisal of other risk factors. Assessment protocols for such an approach have promise as a useful tool for selecting whom to test. Advice on treatment options should consider evidence of efficacy and effectiveness in terms of absolute reduction in risk of fracture, long-term compliance, and adverse effects. | Substantial uncertainty with the performance of BMD in correctly classifying an individual as osteoporotic. Ultrasound is less precise than DXA. | Minimum acceptable interval between measurements may be as long as 2 years. | ||
| Ultrasonography of the heel for diagnostic osteoporosis and selecting patients for pharmacologic treatment431 | 1999 | Blue Cross and Blue Shield Association | Use of ultrasound to direct treatment may result in a substantially smaller health outcome benefit as compared to DXA. 43 to 76% of patients benefiting from treatment would be identified by ultrasound (sensitivity); 75 to 90% of patients not benefiting by treatment would be identified by ultrasound (specificity). | ||||
| Osteoporosis: clinical guidelines for prevention and treatment432 | 1999 | Royal College of Physicians | Recommends selective testing in women with risk factors (based not on evidence but on expert opinion). | DXA at the hip is preferred because of higher predictive value for fracture risk. | Optimal use of BMD measurements in monitoring response to treatment is uncertain, recommend future research. | Until biochemical markers become more widely established and supported by evidence, their use in clinical practice will remain limited. | The cost-effectiveness of BMD measurements improves as the expense of the therapy goes up. |
| Consensus statement on prevention and treatment of osteoporosis433 | 1999 | Israel Center for Technology Assessment in Health Care/Israel Ministry of Health/Israel Medical Association | Physician's responsibility to estimate risk of osteoporosis and fractures and to consider performing additional tests; based on some risk factors, report recommends BMD every 2 years. | Recommends DXA done at facilities with quality control and following regulations on operations and interpretation of results; additional technologies to be considered if efficiency proven compared to DXA. | Calls for publicly funded BMD in women over 65 every 5 years; in postmenopausal women over 50 every 2 years if certain risk factors present; in those with disease entailing increased risk of osteoporosis, no limit on age or frequency of test. | ||
| Monitoring of bone density to assess active treatment of osteoporosis434 | 2000 | Blue Cross and Blue Shield Association | Insufficient evidence to determine if BMD monitoring in patients under treatment improves net health outcomes. | ||||
BMD = bone mineral density; DXA = dual-energy X-ray absorptiometry
An evidence report focuses attention on the strength and limits of evidence from published studies about the effectiveness of a clinical intervention. The development of an evidence report begins with a careful formulation of the problem. In this phase, a preliminary review of the literature and input from experts, stakeholders, and patients may be used to identify the patient populations, interventions, health outcomes, and harms. These parameters are summarized in an analytic framework, which is used in turn to generate a list of key questions to examine in a systematic review of the published literature.
This evidence report emphasizes the patient's perspective in the choice of outcome measures. Studies that measure health outcomes (events or conditions that the patient can feel, such as quality of life, functional status, and fractures) are emphasized over studies of intermediate outcomes (such as changes in bone density). Such a report also emphasizes measures that are easily interpreted in a clinical context. Specifically, measures of absolute risk or the probability of disease are preferred to epidemiologic measures such as relative risk or events per 1,000 women-years.
An evidence report also emphasizes the quality of the evidence, giving more weight to studies that meet high methodological standards that reduce the likelihood of biased results. In general, the results of well-done, randomized controlled trials are regarded as better evidence than results of cohort, case-control, or cross-sectional studies. These studies, in turn, are considered better evidence than uncontrolled trials or case series. An evidence report pays particular attention to the generalizability of efficacy studies performed in controlled or academic settings. Studies that reflect actual clinical effectiveness in unselected patients and community practice settings are highlighted. Finally, an evidence report considers the net benefit, after a thorough effort to assess both the benefits and the harms of a service or technology.
In the context of developing clinical guidelines, evidence reports are useful because they define the limits of the evidence, clarifying when the assertions about the value of the intervention are based on strong evidence from clinical studies. The quality of the evidence on effectiveness is a key component, but not the only component, in making decisions about clinical policies. Additional criteria include acceptability to physicians or patients, the potential for unrecognized harms, and cost-effectiveness.
An analytic framework is a schematic representation of the strategy used to organize topic areas and guide the literature search. The framework identifies the critical preventive or treatment interventions and links them to outcomes. After a preliminary review of the literature and discussion with local experts (Appendix A), we developed the analytic framework in Figure 1
What risk factors predict bone density, bone loss, and fractures?
Are tools for assessing risk factors accurate in identifying women at risk for fractures?
How well do the different bone measurement tests at different sites predict fractures?
What factors related to bone measurement testing influence diagnosis?
Can biochemical markers be used instead of bone measurement tests in identifying women at risk for osteoporosis?
How well do markers predict fractures?
Are risk factors useful in treatment decisions?
How do bone measurement test results affect patients' and physicians' decisions and actions?
Are bone measurement tests effective for monitoring response to therapy and for guiding decisions about changes in management?
Can markers help select patients for treatment?
What are the adverse effects of using biochemical markers to identify women at risk for osteoporosis?
Can markers predict response to therapy?
What diagnostic or laboratory tests are appropriate for evaluating patients with osteoporosis for secondary causes?
What are the costs and cost-effectiveness of diagnostic strategies for identifying women with osteoporosis?
Practicing clinicians often initiate an evaluation for osteoporosis with an assessment of risk factors for individual patients. Arrow 1 in the analytic framework addresses the role of risk factors in identifying women at high risk for osteoporosis. This review includes a survey of the medical literature identifying risk factors predictive of bone density, bone loss, and fractures. It also evaluates evidence of the usefulness of risk factor assessment tools in identifying women at risk for fractures.
We also examined how well the various techniques for measuring bone at different anatomic sites predict fractures, and what effect the choice of test has on the diagnosis of osteoporosis (Arrow 2). We reviewed the ability of markers of bone turnover to identify women at high risk of fractures.
Women eligible for treatment based on risk factors, bone measurement tests, or marker studies would then undergo further evaluation for secondary causes, selection and initiation of medical therapies, and subsequent monitoring if indicated. Arrow 3 in the analytic framework represents several questions relating to treatment and monitoring, including: (1) the role of risk factors and bone measurement tests in guiding initial treatment decisions, (2) the appropriateness of diagnostic or laboratory tests for evaluating patients with osteoporosis for secondary causes, (3) the role of markers of bone turnover in selecting patients for treatment, (4) the role of markers of bone turnover for monitoring response to therapy, and (5) the effectiveness of bone measurement tests for monitoring response to treatment and for guiding decisions about changes in management. Arrow 4 represents how the choice of diagnostic strategy affects the costs of care.
Search strings for each of the six topic areas are listed in Appendix B. Databases were searched three times during the course of the project, with the final search in April 2000. Retrieved abstracts were entered into an electronic database (EndNote™). Figure 2
A lead investigator was assigned for each topic. Two investigators reviewed each set of titles and abstracts to select articles for full-text review. A research assistant tracked the inclusion status and names of reviewers for each abstract reviewed. We included abstracts that had original data about postmenopausal women and were relevant to a key question in one or more topic areas. For studies of prediction using risk factors, bone measurement tests, or biochemical markers, we selected articles if they reported the relation between a specific risk factor, or a risk assessment tool (a set of risk factors), and bone measurement test results, bone loss, or fracture at any site. We excluded studies of predictors of osteoporosis and fracture in patients with steroid use because we considered this a well-known secondary cause of osteoporosis. We also excluded case reports, animal studies, cadaver studies, studies in patients with serious conditions (organ transplant recipients, inpatients, and those with other known secondary causes of osteoporosis), and studies that did not have an English language abstract.
When a disagreement between two reviewers arose, the lead investigator for the topic reexamined the abstract and determined whether the full text of the article should be retrieved. Investigators also were encouraged to flag abstracts that they felt could be relevant for other topics. Support staff maintained a database to refer these citations to the appropriate investigator if they were not already present in their topic-specific abstract database.
After this review, 530 articles about risk factors, 123 about bone measurement tests, 23 about bone measurement tests for monitoring, 277 about biochemical markers, and 53 about costs were selected as possibly relevant. An additional 242 studies were retrieved after reviewing reference lists of studies and by suggestion of the expert panel or leading researchers in the field. The full text of these 1,248 studies were retrieved from the library or ordered through interlibrary loan. Investigators read the full-text version of the retrieved papers and reapplied the initial eligibility criteria. For all topics, we excluded articles if they did not provide sufficient information to determine the methods for selecting subjects and for analyzing data. For some topics, additional criteria were applied to select studies that were systematically reviewed and included in evidence tables as follows:
Risk Factors. Articles were included if they specified methods for measuring risk factors and if they reported bone density, bone loss, or radiographically verified fractures. Secondary causes of osteoporosis, although important to consider for individual patients, were not included here as clinical risk factors. Randomized controlled trials of interventions were not included in the review of risk factors, although it was acknowledged that trials of calcium and vitamin D supplementation, for example, provide additional support for the importance of calcium and vitamin D deficiencies as risk factors.
Bone Density and Quantitative Ultrasound Tests. Prospective studies of predicting fracture risks from bone density and QUS tests were included. We excluded studies of single- and dual-photon absorptiometry, which have been widely replaced by more recent technologies. Case-control studies were examined if they provided sufficient data to construct a cross-tabulation of fracture status by densitometry or QUS results. Studies of the agreement between different densitometry tests were systematically reviewed if they provided data on how the use of different tests affected classification of patients using T-scores or Z-scores.
Markers. We included prospective studies of the diagnostic accuracy of selected markers using bone densitometry as the reference standard and studies of the use of markers to predict fracture, bone loss, or response to therapy.
Cost. We examined reports that compared two or more diagnostic approaches, specified the tests and sites considered, and reported costs associated with diagnosis and subsequent events.
The following information about the patient population, interventions, clinical endpoints, study design, and study quality was extracted from full-text, published studies of fracture or bone density prediction and used to construct evidence tables: identifying information (study name, years of observation); setting (population-based, referral clinic-based, other); study design (randomized trial, prospective, longitudinal cohort study, nested case-control study, other case-control study, cross-sectional study); predictors studied (specific risk factors, bone density and QUS tests, other imaging tests, biological marker assays); length of followup; statistical methods for handling confounders (statistical adjustment, stratification, none) and attrition; numbers of subjects recruited, included, and completing study; and characteristics of the sample (demographic variables, time since menopause, other risk factors). For studies of diagnostic agreement, we also abstracted information about the reference range or decision limits used to classify test results and about the criteria for the determination of disease. For economic evaluations, we also extracted the type of economic evaluation, the primary outcomes reported, data sources, cost unit, discount rate, and what parameters were varied in the sensitivity analyses and results.
All data were abstracted by the lead investigator for the topic. If the lead investigator encountered difficulty in finding or interpreting information in the published report, a second investigator reviewed the article and a consensus was reached. In several cases, we contacted authors to request data that were not included in published reports.
To assess the internal validity of individual studies, we applied a set of criteria developed by the Third U.S. Preventive Services Task Force (Appendix C). This includes a description of a set of minimal criteria for each study design and general definitions of three categories of quality (good, fair, and poor) based on those criteria. These ratings were used to assess cohort studies, randomized controlled trials, and case-control studies within the risk factor, bone measurement test, and marker sections of this paper. The diagnostic test category was used in the diagnostic performance of bone measurement tests and biochemical marker literature as well. We also noted test-review bias and concealment of the test result when assessing disease status (e.g., fracture). A brief description of ratings with criteria by study design follows.
A "good" rating was achieved if the study met all the following criteria: comparable groups were assembled initially and maintained throughout the study (followup at least 80 percent); reliable and valid measurement instruments were used and applied equally to the groups; interventions were spelled out clearly; important outcomes were considered; appropriate attention was given to confounders in the analysis; and intention-to-treat analysis was used in randomized controlled trials.
A study received a "fair" rating if any of the following problems were seen: generally comparable groups were assembled initially, but some question remains whether some (although not major) differences occurred in followup; measurement instruments were acceptable (although not the best) and generally applied equally; some, but not all, important outcomes were considered; some, but not all, potential confounders were accounted for; and intention-to-treat analysis was used in randomized controlled trials.
Studies were given a "poor" rating if any of the following flaws existed: groups assembled initially were not close to being comparable or were not maintained throughout the study; unreliable or invalid measurement instruments were used or instruments were not applied equally among groups (including not masking outcome assessment); key confounders were given little or no attention; and intention-to-treat analysis was lacking in randomized controlled trials.
A study received a "good" rating if the following criteria were met: one or more relevant available diagnostic tests were evaluated; a credible reference standard was used; the reference standard was interpreted independently of the diagnostic test; the reliability of the test was assessed; indeterminate results were few or handled in a reasonable manner; and a large number (more than 100) of broad-spectrum patients with and without disease was included.
A study received a "fair" rating if it evaluated one or more relevant available diagnostic tests; used a reasonable, although not the best, reference standard; interpreted the reference standard independently of the diagnostic test; and consisted of a moderate sample size (50 to 100 subjects) and a "medium" spectrum of patients.
A diagnostic accuracy study with a "poor" rating was characterized by having one or more of the following flaws: an inappropriate reference standard was used; the diagnostic test was administered improperly; a biased ascertainment of the reference standard was shown; or a very small sample size of a very narrowly selected spectrum of patients was included.
A "good" rating included the following: an appropriate ascertainment of cases and a nonbiased selection of case and control participants were obtained; exclusion criteria were applied equally to cases and controls; the response rate was equal to or greater than 80 percent; diagnostic procedures and measurements were accurate and applied equally to cases and controls; and appropriate attention was given to confounding variables.
A "fair" rating included those studies that were recent, relevant, and without major apparent selection or diagnostic workup bias; had response rates of less than 80 percent; or provided attention to some but not all important confounding variables.
A "poor" rating was given to a study in this category if it had major selection or diagnostic workup biases, response rates were less than 50 percent, or little or no attention was provided to confounding variables.
For studies of the relation between risk factors and fracture or other outcomes, we also assessed completeness and length of followup; the use of co-interventions in compared groups; the training level of the personnel conducting the intervention; the inclusion and exclusion criteria used in individual studies; and the methods used to establish the outcome measure.51
| Principle | Description |
|---|---|
| Perspective stated | Whose costs and consequences are considered? |
| Benefit described | What are noneconomic consequences of program? |
| Costs included | What are the intervention costs, morbidity or side-effect costs, averted costs, and induced costs? |
| Discounting included | Are future costs and consequences adjusted for timing? |
| Sensitivity analyzed | For values that are uncertain (e.g., assumed), are analyses performed using alternative values? |
| Incremental cost-effectiveness comparisons presented | Are alternatives compared in a way that allows decisions on prioritization to be made? |
Adapted from Udvarhelyi52
For each study of risk factors, we recorded the relative risk (RR) or odds ratio (OR) for fracture and, when available, the correlation coefficient for fracture or bone density. For prospective studies of fracture prediction using bone density and QUS test results, we calculated the prevalence of fractures in the sample and the probability of a fracture, given various bone density and QUS test results. We then calculated the "risk difference," which we defined as the difference between the probability of a fracture if a test revealed osteoporosis and the probability of a fracture otherwise. If a study reported results for different decision limits (cutoffs), we calculated a risk difference for each threshold.
For studies of the agreement between tests, we recorded or calculated the correlation coefficient, sensitivity, specificity, likelihood ratio (LR), and slope of the regression line for bone density and QUS tests to predict other tests, or of markers to predict the results of bone density or QUS testing. The LR for a positive test was calculated using the formula LR={[PV/(1-PV)]/[p*(d)/1-p*(d)]}, where PV is the probability of the disease, given a positive test result, and p*(d) is the apparent prevalence of disease, estimated as p(d)=(number of true positives+number of false negatives)/(number of patients screened).51
Although the role of clinical risk factors in diagnosing and monitoring osteoporosis is currently unresolved, much research indicates that clinical risk factors serve as important predictors of bone density and fractures. Prospective studies have demonstrated that while bone density is a major predictor of future fractures, it is only one factor in a complex interaction of many (Figure 3
Some risk factors are potentially modifiable. Identification of these factors could allow a clinical intervention, such as changing medication or improving visual acuity, to prevent falls leading to fractures. Women at risk may benefit by knowing what their risk factors are and how to modify them. Preventive measures such as exercise, increased calcium intake, cessation of smoking or excess alcohol use, and improved safety measures at home are steps women could make to potentially reduce their fracture risks.
In this section, we address three questions about risk factors, focusing on those that are most useful in the clinical setting:
What risk factors predict bone density, bone loss, and fractures?
Are tools for assessing risk factors accurate in identifying women at risk for fractures?
Are risk factors useful in treatment decisions?
Many studies have been published about risk factors for low bone density, bone loss, and fractures in postmenopausal women. These studies vary by design, population, measurement of risk factors and outcomes, and methods of analysis. No single study includes all risk factors and outcomes of interest, although longitudinal studies and well-done case-control studies that use multiple regression methods provide some of the best evidence about the independent contribution of risk factors.
These results were used to illustrate the combined influence of declining calcaneal bone density and increasing number of clinical risk factors on hip fracture risk (Figure 4
These data demonstrate that factors other than bone density significantly affect the risk of fracture. Other investigations support this observation. A good-quality, large prospective study of fall-related factors and hip fracture from the Epidemiologie de l'Osteoporose (EPIDOS) cohort of European women age 75 years and older found that independent fall-related predictors of hip fracture included slower gait speed, difficulty in performing a tandem walk, reduced visual acuity, and small calf circumference.55 After adjustment for femoral bone density, all functional measures remained significant. Another study from the same cohort reported that femoral bone density (adjusted RR [ARR] for hip fracture 1.6; CI 1.38-1.95), calcaneal BUA (ARR 1.4; CI 1.2-1.7), gait speed (ARR 1.5; CI 1.3-1.7), and age (ARR 1.5; CI 1.2-1.8) have approximately the same discriminant value to identify women at high risk of hip fractures.56
Similarly, in a prospective study of 1,177 elderly women in residential care, cognition assessed with the Clifton Assessment Procedure for the Elderly (CAPE) was a strong predictor of hip fracture compared with calcaneal QUS (Figure 5
Although most studies about risk factors and fractures have focused on hip fracture outcomes, risk factors for other types of fractures also have been reported.58-61 Other fractures, such as wrist and vertebral fractures, are more common in postmenopausal women than hip fractures and may be especially dominant in younger peri- and postmenopausal women. In a population-based prospective cohort of 3,140 Finnish women age 47-56 years (mean age 53), DXA of the lumbar spine and femoral neck and other clinical risk factors were assessed at baseline, and incident fractures were recorded over 2.7 years of followup.62 Fractures of the wrist were the most common type of fracture in this cohort (n=46), followed by fractures of the ankle (n=30) and rib (n=26); vertebral (n=7) and hip (n=1) fractures were less common. In this study, women with fractures had significantly lower bone density at lumbar and femoral sites compared with those without fractures. History of a previous fracture was a strong predictor of further fractures (OR 2.83 [CI 1.95-4.10]). Women with fractures also had higher weekly alcohol intake (100 g/week or more; OR 1.70 [CI 1.08-2.67]), and were more likely to be younger than 45 years when they had undergone bilateral oophorectomy (OR 3.64; CI 1.01-13.04). Women on hormone replacement therapy (HRT) were less likely to have a fracture (OR 0.94;0. CI 88-0.99). Smoking, type of menopause, age at menarche, parity, and lactation history did not differ between the groups.
A prospective study of incident vertebral fractures in 5,807 postmenopausal women 65 years or older from the SOF found several differences between women with fractures and those without.25 Women experiencing their first vertebral fracture were significantly older and thinner, had a decline in grip strength compared with previous values, were less likely to take estrogen, and reported more mean height loss since age 25 years than women without fractures. A case-control study of vertebral fracture risk factors in 1,012 women in the United Kingdom found that those with fractures were significantly older, shorter, had an earlier menopause, lower parity, and a greater frequency of hyperthyroidism than those without vertebral fractures.63
Risk factors for low bone density and bone loss also could be important to consider when making decisions about bone specific treatment such as bisphosphonates. If a woman's risk factors are primarily functional, such as an unsteady gait or poor cognitive function that increase her risk for falls leading to fractures, improving her bone density by prescribing medications may not be in her best interest. Several studies evaluated risk factors for low bone density and bone loss and are described in Evidence Tables 2 and 3. Risk factors associated with spine and proximal femur bone density have been reported by the SOF based on analysis of cross-sectional data.64 Weight was the most strongly associated factor. Estrogen exposure, physical activity, and calcium intake were positively associated with bone density, whereas a family history of osteoporosis was associated with reduced bone density. A similar analysis of risk factors associated with bone density at radial sites was conducted in the SOF cohort.65 This study found that a large number of factors influenced the bone density of these women, and that age, weight, muscle strength, and estrogen use were the most important. These findings are summarized in Figure 6
Numerous studies about single or related risk factors for osteoporosis and fractures also have been published. These studies were conducted in a variety of populations and present additional insights about individual factors that were not considered in studies investigating multiple predictors.
Throughout the adult lifespan, black women have higher bone density than white women at every site tested.66 They have a higher peak bone density and a slightly slower rate of adult bone loss from the femur and spine. These measures have been found to be higher in whites than in Asians67 and North American Indians.68 Hip fractures are more common in white women (139/100,000) than in Mexican-American (67/100,000) or black (55/100,000) women.69 Japanese women also have a lower fracture rate than white women.70
A good-quality prospective cohort study of elderly Japanese women found that age, low body mass index (BMI = [weight in kg]/[height in m]2), regular alcohol use, previous vertebral fractures, and having five or more children were risk factors for hip fracture.72 A case-control study in Japan also found low BMI and alcohol use related to hip fracture.73 In addition, it noted that previous residence in a rural area, sleep disturbance, stroke with hemiplegia, coffee use, and use of a Western bed were risk factors; eating fish was found to be protective.
Several studies reported a positive relationship between fracture and height.74-77 Some of these associations included current height, height at age 25, and height loss.54,74 Height loss also was associated with low bone density.78 These relationships may be explained by the differences in bone geometry that promote fracture occurrence in taller women.
Weight and BMI are consistent predictors of bone density and fractures in many studies. Increasing weight and BMI are associated with higher bone density,79-85 and both lean body mass and fat body mass appear to affect bone density in most of these studies.
Women with higher weight and BMI have lower risks for fractures,54,74,77,86-89 although this effect was more consistent for hip fractures than wrist fractures in some studies.90 Involuntary weight loss and weight variability significantly increased risks for fracture in several studies, including the SOF.91-95 Low body weight was as significant a risk factor for hip fracture for black women96 and Japanese women72 as for white women. Mechanisms for this relationship include decreased sex hormone-binding globulin levels that increase free sex steroid levels in obese women,80 weight-bearing mechanical forces on bone, cushioning of the hip against the impact of a fall, and weight loss acting as a marker of illness and fragility.
Consistently, women with a family history of osteoporosis have lower bone density than those without.97,98 In the SOF cohort, the risk of hip or wrist fracture was increased in those with a family member with a hip or wrist fracture. The relative risk of hip fracture for those with a mother's (RR 1.48; CI 1.03-2.11), sister's (RR 1.83; CI 1.20-2.80), or brother's (RR 2.26; CI 1.16-4.42) history of hip fracture was not affected by adjustment for bone density.54,99
Studies of postmenopausal women and their premenopausal daughters report that daughters of women with low bone density also had low bone density when compared with daughters of women with normal bone density,100-102 although this was not found in every study.103 Studies of twins indicate that bone density104-106 is more highly correlated in monozygotic than in dizygotic twins.
Fracture history is associated with an increased risk for future osteoporotic fractures, independent of bone density. Data from the SOF cohort showed that women with a vertebral deformity found on conventional spine X-ray were almost three times as likely as others to suffer a hip fracture over 8 years of followup.107 Women with previous vertebral fractures are at increased risk for subsequent vertebral fractures.108-111 Those with previous wrist fractures have increased risks for subsequent hip fractures,112-115 and those women with previous fractures of various types are at risk for subsequent fractures at several sites,116-119 specifically at the hip.54,120,121 Having had a previous fracture identifies women who may have any number of predisposing risk factors that could result in another fracture.
A meta-analysis of 29 published cross-sectional studies of smoking and bone density indicated that postmenopausal bone loss was greater in current smokers than in nonsmokers.122 Bone density diminished by an additional 2 percent for every 10-year increase in age for smokers, with a difference of 6 percent at age 80. A study of twins found that for every 10 pack-years of smoking, the bone density of the twin who smoked more heavily was significantly lower at the lumbar spine and femoral shaft than that of the other twin.123
A meta-analysis of 19 cohort and case-control studies of smoking and hip fracture found that in current smokers relative to nonsmokers, the risk of hip fracture was similar at age 50 but greater thereafter by an estimated 17 percent at age 60, 41 percent at 70, 71 percent at 80, and 108 percent at 90.122 Individual studies reported that the risk of hip fracture increased with the amount smoked and did not reach nonsmoker levels until 10 years after cessation.124 Other studies found that the risk for hip fracture was higher for thinner smokers than for normal or overweight smokers,125 use of estrogen did not protect against fracture in smokers as it did in nonsmokers,126 and the risk for hip fracture was highest in smokers with low intake of vitamins E and C.127
Alcohol use is an inconsistent predictor of bone mass and fractures, probably because participants of studies typically represent low to moderate users and nonusers.128-131 Excessive alcohol use is a known secondary cause of osteoporosis. Data from the Framingham Study found that alcohol intake of at least 7 ounces per week was associated with high bone density,132 and current alcohol consumption was associated with a significant increase in risk of hip fracture, particularly in women under 65 years old (relative risk [RR] 2.96; CI 1.24-7.10).133 The effects of moderate alcohol on bone may be related to the augmentation of endogenous estrogen levels by alcohol.
Caffeine intake is inconsistently associated with bone density and fractures. Some studies have found that caffeine use is associated with low bone density at several sites134 and with annual percent loss of bone density,135 while others have found no association.129,136,137 The SOF reported current caffeine intake as a risk factor for hip fracture (RR 1.2; CI 1.0-1.5).54 In other studies, the risk for hip fractures was found to be mildly increased in women who drank more than two cups of coffee per day73,138,139 and more than four cups of coffee per day.140
Randomized controlled trials indicate that supplemental calcium and vitamin D reduce the risk of hip fractures and other nonvertebral fractures in elderly women.141,142 The influence of calcium intake from dietary sources is less clear. Some studies report little or no association between dietary intake of calcium rich foods and bone density143 or fractures,144-147 including a study of NHANES I data.148 Other studies report that calcium intake through milk and milk products in childhood and adolescence was significantly lower in women with osteoporosis than in controls, although there was little difference between groups for intake later in life.149,150 In other studies, dietary calcium intake was associated with higher bone density151-153 and reduced risk of fracture.121,154-156 Vitamin D had its strongest positive effect in particularly frail, elderly women.157,158
Exposure to high levels of fluoride in drinking water has been found to cause changes in bone mineralization. Fluoride ions are incorporated into bone as fluoroapatite, influencing mineral structure and affecting bone strength. As a result, bone density may be higher than normal, but fractures are more common. A meta-analysis of studies of fluoride and bone density indicated that those with higher fluoride exposure had a reduced risk for diagnosis of osteoporosis (RR 0.39; CI 0.2-0.75).159 A similar meta-analysis of 18 studies of fluoride and fractures indicated that those with higher fluoride exposure had increased risks for any type of fracture (RR 1.12; CI 1.04-1.21).159 Several individual studies, however, did not find these relationships.160-163
Several measures of physical function have been found to be associated with bone density. Greater grip strength, usually measured by a hand-held dynamometer, had a significant positive association with bone density of the proximal femur,164,165 spine, and wrist sites,165 as well as a protective effect for hip fracture.166-168 Lower quadriceps strength,169 slower gait speed,56 and other functional measures54,58 were predictive of hip fracture. The risk of hip fracture also was increased in women with impaired vision.54,89,170,171
Several studies support the association between current exercise and higher bone density.172-175 Women with higher current activity levels have lower risk for fractures than those with less activity.144,176-183 Previous levels of exercise, such as during adolescence and young adulthood, have inconsistent associations with bone density later in life,184-186 but previous recreational activities were associated with a reduced risk of hip fracture in one study.187 Having had a previously active job in midlife also was associated with a reduced risk of hip fractures.176,188
More than 90 percent of hip fractures occur following a fall.89 For women over 70, the tendency to fall is a more important predictor than other factors in some studies.189,190 A history of falls is a significant risk factor for fractures in many studies.88,168,191,192 Risk factors for falls with resulting fractures include impaired mobility,55,193,194 environmental hazards,195,196 fear of falling, reduced knee extension strength, and poor visual acuity.197
In the SOF, women with low baseline bone density had worse baseline cognitive scores, and women with vertebral fractures had lower cognitive test scores and a greater odds of cognitive deterioration than those without fractures.198 Hip fracture was found to be associated with both Alzheimer's disease and vascular dementia in a study that found the rate of hip fracture in these women to be more than twice that of the general population.199 Another study found that hip fractures are associated with Alzheimer's disease.200 Even women with mild impairment on the Mini-Mental Status Exam had a higher risk for hip fracture than those with normal results.201
Women with depression have been found to have lower spine202 and hip bone density203 than other women. Depressed women in the SOF also were more likely to fall, and to have increased rates of vertebral and nonvertebral fractures, than nondepressed women.204
Several studies support associations between high bone density and a long reproductive period characterized by early menarche and late menopause,205 although late age at menopause and short time since menopause seem to be the most consistent factors across studies.62,206-210 One study found that the effect of menopause on fracture incidence was stronger than the effect of a 5-year increase in age.211 Late menopause also was associated with reduced risk for vertebral deformity.212 Other studies found no relationship among these factors.207,213,214 Experts believe there is an age-dependent risk associated with reproductive factors that assumes higher importance in younger cohorts.
Use of estrogen replacement has been consistently associated with higher bone density215 and decreased risks of fractures of many types, including wrist, vertebral, hip, and other nonvertebral fractures.216-222 Estrogen had a protective effect on fracture risk if used ever218 or used within 2 years of the study,218 for younger as well as for older postmenopausal women.216,221 The use of progestins has not been well evaluated in these studies.
A history of oral contraceptive use was protective against low bone density in some women,223,224 although the risk of subsequent fractures in women who had ever used oral contraceptives was greater than that of nonusers in another study.225 The effects of current doses of oral contraceptives on postmenopausal osteoporosis has yet to be determined.
Tubal ligation226 and parity213,227 were not found to be associated with fractures. Parity72,214,228 and number of previous miscarriages228 were not associated with bone density. Breastfeeding was protective for hip fracture in one study,213 but not in another,227 and was associated with higher bone density in others.214,229
Many studies have reported the effects of commonly used drugs on bone. We considered glucocorticoid use as a secondary cause for low bone density and fractures and did not evaluate it in this review.
A meta-analysis of 18 observational studies found that current thiazide users were protected against hip fracture, with an overall odds ratio of 0.82 (CI 0.73-0.91).230 In the SOF, women on thiazides for more than 10 years had significantly higher bone density than nonusers.231 Thiazides also have been associated with reduced risk of wrist fractures.232 However, no randomized controlled trials of thiazides for fracture prevention have yet been conducted. A protective effect for furosemide also has been reported.233
Thyroxine-treated women with low thyroid-stimulating hormone (TSH) levels were found to lose bone density from the spine more rapidly than women without known thyroid disease.234 Low bone density measured at several anatomical sites also was found for long-term users of thyroid hormones at thyroxine- equivalent doses of 1.6 microgram/kg or greater.235 Other studies found no relationship between thyroxine use and bone density.236-239
A recent large, population-based, nested case-control analysis of individuals over 50 years old in the United Kingdom found that the use of statin drugs (HMG-CoA reductase inhibitors) for treatment of hypercholesterolemia was associated with a decreased risk of fractures (OR for women 0.56; CI 0.43-0.74).240 A case-control study including 6,110 New Jersey residents age 65 and older (83 percent women) found similar associations between statin use and reduction in hip fractures (adjusted OR 0.05; CI 0.37-0.76).241 Randomized controlled trials have not yet been done to confirm these relationships.
Use of antidepressants -- including selective seratonin uptake inhibitors, secondary-amine tricyclics, and tertiary-amine tricyclics -- has been associated with increased risks for hip fracture.242-245 Use of lithium was not associated with decreased bone density.246
Women who used propionic acid, but not acetic acid, nonsteroidal anti-inflammatory drugs (NSAIDs) had higher bone density at the radius and lumbar spine than nonusers.247 Data from the SOF indicated that daily use of aspirin or NSAIDs was associated with increased bone density of the hip and spine, but there was not a significant change in risk for hip fracture.248
Studies of other commonly used drugs have had conflicting or inconclusive results. Use of benzodiazepines increased the risk of hip fracture in some,244,245,249-252 but not all studies.253,254 Long-term exposure to oral anticoagulation was associated with an increased risk of vertebral and rib fractures in one study,255 but not in another.256 Chronic use of H2 receptor antagonists had little effect on bone density.257 Intermittent users of nitrates had greater hip and heel bone density than nonusers; daily users had only slightly greater hip bone density than nonusers.258
Several factors are consistently associated with increased risks of low bone density and fractures in postmenopausal women, including increasing age, white race, low weight or weight loss, nonuse of estrogen replacement, history of previous fracture, family history of fracture, history of falls, and low scores on one or more measures of physical activity or function. Other factors are less consistent predictors across studies, but also have significant associations with bone density and fractures. These include smoking, alcohol use, caffeine use, low calcium and vitamin D intake, and use of certain drugs. The relative risks of several risk factors are comparable to that of a one standard deviation difference in bone density. Predictors for low bone density are similar to those for fracture except for those specifically related to falls. Most of the strongest risk factors are consistently related to outcomes in different racial and ethnic populations. Risk factors are generally similar for the various fracture sites except that fractures related to falls have additional functional risk factors.
In contrast to the extensive body of literature about determining clinical risk factors for osteoporosis and fractures, few studies evaluate how to use these risk factors to identify individual women at risk for fracture.259 Ideally, tools for assessing risk factors would aid clinicians in determining who should and should not undergo bone density testing, reduce modifiable risk factors, and consider treatment.
Variables in the models differed among studies. Most studies determined which variables to include by finding the most significant predictors within their study population and combining them to create a score. These scores were developed by assigning a certain number of points to risk factors based on the magnitude of their odds ratios, standardizing the beta coefficients of the regression results, or stratifying the results and designating a threshold for a positive score. Some models focused on certain types of variables, such as a model that included weight over 70 kilograms as the only variable,83 and another that included four cognitive and functional variables.261 Others included a broader range of variables such as age, weight, activity level, family history, and smoking status.267
Four studies evaluated hip fracture outcomes, two studies evaluated vertebral fractures, and two studies evaluated all types of fractures. These studies determined how well risk factors were associated with fractures known to have occurred already (four case-control studies), or how well they would predict fractures in the future (four prospective studies). The other nine studies evaluated bone density outcomes. These studies determined how well risk factors identified women with low bone density obtained by densitometry at the time of the study (all cross-sectional studies). Only 3 of the 17 studies validated the model in a population different from that in which it was developed.
One risk assessment tool from a good-quality study assigned points to selected risk factors for low femoral neck bone density (age, weight, race, estrogen use, presence of rheumatoid arthritis, history of fractures) to create a summary measure referred to as the Simple Calculated Osteoporosis Risk Estimation (SCORE).271 These risk factors were obtained from over 1,200 women from the community and were subsequently tested in a validation group (n=259). SCORE had an area under the ROC curve of 0.81 in the development group, and a sensitivity of 91 percent and a specificity of 40 percent in the validation group.
The performance of SCORE also was evaluated in a separate cohort from the Rancho Bernardo Study.274 These women had a mean age approximately 10 years older than women in the SCORE cohorts. A total of 1,013 postmenopausal white women age 44 to 98 years underwent assessment with SCORE protocol and DXA of the femoral neck. Using the recommended cut point of 6, the sensitivity of SCORE was 98 percent, specificity 12.5 percent, positive predictive value 69 percent, and negative predictive value 75 percent. SCORE also has been used in two studies of cost-effectiveness.12,275 These applications of SCORE are discussed in the Cost section of this report.
The best performing model for fracture outcomes reported an area under the ROC curve of 0.83.262 This model was based on a prospective study of 5,208 subjects in the Rotterdam Study whose risk factors were determined at baseline. Hip fracture outcomes were determined 3.8 years later and risk points were assigned from beta coefficients of regression models. Variables in the model included age, gender, height, use of a walking aid, current smoking, and weight. These are all easily obtainable variables that represent several of the most consistent predictors reported from other studies. Adding femoral neck bone density to the model and remodeling all of the parameters improved its performance slightly (ROC area 0.88) (Figure 7
Few studies report results of the performance of tools for assessing clinical risk factors for osteoporosis. Our evidence review suggests that specific tools to assess risk have inadequate sensitivity and specificity and have not yet been widely tested. Generalizability of these tools to the clinical setting is therefore currently limited. However, some tools, especially those developed in large community populations and containing variables known to be strong predictors, may ultimately be applicable to the clinical setting once they are tested there.
Although this approach may be useful in the general population, translating the results of population studies to the care and management of individuals requires additional considerations. Risk factors that are not common may be dominant in individual patients. Also, some risk factors may be important for different types of fractures and for different age groups, for example. It is not yet known how tools for risk assessment would perform under these varying circumstances. In addition, clinicians may find it difficult to selectively exclude low-risk women from diagnostic protocols because a substantial number of women without specific risk factors also experience fractures. Patients may have similar concerns.
The use of risk factors other than low bone density in treatment decisions is advocated by experts7,9 and included in guidelines by the National Osteoporosis Foundation.8 In one approach, treatment decisions are based on the overall risk of fracture, rather than on the bone measurement test alone. The assumption underlying this approach is that the number needed to treat is lower, and the cost-effectiveness of treatment is higher, for higher risk patients.
We found no studies in our literature search testing a risk-based treatment approach. Such studies would directly compare the benefits of treatment for women who have clinical risk factors but normal or mildly low bone density with those for women with few risk factors but lower bone density. Alternatively, we examined the entry criteria of eight trials of alendronate therapy to assess how applicable recent treatment trials are to a risk-based approach. In these trials, treatment was intended to halt or reverse the loss of bone and prevent fractures in women who had osteoporosis (secondary prevention).276-283 We focused on alendronate trials because many clinicians consider calcium and vitamin D supplementation a standard of care for older women, and because the decision to begin HRT is a complex process involving consideration of risks and benefits unrelated to osteoporosis treatment. Subjects in these trials were predominantly white postmenopausal women, generally in their mid-60s to 70s, and were often selected because of their low bone density and, in some trials, previous vertebral fractures.
One study from the Fracture Intervention Trial (FIT) included 2,027 women who had T-scores of -1.6 or less and preexisting vertebral fractures.276 The 3-year risk of hip fracture in the placebo group was 2.2 percent (1.1 percent in the alendronate group, relative hazard [RH] 0.49 [CI 0.23-0.99]), and the 3-year risk of any clinical fracture was 18.2 percent (13.6 percent in the alendronate group, RH 0.72 [CI 0.58-0.90]).
A second study from FIT included 4,432 women who also had T-scores of −1.6 or less, but did not have preexisting vertebral fractures.283 The 4-year incidences of hip fracture (1.1 percent) and any clinical fractures (14.1 percent) in the placebo group were less than those of the women in the placebo group of the FIT study that included women with preexisting vertebral fractures. In this second study, only the subgroup of treated patients (n=1,627) who had a T-score under −2.5 had a significant reduction in all clinical fractures, from 19.6 to 13.1 percent (RR 0.64; CI 0.50-0.82). There was no reduction in fractures for patients who had a T-score between −2.0 and −2.5 or for those between −1.6 and −2.5.
These results suggest that women with preexisting vertebral fractures and low bone density have a higher incidence of fractures than women with low bone density alone. Women with more risk factors for fracture, such as those with preexisting vertebral fractures or the lowest bone density, had the greatest benefit with treatment. These results do not indicate that patients with a high overall fracture risk who have a T-score above −2.5 will not benefit from therapy. In the highest risk group of the second study (those who had a T-score under −2.5), the incidence of fractures was still much lower than the estimated risk for an 80-year-old woman with a T-score of −1.5 and two or more risk factors.8 The FIT investigators did not attempt to stratify patients by overall risk, and they have not published any subgroup analyses that compare the benefit of treatment in patients who had very high overall fracture risk but relatively high bone density for their age. None of the other studies reviewed has done this, either.
Identification of important risk factors could be useful in making treatment decisions. Although models have been proposed, they have not been prospectively tested in large trials and are not used by clinicians. Clinical trials have not focused on patients who have several risk factors and a high overall risk of fracture but who do not meet the WHO definition for osteoporosis. A few risk factors -- sex, race, age, menopause, a history of previous fracture, and low bone density -- have been primary selection criteria for randomized trials of bisphosphonates and provide clues to the validity of the risk-based approach.
In this section we examine how well different tests predict fracture and how well they agree about which patients have osteoporosis. We focused on recent prospective studies of bone density measures in use today, particularly DXA of the hip and spine, QUS of the heel, and RA or DXA of the hand or forearm. We also examine how bone testing in general affects patients' and physicians' behaviors and treatment decisions.
The literature describing the performance of bone measurement tests to predict fractures is extensive. We focused our review on a meta-analysis published in 1996 that provided a summary of older studies, and on prospective cohort studies that were not included in the meta-analysis or that were published since 1996. We did not review case-control studies because previous systematic reviews found that the results of case-control studies of fracture risk vary widely49, and that at least some of the variation is related to the method used to select cases and controls.252 Case-control studies do indicate whether, on average, mean values of bone density measured by a particular technique differ for patients who have and have not had a fracture, but they do not measure how well the test discriminates between high-risk and low-risk individuals.
The meta-analysis49 that assessed 23 publications from 11 separate prospective cohort studies published before 1996 pooled studies to estimate the age-adjusted relative risk of various types of fractures for a one standard deviation decrease in bone density. Nearly all available data included in these studies were from women in their late 60s or older and all tests used densitometry techniques. Data from these studies were insufficient to estimate age-specific relative risks or to assess the relationship between length of followup and relative risk.
Results of the meta-analysis indicated that measurements made at the hip predicted hip fracture better than measurements made at other sites.44,284,285 For bone density measurements made at the femoral neck, the pooled relative risk per one standard deviation decrease in bone density was 2.6 (CI 2.0-3.5).
This estimate was based almost entirely on one high-quality report from the SOF cohort using data from 8,134 women over 65 years of age after 1.8 years of followup.44 In that study, the relative risk of hip fracture per one standard deviation decrease was 2.7 (CI 2.0-3.6) for measurements made at the proximal femur, versus 2.0 (CI 1.5-2.7) for measurements made at the heel, 1.6 (CI 1.2-2.1) for the distal radius, and 1.6 (CI 1.2-2.2) for the lumbar spine. Because of the clinical importance of hip fracture, these findings established DXA of the hip and, in particular, the femoral neck as the standard test for measuring bone density in epidemiologic studies.
For measures made in the forearm (proximal radius), the pooled relative risk for one standard deviation decrease in bone density was 2.1 (CI 1.6-2.7) for hip fractures.189,286-289 The individual studies contributing to the estimate of the relative risk of forearm bone density for hip fracture included a series of 135 nursing home residents (RR 1.9; CI 1.4-2.7), a population-based study of 1,076 postmenopausal women (RR 2.5; CI 1.3-4.8), and a report of 9,704 subjects in the SOF cohort (RR 1.4; CI 1.1-1.8).
For studies using densitometry methods, calcaneal bone density was associated with forearm (RR 1.6; CI 1.4-1.8), hip (RR 2.0; CI 1.7-2.4), and vertebral fractures49 (RR 2.4; CI 1.8-3.2). In two reports from the SOF cohort, the relative risk of heel bone density for hip fracture was 2.0 (CI 1.5-2.7) and 2.7 (CI 1.8-4.0). No other cohort study reported about prediction of hip fracture from calcaneal densitometry measurement.
For measurement of bone density at the forearm, the relative risk was 2.2 (CI 1.7-2.6) for vertebral fractures, and 1.5 (CI 1.3-1.6) for fractures at all sites. For forearm fractures, measurement at the proximal radius (RR 1.8; CI 1.5-2.1) and distal radius (RR 1.7; CI 1.4-2.0) were slightly better than measurement at the heel (RR 1.6; CI 1.4-1.8), lumbar spine (RR 1.5; CI 1.3-1.8), and hip (RR 1.4; CI 1.4-1.6). For vertebral fractures, measurement at the lumbar spine (RR 2.3; CI 1.9-2.8), heel (RR 2.4; CI 1.8-3.2), and proximal radius (RR 2.2; CI 1.7-2.6) had similar results.
| Study | Population/setting | Exclusions from population | N | Age | Proportion of original cohort in sample | Site, machine, and measure |
|---|---|---|---|---|---|---|
| Study of Osteoporotic Fractures (SOF)44,285,290,297 | Community-dwelling white women from four U.S. areas recruited from lists (e.g., jury selection, drivers' licenses, HMO memberships) | Black women | 8,134 | >65 years | 97% returned at 5 years | Heel QUS: UBA 575; BUA |
| Women unable to walk without assistance, or with bilateral hip replacements | 72% of original cohort known to be alive | Spine, Hip DXA: Hologic QDR 1000; BMD, BMC | ||||
| Fracture followup 99% complete | ||||||
| Epidemiologie de l'Osteoporose (EPIDOS)292-294,298 | Women from five cities in France recruited from voting lists and health insurance companies | Black women | 7,575 | >75 years | 2% lost to followup | Heel QUS: Lunar Achilles; BUA, SOS |
| Women with bilateral hip replacement, previous hip fracture, Paget's disease of bone, malignant bone disease, renal failure, hyperthyroidism, or treated hypothyroidism | Hip DXA: Lunar DPX Plus; BMD | |||||
| Whole body DXA: Lunar DPX Plus; BMD, BMC | ||||||
| Hawaii Osteoporosis Study (HOS)45 | Women of Japanese ancestry, recruited from a 30% random sample of wives of participants in the Honolulu Heart Program | N/A | 560 | Mean 73.7 years | Not reported | Heel QUS: Walker-Sonix; BUA |
| Hand absorptiometry: OsteoGram Phalangeal; BMD Bonalyzer Metacarpal; BMD | ||||||
| Diagnostisch Onderzoek Mammacarcinoom (DOM)300 | Ongoing screening program for breast cancer (every woman born between 1911 and 1941 living in the vicinity of Utrecht, the Netherlands) | None | 440 | Mean 57 years | 82% participation | Hand quantitative microdensitometry: machine not specified |
| 10 women for each age year from DOM were asked to participate in the osteoporosis study | ||||||
| Osteoporosis Risk Factor and Prevention Study (OSTPRE)295 | Random stratified sample of 14,220 women in Kuopio Province, Finland | Women with spinal osteophytes or deformations, hip deformations, or incorrect measures were excluded after densitometry | 3,222 | Mean 53.4 years | Of 3,222 undergoing densitometry, 97.5% followup | Hip, spine DXA: Lunar DPX, BMD |
| Aberdeen - Porter57 | Women aged >70 years in residential care within the catchment areas of Doncaster and Hull Royal Infirmaries, Aberdeen | Women not able to walk with assistance and those with bilateral hip surgery | 1,414 | >70 years | 100% | Heel QUS: Osteosonics Ultrasonic bone analyzer 1001; BUA |
| Mean 84 years | ||||||
| Aberdeen - Stewart299 | Women from 45 to 49 years old recruited from population-based register within 20 mile radius of Aberdeen | Not specified | 790 | 45-49 years | 79% of 1,000 consenting included | Heel QUS: Walker-Sonix UBA 575; BUA |
| Hip, spine DXA: Norland XR-26; BMD, BMC, or both (not reported) | ||||||
| Rotterdam291 | People >55 years living in Ommoord (Rotterdam) | None | 3,078 women | >55 years | Complete followup on 7,046 (88%); 4,268 women | Hip DXA: Lunar DPX-L; BMD |
| Complete data on 5,304 (66%); 3,078 women | ||||||
| Modena296 | Consecutive admissions over 8 months to Department of Orthopedic Surgery, Modena, Italy | Patients with diseases of bone metabolism, rheumatoid arthritis, under treatment with specified drugs, or HRT were excluded | 211 | Mean 52 years | 84% (211/250) | Hand QUS: Sonic 1200; SOS of phalanx |
BMC = bone mineral content; BMD = bone mineral density; BUA = broadband ultrasound attenuation; DPX = dual-photon X-ray; DXA = dual-energy X-ray absorptiometry; HMO = health maintenance organization; HRT = hormone replacement therapy; QDR = quantitative digital radiography; QUS = quantitative ultrasound; UBA = ultrasound bone analyzer
| Cohort | Publication | Grade | ||
|---|---|---|---|---|
| EPIDOS | ||||
| Good | Fair | Poor | ||
| Duboeuf (1997)292 | X | |||
| Schott (1998)298 | X | |||
| Garnero (1998)293 | Xa | |||
| Hans (1996)294 | Xa | |||
| SOF | ||||
| Black (1992)285 | X | |||
| Cummings (1993)44 | X | |||
| Nevitt (1994)297 | X | |||
| Bauer (1997)290 | Xb | |||
| Porter (1990)57 | X | |||
| de Laet (1998)291 | Xc | |||
| Huang (1998)45 | Xd | |||
| Kroger (1995)295 | Xd | |||
| Mele (1997)296 | Xd | |||
| Stewart (1996)299 | Xe | |||
| Vecht-Hart (1997)300 | Xe |
75% of initial cohort received all tests used in the analyses, and differences between those included and not included were not reported.
64% of initial cohort received all tests used in the analyses.
48% of initial cohort had followup measures used in the analyses.
Self-reported fractures confirmed from medical records and radiographs.
Some or all of self-reported fracture outcomes were not verified.
EPIDOS = Epidemiologie de L'Osteoporose; SOF = Study of Osteoporotic Fractures
Publications from five cohorts presented data about the ability of hip DXA measures to predict subsequent fractures in different age groups.44,285,290-295,297-299
DXA of the hip predicted hip fracture best in the SOF and Rotterdam studies (RR per one standard deviation decrease 2.6 [CI 1.9-3.8] and 2.5 [CI 1.8-3.6]). Prediction was not as good in women over 75 years of age. In the EPIDOS cohort, the overall relative risk was 1.9 (CI 1.6-2.4), and in the subgroup of women over 85 years of age, the relative risk was 1.6 (CI 1.3-2.2). In the SOF cohort, the relative risk was higher for women age 65 to 79 (RR 2.9; CI 2.2-3.9) than for those over 80 years of age (RR 2.1; CI 1.4-3.2).
Two recent studies included younger, perimenopausal women between the ages of 45 and 56, but these subjects had too few hip fractures to estimate the relative risk. The adjusted relative risk for all nonspine fractures was 1.39 (CI 1.2-1.6) in one study295 and 1.4 (CI 1.3-2.4) in the other,299 which is comparable to the relative risks for all nonspine fractures in older women.
The method used in the Rotterdam study to identify categories of risk resulted in the strongest association between the category and actual incidence of fracture.291 These investigators used a regression equation, derived from a previous study, to classify women as "high risk," "moderate risk," or "low risk" for fracture, based on age and the result of the DXA of the femoral neck.301 Over 3.8 years of followup, the probability of hip fracture was 0.7 percent in the low risk group versus 6.9 percent in the high risk group. Other cohort studies stratified by age to achieve a similar level of discrimination. The absolute differences in risk increase with the baseline probability, so these differences are larger in older subjects, even though the relative risk in these cohorts may be lower.
The least discrimination occurred in the study of the youngest cohort of women (age 45 to 49 years), which reported results for all nonspine fractures.299 The 2-year probability of fracture in the group with high bone density was 2.4 percent versus 1.7 percent in the group with low bone density. The low difference (7 per 1,000) reflects the fact that bone density testing provides less information about the short-term risk of fracture in younger, lower risk individuals.2
| QUS cut point | Ultrasound result b | DXA of the hip result c | Probability of fracture |
|---|---|---|---|
| 85 | high | high | .0220 (35/1594) |
| high | low | .0053 (19/3613) | |
| low | low | 0 (0/694) | |
| low | high | 0 (0/92) | |
| 80 | high | high | .0227 (35/1541) |
| high | low | .0055 (18/3271) | |
| low | low | .0010 (1/1036) | |
| low | high | 0 (0/145) | |
| 75 | high | high | .0233 (34/1458) |
| high | low | .0050 (14/2814) | |
| low | low | .0013 (5/1493) | |
| low | high | .0044 (1/228) |
Bauer DC435
A QUS result lower than the cut point in the first column is classified as "high risk," and a QUS result higher than the cut point is classified as "low risk."
Dual-energy X-ray absorptiometry of the femoral neck using a T-score = -2.5 as the cutoff value for "high" versus "low" risk.
DXA = Dual-energy X-ray absorptiometry; QUS = Quantitative ultrasound
These subsets of the SOF excluded women who had a fracture before QUS measurements were done. This bias is potentially important because women who had fractures early in the study may have differed in risk factors and in the predictability of their fractures from women who fractured later.
In the second study, from the EPIDOS cohort,294 the relative risk for hip fracture per standard deviation unit was 2.0 (CI 1.6-2.4) for QUS versus 1.9 (CI 1.6-2.4) for DXA of the hip. In the third study,299 the relative risks for DXA of the femoral neck and QUS for all nonspine fractures were nearly equal (1.4; CI 1.3-2.4 versus 1.43; CI 1.2-2.4). Absolute risk differences for QUS ranged from 0.007 to 0.017, compared with 0.007 to 0.025 for DXA of the hip. The number of women with low bone density who subsequently had any nonspine fracture was similar for QUS (between 2 and 8 in 100) and hip DXA (between 2 and 7 in 100), as was the number of women with normal bone density who had a fracture. Three studies45,296,299 included vertebral fractures in their evaluation of the association of QUS and fracture risk. The odds ratio for a vertebral fracture, adjusted for age and calculated for one standard deviation decrease in calcaneal BUA, was 1.50 (CI 1.05-2.16) in one study45 in which 39 patients of a sample of 560 sustained a vertebral fracture. For the other two studies, no vertebral fractures occurred in the samples.
In elderly women, lumbar spine densitometry can have false-negative results because of arthritis of the spine.302,303 In the two studies for which there were available data to perform the calculations,44,299 between 3 and 8 women in every 100 who had low test results for DXA of the spine subsequently suffered a fracture. In 100 women who had a test result above the cutoff values used in these studies, up to 5 patients suffered a fracture.
One report from the EPIDOS study evaluated the association of DXA measures of the whole body with subsequent hip fractures.298 The relative risk for hip fracture was 1.6 (CI 1.2-2.0), versus 1.9 (CI 1.5-2.3) for hip DXA. Patients who had a T-score under −2.5 had a hip fracture risk of 2.3 percent, while those below this threshold had a risk of 1.4 percent. One case series of 211 patients in an orthopedic clinic examined the ability of QUS of the hand to predict nonspine fractures.296
Among different bone measurement tests at various sites, bone density measured at the femoral neck by DXA is the best predictor of hip fracture and is comparable to forearm measurements for predicting fracture at other sites.
In recent prospective studies, QUS measured at the heel predicted hip fracture and all nonspine fractures as well or nearly as well as DXA measured at the femoral neck. For both tests, a result in the osteoporotic range is associated with an increased short-term probability of hip fracture. However, clinical trials of recent pharmacologic therapies have used a low hip DXA, rather than QUS, as a criterion for entry. Physicians who use QUS must consider whether the results of these trials are generalizable to patients identified by QUS to have a high risk of fracture.
QUS of the heel and DXA of the hip provide independent information about fracture risk. Individuals who have low scores by one of these tests, but not the other, have a greater risk of fracture than those who have higher scores by both tests, and a lower risk of fracture than those whose results on both tests are low.
Both of these tests predict hip fracture better than DXA of the lumbar spine. RA or QMD of the hand can predict the risk of nonspine fractures in general, many of which are in the forearm, but there are no recent data about the ability of hand measurement to predict hip fracture. Correlations between different bone measurement tests are generally too low to be accepted as evidence that one test will identify patients at similar risk to those identified by another test. While peripheral measures may approach hip DXA in predicting hip fracture, there are no recent prospective studies that directly compare prediction of hip fracture of these tests with DXA of the hip.
A few studies have examined how the use of different reference ranges, the use of reference ranges based on different young adult female samples, and the use of different brands or types of DXA machines affect the frequency of diagnosis of osteoporosis. When used in the same patients, DXA machines from different makers differ in the proportion of patients diagnosed to have osteoporosis by 6 percent to 15 percent.308-313
The precision of a test can affect its clinical utility for diagnosing a disease. For densitometry, results need to be precise enough to distinguish patients who have osteoporosis from those who do not. This means that variation between repeated measures should be low relative to the difference between values in osteoporotic and osteopenic or nonosteoporotic individuals.
The simplest evaluation of precision assesses agreement of repeated readings on a "phantom," which is a block of material with a known, stable amount of mineral.314,315 These repeated measures may be performed over hours, days, or years. In these in vitro studies of DXA at the hip and spine, the short-term and long-term coefficient of variation (CV) is less than 1.1 percent, meaning that the standard deviation of repeated DXA readings is 1.1 percent or less of the mean.316-320 In the NHANES III survey, three DXA machines from the same manufacturer were deployed in mobile units and used to make several thousand measurements over 3 years.321 The short-term precision for the three machines ranged from 1.4 to 2.1 percent at the femoral neck to 0.40 to 0.80 percent at the intertrochanter region, while the long-term precision was close to 0.50 percent for all sites. In similar studies, the in vitro precision of BUA of the heel was 0.8 to 1.2 percent.322
While the in vitro precision is frequently highlighted in review articles and information from manufacturers, operator skill and variability among subjects are more important than in vitro machine performance in obtaining reliable densitometry results.323 Measurement error is higher in elderly patients than in young women, and at the spine and greater trochanter than at the femoral neck.324
The expertise required to obtain reliable scans differs among bone density tests and is better characterized for older technologies and for axial and hip DXA than for peripheral sites and newer technologies. In the NHANES III study, for example, 3.5 percent of DXA scans were unreadable, mostly because patients moved during the examination. Another 33 percent of scans required reanalysis to apply quality standards concerning the location of the regions of interest.321
These reports reflect the results of using densitometry in highly controlled academic research centers, many of which have implemented quality control techniques that are more sensitive, but also more expensive and time-consuming, than those recommended by manufacturers.317,319,321 In contrast to most common laboratory and radiologic tests, the precision of densitometry tests in everyday practice (outside research protocols and major referral centers) has not been studied. Patient factors such as obesity, handedness, edema (for ultrasound),326-331 and osteoarthritis (for DXA of the spine)332 affect estimates of bone density.
The likelihood of being diagnosed with osteoporosis varies greatly, depending on the site and type of bone measurement test, the brand of densitometer, and the relevance of the reference range in the local population. The variation between techniques, along with the lack of methods to integrate bone density results with clinical predictors, makes it difficult for clinicians to provide accurate information to patients about their test results.
The likelihood of being diagnosed with osteoporosis also depends on the number of sites tested. Testing in the forearm, hip, spine, or heel will generally identify different groups of patients. A physician cannot say, based only on a forearm test, that the patient "does not have osteoporosis." Conversely, although the results of a test at any site are associated to some degree with fractures at other sites, the physician may not be able to assess whether the patient who has a low T-score on a hand or forearm test has significant bone loss at other sites.
Because of these limitations, some experts question whether there is added benefit to the patient from using T-scores to diagnose osteoporosis, as opposed to reporting test results as continuous values and assessing overall risk. Others propose that results using various techniques and sites be "calibrated" to the results of DXA of the hip and reported as a "T-score equivalent." The success of this approach may be limited by differences in precision and low correlation among different techniques.
Many clinicians believe that the results of bone measurement tests motivate patients to exercise, adhere to medication regimens, and change other behaviors to reduce their risk of fracture. Of 1,335 women from the SOF cohort who completed a questionnaire about estrogen therapy and were taking estrogen, 33.6 percent reported their primary reason was prevention or treatment of osteoporosis.352 Furthermore, test results appear to influence physicians' decisions to prescribe HRT.353 On the other hand, the psychological effect of receiving a result indicating osteoporosis could produce anxiety and perceived vulnerability354 that may be unwarranted. We reviewed four publications that address the influence of test results on women's decisions about treatment and health-related behaviors.355-358 One publication evaluated the effect of bone measurements on practice patterns of physicians and also evaluated their level of understanding the reports they receive.359
In a randomized trial of 141 women within 3 years of menopause who were referred from three private practices, those who underwent bone density testing were more likely to fill a prescription for HRT than women who received an educational message about osteoporosis (63.4 percent versus 20.0 percent, p <0.05).357
The results of the largest study, an uncontrolled case series, suggest that women make changes based upon testing, and that there is a relationship between test results and the degree and direction of changes.355 These investigators collected baseline data and presented a 12-minute educational video about bone density to 1,203 postmenopausal women over 50 years of age (mean 62 years) who were referred to an osteoporosis program at a women's health center and who received DXA measures of the hip and spine. Over 15 months of followup, HRT use decreased from 42 percent prior to the test to 13 percent in the group classified as normal (T-score above -2), from 46 to 33 percent in the group whose bone density was moderately decreased (T-score between -2 and -3), and there was no change for the severely low group (T-score less than -3; 46 percent before the test and 47 percent afterwards). For the subset of women who had never taken HRT (n=394), 5 percent, 32 percent, and 35 percent initiated this treatment for the normal, moderately low, and severely low groups, respectively. Calcium supplement use increased after densitometry in all three groups. Twenty percent of women with normal test results reported being afraid of falling, compared with 39 percent for the moderately low group and 58 percent for the severely low group.
In a similar but smaller study,356 37 postmenopausal women (average age 58.5 years) who were referred to an orthopedic clinic received an education kit and session, were tested for bone density with hip and spine DXA, and filled out a questionnaire asking under what circumstances they would initiate HRT. Women were categorized as "normal" (0.99 standard deviations below the mean average), "moderate" risk (1 to 1.99 standard deviations below average), "high" risk (2 to 2.99 standard deviations below average), and "very high" risk (3 or more standard deviations below average). Before receiving the results, 10 women said they wanted HRT, regardless of the outcome of the densitometry test. After the test results, four additional women requested HRT. A followup telephone survey conducted after 1 year (n=35) showed no difference between normal and low bone density groups in the proportion of women on HRT. However, seven women whose BMD test results were low were taking bisphosphonates at followup.
A survey of 261 women who received densitometry studied their perception of risk of fracture and degree of worry, and whether they made changes based on their test results.358 Of the 53 percent who reported that their test was below normal, virtually all of them (94 percent) reported initiating preventive measures. Remarkably, 56 percent of those who reported their test was normal also initiated prevention measures. Women who reported below normal test results were more likely to start HRT than those with normal or above results. Of the entire group, 24 percent who reported below normal results said they began limiting activities to avoid falling; for women 65 years and older, this proportion rose to 31 percent. About one-third of the sample (86 women) had a second bone measurement test. About 26 women reported losing bone at an accelerated rate; of these, 22 initiated additional prevention.
One randomized trial examined the effect of densitometry on the practice patterns of 57 primary care physicians who ordered DXA tests for their patients, and also their understanding of the reports they received.359 Physicians were randomly assigned either to a group that would receive their patients' results in the form of a short technical report, or to a group that would receive a long report written by an endocrinologist with access to clinical information. The outcome measures, obtained by telephone interview after each physician received at least two reports, were level of understanding of the reports and differences between groups in treatment decisions. Differences in understanding between the short- and long-form groups were statistically significant. For the group that received the short technical report, 3 percent understood the T- and Z-scores, 30 percent understood the role of bone density in identifying osteoporosis, 12 percent understood the relationship between bone density and risk of fracture, and 36 percent thought the report was confusing. This compares with 32 percent (p=0.005), 86 percent (p=0.001), 45 percent (p=0.014), and 1 percent (p=0.003), respectively, for the group that received the long report. Overall, the groups did not differ significantly in referrals to a specialist, additional testing, or changes in pharmacological treatments for osteoporosis. However, in a subgroup analysis, gynecologists who received the long report were more likely to prescribe changes in pharmacological treatments for osteoporosis (61 percent versus 19 percent).
One randomized trial suggests that women who undergo densitometry are more likely to start HRT than women who do not. In a randomized trial and a large uncontrolled case series, women who had densitometry and were told they had osteoporosis were more likely to start or continue HRT than women who were told they had normal bone density.
In another randomized trial, physicians found densitometry reports confusing and were not confident that their interpretations of T-scores were correct. While overall the reporting format did not influence treatment decisions, gynecologists who better understood the report were more likely to prescribe changes in HRT than those who considered the report confusing. These results suggest that the behaviors of both patients and physicians are influenced by the results of bone measurement tests.
To be effective in monitoring, a test needs to be sufficiently accurate to detect differences in bone density over time. These differences must predict fracture risk, and changes in therapy made because of these differences must reduce the risk of fracture. To determine if bone measurement tests are effective for monitoring, we reviewed studies related to:
The reliability of densitometry in detecting changes in bone density in patients on therapy for osteoporosis.
How well changes in densitometry predict fracture outcomes in patients on therapy.
How well an "adequate" response to therapy can be distinguished from an "inadequate" response.
If adjustment of therapy based on changes in bone density leads to reduction in fracture risk.
| Precision error | Estimated bone loss | Difference in measurements b | Time frame for a reliable bone mass measurement followup |
|---|---|---|---|
| CV% | % | % | Years |
| 1 | 1 | 2.77 | 2.77 |
| 1 | 3 | 2.77 | 0.92 |
| 2 | 1 | 5.54 | 5.54 |
| 2 | 3 | 5.54 | 1.85 |
| 3 | 1 | 8.32 | 8.32 |
| 3 | 3 | 8.32 | 2.77 |
| 4 | 1 | 11.08 | 11.08 |
| 4 | 3 | 11.08 | 3.70 |
| 5 | 1 | 13.30 | 13.30 |
| 5 | 3 | 13.30 | 4.43 |
| 6 | 1 | 16.63 | 16.63 |
| 6 | 3 | 16.63 | 5.54 |
Agency for Health Care Policy and Research436
Two scans (measurements) would have to differ by more than this amount to be confident that a real change had occurred with 95% confidence that the detected losses are real.
CV = Coefficient of variation
Bone density tests are not precise enough to reliably measure short-term bone loss in postmenopausal women who are not on treatment for osteoporosis. In population-based studies, postmenopausal women lose bone (femoral neck) at a rate of 0.45 to 0.6 percent per year.302,360-362 In one of these studies, which stratified by age, the rate was 0.6 percent per year in women age 60 to 69, 1.1 percent per year in women age 70 to 79, and 2.1 percent per year in women over 80 years of age.360 Higher rates of bone loss have been reported in case series and in the control groups of therapeutic trials performed in patients recruited from specialty referral clinics.363,364
In randomized trials of treatment for osteoporosis, the relationship between changes in bone density and fracture rates is inconsistent.365 For some therapies, however, such as fluoride, increases in bone density have been associated with unchanged or even increased fragility. Other treatments, such as osteocalcin, reduced fracture rates without an appreciable effect on bone density. Because of these results, there is a consensus that while reduction or reversal of bone loss is an important intermediate outcome, it is not an adequate substitute for direct evidence that a treatment prevents fractures.8
One report from the FIT showed that, on average, women who had larger increases (3 percent or more) in total hip bone density after taking alendronate for 2 years had a lower risk of vertebral fracture than women who had unchanged or lost bone density.366 The differences between patients who had a small (3 percent or less) increase and those who lost bone were not statistically significant, and the results did not change when the analysis was limited to patients with high adherence, which was measured by pill counts. These findings do not support the view that monitoring bone density changes can help detect poor compliance.
As the authors pointed out, the findings "do not necessarily suggest that measurement of change in bone density in individual patients can predict their specific risk for vertebral fractures."366 Most importantly, the analysis did not address whether the women who lost bone or had small increases in bone density benefited less from therapy than women who had a greater response. These women may have been at higher risk of fracture in the first place, and might have had even worse results had they been taking placebo. At present there is no way to know whether increasing the dosage of the medication might improve outcomes.
High-quality, indirect evidence about the potential impact of adjusting therapy based on the results of bone density tests comes from a very recent report of the FIT and Multiple Outcomes of Raloxifene Evaluation Trial.367 Patients in the treatment arms of the two trials (n=6,588) had DXA of the hip and spine at baseline and at 12 and 24 months. During the first year of alendronate therapy, a small group of patients (1.4 percent) appeared to have a decline of 4 percent or more in bone density (these were the worst responders). These patients had a 92 percent chance of gaining bone density in the second year of treatment, and their average gain (4.8 percent) was higher than that of patients who had seemed to respond to alendronate in the first year. Conversely, patients who gained the most bone density in the first year (8 percent or more) lost bone density on average in the second year of treatment. A similar pattern was observed for raloxifene. The investigators attributed this pattern to the statistical phenomena known as "regression to the mean."367
Currently, the weight of evidence is against repeating bone density tests within the first year of treatment. There is insufficient evidence to determine whether repeating bone mineral density tests 2 years after starting therapy is useful. There are no studies about the effect of monitoring responses to therapy using densitometry, or of the choice of test, on the outcome of therapy.
It is widely believed that both bone density and the rate of continuing bone loss are independent risk factors for fractures. Bone loss occurs when bone resorption outpaces bone formation during bone remodeling. Markers of bone turnover are biochemical substances found in the blood or urine that reflect this process. Although resorption and formation are coupled, in postmenopausal women, high rates of bone turnover due to estrogen depletion result in net bone loss. Broader use of markers has been advocated to identify women at high risk for osteoporosis or women who are not improving on treatment. Such identification could lead to earlier initiation or adjustment of therapy and improved outcomes. The rationale for these uses of markers is that changes in levels of biochemical markers in response to changes in hormonal milieu or antiresorptive therapy are generally seen before changes in bone density can be detected.368
| Marker | Abbreviation | Collection | Assay |
|---|---|---|---|
| Formation markers | |||
| Total alkaline phosphatase | ALP | Serum | IRMA |
| Bone-specific alkaline phosphatase | BALP | Serum | IRMA, ELISA |
| Osteocalcin, bone gla-protein | OC | Serum | RIA, ELISA |
| Carboxyterminal propeptide of type I procollagen | PICP | Serum | RIA |
| Carboxy propeptide of type I collagen | ICTP | Serum | RIA |
| Resorption markers | |||
| Hydroxyproline | HPR | Urine | |
| Calcium | CA | Urine | |
| Pyridinoline | PYR | Urine | EIA |
| Deoxypyridinoline | D-PYR | Urine | EIA, RIA |
| Type I collagen cross-linked C-telopeptide | CTX | Urine/Serum | ELISA |
| Type I collagen cross-linked N-telopeptide | NTX | Urine | ELISA |
| Tartrate-resistant acid phosphatase | TRAP | Serum | |
Markers of bone resorption include urinary hydroxyproline, urinary calcium, tartrate-resistant acid phosphatase (TRAP), the collagen cross-links urinary pyridinoline (PYR) and urinary deoxypyridinoline (D-PYR), carboxy propeptide of type I collagen (ICTP), type I collagen cross-linked C-telopeptides (CTX), and type I collagen cross-linked N-telopeptides (NTX). CTX and NTX can be measured in serum or urine, but the urine test is more widely used.370 According to Quest Diagnostics, Incorporated, the cost of marker tests ranges from about $75.00 for BALP, to $150.00 for serum NTX.
Most of these markers have come into use recently and are not yet widely used in clinical practice. In the MEDLINE search we conducted for this review, only 43 of 1,038 citations were published before 1990. Urinary D-PYR measured by immunoassay (Pyrilinks-D®), and urinary NTX immunoassay (Osteomark®) are the most widely available markers in U.S. clinical laboratories.370 Urinary CTX immunoassay (CrossLaps®), serum NTX immunoassay (Osteomark®), and urinary PYR immunoassay have the most limited availability.
Several factors should be taken into account when interpreting results of marker tests. Marker levels vary substantially over the course of a day and from day to day.371-373 For example, in one study of 14 women, the daily peak value for urinary NTX was 20% higher than the mean value over 24 hours.371 Collecting several specimens, and collecting them at specific times of the day, may give a more accurate picture of the level of bone turnover than single measurements. In addition, marker levels may take several months to stabilize after initiating treatment, and the rate of change may be an important indicator of the rate of turnover or rate of bone loss. In a study of women on alendronate treatment, osteocalcin had a smaller, gradual decrease over 12 months than NTX, which decreased about 60 percent at 6 months after start of treatment.374 Another study measured markers at 3 months, 6 months, and 12 months.375 Decreased D-PYR levels remained steady at 6 months, whereas bone density continued to increase.
The method by which markers are measured and defined also may affect results. Kits using different methods, and kits from different manufacturers, may have different analytic performance characteristics.376,377
We considered two populations of women for this review: those who have not been treated for osteoporosis, and those who are currently on treatment. Women not on treatment are those who may have bone loss but have never had their bone density measured, or who have a bone density that is known to be in the normal or osteopenic range. In theory, the use of markers in these women could lead to reduction in fractures by identifying women who are at high risk because of rapid bone loss. Treatment could be initiated to prevent osteoporosis by reversing or slowing bone loss. The second population of interest in this review is women undergoing treatment for osteoporosis. In theory, clinicians could use markers to identify women who were not responding to treatment. After identifying these "nonresponders," clinicians could decide to increase or change current treatment, potentially leading to decreased risk of fracture.
| Evidence Table | Description of patients | Condition | Definition of + Test | Definition of - Test | Definition of Sensitivity | Definition of Specificity |
|---|---|---|---|---|---|---|
| 10 | Not on treatment, with or without osteoporosis | Current osteo-porosis | Marker suggests osteoporosis (high turnover) | Marker suggests normal BMD (low or "normal" turnover) | Proportion of patients with osteoporosis who had a marker test indicating high turnover (+ test) | Proportion of patients with normal BMD who had a marker test indicating low/normal turnover (- test) |
| 12 | Not on treatment, with or without osteoporosis | Future fracture | High turnover | Low/normal turnover | Proportion of patients who fractured who had a marker test indicating high turnover (+ test) | Proportion of patients who did not fracture who had a marker test indicating low/normal turnover (- test) |
| 14 | Not on treatment, without osteoporosis | High rate of bone loss over time | High turnover | Low/normal turnover | Proportion of patients with a high rate of bone loss who had a marker test indicating high turnover (+ test) | Proportion of patients with a low rate of bone loss who had a marker test indicating low/normal turnover (- test) |
| 16 | On treatment, with osteoporosis | Poor response to treatment (high rate of bone loss over time) | High turnover | Low/normal turnover | Proportion of patients with a high rate of bone loss who had a marker test indicating high turnover (+ test) | Proportion of patients with a low rate of bone loss who had a marker test indicating low/normal turnover (- test) |
In the absence of clinical trials that compare a "marker-based" strategy to a conventional approach for selecting patients for treatment, we examined whether markers are correlated with bone loss and fractures. We also sought any evidence that marker results had been used in a systematic way to make decisions about treatment, or that these decisions lead to reduction in fracture risk. We examined the adverse effects of measuring markers and of using markers to guide treatment decisions.
Two good-quality studies380,382 measured urinary NTX and found that between 51 and 61 percent of women with osteoporosis had higher levels, and that 50 to 82 percent of those without osteoporosis had lower levels. Overall, only 22% of women with high levels of NTX had osteoporosis. High levels of BALP correctly identified 70% of patients with osteoporosis, but 42% of women without osteoporosis also had high levels, and overall only 21% of women with high levels had osteoporosis. The predictive values of osteocalcin and PICP levels (15%-23%) were similar to those for NTX and BALP, while those for PYR and D-PYR levels were worse (12%-13%).380
The other three studies were small (n= 34 to 78) and did not reflect the mix of patients seen in general clinical practice.378,379,381 Across all studies, for all markers measured, to correctly identify 80 percent of women with low bone density, fewer than 30 percent of those withnormal bone density would be classified correctly. In one small study in which women with osteoporosis were identified before enrollment, CTX was more accurate. Seventy-five percent of women with osteoporosis had an elevated CTX level, and 85 percent of normal controls had lower CTX levels.379 At this cutoff, 25 percent of women with osteoporosis would be incorrectly classified as having normal bone density. Lowering the cutoff for a positive test to result in no false-negative results would lead to 73 percent of women with normal bone density to be classified as osteoporotic.
No single marker or cluster of markers accurately identified individuals who had osteoporosis, as determined by the results of densitometry. It is not surprising that agreement between the two tests was poor. Densitometry measures current bone status, whereas markers measure the process of bone turnover and may indicate the beginning of bone deterioration that would not be detected by densitometry for several more years.
In a large, good-quality, population-based study of 305 Swedish women aged 40 to 80, those with decreased levels (1 standard deviation decrease) of ICTP were 1.9 times more likely to have a fracture of any type 5 years later, and those with decreased levels (1 standard deviation decrease) of PICP were 1.8 times more likely to fracture.384 This study sample was randomly selected from population files of one city, making it similar to a general clinical population. However, there was a 63 percent participation rate, suggesting that the sample may not be representative.
In the EPIDOS cohort,386 women with levels of CTX 2 standard deviations above the premenopausal mean were 2.2 times more likely to have a hip fracture within 22 months, and those with free D-PYR above this range were 1.9 times more likely to fracture. Other markers measured (BALP, osteocalcin, NTX) were not significant predictors of fracture.
In the Rotterdam study, increased levels of free, but not total, PYR and
D-PYR
were associated with increased risk of hip fracture.387 Women with higher levels of free D-PYR were 1.4 times more likely to
fracture at a mean follow-up of 22 months. BALP and NTX were not predictive
of fracture in this and another study.386 An increased level of ALP was associated with risk of fracture in one
of two studies.385
In the Rotterdam study, women with osteocalcin levels above the median were 3.1 times more likely to have a hip fracture at a median follow-up of 2.4 years, but this difference was not statistically significant (95 percent CI 1.0-9.2). In both reports from EPIDOS, there was no association between elevated total osteocalcin levels and the subsequent risk of fracture.384,386 However, increased levels of undercarboxylated osteocalcin, were associated with an increased risk of hip fracture (OR = 2.0, 95 percent CI = 1.2-3.2).376 There was also a strong association between increased levels of undercarboxylated osteocalcin and fracture in a group of elderly women living in a nursing facility who had vitamin D deficiency.383 Those with increased levels of undercarboxylated osteocalcin at baseline were 5.9 times more likely to sustain a hip or other non-vertebral fracture within 18 months. While the results of these two analyses need to be confirmed in other prospective studies, they have generated intense interest. Investigators are examining whether these results reflect decreased intake or deficiency of vitamin K, which mediates carboxylation of osteocalcin.
Results of well-done studies in which markers were combined with other risk factors to enhance prediction of fracture are inconsistent. One good-quality study found a greater risk of fracture in women with both a low bone density and either high CTX or high free D-PYR levels than in women with either risk factor alone.386 Other studies did not find the same increased predictive value when bone density and markers were combined. In the Rotterdam study, adjustment for bone density did not affect risk estimates for the markers, but adjustment for disability (ability to rise, walk, bend) did.387 According to a Hawaiian study, adjustment for baseline calcaneous bone density did not affect the relationship between marker levels and risk of vertebral fracture at three year follow-up.385 In the Swedish study, combining markers with bone density did not increase ability to predict fracture.384
In a nested case-control analysis of 109 elderly women (more than 74 years old) with hip fractures and 292 without fractures in the EPIDOS study, CTX and D-PYR had similar accuracy to predict hip fracture at a mean follow up period of 22 months.386 Garnero defined a "positive" test for CTX or D-PYR as one that was two standard deviations or more above the mean for premenopausal women. Of the women who had a fracture within 22 months of follow up, 36% had a CTX above this value, as did 19% of those who did not have a fracture. For D-PYR, 36% of the women who had a fracture had a "positive" test result, as did 19% of those who did not have a fracture. In the study of institutionalized women with vitamin D deficiency,383 53 percent of those who fractured had an elevated undercarboxylated osteocalcin level at baseline, and 79 percent of those without an elevated level did not fracture. Almost half of the women who fractured were not identified as at-risk by their marker result. Follow-up time was short in both of these studies (22 months in EPIDOS and 18 months in the institutionalized women with vitamin D deficiency); accuracy of the tests might have been different with a longer period of observation.
No marker was associated with increased fracture risk consistently across all studies. The EPIDOS study386 provides evidence that using markers in conjunction with densitometry may increase predictability, but this result has not been confirmed in other studies.
If markers can accurately predict which patients are at risk for rapid bone loss, they could help clinicians select patients who would benefit most from treatment, potentially leading to initiation of treatment in time to prevent further bone loss and eventual fracture. However, we identified no studies in which treatment decisions were made on the basis of marker test results.
In this multicenter study of 295 women aged 67 to 89 participating in the Study of Osteoporotic Fractures,397 osteocalcin, PYR, NTX, and CTX levels measured at baseline correlated significantly with annual percent change in bone density by hip DXA measured an average of 3.8 years later. BALP, however, was not significantly correlated with DXA.
In the OFELY study, a large, population-based study of determinants of bone loss in postmenopausal women in France,399 osteocalcin, NTX, and CTX were significantly associated with forearm bone loss. This study had the longest follow-up period, 4 years.
There was no clear trend across all the studies regarding the correlation between marker results and bone loss. It is difficult to compare these studies because of differences in populations, markers measured, and site of bone density measures. For example, one of six studies of ALP,392 one of three studies of BALP,393 and one of five studies of urinary calcium found a significant correlation with bone loss.395 Two of six studies found a significant correlation393,397 between hip DXA and OC. Six studies measured HPR, and two found a significant correlation.389,393
When a regression equation with osteocalcin, ALP, CA, and HPR was used to predict bone loss at different sites, these markers were correlated with bone density at the proximal forearm and ultradistal forearm, but not at the spine or total body.398
In the remaining studies, markers had high sensitivity and low specificity for predicting which women would lose bone rapidly,395 or had poor sensitivity.392 These studies were small and were conducted at single institutions. One study394 measured the accuracy of six markers to predict bone loss by spine and hip densitometry 4 years later in 141 volunteers participating in an ovarian screening program in the United Kingdom. Sensitivity of osteocalcin, ALP, CA, HPR, PYR, and D-PYR ranged from 33 to 58 percent, and specificity ranged from 50 to 72 percent.
Three longitudinal studies provided information that made it possible to calculate test characteristics when a combination of two or more markers and/or other risk factors was used to predict bone loss.388,390,393 Predictive value was improved when results of several markers were combined, or when markers were combined with risk factors such as initial bone density,390 body mass index,393 fat mass,388 or calcium intake.393
A cross-sectional study of 320 teachers in one Italian town400 examined the relationship of markers to other risk factors for osteoporosis. Only age and time since menopause were significantly associated with PYR; age, number of cigarettes smoked per day, and body mass index were associated with serum osteocalcin. There was no relationship between these markers and bone density measured at the distal forearm. In another study of 249 premenopausal volunteers,401 osteocalcin and PYR were not associated with height, weight, calcium intake, vitamin D supplementation, caffeine intake, or bone density measured at the lumbar spine, distal forearm, and proximal femur.
Studies correlating marker results and bone loss indicated no clear trend. Sensitivity and specificity were too low to be useful for the purpose of selecting patients for treatment. Some studies found better test accuracy when a combination of two or more markers and/or other risk factors was used to predict bone loss.
There is no direct evidence about whether the use of markers to monitor treatment in women with osteoporosis leads to a reduction in fracture risk. The best evidence for this would be a trial in which women on treatment are randomized to monitoring with markers or usual care, and then prospectively followed for incidence of fracture. We found no studies that used this design. Also, no studies measured outcomes, either fracture or bone density, after changes were made in treatment based on the results of marker tests.
Some researchers have advocated using markers as a way of monitoring compliance with treatment. We identified no studies of the use of markers to increase compliance. It is not clear whether this is a more effective method than directly asking the patient about compliance, side effects, or difficulty with following the regimen.
Evidence Tables 15 and 16 describe studies in which markers were used to monitor or predict response to therapy in women already on treatment for osteoporosis.
In four studies comparing levels of BALP to spine bone density performed 12 to 30 months later, correlation coefficients ranged from -0.06 to -0.67.282,402,405,374 The relationship was statistically significant in three studies. Hip DXA, measured in two studies, was not significantly associated with BALP, and total body and wrist bone density were not significantly associated with BALP.
One longitudinal study examined the ability of levels of markers to predict fractures in women taking nasal calcitonin. In this study, 208 women aged 68 to 72 were followed over 2 years.406 CTX and other markers were measured at baseline and 9 months. In 14 women who sustained a fracture at any site, CTX levels remained unchanged from baseline, while in 150 women who did not fracture, CTX decreased an average of 30 percent (range -75.1 to 44.7 percent, p = .002).
Although some marker results correlate significantly with densitometry, the size of this association in many cases is small, suggesting that markers have limited ability alone to identify nonresponders to treatment. The largest study (n= 794), a multicenter study covering four clinical centers, had correlation coefficients that were statistically significant, although small.374 In studies of mean group values, markers, especially osteocalcin, are more closely correlated with spine DXA than with DXA at other sites.
One potential use of markers is to rapidly identify women who are not responding to treatment, and who therefore would benefit by adjustment of therapy. A positive marker test in this case would identify a woman whose bone density is not expected to improve, that is, a "nonresponder."
In one study,374 103 (13 percent) of 794 alendronate users had no increase in spine bone density after 2 years of treatment (i.e., were nonresponders). Of these, NTX after 6 months of treatment was negative (at least 40 percent lower) in 55 patients; we defined these results as "false negatives". Among 151 patients in whom the change in NTX was "positive", 47 (32 percent) proved to be nonresponders (positive predictive value = 0.32). In other words, if the change in NTX suggested nonresponse, the probability of nonresponse increased from 13 percent to 32 percent. Positive predictive values in the other studies ranged from 0.27 to 0.92.
Three studies measured osteocalcin levels.374,402,405 In a study in which 13 of 51 women did not respond to treatment, intact osteocalcin identified 92 percent of the nonresponders and 76 percent of responders at a less than 20 percent decrease cutoff level.402 At a less than 10 percent cutoff, 77 percent of nonresponders and 84 percent of responders were identified. The other two studies, measuring total osteocalcin, identified 54 percent of nonresponders when 78 percent of responders were correctly identified,374 and 57 percent of nonresponders when 98 percent of the responders were correctly identified405 (the proportion of women who were nonresponders is not given in these studies). The false negative rates were 43 to 46 percent; about half of all women who were in fact responding to treatment would be classified as nonresponders, potentially leading to unnecessary adjustment in treatment.
A study of 63 women at one university hospital in Israel416 found a 40 percent decrease in BALP at 6 months had 56 percent sensitivity, 83 percent specificity, 95 percent positive predictive value, and 25 percent negative predictive value to predict spine bone density gain after 2 years of hormone replacement therapy. There was no association between change in hip bone density and percent of decrease in BALP.
Two recent studies also examined the ability of formation and resorption markers to predict response to hormone replacement therapy.415,417 In one,415 388 postmenopausal women participating in 2 clinical trials of transdermal estradiol were followed for changes in spine bone density at 2 years' follow-up. BALP, osteocalcin, and serum and urinary CTX measured at 3 and 6 months were correlated with bone density at 2 years. To determine the accuracy of these markers to predict therapeutic response in individuals, sensitivity was calculated using a cutoff at which no more than 10% of the nonresponders (women with a bone density increase of less than 2.26 percent) would be incorrectly classified. Eighty-nine women who had a change in bone density within the precision error of the technique were excluded from the analysis. At 3 months, osteocalcin and BALP were inaccurate, but improved at 6 months to about 30 to 40 percent sensitivity to identify responders. Performance of CTX was better at both 3 and 6 months (52 to 56 percent sensitivity). Repeated measures at 6 months did not improve accuracy of CTX substantially over levels measured at 3 months.
In another recent study,417 serum and urinary CTX also performed better than the formation markers BALP and osteocalcin in predicting 3-year response to estradiol as measured by hip bone density. When cutoff values for markers were used that kept the false-positive rates low (i.e., only one woman whose marker result decreased but did lose bone density), sensitivity was low for osteocalcin and BALP (36 to 50 percent), but better for serum and urinary CTX (56 to 67 percent).
False-positive and false-negative tests occurred at a high rate in these studies. At cutoff levels for markers giving acceptable false-positive rates, false-negatives are very high, and vice versa. A false-positive test could lead to an unnecessary increase in dosage or change in drug choice when the treatment was in fact effective. A false-negative test could delay adjustment to more effective treatment until more accurate measures indicated nonresponse to treatment. This could in turn lead to increased risk of fracture.
There is a small correlation between response to therapy as measured by densitometry and marker results, but in individuals, no marker is accurate enough to reliably identify nonresponders to treatment as defined by subsequent BMD tests. The best results, from the EPIDOS study, have not been replicated in other studies. The use of BMD at 2 years as the gold standard for measuring response to therapy has been called into question by a recent study,367 which found that BMD at 2 years is not a reliable predictor of subsequent response to therapy.
Osteoporosis can be a sign of systemic disease such as hyperparathyroidism, multiple myeloma, vitamin D deficiency, hyperthyroidism, liver disease, renal failure, and rare disorders such as Menkes syndrome, Marfan syndrome, Ehlers-Danlos syndrome, and homocystinuria due to cystathione deficiency, among others. While some factors, such as corticosteroid and excessive alcohol use, can be ascertained in a routine history and physical examination, diagnosis of many of the other secondary causes of osteoporosis requires laboratory testing.
We addressed the question of what is the appropriate evaluation for secondary causes for women with osteoporosis by conducting a search of the medical literature and a review of expert guidelines and recommendations. Because of the lack of evidence for this question, we also performed a supplemental analysis of physician practice patterns.
Our literature search found no data about the prevalence of secondary causes of osteoporosis in women found to have low bone density or an atraumatic fracture in population-based studies or in the primary care setting. There also is no evidence available to judge when a search for secondary causes is needed, which secondary causes should be sought, or the probability of a secondary cause in women diagnosed with osteoporosis.
Some literature, however, does report either bone density, measures of bone turnover, or the occurrence of osteoporosis in individuals with diseases or disorders known to be secondary causes of osteoporosis, such as hyperthyroidism, hyperparathyroidism, or Cushing's syndrome. These reports were not reviewed because they did not add information about the occurrence of the disease or disorder in women with osteoporosis, and because of methodologic limitations of these studies such as referral bias and small sample sizes. Several small case series report the occurrence of secondary osteoporosis in men referred with severe osteoporosis or vertebral fracture.418-420 The majority of men in these series (54 to 78 percent) were found to have secondary causes of osteoporosis, although it is unclear whether these diagnoses were preestablished or whether osteoporosis was diagnosed first and the secondary causes detected by a subsequent laboratory evaluation. The distribution of secondary causes attributable in women may be markedly different, however. Regardless, because these series include men from tertiary referral centers, they are unlikely to represent the larger community population with osteoporosis. Only a prospective, systematically evaluated cohort of women with newly diagnosed osteoporosis would provide data to appropriately answer this question.
Guidelines and recommendations about laboratory testing to identify secondary causes of osteoporosis were obtained from expert sources published within the last 10 years (Appendix D). We considered three sources: published practice guidelines, review articles on diagnosis and management of osteoporosis, and chapters published in textbooks likely to be consulted by primary care practitioners or endocrinologists. We attempted to delineate the practice base and audience for which the recommendations were written, the level of evidence review undertaken in preparation of the review, the secondary disorders that should be considered, and the specific laboratory tests to be used routinely or as indicated by history or examination.
Of the 10 practice guidelines we reviewed, only 3 appeared to be based on a formal evidence review; the rest were based on expert opinion. Only one of the four review articles and none of the textbook chapters were based on an evidence review. Despite this, even within an evidence-based review, many of the recommendations for laboratory testing were stated as being empirically-based rather than evidence based. This is consistent with the lack of direct data on the prevalence of secondary causes of osteoporosis or on the outcomes of diagnostic algorithms. Slightly greater weight could be given to practice guidelines, however, because these frequently represent the considered opinion of a group of experts and undergo peer review. Expert opinion is usually written by a single subspecialist whose opinion may be based on the management of complicated osteoporosis, and may not be representative of the larger population.
Without exception, all practice guidelines suggest a laboratory evaluation, at least in some patients with established osteoporosis, to exclude secondary causes. The scope of this workup, however, varies among guidelines. Because osteoporosis is often a "diagnosis of exclusion," most guidelines and opinions affirm that the diagnosis of primary osteoporosis cannot be made until coexistent disease or disorders are ruled out. In the first two practice guidelines in Appendix D from osteoporosis foundations, the statements of philosophy strongly contrast. The National Osteoporosis Foundation cautions practitioners that "limited biochemical testing may be required in some cases," whereas the Foundation for Osteoporosis Research and Education states that "the workup of secondary causes can be extensive."
The majority of the 10 practice guidelines suggest that the routine evaluation of a woman with osteoporosis would include measurement of serum calcium, phosphorus, ALP, liver enzymes, and creatinine, all part of a metabolic panel, as well as a complete blood count. Most guidelines suggest additional tests, depending on indications of individual patients. These might include serum protein electrophoresis, thyroid function tests, 24-hour urinary calcium, parathyroid hormone, and 25-OH vitamin D levels.
The majority of expert opinions cited in textbooks describe a more extensive routine evaluation: serum calcium, phosphorus, ALP, and creatinine, or, alternatively, a metabolic panel plus thyroid function tests, complete blood count, and serum or urine protein electrophoresis. If indicated by history and physical examination, additional testing would include measurement of parathyroid hormone and 25-OH vitamin D levels. At least half of the review articles would include a metabolic panel and complete blood count, as well as thyroid function tests. Additional testing, if indicated, would include protein electrophoresis, follicle-stimulating hormone, and a bone biopsy.
Review of these guidelines indicates a lack of agreement in the medical community about the appropriate evaluation for secondary causes. Without evidence of the prevalence of secondary disorders in a population tested by bone density measurement, subsequently diagnosed with osteoporosis, and then evaluated for secondary disorders, it is unlikely that this can be resolved. As a first step, identification of the prevalences of specific secondary causes of osteoporosis in a population of women identified with low bone density would be invaluable. An evaluation could then be constructed based on the probability of disease.
To obtain more information about how physicians evaluate women with osteoporosis for secondary causes, we performed a supplemental analysis about physician practice patterns based on responses to a questionnaire. Details of this analysis are presented in a subsequent section (see Supplemental Analysis 1: Practice Variation in the Evaluation of Secondary Causes). The results of this pilot study indicated that physician practice patterns vary in the community. Physician test-ordering behavior differed among primary care practitioners and specialists. Family practitioners less frequently recommended bone density tests for postmenopausal women, obtained bone density tests, and undertook evaluations for secondary causes compared with internists, endocrinologists, and obstetricians/gynecologists. Endocrinologists more frequently evaluated patients for secondary causes than primary care practitioners. TSH, chemistry battery, and complete blood count were the most frequently ordered tests cited by respondents. These were also the most frequently recommended tests in our review of expert guidelines discussed above.
We took two approaches to address the question about the cost-effectiveness of diagnostic strategies. First, we conducted a literature review and critically appraised relevant articles. Finding this literature limited, we performed a supplemental analysis by constructing our own models of cost-effectiveness. We compared various combinations of QUS measurements of the heel and DXA measurements of the hip for hip fracture outcomes using data from two large prospective cohort studies of osteoporosis. For a full description of this analysis, see Supplemental Analysis 2: Model-Based Economic Comparison of Diagnostic Alternatives.
Another study compared the costs of diagnosing osteoporosis using SCORE with different combinations of DXA of the hip, spine, and forearm radius in 392 women who were retirees or employees of a corporation in northern California.12 One hundred of these women (26 percent) had a T-score under -2.5 with DXA at one or more of the three sites (used as the gold standard in this study). The cost of performing all three tests on every patient was $235. The authors suggested a sequential strategy in which all women first underwent an evaluation using SCORE. Women determined to be at higher risk then received densitometry (e.g., DXA at the radius followed by DXA at the hip and spine for those not assessed as osteoporotic at the radius). Assuming SCORE costs $5 per patient to administer, the example sequential strategy would identify 98 of the 100 osteoporotic women, at a cost of $170 per case. With this strategy, all but 84 of the 392 women would require densitometry, and the majority of women with osteoporosis would undergo DXA at three sites (hip, spine, and radius).
Two other studies reviewed data on women age 60 to 69 years421 and women age 50 to 54 years422 to compare using QUS and/or a series of clinical questions to determine who to refer for assessment with DXA. These authors used DXA of the femoral neck and/or the lumbar spine as the gold standard for comparison. Their referral criteria (relevant to these analyses) included (1) women suspected to be osteoporotic from radiological and clinical findings, (2) women with a medical condition predisposing them to osteoporosis, (3) women on corticosteroids, (4) women who underwent menopause before age 45, and (5) women with a family history of osteoporosis.
In the older cohort (age 60 to 69 years), the sensitivity and specificity (73 percent and 81 percent, respectively) were better and the cost per osteoporotic case identified was lower for QUS compared with the referral criteria and with DXA for all women. For women age 50 to 54, when disease prevalence is considered, QUS also had the lowest cost per osteoporotic case identified. The authors used sensitivity analyses to determine that if DXA costs were less than 3.5 times those of QUS, then DXA for all women was more cost-effective. However, when both osteoporotic and osteopenic cases were targeted, the authors found the referral criteria to have lower cost per case identified than QUS, and sensitivity analyses found this pattern to be consistent across ratios of DXA to QUS costs. An additional clinical criterion -- estrogen-deficient women who would undergo or continue with treatment if found to be osteoporotic or osteopenic -- also was included with the other clinical criteria in the younger cohort. However, the cost per case detected was higher for this group than for the five-question clinical criteria group.
The other studies compared universal HRT with selective therapy based on the results of bone density tests.423,424 These analyses did not provide information about the choice of diagnostic test.
| Linkage: Step in analytic framework | Study type b | Quality of evidence (defined in Appendix C) |
|---|---|---|
| Arrow 1 | ||
| What risk factors predict bone density, bone loss, and fractures? | II-2 | Good: Several large prospective studies; many consistent findings. |
| Are tools for assessing risk factors accurate in identifying women at risk for fractures? | II-2 | Fair-Good: Few studies; tools usually are not validated or tested prospectively. |
| Arrow 2 | ||
| How well do the different bone measurement tests at different sites predict fractures? | II-2 | Fair-Good: Several large fair and good-quality cohort studies. |
| What factors related to bone testing influence diagnosis? | II-2 | Fair-Good: Studies vary because of multiple techniques, sites, and reference standards; difficult to apply to individual patients. |
| Can markers be used instead of bone measurement tests in identifying women at risk for osteoporosis? | II-2 | Poor-good: Small samples; narrow spectrum of patients; inconsistent results. |
| What are the adverse effects of using markers to identify women at risk for osteoporosis? | No studies available. | |
| How well do markers predict fractures? | II-2 | Fair-good: Small samples; results not generalizable; inconsistent results. |
| Arrow 3 | ||
| Are risk factors useful in treatment decisions? | III | There are no studies of patients who are at high risk of fracture based on clinical risk factors but who have normal bone measurement results. |
| How do the results of bone measurement tests affect patients' and physicians' decisions and actions? | I, II-3, III | Poor-Fair: Small descriptive studies; study population may not be generalizable. |
| Are bone measurement tests effective for monitoring response to therapy and for guiding decisions about changes in management? | III, II-2 | Poor-Fair: No studies of the use of monitoring in clinical decisions. |
| Can markers help select patients for treatment? | II-2 | Poor-Good: Two large population-based studies; others small and from single institutions; results inconsistent. |
| What are the adverse effects of using markers to identify women at risk for osteoporosis? | No studies available. | |
| Can markers predict response to therapy? | II-2 | Poor-Good: Several population-based or multicenter studies; results inconsistent; gold standard (2-yr change in DXA) may not be reliable. |
| What diagnostic or laboratory tests are appropriate for evaluating patients with osteoporosis for secondary causes? | III | No studies available. |
| Arrow 4 | ||
| What are the costs and cost-effectiveness of diagnostic strategies for identifying women with osteoporosis? | III | Poor: Only cost-effectiveness models are available; none are based on comparative studies of diagnostic approaches. |
U.S. Preventive Services Task Force437
Study design categories: I = randomized, controlled trials; II-1 = controlled trials without randomization; II-2 = cohort or case-control; II-3 = multiple time series; III - opinions, descriptive epidemiology.
DXA = dual-energy X-ray absorptiometry
Several factors are consistently associated with increased risks of low bone density and fractures in postmenopausal women, including increasing age, white race, low weight or weight loss, nonuse of estrogen replacement, history of previous fracture, family history of fracture, history of falls, and low scores on one or more measures of physical activity or function. Other factors are less consistent predictors across studies, but also have significant associations with bone density and fractures. These include smoking, alcohol use, caffeine use, low calcium and vitamin D intake, and use of certain drugs. Additional risk factors are important in specific studies. The relative risks of several risk factors are comparable to that of a 1 standard deviation difference in bone density. Predictors for low bone density are similar to those for fracture except for those specifically related to falls. Most of the strongest risk factors are consistently related to outcomes in different racial and ethnic populations. Risk factors are generally similar for the various fracture sites except that fractures related to falls have additional functional risk factors.
Few studies report results of the performance of tools for assessing clinical risk factors for osteoporosis. Our evidence review suggests that specific tools to assess risk have inadequate sensitivity and specificity and have not yet been widely tested. Generalizability of these tools to the clinical setting is therefore currently limited. However, some tools -- especially those developed in large community populations and containing variables known to be strong predictors -- may ultimately be applicable to the clinical setting once they are tested there.
Although this approach may be useful in the general population, translating the results of population studies to the care and management of individuals requires additional considerations. Risk factors that are not common may be dominant in individual patients. Also, some risk factors may be important for different types of fractures and for different age groups, for example. It is not yet known how tools for risk assessment would perform under these varying circumstances. In addition, clinicians may find it difficult to selectively exclude low-risk women from diagnostic protocols because a substantial number of women without specific risk factors also experience fractures. Patients may have similar concerns.
Identification of important risk factors could be useful in making treatment decisions. Although models have been proposed, they have not been prospectively tested in large trials and are not used by clinicians. Clinical trials have not focused on what should be done for patients who have several risk factors and a high overall risk of fracture but who do not meet the WHO definition for osteoporosis. A few risk factors -- sex, race, age, menopause, a history of previous fracture, and low bone density -- have been primary selection criteria for randomized trials of bisphosphonates and provide clues to the validity of the risk-based approach.
Among different bone density tests measured at various sites, bone density measured at the femoral neck by DXA is the best predictor of hip fracture and is comparable to forearm measurements for predicting fracture at other sites.
In recent prospective studies, QUS measured at the heel predicted hip fracture and all nonspine fractures as well or nearly as well as DXA measured at the femoral neck. For both tests, a result in the osteoporotic range is associated with an increased short-term probability of hip fracture. However, clinical trials of recent pharmacologic therapies have used a low hip DXA, rather than QUS, as a criterion for entry. Physicians who use QUS must consider whether the results of these trials are generalizable to patients identified by QUS to have a high risk of fracture.
QUS of the heel and DXA of the hip provide independent information about fracture risk. Individuals who have low scores by one of these tests, but not the other, have a greater risk of fracture than those who have higher scores by both tests, and a lower risk of fracture than those whose results on both tests are low.
Both of these tests predict hip fracture better than DXA of the lumbar spine. RA or QMD of the hand can predict the risk of nonspine fractures in general, many of which are in the forearm, but there are no recent data about the ability of hand measurement to predict hip fracture. Correlations between different bone measurement tests are generally too low to be accepted as evidence that one test will identify patients at similar risk to those identified by another test. While peripheral measures may approach hip DXA in predicting hip fracture, there are no recent prospective studies that directly compare prediction of hip fracture by these tests with DXA of the hip.
The likelihood of being diagnosed with osteoporosis varies greatly, depending on the site and type of bone measurement test, on the brand of densitometer, and on the relevance of the reference range in the local population. The variation between techniques, along with the lack of methods to integrate bone density results with clinical predictors, makes it difficult for clinicians to provide accurate information to patients about their test results.
The likelihood of being diagnosed with osteoporosis also depends on the number of sites tested. Testing in the forearm, hip, spine, or heel will generally identify different groups of patients. A physician cannot say, based only on a forearm test, that the patient "does not have osteoporosis." Conversely, although the results of a test at any site are associated to some degree with fractures at other sites, the physician may not be able to assess whether the patient who has a low T-score on a hand or forearm test has significant bone loss at other sites.
Because of these limitations, some experts question whether there is added benefit to the patient from using T-scores to diagnose osteoporosis, as opposed to reporting test results as continuous values and assessing overall risk. Others propose that results using various techniques and sites be "calibrated" to the results of DXA of the hip and reported as a "T-score equivalent." The success of this approach may be limited by differences in precision and low correlation among different techniques.
One randomized trial suggests that women who undergo densitometry are more likely to start HRT than women who do not. In a randomized trial and a large uncontrolled case series, women who had densitometry and were told they had osteoporosis were more likely to start or continue HRT than women who were told they had normal bone density. In one randomized trial, physicians found densitometry reports confusing and were not confident that their interpretations of T-scores were correct. While overall the reporting format did not influence treatment decisions, gynecologists who better understood the report were more likely to prescribe changes in HRT than those who considered the report confusing. These results suggest that the behaviors of both patients and physicians are influenced by the results of bone measurement tests.
Currently, the weight of evidence is against repeating bone density tests within the first year of treatment. There is insufficient evidence to determine whether repeating bone density tests 2 years after starting therapy is useful. There also are no studies about the effect of monitoring responses to therapy using densitometry, or of the choice of test, on the outcome of therapy.
No single marker or cluster of markers accurately predicted the results of densitometry in individuals. Densitometry measures current bone status, whereas markers measure the process of bone turnover.
No marker was associated with increased fracture risk consistently across all studies. The EPIDOS study provides evidence that using markers in conjunction with densitometry may increase predictability, but this result has not been confirmed in other studies.
Studies correlating marker results and bone loss indicated no clear trend. Sensitivity and specificity were too low to be useful for the purpose of selecting patients for treatment. Some studies found better test accuracy when a combination of two or more markers and/or other risk factors was used to predict bone loss.
The primary adverse effect of biochemical markers is the potential for false-positive and false-negative results. Rates of false-positive and false-negative test results vary widely. If markers were used to select women for treatment, a false-positive test could lead to the initiation of unnecessary treatment. A false-negative result could lead to more serious consequences if markers were to be used as an initial test to select women for further testing.
There is a small correlation between response to therapy as measured by densitometry and marker results, but no marker is accurate enough to reliably identify nonresponders to treatment. The best results, from the EPIDOS study, have not been replicated in other studies. The use of bone density at 2 years as the gold standard for measuring response to therapy has been called into question by a recent study that found that bone density at 2 years is not a reliable predictor of subsequent response to therapy.
There is no evidence on which to base recommendations for a strategy of testing to determine secondary causes of osteoporosis. The accumulated literature suggests only the presence of low bone density in certain disorders such as hyperparathyroidism, multiple myeloma, and hyperthyroidism, but not the prevalence of these disorders given a diagnosis of osteoporosis.
Despite this lack of evidence, expert opinion and practice guidelines provide many suggestions for diagnostic workups. Some support extensive testing to rule out major concomitant disease, while others suggest a limited testing strategy based on findings in the history and physical examination. Because the diagnosis of primary osteoporosis is often seen as a diagnosis of exclusion, the pattern of diagnostic testing may continue to be costly until the diagnostic yield is fully demonstrated.
Within the community, the pattern of testing varies widely, particularly by specialty type. Our supplemental analysis suggested that assumptions about the probability of a secondary disease or disorder to explain the occurrence of osteoporosis vary by practice type and specialty. TSH, chemistry battery, and complete blood count were the most frequently ordered tests cited by respondents. These also were the most frequently recommended tests in our review of expert guidelines.
In the literature we reviewed, economic assessments suggested that diagnosis and treatment of women at risk for osteoporosis would be more cost-effective by targeting treatment to those with the lowest bone measurement results. Inclusion of another assessment prior to the bone measurement test may improve the cost-effectiveness of diagnosis. The other assessment could be a risk profile (such as SCORE) or a less expensive, and perhaps more widely available, diagnostic test such as QUS of the heel.
We conducted cost-effectiveness analyses to estimate cost per hip fracture prevented using data from EPIDOS and SOF (see Supplemental Analysis 2). These analyses suggested that a sequential diagnostic approach was more cost-effective than using DXA of the femoral neck alone. The sequential approach we considered was QUS of the heel followed by DXA of the femoral neck only for those with low values of QUS/BUA. Diagnosis with DXA of the femoral neck alone prevented more fractures in most cases, but at an increased cost per fracture prevented compared with the sequential approach. Using SOF data, if the overall incidence of fractures were increased, that is, if the population were at a five- to tenfold higher risk of fracture than the SOF reported, the incremental cost per hip fracture prevented using diagnosis with QUS alone (compared with DXA alone) dropped. QUS alone may be a cost-effective option in high-risk populations. In senstitivity analyses, if treatment efficacy following diagnosis with QUS were 5 percent or 15 percent less than that following diagnosis with DXA, diagnosis with QUS alone would have higher costs to prevent fewer hip fractures than other diagnostic options. If reduction in fracture risk reduction is as little as 5 percent less than diagnosis with DXA, QUS alone may be dominated by other alternatives.
Much of the evidence for diagnosis and monitoring strategies for osteoporosis comes from epidemiologic studies. To be more useful for clinicians and patients, and include a wider diversity of patient populations, future research should focus on the application of these data to the clinical setting.
Tools for assessing risk factors should be tested in prospective studies to determine if their use can correctly stratify women by risk factors, influence treatment decisions, and ultimately reduce fracture outcomes.
Clinical trials should be done to determine if identifying and reducing modifiable risk factors influences fracture outcomes. Some of these modifiable risk factors already have demonstrated effects on fracture risk after intervention, such as supplementation with calcium and vitamin D, and estrogen replacement after menopause. Examples of additional interventions to test include smoking cessation, correcting visual loss, and improving physical function.
Randomized controlled trials of treatments for osteoporosis should be done to test the hypothesis that overall fracture risk, rather than bone measurement results alone, determines the likelihood that a patient will benefit from therapy. Selection of patients for trials should focus on groups of patients who are at high risk of fracture based on clinical risk factors and who have relatively normal bone measurement results. Trials also should address whether patients identified to have bone loss demonstrated by different techniques at different sites have a similar benefit to those identified solely by hip DXA measurements.
Additional research is needed to examine the quality of information provided to patients who undergo various bone measurement tests, as well as to identify other patient education and information needs.
Future research should examine the clinical utility of the WHO working group's criterion for diagnosing osteoporosis, specifically:
To determine whether the diagnosis of osteoporosis provides benefits to patients above that which is provided by predicting their risk of fractures.
To assess the added value of obtaining bone measurement tests at more than one site, and the usefulness of using T-scores for different sites.
To define the prognosis and treated course in patients who would be diagnosed to have osteoporosis by a wrist or spine measure but would not be diagnosed to have osteoporosis by a hip measure.
To determine if the use of bone measurement results and risk factors together to estimate absolute fracture risk is a valuable method to define treatment thresholds.
Studies of using bone measurement tests for monitoring response to therapy could compare fracture outcomes in a group of patients who had tailored therapy based on test results versus a group in whom changes in therapy, if any, were guided by the history alone. A study also could record how often test results in patients on therapy led to a change in therapy or improved compliance, to establish the mechanism by which monitoring led to improved outcomes. By comparing patients, such a study could establish that monitoring changes in test results can reliably predict fracture risk in individual patients by distinguishing an inadequate response to therapy from an adequate response or poor adherence, and that monitoring changes in therapy made because of an inadequate test response can reduce the rate of fractures.
Prospective studies of biochemical markers should define, apply, and evaluate criteria for using marker results in clinical decisionmaking.
Determining the utility of screening for secondary disorders using common laboratory tests requires studies of the frequency of abnormal baseline laboratory tests in large cohorts, such as the SOF, or in large treatment trials of osteoporosis. Clinical followup of these subjects would provide data on bone measurement test results or fracture outcomes.
Studies to formally account for the adverse quality-of-life impact of treatment and treatment side effects are needed to more accurately determine the balance of risks and benefits of therapy options for patients.
Three areas for additional cost-effectiveness research include:
Identifying a scientifically appropriate cut point for QUS/BUA that can be used in either a sequential diagnostic approach or for diagnosis with QUS alone.
Performing a cost-effectiveness analysis using data from EPIDOS (or other large cohort) with various cut points similar to that performed with the SOF data in our analysis to support these results.
Conducting a more detailed, societal perspective cost-effectiveness analysis perhaps incorporated into the trial described above that would address some of the deficiencies of our analysis.
If these findings can be demonstrated in one or two other populations and in a more complete economic evaluation, a randomized controlled trial of diagnostic approaches would be a useful next step. Alternatively, an observational study with randomization of groups of patients to diagnostic or monitoring procedures could be conducted.
A preliminary literature search of MEDLINE identified no studies of strategies to diagnose secondary causes of osteoporosis in postmenopausal women who have low bone density or fragility fractures. A review of published practice guidelines and expert opinions about diagnostic strategies for secondary osteoporosis confirmed this finding, because there were no publications cited as background or evidence. Based on this investigation, we determined that evidence supporting an appropriate evaluation for secondary causes in women with osteoporosis was lacking. To determine how physicians actually conduct these evaluations in clinical practice in the absence of evidence, we next performed a supplemental analysis based on responses to a questionnaire (Appendix S1-1). Support for this separate project was provided by the Outcomes Management Group, Division of Medical Informatics and Outcomes Research, Oregon Health & Science University. The principal investigator of the project was Cynthia Morris, Ph.D.
We designed a mailed, self-administered questionnaire to measure the following: (1) frequency of use of bone density testing in postmenopausal women, and characteristics influencing the decision to order it; (2) frequency of evaluation for secondary causes of osteoporosis for women with confirmed low bone density; (3) referral patterns for women diagnosed with osteoporosis; (4) principal secondary causes of osteoporosis considered in the evaluation; (5) factors influencing the decision to evaluate secondary causes; and (6) laboratory tests used routinely and selectively as indicated by history and physical examination in women with confirmed osteoporosis.
We used the American Medical Association (AMA) list of U.S. practitioners to identify physicians in active practice who are not in military service; both members and nonmembers of the AMA are contained in this list. To enable a comparison of primary care practitioners to specialists (endocrinologists), 500 physicians each in internal medicine, family practice, and obstetrics and gynecology were randomly selected using a nationwide sampling strategy. One thousand endocrinologists were similarly chosen nationwide. Questionnaires were mailed to subjects in January 2000. Because this was intended as a pilot project, additional mailings were not provided. Responses were accepted up until April 1, 2000. Analysis included descriptive frequencies only.
A total of 367 of the 2,500 mailed questionnaires (15 percent) were completed and returned. The average age of physician respondents was 47.3±10.9 years. Thirty-two percent of the sample was female and 68 percent male. Ninety-three percent of respondents were in practice full-time and 7 percent part-time. Private or group practice types predominated in the sample (76 percent) compared with 10 percent who work in an academic medical center, 5 percent in an HMO, and 6 percent in other settings. Fifty-five percent of respondents practiced in a large city (>200,000 population), 35 percent in a midsized city (25,000-200,000 population), and 10 percent in a small town or rural environment.
| Internists | Family Practitioners | Endocrinologists | Obstetricians/Gynecologists | |
|---|---|---|---|---|
| >75% | 22 (42%) | 9 (20%) | 80 (42%) | 15 (23%) |
| 25-75% | 20 (38%) | 19 (42%) | 83 (43%) | 25 (38%) |
| <25% | 8 (15%) | 16 (36%) | 27 (14%) | 26 (39%) |
| None | 2 (4%) | 1 (2%) | 2 (1%) | 0 |
*Percentages may not equal 100 because of rounding
| Internists | Family Practitioners | Endocrinologists | Obstetricians/Gynecologists | |
|---|---|---|---|---|
| Frequently (>1 patient/wk) | 31 (60%) | 15 (33%) | 128 (66%) | 35 (53%) |
| Occasionally (>1 patient/mo) | 17 (33%) | 20 (44%) | 59 (31%) | 22 (33%) |
| Rarely (>1 patient/yr) | 3 (6%) | 10 (22%) | 4 (2%) | 9 (14%) |
| Never | 1 (2%) | 0 | 2 (1%) | 0 |
*Percentages may not equal 100 because of rounding
| Internists | Family Practitioners | Endocrinologists | Obstetricians/Gynecologists | |
|---|---|---|---|---|
| Frequently (>1 patient/wk) | 10 (20%) | 4 (9%) | 104 (54%) | 12 (18%) |
| Occasionally (>1 patient/mo) | 17 (33%) | 11 (24%) | 66 (35%) | 10 (15%) |
| Rarely (>1 patient/yr) | 22 (43%) | 28 (62%) | 20 (10%) | 34 (52%) |
| Never | 2 (4%) | 2 (4%) | 1 (1%) | 9 (14%) |
*Percentages may not equal 100 because of rounding
Three factors were cited as strongly influential in the decision to evaluate a woman for secondary causes by more than 90 percent of the responding physicians: no response to 1 year of treatment for osteoporosis, very low bone density, and osteoporosis in a premenopausal woman. When asked which age group they would most likely evaluate for secondary causes, respondents most often cited the youngest group, women 45-54 years of age. Concomitant serious medical illness, history of an atraumatic fracture, and unwillingness to undergo treatment for osteoporosis were modestly influential in the decision to evaluate osteoporosis in a woman, cited at least half the time by respondents.
| Routinely | Only if indicated by history & exam | |
|---|---|---|
| TSH - Thyroid-stimulating hormone | 92% | 6% |
| Chemistry battery | 87% | 9% |
| Complete blood count | 69% | 20% |
| 24-hour urine calcium | 44% | 37% |
| Parathyroid hormone | 43% | 45% |
| Urinalysis | 42% | 33% |
| 25-OH vitamin D | 41% | 40% |
| Serum protein electrophoresis | 37% | 48% |
| Ionized calcium | 32% | 43% |
| T3/T4 - Thyroxine or T3 | 31% | 47% |
| Erythrocyte sedimentation rate | 24% | 50% |
| Bone-specific alkaline phosphatase | 18% | 52% |
| Serum or urinary collagen crosslinks | 18% | 50% |
| Urine pH | 18% | 50% |
| Cortisol | 16% | 63% |
| 1,25(OH)2D3 | 16% | 57% |
| 24-hour creatinine clearance | 15% | 52% |
| Spinal X-ray or CT scan | 10% | 60% |
| Osteocalcin | 7% | 56% |
| Blood gas | 0.6% | 62% |
| Bone biopsy | 0.3% | 64% |
The results of our analysis of physician practice pattern variation in the evaluation of secondary causes of osteoporosis in women indicated a lack of consensus. Physician test-ordering behavior differed among primary care practitioners and specialists. Family practitioners less frequently recommended bone density tests for postmenopausal women, obtained bone density tests, and underwent evaluations for secondary causes than did internists, endocrinologists, and obstetricians/gynecologists. Endocrinologists more frequently evaluated patients for secondary causes than did other practitioners. Thyroid-stimulating hormone, chemistry battery, and complete blood count were the most frequently ordered tests cited by respondents. These also were the most frequently recommended tests in our review of expert guidelines.
These data are limited by the low response rate and may not be representative of true practice variation that would be observable only within a large data set. Respondents likely represented a physician community most interested in osteoporosis, as evidenced by the higher response rate (21 percent) of endocrinologists, and may not reflect actual community practice.
[Form goes here]
Our review of the literature on bone measurement tests indicates that dual-energy X-ray absorpiometry of the femoral neck (DXN-FN) or total hip is the best predictor of hip fractures (see section on Bone Measurement Tests, p. 48, above). Clinicians use this approach for making diagnosis and treatment decisions about osteoporosis. Women with T-scores less than -2.5 are considered osteoporotic and may be offered treatment.1 Women with T-scores between -1 and -2.5 are considered ostepenic and may be monitored more closely for additional loss of bone density. Diagnosis with DXA-FN requires a relatively large and expensive instrument and a relatively high degree of operator skill to correctly position the target area. DXA is usually performed at a hospital or larger medical center and is often done on a referral basis.
Over the last few years, several newer diagnostic approaches, such as quantitative ultrasound (QUS) and peripheral DXA, have become more available. These approaches require less costly equipment and less space than DXA and are becoming more common in primary care offices and smaller clinics. As part of our evidence review, we met with a local technical expert panel (Appendix A). They suggested that a sequential diagnostic approach may become common in primary care. With this approach, a newer and more available diagnostic tool such as QUS is used in a primary care setting. If a woman is identified as being at higher risk of osteoporosis, she is then referred to DXA-FN for a definitive diagnosis. Also, as these diagnostic tools become more available in the primary care setting, clinicians may choose to use one of these newer methods alone.
We developed a simple economic model to compare both the sequential diagnostic approach and diagnosis with QUS alone to diagnosis with DXA-FN alone. This simple model is intended to evaluate potential differences in cost per fracture prevented among these three diagnostic approaches. These results may support a more detailed economic evaluation beyond the scope of this evidence report.
We chose QUS to be representative of the newer and more available diagnostic tools because there are published reports from two cohort studies2,3 that compare hip fracture rates following diagnosis with DXA and with QUS. In addition to these published reports, the investigators of the Study of Osteoporotic Fractures (SOF)4 provided data on women in the SOF cohort who received both DXA-FN and QUS and were followed for subsequent fractures.
The World Health Organization1 and other groups recommend that osteoporotic women be defined as those with DXA T-scores less than -2.5. However, there are no such guidelines for QUS broadband ultrasound attenuation (QUS/BUA). We therefore performed a series of analyses using the SOF cohort data with various QUS/BUA cut points used to define women at high risk of fractures.
The objective of this economic evaluation is to use a relatively simple economic model to predict, for alternative diagnostic approaches, the total direct medical costs (from a payer/provider perspective) and number of hip fractures in a cohort of 1,000 older women. Data for this analysis was obtained from published studies of the Epidemiologie de l'Osteoporose (EPIDOS) cohort2 and unpublished data from the SOF cohort.3 We developed this model to assess the potential for reduced cost per hip fracture prevented using either a sequential diagnostic approach (QUS followed by DXA-FN in those at higher risk of osteoporosis) or QUS alone relative to DXA alone.
The target population for these analyses is the same as that for the EPIDOS2 and SOF3 studies -- that is, older white women. The EPIDOS study consisted of 5,541 French women (out of 7,575 in the original cohort) with a mean age of 80 years. The original SOF cohort3 (U.S. women age 65 years or more) had been followed for about 5 years when the period for this study began. Of those alive at that time, 5,993 (72 percent) were included in this study, with a mean age of 76 years. In both studies, some centers lacked the QUS instruments at the time the diagnostic tests were taken; thus, not all women in the cohorts were included in these two studies. The fracture data in the reports on the EPIDOS and SOF cohorts had an average follow-up time of about 2 years. While this period is shorter than the period of most clinical trials of therapy to reduce fracture rates, we have used this 2-year time horizon for our analyses.
We used number of hip fractures as the only outcome of interest. We did not explicitly model sequelae of hip fractures (such as death or admission to a nursing home), although the costs of these sequelae were included in the cost of a hip fracture.
As the base case risk reductions for the three treatments, we used 50 percent (alendronate), 25 percent (HRT), and 10 percent (calcium and vitamin D). These values are based on the more conservative estimates of treatment effects and the ranges of uncertainty cited in a 1998 report by the National Osteoporosis Foundation.4 We then estimated the number of women in a hypothetical cohort of 1,000 who were diagnosed as osteoporotic and received treatment. The number of hip fractures was estimated for osteoporotic women using the risk reduction for the treatment used, and for nonosteoporotic women. This total number of hip fractures was subtracted from the number of fractures without any diagnosis or treatment (the number of hip fractures per 1,000 women observed in the cohort used) to estimate the number of fractures prevented by diagnosis and treatment. We allowed the total fracture rate to be increased (up to tenfold) or decreased (by up to 50 percent) to represent populations with higher or lower overall fracture risk. We also allowed fracture risk reduction in those diagnosed using QUS alone to be reduced by a percentage (by 10 percent, 20 percent, and 30 percent) compared to those diagnosed with DXA-FN alone. There are no clinical trials of therapies leading to reduction in fracture rates that have used QUS/BUA at baseline as a risk predictor.
We used three sets of direct medical costs in this economic model. The first set of costs are the costs of diagnosis. For DXA or QUS alone, the costs of diagnosis are 1,000 (the number in the cohort) times the cost of the diagnostic test. For the base case, the test cost for DXA is $133 and for QUS is $34. In sensitivity analyses, these costs range from $99 to $183 for DXA and $30 to $54 for QUS. We used Medicare reimbursement rates (median, lower, and upper quartiles) for 20006 for CPT code 76075 for DXA (hip and spine) and 76977 for QUS (any site) for these cost estimates.
The second cost is for treatment of fractures. We used cost estimates for hip fractures from the U.S. Congress Office of Technology Assessment.7 We inflated these costs to 1999 values (multiplier of 1.147) using medical inflation data from the U.S. Department of Commerce.8
The third set of costs are for the three treatment regimens -- alendronate, HRT, and calcium and vitamin D. HRT assumes treatment with both estrogen and progestin. In practice, women without an intact uterus would receive only the estrogen, lowering the cost for this regimen. We used the cost for the combination regimen for all analyses. We used the annual cost estimates from the NOF report4 inflated to 1999 dollars (using a multiplier of 1.147). Because the EPIDOS and SOF cohorts had different mean ages, the treatment costs differ slightly because of differences in mortality in the first year. Treatment costs were summed for year 1 and, after discounting and adjusting for mortality in year 1, for year 2. We assumed the cohort experienced mortality in the second year corresponding to the average mortality in females in the United States for a group with age equal to the mean cohort age.9
The three sets of costs (diagnosis, fractures, and treatment) were summed to obtain a total cost. We used base case plus or minus 30 percent for sensitivity analyses on costs of fractures and costs of treatments. From this total cost we subtracted the cost of treating fractures in the cohort, assuming no diagnosis or treatment, to obtain the final (incremental) total cost. Thus, the total cost is the incremental cost of performing a diagnosis and treating women diagnosed as osteoporotic.
Neither of the reports on the EPIDOS and SOF cohorts reported 1-year hip fracture rates, only fracture rates for the total follow-up period. Thus, we were unable to discount either fractures prevented or cost for fractures for these analyses. We discounted the second year of treatment costs by 3 percent. Given the 2-year time horizon we used, any impact of not discounting all costs and outcomes is likely to be relatively small.
We programmed two versions of this model. The first version was developed in Excel 97® for Windows 95®. The second version was programmed in SAS® Version 7.0 for Windows 95®. For a given scenario, we omit discussion of diagnostic alternatives that are either dominated or show extended dominance compared to another alternative. Economic dominance occurs in three situations: one alternative may cost more and prevent more fractures than a second; one alternative may cost as much and prevent more fractures; an alternative may cost less and prevent the same number of fractures. Extended dominance occurs when the incremental cost-effectiveness of an alternative with lower cost is greater than the incremental cost-effectiveness of a second alternative that has higher cost. For a more complete discussion of dominance and extended dominance, we refer the reader to the report by the Panel of Cost-Effectiveness in Health and Medicine.10
In all but two of the sensitivity analysis scenarios, the sequential approach had the lowest cost per hip fracture prevented, DXA alone prevented more hip fractures at a higher cost per hip fracture prevented, and QUS alone was dominated. If women diagnosed as osteoporotic are treated with HRT and if either the higher cost of DXA ($183) or the lower cost of HRT is used, QUS alone has an incremental cost-effectiveness ratio only slightly higher than that for the sequential approach. For high-cost DXA (or low-cost HRT), the incremental cost-effectiveness ratios are about $116,000 ($78,000) for the sequential approach, about $129,000 ($79,000) for QUS alone, and about $235,000 ($149,000) for DXA alone. While the magnitude of the incremental cost-effectiveness ratios varies across the other sensitivity scenarios, the sequential procedure consistently has the lower cost-effectiveness ratio. DXA alone will prevent additional hip fractures, but at an increased incremental cost per hip fracture prevented at least 43 percent higher across scenarios with alendronate treatment and at least 26 percent higher across scenarios with HRT treatment.
Appendix S2-1 summarizes results for all diagnostic alternatives, including those that are dominated. The sequential diagnostic approach has the lowest incremental cost per fracture prevented compared to no diagnosis and treatment. For QUS/BUA cut points of 70 dB/MHz or less, DXA is the next most cost-effective alternative, but the incremental cost per fracture prevented of DXA over the sequential option is substantial, at least 50 percent greater in relative terms or more than $85,500 per hip fracture prevented in absolute terms. At QUS/BUA cut points of 75 dB/MHz or greater, DXA alone is dominated. QUS alone is not a cost-effective alternative at cut points of 75 dB/MHz or greater. The cost per hip fracture prevented is around three times as great or greater as the cost per hip fracture for the sequential approach.
If calcium and vitamin D were used as a treatment, QUS alone is the most cost-effective alternative (cost per fracture prevented ranging from about $118,000 to $130,000 across the cut points). DXA alone is the only other nondominated alternative, but only for cut points of 60 dB/MHz or less, and the incremental cost per fracture prevented is about four times or more greater than for QUS alone. Results are summarized in Appendix S2-1.
This summary will focus primarily on scenarios in which the order of the diagnostic alternatives changes from the base case. For scenarios not discussed, the magnitudes of the incremental cost-effectiveness ratios may differ across alternatives, but the order is the same as the base case. In some cases, the change in magnitude of the incremental cost-effectiveness ratios indicates that an alternative may become relatively cost-effective for certain sensitivity parameters. These cases also are cited.
Sensitivity analyses for cost of QUS showed no substantive changes for the analyses with alendronate treatment. For treatment with HRT, at QUS/BUA cut points of 70 and 80 dB/MHz, in the scenarios using the higher cost for QUS ($54), DXA alone dominated the sequential procedure. At QUS/BUA cut point of 85 dB/MHz, in the scenario of low cost for QUS ($30), the sequential diagnostic approach dominated DXA alone.
Finally, we performed sensitivity analyses on reduction in the degree of risk reduction achieved by therapy in women diagnosed as osteoporotic using QUS alone. For treatment with alendronate, QUS alone was the least cost-effective alternative at QUS/BUA cut points of 65 and 70 dB/MHz. With 5 percent reduction in risk reduction, QUS alone became dominated at 65 dB/MHz, and with 15 percent reduction, it was dominated at 70 dB/MHz. At QUS/BUA cut point of 75 dB/MHz, DXA was a more cost-effective alternative than QUS diagnosis (DXA was dominated in the base case) with 10 percent or greater reduction in risk reduction. At cut points of 80 and 85 dB/MHz, there was no substantive difference between sensitivity and base case scenarios up to a reduction in risk reduction of 30 percent.
For treatment with HRT at QUS/BUA cut points of 50 and 65 dB/MHz, QUS alone -- not dominated in the corresponding base case scenarios -- is a dominated alternative with even a 5 percent risk reduction. At a cutpoint of 70 dB/MHz, QUS alone is dominated if the reduction in risk reduction is 15 percent. At a cut point of 75 dB/MHz, diagnosis with DXA alone, which is dominated in the corresponding base case scenario, is not dominated with even a 5 percent reduction in risk reduction and QUS alone is dominated with a 30 percent reduction in risk reduction. For all cut points except 85 dB/MHz, however, the sequential approach remained the most cost-effective alternative.
For treatment with calcium and vitamin D, QUS alone was the only nondominated alternative for QUS/BUA cut points of 65 dB/MHz or greater. If the reduction in risk reduction was at least 5 percent at 65 dB/MHz or at least 15 percent at 70 dB/MHz, then DXA alone is a nondominated alternative, although with a relatively high incremental cost-effectiveness ratio for most reductions in risk reduction. At QUS/BUA cut point of 75 dB/MHz, DXA alone is the only nondominated alternative for a 30 percent reduction in risk.
Sensitivity analyses for costs of calcium and vitamin D, risk reduction in response to treatment (alendronate, HRT, or calcium plus vitamin D), costs to treat fractures, and fracture rates did not show any changes in the order of the alternatives with respect to incremental cost-effectiveness ratios. The magnitude of cost-effectiveness ratios did change somewhat across these sensitivity analyses.
QUS alone in the base case is only cost-effective for treatment with calcium and vitamin D. As noted in the National Osteoporosis Foundation report,4 this is an alternative in which diagnosis may not be worth performing, because the cost and adverse experiences of both calcium and vitamin D are so low as to be relatively negligible. For treatment with alendronate, QUS alone is not dominated for cut points of 70 dB/MHz or higher. While diagnosis with QUS alone prevents the most fractures in these scenarios, the incremental cost-effectiveness ratios are all around $500,000 per fracture prevented or greater. One could ask whether, in a higher-risk population than the SOF cohort, diagnosis with QUS alone would be more cost-effective.
To address this question, we performed a series of analyses in which the risk of fractures in the population was increased by five-, seven- , and tenfold. In these scenarios, the order of alternatives is the same as in the base case. However, for higher-risk groups the incremental cost per hip fracture prevented drops for all alternatives. In particular, the incremental cost per hip fracture prevented for QUS alone is below $100,000 for several scenarios. For example, with alendronate treatment at 70 dB/MHz and a tenfold increase in fracture risk, the incremental cost per hip fracture prevented is $2,937 saved per hip fracture prevented for the sequential approach, $5,618 per hip fracture prevented for DXA alone, and $81,954 for QUS alone. QUS alone prevented 4.17 more fractures than DXA alone. Similarly, the incremental cost per hip fracture prevented following diagnosis with QUS alone at 75 dB/MHz given a 5-fold increase in fracture risk is $93,997 -- and for a 10-fold increase, $35,933. In these cases, almost 6 and more than 12 additional hip fractures are prevented, respectively, compared to the sequential approach. Similar results are observed for higher QUS/BUA cut points with alendronate and for treatment with HRT for QUS/BUA cut points of 70 dB/MHz or greater. Thus, QUS alone may be a cost-effective alternative in higher-risk populations.
These analyses have shown that the sequential diagnostic approach may be a cost-effective alternative to DXA. Further, in a higher-risk population, QUS alone may also be a cost-effective alternative. The results for diagnosis with QUS alone are sensitive to the reduction of risk reduction following diagnosis with QUS. If reduction in fracture risk reduction is as little as 5 percent less than diagnosis with DXA, QUS alone may be dominated by other alternatives.
Quality-adjusted life year (QALY), rather than hip fractures prevented, is the recommended outcome measure for economic comparisons.10 By using fractures prevented, policymakers cannot compare diagnoses with one of the approaches considered to other healthcare interventions. An analysis using QALYs would facilitate such a comparison. Our approach does allow one to make comparisons among diagnostic alternatives included within our model. Cost per QALY would also allow incremental cost-effectiveness ratios (comparisons among alternatives) to be placed into a context related to other economic healthcare decisions. Our primary purpose for this project, however, was to assess the potential for less expensive and more readily available diagnostic approaches (the sequential diagnostic approach and QUS alone) to provide a cost-effective alternative to diagnosis with DXA-FN. If the sequential approach, for example, were not cost-effective compared to DXA-FN in this simple approach, it would not be likely to be more cost-effective with a more complex model. These results support additional research to estimate differences in cost per QALY among these diagnostic alternatives.
We have made an assumption about the treatment-related reduction of fracture rates following diagnosis of osteoporosis with QUS/BUA. To date, clinical trials that have assessed the impact of reduction in risk of fracture have all used diagnosis with DXA-FN. One may question whether a woman diagnosed using either the sequential approach or QUS alone would experience the same reduction in fracture risk as a woman diagnosed with DXA. For diagnosis with the sequential approach, this should not be an issue. All women diagnosed by the sequential approach as osteoporotic will have both a QUS/BUA value less than the specified point and a DXA T-score less than -2.5. Since QUS/BUA has been shown to be a statistically significant risk factor for fracture rates in a multivariable model containing DXA-FN T-scores,3 the fracture risk in this group would be expected to be at least as high as that for women with a DXA-FN T-score less than -2.5 alone. To the extent that women identified as osteoporotic by the sequential approach have a higher fracture risk than women diagnosed with DXA alone, our analyses are conservative with respect to the sequential diagnostic approach. That is, the true fracture rates in women diagnosed as osteoporotic by the sequential approach will be higher than we have used, meaning that the corresponding incremental cost-effectiveness ratios will be lower than those reported here. For diagnosis with QUS alone, however, no clinical trials support similar reductions in hip fracture risk as following diagnosis with DXA alone. The sensitivity of our results to potential reduction in risk reduction following diagnosis with QUS alone suggests that further research is needed to determine if such as reduction in risk occurs using diagnosis with QUS alone.
Our economic analyses have proved robust to the reductions in hip fracture risk following any of the treatment regimens. That is, the order of the cost-effectiveness ratios did not change across the sensitivity analyses of treatment-related risk reductions. However, the magnitude of the incremental cost-effectiveness ratios decreases as the treatment-related risk reduction increases. Similarly, the sensitivity analyses for costs of care for fractures did not change the order of the alternatives. As fracture costs increase, the incremental cost-effectiveness ratios decrease, although changes are relatively minor over the range of fracture costs considered. Finally, when the sensitivity analyses are performed for the proportion of fractures (across all women), the order of alternatives does not change. As the rate of fractures increases, the incremental cost-effectiveness ratios decrease.
One aspect that complicates our analyses is the lack of an agreed-upon cut point for QUS/BUA. That is, the literature does not provide a recommendation for a QUS/BUA cut point that will either indicate sufficient risk to either require treatment or warrant further diagnosis using DXA. The EPIDOS report used a cut point that approximated the median of the observed distribution of QUS/BUA. Using the SOF data, if we were to define sensitivity and specificity in terms of fracture outcomes and plot a receiver operating characteristic (ROC) curve, an optimal QUS/BUA cut point would not be obvious. Thus, we have performed the economic analyses for each of a series of cut points (from 50 to 85 dB/MHz with increases of 5 dB/MHz). We have made no attempt to select one QUS/BUA cut point. As the cut point increases, so do the numbers of fractures prevented, the numbers of patients diagnosed as osteoporotic, and the total costs. Additional research to determine QUS/BUA cut points would allow economic evaluation to be more focused.
The two cohorts, EPIDOS and SOF, differ, especially in the proportion of women with DXA-FN T-scores of -2.5 or less. The EPIDOS cohort has older women with almost 50 percent diagnosed as osteoporotic using DXA-FN. Only 28 percent of the SOF cohort is osteoporotic by DXA-FN. Also, the EPIDOS results are very robust across scenarios whereas SOF results are sensitive to the QUS/BUA cut point used and to several other variables as well.
The incremental cost per fracture prevented as estimated by our model appears to be very high, with most much higher than $100,000 per hip fracture prevented for most scenarios. There are at least two reasons why these values may be inflated. First, we used a relatively short time horizon of 2 years for this evaluation. This period corresponds to the mean follow-up time in both cohorts simplifying the analyses. However, the time in which to accrue benefits is relatively short, even for women in this elderly population. If fractures over remaining lifetime were modeled, the cost per fracture prevented would likely decrease, perhaps substantially. Second, this evaluation has ignored other risk factors. Use of additional risk factors might reduce the magnitude of these cost-effectiveness ratios by better identifying women at high risk of hip fractures. Thus, a more complex economic model that included other clinical risk factors would provide more definitive comparisons among these alternatives.
Several areas of future research needs have been mentioned so far in this section. Studies are needed to choose cut points for QUS/BUA to either define women at high risk of fracture or define women who should receive further diagnostic testing with DXA. The SOF cohort may provide some guidance to address this question, but additional studies are likely to be needed also. Second, a more detailed economic evaluation should be performed. A future economic evaluation should, at a minimum, use QALY as an outcome measure, include lifetime costs and benefits, and target other age groups in addition to the elderly. Also, if such an economic evaluation can include other risk factors, the cost per hip fracture prevented for these diagnostic approaches is likely to decrease. Third, research should evaluate whether or not the treatment-related reduction in risk of hip fracture is the same in a woman diagnosed with QUS alone as in one diagnosed with DXA-FN alone. Fourth, a randomized clinical trial that compares a sequential diagnostic approach to diagnosis with DXA-FN alone would be useful. Women would be randomly allocated to one or the other of the two diagnostic approaches and treated if diagnosed to be at high risk. This study would require agreed-upon QUS/BUA cut points.
Overall, the sequential diagnostic approach may provide a cost-effective alternative to diagnosis with DXA-FN alone. Further, in a population with a much higher overall fracture risk than SOF, QUS alone may also be cost-effective. However, the results for diagnosis with QUS alone are sensitive to any reduction in efficacy following diagnosis with QUS. As diagnostic tools such as QUS become more readily available in the primary care setting, research that provides guidance to clinicians as to the best use of such tools is required. Based on the conclusion of the simple model presented here, use of QUS either for diagnosis of osteoporosis or to determine which women should receive DXA-FN may be appropriate and cost-effective.
Heidi D. Nelson, M.D., M.P.H.
Associate Professor of Medicine and Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Mark Helfand, M.D., M.P.H.
Evidence-based Practice Center Director
Associate Professor of Medicine and Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Margaret M. Nygren, M.A.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Cynthia D. Morris, Ph.D., M.P.H.
Professor of Medicine and Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Nancy Carney, Ph.D.
Assistant Professor of Emergency Medicine
Oregon Health & Science University
Portland, Oregon
Dale F. Kraemer, Ph.D.
Assistant Professor of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Susan Mahon, M.P.H.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Kathryn Pyle Krages, AMLS, M.A.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Piper Hackett, B.S.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Patty Davies, M.S.
OHSU Library
Oregon Health & Science University
Portland, Oregon
Alix Seif, M.A.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Connie Levesque, B.S.
Division of Medical Informatics & Outcomes Research
Oregon Health & Science University
Portland, Oregon
Eric Orwoll, M.D.
Professor of Medicine
Head, Bone, and Mineral Section
Division of Endocrinology
Oregon Health & Science University
Portland, Oregon
Melinda Lee, M.D.
ElderPlace in Cully
Portland, Oregon
Elizabeth Eckstrom, M.D., M.P.H.
Department of Medicine
Good Samaritan Hospital
Portland, Oregon
Peter I. Juhn, M.D., M.P.H.
Executive Director, Care Management Institute
Kaiser Permanente
Oakland, California
Louise Pyle
Tigard, Oregon
Lauren Rykert, OTR/L
Occupational Therapist
Portland, Oregon
Steven R. Cummings, M.D.
Professor of Medicine, Epidemiology, and Biostatistics
Assistant Dean for Clinical Research
Department of Medicine
University of California, San Francisco
San Francisco, California
C. Conrad Johnston, Jr., M.D.
Distinguished Professor
Indiana University School of Medicine
Indianapolis, Indiana
L. Joseph Melton III, M.D.
Michael M. Eisenberg Professor
Department of Health Services Research
Mayo Clinic and Mayo Foundation
Rochester, Minnesota
Douglas C. Bauer, M.D.
Assistant Professor of Medicine
University of California, San Francisco
San Francisco, California
Cathleen Colon-Emeric, M.D.
Duke University Medical Center
Durham, North Carolina
(Representing the American Geriatrics Society)
Latha Dulip-Singh, M.D.
Bone Unit
Department of Medicine
New Britain General Hospital
New Britain, Connecticut
(Representing the American College of Physicians)
Ken Faulkner, Ph.D.
Director of Osteoporosis Research
Synarc
Portland, Oregon
Murray J. Favus, M.D.
University of Chicago
Chicago, Illinois
Patrick Garnero, Ph.D.
Synarc
Lyon, France
Evan C. Hadley, M.D.
Associate Director (Geriatrics)
National Institute on Aging
National Institutes of Health
Bethesda, Maryland
C. Conrad Johnston, Jr., M.D.
Distinguished Professor
School of Medicine
Indiana University
Indianapolis, Indiana
John Kanis, M.D.
Center for Metabolic Bone Diseases
Department of Human Metabolism and Clinical Biochemistry
Medical School
University of Sheffield
Sheffield, United Kingdom
Kenneth Lyles, M.D.
Associate Professor of Medicine
Duke University
Durham, North Carolina
(Representing the American Geriatrics Society)
Saralyn Mark, M.D.
Senior Medical Advisor
Office on Women's Health
Department of Health and Human Services
National Aeronautics and Space Administration
Washington, DC
Joan A. McGowan, Ph.D.
Chief
Musculoskeletal Diseases Branch
National Institute of Arthritis and Musculoskeletal and Skin Diseases
National Institutes of Health
Bethesda, Maryland
L. Joseph Melton III, M.D.
Michael M. Eisenberg Professor
Department of Health & Science Research
Mayo Clinic
Rochester, Minnesota
Kenneth Mickelson, Ph.D.
Senior Staff Scientist
Bayer Diagnostics, Research and Development
Tarrytown, New York
Isaac Mizrahi, Ph.D.
Manager
Clinical and Regulatory Affairs (Skeletal Diagnostics)
Beckman Coulter, Inc.
San Diego, California
Joyce M. Paucek
Marketing Director
Norland Medical Systems, Inc.
Fort Atkinson, Wisconsin
William Sacks, Ph.D., M.D.
Medical Officer
Radiological Devices Branch
Office of Device Evaluation
Division of Reproductive, Abdominal, and Radiological Devices
Center for Devices and Radiological Health
U.S. Food and Drug Administration
Washington, DC
Ronald B. Staron, M.D.
Department of Radiology
Columbia University
New York, New York
(Representing the American College of Radiology)
Steven M. Teutsch, M.D., M.P.H.
Senior Director
Outcomes Research & Management
U.S. Human Health
Merck & Co., Inc.
West Point, Pennsylvania
Anna Tosteson, Sc.D.
Associate Professor of Medicine and Associate Professor of Community and Family Medicine
Dartmouth-Hitchcock Medical Center
Lebanon, New Hampshire
Eric von Stetten, Ph.D.
Scientific Director
Hologic, Inc.
Bedford, Massachusetts
Marie D. Zinninger
Associate Executive Director
American College of Radiology
Reston, Virginia
1 exp osteoporosis
osteoporosis, postmenopausal
2 bone density
3 1 or 2
4 exp risk
| logistic models | risk assessment |
| risk factors |
5 3 and 4
6 exp transplantation
| cell transplantation | transplantation, autologous |
| organ transplantation | transplantation, heterologous |
| replantation | transplantation, heterotopic |
| tissue transplantation | transplantation, homologous |
7 exp kidney failure
| kidney failure, acute | diabetes insipidus, nephrogenic |
| kidney failure, chronic |
8 su.fs. (Surgery as a subheading anywhere in the article)
9 athletic injuries
10 exp sports
| baseball | basketball | bicycling |
| boxing | golf | football |
| gymnastics | hockey | mountaineering |
| racquetball | running | martial arts |
| skating | skiing | track and field |
| soccer | swimming | weight lifting |
| walking | wrestling |
11 exp fractures/dt,su,th (limited to surgery and other therapies)
| femoral fractures | fractures, closed |
| fractures, comminuted | fractures, malunited |
| fractures, open | fractures, spontaneous |
| fractures, stress | fractures, ununited |
| humeral fractures | radius fractures |
| rib fractures | shoulder fractures |
| skull fractures | spinal fractures |
| tibial fractures | ulna fractures |
12 orthopedic$.mp. (As a textword anywhere)
13 6 or 7 or 8 or 9 or 10 or 11 or 12
14 5 not 13 (statements 6 through 12 were excluded from the study)
15 limit 13 to female
16 limit 14 to human
17 limit 15 to english language
18 looked at english abstracts of foreign articles
19 exp fractures or exp osteoporosis or bone density
20 exp risk (terms as in 4)
21 19 and 20
22 exp cohort studies
longitudional studies
follow-up
studies
prospective studies
23 meta-analysis
24 exp case control studies
retrospective studies
25 predictive value of tests
26 evidence-based medicine
27 22 or 23 or 24 or 25 or 26
28 21 and 27
29 limit 28 to human
30 limit 29 to english language
1 exp osteoporosis
osteoporosis, postmenopausal
2 bone density
3 1 or 2
4 densitometry, x-ray
5 exp ultrasonography
| echocardiography | ultrasonography, doppler |
| echoencephalography | ultrasonography, interventional |
| endosonography | ultrasonography, mammary |
6 calcaneous/us (us = ultrasonics)
7 (dxa or sxa or bua or qct or qus or mxa or mrx or ra or dip or sos or ubps or spa or dpa).tw.
8 exp osteoporosis/us (us = ultrasonics)
9 4 or 5 or 6 or 7 or 8
10 3 and 9
11 limit 10 to human
12 limit 11 to english language
13 looked at english abstracts of foreign articles
1 exp osteoporosis/
osteoporosis, postmenopausal
2 bone density/
3 1 or 2
4 monitoring
5 densitometry, x-ray/
6 exp ultrasonography/
| echocardiography | ultrasonography, doppler |
| echoencephalography | ultrasonography, interventional |
| endosonography | ultrasonography, mammary |
7 calcaneus/us
8 4 or 5 or 6 or 7
9 3 and 8
10 exp osteoporosis/dh,dt,th
11 9 and 10
12 limit 11 to human
1 exp osteoporosis
osteoporosis, postmenopausal
2 bone density
3 1 or 2
4 exp biological markers
genetic markers
5 marker$.tw. (markers as a word in the title or abstract)
6 (bap or ctx or dpd or ntx or pyd).tw. (marker abbrev. in title and abstract)
7 4 or 5 or 6
8 3 and 7
9 limit 8 to human
10 limit 9 to english language
11 limit 10 to female
12 looked at english abstracts of foreign articles
1 exp osteoporosis
2 ec.fs. (ec=economics)
3 exp costs and cost analysis
| cost allocation | cost-benefit analysis |
| cost control | cost of illness |
| cost sharing | health care costs |
| health expenditures |
4 exp economics
| costs and cost analysis | economic competition |
| economic value of life | economics, dental |
| economics, hospital | economics, medical |
| economics, nursing | economics, pharmaceutical |
| fees and charges | financial management |
| financial support | financing, organized |
| financing, personal | health care sector |
| inflation, economic | investments |
| medical indigency | raxes |
5 2 or 3 or 4
6 1 and 5
7 limit 6 to english language
8 looked at english abstracts of foreign articles
The Methods Work Group for the Third U.S. Preventive Services Task Force (USPSTF) developed a set of criteria by which the quality of individual studies could be evaluated. At its September 1999 quarterly meetings, the USPSTF accepted the criteria, and definitions of quality categories relating to internal validity.
Presented below are a set of minimal criteria for each study design and a general definition of three categories -- "good," "fair," and "poor." These specifications are not meant to be rigid rules but rather are intended to be general guidelines, and individual exceptions, when explicitly explained and justified, can be made. In general, a "good" study is one that meets all criteria well. A "fair" study is one that does not meet (or it is not clear that it meets) at least one criterion but has no major limitations. "Poor" studies have at least one major limitation.
Criteria:
Comprehensiveness of sources considered/search strategy used
Standard appraisal of included studies
Validity of conclusions
Recency and relevance
Good:
Recent, relevant review with comprehensive sources and search strategies; explicit and relevant selection criteria; standard appraisal of included studies; and valid conclusions.
Fair:
Recent, relevant review that is not clearly biased but lacks comprehensive sources and search strategies.
Poor:
Outdated, irrelevant, or biased review without systematic search for studies, explicit selection criteria, or standard appraisal of studies.
Criteria:
Accurate ascertainment of cases
Nonbiased selection of cases/controls with exclusion criteria applied equally to both
High response rate
Diagnostic testing procedures applied equally to each group
Measurement of exposure accurate and applied equally to each group
Appropriate attention to potential confounding variable
Good:
Appropriate ascertainment of cases and nonbiased selection of case and control participants; exclusion criteria applied equally to cases and controls; response rate equal to or greater than 80 percent; diagnostic procedures and measurements accurate and applied equally to cases and controls; and appropriate attention to confounding variables.
Fair:
Recent, relevant, without major apparent selection or diagnostic work-up bias but with response rate less than 80 percent or attention to some but not all important confounding variables.
Poor:
Major selection or diagnostic work-up biases, response rates less than 50 percent, or inattention to confounding variables.
Criteria:
Initial assembly of comparable groups
-- for RCTs:
adequate randomization, including first concealment and whether
potential confounders were distributed equally among
groups
-- for cohort studies: consideration of
potential confounders with either restriction or measurement for
adjustment in the analysis; consideration of inception
cohorts
Maintenance of comparable groups (includes attrition, crossovers, adherence, contamination)
Important differential loss to followup or overall high loss to followup
Measurements: equal, reliable, and valid (includes masking of outcome assessment)
Clear definition of interventions
Important outcomes considered
Analysis: adjustment for potential confounders for cohort studies, or intention to treat analysis for RCTs.
Good:
Meets all criteria: comparable groups are assembled initially and maintained throughout the study (followup at least 80 percent); reliable and valid measurement instruments are used and applied equally to the groups; interventions are spelled out clearly; important outcomes are considered; and appropriate attention to confounders in analysis. In addition, for RCTs, intention to treat analysis is used.
Fair:
Studies will be graded "fair" if any or all of the following problems occur, without the major limitations noted in the "poor" category below: generally comparable groups are assembled initially but some question remains whether some (although not major) differences occurred in followup; measurement instruments are acceptable (although not the best) and generally applied equally; some but not all important outcomes are considered; and some but not all potential confounders are accounted for. Intention-to-treat analysis is done for RCTS.
Poor:
Studies will be graded "poor" if any of the following major limitations exists: groups assembled initially are not close to being comparable or maintained throughout the study; unreliable or invalid measurement instruments are used or not applied at all equally among groups (including not masking outcome assessment); and key confounders are given little or no attention. For RCTs, intention-to-treat analysis is lacking.
Criteria:
Screening test relevant, available for primary care, adequately described
Study uses a credible reference standard, performed regardless of test results
Reference standard interpreted independently of screening test
Handles indeterminate results in a reasonable manner
Spectrum of patients included in study
Large sample size
Administration of reliable screening test
Good:
Evaluates relevant available screening test; uses a credible reference standard; interprets reference standard independently of screening test; reliability of test assessed; has few or handles indeterminate results in a reasonable manner; includes large number (more than 100) broad-spectrum patients with and without disease.
Fair:
Evaluates relevant available screening test; uses reasonable although not best standard; interprets reference standard independent of screening test; moderate sample size (50 to 100 subjects) and a "medium" spectrum of patients.
Poor:
Has major limitation such as: uses inappropriate reference standard; screening test improperly administered; biased ascertainment of reference standard; very small sample size of very narrow selected spectrum of patients.
| Chemistry Battery | ||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Organization, Author, or Reference | Title | Year | Audience | Development | Population | Statement of Philosophy | Secondary Disorders | Complete blood count | Erythrocyte sedimentation rate | Urinalysis | Urine pH | Standard chemistry battery | Calcium | Phosphorus | Alkaline phosphatatase | Blood urea nitro-gen | Albumin | Liver enzymes | Electrolytes | Protein | Creatinine | Protein electrophoresis | Urinary calcium | 24-hour creatinine clearance | Ionized calcium | Parathyroid hormone | 25 (OH) vitamin D | 1,25(OH)2 vitamin D | Thyroid function tests | Cortisol | Testosterone | Folicle stimulating hormone | Bone-specific alkaline phos-phatase | Urine pyridinoline crosslinks | Urine N-telopeptide | Osteocalcin | Bone biopsy | Miscellaneous |
| Practice Guidelines | ||||||||||||||||||||||||||||||||||||||
| National Osteoporosis Foundation454 | Physician's guide to prevention and treatment of osteoporosis | 1999 | Physicians reviewed and developed in collaboration with 9 professional societies | Evidence-based review | Women, primary care (?) | "Be alert to see causes of osteoporosis, which include a broad range of disease states and therapeutic drugs. Limited biochemical testing may be required in some cases." | Acromegaly, adrenal atrophy and Addison's disease, amyloidosis, chronic obstructive pulmonary disease, congenital porphyria, Cushing's syndrome, endometriosis, epidermolysis bullosa, gastrectomy, gonadal insufficiency, hemochromatosis, hemophilia, hyperparathyroidism, hypophosphatasia, idiopathic scoliosis, insulin-dependent diabetes mellitus, lymphoma and leukemia, malabsorption syndromes, mastocytosis, multiple myeloma, multiple sclerosis, nutritional disorders, osteogenesis imperfecta, parenteral nutrition, pernicious anemia, rheumatoid arthritis, sarcoidosis, severe liver disease, thalassemia, thyrotoxicosis, tumor secretion of parathyroid hormone-related peptide | Limited | Limited | Limited | Limited | Limited | Limited | |||||||||||||||||||||||||
| Foundation for Osteoporosis Research and Education456 | Section III: Guidelines of care on osteoporosis (sic) for primary care physicians | 1997 | Primary care physicians | Expert opinion | Men and women in primary care | "…the work up of secondary causes of osteoporosis can be extensive…" | Multiple myeloma, vitamin D deficiency, hyperparathyroidism, hypercalcemia, bowel disorders, renal tubular acidosis | Additional initial study | Depending on individual aspects of case | Routine | Routine | Routine | Routine | Routine | Routine | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Additional initial study | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Antigliadin antibodies, anti-endomysial antibodies may be necessary in some patients | ||||||||||
| American Association of Clinical Endocrinologists456 | AACE clinical practice guidelines for the prevention and treatment of postmenopausal osteoporosis | 1996 | Endocrinologists | Expert opinion | Postmenopausal women | "Many patients with osteoporosis take medications or have coexisting diseases that cause or aggravate bone loss, and primary postmenopausal osteoporosis cannot be diagnosed or appropriately treated until these possibilities are eliminated." | Hypogonadism, hyperadrenocortism, thyrotoxicosis, anorexia nervosa, hyperprolactinemia, systemic mastocytosis, porphyria, hypophosphatemia in adults, type I diabetes mellitus, thalassemia, pregnancy, hyperparathyroidism, acromegaly, malabsorption syndromes and malnutrition, chronic liver disease, gastric operations, vitamin D deficiency, calcium deficiency, alcoholism, osteogensis imperfecta, homocystinuria due to cystathione deficiency, Ehlers-Danlos syndrome, Marfan syndrome, rheumatoid arthritis, myeloma and some cancers, immobilization, renal tubular acidosis, hypercalciuria | Routine | Depending on individual aspects of case | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | If other causes for bone loss are suspected, additional evaluations may include acid-base studies, dexamethasone suppression, or bone marrow exam. Undecalcified iliac bone biopsy with double tetracycline labeling may be considered when there is no response to therapy or no apparent cause for osteoporosis. | |||||||||||||||
| American College of Rheumatology Task Force on Osteoporosis Guidelines457 | Recommendations for the prevention and treatment of glucocorticoid-induced osteoporosis | 1996 | Patients of rheumatologists on long-term glucocorticoid therapy | Expert opinion | Men and women receiving moderate to high dose glucocorticoids with or without osteoporotic fracture | "Laboratory tests should focus on identifying secondary causes of osteoporosis and assessing urinary calcium excretion." | Hyperparathyroidism, osteomalacia, thyroid disease, over supplementation with thyroid medication, renal osteodystrophy syndromes, multiple myeloma | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Measure if total protein or serum globulin elevated | Routine | Measure only if serum calcium high, serum phosphorus low, urinary calcium low | Routine, and measure if serum calcium low, serum phosphorus low, urinary calcium low | If patient on thyroid replacement | Routine | Routinely measure leutenizing hormone in women | |||||||||||||||
| Scientific Advisory Board, Osteoporosis Society of Canada458 | Clinical practice guidelines for the diagnosis and management of osteoporosis | 1996 | Primary care | Evidence-based review, guidelines tested through focus groups | Men and women in primary care | "The laboratory investigations shown … should be used to exclude secondary causes of osteoporosis." | Hyperparathyroidism, metastatic cancer, multiple myeloma, osteomalacia, liver disease, renal impairment | Routine | Routine | Routine | Routine | Routine | ||||||||||||||||||||||||||
| European Foundation for Osteoporosis and Bone Disease459 | Guidelines for diagnosis and management of osteoporosis | 1997 | Primary care physicians | Expert opinion | Men and women in primary care | "The range of clinical and biological tests will depend on the severity of the disease, the age at presentation, and the presence of vertebral fractures." | Anorexia nervosa, malabsorption due to gastrointestinal and hepatobiliary diseases, primary hyperparathyrodism, thyrotoxicosis, primary hypogonadism, prolactinoma, hypercortisolism, osteogenesis imperfecta, rheumatoid arthritis, chronic obstructive lung disease, chronic neurological disorders, chronic renal failure, mastocytosis, Type I diabetes, post-transplantation, hyperthyroidism, Cushing's syndrome | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Optional | Optional | Optional | Optional | Routine in men only | Optional | Optional | Optional | Optional | Bone marrow, cancer markers, gonadotrophins are optional. | ||||||||||||
| Advisory Group on Osteoporosis, Royal College of Physicians460 | Summary and recommendations of the report "Osteoporosis -- clinical guidelines for the prevention and treatment" | 1999 | Primary care | Evidence-based review | Men and women in primary care | "Diagnostic assessment of individuals with osteoporosis should include not only the assessment of BMD … but also the exclusion of diseases that mimic osteoporosis and the management of any associated morbidity." | Hyperparathyroidism, renal osteodystrophy, osteomalacia, alcohol abuse, multiple myeloma, hyperthyroidism, hypothyroidism | If elevated plasma electrophoresis | Routine | Routine | Routine | Routine | Routine | Routine | Routine | |||||||||||||||||||||||
| Society of Obstetricians and Gynecologists of Canada461 | Canadian consensus conference on menopause: osteoporosis | 1998 | Obstetrics/Gynecology | Expert opinion | Women | "Laboratory investigations should be carried out in every patient with established osteoporosis to exclude secondary causes of osteoporosis." | Hyperparathyroidism, metastatic cancer, multiple myeloma, osteomalacia, liver disease, renal impairment | Routine | Routine | Routine | Routine | |||||||||||||||||||||||||||
| Australian National Consensus Conference462 | The prevention and management of osteoporosis | 1997 | Primary care | Expert opinion | Men and women in primary care | "The extent to which investigations are necessary is determined by the clinical picture, but the index of suspicion for secondary causes of osteoporosis should increase in patients with severe bone loss for age (e.g. bone density lower than 2.0 standard deviations below the age-related mean)." | Vitamin D deficiency, hyperthyroidism, hypogonadism, hyperparathyroidism, alcoholism, liver disease, multiple myeloma | Routine | Routine | Routine | Routine | Depending on individual aspects of case | Routine | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | Depending on individual aspects of case | If hypo-gonadism is suspected | If hypogonadism is suspected | Sex steroids if hypogonadism is suspected | ||||||||||||||||||
| University of California at San Diego463 | UCSD Medical Group osteoporosis guidelines | 1998 | Uncertain | Peri- and postmenopausal women | If T score <-2.5 SD on DEXA, then conduct additional workup. | Malignancy, myeloma, Cushing's disease, hyperthyroidism, hyperparathyroidism, osteomalacia, hypogonadism | Consider | Consider | Routine if T-score <-2.5 SD on DEXA | Routine if T-score <-2.5 SD on DEXA | Routine if T-score <-2.5 SD on DEXA | Routine if T-score <-2.5 SD on DEXA | Routine if T-score <-2.5 SD on DEXA | Consider | ||||||||||||||||||||||||
| MR McClung464Osteoporosis | Nonpharmacologic management of osteoporosis | 1994 | Uncertain | Expert opinion | Men and women suspected of or referred for treatment of osteoporosis | "In patients found to have osteoporosis, correctable factors that may contribute to bone loss and skeletal fragility must be identified." | Endogenous or exogenous hyperthyroidism and cortisol excess, hyperparathyroidism, androgen deficiency, chronic liver or renal insufficiency, multiple myeloma, other hematopoietic malignancies, alterations in vitamin D metabolism and calcium balance, renal calcium wasting | Routine | Routine | "Laboratory screening for specific medical problems that may cause bone loss depends on the comfort of the clinician with the clinical recognition of these abnormalities." | ||||||||||||||||||||||||||||
| PD Miller465Adv Intern Med | Management of osteoporosis | 1999 | Primary care | Expert opinion | "A substantial number of patients with low bone mass, who on the surface appear to have postmenopausal osteoporosis, need workups for secondary causes of bone loss. The intensity of the workup should be a clinical judgement based on the individual patient's history and physical examination." | Hypogonadism, osteoporosis of pregnancy, multiple myeloma, hyperthyroidism, hyperparathyroidism, mastocytosis, chronic renal failure, homocystinuria, osteogenesis imperfecta, inflammatory bowel disorders, celiac disease | Routine | Routine | May be helpful in individual cases | May be helpful in individual cases | May be helpful in individual cases | May be helpful in individual cases | Routine | May be helpful in some cases | May be helpful in individual cases | May be helpful in individual cases | May be helpful in individual cases | The following may be helpful in individual cases: serum estradiol, and prolactin; antigliadin and aniendomesial antibodies; small intestinal biopsy; 24-hour urine phosphorus; collagen amino acid sequencing or fibroblast (skin) culture. | ||||||||||||||||||||
| A Juby466Can Fam Physician | Managing elderly people's osteoporosis | 1999 | Family practice | Evidence-based review | Elderly men and women seen by family practitioners | "Having decided who is most likely to be at risk of osteoporosis leads to the important issue of differentiating primary from secondary causes of osteoporosis." "…it is important to be aware of secondary causes of osteoporosis, particularly for patients who do not seem to be responding to therapy as predicted." | Diabetes mellitus, hypogonadism, hyperthyroidism, hyperprolactinemia, hyperparathyroidism, hypercortisolism, multiple myeloma, rheumatoid arthritis, immobilization, osteomalacia, chronic renal failure, systemic mastocytosis, hepatic disease, osteogenesis imperfecta, scurvy, malabsorption syndrome, sarcoidosis, chronic obstructive lung disease | Routine | Routine | Routine | Routine | Routine | Routine | |||||||||||||||||||||||||
| Osteoporosis: Diagnostic and Therapeutic Principles CJ Rosen,467 ed. | The diagnosis and treatment of postmenopausal osteoporosis | 1996 | Primary care | Expert opinion | Postmenopausal women | Routine | Routine | Routine | Routine | Optional | Routine | Routine | Routine | Consider | ||||||||||||||||||||||||
| Primary Care of Women468 DP Lemcke et al., eds. | Section IV: metabolic and endocrine disorders, osteoporosis DP Lemcke | 1995 | Primary care | Expert opinion | Women, primary care | "In a patient who has documented osteoporosis, a careful history and physical examination and several laboratory tests should be performed to rule out secondary causes of osteoporosis." | Endocrinopathies: hypogonadism, hyperthyroidism, hyperparathyroidism, Cushing's syndrome, hyperprolactinemia, acromegaly. Gastrointestinal diseases: malabsorption syndromes, chronic obstructive jaundice, primary biliary cirrhosis, subtotal gastrectomy | Routine | Routine | Routine | If indicated from initial lab tests | Routine | If indicated from initial lab tests | If indicated from initial lab tests | If indicated from initial lab tests | |||||||||||||||||||||||
| Primary Care of Women469 KJ Carlson et al., eds. | Section III: Endocrinology, Osteoporosis DM Slovik | 1995 | Primary care | Expert opinion | Women, primary care | "A detailed history and physical examination is necessary to identify risk factors and the secondary causes of osteoporosis." | Routine | Routine | Routine | Routine | Helpful | Helpful | If hyperpara-thyroidism is suspected | If vitamin D deficiency is suspected | Routine | |||||||||||||||||||||||
| A Clinical Guide for the Care of Older Women470 RL Byyny, L Speroff, eds. | "Osteoporosis" | 1996 | Primary care | Expert opinion | Older women | "Patients with osteoporosis should be screened for other conditions that lead to osteoporosis." | Hyperparathyroidism, chronic renal failure, multiple myeloma, leukemia, lymphoma, hyperthyroidism, alcohol abuse, metastatic cancer, hypercortosolism | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | When indicated by exam | ||||||||||||||||||||
| Harrison's Principles of Internal Medicine471 AS Fauci et al., eds. | Chapter 355. Metabolic bone disease SM Krane et al. | 1998 | Primary care | Expert opinion | Men and women | "Since decrease in skeletal mass is a universal feature of aging, it is difficult to evaluate asymptomatic decreased bone density, determined radiographically, in older women, especially when not accompanied by marked increase in biconcavity of vertebral bodies or fractures." | Hypogonadism, hyperadrenocorticism, hyperparathyroidism, thyrotoxicosis, malabsorption, scurvy, calcium deficiency, immobilization, systemic mastocytosis, adult hypophosphatasia, metabolic bone disease, osteogenesis imperfecta, homocystinuria due to cystathione synthase deficiency, Ehlers-Danlos syndrome, Marfan's syndrome, rheumatoid arthritis, malnutrition, alcoholism, epilepsy, primary biliary cirrhosis, chronic obstructive pulmonary disease, Menkes' syndrome | No specific recommendations for testing are made. | ||||||||||||||||||||||||||||||
| Manual of Medical Therapeutics472 GA Ewald, CR McKenzie, eds. | Chapter 23. Mineral and metabolic bone disease, S Dagogo-Jack et al. | 1995 | Primary care | Expert opinion | Men and women | "Diagnosis of primary osteoporosis requires exclusion of secondary forms of osteoporosis and other causes of osteopenia. The history, physical examination, and a few basic laboratory tests are usually diagnostic." | Cushing's syndrome, hyperthyroidism, hypogonadism in men, immobilization, osteogenesis and related disorders, primary hyperparathyroidism, osteomalacia, myeloma, mastocytosis, renal osteodystrophy | Routine | Routine | Routine | Routine | Routine | If usual features present | |||||||||||||||||||||||||
| Internal Medicine473 JH Stein, ed. | Chapter 181. Osteoporosis LG Reisz | 1998 | Primary care | Expert opinion | Men and women | "In the typical case of a postmenopausal woman with a crush fracture, a careful history and physical examination and limited laboratory studies are usually sufficient to rule out diseases that mimic or aggravate primary osteoporosis." "Routine laboratory studies in otherwise healthy patients with low bone mineral density should include a measurement of serum calcium." | Mainly primary hyperparathyroidism, osteomalacia, hyperthyroidism, multiple myeloma, and Cushing's syndrome. Gastrointestinal or renal disease predispose to osteomalacia or secondary hyperparathyroidism as well as osteoporosis | Routine | Routine | Routine | Routine | Routine | Routine | Routine if history or exam suggestive of vitamin D deficiency | Routine | Routine if signs of Cushing's syndrome present | Can be used | Can be used | Can be used | May be indicated in severe or atypical cases | ||||||||||||||||||
| Cecil Essentials of Medicine474 TE Andreoli et al., eds. | "Osteoporosis" | 1997 | Primary care | Expert opinion | Men and women in primary care | "Secondary causes of osteoporosis should be sought in patients with an established diagnosis of osteoporosis, particularly when the bone density is significantly lower than that of age- and sex-matched individuals." | Endocrine disease: hyperprolactinemia, hypothalamic amenorrhea, anorexia nervosa, premature and primary ovarian failure, primary and secondary gonadal failures, delayed puberty, hyperthyroidism, hyperparathyroidism, hypercortisolism, growth hormone deficiency; Gastrointestinal disease: subtotal gastrectomy, malabsorption syndromes, chronic obstructive jaundice, primary biliary cirrhosis, alactasia; Bone marrow disorders: multiple myeloma, lymphoma, leukemia, hemolytic anemia, systemic mastocytosis, disseminated carcinoma. Connective tissue disease: osteogenesis imperfecta, Ehlers-Danlos syndrome, Marfan's syndrome, homocystinuria. Miscellaneous: immobilization, rheumatoid arthritis | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | If clinical suspicion exists | Routine | In selected patients | |||||||||||||||||||
| Endocrinology475 LJ De Groot , ed. | Chapter 73. Osteoporosis RM Neer | 1995 | Primary care, endocrinology | Expert opinion | Men and women | "..when osteopenia is present, one can establish whether it is worse than expected for the age and sex of the patient. This information is invaluable to assessing the effects of endocrine and other diseases on the skeleton. It also helps guide the extent (and therefore the expense) of additional laboratory evaluations in patients with osteoporosis." | Primary ovarian failure, gonadotropin deficiency, hypothalamic pituitary dysrhythmia, androgen deficiency, glucocorticoid excess, growth hormone or IGF deficiency, thyroid hormone excess, PTH excess, vitamin D deficiency, anorexia nervosa, exercise-induced amenorrhea, delayed puberty | Specific testing not noted. | ||||||||||||||||||||||||||||||
| Williams Textbook of Endocrinology476 JD Wilson et al., eds. | "Primary osteoporosis" | 1998 | Primary care, endocrinology | Expert opinion | Men and women | "Because primary osteoporosis is a diagnosis of exclusion, secondary forms of bone loss should be excluded. Secondary forms of osteoporosis should be suspected in groups with a low prevalence of the primary form of the disease (i.e. white men, young white women, and black persons of both sexes and all ages)." | Thyrotoxicosis, multiple myeloma, cortisol excess, primary hyperparathyroidism, osteomalacia | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | In certain cases | |||||||||||||||||||||
| Principles and Practice of Endocrinology and Metabolism477 KL Becker, ed. | Chapter 63. Osteoporosis JF Tohme et al. | 1995 | Primary care, endocrinology | Expert opinion | Men and women | "The diagnosis of osteoporosis is a diagnosis of exclusion. An intensive search should be undertaken to exclude secondary causes of osteoporosis, particularly in patients who are neither postmenopausal women nor elderly." | Cushing's syndrome, hypogonadism, hyperthyroidism, hyperparathyroidism, diabetes mellitus, acromegaly, hyperprolactinemia, osteogenesis imperfecta, homocystinuria, Ehlers-Danlos syndrome, Marfan's syndrome, Menke steely hair disease, Riley-Day syndrome, Gaucher disease and other glycogen storage disorders, sickle-cell anemia, thalassemia, hypophosphatasia, alcohol abuse, multiple myeloma, gastrointestinal disease, chronic liver disease, pregnancy, chronic obstructive pulmonary disease, rheumatoid arthritis, malnutrition, diffuse cancer, systemic mastocytosis, amyloid disease, hemochromatosis | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | If hypercalcemia, hypercalciuria, or high-normal calcium levels present | In elderly patients, those on antiepileptic drugs, or with intestinal or liver disease | If adrenocortical hyperfunction is suspected | |||||||||||||||||||
| Basic and Clinical Endocrinology478 FS Greenspan, GJ Strewler, eds. | Chapter 17. Mineral metabolism and metabolic bone disease GJ Strewler | 1997 | Primary care | Expert opinion | Men and women in primary care | Hyperthyroidism, diabetes mellitus, osteogenesis imperfecta, Ehlers-Danlos syndrome, homocystinuria, Menkes syndrome, Marfan's syndrome, multiple myeloma, systemic mastocytosis, hypogonadism, hypercortisolism, hyperparthyroidism, malabsorption syndrome, sustotal gastrectomy, obstructive jaundice, biliary cirrhosis, renal tubular acidosis | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | Routine | |||||||||||||||||||||||
AHRQ: Agency for Healthcare Research and Quality
ALP: alkaline phosphatase
ARR: adjusted relative risk
BALP: bone-specific alkaline phosphatase
BMC: bone mineral content
BMD: bone mineral density
BMI: body mass index
BUA: broadband ultrasound attenuation
C/E: cost-effectiveness
CAPE: Clifton Assessment Procedure for the Elderly
CI: confidence interval
CTX: type I collagen cross-linked C-telopeptide
CV: coefficient of variation
D-PYR: deoxypyridinoline
DbMHz: decibels/megahertz
DOM: Diagnostisch Onderzoek Mammacarcinoom
DPA: dual photon absorptiometry
DPX: dual photon X-ray
DXA: dual-energy X-ray absorptiometry
ELISA: enzyme-linked immunosorbent assay
EPIDOS: Epidemiologie de l=Osteoporose
FIT: Fracture Intervention Trial
HMO: health maintenance organization
HOS: Hawaii Osteoporosis Study
HPR: hydroxyproline
HRT: hormone replacement therapy
ICTP: type I collagen carboxyterminal telopeptide
LR: likelihood ratio
MA: Mexican American
NHANES: National Health and Nutrition Examination Survey
NHB: non-Hispanic black
NHW: non-Hispanic white
NSAID: nonsteroidal anti-inflammatory drug
NTX: type I collagen cross-linked N-telopeptide
OFELY: l'Os des Femmes de Lyon
OR: odds ratio
pDXA: peripheral dual-energy X-ray absorptiometry
PICP: carboxyterminal propeptide of type I procollagen
PINP: aminoterminal propeptide of type I collagen
pQCT: peripheral quantitative computed tomography
PYR: pyridinoline
QCT: quantitative computed tomography
QDR: quantitative digital radiography
QMD: quantitative microdensitometry
Free Full text in PMC].
Free Full text in PMC].
Free Full text in PMC]
[PubMed]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC].
Free Full text in PMC].
Free Full text in PMC].
Free Full text in PMC]
Free Full text in PMC]
[PubMed].
Free Full text in PMC].
Free Full text in PMC]
[PubMed].
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC].
Free Full text in PMC]
[PubMed]
Free Full text in PMC].
Free Full text in PMC]
Free Full text in PMC]
[PubMed].
Free Full text in PMC].
Free Full text in PMC].
Free Full text in PMC]
Free Full text in PMC].