NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Evaluation of Technologies for Identifying Acute Cardiac Ischemia in Emergency Departments: Evidence Report/Technology Assessment Number 26
 
hserta
AHRQ Evidence Reports
public health

Chapter  26:  Evaluation of Technologies for Identifying Acute Cardiac Ischemia in Emergency Departments: Evidence Report/Technology Assessment Number 26

A38234

Prepared for:
Agency for Healthcare Research and Quality
U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852


http://www.ahrq.gov/

Contract No. 290-97-0019

Prepared by:
New England Medical Center Evidence-based Practice Center
Joseph Lau, MD
Principal Investigator
John P.A. Ioannidis, MD
Ethan Balk, MD, MPH
Catherine Milch, MD
Priscilla Chew, MPH
Norma Terrin, Ph.D.
Thomas A. Lang, MA
Deeb Salem, MD
John B. Wong, MD
Investigators

AHRQ Publication No. 01-E006

May 2001

On December 6, 1999, under Public Law 106-129, the Agency for Health Care Policy and Research (AHCPR) was reauthorized and renamed the Agency for Healthcare Research and Quality (AHRQ). The law authorizes AHRQ to continue its research on the cost, quality, and outcomes of health care and expands its role to improve patient safety and address medical errors.

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

Prepared for:
Agency for Healthcare Research and Quality
U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852


http://www.ahrq.gov/

Contract No. 290-97-0019

Prepared by:
New England Medical Center Evidence-based Practice Center
Joseph Lau, MD
Principal Investigator
John P.A. Ioannidis, MD
Ethan Balk, MD, MPH
Catherine Milch, MD
Priscilla Chew, MPH
Norma Terrin, Ph.D.
Thomas A. Lang, MA
Deeb Salem, MD
John B. Wong, MD
Investigators

AHRQ Publication No. 01-E006

May 2001

On December 6, 1999, under Public Law 106-129, the Agency for Health Care Policy and Research (AHCPR) was reauthorized and renamed the Agency for Healthcare Research and Quality (AHRQ). The law authorizes AHRQ to continue its research on the cost, quality, and outcomes of health care and expands its role to improve patient safety and address medical errors.

This report may be used, in whole or in part, as the basis for development of clinical practice guidelines and other quality enhancement tools, or a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.

Preface

The Agency for Healthcare Research and Quality (AHRQ), through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHRQ and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHRQ encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHRQ expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Healthcare Research and Quality, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.

Douglas B. Kamerow, M.D.John M. Eisenberg, M.D.
Director, Center for PracticeDirector
and Technology AssessmentAgency for Healthcare Research
Agency for Healthcare Research and Quality and Quality
and Quality
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Structured Abstract

Objectives

Acute cardiac ischemia (ACI) includes unstable angina pectoris (UAP) and acute myocardial infarction (AMI) and is the leading cause of death in the United States. The rapid and accurate diagnosis of ACI has substantial clinical and economic benefits. We have updated a 1997 report that evaluated the diagnostic technologies for identifying patients with ACI in the emergency department (ED), where most patients with ACI enter the health care system.

Search Strategy

We conducted a comprehensive MEDLINE search of the English-language literature published between 1966 and December 1998. Additional articles were retrieved from references cited in the 1997 report and bibliographies of retrieved articles. Search terms included those related to the diagnosis of ACI, AMI, and UAP in the ED and to the following technologies: prehospital electrocardiography (ECG), continuous/serial ECG, nonstandard leads ECG, exercise stress ECG, the ACI Time-Insensitive Predictive Instrument (ACI-TIPI), the Goldman chest pain protocol, biomarkers, sestamibi myocardial perfusion imaging, echocardiography, and computer-based decision aids.

Selection Criteria

We reviewed studies that assessed the diagnostic performance of these technologies and their impact on health care delivery. Study populations were adult patients presenting to EDs with signs and symptoms suggestive of ACI.

Data Collection and Analysis

Study test performance or clinical impact data were abstracted. Each study was also assessed for its applicability and methodological quality. Evidence tables were constructed for the technologies. Meta-analyses were conducted to summarize the test performance and the clinical impact of the technologies. A decision and cost-effectiveness analysis was performed to evaluate the tradeoff between costs associated with a technology and its triage accuracy.

Main Results

We screened 6,667 MEDLINE titles, retrieved 407 articles, and included 105 studies in the analysis. Most studies evaluated the diagnostic performance of the technologies; a few evaluated the clinical impact of routine use. In the general ED setting, only ACI-TIPI was shown to be able to identify most of the patients with ACI and to reduce unnecessary hospitalizations. Single measurement of biomarkers at presentation to the ED has poor sensitivity for AMI although most biomarkers have high specificity. Serial measurements can greatly increase the sensitivity for AMI while maintaining excellent specificity. However, biomarkers cannot identify most UAP.

Seventeen technologies and four combinations of technologies were evaluated in the cost-effectiveness analysis. Through the use of diagnostic performance data, the combination of troponin T and echocardiography was shown to have the best triage accuracy for patients with ACI and to be the most cost-effective. Through the use of clinical impact data, ACI-TIPI was shown to have the best triage accuracy for patients with ACI and to be the most cost-effective technology.

Conclusions

  • A diverse array of technologies with varying degrees of diagnostic accuracy is available for use in general or selected patient populations to diagnose ACI in the ED.

  • No single technology is able to identify all ACI patients and at the same time avoid hospitalizing many patients without ACI.

  • Most studies evaluated patients with AMI rather than ACI.

  • Research on the diagnosis of ACI in the ED is characterized by great heterogeneity among studies because of the large number of variables that can be studied. This heterogeneity has resulted in fragmented evidence that is not easily synthesized into a coherent whole.

  • Some technologies (e.g., echocardiography, sestamibi imaging, exercise ECG) remain underevaluated, and little research has been done on the value of sequential testing or on test combinations.

  • The number of studies on the clinical impact of these technologies is inadequate.

  • The quality of reporting by studies in this area needs to be improved.

This document is in the public domain and may be used and reprinted without permission except those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation

Lau J, Ioannidis J, Balk E, et al. Evaluation of Technologies for Identifying Acute Cardiac Ischemia in Emergency Departments. Evidence Report/Technology Assessment Number 26. (Prepared by The New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019) AHRQ Publication No. 01-E006, Rockville, MD: Agency for Healthcare Research and Quality. May 2001.

Summary

Introduction

Acute myocardial infarction (AMI) is the leading cause of death in the United States. Investigating the causes, progression, and treatment of AMI continues to be a national research priority. In clinical medicine, much research has focused on the early diagnosis and treatment of acute cardiac ischemia (ACI), which includes both unstable angina pectoris (UAP) and AMI. In 1991, the National Heart, Lung, and Blood Institute (NHLBI) of the National Institutes of Health initiated the National Heart Attack Alert Program (NHAAP) to study the issues related to rapid recognition and response to patients with signs and symptoms of ACI in emergency department (ED) settings, the point at which most of these patients enter the health care system. This ongoing effort brings together scientists, clinicians, and NHLBI staff with a Coordinating Committee that includes representatives of 40 professional organizations.

In 1994, the NHAAP Working Group on Evaluation of Technologies for Identifying Acute Cardiac Ischemia in the Emergency Department was formed to assess the technologies for diagnosing ACI and AMI in the ED. Members of the Working Group had expertise in the areas of cardiology, emergency medicine, general internal medicine, family practice, and nursing, as well as in the specific disciplines of meta-analysis and health services research. The Working Group reviewed all technologies for diagnosing ACI in the ED. The assessments of these technologies in actual use in EDs, and the nature, extent, and quality of the evidence on which the assessments were based, are presented in the Working Groups final 1997 report, An Evaluation of Technologies for Identifying Acute Cardiac Ischemia in the Emergency Department.

Reporting the Evidence

In 1998, the Agency for Healthcare Research and Quality (AHRQ, formerly the Agency for Health Care Policy and Research [AHCPR]), working as a partner for the NHLBI's NHAAP, contracted with the New England Medical Center's Evidence-based Practice Center (EPC) to update the 1997 NHAAP report. The EPC was charged with evaluating the evidence on these diagnostic technologies published since October 1994. As before, the purpose of the review was to assess the accuracy of technologies for diagnosing ACI in the ED and their clinical impact when used in this setting. However, the original 1997 report did not provide quantitative estimates of the test performance or clinical impact of the diagnostic technologies. To address this, we conducted meta-analyses where possible in which we reexamined all the studies reviewed in the original report, abstracted the necessary data, and combined these data with more recently published studies. We also conducted decision and cost-effectiveness (CE) analyses to investigate the interactions between technologies' diagnostic performances and costs, populations, and outcomes, and to provide an evidence-based framework on which to base recommendations. NHAAP Working Group members helped frame some of the study issues, but they were not involved in the evaluation of evidence or in the writing of this report.

Methods

We conducted a systematic and comprehensive search of the English-language literature published between 1966 and December 1998. Literature was retrieved from a computer MEDLINE search, references cited in the 1997 Working Group report, review of references of retrieved articles, and assistance from domain experts. Search terms included those related to the diagnosis of ACI, AMI, and UAP in the ED, and to the following technologies: prehospital electrocardiography (ECG), continuous/serial ECG, nonstandard leads ECG, exercise stress ECG, the ACI Time-Insensitive Predictive Instrument (ACI-TIPI), the Goldman chest pain protocol, biochemical tests and biomarkers (e.g., creatine kinase [CK] or its subunit [CK-MB], troponin T, etc.), sestamibi myocardial perfusion imaging, echocardiography, and computer-based decision aids.

Inclusion Criteria

We followed the general approach for selecting studies taken by the Working Group in its report. We considered reports if they came from work done in the ED setting; results from other settings (e.g., the cardiac care unit) were used only if little or no ED-based data were available. Data from non-ED settings were used with the understanding that they suggest potential utility but do not directly apply to the emergency setting.

We accepted prospective and retrospective studies that evaluated one or more of the technologies considered in this evidence report and included patients 18 years and older who presented to the ED with symptoms suggestive of ACI. We placed no restrictions on patients' gender or ethnicity. In general, ED testing consists of either a single test performed within the initial 4-hour period after presentation to the ED or repeated testing up to14 hours after the patient's presentation to the ED. We accepted studies with minor deviations from this standard.

Data were abstracted according to a written protocol and were summarized in evidence tables.

Grading of the Evidence

The evidence-grading scheme we used assesses four dimensions that are important for the proper interpretation of the evidence:

  • Size of the study (weight of the evidence).

  • Applicability (population category and prevalence of disease).

  • Diagnostic performance or magnitude of clinical impact.

  • Methodologic quality (internal validity).

Applicability

We grouped the populations and settings of the studies using a four-category scale to help interpret the results. We also collected data about the prevalence of ACI or AMI to assist the interpretation. The four defined population categories are:

  • Category I -- Studies that included all patients with signs and symptoms suggestive of ACI, such as chest pain, shortness of breath, jaw pain, acute pulmonary edema, and so forth. This is the most inclusive category.

  • Category II -- Studies that used chest pain as the inclusion criteria. Most studies belong to this group. Category II is a subset of category I.

  • Category III -- Studies that included patients with chest pain but excluded those with clinical or ECG findings of AMI. Many studies, especially studies of stress cardiac imaging or testing, belong to this group. Category III is a subset of category II.

  • Category IV -- Studies in which all patients were hospitalized or which used additional criteria that enrolled highly selected subpopulations. We also placed retrospective studies in this category.

Test Performance Studies

When there were sufficient data for a technology, we used three complementary methods of synthesizing data across several studies to report on its test performance: summary receiver operating characteristics (SROC) analysis, separately combined sensitivity and specificity values using a random effects model, and the summary diagnostic odds ratios using a random effects model.

We defined a three-level methodologic quality scale for test performance studies graded as follows:

  • A (least bias) -- such as a study that adheres to the traditionally held concepts of high-quality diagnostic evaluation, including clear descriptions of the population and setting; clear descriptions of the reference standard, the test under investigation, and the diagnostic criteria; masked interpretation of the reference and the test under investigation; verification of the diagnoses in all or most of the patients with negative results; and no significant reporting errors that are likely to result in substantial bias.

  • B (susceptible to some bias) -- a study that does not meet all the criteria in category A but whose deficiencies are unlikely to cause major bias.

  • C (likely to have significant bias) -- a study with significant design or reporting flaws that cannot preclude major bias. This category includes studies in which verification bias could be a major issue and studies that have significant amounts of missing information or discrepancies in their reporting.

Clinical Impact Studies

In the few instances where there are sufficient data reported by clinical impact studies, dichotomous outcomes expressed as risk ratio or continuous outcomes were combined using a random effects model.

We defined a three-level methodologic quality scale for clinical impact studies graded as follows:

  • A (least biased) -- such as prospective controlled trials.

  • B (susceptible to some bias) -- such as prospective cohort studies.

  • C (likely to have significant bias) -- other designs or studies with significant conduct or reporting problems that could lead to large bias.

Findings

General Observations

The MEDLINE literature search identified 6,667 titles, a third of which were published from 1994 onward, indicating increased research activities on this topic over the past 5 years compared with the previous 27 years. From these abstracts, 407 full articles were retrieved for review, 105 of which are included in the analysis.

A diverse array of technologies with varying degrees of diagnostic accuracy is available for use in general or selected populations to diagnose ACI in the ED. About half the studies analyzed were in population category II and about 30 percent in category III. Prevalence of AMI across studies, even within population categories and in similar settings, varied widely with little indication that similarly reported inclusion criteria among studies resulted in similar levels of AMI prevalence.

Despite this, there is some indication that overall, studies that included all patients with chest pain (population category II) have higher prevalence of AMI than either studies that included all patients with symptoms suggestive of ACI (population category I) or studies that excluded patients with diagnostic ECGs (population category III). In addition, though differences in AMI prevalence among different settings are not statistically significant, there is evidence that studies that analyzed only admitted ED patients have higher prevalence of AMI than those that included all ED patients. Thus, these two populations may truly be different.

Specific Findings

Most studies evaluated the accuracy of the technologies; only a few evaluated the clinical impact of routine use. To summarize:

  • Prehospital 12-lead ECG has moderate sensitivity (76 percent) and specificity (88 percent) for diagnosis of ACI. It has demonstrated a reduction of the mean time to thrombolysis by 33 minutes and short-term overall mortality in randomized trials.

  • In the general ED setting, only ACI-TIPI has demonstrated, in a large multicenter clinical trial, a reduction in unnecessary hospitalizations without decreasing the rate of appropriate admission for patients with ACI.

  • The Goldman chest pain protocol has good sensitivity (about 90 percent) for AMI but has not been shown to result in any differences in hospitalization rate, length of stay, or estimated costs in the single clinical impact study performed. Its applicability to patients with UAP has not been evaluated.

  • Single measurement of biomarkers at presentation to the ED has poor sensitivity for AMI, although most biomarkers have high specificity (over 90 percent). Serial measurements can greatly increase the sensitivity for AMI while maintaining their excellent specificity. Biomarkers cannot identify most patients with UAP.

  • Diagnostic technologies to evaluate ACI in selected populations, such as echocardiography, sestamibi perfusion imaging, and stress ECG, may have very good to excellent sensitivity; however, they have not been sufficiently studied.

Results of Decision and Cost-Effectiveness Analysis

Decision and cost-effectiveness analyses were performed for 17 technologies and 4 combinations of technologies that have been evaluated in the literature and this report. The cost analysis is from the payers' perspective (e.g., health insurance companies); patient outcomes are either appropriate triage or 30-day survival of patients with ACI.

As not all technologies can be applied to all patients in the ED (such as stress ECG), two different ED populations were used for the analysis: a general population model, which includes all patients in the ED, and a subgroup model, in which high-risk patients are excluded. Stress tests, sestamibi imaging, and serial and continuous ECG were evaluated only in the subgroup population.

As expected, technologies with the best diagnostic accuracy for AMI and UAP have the highest values for appropriate triage for patients with ACI. Technologies that are more effective (greater number of patients with ACI appropriately triaged) tend to have higher total costs, with the exception of ACI-TIPI. The biomarkers are least costly and have the lowest values for appropriate triage. Algorithms, combination technologies, and echocardiography are the next most effective technologies, in that order. Sestamibi imaging and exercise ECG are more expensive than other technologies but have excellent diagnostic performance for ACI.

Based on data using only the diagnostic performance data of technologies, the combination technology of troponin T and echocardiography has the best CE among all technologies applicable to the general population model. If results from clinical impact studies are incorporated, ACI-TIPI has the best CE because of its very high triage accuracy and low cost.

The incremental CE of troponin T and echocardiography is about $7,670 per additional appropriate triage for a patient with ACI compared with serial or combination biomarkers. The incremental CE of the next most effective technology, the artificial neural network, is approximately $10,560. Given the economic ramifications and the effects on the patient of a missed ACI diagnosis, this incremental CE for troponin T and echocardiography is minimal.

Because the estimates for detection of UAP are based on sparse data, we also evaluated the triage accuracy and cost-effectiveness of technologies for appropriate triage for patients with AMI only. The relative CE rankings do not change compared with the rankings for patients with ACI. There are few but important differences, however, in triage accuracy: (1) the Goldman protocol improves significantly, (2) serial CK-MB improves slightly, and (3) the combination of troponin T and echocardiography is slightly better than ACI-TIPI (a difference of one patient with AMI appropriately triaged).

The combination of troponin T and echocardiography is the most cost effective, followed by the artificial neural network. The incremental CE between these two technologies is much larger than in the general ACI model: approximately $137,000 per additional appropriately triaged patient with AMI.

In the low-risk patient subgroup model, ACI-TIPI is again the most cost-effective technology if data from clinical impact studies are incorporated. Sestamibi stress imaging has the best diagnostic performance (detects 82 percent of patients with ACI), followed by sestamibi rest scanning and exercise ECG. The costs of exercise ECG and stress sestamibi are nearly the same. The incremental CE between the two technologies is a mere $364 per appropriately triaged patient, reflecting the higher effectiveness of stress sestamibi for its cost relative to exercise ECG.

The incremental CE between stress sestamibi imaging and the next cost-effective technology, the combination of troponin T and echocardiography, is much greater: $12,757. However, given that stress sestamibi imaging results in the appropriate triage of 37 additional patients with ACI (per 1,000 ED patients) compared with troponin T and echocardiography, it appears to be a very cost-effective technology.

If data from the ACI-TIPI trial are used, the incremental CE of using ACI-TIPI compared with troponin T and echocardiography is only $1,502 per additional appropriate triage for a patient with ACI, a truly negligible increase for improved triage accuracy.

Considering only triage accuracy for patients with AMI, the combination of troponin T and echocardiography is the most cost effective. Exercise ECG and stress sestamibi imaging also have excellent triage accuracy; however, the per ED patient costs of these two technologies is about $500 more than that of troponin T and echocardiography.

Future Research

  • Most studies evaluated the performance of a technology in diagnosing AMI; future studies should also evaluate a technology's performance in diagnosing UAP.

  • Some technologies (e.g., echocardiography, sestamibi imaging, exercise ECG, serial biomarkers, and new biomarkers such as P-selectin and fatty acid binding proteins) remain underevaluated.

  • To date, most studies have evaluated the application of a single technology on patients. Research is needed to determine whether combinations of tests, such as a panel of biomarkers, or of multiple modalities, such as ECG with serial CK-MB measurements, perform better than the component tests alone.

  • Because good test performance, in isolation, does not automatically translate to appropriate utilization or desired outcomes, clinical impact studies are needed to evaluate the clinical outcomes of the actual use of the test.

  • The prevalence of ACI among the studies varies widely and may be explained only partially by differences in patient populations. The wide variation of prevalence has an unknown effect on test performance and interpretation of the results and may indicate incomplete reporting of study biases. We need to understand the reason for the heterogeneity of the prevalence among studies with seemingly similar patient populations.

  • The methodological quality and the reporting of the diagnostic performance studies on this topic varies widely and could be improved substantially.

Chapter 1. Introduction

This evidence report on the evaluation of technologies for diagnosing acute cardiac ischemia (ACI) updates an earlier report on the same topic by the National Heart Attack Alert Program (NHAAP). The first section describes why and how the original report was developed. The following sections describe the charge to update the original report and the expectations for the updated report, identify the issues encountered when the literature on these technologies is reviewed, and provide the background necessary to place the updated report in context. The last section describes the features and contents of the updated report itself.

The 1997 NHAAP Report

Background

Acute myocardial infarction (AMI) is the leading cause of death in the United States. Research into the causes, progression, and treatment of AMI continues to be a national research priority; this research continues to produce substantial progress in the areas of prevention, diagnosis, and treatment of AMI, as well as advances in understanding its molecular and cellular aspects. In clinical medicine, much research has been focused on the early diagnosis and treatment of ACI, which includes both unstable angina pectoris (UAP), which can lead to AMI, as well as AMI itself. Research has shown that early diagnosis and treatment of UAP is beneficial and may prevent AMI.

Thus, as mandated by Congress, in 1991 the National Heart, Lung, and Blood Institute (NHLBI) of the National Institutes of Health (NIH) initiated NHAAP to look for ways to reduce the morbidity and mortality from AMI in this country. Specifically, the Program focused on issues related to the rapid recognition and response to patients with symptoms and signs of ACI in emergency settings, the point at which most of these patients enter the health care system. This ongoing effort brings together scientists, clinicians, and NHLBI staff with a Coordinating Committee that includes representatives of 40 professional organizations.

With a plan to increase public awareness of the need for rapid evaluation and treatment of symptoms that might represent ACI, the NHAAP also studied the needs of emergency department (ED) physicians to be able to handle the potentially larger number of patients with chest pain and related symptoms and, in particular, their need to accurately diagnose ACI from among the large number of such ED patients. Accordingly, in 1994 the NHAAP Working Group on Evaluation of Technologies for Identifying Acute Cardiac Ischemia in the Emergency Department was formed to assess the technologies for diagnosing ACI and AMI in the ED. Members of the Working Group had expertise in the areas of cardiology, emergency medicine, general internal medicine, family practice, and nursing, as well as in the specific disciplines of meta-analysis and health services research. The Working Group reviewed all technologies for diagnosing ACI in the ED. The assessments of these technologies in actual use in EDs, and the nature, extent, and quality of the evidence on which the assessments were based, are presented in the Working Group's 1997 final report, An Evaluation of Technologies for Identifying Acute Cardiac Ischemia in the Emergency Department (Selker, Zalenski, Antman, et al., 1997).

Working Group Process and Methods

The Working Group conducted a formal review and evaluation of the literature on the technologies for diagnosing ACI in the ED. MEDLINE and related electronic database searches were supplemented with the members' knowledge of the literature and ongoing research. All relevant studies published in English on each technology were reviewed, summarized, analyzed, and reported independently by three members in a process analogous to an NIH Study Section. The quality of evidence for each technology was rated as A (high), B (moderate), C (limited), or NK (not known). Similarly, diagnostic accuracy and clinical impact were each rated as +++, ++, +, NK (not known), or not effective (NE).

These ratings were presented by reviewer members of the Working Group in both oral and written format at an initial meeting, revised and updated by the reviewers, and then compiled into a single document by the Working Group co-chairs. This document was reviewed in a second Working Group meeting, at which consensus was reached on the detailed conclusions and recommendations for each technology. The text, recommendations, and conclusions were then updated, reviewed, and approved once more before the final report was completed. The final report also underwent external review by a broad range of experts who were not members of the Working Group.

Summary of Technologies Assessed and Key Findings

Recommendations on the use of a technology were based on the strength of the evidence of its diagnostic accuracy in the ED and on its clinical impact when implemented in decisionmaking. The conclusions of the Working Group are presented below in the same order as reported in the executive summary of the report.

Although the standard electocardiograph (ECG) is a safe, readily available, and inexpensive technology with a relatively high sensitivity for AMI, it is not highly sensitive or specific for ACI. However, the ECG remains an integral part of the evaluation of patients with chest pain, and the Working Group recommended that it remain the standard of care for evaluating patients with chest pain in the ED.

The original ACI predictive instrument, developed and tested in the early 1980s, was found to have excellent diagnostic performance and substantial clinical impact in a high-quality prospective, multicenter trial for both AMI and UAP. The requirement for a programmed calculator was a drawback, but this drawback was expected to be remedied in the newer ACI Time-Insensitive Predictive Instrument (ACI-TIPI). The instrument was recommended for general use.

The ACI-TIPI, a newer version of the original ACI instrument, performed as well as the original instrument in both diagnostic performance and clinical impact. The algorithm has been incorporated into standard ECG machines and its predictions are printed on the ECG itself, making it much more useful than its predecessor. More definitive recommendations were withheld, pending the publication of a large multicenter trial of this new version's effectiveness.

Theprehospital 12-lead ECG was found to havegood diagnostic performance but limited clinical impact. Although it has promise, the Working Group believed that its best use would be in areas with long emergency medical service (EMS) transport times and perhaps in conjunction with prehospital thrombolytic therapy. Its routine use was not recommended.

The Goldman chest pain protocol was found to have excellent diagnostic accuracy for AMI, the purpose for which it was designed, but it does not test for UAP. Its greatest potential benefit was believed to be that of improving the specificity for AMI, which could reduce unnecessary cardiac care unit (CCU) admissions. However, the only trial to study this benefit showed that it had no impact on care or on resource use. Thus, its routine use was not recommended.

The use of a single creatine kinase (CK) test (or its subunit, CK-MB) was not recommended for use in ED triage, but multiple tests over several hours accurately diagnosed AMI. The test does not detect UAP, and its clinical impact has not been studied.

The overall performance of sestamibi imaging studies was encouraging, but the data available for evaluation at the time of the report were insufficient for making recommendations.

The ECG exercise stress test, an extension of the standard ECG, has only modest diagnostic accuracy for coronary disease, and few studies have examined its clinical impact in the ED. Its routine use was not recommended.

Echocardiography performed only modestly well in diagnosing ACI, and its clinical impact in the ED had not been studied. For these reasons, it was not recommended for routine ED use.

Other computer-based decision aids represent a variety of methods to predict AMI. Information about their generalizability and transportability is limited. Their clinical impact was also not known, and they were not recommended for routine use.

The data for troponin T and troponin I biomarkers were encouraging, but these biomarkers had not been adequately studied.

The importance of myoglobin as a marker of AMI was not yet clear, and its use was not recommended.

The use of nonstandard ECG leads for detecting ACI had undergone only limited testing, and their clinical impact had not been studied.

Thallium scanning, body surface mapping, and continuous 12-lead ECG had not been evaluated for ED use and so could not be recommended.

Limitations of the 1997 Report

The 1997 report had three important limitations. First, it contained no details about the individual studies reviewed. Second, it contained no quantitative estimates of diagnostic performance or clinical impact. Finally, although literature based, the report's recommendations were based on consensus, without the aid of a quantitative framework, such as decision or cost-effectiveness analyses, to help integrate the findings. All three of these limitations have been addressed in the updated report. In addition, the 1997 report evaluated studies published as of September 1994. Many studies have been published on this topic since, and they have been reviewed in the updated report.

The Charge to Update the 1997 Report

In 1997, the Agency for Healthcare Research and Quality (AHRQ, formerly the Agency for Health Care Policy and Research [AHCPR]) designated 12 institutions in the United States and Canada to serve as Evidence-based Practice Centers (EPCs). The EPCs prepare evidence reports and technology assessments on topics that are selected by the AHRQ and that focus on specific aspects of prevention, diagnosis, treatment, or management of a particular condition or on an individual procedure, treatment, or technology. The evidence reports and technology assessments themselves are based on rigorous, comprehensive, and systematic reviews of the scientific literature and on explicit, detailed, and documented methods, rationales, and assumptions. They often include meta-analyses and cost and decision analyses.

All EPCs collaborate with other medical and research organizations in developing these reports so that input is obtained from a broad range of experts. Professional associations, health plans, providers, and others that nominate topics may act as partners with EPCs, providing technical expertise and serving as peer reviewers of the final product. The partners -- not the EPCs -- are expected to translate the findings from the evidence reports and technology assessments into clinical practice guidelines or other implementation tools to improve the quality of care in their respective organizations. Thus, these evidence reports and technology assessments provide an evidence-based foundation on which public and private organizations may develop their own clinical practice guidelines, performance measures, review criteria, or other clinical quality improvement tools. In addition, they may give health plans and payers the information needed to make informed decisions about coverage policies for new and changing medical devices and procedures.

In 1998, AHRQ, working as a partner for the NHLBI NHAAP, contracted with the New England Medical Center's EPC to update the 1997 NHAAP report. The EPC was charged with evaluating the evidence on these diagnostic technologies that had been published since October 1994. As before, the purpose of the review was to assess the accuracy of technologies for diagnosing ACI in the emergency department and their clinical impact when used in this setting.

The AHRQ requested the following modifications of the original report in this update:

  • Creatine kinase was to be combined with other biochemical tests to form a category called "biochemical tests and markers." Newly developed markers not in the original report but relevant to testing, such as fatty acid binding proteins and P-selectin, were also to be reviewed. Combinations of these markers, when available, were to be examined as well. Differences in point-of-care versus central laboratory testing were also to be addressed because these differences may be relevant to the cost analysis. Evidence regarding prehospital and home testing was to be reviewed because these tests can affect early treatment.

  • The original ACI predictive instrument was to be incorporated as part of the discussion of the ACI-TIPI.

  • Body surface mapping was to be deleted, but the use of nonstandard electrocardiographic leads retained.

  • Because thallium is currently not employed in acute perfusion imaging protocols, it was to be incorporated as background and included (with sestamibi and other relevant agents) under a category of "myocardial perfusion imaging."

  • Resting, exercise, and stress assessments for both echocardiography and myocardial perfusion imaging were to be reviewed because these assessments are now used very early in the workup for ACI and are often performed before patients are discharged from the ED.

Issues in Diagnosing Acute Cardiac Ischemia in Emergency Departments

Studies of the diagnosis of ACI in EDs are characterized by great heterogeneity in design, study population, study setting (ED only or combined ED and CCU), and technologies used. This heterogeneity reflects the complexity of the task and the large number of possible and plausible combinations of factors that can be studied.

Issues in the Definition of Acute Cardiac Ischemia

Acute cardiac ischemia is a continuum of clinical states that range from reduced myocardial perfusion at rest to infarction of myocardial tissues. The charge to study the diagnosis of ACI reflects the fact that identifying only patients with AMI misses a large number of ED patients with UAP, who are also at substantial and immediate risk of cardiac events.

About 15 percent of patients with unrecognized UAP admitted to the ED experience AMI within 2 months of admission (Roberts, Califf, Harrell, et al., 1983; Gottlieb, 1987; Gottlieb, Weisfeldt, Ouyang, et al., 1987; Mulcahy, Conroy, Katz, et al., 1990). The overall mortality of untreated AMI is between 12 and 15 percent. Early recognition and treatment of AMI can reduce the amount of myocardial tissue damage, improve cardiac function and survival rates, and reduce long-term complications such as congestive heart failure; and the early recognition and treatment of UAP may prevent the morbidity and mortality associated with untreated cardiac ischemia. Indeed, the mortality rate is nearly doubled among patients with AMI or UAP inadvertently sent home (Pope, Aufderheide, Ruthazer, et al., 2000).

In addition, patients with unrecognized ACI who are inappropriately discharged from the ED often require additional and costly ED visits and diagnostic evaluations. Such discharges may also result in negative consequences for physicians (such as malpractice suits or disciplinary action). Thus, the timely recognition of both AMI and UAP can reduce the morbidity and mortality associated with ACI as well as reduce the health care costs and negative consequences related to missed diagnoses.

Issues in the Populations Studied

Clinical studies of diagnostic tests for ACI vary widely in the patient populations they study. For example, some studies included all patients with chest pain and others included only patients with chest pain in whom clear evidence of AMI was lacking. Still others included all patients presenting to the ED with various symptoms suggestive of ACI. Different inclusion criteria may result in heterogeneous study populations in whom prevalence rates of AMI and UAP differ, which could lead to different study results.

Another difficult issue is the distinction between ED and CCU settings. Studies that evaluated only patients admitted to the CCU will have selected a population at much higher risk for true ACI by having already excluded low-risk patients from the evaluation process, even though all such patients were initially seen in the ED. Differences in study populations are also apparent in the mean times between symptom onset and ED presentation.

Issues in the Use of Diagnostic Technologies

Table 1. Technologies for identifying acute cardiac ischemia in the emergency department and their diagnostic use
TechnologyAcute myocardial infarctionUnstable angina pectorisIntended use
Prehospital 12-lead ECG graphic element graphic elementEarly detection
Electrocardiography
Standard 12-lead graphic element graphic elementGeneral
5Serial/continuous ECG graphic element graphic elementSelected subgroup
Nonstandard leads graphic element graphic elementSelected subgroup
Exercise/stress ECGX graphic elementSelected subgroup
Biomarkers (single or serial)
CK graphic elementXSelected subgroup
CK-MB single graphic elementXSelected subgroup
Myoglobin graphic elementXSelected subgroup
Troponin I graphic elementXSelected subgroup
Troponin T graphic elementXSelected subgroup
P-selectin graphic elementXSelected subgroup
Fatty-acid binding protein graphic elementX
Imaging
Echocardiography graphic element graphic elementSelected subgroup
Rest Sestamibi graphic element graphic elementSelected subgroup
ACI-TIPI graphic element graphic elementGeneral
Goldman chest pain protocol graphic elementXGeneral
Algorithms, computerized decision aids graphic element±General

Intended use of the technology for this indication.

X Not intended use of the technology for this indication.

±Intended use depends on the specific technology.

Several technologies have been developed to detect some or all of the possible manifestations and types of ACI. However, some tests currently used in the ED, such as serum CK levels, are intended to diagnose only AMI and not UAP (Table 1). Other technologies, such as echocardiography, may detect either AMI or UAP, whereas still others, such as the Goldman chest pain protocol, may not be sensitive to the diversity of manifestations associated with ACI. In addition, the patient populations for which, and circumstances under which, ED physicians order individual tests vary widely.

It is important to have diagnostic technologies with high sensitivity and specificity for ACI to minimize the chances of missing patients with ACI and also to avoid unnecessary hospitalizations for those not having ACI. Good test performance in isolation, however, does not automatically translate to improved patient care when the technology is used. A test result provides only one piece of information in the complex decisionmaking process of clinicians in the ED. Therefore, in addition to high sensitivity and specificity, a diagnostic technology must also demonstrate desired clinical impact during routine use in the ED. The earlier Working Group report found very few studies that reported the clinical impact of the technologies. Most studies evaluated only test performance and did not assess impact on ED care. Studies that reported clinical impact also differed substantially in how they measured impact. Outcome measurements included 30-day survival rates, discharge rates, re-admission rates, procedure rates, ejection fractions, and so on. Other outcomes included time from symptom onset to treatment administration.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F001.jpg.

   Figure 1. Effect of threshold on diagnostic test performance

The performance of a diagnostic test is determined by both the innate accuracy of the test, as indicated by the degree of overlap in the test results of patients with and without ACI, and by the position of the threshold within the overlapping range, which defines the tradeoff between sensitivity (the proportion of true positive results) and specificity (the proportion of true positive results).

The sensitivity and specificity of a test depend not only on the intrinsic nature of the test, but also on how the users of the test define these characteristics (Figure 1). In most diagnostic tests, the test results of sick and healthy populations overlap to some extent. The less the overlap, the more accurate the test. The interpretation of these overlapping scores is governed by a threshold value placed in the overlapping range by the user. Scores on one side of the threshold are interpreted as positive; those on the other, as negative.

There is a tradeoff, however, between sensitivity (the proportion of true positive results) and specificity (the proportion of true negative results). If the threshold is set to reduce the rate of false-positive results (overdiagnosis), the rate of false-negative results (underdiagnosis) will rise (see Figure 1). The tradeoff between sensitivity and specificity is illustrated in receiver operating characteristics (ROC) curves, but such curves are often not included in studies of diagnostic tests.

The timing of tests also differs greatly among studies. The technologies studied ranged from a single test at presentation to the ED to repeated tests administered up to 14 hours after the patient's initial presentation or that depended on symptom duration (not time from presentation). For example, a single serum CK or CK-MB measurement at presentation has been used diagnostically, but additional serial measurements taken over several hours have additional incremental diagnostic value. Thus, when studies of biomarkers are compared, the timing of the test has to be taken into account. If the data are not presented separately, tests used in combinations may also be difficult to combine with the same tests used alone.

The reference test or criteria to which the index test is compared can also differ among studies. Most studies for the diagnosis of AMI use the World Health Organization (WHO) criteria to define AMI (Gillum, Fortman, Prineas, et al., 1984). However, the definition of the "cardiac enzymes" differed among the studies. Some studies further complicate the matter by using a procedural outcome, such as angioplasty, from which the diagnosis must be inferred. The diagnostic criteria for UAP are often less strict. Although some studies used Braunwald's classification for UAP, other studies defined non-AMI ACI as "evidence of atherosclerotic heart disease," or "non-AMI ischemia." Without a standard definition, we accepted the author's diagnosis of UAP, without necessarily knowing which signs and symptoms were used to make the diagnosis.

As described above, given the clinical and economic consequences, tests for diagnosing ACI need to have both high sensitivity and high specificity. A highly sensitive test is required if the adverse outcomes of missing (and therefore not treating) a diagnosis of ACI are to be avoided. In the worst case, patients with undiagnosed ACI can be discharged home without treatment and then die from a cardiac event. Thus, an underdiagnosis rate of, say, only 2 percent may not be acceptable. At the same time, a highly specific test is required to avoid unnecessary admission of patients to cardiac care units, a practice that increases costs and resource use. About half of the people admitted to coronary care units with the symptoms of ACI do not have coronary artery disease. The economic implication of this overdiagnosis and unnecessary hospitalization is in the billions of dollars each year (Fineberg, Scadden, Goldman, et al., 1984; Pozen, D'Agostino, Selker, et al., 1984).

Inadequate followup of patients discharged from the ED could lead to verification bias. Verification bias may occur when the proportion of patients with confirmed negative test results is small. Thus, studies that do not adequately follow up all the patients discharged from the ED may miss a large number of diagnoses, which can falsely increase test sensitivity and decrease test specificity.

Issues in Implementing the Technologies

A major issue in determining clinical impact and in implementing these technologies is the perspective from which they are studied. Emergency department physicians may be more concerned with avoiding the medical and legal consequences of underdiagnosing (and therefore not treating) ACI, whereas third-party payers may be more interested in reducing overdiagnosis to avoid unnecessary CCU admissions throughout their health care network. Patients are most concerned with the accurate and timely diagnosis of ACI.

The Updated Evidence Report

As in the original report, we evaluated both the diagnostic performance and the clinical impact of these technologies.

New Features in the Updated Report

For this updated report, we re-reviewed and systematically abstracted data from all the studies included in the 1997 Working Group report, as well as from all studies published from October 1994 through December 1998. The abstracted data allowed us to summarize the evidence quantitatively, which was not done in the original report.

Also new in this updated report is the inclusion of meta-analyses. When the data were sufficient and appropriate, we conducted meta-analyses to quantitatively assess test performance with summary receiver operating characteristics curves (SROC). Likewise, when possible, we conducted meta-analyses of trials studying clinical impact.

The updated report also includes decision and cost-effectiveness analyses. Because the diagnosis of ACI in the ED is complex and dependent on many different factors, such analyses are fraught with difficulties and limitations. The number of possible clinical scenarios that can be analyzed is great. Long-term clinical outcomes are difficult to model because the management of patients with ACI can vary. The long-term disposition of patients with noncardiac chest pain is also difficult to model. Thus, no single model, or set of models, can adequately reflect the range of circumstances that are encountered in the ED. As a result, these analyses were conducted not for the purpose of making specific clinical recommendations but for understanding the interactions among the variables studied.

Technologies Assessed in the Updated Report

As directed by the AHRQ, the technologies reviewed in the updated report are:

  1. Prehospital 12-lead electrocardiography

  2. Emergency department electrocardiography
    Continuous 12-lead ECGs
    Nonstandard ECG leads
    Rest, exercise, or stress ECG assessments

  3. The Acute Cardiac Ischemia Time-Insensitive Predictive Instrument

  4. The Goldman chest pain protocol

  5. Biochemical markers
    Creatine kinase (CK), single and serial measurements
    Creatine kinase subunit (CK-MB), single and serial measurements
    Troponin T
    Troponin I
    P-selectin
    Fatty acid binding protein
    Myoglobin

  6. Myocardial perfusion imaging technologies
    Echocardiography (including rest, exercise, and stress assessments)
    Sestamibi imaging (including rest, exercise, and stress assessments)

  7. Other computer-based decision aids

Summary

The current report updates the 1997 Working Group's assessment of the performance and impact on care of these diagnostic technologies. This updated report:

  • Systematically reviews the evidence for these technologies.

  • Includes a quality assessment of the studies that comprise this evidence.

  • Summarizes the diagnostic performance and clinical impact of these technologies using meta-analysis, where possible.

  • Presents decision and cost-effectiveness analyses to provide insights into the characteristics of these technologies.

Chapter 2. Methods

This evidence report on the evaluation of technologies for identifying ACI in the ED is based on a systematic review of the literature. Meetings and teleconferences of the EPC staff with technical experts representing the NHAAP Working Group were held to identify specific issues central to this report. A comprehensive search of the medical literature was conducted to identify studies addressing these technologies. Following the format of the original NHAAP report, we examined studies that assessed the diagnostic performance and the clinical impact of the technologies requested in this update.

For this evidence report, we compiled evidence tables of study features and results, appraised the methods of the studies, and summarized their results. When there was a sufficient number of studies with adequately reported data, we conducted meta-analyses to assess overall test performance and to estimate the clinical impact of the application of the technology. A decision and cost-effectiveness analysis was performed to gain insights into the tradeoffs between the clinical impact and costs of the technologies. Because of the complexity of the problem, the cost-effectiveness analysis should be viewed as a tool for decisionmaking and not as the definitive recommendation for diagnosing and managing ACI in the ED.

The NHAAP Working Group, which wrote the original report, served as the science partner of this updated report. The Working Group provided technical experts to work with the EPC staff to refine key questions and to identify important issues, helped find relevant studies, and provided critical input into the decision and cost-effectiveness analysis. Members of the NHAAP Working Group are listed in Appendix B.

Aim of the Evidence Report

The aim of this evidence report is to update the 1997 NHAAP Working Group report by examining the literature published since October 1994, to rigorously assess these technologies, to conduct meta-analyses when feasible, and to explore the application of these technologies with decision and cost-effectiveness analysis. However, the original 1997 report did not provide quantitative estimates of the test performance or clinical impact of the diagnostic technologies. To conduct meta-analyses, we re-examined all the studies reviewed in the original report, abstracted the necessary data, and combined these data with more recently published studies.

Literature Search

Studies for the literature review were identified primarily through a MEDLINE search of English language literature conducted between December 1998 and January 1999. In addition, we identified and retrieved all the studies under each of the technologies referenced in the 1997 report. We also consulted technical experts and examined references of published meta-analyses and selected review articles to identify additional studies. Several technical experts forwarded articles to us published in 1999, after our MEDLINE search was completed. Articles that met the inclusion criteria were incorporated in our evidence report.

Search Terms and Strategies

Table 2. MEDLINE search strategies and terms (used in conjunction with OVID)
Prehospital 12-lead ECG
Setting Emergency medical service communication systems/or Emergency medical services/or Transportation of patients/ Exp mobile health units/or "prehospital".mp. Exp Telecommunications/ Exp transportation of patients/ Exp emergency medical technicians/
Technology Exp electrocardiography/ Exp* electrocardiography/ (ecg or ekg or electrocardiog$).af
Disease exp myocardial ischemia/di [Diagnosis] exp chest pain/di [Diagnosis] The following groups of lines are common search terms for the following technologies: serial/continuous 12-lead ECG, all lead ECG, ECG stress test, echocardiography, CK, other biomarkers (myoglobin/troponin), sestamibi, computer aids, and Goldman chest pain protocol.
  1. exp myocardial ischemia/di [Diagnosis]
    exp chest pain/di [Diagnosis]
    (chest pain or myocardial infarction or myocardial ischem$).tw

  2. Exp Emergency service, hospital/
    Emergenc$.tw.

Serial 12-lead ECG uses the following terms plus the common terms grouped under 1 and 2 above: (continuous 12-lead or continuous twelve lead or serial 12-lead or serial twelve lead).tw (continuous ecg or continuous ekg or continuous electrocardiography or serial ecg or serial ekg or serial electrocardiography or serial twelve lead).tw
ECG stress test uses the following terms plus the common terms grouped under 1 and 2 above: Exp Exercise test/ (treadmill or exercise test$).tw. exp "sensitivity and specificity"/
Echocardiography uses the following terms plus the common terms grouped under 1 and 2 above: Exp Echocardiography/ Exp* Echocardiography/
CK uses the following terms plus the common terms grouped under 1 and 2 above: *Creatine Kinase/bl, du [Blood, Diagnostic Use] *Creatine Kinase isoenzymes/bl, du [Blood, Diagnostic Use] Exp Myocardial ischemia/en [Enzymology]
Other Biomarkers (myoglobin/troponin) uses the following terms plus the common terms grouped under 1 and 2 above: Myocardial ischemia/bl, du [Blood, Diagnostic Use] Chest pain/bl, du [Blood, Diagnostic Use] Exp Troponin/ Exp Myoglobin/ Exp enzyme tests/ Exp biological markers/ Exp immunoenyzme techniques/ Exp immunoassay/
Sestamibi uses the following terms plus the common terms grouped under 1 and 2 above: Exp technetium tc 99m sestamibi/ Exp Chest pain/di, ri [Diagnosis, Radionuclide Imaging] Exp Myocardial ischemia/di, ri [Diagnosis, Radionuclide Imaging]
Computer Aids uses the following terms plus the common terms grouped under 1 and 2 above: Exp Diagnosis, computer-assisted/ Exp "neural networks (computer)"/
Goldman Chest Pain Protocol uses the following terms plus the common terms grouped under 1 and 2 above: Exp Diagnosis, computer-assisted/
ACI-TIPI Articles for the ACI-TIPI were retrieved directly from Dr. Selker's team (See Appendix B).
The literature search was conducted to identify clinical studies published from 1966 through December 1998. The MEDLINE search terms are listed in Table 2. Separate search strategies were developed for each of the diagnostic technologies and were based on three areas: setting, technology, and disease. The text words or MeSH headings for all technologies included "chest pain," "myocardial ischemia" or "infarction," "emergency," and "emergency service." The search was limited to studies on humans and published in English.

MEDLINE search results were printed and screened. Potential studies were identified for retrieval based on setting (if given), study question, population, and disease. Articles involving cost analysis, chest pain centers, minority and gender issues, and cocaine users (no relevant articles were found) were also retrieved. Studies with no clear reference to emergency department settings and populations with special comorbidities (e.g., patients with renal disease) were excluded. After retrieval, each paper was screened to verify that the setting and disease were appropriate and that the study question focused on diagnostic test performance, clinical impact, or both. Some studies compared two or more technologies with each other (e.g., sestamibi versus two-dimensional echocardiography [Kontos, Arrowood, Jesse, et al., 1998]).

A literature search was not performed for standard 12-lead ECG or thallium-201 scanning. The 12-lead ECG was not evaluated because it is a standard of care and is part of the WHO reference standard for diagnosing AMI. Thallium-201 scanning, as noted in the earlier Working Group report, is not feasible in the ED, and a better radioisotope (technetium-99m sestamibi) has superseded this technology.

Study Selection

Table 3. Number of MEDLINE citations found for technologies used to diagnose acute cardiac ischemia in emergency departments
Number of MEDLINE citations
Technology1966 to 12/19981994 to 12/1998
Prehospital 12-lead ECG674172
Continuous/serial 12-lead ECG13437
Nonstandard lead ECG409134
Exercise stress ECG711386
Predictive instruments5025
ACI-TIPI148
Goldman protocol218
Computer aids33562
Creatine kinase757110
Other biomarkers2,019865
Echocardiography1,522492
Sestamibi imaging2120
Total citations screened6,6672,319
A MEDLINE search for the years 1966 through 1998 identified 6,667 titles (Table 3). About one-third of the titles were published from 1994 onward, indicating increased research activities in this topic over the past 5 years compared with the previous 27 years. We screened the titles and abstracts of these citations and retrieved 407 full-length articles for further examination. Reports published only as abstracts in proceedings were rejected from further consideration. Several abstracts were used in the earlier Working Group report. Subsequently published, full articles based on these abstracts are used in the current evidence report. Specific inclusion criteria are discussed below.

Patient Populations and Settings Studied

We followed the general approach for selecting the study setting taken by the Working Group in their report: "In these evaluations of the clinical data, results were considered applicable to the aims of this report only if they came from work done in the ED setting; results coming from other settings (e.g., the CCU) were used only if no ED-based data were available. Data from non-ED settings were used with the understanding that they suggest potential utility but do not directly apply to the emergency setting."

We accepted prospective and retrospective studies that evaluated one or more of the technologies considered in this evidence report that included patients 18 years and older who presented to the ED with symptoms suggestive of ACI. We placed no restrictions on the patients' gender or ethnicity. In general, ED testing consists of either a single test that occurs within the initial 4-hour period from presentation to the ED or repeated testing that occurs up to 14 hours after the patient's initial presentation to the ED. We accepted studies with minor deviations from this standard. Retrospective studies were considered in the Data from Other Clinical Studies sections in Chapter 3, Results.

Outcomes Considered in This Evidence Report

Acute cardiac ischemia is the primary outcome of interest in this evidence report. This condition includes AMI and UAP. However, because some of the diagnostic technologies (e.g., CK-MB) are specific for the detection of AMI, we also used AMI as an outcome. We used the WHO definition for AMI, which is based on the presence of two of three criteria: a history and physical examination compatible with AMI; characteristic ECG changes, such as ST-segment elevation and evolution; and characteristic rise and fall of the cardiac enzymes. Although some studies used Braunwald's classification for UAP, this diagnosis was not clearly defined in many articles. We accepted the UAP diagnosis as reported by the authors of the articles.

Some studies also reported ischemic cardiac outcomes by procedures performed, such as coronary artery bypass graft (CABG) surgery and angioplasty, or as significant coronary artery disease diagnosed by coronary angiography. Because some of the patients who underwent cardiac procedures may indeed have UAP, we included these procedures or diagnoses into a broadened category of ACI when there was little or no other evidence available using the stricter ACI definition of AMI and UAP. The use of this broadened definition is noted in specific sections of the evidence summary. We acknowledge that this categorization is not ideal, but it reflects the designs of a large number of clinical studies.

Data Abstraction

Data for evidence tables were abstracted directly onto computer spreadsheets. Information abstracted for assessment of diagnostic performance included the study population characteristics, inclusion and exclusion criteria, the descriptions and the diagnostic criteria for the reference test and the test being evaluated, potential verification bias and test limitations, as well as the main results and the conclusions of the study. For clinical impact, additional information about the clinical outcomes was abstracted. In addition, data for quality assessment of individual studies were systematically abstracted (Appendix A). Reported test performance results, such as the summary sensitivity and specificity values, were verified against the data presented as outcomes using the discharge or final diagnosis. Data were abstracted for each of the technologies in studies that evaluated several tests simultaneously and where data for each test were available independently. Data were abstracted either independently by two members of the EPC staff or by one member and then verified by a second member. Discrepancies of abstracted data between two members were resolved by the EPC director.

Reporting the Results

The evidence we found for the technologies is summarized in three complementary forms. The evidence tables provide detailed information about key features of study design and results of all the studies reviewed. A narrative and tabular summary of the strength and quality of the evidence of each study are provided for each technology. When there was a sufficient number of studies for a specific technology, meta-analysis was performed to provide a quantitative summary of the test performance or clinical impact.

Evidence Tables

For each of the diagnostic technologies, separate evidence tables were constructed for diagnostic test performance and clinical impact studies. These tables are presented under the Evidence Tables section of this evidence report. The evidence tables list the clinical studies found for each of the technologies and that met the inclusion criteria. The specific pieces of information we included in the evidence tables are described above.

Summarizing the Evidence of Individual Studies

Grading of the evidence can be useful by indicating the overall "quality" of studies for a technology. Although a simple evidence grading system using a single metric may be desirable, the "quality" of evidence is multidimensional, and a single metric cannot fully capture information needed to interpret a clinical study (Ioannidis and Lau, 1998; Juni, Witschi, Bloch, et al., 1999; Lijmer, Mol, Heisterkamp, et al., 1999; Lohr and Carey, 1999). We believe that information on individual components of a study contributes more to the evaluation of evidence by deliberating bodies than a single summary score. The evidence-grading scheme we used here assesses the following four dimensions that are important for the proper interpretation of the evidence:

  • Size of the study.

  • Applicability (patient category and prevalence of disease).

  • Diagnostic performance or the magnitude of the clinical impact.

  • Internal validity.

Study Size

The study (sample) size is used as a measure of the weight of the evidence. In general, a large study provides a more precise estimate of the treatment effect or test performance. Size alone, however, does not guarantee generalizability. A study that enrolled a large number of selected patients may be less generalizable than several smaller studies that included a broad spectrum of patient populations.

Applicability (population categories)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F002.jpg.

   Figure 2. Population categories

Circles I, II, and III depict the relationships of the generally used inclusion criteria for patients in prospective studies in the ED examined in this evidence report. The circles are used to illustrate the relationships only, and the size of the circles is not meant to represent the actual quantitative relationship between the groups. Most of the studies examined in this evidence report belong to the settings "II" and "III." Circle IV represents a few reviewed studies that used additional selection criteria, are retrospective, or admitted all patients to the CCU.

Applicability, or generalizability or external validity, addresses the issue of whether the study population is sufficiently broad to be generalizable to the population at large. The study population is typically defined by the inclusion and exclusion criteria. We developed four categories based on the included populations in the studies (Figure 2). This categorization provides a simple way to group these diverse studies as well as a way to understand the effect of the diversity of the criteria on test performance or clinical impact.

Category I -- studies that included all patients with symptoms and signs suggestive of ACI, such as chest pain, shortness of breath, jaw pain, acute pulmonary edema, and so on. This is the most inclusive category. Few studies met category I criteria.

Category II -- studies that included possible ACI patients with a chief complaint of chest pain. Most studies belong to this group. Category II is a subset of category I.

Category III -- studies that included patients with chest pain but that excluded those with clinical or ECG findings of AMI. Many studies, especially studies of stress cardiac imaging or testing, belong to this group. The subjects in these studies were considered to be at "low risk" for AMI or ACI. Category III is a subset of category II.

Category IV -- studies in which all patients were admitted to the CCU or used additional criteria that enrolled highly selected subpopulations. Category IV may be a subset of category I, II, or III, or it may not be a subset as shown in Figure 2. Retrospective studies fall into this category.

Applicability (disease prevalence)

The prevalence of AMI or ACI is the most objective measure of the similarity of the study populations among the studies. Because different prevalence rates of the diagnoses may reflect the actual enrollment of different distributions of the disease spectrum, varying prevalence rates reported among studies may be related to the variability of reported diagnostic performance and clinical impact results. We recorded this information to assist the interpretation of the results.

Estimates of Diagnostic Test Performance

We used three complementary methods for assessing diagnostic test performance: SROC analysis, independently combined sensitivity and specificity values, and diagnostic odds ratios. Details about these methods are provided in the meta-analysis section later in this chapter.

Estimates of Clinical Impact

Several types of clinical outcomes were reported by the studies. Dichotomous data include overall mortality, number of ACI cases missed, and number of unnecessary hospital or CCU admissions avoided. Continuous data include the mean time to thrombolysis and the mean ejection fractions of intervention and control groups. As was the case for the earlier Working Group report, there is still a paucity of clinical impact studies. When clinical impact information is available, we summarized each of the clinical outcomes independently. Details of these methods are also provided in the meta-analysis section.

Quality Assessment of Internal Validity of Diagnostic Performance Studies

Internal validity refers to the design, conduct, and reporting of the clinical study. Proposals for evaluating the methodological quality of diagnostic test evaluation have been developed (Mulrow, Linn, Gual, et al., 1989; Irwig, Tosteson, Gatsonis, et al., 1994), but they have not been empirically evaluated. A recent article (Lijmer, Mol, Heisterkamp, et al., 1999) found that some of the traditional items of quality (e.g., unmasked interpretation, patient verification) did not have a large influence on the relative diagnostic odds ratio. The results of several other studies that evaluated quality scales also call into question the value of a single quality scale, including scales that may have been "validated" (Juni, Witschi, Bloch, et al., 1999; Clark, Wells, Huet, et al., 1999). Clearly, much more research is needed in quality assessment before a reliable tool becomes available. For the purpose of this evidence report, given the above caveats, we used a three-category scale to provide some indication of the methodological quality of the studies summarized:

Grade A (least bias) -- a study that mostly adheres to the traditionally held concepts of high quality diagnostic evaluation, including: clear description of the population and setting; clear description of the reference standard, the test under investigation, and the diagnostic criteria; masked interpretation of the reference test and the test under investigation; verification of the diagnoses in all or most of the patients with negative results; and no reporting errors that might hide substantial bias.

Grade B (susceptible to some bias) -- a study that does not meet all the criteria in category A. It has some deficiencies but none likely to cause major bias.

Grade C (likely to have significant bias) -- a study with significant design or reporting errors that cannot preclude major bias. This category includes studies in which verification bias could be a large issue and studies that have large amounts of missing information or discrepancies in reporting.

Quality Assessment of Internal Validity of Clinical Impact Studies

The internal validity of clinical impact studies refers to the soundness of the design, conduct, and reporting of the clinical trial. Some of the features of this dimension have been widely used in various "quality" scales, which usually include items such as concealment of random allocation, treatment masking, and the handling of dropouts. Clinical impact studies encountered in our report consist of both randomized trials and nonrandomized prospective studies. In this evidence report, we defined three categories of quality as follows:

Grade A (least bias) -- a controlled clinical trial (randomized or quasi-randomized) with only minor methodological problems and no reporting errors likely to hide substantial bias.

Grade B (susceptible to some bias) -- a well-designed and conducted prospective nonexperimental study design or a controlled trial with some methodological and reporting problems that may hide moderate bias.

Grade C (likely to have large bias) -- a study with major methodological or reporting problems that are likely to hide significant bias. This category includes studies with large amounts of missing information.

Summarizing the Evidence for Each Technology

In addition to the grading of individual studies, we summarized each diagnostic technology. Recent studies suggest that discrete information is more consistent and useful than a single summary score (Juni, Witschi, Bloch, et al., 1999). Therefore, for each technology, we summarized the following dimensions:

  • The weight of the evidence, expressed as the total number of studies and patients.

  • The applicability of the study results, as determined by the range of populations studied and by the prevalence of ACI.

  • The methodological quality (internal validity) of the individual studies.

The number of studies and the number of patients included in each study are summarized for each technology to provide a sense of the quantity of evidence available to assess a technology. The applicability of the studies to the ED setting is assessed by the range of study population categories represented and by the prevalence of ACI or AMI reported by the individual studies. When meta-analyses of diagnostic performance and clinical impact were performed, the overall estimates are reported. If meta-analyses were not possible, the range of results reported by the individual studies is provided. The composite study methodological quality is derived using the following rule: The quality score of the majority of the studies providing the evidence, taking into account the study size as well, is used to determine the overall quality of evidence for the technology examined. We acknowledge that this grading is arbitrary, but given the recent publications on the issues with quality scoring, it is unclear whether there is a more reliable method.

Presentation of Results for Each of the Technologies

In Chapter 3, the evidence for each of the technologies is presented in the format described below. We followed the structure of the NHAAP Working Group report and presented results of the prospective studies on test performance, then studies of clinical impact, and then data from other studies (such as those conducted in the CCU). When results are available, appropriate summary tables are included in each section. The summary table is preceded by a narrative description of the included studies. The tables list the qualifying studies, the number of patients, the study population category (as defined earlier), the prevalence of AMI or ACI in each study, the test performance (sensitivity and specificity) or clinical impact, and the methodological quality of the study. The overall results of test performance or clinical impact derived from meta-analyses are also shown. Data used for specific meta-analyses are described in the meta-analysis section of the evidence report. Studies conducted in other settings (e.g., the CCU) are described and summarized in the Data From Other Clinical Studies section.

Supplemental Analyses

Meta-analyses were performed to quantify the diagnostic performance and clinical impact of several diagnostic technologies where the data were sufficient. A decision and cost-effectiveness analysis was performed to compare the cost and effectiveness of each technology.

Meta-Analysis

Diagnostic Test Performance

We used three different methods to summarize the test performance of the diagnostic technologies: SROC curve analysis, separately averaged sensitivity and specificity values across studies, and the diagnostic odds ratio.

The SROC method assumes that the variability in the reported sensitivity and specificity values from different studies is due to different cutoff values being applied (Moses, Shapiro, and Littenberg, 1993). Each study provides a pair of sensitivity and specificity values to the analysis. It uses a regression method to fit a curve that best describes the data in the ROC space. We used the unweighted SROC method because it is probably less biased than the weighted regression method (Irwig, MacAskill, Glasziou, et al., 1995). If multiple thresholds are available for individual diagnostic test studies, ROC curves can be constructed and the areas under the curves can be estimated. The area under the curve provides an assessment of the overall accuracy of the test and allows comparisons with other tests. However, few studies provided results using multiple cutpoints.

The areas under different SROC curves can also be calculated and compared across technologies. However, the range of sensitivity and specificity values from studies in a meta-analysis of diagnostic tests is often limited, and extrapolation of the SROC analysis beyond the values of actual data is not reliable. For example, the specificity values reported by the CK-MB studies are typically between 90 and 100 percent. Thus, the SROC curve that can be constructed with actual data is limited to about the first 10 percent of the SROC space, and extrapolating the SROC curve to the entire SROC space to calculate the area under the curve would not be reasonable. Most of the technologies we examined have narrow reported ranges of sensitivity or specificity values. Therefore, we did not calculate the area under the SROC curve for any of the technologies.

When there is little variability in the test results -- studies appeared to be operating at similar thresholds and reported similar results -- SROC analysis provided little additional information. In this case, separately averaged sensitivity and specificity values across studies will give similarly useful summary information.

We combined the sensitivity and specificity values of the tests across studies using a random effects model to estimate the average values. A random effects model incorporates both the within-study variation (sampling error) and between-study variation (true treatment-effect differences) into the overall treatment estimate. It gives a wider confidence interval than the fixed effects model (which considers only within-study variability) when estimates are based on heterogeneous results.

When each is combined separately, sensitivity and specificity tend to underestimate the true test sensitivity and specificity. They are nonetheless useful estimates of the average test performance and provide an indication of the approximate test operating point for most of the studies. The appropriateness of this method can be verified by inspecting the location of the combined estimates and noting the distance of the estimates from the SROC curve. In our experience, the random effects-averaged sensitivity and specificity results are close to the unweighted SROC curve and well within the confidence intervals of each other. Average sensitivity and specificity results also serve as useful baseline test performance values for the decision and cost-effectiveness analysis.

The diagnostic odds ratio for a diagnostic test is defined as: [sensitivity/(1 - sensitivity)/(1 - specificity)/specificity] (Irwig, MacAskill, Glasziou, et al., 1995). A high diagnostic odds ratio typically has either a high sensitivity or a high specificity value, or both. The higher the odds ratio, in general, the greater the test accuracy. The summary diagnostic odds ratio obtained by combining the odds ratios of individual studies using a random effects model can provide a single summary value that is useful (with other summary information about the test) for comparing technologies. The diagnostic odds ratio is equivalent to the intercept coefficient of the SROC method assuming a zero slope.

Statistical analyses using the SROC curve method and combining sensitivity and specificity using the random effects model was performed using "Meta-Test" version 0.6. Summary diagnostic odds ratios were calculated using "Meta-Analyst" version 0.991. Both of these computer programs were developed by the EPC director (Dr. Lau) and are available to the public. Where necessary, statistical analysis algorithms implemented in MathCAD 7.0™ were also used. We report 95 percent confidence intervals (CIs) along with all estimates.

Clinical Impact Studies

Studies included in meta-analyses of clinical impact outcomes were combined using a random effects model (Laird and Mosteller, 1990). The risk ratio of the outcome was used to combine dichotomous outcome data, such as mortality. A random effects model was also used to combine continuous outcomes, such as differences in the mean time to thrombolysis. Summary risk ratios were calculated using "Meta-Analyst" version 0.991. Continuous outcomes were combined using the random effects model implemented in MathCAD 7.0.™ We report 95 percent CIs with all estimates.

Decision and Cost-Effectiveness Analysis

A decision and cost-effectiveness analysis was conducted to examine the tradeoff between test performance and their costs. Details of these analyses are presented in the decision analysis section in this evidence report. Again, we recognized the difficulties and limitations of such analyses for a clinical situation as complex as diagnosing ACI in the ED. These analyses should be viewed not for the purpose of making specific clinical recommendations but for understanding the interactions among the variables studied.

Chapter 3. Results

We reviewed 407 full articles; 105 were used in this evidence report. Forty-five articles were published since 1994. About a dozen citations used in the original Working Group report were rejected in our report. Several of these citations were abstracts that have now been published as full articles, and several others did not meet our inclusion criteria.

Overall, evaluated studies examined representative patient samples. In 82 studies that reported on subjects' age, the mean age ranged from 37 to 69 years; one-half of these studies reported on subjects whose average age was at least 59, and only 6 percent had subjects whose average age was lower than 50. In 81 studies that reported on subjects' gender, between 39 percent and 95 percent of subjects were male; one-half of these studies had at least 63 percent men. Only 12 percent of studies included a majority of women, whereas 20 percent of studies had at least 75 percent men. Only rarely did studies report on subjects' racial composition.

The results of our research are presented in the following sections in this chapter. The first section concerns the relationship between the study population categories and the prevalence of AMI or ACI in the studies that evaluated the diagnostic technologies. The second section is the main part of this chapter and provides the summaries of the technologies evaluated. The results of the decision and cost-effectiveness analysis are reported in the Decision and Cost-Effectiveness Analysis section.

We followed the same general reporting structure used by the Working Group in our report. Since the earlier Working Group report already provided a detailed description of the technologies, this information will not be repeated here. Each of the technologies will be presented separately and contains two sections reporting Data From Prospective Clinical Studies in the ED Setting and Data From Other Clinical Studies. The Data From Prospective Clinical Studies in the ED Setting sections contain two subsections: Studies of Test Sensitivity and Specificity and Studies of the Clinical Impact of the Test's Actual Use.

Study Population Categories and Prevalence of AMI

The numerous studies included in the current analysis used a large number of different entry criteria for including and excluding subjects into their analyses. As described in Chapter 2, we grouped studies into four population categories -- I, II, III, or IV -- by their entry criteria. It was thought that this grouping would sort the studies into coherent population categories that may explain some of the heterogeneity between studies. However, this was often not the case.

Our goal in sorting the studies was to form groups of studies with similar sample populations. However, we noted that there remained a large spread of rates of AMI prevalence within each of the population categories. We, thus, aimed to analyze and, in part, explain the heterogeneity of prevalence. However, in contrast to the rest of this report, in this analysis of the relationship between prevalence rates and the population categories, studies of hospitalized patients were not all placed in category IV, but were placed in a I, II, or III category as appropriate. Studies were subcategorized by whether they were prehospital based, ED-only based, or included hospitalized or CCU patients.

Categorization of Studies Analyzed

Table 4. Categorization of studies by population category and setting
Population categoryEDHospitalCCUPrehospitalTotal
I1302419
II27163753
III2073131
IV41005
Total6424812108 1
1

Three studies each provided data on two population categories and are thus counted twice.

Table 4 enumerates the number of studies analyzed for each of the population categories and study settings. Three studies (Goldman, Weinberg, Weisberg, et al., 1982; Kennedy, Harrison, Burton, et al., 1997; Hingorani, O'Hanlon, Halloran, et al., 1997) are included twice as they provide prevalence data and separate analyses for different categories of patients. Biomarker studies included both those included in the meta-analyses (and evidence tables) and studies that would have been included in the meta-analyses except that no information on test performance could be extracted. The other studies included only those that appeared in the appropriate evidence tables or were discussed in the main body of Chapter 3. Thus the number of studies analyzed here may not correspond exactly to other summaries of studies analyzed.

Most studies were based in the ED, including both hospitalized and discharged patients. However, the followup rates of patients discharged from the ED are often poor, with 10 to 20 percent or more of discharged patients frequently being lost to followup. Half the studies used minor variations of including all patients with chest pain as the cause for the ED visit. About 30 percent of studies excluded patients with diagnostic ECGs. This criterion became more common with the practice of providing early thrombolytic therapy to patients with AMI. All but one of the studies that excluded patients with diagnostic ECG were published since 1990; 21 of 32 studies were published since 1996. Prior to 1996, 56 percent of the studies (35 of 63) used category II criterion and only 16 percent used category III criterion. Since 1996, only 40 percent of the studies (18 of 45) used category II criterion, and 47 percent used category III criterion. The percentage of studies using category I criterion also decreased from 24 percent prior to 1996 to 9 percent since 1996.

Relationship Between AMI Prevalence and Population Category and Setting

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F003.jpg.

   Figure 3. Biomarker studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F004.jpg.

   Figure 4. Diagnostic protocol, predictive instrument, and decision aid studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

Points represent individual studies (except that one study is represented by two points -- one in population II, one in III, both hospital settings). Studies are organized by population category (main columns) and study setting (subcolumns). Points are offset for clarity. The total number of studies in each population category and of all biomarker studies is presented.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F005.jpg.

   Figure 5. Echocardiography and sestamibi studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

Points represent individual studies (except that one protocol study is represented by two points -- one in population II, one in III, both hospital settings; and one decision aid study is represented by two points -- one ED setting and one hospital setting, both in population category II). Studies are organized by population category (main columns) and study setting (subcolumns). Points are offset for clarity. The total number of studies in each population category and of all studies for each technology is presented.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F006.jpg.

   Figure 6. Exercise tolerance test and dobutamine stress echocardiography studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

Points represent individual studies. Studies are organized by population category (main columns) and study setting (subcolumns). Points are offset for clarity. The total number of studies in each population category and of all studies for each technology is presented.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F007.jpg.

   Figure 7. Serial ECG, nonstandard ECG, and prehospital ECG studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

Points represent individual studies. Studies are organized by population category (main columns) and study setting (subcolumns). Points are offset for clarity. The total number of studies in each population category and of all studies for each technology is presented.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F008.jpg.

   Figure 8. All studies: Relationship between population categories/study setting and prevalence of acute myocardial infarction

Points represent individual studies (except that two studies are represented by two points -- one each in population II, one in III, both hospital settings; and one is represented by two points -- one ED setting and one hospital setting, both in population category II). Studies are organized by population category (main columns) and study setting (subcolumns). Points are offset for clarity. The total number of studies in each population category and of all studies for each technology is presented.

We plotted all the studies by the prevalence of AMI. As the majority of studies (especially biomarker studies) did not report prevalence of ACI, we focused our analysis on prevalence of AMI. Each of Figures 3 to 7 displays one or more technologies on the horizontal axis with columns representing the population categories I to IV. Each column is further divided into subcolumns representing the study settings. The points were plotted with an offset to allow easier differentiation between studies. Studies that included analyses of more than one technology appeared in each of the relevant graphs. Figure 8 includes all 108 studies analyzed.

All Studies

Examining all studies (see Figure 8), it is apparent that the current groupings do not correlate to prevalence of AMI. However, as expected, the mean prevalence in category II studies (28 percent, 95 percent CI, 24-31 percent) is greater than that in category III studies which excluded a number of definite AMIs (15 percent, 95 percent CI, 9-20 percent) (p=0.001). Category I studies, which had broader inclusion criteria, also had a somewhat lower mean prevalence (20 percent, 95 percent CI, 15-25 percent) than category II studies, although this difference was not significant (p=0.3).

The setting of the study also does not appear to yield a consistent prevalence level. However, visually inspecting the settings in categories II and III studies (which have the most hospital- and CCU-restricted studies) appears to show that hospital-restricted studies have a greater prevalence than fully ED-based studies, and CCU-restricted studies have yet a greater prevalence.

However, within category II studies, differences between mean prevalence levels of different settings are all nonsignificant. ED-based studies had a mean prevalence of 24 percent (95 percent CI, 19-29 percent), whereas hospital-restricted studies had a mean prevalence of 29 percent (95 percent CI, 22-37 percent) (p=0.5, ED versus hospital) and CCU-restricted studies had a mean prevalence of 38 percent (95 percent CI, 0-92 percent) (p=0.2, ED versus CCU, and p=0.6, hospital versus CCU). Studies with prehospital settings have somewhat greater prevalence levels (32 percent, 95 percent CI, 20-45 percent) as ED-based studies (p=0.5).

A similar pattern is seen within category III studies. ED-based studies had a mean prevalence of 8.9 percent (95 percent CI, 4.1-14 percent); hospital-restricted studies had a mean prevalence of 21 percent (95 percent CI, 8.7-33 percent) (p=0.1, ED versus hospital); and CCU-restricted studies had a mean prevalence of 36 percent (95 percent CI, 0-100 percent) (p=0.006, ED versus CCU, and p=0.2, hospital versus CCU).

Biomarker Studies

Biomarker studies have a distribution of populations, settings, and prevalence levels similar to all the studies together. Although the information reported by these studies allowed them to meet the entry criteria for this report (studies of technologies tested in the ED for ACI that are not of highly selected patients), it is clear that many of these studies do, in fact, have fairly to very selected patient samples with high prevalence of AMI. In fact, 22 percent of studies showed levels of AMI prevalence lower than 10 percent; 44 percent had prevalence levels between 10 percent and 30 percent; 25 percent had prevalence levels between 30 percent and 50 percent; and 9 percent had prevalence levels above 50 percent.

Other Technologies

Among studies of protocols, predictive instruments, and decision aids, the large majority of studies were ED-based. Notably, with one exception (Gibler, Runyon, Levy, et al., 1995) which apparently excluded a large number of patients with AMI who bypassed the "Heart Emergency Room," all hospital-restricted studies reported AMI prevalence levels greater than similar ED-based studies. The two serial ECG studies, which were both ED-based and excluded patients with initial diagnostic ECGs, showed low prevalence. In contrast, the nonstandard ECG studies were all hospital- or CCU-based and generally showed greater prevalences. The prehospital 12-lead ECG studies found a wide range of prevalence levels, though those with broader inclusion criteria generally found lower prevalences of AMI.

Echocardiography and sestamibi studies reported generally lower prevalence levels. The two exceptions (Sasaki, Charuzi, Beeder, et al., 1986; Peels, Visser, Kupper, et al., 1990) excluded patients with previous AMI, coronary artery disease, or valvular or wall motion abnormalities. Studies of exercise tolerance tests, as would be expected, all found very low levels of prevalence as high-risk patients were excluded. However, the only dobutamine stress echocardiography study (Trippi, Lee, Kopp, et al., 1997), which was restricted to CCU patients, found a considerably greater prevalence of AMI.

Summary

In summary, among studies with generally similar descriptions of inclusion criteria, there remains a large variation as to which patients are included in study samples, as evidenced by the heterogeneity of AMI prevalence rates. Even though there was little evidence in the biomarker studies for a correlation between the prevalence of AMI and test performance, the large prevalence variation raises the question of unreported selection biases within the studies. It is frequently not clear when articles are reviewed why the prevalences of AMI for given studies vary. It is necessary for more articles to report the peculiarities of the populations from which patient samples are drawn and to better report additional biases in subject selection.

There is also some evidence that studies that restricted subjects to patients admitted to the hospital or, more specifically, to the CCU, do in fact report greater prevalences of AMI and, thus, may not be representative of all ED patients seen for the possibility of ACI.

Summary of Evidence for the Diagnostic Technologies

Reported in this section are the summaries of the evidence for the diagnostic technologies examined in this evidence report.

Prehospital 12-lead Electrocardiography

Evidence Tables 1a, 1b, and 1c present details about the studies considered for this technology.

Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

Table 5. Prehospital 12-lead ECG: Diagnostic test performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Aufderheide 1990151IACI AMI40.3 15.990 54.253 99.2A
Kudenchuk 19911,189IACI AMI46.1 32.9- 66.2- 94.8A
Dalzell 199194IIACI AMI91.5 51.178 77.188 97.8B
Bertini 1991605IIACI AMI52.4 26.095 76.493 93.5B
Aufderheide1 1992a, 1992b439IACI AMI54.4 20.743 41.887 99.7B
Arntz 19921,226IIACI AMI- 34.0- 64.3- 99.5B
Otto 1994428IACI AMI60.0 23.4- 60.0- 80.8B
Foster 1994155IIACI AMI- 13.5- 81.0- 100B
Millar-Craig 1997162IIACI AMI66.0 44.4- 98.6- 41.1B
Brown 199732IIACI AMI- 21.9- 100- 80.0C
Kudenchuk 19983,027IIACI AMI53.3 38.045.92 -- 96.02A
Overall7,508I/IIACI AMI46-92 14-5176 (54-89) 3 Odds ratio 68 (59-76) 3 Odds ratio88 (67-96) 3 23.3(6.3-85) 3 97 (89-92) 3 104 (48-224) 3B
1

Data from two publications of the same study.

2

ACI criteria - ST-segment elevation. Other criteria available; see study.

3

Results from meta-analyses using random effects calculations.

A total of 11 reports qualified for inclusion in the analysis of diagnostic accuracy. It should be noted that there were overlapping reports for two studies (Table 5). Three reports were derived from the same study population (Aufderheide, Keelan, Hendley, et al., 1992a; Aufderheide, Hendley, Woo, et al., 1992b; Otto and Aufderheide, 1994); the first two offer complementary interpretation on overall accuracy, and the third is a retrospective evaluation of the diagnostic accuracy of specific ECG changes. Another two reports stemmed from the Myocardial Infarction Triage and Intervention (MITI) trial (Kudenchuk, Maynard, Cobb, et al., 1998; Kudenchuk, Ho, Weaver, et al., 1991). The latter study contained data only on early screened patients, but the two reports provided complementary data on different diagnostic criteria and outcomes. Therefore eight studies are considered in the synthesis of the results.

The common characteristic of these eight studies was that they targeted populations with chest pain who required an ECG and did not have any significant exclusion criteria. Nevertheless, in Bertini, Rostagno, Taddei, et al. (1991), 8 percent of the hospitalized and 24 percent of the nonhospitalized patients had no data available and without specific reasons given. Similarly in Aufderheide, Keelan, Hendley, et al. (1992a) and Aufderheide, Hendley, Woo, et al. (1992b), of 680 enrolled patients, 149 had unsuccessful transmission of ECG by phone, 72 were transported to nonparticipating facilities, and 20 patients had "unavailable medical records." Finally, in the study by Millar-Craig, Joy, Adamowicz, et al. (1997), two phases were planned. During phase I, paramedics were trained and the prehospital 12-lead ECG was not used for decisionmaking, whereas in phase II, only those paramedics with accuracy over 80 percent in their phase I ECG interpretations participated. In phase II, the prehospital 12-lead ECG was used to decide whether the patient should be directly admitted to the CCU.

A brief mention should be made at this point of two excluded studies that also addressed the accuracy of chest pain protocols by paramedics. One study described 25 patients fast-tracked by paramedics to the CCU on the basis of prehospital 12-lead ECG findings (Banerjee and Rhoden, 1998). The diagnosis of AMI was verified subsequently in 14 of 25 patients, but no information is given on how many patients who were not fast-tracked to the CCU had AMI; therefore, the diagnostic characteristics cannot be determined. Another study (Wuerz and Meador, 1995) addressed the diagnostic accuracy of initiation of a chest protocol by paramedics (ECG strip monitoring, intravenous [IV] access, sublingual nitroglycerin, IV morphine) and found a sensitivity of 69 percent (260/376) and specificity of 87 percent (2,386/2,746) for ischemic heart disease. This report offers useful information of the diagnostic accuracy of the paramedics' first clinical impression, but it does not pertain to the accuracy of prehospital 12-lead ECG since only single-strip ECG was obtained.

Not considered for data synthesis are studies pertaining to chest pain protocols or computerized interpretations of ECG in the prehospital setting. This includes two studies where there was a computerized interpretation of the prehospital 12-lead ECG (Grijseels, Deckers, Hoes, et al.,1996; Aufderheide, Rowlandson, Lawrence, et al., 1996) as part of chest pain algorithms. The ACI-TIPI interpretation in Aufderheide, Rowlandson, Lawrence, et al. (1996) was performed retrospectively on 439 patients. Patients with low ACI-TIPI probability (0 to 9 percent) had a low incidence of angina (2.3 percent) and no AMI or life-threatening events. Grijseels, Deckers, Hoes, et al. (1995) considered a computer interpretation of a prehospital 12-lead ECG along with other parameters testing a number of hospital-developed algorithms in the prehospital setting and trying to develop a new algorithm suitable for prehospital triage. Details about this study can be found in the computer-based decision aids section.

Data on diagnostic accuracy for AMI on the basis of the overall ECG interpretation were available in all eight qualifying studies, and data on diagnostic accuracy for ischemia (including AMI and angina) were available from five of the eight qualifying studies. Three studies also provided diagnostic accuracy data for specific observed ECG changes. To avoid duplication of patients in the summary calculations, when more than one set of diagnostic accuracy data using different ECG criteria are reported in a study, only one set is used in the summary calculations, generally the one best approaching the definition of other studies.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F009.jpg.

   Figure 9. SROC analysis of prehospital 12-lead ECG to diagnose ACI

Table 5 summarizes the results of the prehospital 12-lead ECG studies considered for diagnostic performance. Figure 9 (see meta-analyses section) displays the SROC results of five studies to assess diagnostic performance of prehospital 12-lead ECG for ACI. Compared with AMI outcome, the diagnostic accuracy is inferior. Because of different criteria in the definition of coronary ischemia and different criteria in the definition of an abnormal ECG, there was significant heterogeneity on the sensitivity and specificity estimates in the five studies.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F010.jpg.

   Figure 10. SROC analysis of prehospital 12-lead ECG to diagnose AMI

Figure 10 (see meta-analyses section) displays the SROC result of the eight studies to assess the diagnostic performance of prehospital 12-lead ECG for AMI. As it can be shown in the figure, the Millar-Craig (1997) study operated at different sensitivity and specificity values compared with the other studies because it used very soft criteria for determining the need for direct admission to the CCU rather than for diagnosing AMI. The heterogeneity in the reported sensitivity may be due to different criteria being used in the studies used to define an abnormal ECG.

Three studies addressed the diagnostic accuracy of specific ECG changes. One study (Otto and Aufderheide, 1994) suggested that consideration of reciprocal changes in addition to ST elevation may be necessary to increase the positive predictive value to acceptable range (>90 percent) for consideration of early prehospital thrombolysis. The study by Kudenchuk, Maynard, Cobb, et al. (1998) also gathered data on the improved diagnostic performance of serial prehospital/hospital ECGs (discussed in the serial ECG section). Both studies show the anticipated tradeoff between sensitivity and specificity as more stringent ECG criteria are utilized. Dalzell, Purvis, and Adgey (1991) also provide data on specific ECG changes, but this seems to be a highly selected population with a very high prevalence of AMI.

Finally, one study (Kudenchuk, Ho, Weaver, et al., 1991) compared the performances of computer-interpreted and physician-interpreted prehospital 12-lead ECG. Computer-interpreted ECG in this study had better specificity (98 percent vs. 95 percent) and worse sensitivity (52 percent vs. 66 percent) than physician-interpreted ECG, and the overall performance was acceptable. Of course, these estimates depend on the stringency of the criteria set to recognize acute injury.

Studies of the clinical impact of the test's actual use

The clinical impact of prehospital 12-lead ECG has been addressed by comparison of the initiation of thrombolysis (based on prehospital 12-lead ECG) with hospital initiation of thrombolysis using both randomized trials and prospective nonrandomized studies. Based on the availability of data reported in the literature, we analyzed the following outcomes in this report: (1) time savings, (2) early differences in left ventricular function (as expressed by the estimated ejection fraction), (3) hospital mortality, and (4) long-term mortality.

In the analysis of clinical impact, we considered separately randomized trials and prospective nonrandomized studies with prospectively enrolled controls. We excluded observational studies that did not use any controls, but instead, in most cases, calculated the observed time delays and speculated on "potential" time that could have been saved if thrombolysis was initiated in the prehospital setting. Such reports included the following: Giovas, Papadoyannis, Thomakos, et al. (1998); Grim, Feldman, and Childers (1989); BEPS Collaborative Group (1991); Aufderheide, Haselow, Hendley, et al. (1992c); Weaver, Eisenberg, Martin, et al. (1990); Linderer, Schroder, Arntz, et al. (1993); and Bossaert, Demey, Colemont, et al. (1988). They invariably show short duration of time to potential prehospital thrombolysis as well as possible gains compared with waiting for arrival at the hospital and obtaining an ECG there before administering treatment. One more early study (Fine, Weiss, Sapoznikov, et al., 1986) was excluded because although both home- and hospital-treated patients were considered, their outcomes could not be separated. Finally, the formal meta-analysis did not consider studies that looked only at the quality of transmission over the telephone via modem of prehospital 12-lead ECGs (Grim, Feldman, Martin, et al.,1987).

Retrospective studies were also excluded from the evidence synthesis, but the largest of them is worthwhile discussing here because it is the largest study conducted to date on the clinical impact of prehospital 12-lead ECG. Canto, Rogers, Bowlby, et al. (1997) evaluated the differences in timing between AMI patients who received a prehospital 12-lead ECG versus those who did not have an ECG obtained in the prehospital setting. This was a retrospective evaluation of 70,763 patients in a large registry of AMI patients, including 3,768 patients with prehospital 12-lead ECGs. Only AMI patients presenting to the hospital within less than 12 hours of the onset of symptoms were included, and in-hospital infarction patients, transferred-in referrals, and self-transported patients were excluded; 70,763 of 275,046 registered AMI patients qualified for the analysis. The median time from onset of symptoms to hospital arrival was surprisingly prolonged in patients who had a prehospital 12-lead ECG (152 vs. 91 minutes), although the median time from hospital arrival to therapy was shortened both for thrombolysis (30 vs. 40 minutes) and for primary angioplasty (92 vs. 115 minutes) in the prehospital 12-lead ECG group. The prehospital 12-lead ECG group was more likely to have reperfusion therapy by thrombolysis or angioplasty, angiography, and CABG than the control group. In-hospital mortality was 8 percent in the prehospital group vs. 12 percent in the control group (p<0.001), and the beneficial effect on survival remained present also in multivariate analyses adjusting for various predictors of mortality.

Prospective nonrandomized evidence

Prospective nonrandomized evidence was available from five different teams. The Israeli team that pioneered prehospital thrombolysis published successive publications on their experience of prehospital vs. hospital administration of thrombolysis with streptokinase. These include the following: Koren, Weiss, Hasin, et al. (1985); Weiss, Fine, Applebaum, et al. (1987); Rozenman, Gotsman, Weiss, et al. (1994, 1995); and Weiss, Leitersdorf, Gotsman, et al. (1998). For synthesis of the evidence, we used the latest update on hard endpoints, which is the 1995 report by Rozenman and colleagues. Compared with earlier reports, this 1995 report also included patients with significant systemic hypertension and patients older than 75 years. The Weiss, Leitersdorf, Gotsman, et al. (1998) report does not contain any data on the prespecified endpoints, but simply gives data on long-term evaluation of the ejection fraction in a subset of 362 of the accrued patients (prehospital n=68, hospital n=294), showing fewer symptoms of heart failure (including dyspnea, fatigue, orthopnea, nocturnal dyspnea, nocturia, peripheral edema, and episodes of pulmonary edema) over a followup of 4 years after presentation. This report, as well as the early reports, is not discussed again here.

One of the five studies we examined (Gibler, Kereiakes, Dean, et al., 1991) reported on only four patients who had received prehospital thrombolysis, and inferences are limited. Another study, the Cincinnati Heart Project (reported in Kereiakes, Weaver, Anderson, et al., 1990), addressed the impact of obtaining a prehospital 12-lead ECG on the time from reaching the hospital until initiation of thrombolytic therapy. Thrombolysis was not given in the prehospital setting. The 13 patients enrolled in the project had average delays of 36.3 (standard deviation [SD] 11.3) minutes, as compared with 62.9 (SD 14.7) minutes for 196 patients seen in the same facilities where the prehospital 12-lead ECG protocol was not applied, and 88.8 (SD 54.4) minutes in 211 historic controls from the same facilities. No data are available on time from onset of symptoms to thrombolysis.

All three nonrandomized studies that addressed the impact of prehospital thrombolysis used time from symptom onset to treatment as an endpoint and provided short-term mortality data. Two of the three (Roth, Barbash, Hod, et al., 1990; Rozenman, Gotsman, Weiss, et al., 1995) also addressed ejection fraction in the short term.

Table 6. Time to thrombolysis for prehospital 12-lead ECG: Nonrandomized studies1
Study, yearStudy sizeTime to thrombolysis (minutes [SD])Study quality
PrehospitalHospitalPrehospitalHospital
Roth 1990744494 (35)137 (45)B
Bouten 1992226220100 (56)166 (56)B
Rozenman 199511464684 (48)126 (60)B
Overall1,324Mean time difference 1 −50.1 (−67.3, −34.2)B
1

Results from meta-analysis using random effects calculations, 95 percent CI.

The results for the time from onset of symptoms to treatment are summarized in Table 6. Overall, in the over 400 patients given thrombolysis in the prehospital setting, the time gain was approximately 50 minutes compared with their respective controls receiving hospital thrombolysis.

Table 7. Ejection fraction outcome for prehospital 12-lead ECG/thrombolysis: Nonrandomized studies
Study, yearStudy sizePopulation categoryEjection fraction percent (SD)Study quality
PrehospitalHospital
Study team 1
Rozenman 1995760III58 (13)54 (15)B
Weiss 1987113III62 (8)55 (14)B
Koren 198551IIINo dataNo dataB
Kereiakes 1990209IIINo dataNo dataB
Roth 1990116III49 (17)45 (19)B
Bouten 1992446INo dataNo dataB
Overall1,695III49-62 245-55 2B
1

Study team based in Israel.

2

Range of reported values.

Table 8. Mortality outcome for prehospital 12-lead ECG/thrombolysis: Nonrandomized studies
Study, yearStudy sizePopulation categoryMortalityStudy quality
PrehospitalHospital
Study team 1
Rozenman 1995760III5/11411/645B
Weiss 1987113III3/34No dataB
Koren 1985251III-2/53B
Bouten 1992446I0/2266/220B
Kereiakes 1990209IIINo dataNo dataB
Roth 1990116III4/723/44B
Overall1,695IIIRisk ratio 3 0.84 (0.17, 4.15)B
1

Study team based in Israel.

2

Cohort study.

3

Results from meta-analysis using random effects calculations, 95 percent CI.

Ejection fraction in the short term (in 1 week after admission in the 1995 study by Rozenman and colleagues) or upon discharge in the 1990 study by Roth and colleagues was better in the prehospital groups, but reached statistical significance only in the larger 1995 study. The magnitude of the difference was identical in the smaller 1990 study (4 percent). These data are based on 616 patients in the study by Rozenman and coworkers (1995) and 108 patients in the study by Roth and coworkers (1990) (Table 7). In-hospital mortality did not differ significantly between the two groups in any of the three studies, but data were very limited and some heterogeneity may be present. Summarized clinical outcomes are shown in Table 8 along with summary estimates based on random effects estimates. Random effects may be strongly indicated for the mortality outcome, since there was significant heterogeneity among the three studies.

Randomized evidence on clinical impact

A total of 14 randomized trials pertaining to the comparison of prehospital against hospital strategies were identified. One small trial (McNeill, Cunningham, Flannery, et al., 1989) was excluded, since the comparison was between thrombolysis in the CCU vs. thrombolysis in the ED or at home. The latter arm did not include only prehospital thrombolysis (several patients were treated in the ED). The results of this study are consistent with those of the other trials and would not affect the results overall. Eleven of the 13 qualifying trials compared prehospital vs. hospital administration of thrombolysis and typically considered populations of patients with short duration of symptoms (up to anywhere between 3 and 6 hours, when stated), age under 75, and no contraindications to thrombolysis, as perceived by each trial's investigators. The other two trials evaluated whether obtaining a prehospital 12-lead ECG might increase the time spent in the field (Karagounis, Ipsen, Jessop, et al., 1990) or may affect the time to administration of thrombolysis after admission to the hospital (Kereiakes, Gibler, Martin, et al., 1992). Karagounis and coworkers (1990) found that the 34 patients who were randomized to have a prehospital 12-lead ECG had identical in-field times to the 37 patients randomized not to have a prehospital 12-lead ECG (16.4 vs. 16.1 minutes on average). Prehospital thrombolysis was administered in six patients in the first group and thrombolysis was also given to six patients in the second group. Kereiakes and coworkers (1992) found in 11 patients randomized to have a prehospital 12-lead ECG a 15-minute reduction in the in-hospital time to thrombolytic treatment compared with 9 patients who did not have a prehospital 12-lead ECG.

Table 9. Time to thrombolysis outcome for prehospital 12-lead ECG: Randomized trials
Study, yearStudy sizeTime to thrombolysis (minutes)Study quality
PrehospitalHospitalMean (SD)Median
PrehospitalHospitalPrehospitalHospital
Castaigne 19895743NDND131180A
Barbash 1990434496 (36)132 (42)NDNDA
Karagounis 19906648 (12)68 (29)NDNDA
Schofer 1990403885 (51)137 (50)NDNDA
McAleer 199243102138172NDNDA
GREAT 19921163148NDND101240A
EMIP 19932,7502,719NDND130190A
Weaver (MITI) 199317518592 (58)120 (49)77110A
Overall6,562Mean Time Difference 2 −33.2 −44.1, 22.3)A

ND=no data. GREAT=Grampian Region Early Anistreplase Trial

1

Data from three publications of the same study.

2

Results from meta-analysis using random effects calculations of four studies that provided data, 95 percent CI.

The time from onset of symptoms to thrombolysis was mentioned as an outcome in eight trials and the results are summarized in Table 9.

As is evident, although it is not possible to arrive at exact summary estimates because of the heterogeneity in the way data are reported, nevertheless all studies show clearly that a significant reduction in the time to treatment is achieved, ranging between 20 and 60 minutes. The summary estimate would probably be closer to the (by far) largest study, the European Myocardial Infarction Project (EMIP, 1993), where the difference in the median time values was 1 hour. These results are in agreement with the data from the nonrandomized studies.

Table 10. Ejection fraction outcome for prehospital 12-lead ECG/thrombolysis: Randomized trials
Study, yearStudy sizePopulation categoryEjection fraction percent (SD)Study quality
PrehospitalHospital
Castaigne 1989100III57 (ND)53 (ND)A
Barbash 199087III48 (15)48 (15)A
Karagounis 199071IIINDNDA
Schofer 199078III51 (10) 353 (14) 3A
GREAT1 1992311IIINDNDA
McAleer 1992145III57 252 2A
Kereiakes 199222IINDNDA
Weaver (MITI) 1993360III53 (12)54 (12)A
EMIP 19935,469IIINDNDA
Overall6,643III48-57 448-54 4A

ND=no data

1

Data from three publications of the same study.

2

Evaluated only in 45 percent of the enrolled patients (not stated how selected). In other trials, all patients had evaluation of ejection fraction.

3

Evaluated in 28 patients in each arm.

4

Range of reported values. Meta-analysis was not performed as it is obvious that there is no difference.

Table 11. Mortality outcome for prehospital 12-lead ECG/thrombolysis: Randomized trials
Study, yearStudy sizePopulation categoryMortalityStudy quality
PrehospitalHospital
Castaigne 1989100III3/572/43A
Barbash 199087III1/433/43A
Karagounis 199071IIINo dataNo dataA
Schofer 199078III1/402/38A
GREAT1 1992311III11/16317/148A
McAleer 1992145III1/4312/102A
Kereiakes 199222IINo dataNo dataA
Weaver (MITI) 1993360III10/1755/185A
EMIP 19935,469III266/2,750303/2,719A
Overall6,643IIIRisk ratio 2 0.84 (0.73, 0.98)A
1

Data from three publications of the same study.

2

Results of meta-analysis using random effects calculations, 95 percent CI.

Short-term effects on the left ventricular ejection fraction were reported in five trials. As shown in Table 10, in contrast to the results of the nonrandomized studies, there was no significant difference noted in any of these five trials, and a favorable trend was seen only in two trials. On the contrary, there were trends for reduced early mortality (in-hospital or up to 60 days) in six of the seven trials that addressed mortality, and the summary estimate shows a statistically significant 16-percent reduction in the risk of death with no heterogeneity between the seven trials (Table 11). The overall estimate of effect is practically identical to that obtained in the nonrandomized studies.

Long-term impact on mortality was assessed in four trial populations (Weaver, Cerqueira, Hallstrom, et al., 1993; Barbash, Roth, Hod, et al., 1990; McAleer, Ruane, Burke, et al., 1992; Grampian Region Early Anistreplase Trial [GREAT], 1992). There was substantial heterogeneity in the presented results. In the study by McAleer and coworkers, 1-year mortality rates were 6.1 percent vs. 20 percent and 2-year mortality rates were 9.1 percent and 30.6 percent, respectively, in the prehospital and hospital arms. Followup publications on the GREAT study (Rawles, 1994, 1997) showed also a survival benefit with mortality rates at 1 year of 10.4 percent vs. 21.6 percent, and at 5 years of 25 percent vs. 36 percent. On the contrary, 2-year survival rates in the much larger MITI trial were 89 percent and 91 percent for the prehospital and hospital groups, respectively (Weaver, Cerqueira, Hallstrom, et al., 1993). Three patients in each arm of the trial conducted by Barbash and coworkers (1990) died over 2 years of followup.

Data From Other Clinical Studies

No data are reported.

Continuous Electrocardiography/Serial Electrocardiography

Evidence Table 2 presents details about the studies considered for this technology.

Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

Table 12. Continuous/serial ECG: Diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Hedges 1992261IIIACI AMI40.2 10.725 1 39.392 1 88.4B
Gibler 19951,010IVACI AMI4.3 1.221.2 -99.4 -C
Overall1,271III/IVACI4.3-40.221-25 292-99.4 2C
Odds ratio 3.8-45
AMI1.2-10.739.3 388.4 3
Odds ratio 4.9
1

Only summary results were available for the ACI outcome.

2

Range of values reported. Meta-analysis was not possible because of inadequate data reporting.

3

Point estimate from single study.

Two studies in the population category III that examined the diagnostic performance of serial ECGs were included in this summary. Both of these studies were included in the earlier Working Group report and are summarized in Table 12. No qualifying new study was identified for this update.

Hedges, Young, Henkel, et al. (1992) tested the diagnostic performance of serial ECG testing in patients of whom 68 percent were a population of veterans. Thirty-eight percent (159/420) of the initial patients were not included because of incomplete data. It was unknown whether the interpretation of the reference standard was blinded to the results of the diagnostic test. Because of the lack of reporting of events in the ACI category, the reported summary sensitivity and specificity results could not be verified.

Gibler, Runyon, Levy, et al. (1995) reported on a retrospective study of a 9-hour protocol involving a series of diagnostic tests. The initial population of 1,010 consecutive patients completed both serial CK-MB and serial 12-lead ECG tests. The serial ECG was conducted at 20-second intervals over a 9-hour period. A computer monitor automatically compared subsequent ECGs with the initial baseline ECG. The definition of ischemia included angina as well as unstable angina. Other potential bias factors include 28 patients who were released against medical advice and 113 cocaine users, of whom 10 were admitted by serial ECG detection. Other tests in the 9-hour protocol, echocardiography and graded exercise testing, are discussed in the algorithm/computer-aided diagnostic section.

Studies of the clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical Studies

No data are reported.

Nonstandard Lead ECG

Evidence Table 3 presents details about the studies considered for this technology.

Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

No studies are reported.

Studies of the clinical impact of the test's actual use in the ED

No studies are reported.

Data From Other Clinical Studies

Four diagnostic test performance studies met the inclusion criteria of nonstandard lead ECGs. Although the testing was conducted in the ED setting for all studies, in three studies, all patients enrolled were subsequently admitted to the hospital (one study specifically stated cardiac monitored beds). In Justis and Hession (1992), of the 188 patients who had complete data, 163 were admitted. There were no common technologies between the four studies that could be synthesized. They include 15-, 18-, 22-lead ECGs, and a 24-lead variance cardiography.

Justis and Hession (1992) tested the diagnostic performance of the 22-lead ECG within 3 hours of presentation on a prospective cohort in an urban hospital. The 22-lead ECG uses digital cardiac electrogram technology, with each lead providing QRS data. Final diagnosis for AMI was based on CK-MB levels. An index of greater than or equal to 85 was used for positive test results. Because of discrepancies in the reporting of the data in the text as well as in the tables, sensitivity and specificity could not be verified. Enrollment consisted of 212, of whom 24 had incomplete 22-lead ECG data. One hundred eighty-eight patients, not 163 as reported, were the evaluable study size. Twenty-five patients were discharged, 24 with noncardiac chest pain or at low risk of AMI and 1 returned and had AMI confirmed. All 163 patients were admitted for serum enzymes and angiography for AMI and coronary artery disease (CAD), respectively.

Table 13. Nonstandard lead ECG: Diagnostic performance studies
Study, yearStudy sizePopulation category 1Prevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Justis 1992163IVAMI21.88376B
Zalenski 1993149IVAMI22.858.893.0B
Spadafore 199652IVACI AMI48.1 2596.0 -40.7 -B
Zalenski 1997b533IVAMI64.766.184.0A
Overall897IVACI4896 241 2B
Odds ratio 17 2
AMI22-6559-83 376-93 3
Odds ratio 10-19 3
1

All patients admitted.

2

Point estimate from single study.

3

Range of values reported. Meta-analysis is inappropriate because of the different technology used in each study.

Zalenski, Cooke, Rydman, et al. (1993) conducted a 15-lead ECG on a prospective cohort in a suburban, teaching, university-affiliated community hospital. The 15-lead ECG, a standard 12-lead ECG with V4R, V8, and V9, was read by a cardiologist blinded to patient clinical outcome. There were two levels of criteria for diagnosis for AMI, the first as reported in Table 13, and a second to select patients for thrombolytic therapy in which the sensitivity was 44.1 percent and specificity was 99.1 percent. The final diagnosis for AMI included CK and CK-MB levels or conclusive ECG results. The testing was conducted to test for AMI on patients admitted to rule out myocardial infarction and unstable angina. The patients were not consecutive.

Spadafore, Lieber, and Vasilenko (1996) tested the performance of 24-lead variance cardiography (VC) in a community hospital with a prospective cohort selected for hospital admission. The VC is a 12-lead ECG with 12 auxiliary leads capable of inputting computer data, calculating QRS morphologic variability. The derived index of greater than or equal to 75 was selected as positive for ACI. The VC was tested in the ED with the physicians blinded to the VC results. The final diagnosis for ACI included characteristic clinical presentation, ECG findings, plus CK and CK-MB levels. Several limitations to be considered, as noted by the authors, are: VC does not appear to be able to differentiate between AMI and UAP patients, all AMI patients were male, and patients positive on their initial 12-lead ECGS were also included in the study.

Zalenski, Rydman, Sloan, et al. (1997b) tested an 18-lead ECG in seven emergency departments of public and private hospitals, including teaching hospitals. The 18-lead ECG included 12-lead ECG plus 3 posterior leads, V7, V8, V9, and 3 right ventricular leads, V4R, V5R, V6R. The single cardiologist reading the initial 12-lead ECG for final diagnosis and 18-lead ECG was blinded to the test results and the patient's outcome. Final diagnosis included CK and CK-MB results or pathologic Q waves. Analysis between the included and excluded patients (66 of 70) showed no difference for gender and race, but the excluded patients were younger, 61.1 vs. 65.9 years.

Exercise Stress ECG

Evidence Tables 4a and 4b present details about the studies considered for this technology.

Data From Prospective Clinical Studies in the ED Setting

One new study (Kirk, Turnipseed, Lewis, et al., 1998) and two reports (Tsakonis, Shesser, Rosenthal, et al., 1991; Kerns, Shaub, and Fontanarosa, 1993) previously used by the Working Group are included in our evidence summary. However, two studies (Gibler, Runyon, Levy, et al., 1995; Zalenski, Roberts, Das, et al., 1994) mentioned in the Working Group report are not included in the current report. The study by Gibler and colleagues (1995) is a retrospective study that enrolled 1,010 patients at the outset; the protocol of this study also resulted in a highly selected patient population that received exercise stress ECG. The study by Zalenski and colleagues (1994) is an abstract that has not yet been published as a full article and therefore not used in our report.

Studies of test sensitivity and specificity

Table 14. Exercise stress ECG to diagnose ACI: Diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Kirk 1998212IIIACI AMI6.1 1.4100 -92.5 -A
Lewis 1999100III (ED/CPU)ACI 10AMI 270 -82.2 -C
Overall312IIIACI6-1070-100 182-93 1B
Odds ratio 11-? 2

CPU=chest pain unit.

1

Range of values reported.

2

Upper range cannot be estimated because of study with 100 percent sensitivity.

Kirk, Turnipseed, Lewis, et al. (1998) studied a prospective cohort to determine the safety and utility of exercise stress testing (Table 14). The intent was to include a large, heterogeneous, low-risk population. Two hundred and twelve patients were enrolled and underwent the modified Bruce protocol. Of the 28 patients testing positive, 3 had myocardial infarction and 10 had unstable angina. There were 59 nondiagnostic and 125 negative results for a specificity of 92.5 percent.

Lewis, Amsterdam, Turnipseed, et al. (1999) studied patients with known cardiac arterial disease. The modified Bruce protocol was performed on 100 patients with a sensitivity of 70 percent and specificity of 82.2 percent. Two concerns that the study design may result in bias are first, that the study sample did not consist of consecutive patients but was based on physician referral, and second, that for the diagnosis of unstable angina, the test under study was used as part of the criteria.

Studies of the clinical impact of the test's actual use

Table 15. Exercise stress ECG: Clinical impact studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Clinical impactStudy quality
Tsakonis 199128IIIAMI ACI0 0No cardiac eventC
Kerns 199332IIIAMI ACI0 0No cardiac eventC
Kirk 1998212IIIACI AMI6.1 1.4Unknown 1B
Overall272IIIACI AMI0-6.1 0-1.4UnknownNot applicable 2
1

This study has no comparison arm.

2

Not applicable because of unknown clinical impact.

Tsakonis, Shessee, Rosenthal, et al. (1991) looked at the feasibility and safety of emergency cardiac treadmill exercise stress testing (Table 15). A convenience sample of 28 patients with normal ECG results was enrolled. They were also likely candidates for hospitalization for coronary artery disease. The patients underwent a modified Bruce protocol, of which 23 tested negative. Four positive test results were concluded to be false positives, and the one patient with a positive test refused further workup and was lost to followup. A 1- to 12-month followup found no cardiac events.

Kerns, Shaub, and Fontanarosa (1993) looked at the feasibility and safety of cardiac treadmill exercise stress testing. Compared were two groups of patients -- prospective patients presenting to the emergency department and retrospective inpatient controls. Both groups underwent the Bruce protocol stress test with normal test results. The controls had a "primary discharge diagnosis of atypical or noncardiac chest pain." The emergency patients were followed for 6 months, and no cardiac events were experienced.

In the Kirk, Turnipseed, Lewis, et al. (1998) study described in the last section, of the 212 patients, 125 (59.0 percent) tested negative and were discharged, and 59 patients (27.8 percent) had nondiagnostic tests. Of the 28 patients who tested positive, 3 had AMI and 10 had unstable angina (UA). There was no mortality or morbidity for 95 percent (205) of patients followed for 30 days.

Data From Other Clinical Studies

No data are reported.

Creatine Kinase and Creatine Kinase-MB

Evidence Tables 5, 6, 7, and 8 present detailed descriptions of the studies.

Data From Prospective Clinical Studies in the ED Setting

A total of 47 studies were assessed for the diagnostic accuracy of either CK or CK-MB in the diagnosis of AMI in the ED. Of these, 24 have been published since 1995. Twenty-one studies included all eligible ED patients and qualified for inclusion in meta-analysis. Of these 21 studies, 16 have been published since 1995. Ten included studies addressed CK as a single test upon admission to the ED; only two addressed serial testing of CK. Ten included studies addressed CK-MB as a single test upon admission to the ED; seven addressed serial testing of CK-MB. A number of studies analyzed both CK and CK-MB and/or presentation and serial testing. Fourteen additional studies included patients evaluated in the ED but excluded patients who were discharged from the ED.

The assessments of CK and CK-MB are presented in the following order:

  • Single CK upon presentation to ED to diagnose AMI or ACI.

  • Serial CK upon presentation to ED to diagnose AMI or ACI.

  • Single CK-MB upon presentation to ED to diagnose AMI or ACI.

  • Serial CK-MB upon presentation to ED to diagnose AMI or ACI.

Single CK Upon Presentation to ED to Diagnose AMI
Studies of test sensitivity and specificity

Table 16. Presentation creatine kinase to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Eisenberg 197980II165482C
Roxin 1984305I262896C
Lee 1987639II163880C
Hedges 1987773II6.65565B
Viskin 1987252II303893C
Mair 1991a96II243589C
Mair 1991b126II404383C
Thomson 1995383II184293C
Gornall 199698III412886B
Hetland 1996133II347.193B
Laurino 19971115III2216 1NDC
Overall22,885I/II/III6.6-4136 (29-44) 388 (80-93) 3C
Odds ratio 3.7 (2.5-5.4) 3
1

Study not included in meta-analysis.

2

Laurino (1997) not included in overall summary because the specificity was not reported.

3

Results from meta-analyses using random effects calculations.

Table 17. Presentation creatine kinase to diagnose AMI: Diagnostic performance studies (all patients admitted)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Tucker 1994133IV (Hospital admission)295182C
Tucker 1997177IV (Hospital admission)153084B
Young 19931222IV (CCU admission)1940No dataC
Overall532IV15-2930-51 182-84 1C
Odds ratio 4.7-12 1
1

Range of reported values.

A total of 18 studies pertaining to the study of CK as a single test upon presentation to the ED for the diagnosis of AMI were retrieved. One study (Mair, Artner-Dworzak, Lechleitner, et al., 1992) presented the same data set as a previously published study. Two studies (Young and Green, 1993; Laurino, Pelletier, Eadry, et al., 1997) reported only sensitivity results. These two studies are presented in Tables 16 and 17 and Evidence Tables 5a and 5b. One study (Mair, Genser, Morandell, et al., 1996) reported only sensitivity results from a population that includes patients selected for having AMI. Two studies (Katz, Irwig, Vinen, et al., 1998; Ohman, Casey, Bengston, et al., 1990) analyzed their data with logistic regression using a continuous CK variable. Thus 12 data sets were available for meta-analysis. Two studies included only patients who were admitted to the hospital after ED evaluation; thus, only 10 of these studies (published between 1979 and 1996) included patients seen in the ED, including patients both admitted to the hospital and discharged from the ED.

The study by Eisenberg, Horowitz, Busch, et al. (1979) included patients over 30 years with chest pain seen at an urban ED. Patients without AMI were underrepresented as no data were presented on 29 additional patients who met the enrollment criteria but did not return for followup evaluation. No demographic information was reported. Of note, no description of test measurement or definition of a positive CK test was given.

The study by Roxin, Cullhed, Groth, et al. (1984) included patients "referred to [an urban] hospital on suspicion of AMI." No information was given on loss to followup. Of note, of the 305 patients included, 147 were not included in the reported test performance analysis because 22 had a final diagnosis of "possible AMI" (with atypical ECG changes or elevated aspartate amino transferase [AST] levels in only one blood sample) and for 125, AMI "could not be entirely excluded" (patients had chest pain, shock, or pulmonary edema with no other cause, but no ECG changes or elevations of AST). For the purposes of meta-analysis, all patients not meeting WHO criteria (including those with possible or "cannot rule out" AMI) were classified as not having AMI.

The study by Lee, Weisberg, Cook, et al. (1987) included patients over 30 years seen in an urban ED with a chief complaint of chest pain. Patients without AMI were underrepresented, since discharged patients who did not have followup testing were excluded from analysis. In total, 416 of 1,055 patients were excluded.

The study by Hedges, Rouan, Toltzis, et al. (1987) included patients over 30 years seen in an urban ED with chest pain. The study reported that 97 of 861 patients were excluded because a CK at presentation was missing; however, 773 patients were included in the analysis. Description of followup of discharged patients implied no loss to followup.

The study by Viskin, Heller, Gheva, et al. (1987) included patients over 25 years seen in an urban ED with chest pain. Of 300 eligible patients, 48 were excluded primarily because of incomplete ED or followup data.

One study by Mair, Artner-Dworzak, Lechleitner, et al. (1991a) included patients seen in an urban ED. No data were provided as to inclusion or exclusion criteria, loss to followup, or demographics.

Another study by Mair, Artner-Dworzak, Dienstl, et al. (1991b) included patients seen in an urban ED with a chief complaint of chest pain. No data were provided for loss to followup or demographics.

The study by Thomson, Gibbons, Smars, et al. (1995) included patients over 20 years seen in an urban ED with anterior or left lateral chest pain. Patients with recent infection, steroid use, cancer, gastrointestinal bleed, surgery, hemodialysis, or cardiopulmonary resuscitation were excluded. Only 3 of 20 patients who were discharged from the ED had followup tests and were evaluated.

The study by Gornall and Roth (1996) included patients seen in a suburban ED with chest pain. Patients with ECGs diagnostic of AMI at presentation or who had noncardiac diseases diagnosed by history and physical at presentation were excluded. No data were reported on possible loss to followup.

The study by Hetland and Dickstein (1996) included patients seen in an urban ED with "chest discomfort suggestive of AMI" for fewer than 6 hours. Patients who arrived in the ED from 10 p.m. to 8 a.m. were not included. No data were reported on possible loss to followup.

In summary, few reports gave detailed descriptions of inclusion or exclusion criteria. None described the ED or hospital setting. Eight studies primarily included only patients with chest pain. The prevalence of AMI in these studies ranged from 7 percent to 34 percent. One study had more restrictive inclusion criteria (Gornall and Roth, 1996), which excluded patients who clearly did not have ACI, and had the highest AMI prevalence, at 41 percent. One study (Mair, Artner-Dworzak, Dienstl, et al., 1991a) did not report inclusion criteria, but apparently included highly selected patients, as the prevalence of AMI was relatively high at 40 percent.

In general, AMI was defined by WHO criteria or variations thereof. In most, serial CK or CK-MB was used to define AMI. Only one (Roxin, Cullhed, Groth, et al., 1984) explicitly did not use CK to define AMI, but used serial AST instead. The studies used a variety of thresholds for abnormal CK, ranging from 80 U/L to 336 U/L for men and 70 U/L to 326.5 U/L for women. One study (Eisenberg, Horowitz, Busch, et al., 1979) did not report the threshold used.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F011.jpg.

   Figure 11. SROC analysis of presentation CK to diagnose AMI (ED only studies)

Table 16 and Figure 11 (see meta-analysis section) show a summary of included articles, results of meta-analysis, and unweighted ROC analysis. Heterogeneity among studies (primarily differences in test sensitivity levels) was not explained by CK thresholds (definition of test positivity), reported differences in eligibility criteria, study setting, or prevalence of AMI. Across studies, there were insufficient data to allow full analysis of symptom duration prior to testing; however, those studies that analyzed test performance by symptom duration (Hetland and Dickstein, 1996; Laurino, Pelletier, Eadry, et al., 1997; Lee, Weisberg, Cook, et al., 1987; Mair, Artner-Dworzak, Dienstl, et al., 1991b; Viskin, Heller, Gheva, et al., 1987) all found increased sensitivity of presentation CK in patients with longer duration of symptoms.

Studies of clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical Studies

Two studies included patients evaluated in the ED who were subsequently admitted to the hospital, thus excluding patients discharged from the ED.

The first study by Tucker, Collins, Anderson, et al. (1994) included patients who were seen in a large urban community ED with chest discomfort or other symptoms prompting ordering cardiac enzymes and who were subsequently admitted to the hospital. Patients were excluded if their symptoms had persisted for more than 24 hours. Of 193 patients presenting to the ED, 30 were excluded because they were discharged from the ED; 17 had more than 24 hours of symptoms; 6 had improper laboratory test collection; 6 had no consent; and 1 was transferred. No information was reported on method of diagnosing AMI, level of abnormal CK, or patient demographics.

The second study by Tucker, Collins, Anderson, et al. (1997) included patients who were seen in an urban ED with chest discomfort of less than 24 hours and who were subsequently hospitalized. Patients with initial diagnostic ECGs or who received cardiopulmonary resuscitation were excluded. No data were reported on patients not analyzed. CK was defined as abnormal if activity was greater than 170 U/L in men or 135 U/L in women.

In summary, these two studies had similar AMI prevalence levels (at 29 percent and 15 percent, respectively) as the 10 ED studies. Like most ED studies, the second study by Tucker and colleagues (1997) included about two-thirds men. No demographic data were reported in the first study by Tucker and colleagues (1994).

Tucker and coworkers (1997) used WHO criteria with serial CK and CK-MB to diagnose AMI; and Tucker and coworkers (1994) did not report method of AMI diagnosis. The definition of abnormal CK was similar to that in the ED studies by Tucker and coworkers in 1997. Again, Tucker and coworkers (1994) did not report the definition of abnormal CK.

Including these two studies in the analysis yielded 3,195 evaluable patients, addressing the diagnostic accuracy of CK as a single test done at presentation to the ED. The random effects model yielded a pooled sensitivity of 37 percent (95 percent CI 31 percent, 44 percent) and a pooled specificity of 87 percent (95 percent CI 80 percent, 91 percent). The random effects model odds ratio is 3.9 (95 percent CI 2.7, 5.7).

Single CK Upon Presentation to ED to Diagnose ACI

No studies are reported.

Serial CK Upon Presentation to ED to Diagnose AMI
Studies of test sensitivity and specificity

Four studies pertaining to the study of CK in serial testing in the ED setting for the diagnosis of AMI were retrieved. One study (Tucker, Collins, Anderson, et al., 1997) did not report sufficient data as to the numbers of patients included at each time point to allow for complete data extraction. In addition, it was not clear from the text whether the presented test performance values over time represented cumulative, serial data. One study (Katz, Irwig, Vinen, et al., 1998) analyzed the data with logistic regression using a continuous CK variable. Thus only two studies were available for meta-analysis; however, because the number of studies available was small, meta-analysis was not performed.

Table 18. Serial creatine kinase to diagnose AMI: Diagnostic performance studies (ED studies)
Study, YearStudy sizePopulation categoryPrevalence of AMI (%)Times of blood draws evaluatedTest performanceStudy quality
Sensitivity (%)Specificity (%)
Gerhardt 1982481I43Hours 10, 16 of chest pain9968C
Roxin 1984305I260, 2, 4 hours6984C
Overall786I26-43-69-99 168-84 1C
Odds ratio 12-220 1
1

Range of reported values.

The study by Gerhardt, Waldenstrom, Horder, et al. (1982) included patients seen in an ED with "symptoms indicative of their having had an AMI within the previous 24 hours" (Table 18). The test of interest was CK-B subunit, which was corrected for CK-BB. The test was considered abnormal if activity was greater than 12 U/L. Blood was drawn 10 and 16 hours after presentation. No data were reported on patients not analyzed or on demographics.

The study by Roxin, Cullhed, Groth, et al. (1984) included patients "referred to [an urban] hospital on suspicion of AMI" (Table 18). No information was given on loss to followup. CK samples were drawn at presentation and 2 and 4 hours later. Of note, of the 305 patients included, 147 were not included in the reported test performance analysis because 22 had a final diagnosis of "possible AMI" (with atypical ECG changes or elevated AST levels in only one blood sample) and for 125, AMI "could not be entirely excluded" (patients had chest pain, shock, or pulmonary edema with no other cause, but no ECG changes or elevations of AST).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F013.jpg.

   Figure 13. Studies of serial CK to diagnose AMI1

Table 18 and Figure 13 (see meta-analysis section) show a summary of included articles. Both studies reported broad eligibility criteria; however, these two studies varied significantly in their methods. The study by Gerhardt and coworkers (1982) based timing of serial testing on symptom duration, and the study by Roxin and coworkers (1984) based timing of serial testing on time since arrival to the ED. The study by Gerhardt and colleagues (1982) also reported an AMI prevalence at 43 percent, considerably greater than the prevalence at 26 percent reported in the study by Roxin and colleagues (1984).

Studies of clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical Studies

No data are reported.

Serial CK Upon Presentation to ED to Diagnose ACI

No studies are reported.

Single CK-MB Upon Presentation to ED to Diagnose AMI
Studies of test sensitivity and specificity

Table 19. Presentation creatine kinase-MB to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Mair 1991b126II405792C
Collins 1993195II425297B
Brogan 1994189I122399B
Castaldo 1994157II372298C
Montague 199589II283698C
Thomson 1995375II185693C
Hetland 1996133II344498B
Mair 1996100II395989B
Gornall 199698III412598B
Hedges 19961,042III6.45796A
Laurino1 1997115III2220NDC
Overall22,504I/II/III6.4-4244 (35-53) 396 (94-97) 3B
Odds ratio 23 (17-32) 3
1

Study not included in meta-analysis because data were incomplete.

2

Laurino (1997) not included in overall summary.

3

Results from meta-analysis using random effects calculations.

Table 20. Presentation creatine kinase-MB to diagnose AMI: Diagnostic performance studies (all patients admitted)
Study, yearStudy sizePopulation category (setting)Prevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Gibler 198759IV (hospital admission)361497C
Gibler 1990183IV (hospital admission)175597A
Marin 1992313IV (hospital admission)224195C
Tucker 1994133IV (hospital admission)293399C
Apple 199598IV (hospital admission)6.110086B
Tucker 1997177IV (hospital admission)152697B
Kontos 1997a101IV (CCU admission)2030100B
Fesmire 1998764IV (hospital admission)204897C
Kontos 1999a2,093IV (CCU admission)8.94699C
Young 19931222IV (CCU admission)1940No dataC
Overall23,921IV6.1-3640 (33-47) 397 (94-99) 3C
Odds ratio 28 (14-54) 3
1

Study not included in meta-analysis.

2

Laurino (1997) not included in overall summary.

3

Results from meta-analysis using random effects calculations (studies including admitted patients only).

A total of 31 studies pertaining to the evaluation of CK-MB as a single test upon presentation to the ED for the diagnosis of AMI were retrieved. Three studies (Mair, Artner-Dworzak, Lechleitner, et al., 1992; Hedges, Young, Henkel, et al., 1994; Young, Gibler, Hedges, et al., 1997) presented the same data sets as previously published studies and were thus excluded. Two studies (Laurino, Pelletier, Eadry, et al., 1997; Young and Green, 1993) reported only sensitivity results. These two studies are presented in Tables 19 and 20 and Evidence Tables 7a and 7b but are not included in meta-analysis. One study (D'Costa, Fleming, and Patterson, 1997) compared CK-MB data with troponin I data and did not report test performance data for AMI. Two studies (Katz, Irwig, Vinen, et al., 1998; Ohman, Casey, Bengston, et al., 1990) analyzed their data with logistic regression using a continuous CK-MB variable. One study (Zalenski, McCarren, Roberts, et al., 1997a) reported on a highly selected population sample of low-risk patients who were hospitalized. The study was excluded because 57 percent of the patients had symptoms for more than 24 hours (40 percent for more than 48 hours) and thus were not representative of typical ED patients. One report (Mach, Lovis, Chevrolet, et al., 1995) was excluded because it studied a highly selected sample of patients in whom AMI was suspected by criteria of the Imminent Myocardial Infarction Rotterdam Study; notably, the AMI prevalence was 78 percent. Two studies (de Winter, Koster, Sturk, et al., 1995; Laurino, Bender, Kessimian, et al., 1996) reported data only by onset of symptoms in such a way that data on presentation testing could not be extracted. Therefore, 19 data sets were available for analysis of CK-MB to diagnose AMI. Of these, seven studies included only patients who were admitted to the hospital after ED evaluation. Two others included only patients admitted to the CCU after ED evaluation. Thus, 10 studies (published between 1991 and 1996) that analyzed patients seen in the ED, including patients both admitted to the hospital and discharged from the ED, were included in meta-analysis.

One study by Mair, Artner-Dworzak, Dienstl, et al. (1991b) included patients seen in an urban ED with a chief complaint of chest pain. CK-MB was considered abnormal if mass was greater than 7 ng/ml. No data were reported on subjects not analyzed or on demographics.

The study by Collins, Wright, Rinsler, et al. (1993) included patients seen in a suburban community hospital with chest pain "suggestive of AMI" and referred with acute symptoms for cardiology assessment. CK-MB was defined as abnormal if mass was greater than 7 ng/ml; however, data were available for multiple CK-MB thresholds. No data were reported on subjects not analyzed or on demographics.

The study by Brogan, Friedman, McCuskey, et al. (1994) included patients seen in a rural university-affiliated ED with a broad range of chief complaints consistent with ACI. Patients whose symptoms were of more than 12 hours duration or who had renal failure or muscular dystrophy were excluded. CK-MB was defined as abnormal if mass was greater than 9 ng/ml and the index was greater than 2 percent. No data were reported on subjects not analyzed or on gender.

The study by Castaldo, Ercolini, Forino, et al. (1994) included patients with less than 2 hours of chest pain and were thought likely to be having AMI. Although patients had measurements at 3, 6, and 9 hours after onset of chest pain (rather than specifically at presentation), as they presented within 2 hours of onset of chest pain, the 3-hour data were used as admission measurements. CK-MB was defined as abnormal if index was greater than 5 percent. No data were reported on patients not analyzed.

The study by Thomson, Gibbons, Smars, et al. (1995) included patients over 20 years old seen in an urban ED with anterior or left lateral chest pain. Patients with recent infection, steroid use, cancer, gastrointestinal bleed, surgery, hemodialysis, or cardiopulmonary resuscitation were excluded (n=87). Only 3 of 20 patients who were discharged from the ED had followup tests and were evaluated. Fourteen patients had multiple ED visits, each of which was included in the analysis. "Exclusion of the 17 repeat visits [resulted in] only a 1 percent to 2 percent increase in the sensitivity." CK-MB was defined as abnormal if mass was greater than 4.7 ng/ml.

The study by Montague and Kircher (1995) included patients having chest pain or "suspected AMI" seen in a suburban ED. Patients with renal insufficiency (n=6) were excluded. Thirty-seven eligible patients were not evaluated in any analyses because only one blood sample was drawn. CK-MB was defined as abnormal if mass was greater than 5 ng/ml.

The second study by Mair, Genser, Morandell, et al. (1996) included patients seen in an urban ED with a chief complaint of chest pain. No data were reported on subjects not analyzed; however, one patient is unaccounted for in the CK-MB analysis. CK-MB was defined as abnormal if mass was greater than 5 ng/ml.

The study by Hedges, Gibler, Young, et al. (1996) included patients seen in a university-affiliated ED with chest discomfort or clinical suspicion of AMI. Patients with diagnostic ECG, who were unstable, or who had recent cardioversion were excluded. Thirteen patients were excluded because data were incomplete. CK-MB was defined as abnormal if mass was greater than 7 ng/ml.

The study by Gornall and Roth (1996) included patients seen in a suburban ED with chest pain. Patients with ECGs diagnostic of AMI at presentation or who had noncardiac diseases diagnosed by history and physical at presentation were excluded. No data were reported on subjects not analyzed. CK-MB was defined as abnormal if mass was greater than 8 ng/ml.

The study by Hetland and Dickstein (1996) included patients seen in an urban ED with "chest discomfort suggestive of AMI" for less than 6 hours. Patients who arrived in the ED from 10 p.m. to 8 a.m. were not included. No data were reported on subjects not analyzed. CK-MB was defined as abnormal if mass was greater than 5 ng/ml.

In general, subjects were patients who presented to the ED with chest pain or discomfort. One study (Brogan, Friedman, McCuskey, et al., 1994) included all patients with symptoms consistent with ACI. One study (Hedges, Gibler, Young, et al., 1996) excluded patients with diagnostic ECG. One study (Gornall and Roth, 1996) excluded patients with diagnostic ECGs and patients with easily diagnosed noncardiac diseases. One study (Hetland and Dickstein, 1996) excluded patients with symptoms lasting more than 6 hours. One study (Castaldo, Ercolini, Forino, et al., 1994) excluded patients with chest pain lasting less than 2 hours. One study (Thomson, Gibbons, Smars, et al., 1995) excluded patients with a variety of recent noncardiac conditions, and one (Montague and Kircher, 1995) excluded patients with renal insufficiency. Very little information was available about eligible patients who were not evaluated.

In general, AMI was defined by WHO criteria or variations thereof. In all, serial CK or CK-MB was used to define AMI. Only two (Mair, Artner-Dworzak, Dienstl, et al., 1991b; Brogan, Friedman, McCuskey, et al., 1994) explicitly did not use CK-MB to define AMI (apparently using CK only). One (Thomson, Gibbons, Smars, et al., 1995) used only serial CK-MB and sudden death to define AMI. Almost all studies used mass measurements of CK-MB (ng/ml) ranging from 4.7 ng/ml to 8 ng/ml. One (Brogan, Friedman, McCuskey, et al., 1994) used a combination of mass (9 ng/ml) and index (2 percent MB). One study (Castaldo, Ercolini, Forino, et al., 1994) used only CK-MB index (5 percent MB).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F012.jpg.

   Figure 12. SROC analysis of presentation CK-MB to diagnose AMI (ED only studies)

Table 19 and Figure 12 (see meta-analysis section) show a summary of included articles, results of meta-analysis, and unweighted ROC analysis. As with presentation CK, heterogeneity among studies could not be explained by CK-MB thresholds, reported differences in eligibility criteria, study setting, or prevalence of AMI. There was not a clear correlation between mean or median symptom duration and test sensitivity in the nine studies that reported this information. Those studies that analyzed test performance by symptom duration (Mair and coworkers, 1991b, 1996; Collins and coworkers, 1993; Brogan and coworkers, 1994; de Winter and coworkers, 1995; Hetland and Dickstein, 1996; Laurino and coworkers, 1996, 1997) all found increased sensitivity of presentation CK-MB in patients with longer duration of symptoms.

Studies of clinical impact on the test's actual use

No studies are reported.

Data From Other Clinical Studies

Nine articles included patients evaluated in the ED who were subsequently admitted to the hospital, thus excluding patients discharged from the ED.

The study by Gibler, Gibler, Weinshenker, et al. (1987) included patients seen in a community hospital ED with chest pain who were subsequently hospitalized. Fourteen subjects were not included in analysis because of missing laboratory results or lack of consent. The definition of abnormal CK-MB was not reported. No demographic data were provided.

A second study by Gibler, Lewis, Erb, et al. (1990) included patients in the ED with chest pain who were subsequently hospitalized. Three patients were not included in the analysis: one patient had incomplete data, one was already hospitalized prior to chest pain, one had an initial CK greater than 4,000 IU/L. CK-MB was defined as abnormal if mass was greater than 7.5 ng/ml.

The study by Marin and Teichman (1992) included patients seen in the ED with chest pain consistent with AMI or ischemia for less than 12 hours who were subsequently hospitalized. Patients were excluded if they were transferred to another hospital for admission; 62 patients were not included in the analysis because no final diagnosis could be made because of such events as surgery or death, 22 patients had chest pain for more than 12 hours, and 3 were missing data. CK-MB was defined as abnormal if mass was greater than 7.5 ng/ml.

The study by Tucker, Collins, Anderson, et al. (1994) included patients seen in the ED with chest discomfort for less than 24 hours that prompted a physician to order cardiac enzymes and who were subsequently hospitalized. Sixty patients were not included in the analysis: 30 were discharged from the ED, 17 had symptoms for more than 24 hours, 6 had improper laboratory test collection, 6 had no consent, and 1 was transferred to another facility. CK-MB was defined as abnormal if mass was greater than 5 ng/ml. No data were reported on demographics.

The study by Apple, Voss, Lund, et al. (1995) included patients seen in the ED with chest pain who were subsequently hospitalized. Patients had a mean duration of chest pain of 14 hours. No information was reported on patients not included in the analysis or on subject demographics. CK-MB was defined as abnormal if mass was greater than 5 ng/ml.

The second study by Tucker, Collins, Anderson, et al. (1997) included patients seen in an urban ED with chest discomfort of less than 24 hours who were subsequently hospitalized. Patients with initial diagnostic ECGs or who received cardiopulmonary resuscitation were excluded. No data were reported on patients not analyzed. CK-MB was defined as abnormal if mass was greater than 5 ng/ml.

The study by Fesmire, Percy, Bardoner, et al. (1998) included patients seen in the ED of a university teaching hospital with chest pain who were subsequently hospitalized. Patients with recent cocaine use, tachyarrhythmias, pulmonary edema, or demand pacemakers were excluded. Of 1,000 eligible patients, 236 failed to have a second CK-MB level drawn and thus were not analyzed. CK-MB was defined as abnormal if mass was greater than 6 ng/ml.

Two articles (Kontos, Anderson, Hanbury, et al., 1997a; Kontos, Anderson, Schmidt, et al., 1999a) included patients evaluated in the ED who were subsequently admitted to the coronary care unit, thus excluding patients discharged from the ED or admitted to a noncoronary care unit bed.

The 1997a study by Kontos and colleagues included patients seen in an ED with chest pain who were subsequently admitted to the CCU. No specific exclusion criteria were reported; nine patients were not included in the analysis because of missing laboratory data. CK-MB was defined as abnormal if mass was greater than either 4.7 ng/ml or 8 ng/ml, depending on the analyzer used.

The 1999a article by Kontos and colleagues included patients initially seen in an urban ED who were admitted to the coronary care unit with suspected AMI. Patients were excluded if their initial ECG was diagnostic for AMI or if they had cardiac or respiratory arrest in the ED; 241 patients with diagnostic ECGs, 44 patients with cardiac or respiratory arrest, and 406 patients with missing laboratory data were excluded. CK-MB was defined as abnormal if mass was greater than 8 ng/ml and index was greater than 4 percent.

In summary, the range of prevalence within the studies of hospitalized patients was similar to that within ED-based studies. Likewise, the symptom inclusion and exclusion criteria and patient demographics were similar overall. The range of sensitivity and specificity values reported was also within the same range as in the ED-based studies. The one exception to this was the study by Apple and coworkers (1995), which reported 100 percent sensitivity in a sample of patients whose average symptom duration was 14 hours, compared with 2 to 6 hours in other studies. These studies were published between 1987 and 1999.

Analyzing all 19 studies addressing the diagnostic accuracy of CK-MB as a single test done at presentation to the ED yielded 6,425 patients. The random effects model yielded a pooled sensitivity of 42 percent (95 percent CI=36 to 48 percent) and a pooled specificity of 97 percent (95 percent CI=95 to 98 percent). The random effects model odds ratio is 25 (95 percent CI=18 to 36 percent).

Single CK-MB Upon Presentation to ED to Diagnose ACI

Table 21. Presentation creatine kinase-MB to diagnose ACI: Diagnostic performance studies
Study, year Study sizePopulation categoryPrevalence of ACI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Hedges 19961,042III20.4 (AMI 6.4)2396C
Odds ratio=7.2
Only one study evaluated CK-MB as a single test upon presentation to the ED to diagnose both AMI and UAP (Table 21).

Hedges, Gibler, Young, et al. (1996) included patients seen in a university-affiliated ED with chest discomfort or clinical suspicion of AMI. Patients with diagnostic ECGs, who were unstable, or who had recent cardioversion were excluded. CK-MB was defined as abnormal if mass was greater than 7 ng/ml. AMI was defined by WHO criteria with echocardiography and cardiac catheterization used to diagnose some patients. No definition of UAP was given. Of 1,042 patients 67 (6.4 percent) were diagnosed with AMI and 146 (14 percent) were diagnosed with UAP.

Serial CK-MB Upon Presentation to ED to Diagnose AMI
Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

Table 22. Serial creatine kinase-MB to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Times of blood draws evaluatedTest performanceStudy quality
Sensitivity (%)Specificity (%)
Gerhardt 1982481I43Hours 10, 16 of chest pain10097C
Hedges 1992261III110, 1, 2, 3 hours6895C
Brogan 1994189I120, 1 hours4199B
Castaldo 1994157 1II37Hours 3, 6 of chest pain6295C
Montague 199589II280, 2 hours6892C
Gibler 19951,010III1.20, 3, 6, 9 hours10098C
Hedges 19961,042III6.40, 3 hours8895A
Sayre 19982473I5.1 (34/667) 20, 3 hours 340 399.8 3B
Overall43,149I/II/III1.2-53-800 (61-91) 596 (94-98) 5C
Odds ratio 130 (40-400) 5
1

Not all subjects had two laboratory samples.

2

AMI prevalence information reported for total sample only. No extractable data reported for subset of population (n=473) included in 3-hour test performance data. Therefore, study not included in meta-analysis.

3

Includes up to 128 of 473 patients with only a single blood draw.

4

Sayre (1998) not included in overall summary.

5

Results from meta-analysis using random effects calculations.

A total of 23 studies pertaining to the evaluation of CK-MB used in serial testing in the ED for the diagnosis of AMI were retrieved. Two articles (Young, Gibler, Hedges, et al., 1997; Hedges, Young, Henkel, et al., 1994) presented the same data sets as previously published articles and were thus excluded. One article (Puleo, Meyer, Wathen, et al., 1994) was excluded because it provided data on subforms of CK-MB only. Two articles (Brogan, Vuori, Friedman, et al., 1996; Brogan, Bock, McCuskey, et al., 1997) reported data only by symptom duration, which could not be extracted for meta-analysis or serial testing. One study (Katz, Irwig, Vinen, et al., 1998) analyzed the data with logistic regression using a continuous CK variable. One study (Mach, Lovis, Chevrolet, et al., 1995) was excluded because it studied a highly selected sample of patients in whom AMI was suspected by criteria of the Imminent Myocardial Infarction Rotterdam Study; notably, the AMI prevalence was 78 percent. One study (de Winter, Koster, Sturk, et al., 1995) reported data only by onset of symptoms in such a way that data on serial testing could not be extracted. Sayre, Kaufmann, Chen, et al. (1998) reported test performance data on 473 of 667 patients who presented to the ED with symptoms suggestive of ACI and had blood drawn within 3 hours of presentation to the ED. A large, but unreported, number of these patients had only one blood sample drawn. In addition, the prevalence of AMI in this subpopulation was not reported and thus meta-analysis could not be performed. However, this study is included in Table 22 and Evidence Table 8A. Sevenof the remaining 14 articles (published between 1982 and 1996) included patients seen in the ED or a chest pain evaluation unit that included patients both admitted to the hospital and discharged from the ED.

The study by Gerhardt, Waldenstrom, Horder, et al. (1982) included patients seen in an ED with "symptoms indicative of their having had an AMI within the previous 24 hours." The test of interest was CK-B subunit, which was corrected for CK-BB. The test was considered abnormal if activity was greater than 12 U/L. Blood was drawn at 10 and 16 hours after presentation. No data were reported on patients not analyzed or on demographics.

The study by Hedges, Young, Henkel, et al. (1992) included patients with chest discomfort seen at either an urban university ED or a Veterans' Administration ED. Patients were excluded if they had ECGs diagnostic for AMI, shock, anemia, or recent cardioversion; 159 patients were not included because of incomplete data. CK-MB was defined as abnormal if mass was greater than 8 ng/ml. Blood was drawn hourly from presentation through 3 hours.

The study by Brogan, Friedman, McCuskey, et al. (1994) included patients seen in a rural university-affiliated ED with a broad range of chief complaints consistent with ACI. Patients whose symptoms were of more than 12 hours duration or who had renal failure or muscular dystrophy were excluded. CK-MB was defined as abnormal if mass was greater than 9 ng/ml and the index was greater than 2 percent. Blood was drawn at presentation and at 1 hour. No data were reported on patients not analyzed or on gender.

The study by Castaldo, Ercolini, Forino, et al. (1994) included patients with less than 2 hours of chest pain and were thought likely to be having AMI. Although patients had measurements at 3, 6, and 9 hours after onset of chest pain (rather than specifically at presentation), as they presented within 2 hours of onset of chest pain, the 3- and 6- hour data were used as serial measurements. CK-MB was defined as abnormal if index was greater than 5 percent. No data were reported on patients not analyzed.

The study by Gibler, Runyon, Levy, et al. (1995) included patients seen in a "Heart Emergency Room" with chest pain. Those with diagnostic ECG, hypotension, or history of ACI, AMI, or CAD were excluded. Notably, those with symptoms suggestive of UAP were also excluded and patients admitted directly to the CCU who were not evaluated in the Heart Emergency Room were not included. CK-MB was defined as abnormal if greater than 6 ng/ml or 7 ng/ml (depending on the analyzer used) with an index greater than 5 percent. Blood was drawn every 3 hours from presentation through 9 hours. No data were reported on patients not analyzed.

The study by Montague and Kircher (1995) included patients with chest pain or "suspected AMI" seen in a suburban ED. Patients with renal insufficiency (n=6) were excluded. Thirty-seven eligible patients were not evaluated in any analyses because only one blood sample was drawn. CK-MB was defined as abnormal if mass was greater than 5 ng/ml. Blood was drawn at presentation and hour 2.

The study by Hedges, Gibler, Young, et al. (1996) included patients with chest discomfort or clinical suspicion of AMI seen in a university-affiliated ED. Patients with diagnostic ECG, who were unstable, or who had recent cardioversion were excluded. Thirteen patients were excluded because of incomplete data. CK-MB was defined as abnormal if mass was greater than 7 ng/ml. Blood was drawn at presentation and after 3 hours.

In summary, six studies evaluated all eligible patients seen in the ED (Gerhardt, Waldenstrom, Horder, et al., 1982; Hedges, Young, Henkel, et al., 1992; Brogan, Friedman, McCuskey, et al., 1994; Castaldo, Ercolini, Forino, et al., 1994; Montague and Kircher, 1995; Hedges, Gibler, Young, et al., 1996). One study evaluated patients seen in a chest pain evaluation unit (Gibler, Runyon, Levy, et al., 1995).

The prevalence for AMI in these studies ranged from 1.2 percent to 43 percent. In general, subjects were patients who presented to the ED with chest pain. Two studies (Gerhardt, Waldenstrom, Horder, et al., 1982; Brogan, Friedman, McCuskey, et al., 1994) included all patients with symptoms consistent with ACI. Two studies also included patients with suspected AMI (Montague and Kircher, 1995; Hedges, Gibler, Young, et al., 1996). Two studies excluded patients with more than 12 hours of symptoms (Brogan, Friedman, McCuskey, et al., 1994; de Winter, Koster, Sturk, et al., 1995); one (Castaldo, Ercolini, Forino, et al., 1994) excluded those with less than 2 hours of chest pain. Three studies excluded patients who required cardioversion or were otherwise unstable (Hedges, Young, Henkel, et al., 1992; Gibler, Runyon, Levy, et al., 1995; Hedges, Gibler, Young, et al., 1996). Three trials excluded patients with diagnostic ECG (Hedges, Young, Henkel, et al., 1992; Gibler, Runyon, Levy, et al., 1995; Hedges, Gibler, Young, et al., 1996). One study also excluded patients with a history of coronary artery disease or "a clinical syndrome. . .consistent with unstable angina" (Gibler, Runyon, Levy, et al., 1995). All but one study (Gerhardt, Waldenstrom, Horder, et al., 1982) excluded patients with trauma or whose chest pain was explained by chest radiography.

In general, AMI was defined by WHO criteria; however, one study (Gibler, Runyon, Levy, et al., 1995) did not define AMI. All studies that reported which cardiac enzymes were used to define AMI used CK and/or CK-MB except for one study that used AST and lactate dehydrogenase (LDH) to define AMI, although this study also used CK-MB when diagnosis was doubtful (Gerhardt, Waldenstrom, Horder, et al., 1982). One also defined AMI by angiography (Hedges, Gibler, Young, et al., 1996).

Four studies used mass (ng/ml) measurements for CK-MB, using thresholds that ranged from 5 ng/ml to 8 ng/ml. One study used a combination of mass (ng/ml) and index (percent MB) measurements for CK-MB (Brogan, Friedman, McCuskey, et al., 1994), using a threshold of 9 ng/ml and 2 percent CK-MB. One used activity (U/L) measurements (Gerhardt, Waldenstrom, Horder, et al., 1982), using a threshold of 12 U/L. One used only an index of 5 percent (Castaldo, Ercolini, Forino, et al., 1994). Two studies (Gerhardt, Waldenstrom, Horder, et al., 1982; Castaldo, Ercolini, Forino, et al., 1994) drew laboratory samples according to symptom duration at either 10 and 16 hours after onset of symptoms or at 3, 6, and 9 hours after onset of symptoms. In the rest of the studies, samples were drawn at ED presentation and at various times from 1 to 9 hours after presentation.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F014.jpg.

   Figure 14. SROC analysis of serial CK-MB to diagnose AMI (ED only studies)1

Table 22 and Figure 14 show a summary of included articles, results of meta-analysis, and unweighted ROC analysis. Heterogeneity was best explained by timing of serial testing. Studies that performed serial testing within 2 hours of presentation had test sensitivity levels below 70; those that drew serum samples at presentation and 3 or 4 hours had sensitivities from 68 percent to 90 percent. Test sensitivity was 100 percent in the two studies that performed serial testing at 6 or more hours after presentation. Likewise, the study that performed serial testing up to 16 hours after onset of symptoms had substantially higher sensitivity than the study that performed serial testing up to 6 hours after symptom onset.

Studies of clinical impact on the test's actual use

Table 23. Serial creatine kinase-MB to diagnose ACI: clinical impact study
Study, yearStudy sizePopulationcategoryPrevalence of ACI (%)Clinical impactStudy quality
Hedges 19961,042IIIAMI 6.4 UAP 14Additional admission to hospital of ACI patients:4 (1.9%) 3 AMI, 1 UAPC
Additional discharge from ED of ACI patients:0
Additional admission to hospital of non-ACI patients:5 (0.6%)
Additional discharge from ED of non-ACI patients:27 (3.3%)
One study (Hedges, Gibler, Young, et al., 1996) evaluated CK-MB and changes in decisionmaking by ED physicians on whether patients required hospital or CCU admission (Table 23). The study was a single-armed, prospective, observational study of 1,042 patients seen in seven EDs with chest discomfort and a clinical suspicion of AMI. Patients were excluded who had diagnostic ECGs, clinical instability, recent cardioversion, chest trauma, or diagnostic chest radiographs.

The analysis pertained primarily to patient-specific rankings by the physicians of the importance attributed to CK-MB to clinical decisionmaking. However, overall in the 67 patients with AMI, the decision to admit to the hospital was changed from "no" to "yes" after the CK-MB results for 3 (4.5 percent) patients were reviewed. For no patient with AMI was the decision to admit changed to not admit after review of the CK-MB levels. Likewise, in the 67 patients with AMI, the decision to admit to the CCU was changed from "no" to "yes" for 13 (19.4 percent) patients. The decision was changed to not admit to the CCU for two (3.0 percent) patients.

Of the 146 patients with UAP, the decision to admit to the hospital was changed from "no" to "yes" for one (0.7 percent) patient. No patients with UAP were discharged from the ED. Of the 829 patients without ACI, 5 (0.6 percent) additional patients were admitted to the hospital after review of the test results; 27 (3.3 percent) additional patients were discharged.

Data From Other Clinical Studies

Five articles included patients evaluated in the ED who were subsequently admitted to the hospital, thus excluding patients discharged from the ED.

Table 24. Serial creatine kinase-MB to diagnose AMI: Diagnostic performance studies (all patients admitted)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Times of blood draws evaluatedTest performanceStudy quality
Sensitivity (%)Specificity (%)
Gibler 198759IV (hospital admission)450, 3, 6 hours10092C
Gibler 1992577IV (hospital admission)120, 1, 2, 3 hours8094B
Marin 1992313IV (hospital admission)200, 1, 2, 3 hours9093C
Hoekstra 19945,063IV (hospital admission)7.30, 2 hours6395B
Levitt 1996190IV (hospital admission)110, 3 hours8199A
Kontos 1997a101IV (CCU admission)200, 4 hours90100B
Kontos 1999a2,093IV (CCU admission)8.90, 3 hours7899C
Overall8,396IV (hospital/CCU admission)7.3-45-81 (71-89) 197 (94-98) 1
Odds ratio 171 (52-565) 1
1

Results from meta-analysis using random effects calculations.

A study by Gibler, Gibler, Weinshenker, et al. (1987) included patients seen in a community hospital ED with chest pain who were subsequently hospitalized (Table 24). Fourteen subjects were not included in analysis because of missing laboratory results or lack of consent. The definition of abnormal CK-MB was not reported. Blood was drawn at presentation and 3 and 6 hours later. No demographic data were provided.

Another study (Gibler, Young, Hedges, et al., 1992) included patients seen in an urban ED with a chief complaint of chest pain who were subsequently hospitalized. Patients with diagnostic ECGs were excluded (52 patients). CK-MB was defined as abnormal if mass was greater than 8 ng/ml. Blood was drawn at presentation and hourly for 3 hours. No data on subject demographics were reported.

The study by Marin and Teichman (1992) included patients seen in the ED with chest pain consistent with AMI or ischemia for less than 12 hours who were subsequently hospitalized. Patients were excluded if they were transferred to another hospital for admission; 62 patients were not included in the analysis because no final diagnosis could be made due to such events as surgery or death, 22 patients had chest pain for more than 12 hours, and 3 had missing data. CK-MB was defined as abnormal if mass was greater than 7.5 ng/ml. Blood was drawn at presentation and hourly for 3 hours.

The study by Hoekstra, Hedges, Gibler, et al. (1994) included patients seen in a community hospital ED with a chief complaint of chest pain or discomfort consistent with AMI who were subsequently hospitalized. Patients with initial diagnostic ECGs were excluded. No data were reported on patients not included in the analysis. The definition of abnormal CK-MB varied depending on the analyzer used, ranging from 5.6 ng/ml to 7.5 ng/ml. Blood was drawn at presentation and at 2 hours. No data were reported on demographics.

The study by Levitt, Promes, Bullock, et al. (1996) included patients seen in an urban ED with chest pain consistent with acute cardiac ischemia who were subsequently hospitalized. Patients with diagnostic ECGs were excluded. No data were reported on patients not analyzed. CK-MB was defined as abnormal if mass was greater than 11.9 ng/ml. This level was chosen post hoc after examination of a receiver operator curve. Blood was drawn at presentation and at 3 hours.

Two articles included patients evaluated in the ED who were subsequently admitted to the coronary care unit, thus excluding patients discharged from the ED or admitted to a noncoronary care unit bed (Kontos Anderson, Hanbury, et al., 1997a; Kontos, Anderson, Schmidt, et al., 1999a). These two articles were described above. (See Single CK-MB Upon Presentation to ED to Diagnose AMI -- Data from other clinical studies.) In the 1997a study by Kontos and colleagues, blood was drawn at presentation and at 4 hours. In the 1999a study by Kontos and coworkers, blood was drawn at presentation to the ED and at 3 hours.

In summary, seven additional studies, which included only hospitalized patients, addressed the diagnostic accuracy of CK-MB as a serial test done in the ED. Two of these evaluated only patients seen in the ED subsequently admitted to the coronary care unit (Kontos, Anderson, Hanbury, et al., 1997a; Kontos, Anderson, Schmidt, et al., 1999a). These studies were published between 1987 and 1999.

The prevalence for AMI among these studies ranged from 7 percent to 45 percent. Three studies included patients who presented to the ED with chest pain (Gibler, Gibler, Weinshenker, et al., 1987; Marin and Teichman, 1992; Kontos, Anderson, Hanbury, et al., 1997a); the other four excluded patients with diagnostic ECGs. One study excluded patients with more than 12 hours of symptoms (Marin and Teichman, 1992). One study excluded patients who required cardioversion or were otherwise unstable (Kontos, Anderson, Schmidt, et al., 1999a).

In general, AMI was defined by WHO criteria or variants. One study also defined AMI by angiography (Hoekstra, Hedges, Gibler, et al., 1994). All studies that reported which cardiac enzymes were used to define AMI used CK and/or CK-MB. One study (Kontos, Anderson, Schmidt, et al., 1999a) also used troponin I in some patients.

Five studies used mass (ng/ml) measurements for CK-MB, using thresholds that ranged from 4.7 ng/ml to 11.9 ng/ml. One study used a combination of mass (ng/ml) and index (percent MB) measurements for CK-MB (Kontos, Anderson, Schmidt, et al., 1999a), using a threshold of 8 ng/ml and 4 percent CK-MB. One study did not report a method for measuring CK-MB or a threshold (Gibler, Gibler, Weinshenker, et al., 1987). Each study had a different protocol for timing serial testing, drawing samples at ED presentation and at various times from 2 to 6 hours after presentation.

Analyzing all 14 studies addressing the diagnostic accuracy of serial CK-MB done in the ED yielded 11,625 patients. The random effects model yielded a pooled sensitivity of 79 percent (95 percent CI=71 to 86 percent) and a pooled specificity of 96 percent (95 percent CI=95 to 97 percent). The random effects model odds ratio is 140 (95 percent CI=65 to 310).

Serial CK-MB Upon Presentation to ED to Diagnose ACI

Table 25. Serial creatine kinase-MB to diagnose ACI: Diagnostic performance study
Study, yearStudy sizePopulation categoryPrevalence of ACI (%)Times of blood draws evaluatedTest performanceStudy quality
Sensitivity (%)Specificity (%)
Hedges 19961,042IIIACI 20.4 (AMI 6.4)0, 3 hours3195C 1
1

No definition of UAP was reported in study.

Only one study (Hedges, Gibler, Young, et al., 1996) pertained to the evaluation of serial CK-MB in the ED to diagnose both AMI and UAP (Table 25). That study included patients seen in a university-affiliated ED with chest discomfort or clinical suspicion of AMI. Patients with diagnostic ECGs, who were unstable, or who had recent cardioversion were excluded. CK-MB was defined as abnormal if mass was greater than 7 ng/ml. AMI was defined by WHO criteria with echocardiography and cardiac catheterization used to diagnose some patients. No definition of UAP was given.

Myoglobin

Evidence Tables 12, 13a, and 13b present detailed information about the studies.

Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

A total of 27 reports were evaluated for the assessment of the diagnostic accuracy of myoglobin in the diagnosis of AMI and ACI. Ten articles studied presentation myoglobin in the ED setting. We are uncertain whether there is overlap between the Tucker, Collins, Anderson, et al. (1994) and Tucker, Collins, Anderson, et al. (1997) reports and between the Kontos, Anderson, Hanbury, et al. (1997a) and Kontos, Anderson, Schmidt, et al. (1999a) reports, but protocols seem to have differences, even if the investigators are the same. Sensitivity analyses including only one of the two studies yield identical results.

The included patients could have had variable duration of symptoms upon presentation, which might have affected diagnostic performance. One study (Castaldo, Ercolini, Forino, et al., 1994) provides information defined according to time from onset of symptoms. Patients had measurements at 3, 6, and 9 hours after onset of chest pain, and all patients presented within 2 hours of onset of chest pain. In the quantitative synthesis, the 3-hour data are used as admission measurements and 6-hour data are used as measurements 2 to 4 hours after admission.

Castaldo and colleagues (1994) limited enrollment to patients with less than 2 hours of symptoms; Kennedy, Harrison, Burton, et al. (1997) to less than 4 hours of symptoms; Hetland and Dickstein (1996) to less than 6 hours of symptoms; and Brogan, Friedman, McCuskey, et al. (1994) to less than 12 hours of symptoms. Brogan and coworkers (1994) give a mean of 3.2 hours from onset of symptoms to presentation; Mair, Artner-Dworzak, Lechleitner, et al. (1992) reported that the median time from onset of symptoms was 2.3 hours; and Hetland and Dickstein (1996) reported median times from onset of symptoms for patients with AMI of 2 to 4 hours and for those without AMI of 3 hours. Other reports lack this crucial information, but it may be assumed that the distribution of this measurement is likely to be typical of unselected populations presenting with chest pain in the ED. Delays of blood draw from the time of ED arrival are not mentioned typically.

Two studies (Gornall and Roth, 1996; Kennedy, Harrison, Burton, et al., 1997) restricted the analysis to patients who had nondiagnostic ECG. This may limit their generalizability. Other reports did not seem to have substantial restrictions. Exclusion criteria related to conditions that may cause spurious elevations in myoglobin (trauma, renal insufficiency, known muscular dystrophy or other muscular disease, recent intramuscular injections, and recent prehospital thrombolysis) were mentioned to various extents and combinations in the included reports -- none of these exclusions is likely to limit the study populations substantially, although exact data on screened-out patients are sparse. Lack of samples or inadequate samples seemed to be a negligible problem, when this information was recorded and reported.

Table 26. Presentation myoglobin to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Gilkeson 197871I183893C
Roxin 1984305I267183C
Mair 1992126I404591B
Castaldo 1994157II3738100C
Brogan 1994189I125598A
Montague 199589I285681B
Mair 1996101I394689B
Gornall 199698III414398B
Hetland 1996133II345194C
Kennedy 199786III233896C
Overall1,395I/II/III12-4149 (41-57) 193 (88-96) 1B/C
Odds ratio 13 (7.9-21) 1
1

Results from meta-analysis using random effects model.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F016.jpg.

   Figure 16. SROC analysis of presentation myoglobin to diagnose AMI (ED only studies)

The included reports use variable cutoffs for the definition of an abnormal myoglobin value, but the differences are small. Cutoffs range from 70 to 100 ng/ml. Mair, Genser, Morandell, et al. (1996) and Hetland and Dickstein (1996) also provide estimates for other thresholds that could generate ROC curves (reported area under curve 0.65 in Mair and coworkers (1996) and 0.80 in Hetland and Dickstein (1996). Shown in Table 26 and Figure 16 is the unweighted SROC analysis which suggests modest diagnostic performance. There is substantial heterogeneity in the sensitivity estimates which may be explained by the inclusion of studies with different distributions of the time from onset of symptoms. Those studies that analyzed test performance by symptom duration (Brogan, Friedman, McCuskey, et al., 1994; de Winter, Koster, Sturk, et al., 1995; Hetland and Dickstein, 1996; Laurino, Pelletier, Eadry, et al., 1997; Mair, Genser, Morandell, et al., 1996) all found increased sensitivity of presentation myoglobin in patients with longer duration of symptoms.

Only Kennedy, Harrison, Burton, et al. (1997) offered some data on the diagnostic performance of myoglobin for the diagnosis of more broadly defined coronary ischemia (AMI or angina).

Serial myoglobin in ED to diagnose AMI

Table 27. Serial myoglobin to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Simple numerical thresholds
Roxin 1984305I269883C
Brogan 1994189I127396A
Castaldo 1994157 1II3790100C
Montague 199589I2810077B
Kennedy 199791III238696 2C
Overall831I/II/III23-3790 (78-96)90 (78-96)A/B/C
Odds ratio 140 (66-300)
Simple numerical threshold or increase in level
Brogan 1994189I129196A
Gornall 199698II4193100B
Doubling of myoglobin value only
Montague 199589I286498B
1

Not all subjects had two laboratory samples.

2

Text unclear. Specificity not stated clearly for serial myoglobin. Data included here may be overestimation of specificity.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F017.jpg.

   Figure 17. SROC analysis of serial myoglobin to diagnose AMI (ED only studies)

Eleven studies provided data on myoglobin measurements obtained some time after the initial ED/hospital presentation (Table 27; Figure 17). Ten reports evaluated use of simple numerical thresholds for myoglobin. Two reports (Brogan, Friedman, McCuskey, et al., 1994; Gornall and Roth, 1996) studied the diagnostic value of using a combination of numerical thresholds and increases (of 40 ng/ml, 50 percent, or 100 percent) from the initial serum myoglobin value. One report (Montague and Kircher, 1995) also evaluated the value doubling only within 2 hours after presentation.

As with serial CK-MB, heterogeneity was best explained by timing of serial testing. Overall, as time between serial tests was increased, the sensitivity of the test also increased. However, two small studies that performed serial testing at 2 hours after presentation reported test sensitivity of 100 percent.

Studies of clinical impact of the test's actual use

No studies are reported.

Troponin I and Troponin T

Evidence Tables 9, 10, and 11 present details about the studies considered for this technology.

Data From Prospective Clinical Studies in the ED Setting
Studies of test sensitivity and specificity

Clinical trials of troponin I and troponin T include the following:

Troponin I upon presentation

Mair, Genser, Morandell, et al. (1996) selected patients for enrollment including patients with AMI to test the diagnostic performance of an immunoassay for troponin I in the ED. There were no data on patient demographics or inclusion/exclusion criteria. Final diagnosis by cardiologist was made blinded to test results. Average onset of pain to ED presentation was 2.6 hours with a range of 0.3 to 48 hours.

Table 28. Presentation troponin I to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy SizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Mair 1996101II392395C
Hamm 1997773III66689A
Overall874II/III6-3923-66 189-95 1A
1

Range of reported values.

Hamm, Goldmann, Heeschen, et al. (1997) evaluated the diagnostic test performance of troponin I and troponin T in the emergency department on 773 consecutive patients. Serum samples were taken at presentation and repeated 4 hours later. Patients presenting under 2 hours after onset of pain had an additional sample drawn so that all patients had tests performed at least 6 hours after onset of pain. Outside laboratory results for troponin I were made blinded to patient outcome and compared with final diagnosis by CK and CK-MB levels (see sections on serial troponin I, serial troponin T, and presentation troponin T for additional sensitivity/specificity data). The results of the two studies are shown in Table 28. The limited data suggest that the test performance is similar to that of presentation troponin T.

Other troponin I clinical data

Table 29. Presentation troponin I to diagnose AMI: Diagnostic performance studies (all patients admitted)
Study, YearStudy sizePopulation categoryPrevalence of AMI (%)AMI test performanceStudy quality
Sensitivity (%)Specificity (%)
Apple 199598IV610092C
Tucker 1997177IV153.798B
D'Costa 1997316II2079NDC
Laurino 1997115I2232NDC
There were two studies evaluating the test performance of immunoassays for detecting troponin I for patient presentation to the ED, but the patients were subsequently admitted (Table 29).

Apple, Voss, Lund, et al. (1995) analyzed the diagnostic performance of troponin I from a sample drawn at presentation in the ED from 98 consecutive patients, who were then admitted to rule out AMI. There were no data on blinding of test interpretation or patient characteristics. The mean sampling time from onset of chest pain for all patients was 14 hours and 7.4 hours for patients with AMI.

Tucker, Collins, Anderson, et al. (1997) tested the diagnostic performance of troponin I in a cohort of consecutively enrolled patients with nondiagnostic ECGs. All patients were subsequently admitted with the admitting physician blinded to the test results. The mean time from onset of chest pain for all patients was 4.0 hours and the median 2.2 hours.

Two studies also reported only sensitivity data on troponin I. D'Costa, Fleming, and Patterson (1997) included patients with less than 24 hours of chest pain, excluding patients with trauma or renal insufficiency. Data on presentation troponin I are presented only for the 62 patients with AMI who had troponin I levels above or below 1.0 ng/ml. Laurino, Pelletier, Eadry, et al. (1997) analyzed patients with a chief complaint of chest pain or symptoms consistent with AMI or ischemia for less than 6 hours. Patients receiving thrombolytics were excluded. Data on presentation troponin I are presented only for the 25 patients with AMI who had troponin I levels above or below 0.6 ng/ml.

Serial troponin I to diagnose AMI

Table 30. Serial troponin I to diagnose AMI: Diagnostic performance study
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Hamm 1997773III610083A
One study evaluated the test performance of serial immunoassays for detecting troponin I for patient presentation to the ED (Table 30). Hamm, Goldmann, Heeschen, et al. (1997) evaluated the diagnostic performance of serial troponin I at presentation and at 6 hours after onset of pain for 773 consecutive patients (see section on presentation troponin I for details of study).

Table 31. Serial troponin I to diagnose AMI: Diagnostic performance study (all patients admitted)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Kontos 1999b620IV109096B
One study (Kontos, Jesse, Anderson, et al., 1999b) reviewed the diagnostic performance of serial troponin I over an 8-hour period in the CCU with 620 patients considered at low to moderate risk for ACI (Table 31). Patients were admitted through the ED; however, laboratory samples were drawn in the CCU. The test results were made available to the treating physician, and there were no data on the blood sampling interval other than references to sampling periods similar to other studies that used chest pain evaluation protocols.

Troponin T upon presentation

Table 32. Presentation troponin T to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI(%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Mair 1991a96II245796C
de Winter 19951309III533491B
Mair 1996101II392892C
Hetland 1996133II345386C
Hamm 1997773III65192A
Gust 199868II 2242598B
Overall1,171II/III6-3944 (32-56) 392 (88-95) 3C
Odds ratio 10 (5.9-18) 3
1

Not included in meta-analysis. No data on numbers of subjects included in analysis of troponin T. Does not include all patients presenting to the ED.

2

Serum sample drawn prior to arrival in ED.

3

Results from meta-analysis using random effects model.

There are four eligible ED studies and one study in which laboratory samples were drawn prior to arrival in the ED (Table 32). It should be noted that different studies use variable cutoffs for the definition of an abnormal troponin T value. Cutoffs range from 0.05 ng/ml to 0.5 ng/ml, but all except two studies have cutoffs that are in the range of 0.1 to 0.2 ng/ml. Mair, Genser, Morandell, et al. (1996) and Hetland and Dickstein (1996) also provide estimates for the area under the ROC curve based on a range of thresholds (estimates of the areas reported are 0.60 and 0.76, respectively).

Mair, Artner-Dworzak, Lechleitner, et al. (1991) enrolled 96 ED patients from the hospital's department of internal medicine. There were no data on enrollment criteria or time from onset of pain to presentation. The cardiologist was blinded to the test results. The cutoff for a positive test was 0.5 ng/ml.

Mair, Gener, Morandell, et al. (1996) selected patients for enrollment including patients with AMI to test the diagnostic performance of an immunoassay for troponin T. There were no data on patient demographics or inclusion/exclusion criteria. The range for onset of pain to presentation was 0.3 to 48 hours with a median delay of 2.6 hours. Final diagnosis was made blinded to the test results.

Hetland and Dickstein (1996) evaluated 133 consecutive patients with a maximum of 6 hours of pain at presentation to the ED. The median time of pain onset to ED presentation was 3 hours for non-AMI patients, 2 hours for AMI female patients, and 4 hours for AMI male patients. Final diagnosis was determined with knowledge of the test results.

Hamm, Goldmann, Heeschen, et al. (1997) evaluated the diagnostic performance of troponin T at presentation and at 6 hours after onset of pain for 773 consecutive patients (see section on presentation troponin I for details of study).

Gust, Gust, Bottiger, et al. (1998) evaluated the diagnostic performance of troponin T drawn by an emergency doctor traveling with the emergency medical service prior to arrival in the ED. The 68 patients had radiating chest pain which was not relieved by rest or sublingual nitroglycerin. Patients with recent AMI, thrombolytic treatment, or angioplasty were excluded. Patients had a mean of 4.3 hours of symptoms (range 0.5 to 10.0 hours) prior to arrival of the ambulance. A rapid bedside immunoassay was used.

One study (de Winter, Koster, Sturk, et al., 1995) did not provide sufficient information to allow complete data extraction. It was therefore not included in meta-analysis, although it is included in Table 32. de Winter and colleagues (1995) studied 309 consecutive patients considered at low risk for AMI. Sensitivity and specificity estimates according to time from onset of symptoms were reported as there were multiple blood samples taken from the onset of chest pain, hourly from 3rd to 8th, and every 4th hour thereafter to 24 hours. The prevalence of AMI appears to be relatively high at 53 percent.

The overall specificity appears excellent, although the sensitivity seems to vary substantially among studies. It should be noted that the included patients could have had variable duration of symptoms upon presentation and typically these studies offer little information on this parameter and how it might have affected diagnostic performance.

Serial troponin T to diagnose AMI

Table 33. Serial troponin T (up to 6 hours) to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Hamm 1997773III690 180 1A
Sayre 1998546 2II565 393 3C
1

Data from 0 to 4 hours used; other measurements available.

2

546 of 667 patients had blood drawn within 3 hours of presentation to ED.

3

Includes patients with 1, 2, or 3 blood draws within 3 hours of presentation to ED. Up to 128 patients had only blood draw.

4

Range of reported values.

Two studies (Sayre, Kaufmann, Chen, et al., 1998) and (Hamm, Goldmann, Heeschen, et al., 1997) reported information on the diagnostic performance of multiple serial measurements obtained within less than 6 hours from ED presentation (Table 33). These two studies suggest that the sensitivity of the assay increases with longer symptom duration. In the study by Sayre and coworkers (1998), the sensitivity improved from 65 percent at 0 to 3 hours after admission to 79 percent at 3 to 6 hours after admission, whereas specificity remained essentially unchanged at 93 percent using the same cutoff of 0.2 ng/ml. However this analysis of "serial" data includes up to 128 patients with only a single blood draw. Based on measurements up to 6 hours after presentation, Hamm and colleagues (1997) recorded a sensitivity of 94 percent (44/47) for a specificity of 89 percent (647/726) using a cutoff of 0.18 ng/ml. These data suggest that serial measurements for 6 hours may improve the sensitivity from the 50 percent at presentation to 80 to 90 percent, at the same time maintaining the specificity at around 90 percent.

Studies of the clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical Studies

Table 34. Presentation troponin T to diagnose AMI: Diagnostic performance studies (other studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Mach 199532IV7840"80" 1C
Tucker 199768IV151597B
1

Reported specificity value not possible with seven patients.

Two studies evaluated the test performance of presentation troponin T in special settings (Table 34). Mach, Lovis, Chevrolet, et al. (1995) evaluated a rapid bedside whole blood immunoassay for the detection of troponin T on 32 patients in the ED in whom AMI was suspected according to the Imminent Myocardial Infarction Rotterdam Study. Thus, the prevalence of AMI was high at 78 percent. The mean time of pain onset to ED presentation was 4.13 ± 1.8 hours. The diagnoses for the seven non-AMI patients were stable angina (1), UAP (5), and myopericarditis (1). The study reported that ". . .the reading of positivity was performed by the eye and with occasional interference from hemolyzed blood." Of note, the reported specificity of 80 percent is not possible with a sample of seven patients without AMI.

Tucker, Collins, Anderson, et al. (1997) tested the diagnostic performance of troponin T in a cohort of consecutively enrolled patients with nondiagnostic ECGs. However, all patients were subsequently admitted. The mean time from onset of chest pain for all patients was 4.0, median 2.2 hours.

Table 35. Serial troponin T to diagnose AMI: Diagnostic performance studies (other studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Mach 199532IV7810086C
Mohler 1998100IV69489B
Two other studies evaluated serial troponin T in special settings (Table 35). The study by Mach and coworkers (1995) is described above. Serial testing was performed at presentation and at 4 hours. Mohler, Ryan, Segar, et al. (1998) evaluated patients with "cardiac" chest pain who were subsequently admitted to the hospital. Serial testing was done at presentation, 4 hours, and later times up to 24 hours.

Only two studies offered diagnostic performance data for more broadly defined coronary outcomes. Mohler and colleagues (1998) provided data on AMI or unstable angina and showed a sensitivity of 58 percent (36/62) and specificity of 97 percent (37/38). Green, Beaudreau, Chan, et al. (1998) evaluated AMI or any adverse cardiac event as the outcome and found a sensitivity of 22 percent (10/45) and specificity of 98 percent (242/247). These data, although limited, suggest excellent specificity, but very poor sensitivity, for the diagnosis of more broadly defined cardiac coronary outcomes.

Combination Biomarker Tests

Data From Prospective Clinical Studies in the ED Setting

Four studies reported on the combination of CK-MB and myoglobin. Three of these reported data on the two tests drawn in one serum sample; two reported data on serial samples. No other combinations of biomarker tests were reported by other studies. Only one of these studies evaluated all ED patients.

Combination CK-MB and myoglobin upon presentation

Table 36. Presentation CK-MB and myoglobin to diagnose AMI: Diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Montague 199589II28100 172 1C
Kontos 1997a101IV208580B
Kontos 1999a2,093IV96289C
Overall2,283IV9-2883 (81-96) 282 (68-90) 2B/C
Odds ratio 17 (7.6-40) 2
1

Results from sample drawn at 2 hours after presentation to ED.

2

Results from meta-analysis using random effects model.

Two studies reported data on combination CK-MB and myoglobin drawn at patient presentation to the ED (Kontos, Anderson, Hanbury, et al. 1997a; Kontos, Anderson, Schmidt, et al., 1999a) (Table 36). A third study reported only combination data from the serum sample drawn 2 hours after presentation (Montague and Kircher, 1995). Eligibility criteria and test thresholds were the same as for evaluation of the individual tests. Kontos and colleagues (1997a, 1999a) included only patients admitted to the CCU; Montague and Kircher (1995) included all eligible ED patients. A positive combination test was defined as either a positive CK-MB or a positive myoglobin.

Although the test performance of the combination test appears better than that for each of the individual tests, it should be noted that it is likely that the decision to report the combination result was made a posteriori. It is unclear if the small number of studies that actually reported the combination data is a biased sample (in that studies with better combination performance than individual test performance may be more likely to report the data). In addition, the study that reported only 2-hour data had a substantially higher test sensitivity of 100 percent.

Serial combination CK-MB and myoglobin to diagnose AMI

Table 37. Serial CK-MB and myoglobin to diagnose AMI: Diagnostic performance studies
Study, yearStudy sizePopulation CategoryPrevalence of AMI (%)Test performanceStudy Quality
Sensitivity (%)Specificity (%)
Levitt 1996190IV1110091A
Kontos 1997a101IV2010075B
Only two studies reported on combination CK-MB and myoglobin drawn in serial serum samples (Levitt, Promes, Bullock, et al., 1996; Kontos, Anderson, Hanbury, et al., 1997a) (Table 37). Eligibility criteria and test thresholds were the same as for evaluation of the individual tests. The studies included only patients admitted to either the hospital or the CCU. A positive combination test was defined as either a positive CK-MB or a positive myoglobin in either of the serum samples drawn.

With such a small sample, it is difficult to compare individual with combination serial testing, although the test performance appears to be at least similar.

Other Biomarkers (P-Selectin, Malondialdehyde Low-Density Lipoprotein)

Data from Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

Table 38. Presentation P-selectin: Diagnostic performance study
Study, yearStudy sizePopulation categoryTestPrevalence of disease (%)Test performanceStudy quality
SensitivitySpecificity(%)
Hollander 1999263IISoluble P-selectinAMI 8.4 ACI 3345 (27-69) 35 (25-46)76 (70-81) 79 (72-85)B
Membrane-bound P-selectinAMI ACI32 (15-54) 30 (21-41)71 (65-77) 71 (64-78)
P-selectin index 1AMI ACI59 (41-82) 55 (45-66)54 (48-61) 57 (50-65)
1

P-selectin index required elevation in either soluble or membrane-bound P-selectin.

A single study (Hollander, Muttreja, Dalesandro, et al., 1999) studied the diagnostic test performance of P-selectin to diagnose ACI and AMI in the ED (Table 38). Consecutive patients over age 18 who presented to the ED of an urban, tertiary care center who had a chief complaint of chest pain for less than 6 hours were included. Patients whose symptoms were clearly of noncardiac origin were excluded. Approximately 10 percent of patients were not analyzed because of refusal to participate or inadequate blood samples. AMI was defined by WHO criteria; UAP was classified according to the AHCPR (now AHRQ) risk stratification scheme, in addition to requiring demonstration of CAD, exercise-induced ischemia, or CK-MB levels between 5 and 10 ng/ml. Subjects had presented to the ED with a median of 3 hours of symptoms.

Thresholds for P-selectin were chosen post hoc after ROC curve analysis. Blood was drawn at presentation and at 1 hour. To diagnose AMI, presentation soluble P-selectin was found to have a sensitivity of 45 percent and a specificity of 76 percent with a threshold of 138.8 ng/ml. Presentation membrane-bound P-selectin had a slightly lower test performance, with a sensitivity of 32 percent and a specificity of 71 percent with a threshold of 4.8 percent. In addition, a "P-selectin index" (requiring elevation in either soluble or membrane-bound P-selectin) had a sensitivity of 59 percent and a specificity of 54 percent. Serial testing results at 1 hour were not presented, although it was stated that they did not significantly affect test performance.

To diagnose ACI, presentation soluble P-selectin had a sensitivity of 35 percent and a specificity of 79 percent; membrane-bound P-selectin, a sensitivity of 30 percent and a specificity of 71 percent; and the P-selectin index, a sensitivity of 55 percent and a specificity of 57 percent. Again, serial results were not presented.

Overall, in this single study, P-selectin at presentation (and reportedly with serial testing at presentation and 1 hour) had poor to moderate sensitivity and moderate selectivity.

Studies of clinical impact on the test's actual use

No studies are reported.

Data From Other Clinical Studies

Malondialdehyde (MDA)-modified low-density lipoprotein (LDL) is a biomarker beginning to be evaluated for its role in diagnosing or categorizing ACI. It may reflect endothelial injury or plaque instability (Holvoet, Collen, and Van de Werf, 1999). This is the only study to date that investigates the clinical utility of this biomarker. The study included 104 patients with ACI (42 with UAP, 62 with AMI) who presented to the ED with chest pain and ECG changes, and 64 patients with stable, angiographically documented coronary artery disease. Blood samples were drawn for patients with ACI within 6 to 8 hours of onset of chest pain. Plasma levels of MDA-modified LDL were 2.6- to 2.9-fold higher in patients with AMI or UAP than with stable coronary artery disease. Plasma levels were similar in patients with AMI and UAP. ROC analysis also revealed that MDA-modified LDL discriminated between stable CAD and UAP. At a threshold of 0.70 mg/dl (0.02 mmol/L) MDA-modified LDL had a sensitivity of 95 percent for UAP and 95 percent for AMI. The specificity in this highly selected sample of patients with stable or unstable coronary artery disease was 95 percent.

Echocardiography (rest)

Evidence Table 14a presents details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting

A total of 11 reports were considered for the assessment of the diagnostic accuracy of echocardiography (Horowitz, Morganroth, Parrotto, et al., 1982; Sasaki, Charuzi, Beeder, et al., 1986; Peels, Visser, Kupper, et al., 1990; Sabia, Afrookteh, Touchstone, et al., 1991; Levitt, Promes, Bullock, et al., 1996; Mohler, Ryan, Segar, et al., 1996, 1998; Trippi, Kopp, Lee, et al., 1996; Trippi, Lee, Kopp, et al., 1997; Kontos, Arrowood, Jesse, et al., 1998; Gibler, Runyon, Levy, et al., 1995). Eight of these reports dealt with rest echocardiograms, and two dealt with both rest and dobutamine stress echocardiography (Trippi, Kopp, Lee, et al., 1996; Trippi, Lee, Kopp, et al., 1997). Both of the articles by Trippi and colleagues pertain to the same study and the earlier one contains the first 26 patients of the later publication; however, complementary information is provided. One study compared video vs. digital recording of echocardiograms (Mohler, Ryan, Segar, et al., 1996).

Studies of test sensitivity and specificity

Table 39. Rest echocardiography to diagnose AMI: Diagnostic performance studies (ED studies)
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Peels 199043III30.292.353.3C
Sabia 1991169I16.793.157.1B
Kontos 1998185III3.210082.1B
Overall397III3-3093 (81-97) 166 (43-83) 1B
Odds ratio 20 (6.5-62) 1
1

Results from meta-analyses using random effects calculations.

Only three rest echocardiography studies met the inclusion criteria of being exclusively in the ED setting (Peels, Visser, Kupper, et al., 1990; Sabia, Afrookteh, Touchstone, et al., 1991; Kontos, Arrowood, Jesse, et al., 1998) and their details are presented in Evidence Table 14a. The sensitivity values reported by the three studies showed very little variation. Therefore, the SROC method was not particularly useful to summarize the data for rest echocardiography. The random effects model sensitivity and specificity results calculated for these three studies are shown in Table 39.

Two studies (Peels, Visser, Kupper, et al., 1990; Kontos, Arrowood, Jesse, et al., 1998) (Table 39) provided data for a broad ACI definition for diagnostic performance; the random effect diagnostic odds ratio is similar to that of AMI outcome, 16.6 (95 percent CI, 7.1 to 38.9).

Studies of clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical Studies

Several studies that do not meet the strict ED only inclusion criteria are discussed here. With the exception of studies by Mohler, Ryan, Segar, et al. (1998) and Horowitz, Morganroth, Parratto, et al. (1982), all other studies required normal or nondiagnostic ECGs as an inclusion criterion. In addition, Trippi, Lee, Kopp, et al. (1997) and Gibler, Runyon, Levy, et al. (1995) also required normal enzymes. Gibler and coworkers (1995) performed echocardiograms only after serial enzymes and serial ECGs had been normal for 9 hours and not all patients underwent echocardiography. It is thus a very selected population. A past history of AMI was an exclusion criterion for Trippi and colleagues (1997), Gibler and colleagues (1995), Sasaki, Charuzi, Beeder, et al. (1986), and Horowitz and colleagues (1982), whereas at least the first three studies also excluded patients with any history of CAD or ACI -- no similar statement is clarified in Sasaki and coworkers (1986) and Horowitz and coworkers (1982). Almost all the patients in the study by Mohler and colleagues (1998) had previous echocardiograms for comparison to avoid misinterpretation of old abnormalities. The results from Levitt, Promes, Bullock, et al. (1996) are expressed as mean and standard deviation of wall motion scores, and no difference was reported in echocardiographic scores between the AMI and non-AMI groups (mean 16.9 vs. 15.3, p=0.32).

The diagnosis of AMI was based on WHO criteria or variants thereof, but several studies seemed to depend entirely on enzymes. More broadly defined coronary syndromes were studied in Sasaki and coworkers (1986) (AMI or CAD by angiography or stress test), Trippi and coworkers (1997) (AMI or CAD by catheterization or telephone survey), and Gibler and coworkers (1995) (any cardiac disease). When the ED studies and non-ED studies are combined, the random effects pooled sensitivity is 77 percent (95 percent CI=51 to 92 percent) and the specificity is 85 percent (95 percent CI=74 to 91 percent) using this broad coronary syndrome definition. The use of diverse and suboptimal reference standards may affect the estimate of diagnostic performance of echocardiography. The study by Mohler, Ryan, Segar, et al. (1996) suggests that there is excellent concordance between video and digital evaluation of echocardiograms.

Dobutamine Stress Echocardiography

Evidence Table 14b presents details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

No studies are reported.

Studies of the clinical impact of the test's actual use

No studies are reported.

Data From Other Clinical studies

Table 40. Dobutamine stress echocardiography -- broaden ACI definition: Diagnostic performance study
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Trippi 1997139III (ED/CCU)ACI AMI11.6-13.7 1 3.7-4.3 1- 89.5 2- 88.9 2C
1

Depends on which of the three possible numbers reported was used in the calculations.

2

Point estimate from single study.

Two articles dealt with both rest and dobutamine stress echocardiography (Trippi, Kopp, Lee, et al., 1996; Trippi, Lee, Kopp, et al., 1997). Both of these articles pertain to the same study and the earlier article reports on the first 26 patients of the later publication; however, complementary information is provided. All the patients were admitted to the CCU in the 1997 study (Table 40). It reported a sensitivity of 89.5 percent and specificity of 88.9 percent against AMI or CAD by history, catheterization, or telephone survey. This is a highly selected population who had nondiagnostic ECG, normal enzymes, and a pain score of less than 2 out of 10 at the time of the study; all the patients had negative rest echocardiography. Several discrepancies in the data were found in this study. The number of patients who actually received dobutamine stress echocardiography and the reported sensitivity and specificity results cannot be verified with data reported in the text and tables.

Technetium-99m Sestamibi Imaging

Evidence Table 15 presents details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

Nine reports pertaining to the study of Tc-99m sestamibi imaging in the ED setting were retrieved. Two teams had produced three reports, each of which overlapped to a substantial extent; and another team produced two overlapping reports. First, the Virginia Commonwealth University team produced three overlapping reports. Kontos, Arrowood, Jesse, et al. (1998) presented information on 185 patients who had both Tc-99m sestamibi and echocardiography during the period of August 1994 to December 1994; and Tatum, Jesse, Kontos, et al. (1997) presented information on 438 patients (regardless of whether they had echocardiography as well or not) during the period June 1, 1994, to October 26, 1994, at the same institution based on a rule-out ACI protocol which used Tc-99m sestamibi for patients at low to moderate risk for AMI and ACI.

The overarching report by Kontos, Jesse, Schmidt, et al. (1997b) contains information on patients who had Tc-99m sestamibi imaging at the same institutions between June 1994 and August 1995. Therefore only this report, of the three, was used in the evidence synthesis.

Similarly, three reports originated from the William Beaumont Hospital. Weissman, Dickinson, Dworkin, et al. (1996) report on the original experience of 50 scanned patients. The data were used to generate cost-effectiveness estimates, but the report does not include diagnostic accuracy data which were previously presented in a 1995 abstract in the Journal of Nuclear Medicine. The Stewart, Dickinson, Weissman, et al. (1996) report contains an update with a total of 68 patients, and therefore only this report was considered in the evidence synthesis.

It was verified that the paper by Hilton, Fulmer, Abuan, et al. (1996) included the same patient population as an earlier report. Therefore, the report by Hilton, Thompson, Williams, et al. (1994) is used in the evidence synthesis.

The study by Varetto, Cantalupi, Altieri, et al. (1993) was mentioned in the original Working Group report, but it was also noted that only patients admitted to the CCU were studied. This study was not included in our analysis.

Table 41. Rest sestamibi to diagnose AMI: diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Hilton 1994102IIIACI AMI13.7 11.893.3 10079.3 77.8B
Stewart 199668IIIACI AMI8.8 1.5100 10053.2 49.3B
Kontos 1997b1532IIIACI AMI17.1 5.379.8 92.980.9 71.2A
Kontos 19981185IIIACI AMI- 3.2- 100- 84.4A
Overall702 1IIIAC9-1781 (74-87) 273 (56-85) 2B
Odds ratio 30 (10-92) 2
AMI2-1292 (78-98) 267 (52-79) 2
Odds ratio 26 (6-113) 2
1

There are some overlaps in the patient populations in these three reports. Communication with the authors of these reports revealed that 53 patients in the Kontos (1997b) study were included in the Kontos (1998) study.

2

Results from meta-analysis using random effects calculations.

With these clarifications, four studies with a total of 765 evaluable patients addressed the diagnostic accuracy of Tc-99-sestamibi scan in ED patients with chest pain (Table 41). These studies were published between 1993 and 1997 and, with one exception (Stewart, Dickinson, Weissman, et al. 1996), used only rest scan. A subgroup of patients in the Stewart, Dickinson, Weissman, et al. (1996) study (36/68 patients) also underwent stress scan, if the rest scan was unrevealing. Demographic and reference standard peculiarities are shown in the evidence tables.

Characteristically, all studies addressed populations of patients where there was a presumed low or moderate at the most risk of AMI and ACI and the ECG was normal or nondiagnostic. Also, patients with a history of AMI were excluded in three of the four studies to avoid difficulty in interpreting segmental wall motion abnormalities. The Kontos, Jesse, Schmidt, et al. (1997b) report included patients regardless of a prior history of MI, but separate data are also provided for the subgroup of patients without such a history.

The overall rate of confirmed AMI in these studies was only 7.1 percent (54/765), and the rate of more broadly defined coronary syndromes (defined with somehow different definitions across studies -- see Evidence Table 15) was 21.3 percent (163/765). In this regard, these studies generally targeted highly selected populations: In one study where data were available, only 64 of 274 consecutive patients with chest pain qualified for inclusion (Varetto, Cantalupi, Altieri, et al., 1993); in another study (Tatum, Jesse, Kontos, et al., 1997, a subset of Kontos, Jesse, Schmidt, et al., 1997b), the qualifying rate was 442/1,187; in Stewart, Dickinson, Weissman, et al. (1996), the study population represented only 7 percent of all patients evaluated for chest pain syndromes not felt to be AMI; Hilton, Thompson, Williams, et al. (1994) did not provide information on the respective percentage. Unsuccessful imaging was uncommon in the two studies that provided these data (less than 2 percent for both combined).

The overall test performance suggests excellent sensitivity, but only modest specificity for AMI. The accuracy results for more broadly defined coronary disease syndromes should be interpreted with caution, since the definitions of coronary disease were heterogeneous.

Studies of clinical impact of the test's actual use

No studies are reported.

Data from other clinical studies

No data are reported

ACI Time-Insensitive Predictive Instrument (ACI-TIPI)

Evidence Tables 16a and 16b present details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

Table 42. ACI-TIPI to diagnose ACI: Diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Overall physician performanceStudy quality
Sensitivity (%)Specificity (%)
Pozen 1980856 Test 401 Control 455IACI Test Control 16.5 17.486.491.6A
Pozen 19842,320IACI Test Control 32 2994.578.1A
Davison 1990232IIACI AMI29.7 20.7No data 1 -No data 1 -A
Selker 19912,320IACI AMI34.2 17.6Not applicableA
Overall5,496IACI AM16.5-34.2 17.6-20.786-95 2 -78-92 2 -A
1

See study for SROC curves.

2

Range of values reported.

Table 43. ACI predictive instrument/ACI-TIPI to diagnose ACI: clinical impact studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)30-day mortality(%)Study quality
Pozen 1980856IACINo dataA
Test Control401 455Test Control16.5 17.4
Pozen 19842,320IACINo dataA
Test Control32 29
Corey 198756IACI - No dataNo dataC
Sarasin 1994529I 1ACI AMI59.0 31.9Test Control8 9B
Test Control294 235
Selker 199810,689IACI 2 Test Control 24 232.6 2.4A
Test Control4,738 5,951AMI Test Control 8 8
Overall14,450IACI AMI(17-59.0) (8-31.9)2.4-9A
1

Study conducted in Switzerland, where patients may have been referred by primary care physician.

2

Includes angina, unstable angina pectoris, and myocardial infarction.

Four studies met the inclusion criteria for the diagnostic performance of the original ACI predictive instrument and the ACI-TIPI (Table 42).

Pozen, D'Agostino, Mitchell, et al. (1980) tested the diagnostic performance of the original ACI predictive instrument in a single, urban, teaching hospital. The 10-month study consisted of alternating periods of experimental versus control months. During the experimental months, the instrument's probability of diagnosing ACI was made available to the ED physician who had discretion whether to utilize the additional information for patient diagnosis. Sensitivity dropped from 89.9 percent during the control period to 86.4 percent during the experimental period, whereas the specificity improved from 80.1 percent to 91.6 percent, respectively. The followup rate was 92 percent, and variables revealing differences between the two groups were adjusted for. The monthly rotation schedule of the ED clinicians effectively controlled for bias or eliminated the maturation effect. The generalizability of the results from this single, urban setting is uncertain.

Pozen, D'Agostino, Selker, et al. (1984) tested the diagnostic performance of the ACI predictive instrument with slight modifications in the number of variables used compared with the number in the 1980 study by Pozen and coworkers. Six hospitals were involved, consisting of two hospitals per three different teaching-status levels with two study designs employed: (1) an alternating 11-month schedule of an experimental versus a control month, and (2) a time-series consisting of 6 experimental months followed by 5 control months. The study was basically conducted in the same manner as the previous study in terms of diagnostic and inclusion criteria, retrospective review and blinding, and the ED physician autonomy to incorporate the instrument's probability in the diagnosis. The difference in sensitivity from 95.3 percent during the control period to 94.5 percent during the experimental period was nonsignificant, whereas the specificity improved from 73.2 percent to 78.1 percent (p=0.002), respectively. That the incidence of disease was greater in the second paper than in the first paper could be a reflection of a wider and larger general population, since the first paper was set in a single, urban setting with a relatively large minority representation of 45 percent to 51 percent. Followup rate was 89 percent. Reported prevalence and sensitivity/specificity data could not be verified because of the lack of reporting of actual numbers.

Davison, Suchman, and Goldstein (1990) tested three decision aids in a small study of 235 patients. Specific variables to each of the decision aids were incorporated into a data collection form filled out by the ED residents who were also blinded to the prediction of the aids and to the purpose of the study. It is not known whether the final diagnosis was made blinded to the predictions of the decision aids. There were minor inconsistencies between the number of enrolled patients and patients analyzed. SROC was reported using nine cut points to distinguish normal from abnormal rather than a continuous curve as intended by the design of the instrument. For the evaluation patients for AMI, none of the protocols could reduce the false-positive rate without reduction of the true-positive rate.

Selker, Griffith, and D'Agostino (1991) tested the ACI Time-Insensitive Predictive Instrument at the same time the 1984 study by Pozen and colleagues was being conducted, utilizing the same data on the same population at the same six hospitals. The diagnostic performance of the probability scale and the risk stratification groupings was tested. The TIPI SROC was compared to the original predictive instrument's SROC, 0.88 to 0.89, respectively. The mean TIPI probability was 59 percent for patients with ischemia and 21 percent for patients without ischemia (p=0.0001), regardless of the triage decision. In the test performance of risk stratification, of the four risk probability groups, 1.6 percent had ACI (0.7 percent AMI) in the low-risk group, 12.0 percent (4.4 percent AMI) had ACI in the lower moderate risk group, 36.7 percent (12.3 percent AMI) had ACI in the higher moderate risk group, and 81.6 percent had ACI (53.3 AMI) in the high-risk group. Physician reviewers assigning diagnosis were blinded to the study's purpose and instrument components.

Studies of clinical impact of the test's actual use

Five studies met the inclusion criteria for studies evaluating the clinical impact of the original ACI predictive instrument and the ACI Time-Insensitive Predictive Instrument. The design of two studies by Pozen and coworkers (1980, 1984) are described in the above section under Studies of Test Sensitivity and Specificity.

Pozen, D'Agostino, Mitchell, et al. (1980) tested the clinical impact of the original ACI predictive instrument. CCU admissions dropped during the experimental months compared with the control months (26 percent to 14 percent), respectively; inappropriate discharges remained unchanged at 3 percent. Overall diagnostic accuracy improved from 81.8 percent to 90.7 percent.

Pozen, D'Agostino, Selker, et al. (1984) tested the clinical impact of the ACI predictive instrument. As expected from the improved specificity in the diagnostic performance, the number of CCU admissions decreased for nonischemic patients from 24 percent during the control period to 17 percent during the experimental period, whereas the discharge rates increased from 44 percent to 51 percent. There was no change for ischemic patients in terms of triage outcomes between the two periods.

Corey and Merenstein (1987) conducted a randomized study in a private, community hospital using the predictive instrument designed by Pozen and coworkers (1980, 1984). The emergency physicians and nurses incorporated data onto a worksheet for both control and experimental patients. The risk for ACI was calculated by the physician for the experimental patient only. Differences in false-positive rates between the two groups were evaluated, and there was a reported decrease from 71 percent to 0.0 percent, but this could not be verified with the data given. There were no data on patient outcomes.

Sarasin, Reymond, Griffith, et al. (1994) evaluated the clinical impact of ACI-TIPI on speed of triage from patient presentation to ED discharge or admission and on physicians with different levels of clinical experience. With novice physicians, ED length of stay for ischemic patients decreased from 3.8 hours during the control periods to 3.4 hours during the experimental periods, whereas nonischemic patients' ED length of stay increased from 3.7 hours during the control periods to 4.5 hours. With experienced clinicians, the exact opposite was true; ED length of stay for ischemic patients increased from 4.2 hours during the control periods to 4.4 hours during the experimental periods and decreased for the nonischemic patients, with 4.3 to 3.6 hours, respectively. Including 30-day death rates, none of the results was statistically significant. Twenty-one percent of the enrolled patients were dropped from the final analysis. Other limitations, as noted in the paper, include the relative small sample size along with the use of a single hospital. In addition, the higher rate of disease may be the result of the difference in health care practice in Switzerland compared with that in the United States. In Switzerland, patients have access to their primary care physicians on a "semi-emergency basis"; the physicians are then able to screen their patients prior to ED referral.

Selker, Beshansky, Griffith, et al. (1998) evaluated the clinical impact of ACI-TIPI with a multisite study involving 10 hospitals, including private, public, community, and tertiary hospitals. As with the previous studies, the study design was case control with alternating months of experimental versus control months for a total of 7 months. The inclusion criteria were similar to those in the previous studies with the added inclusion of cocaine users 18 years or older. During the intervention months, ACI-TIPI probability was automatically printed onto an ECG record to be utilized as a supplemental diagnostic aid to be used at the physician's discretion. For nonischemic patients at sites with high CCU capacity relative to low-capacity telemetry units, CCU admissions decreased 15 percent to 12 percent, a change of 16 percent (95 percent CI=−30 percent to 0 percent). At the same time, ED discharges increased 49 percent to 52 percent, a 6 percent change (CI=0 percent to 14 percent); overall p=0.09. Across all sites, unsupervised residents reduced CCU admissions 14 percent to 10 percent, a −32 percent change (CI=−55 percent to 3 percent), and telemetry unit admissions were also reduced from 39 percent to 31 percent, a change of −20 percent (CI=−34 percent to −2 percent). ED discharge increased from 45 percent to 56 percent, a 25 percent change (CI=8 percent to 45 percent); overall p=0.008. The use of ACI-TIPI by supervised residents and attending physicians did not affect care for patients without cardiac ischemia. For patients with ischemia, the 96 percent rate of admission remained unchanged at sites with high-capacity CCUs or at telemetry units for all physician types. Low CCU capacity influenced physicians to selectively reduce CCU admissions for patients without ischemia. Mortality was nonsignificant between the intervention and control groups at 30 days for all sites combined. Followup data to confirm "true" diagnosis was 99 percent. Results could not be verified with limited data reported.

Data From Other Clinical Studies

No data are reported.

Goldman Chest Pain Protocol

Evidence Tables 17a and 17b present details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

Table 44. Goldman chest pain protocol to diagnose AMI: diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Goldman 1982357IIACI AMI- 15.4- 90.9- 69.9A
Goldman 19884,770IIACI AMI26.6 12.1- 88.0- 74.2A
Davison 1990232IIACI AMI29.7 20.7ND 1 -ND 1 -A
Overall5,359IIACI AMI26.6-29.7 12.1-20.7- (88.0-90.9)- (69.9-4.2)A
1

See study for SROC curves.

Three studies met the inclusion criteria for the diagnostic performance of the Goldman chest pain protocol (Table 44). Goldman, Weinberg, Weisberg, et al. (1982) tested a computer-derived model using recursive partitioning analysis to predict myocardial infarction in patients with chest pain. Two different patient populations and settings were included. The ED set consisted of 357 patients with varying chest pain unexplained by trauma or radiographs. It was not clear what the inclusion criteria were for the admitted set of 111 patients. The admitted set had a higher prevalence of illness, 27.0 percent to the ED rate of 15.4 percent. Although the diagnostic criteria for unstable angina were well defined, there were no further diagnostic results for this condition reported in the study.

Goldman, Cook, Brand, et al. (1988) tested a modified version of the original Goldman protocol with an added provision to accommodate patients with and without history of cardiac ischemia. The study was evaluated at two university hospitals and four community hospitals. One hundred twenty-eight patients who enrolled were not included in analysis because followup was insufficient for final diagnosis, but in addition, 595 patients were not included because of incomplete forms or insufficient clinical data to complete assignment into subgrouping.

In a small study of 235 patients, Davison, Suchman, and Goldstein (1990) tested three decision aids, as described in Goldman, Weinberg, Weisberg, et al. (1982); Pozen, D'Agostino, Selker, et al. (1984); and Brush, Brand, Acampora, et al. (1985). Specific variables to each of the decision aids were incorporated into a data collection form filled out by the ED residents who were also blinded to the prediction of the aids and to the purpose of the study. It is not known whether the final diagnosis was made blinded to the predictions of the decision aids. There were minor inconsistencies in the number of enrolled patients and patients analyzed. SROC was reported. For the evaluation patients for AMI, none of the protocols could reduce the false-positive rate without reduction of the true-positive rate.

Studies of clinical impact of the test's actual use

Table 45. Goldman chest pain protocol to diagnose AMI: Clinical impact study
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)MortalityStudy quality
Lee 19951,921IIAMI6.6No dataA
Intervention924Intervention6.9
Control997Control6.2
One study evaluated the clinical impact of the Goldman protocol (Table 45). Lee, Pearson, Johnson, et al. (1995) evaluated the clinical impact with "a low-intensity, nonintrusive intervention" study at a teaching hospital ED. The time-series design consisted of 6 cycles of 14 weeks each: 5-week intervention followed by 5-week control period separated by 2-week washout periods. The risk estimates and triage recommendations were made available with no human contact or influence by flowcharts and stickers to physicians. There were no differences between the two groups for outcomes of hospitalization rates, length of stay, and estimated costs. Of the intervention group, 50.5 percent (467/924) were hospitalized with 10 percent (92/924) admitted to the CCU. Correspondingly, the control group had 52.2 percent (520/997) hospitalized with 9.5 percent admitted. A low-risk group, defined as up to 7 percent risk for having AMI, included 1,160 patients. This low-risk population had a 1.6-percent CCU admission rate. Mean length of stay for the intervention group was 4.9 total, 3.5 in the CCU; and for the control group, 4.9 total, 4.1 in the CCU. Mean estimated costs for the intervention group were $7,822 vs. $7,955 for the control group. The protocol was amended after the first cycle to include triage recommendations at the request of physicians for guidance in interpreting quantitative risk predictions. Because outcome data were not reported by cycle, it is difficult to detect whether there may have been an effect because of the change in the protocol after commencement. Noted in the study were the differences in decisionmaking at teaching vs. community hospitals and how the intervention intended for a single decisionmaker, as in a community hospital, may not have an effect in a different setting where several physicians are involved in the decisionmaking. Although the cycles did not correlate with the rotation schedules of the physicians, there was no information as to structure or number of the rotations.

Data From Other Clinical Studies

No data are reported.

Clinical Algorithms

Evidence Tables 18a and 18b present details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

No studies are reported.

Studies of clinical impact of the test's actual use

Table 46. Algorithm/protocol: Clinical impact studies
Study, yearStudy sizePopulation categoryPrevalence of disease (%)MortalityStudy quality
Gomez 1996100III (CPU)ACI AMI6.0 2.00/100 at 30 daysA
Mikhail 1997502III (CPU)ACI AMI8.8 2.02/374 at 150 daysB
Overall1,502III (CPU)ACI AMI6.0-8.8 2.0B

CPU - chest pain unit

Two studies evaluated the clinical impact of algorithms (Table 46). Gomez, Anderson, Karagounis, et al. (1996) evaluated a rapid rule-out protocol based in the ED chest pain evaluation unit. The patient population included was at low risk for acute myocardial infarction as defined by the Goldman method. One hundred patients were randomized into two groups, the test group to undergo the rapid rule-out protocol and the control group who were assigned to standard care of hospital admission and management as required by their physicians. The length of stay and hospital charges were the primary outcomes of study. Some additional endpoints included missed diagnosis and frequency of final diagnosis of ischemia overall and by the study group. By intention-to-treat analysis of 100 patients, the median length of stay and hospital charges were lower for the rule-out protocol group compared with the control group: median 12.1 versus 22.3 hours, and median $895 vs. $1,488, p=0.0001. The analysis of 92 patients with ischemia ruled out was similar in shorter length of stay and charges: median 11.9 vs. 22.8 hours, and median $893 vs. $1,349, p=0.0001, respectively. There were six patients diagnosed with ischemia, one patient in each group for acute myocardial infarction and four patients for unstable angina in the control group. There were no missed diagnoses at 30-day followup. Because of the study population's low event rate, there was inadequate power to show a difference between the two groups. Though blinding was not appropriate, it may be a factor in influencing the control group physicians' choices concerning triage, evaluations, and length of stay.

Mikhail, Smith, Gray, et al. (1997) evaluated a 23-hour chest pain protocol in the chest pain center in a community hospital. All patients underwent a mandatory stress test unless they had end-stage heart disease, in which case they needed cardiac enzyme tests to rule out AMI. Rule-out tests included CK/CK-MB, myoglobin, and continuous ECG. Of the 502 patients transferred from the emergency department, 420 had stress testing consisting of 247 standard-graded stress tests, 161 stress echocardiography, 9 dobutamine echocardiography, and 3 thallium stress tests. A total of 67 patients (13.3 percent) from the 502 enrolled were admitted from the chest pain center. Final diagnosis included 44 with ischemic heart disease, 10 of whom had acute myocardial infarction. Of the 32 patients admitted with positive stress results, 24 had a final diagnosis of ischemia. There was no mortality or AMI for patients discharged with negative findings from the chest pain center at 14 days. At 150 days, 354 (86.0 percent) of the 435 discharged were followed. Two deaths and one AMI were reported. Of two patients with known CAD and initially lost to followup, one had cardiac arrest 2 weeks after chest pain unit (CPU) discharge, and one had AMI 6 days after discharge. One patient with end-stage heart disease and admitted for rule-out protocol only had cardiac arrest. This population was defined as at low to moderate risk for ischemic heart disease. There were discrepancies with numbers reported.

Data From Other Clinical Studies

Table 47. Algorithm/protocol: Diagnostic performance studies
Study, yearStudy sizePopulation category (setting)Prevalence of disease (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Lee 19912,684IV (CCU)ACI AMI58.2 30.0Not applicableA
957Low-risk patients:
ACI AMI34.6 7.0
Gibler 19951,010IVACI AMI4.3 1.2Serial ECG (n=1,010)A
21.299.4
Echocardiography (n=901)
47.499.0
Exercise Stress (n=791)
28.699.4
Serial CK-MB (n=1,010)
10098.3
Zalenski 1997a317IVACI9.590.050.5A
Overall4,011IVACI AMI4.3-58.2 1.2-30.0A
Three studies met the inclusion criteria for the diagnostic performance of the algorithms (Table 47). Lee, Juarez, Cook, et al. (1991) used the Goldman algorithm to identify low-risk patients presenting in the ED to validate a 12-hour protocol. Of the 957 admitted patients who qualified for the observational study, 771 did not have abnormal enzyme levels or recurrent ischemic chest pains and potentially could have been transferred from a monitored setting to a lower level of care for further testing. Four of 771 patients, or 0.5 percent had AMI. The rate of AMI for the low-risk population was 7 percent and 34.6 percent overall for ACI. As noted, the rate of unstable angina was consistent between the high and low risk groups at approximately 28 percent. All patients were admitted, and the 12-hour protocol to rule out myocardial infarction was validated in a hospital setting, not in the ED.

The Gibler, Runyon, Levy, et al. (1995) article is a retrospective review of 1,010 consecutive patients at an urban, tertiary hospital evaluating a 9-hour heart ED program. The protocol included a series of tests including serial CK-MB at presentation, and at 3, 6, and 9 hours and serial ECG at 20-second intervals for 9 hours. At the end of the 9-hour protocol, if there was no indication of evolving ischemia or necrosis, a history and physical examination was conducted by a cardiologist. A two-dimensional echocardiography at rest was conducted by the cardiologist in the ED. This was followed by a graded exercise (maximal Bruce protocol) test. If the exercise test was negative, the patient was discharged and instructed to return in 24 to 48 hours for followup tests. One hundred fifty-three patients were admitted, of whom 52 were diagnosed with cardiac conditions. Twelve of the admitted patients had acute myocardial infarction, and 31 had angina or unstable angina. From the data reported, it would appear that all 1,010 patients completed the serial CK-MB for a sensitivity of 100 percent, 12 of 29 for AMI, specificity 98.3 percent, and serial ECG for a sensitivity of 21.2 percent, specificity 99.4 percent. In the ED, 901 patients underwent echocardiography for a sensitivity of 47.4 percent (9 of 19 true positive results) and a specificity of 99.0 percent. Exercise stress test was conducted for 791 for a sensitivity of 28.6 percent (4 of 14 true positive results) and a specificity of 99.4 percent. The population was selected for being at "low- to moderate-risk" for a rate of ischemia of 4.3 percent.

In the Zalenski, McCarren, Roberts, et al. (1997) study, a 12-hour protocol to diagnose ACI was executed in the ED setting for 359 patients who were at low risk for AMI as defined by the Goldman protocol, but who were also in need of hospital admission. The tests were conducted as follows: serial CK-MB, serial ECG, and clinical assessments concurrently for 12 hours. Exercise stress ECG was conducted if the previous tests and examinations were negative. All patients were admitted to establish reference diagnoses. The inclusion/exclusion criteria were modified after the first year of the 29-month enrollment period by lowering the age from 30 years to 20 years and eliminating the restriction on duration of pain from less than 30 seconds to 72 hours. It should be noted that 43.5 percent of the 317 evaluable patients had chest pain lasting fewer than 24 hours, 16.6 percent had pain from 24 to 48 hours, and 39.9 percent had pain for more than 48 hours. Thirty-one patients had incomplete enzyme determinations, and three withdrew from the study. Another eight patients had noncardiac diagnoses.

Computer-Based Decision Aids

Evidence Tables 19a and 19b present details about the studies considered for this technology.

Data From Prospective Clinical Trials in the ED Setting
Studies of test sensitivity and specificity

Table 48. Computer-based decision aids to diagnose AMI: Diagnostic performance studies
Study, yearStudy sizePopulation categoryPrevalence of AMI (%)Test performanceStudy quality
Sensitivity (%)Specificity (%)
Tierney 1985540II11.58186C
Baxt 1991331II10.697.296.3A
Dilger 1992122I/II32.88986A
Jonsbu 19931,252II41.798.358.9A
Baxt 19961,070II7.096.096.0A
Kennedy 1997 Center 1: Center 2: 200 91 II (admitted) 33.8 1 91.2 90.2 A
III (admitted)23.152.480.0A
Overall3,606I/II/III7-4252-9859-96A
1

200 patients validation data set plus 90 patients used in training of artificial neural network at center 1.

There were six diagnostic performance studies that met the inclusion criteria for computer-based decision aids in the ED setting (Table 48).

Tierney, Roth, Psaty, et al. (1985) validated the diagnostic performance of a predictive model on 540 ED patients from an urban hospital. The logistic regression model uses four dichotomous variables obtained by a questionnaire completed by the physician. CK/CK-MB, LDH isoenzyme-1, or ECG results were used for the diagnosis of AMI. The sensitivity and specificity could not be verified with the data reported. Of the 655 enrolled in the study, 115 were lost to followup and omitted from analyses. Of the 540 evaluable patients, 284 were used to derive the model.

Baxt (1991) prospectively validated the diagnostic performance of an artificial neural network on physician-completed questionnaires of 331 patients presenting with anterior chest pain. The information on the questionnaire was based on patient history, symptoms, and ECG results. The final diagnosis of AMI was based on CK/CK-MB, LDH isoenzyme-1, or ECG results or sestamibi imaging results. Twenty-four patients of 355 enrolled could not be followed and were therefore omitted from the study. The followup data inputted were also blinded to the initial data entry.

Dilger, Pietsch-Breitfeld, Stein, et al. (1992) prospectively validated a predictive model based on six variables obtained by physician-completed questionnaires. A population of 122 was evaluated for AMI with the final diagnosis made by one of the investigators based on "anamnestic, electrocardiographic, and enzymatic infarction criteria without knowledge of predictive result by the model." Included as one of the six variables in the predictive model is CK level at presentation, which is also used as part of the final diagnosis. The authors note that, for final diagnoses, the time course of 22 hours of total CK and CK-MB levels were utilized. The sensitivity and specificity could not be verified with the data reported.

Jonsbu, Aase, Rollag, et al. (1993) applied a computer-derived decision support system, which was based on case history data, to identify, from ED patients presenting with chest pain, those in need of CCU admissions and patients who may have AMI. Final diagnoses from ECG results or isoenzymes levels were made blinded to the test under investigation. The sensitivity and specificity could not be verified with the data reported.

Baxt and Skora (1996) tested the diagnostic performance of the artificial neural network as reported in Baxt (1991) on a larger patient population of 1,070 at a major, urban teaching hospital. This was conducted in a similar fashion as the preliminary study, with the individuals involved in the reference and the test diagnoses blinded to the other's results. The reference criteria were also based on CK and CK-MB levels or ECG results. The network does not base its diagnosis of AMI on ECG changes, as 33 of the 75 patients had no such changes and 72 AMI patients were correctly identified by the network. Thirty patients who were diagnosed negative by the network were lost to followup.

Kennedy, Harrison, Burton, et al. (1997) used two centers, the first center to train and prospectively validate an artificial neural network and the second center to test its application and its portability. As noted by the authors, the two patient populations were very different. The patient inclusion/exclusion criteria for the training center were minimal, "non-traumatic chest pain." But this population was also from the department of medicine admissions and may have been at high risk for AMI. The authors considered the test group at the second center to be a difficult group to diagnose because of the relatively strict inclusion/exclusion criteria of chest pain under 4 hours upon presentation and nondiagnostic ECGs. It also appears that the second group was admitted after data were collected in the ED. The artificial neural network is a computer program consisting of a set of processing units with data entered directly into the computer with the aid of data entry screens. The final diagnosis for AMI was based on two of three criteria: clinical history, ECG, or biochemical results. The reviewers were blinded to the results of the network. The differences in sensitivity and specificity between the two groups may be indicative of differences in population risk for AMI. The sensitivity and specificity also could not be verified with the data reported.

Studies of clinical impact of the test's actual use

Table 49. Computer-based decision aids in prehospital setting: Clinical impact study
Study, yearStudy sizePopulation categoryPrevalence of disease (%)Mortality (%)Study quality
Grijseels 1996977III 1 (Prehospital)ACI AM47.8 29.92.6 at 30 daysA
1

Prehospital subjects preselected by their general practitioner.

Grijseels, Deckers, Hoes, et al. (1996) implemented a decision rule for 1,020 prehospital patients enrolled by their general practitioners in the Netherlands (Table 49). The decision rule recommended 750 for hospitalization, of whom 427 had ACI. Of the 227 not recommended for hospitalization, general practitioners admitted 128, of whom 32 had ACI (25 percent). Ultimately, 8 patients with ACI were missed, and there were 25 deaths total. The general practitioners decided not to admit 19 patients, 3 of whom developed AMI.

Data From Other Clinical Studies

Table 50. Computer-based decision aids in prehospital setting: Diagnostic performance study
Study, yearStudy sizePopulation categoryPrevalence of disease (%)AMI test performanceStudy quality
Sensitivity (%)Specificity(%)
Grijseels 1996977III 1 (Prehospital)ACI AMI47.8 29.991.4 -36.7 -B
1

Prehospital subjects preselected by their general practitioner.

One study provided diagnostic performance in the prehospital setting (Table 50). Grijseels, Deckers, Hoes, et al. (1996) implemented a decision rule to improve prehospital triage by general practitioners in The Netherlands. The decision rule is based on clinical indicators and computerized ECG results. The final discharge diagnosis for AMI was based on standard history, ECG, and enzyme criteria. Unstable angina was defined as new onset angina or history of angina with increasing frequency or severity of symptoms. Of the 1,020 consecutive patients with symptoms, 43 were not included, nor were data reported on them.

Chapter 4. Conclusions

The results presented in Chapter 3 are based on the screening of 6,667 MEDLINE titles and 407 full articles, 105 of which are analyzed in this report. About one-third of the 6,667 articles were published since 1994, and 45 of the 105 articles we used were published between 1994 and 1998. These numbers indicate a large increase in research activities on this topic during the past 5 years compared with the previous 27 years.

A diverse array of technologies with varying degrees of diagnostic accuracy and costs is available for use in general or selected patient populations to diagnose ACI in the ED. These technologies include ECG-based tests, blood tests of biomarkers, imaging studies, stress testing, and computer-based decision aids; the technologies detect various facets and manifestation of ACI. The direct costs of these technologies range from being almost free (ACI-TIPI and the Goldman chest pain protocol) to $1,000 or more (sestamibi perfusion imaging). The evidence we reviewed indicates that none of the current technologies distinguish all patients with ACI from those without. Although the optimal diagnostic approach may be some combination of these tests that has yet to be evaluated, several technologies come close to offering the desired tradeoff between costs and effectiveness.

Table 51. Summary of test performance studies of diagnostic technologies for acute cardiac ischemia in emergency departments
TechnologyCondition studiedNumber of studies (subjects)Population category of studies 1Studies' prevalence range (%)Sensitivity 2 (95% CI) (%)Specificity 2 (95% CI) (%)Diagnostic odds ratio 2 (95% CI)Overall quality of evidence
Prehospital 12-lead ECGACI AMI5 (4,311) 10 (4,481)I/II I/II46-92 14-5176 (54-89) 68 (59-76)88 (67-96) 97 (89-92)23 (6.3-85) 104 (48-224)B B
Continuous/serial ECGACI AMI2 (1,271) 1 (261)III/IV III/IV4-40 1121-25 3 39 392-99 3 88 33.8-45 3 4.9 3C B
Nonstandard lead ECGACI AMI1 (52) 4 (897)IV IV48 22-6596 3 59-83 341 3 76-93 317 3 10-19 3B B
Exercise stress ECGACI2 (312)III6-1070-100 382-93 311 -- 4B
CK (presentation)AMI10 (2,885)I/II/III7-4136 (29-44)88 (80-93)4.0 (2.6-6.2)C
CK (serial)AMI2 (786)I26-4369-99 368-84 312-222 3C
CK-MB (presentation)ACI AMI1 (1,042) 10 (2,504)III I/II/III20 6-4223 3 44 (35-53)96 3 96 (94-97)7.2 3 23 (17-32)C B
CK-MB (serial)ACI AMI1 (1,042) 7 (3,381)III I/II/III20 1-5331 3 87 (67-95)95 3 96 (94-97)8.5 3 171 (58-505)C B
Myoglobin (presentation)AMI10 (1,395)I/II/III12-41349 (41-57)93 (88-96)13 (7.9-21)B
Myoglobin (serial)AMI5 (831)I/II/III23-3790 (78-96)90 (78-96)140 (66-300)C
Troponin I (presentation)AMI2 (874)II/III6-3923-66 389-95 35.7-14 3A
Troponin I (serial)AMI1 (773)III6100 383 3- 4A
Troponin T (presentation)AMI5 (1,171)II/III6-3944 (32-56)92 (88-95)10 (5.9-18)B
Troponin T (serial)AMI2 (1,440)II/III5-680-93 365-90 335-120 3A/C
P-selectinACI AMI1 (263) same studyII33 8.435 3 45 379 3 76 32.0 3 2.6 3B
Rest echo- cardiographyACI 5 AMI2 (228) 3 (397)III I/III3-30 3-3070 (43-88) 93 (81-91)87(72-94) 66 (43-83)20 (9-48) 20 (7-62)C B
Stress echo-cardiographyAMI1 (139)III490 389 368 3C
Sestamibi (rest)ACI AMI3 (702) same studiesIII9-17 2-1281 (74-87) 92 (78-98)73 (56-85) 67 (52-79)18 (11-29) 26 (6-113)B
ACI-TIPIACI4 (5,496)I17-3486-953,578-923,561-693,5A
Goldman chest pain protocolAMI3 (5,359)I(ACI 27-30) 12-2188-91 370-74 320-23 3A
Algorithm/protocolsNo data from prospective studies.
Computer-based decision aidsAMI6 (3,606)I/II/III7-4252-98 358-96 34.4-904A
1

Population categories: I: all patients with symptoms suggestive of ACI; II: chest pain; III: chest pain with nondiagnostic ECG; IV: selected subpopulation.

2

Results from meta-analysis of several studies using random effects calculations unless otherwise indicated.

3

Point estimate from single study or a range of reported values; meta-analysis not performed.

4

Upper range cannot be estimated because of a study with 100 percent sensitivity.

5

ACI-TIPI is not intended to provide sensitivity and specificity. The results reported incorporate physicians' triage decisions and are not reflective of diagnostic test performance only.

Table 52. Summary of clinical impact studies of diagnostic technologies for acute cardiac ischemia in emergency departments
TechnologiesCondition studiedNumber of studies (subjects)Population category 1Prevalence (%)Clinical outcomes studiedClinical impactQuality of evidence
Prehospital 12-lead ECGACI AMI~10 studies 2 ~8,000 pts 2I/II46-100 15-51Time to thrombolysis, ejection fraction, mortality++A
Continuous/serial ECGNo study--Not known-
Nonstandard lead ECGNo study--Not known-
Exercise stress ECGACI3 (272)III0-6Feasibility and safetyNot knownC
CK (single/serial)No study--Not known-
CK-MB (single)No study--Not known-
CK-MB (serial)ACI1 (1,042)III20Additional admissions or discharges of ACI and non-ACI patients++C
Myoglobin (single/serial)No study--Not known-
Troponin I or TNo study--Not known-
Other biomarkersNo study--Not known-
Rest echo-cardiographyNo study--Not known-
Stress echo-cardiographyNo study--Not known-
SestamibiNo study--Not known-
ACI-TIPIACI5 (14,450)I17-59CCU admission rate, inappropriate discharge+++A
Goldman chest pain protocolAMI1 (1,921)III6.6Hospitalization rate, length of stay, estimated costs+A
Algorithm/protocolsACI2 (602)III6-9Length of stay, hospital charges, 30-day and 150-day mortalityNot knownB
Computer-based decision aidsACI1 (977)III48 (AMI 30)30-day mortalityNot knownA
1

Population categories: I: all patients with symptoms suggestive of ACI; II: chest pain; III: chest pain with nondiagnostic ECG; IV: selected subpopulation.

2

Different outcomes analyzed involved different number of studies and patients.

Table 51 summarizes the results of all the diagnostic performance studies evaluated, and Table 52 summarizes the clinical impact studies.

General Observations on the Studies Analyzed

In addition to the conclusions described in this chapter, we believe that the data support the following observations:

  • Research on the diagnosis of ACI in the ED is characterized by great heterogeneity in the studies as a result of the large number of variables that can be studied. This heterogeneity has resulted in fragmented evidence that is not easily synthesized into a coherent whole.

  • There are a limited number of studies, both on technologies' diagnostic performance and in their clinical impact, on patients with ACI. Most studies evaluated only patients with AMI.

  • The methodological quality of the diagnostic performance studies on this topic varies widely but, in general, needs to be greatly improved. Most of the evidence for diagnostic performance is based on studies that were given a "B" or "C" methodological quality rating.

  • The number of new studies published in the past 5 years on the clinical impact of these technologies is disappointingly small.

  • Some technologies (e.g., echocardiography, sestamibi perfusion imaging, exercise ECG) remain underevaluated.

  • The prevalence of ACI varies widely among the studies, even for apparently similar patient populations defined by the inclusion criteria. This heterogeneity raises the question of the applicability of the study results to other settings.

  • Little work has been done on the value of sequential testing or on test combinations.

We also found several instances of multiple reports of the same study. Sometimes these reports were updates of preliminary reports and contained data on additional patients; however, this overlap made it difficult to distinguish new information from old, thus introducing bias of either overcounting or undercounting depending on the decision to include or exclude the overlapping studies. Sometimes these studies did provide useful, complementary information.

Observations on the Prevalence of Acute Cardiac Ischemia

To help interpret the results, we examined the prevalence of ACI and AMI in the four population categories. About one-half of the studies analyzed patients in population category II, and about one-third of the studies analyzed patients in category III. The prevalence of AMI across studies of patients in the same population categories and in similar settings varied widely, and there was little indication that similar inclusion and exclusion criteria resulted in similar ACI and AMI prevalence. The lack of association between the population categories defined by the inclusion criteria and the prevalence of ACI raises the question of whether the inclusion criteria used in current studies are sufficiently refined to allow their results to be applied in other settings.

Despite this lack of association, overall, studies that included all patients with chest pain (population category II) did how a higher prevalence of AMI than did studies that included all patients with any symptom suggestive of ACI (population category I) or studies that excluded patients with diagnostic ECGs (population category III). In addition, although differences in AMI prevalence in studies in different settings were not statistically significant, studies that analyzed only ED patients admitted to either the hospital or to the CCU did show a higher prevalence of AMI than those that included all ED patients and therefore may have truly represented different populations.

Conclusions About the Diagnostic Technologies

Prehospital 12-Lead Electrocardiography

The diagnostic accuracy of prehospital 12-lead ECG for AMI and ACI, as expected, was similar to that of the standard 12-lead ECG, which is the standard of care in the management of patients suspected of having ACI. The accumulated evidence is substantial in both the total sample size and quality, and the data were gathered from patient populations with few exclusion criteria.

The evidence shows that obtaining a prehospital 12-lead ECG does not prolong time in the field or delay transport to the ED. In addition, prehospital ECG-guided thrombolytic therapy can be administered 45 minutes to 1 hour earlier than hospital-based thrombolysis. Prehospital thrombolysis has a modest but significant impact on early mortality. Approximately 60 patients would need to be administered prehospital thrombolysis to save one additional life, in the short term, compared with hospital thrombolysis. Short-term, beneficial effects of thrombolysis on the left ventricular ejection fraction have not been reported in randomized trials. The long-term survival benefits of prehospital thrombolysis remain uncertain.

Continuous/Serial 12-Lead ECG

Two studies evaluated the test performance of continuous/serial 12-lead ECG in the ED, but there was no clinical impact study. The two studies were quite dissimilar. One by Gibler, Runyon, Levy, et al. (1995) included a large retrospective population of 1,010 participating in a 9-hour protocol. The "serial ECG" consisted of a 20-second interval between readings. The second study by Hedges, Young, Henkel, et al. (1992) included patients from a veterans' hospital in which two ECGs were taken 4 hours apart. The prevalences of ACI in these studies were very different (4 and 40 percent, respectively) given the low-risk populations. The sensitivity for ACI was low (21 and 25 percent, respectively), and the specificity was high (92 and 99 percent, respectively). With the limitations and the varied source of the data, a conclusion about the utility of this technology cannot be drawn.

Nonstandard Lead ECG

The data on the diagnostic performance of nonstandard lead ECG from the four studies reported varied too much to draw any conclusion. The studies used 15, 18, 22, and 24 leads and were conducted with selected patients for admission. The prevalence was reflective of this selective population: It ranged between 22 and 65 percent for AMI. There were no clinical impact studies on nonstandard ECGs.

Exercise Stress ECG

The data on the diagnostic performance of exercise stress testing to detect ACI in the ED were limited to only two studies, both published after the earlier NHAAP report. The overall data included a small sample size of a low-risk population. Although the diagnostic performance was encouraging, it would be premature to make conclusions regarding this technology until additional high-quality studies are conducted.

There were also limited data on the clinical impact of exercise stress testing for ACI, with one of the three studies published since the report by the Working Group. The first two studies, Tsakonis, Roth, Psaty, et al. (1991) and Kerns, Shaub, and Fontanarosa (1993), had no cardiac events and included very small sample sizes, 28 and 35, respectively. Because these studies reported on a total of only 272 subjects and were of low methodological quality, the clinical impact of this technology is unclear.

Biomarkers

Creatine Kinase (CK), Single and Serial Measurements

The amount of evidence on CK as a single test administered at presentation to patients in the ED is large. As shown in Table 51, the evidence suggests that the sensitivity of a single CK reading for AMI is low (36 percent), and the specificity is modest (88 percent). Limited evidence suggests that the sensitivity of the test depends on the duration of the patient's symptoms; sensitivity increases with longer symptom duration. Test performance across studies did not appear to vary by type of hospital, inclusion criteria, AMI prevalence, or test threshold.

Only two studies have evaluated serial CK testing. Both used broad inclusion criteria but enrolled populations in which the prevalence of AMI was moderate to high. Test sensitivity was high (95 to 99 percent) in serial tests performed over about 15 hours after presentation to the ED (or from the onset of symptoms), but was only modest (69 percent) in the one study that drew serial samples for 4 hours. Test specificity was modest in both studies (68 and 84 percent).

As a single test, CK is insensitive and only modestly specific for AMI. Serial testing appears to have a higher sensitivity, although the specificity remains modest. However, the evidence is insufficient to evaluate serial CK measurements over a short time. Because high serum CK levels represent infarcted myocardium, CK has not been evaluated for diagnosing ACI in the ED. There were no clinical impact studies for CK.

Creatine Kinase Subunit (CK-MB), Single and Serial Measurements

As is the case with CK, the total sample size and number of studies on a single CK-MB measurement at presentation to the ED are large. The evidence suggests that the sensitivity of single CK-MB for AMI is low (44 percent), although specificity is high (96 percent). Studies reported a broad range of sensitivity for diagnosing AMI. Again, as is the case for CK, limited evidence suggested that the sensitivity of CK-MB depends on the duration of the patient's symptoms; sensitivity increases with longer symptom duration. In general, studies reported a narrow range (92 percent to 99 percent) of test specificity. Test performance across studies did not appear to vary by type of hospital, inclusion criteria, AMI prevalence, or test threshold.

The total sample size and number of studies of serial tests for CK-MB in the ED setting are large. Overall, serial testing had a modest sensitivity (87 percent) and a high specificity (96 percent) for AMI. However, test sensitivity was strongly related to the timing of serial testing. All studies that performed serial testing for at least 4 hours after presentation to the ED (or until at least 8 hours after symptom onset) found test sensitivity to be greater than 90 percent. Conversely, all studies that performed serial testing to at most 3 hours found test sensitivity to be lower than 90 percent. The pooled sensitivity for serial testing to at least 4 hours was 96 percent; the pooled sensitivity for serial testing until 3 hours was only 81 percent. In general, test specificity was in a narrow range across studies and was above 90 percent.

CK-MB as a single test was only modestly sensitive and specific for AMI; however, serial testing performed over 4 to 9 hours was highly sensitive and highly specific. Because serum CK-MB levels represent infarcted myocardium, CK-MB has not been tested for diagnosing ACI in the ED. There were no clinical impact studies for CK-MB.

Troponin T and Troponin I

The evidence for the diagnostic performance of troponin T is substantial for diagnosing AMI but rather limited for diagnosing ACI. Data for troponin I are limited, but its performance was similar to that of troponin T. The sensitivity of presentation troponin T for diagnosing AMI in the ED was poor, but it improved substantially if serial measurements were obtained for up to 6 hours after ED presentation. Most likely, the sensitivity was better for patients who have had symptoms for longer periods of time. The specificity of troponin T for AMI was in the range of 90 percent.

Myoglobin

The diagnostic performance of myoglobin has been well studied for diagnosing AMI, but not for diagnosing ACI. The performance of myoglobin was similar to that of CK-MB, except that serial testing may have provided adequately high sensitivity earlier. The sensitivity of myoglobin for diagnosing AMI in the ED was poor (49 percent) when a single initial measurement was obtained, but sensitivity improved greatly if a second measurement was obtained 2 to 4 hours after the first one (>86 percent). However, the sensitivity for patients only recently symptomatic was poor, and a second measurement in 2 to 4 hours may still not have been sufficiently sensitive to be useful. Specificity was very good, but not excellent. Its deviation from excellence in the various reports probably depended on the extent to which other reasons for elevated myoglobin were excluded a priori. There is limited evidence to suggest that the failure of myoglobin level to increase over 1 to 2 hours was excellent at excluding AMI (specificity >94 percent).

Other Biomarkers

Studies on P-selectin and malondialdehyde-modified low-density lipoprotein are just beginning to appear. There was only one ED study of P-selectin that reported low sensitivity and low specificity for AMI.

Combination CK-MB and Myoglobin

The only combination of biomarkers to diagnose AMI that has been reported is CK-MB and myoglobin, and this in only three studies. In addition, it appears that the decision to analyze this combination was made post hoc, and thus the findings may have overestimated true test performance (as studies with poor test performance may have been less likely to report their finding). That said, combination CK-MB and myoglobin at ED presentation (where a positive test was defined as either CK-MB or myoglobin being elevated) had good test performance, with both a high sensitivity (83 percent), though poorer specificity (82 percent), as compared with presentation CK-MB or myoglobin alone. Serial combination CK-MB and myoglobin (where a positive test was defined as any of the four serum samples being elevated) performed similarly to the individual serial biomarkers, although sensitivity may be higher (100 percent).

Echocardiography

The total sample size and the number of the studies evaluating echocardiography for the diagnosis of ACI are small. Limited evidence suggests that resting echocardiography has high sensitivity (93 percent) although only modest specificity (66 percent) for AMI. The availability of previous echocardiograms for comparison may improve the specificity (Mohler, Ryan, Segar, et al., 1998). But even if this improved specificity is verified with additional studies, the need for previous echocardiography would limit its applicability in the general ED setting. In addition, the data pertain mostly to patients with normal or nondiagnostic ECGs. The data for stress dobutamine echocardiography are even more limited, but the one study suggests that it may be the next diagnostic step for patients with a negative resting echocardiogram, normal ECGs, and normal enzyme levels. There was no clinical impact study for this technology.

Technetium-99m Sestamibi Myocardial Perfusion Imaging

Data on the diagnostic accuracy of resting Tc-99m sestamibi imaging in the ED are limited, and there are still no data on its clinical impact. The test has been used in selected patient populations that generally have a low-to-moderate risk of ACI, no history of myocardial infarction, and a presenting ECG nondiagnostic for ACI. Thus, the generalizability of the current evidence is limited, and the test should be reserved for these circumscribed populations. In these patients, the test had excellent sensitivity for AMI, and very good, but not perfect, sensitivity for coronary disease in general. Specificity was modest for AMI, and although it may be a little better for ACI, it was still far from excellent.

Acute Cardiac Ischemia Time-Insensitive Predictive Instrument (ACI-TIPI)

The ACI-TIPI computes a 0-100 percent probability that a given patient has ACI (i.e., either acute myocardial infarction or unstable angina pectoris). Applicable to any ED patient presenting with any symptom suggestive of ACI, it is based on a logistic regression equation that uses presenting symptoms and ECG variables. Originally in a handheld calculator form, it is now incorporated into conventional electrocardiographs so that the patient's ACI-TIPI probability is printed with the standard ECG header text. In large controlled interventional trials in a wide range of hospitals, its use by ED physicians has been shown to reduce unnecessary admissions of patients without ACI and patients with stable (as opposed to unstable) angina, while not reducing appropriate hospitalization for patients with ACI. It has also been shown to help the triage speed and accuracy of less-trained and less-supervised residents. The wider dissemination and use of ACI-TIPI could result in significant positive impact on the triage of ACI patients in the ED.

Goldman Chest Pain Protocol

The Goldman chest pain protocol is based on a computer-derived model using recursive partitioning analysis to predict myocardial infarction in patients with chest pain. It has good sensitivity (about 90 percent) for AMI, but it was not developed to detect UAP as well. In a clinical impact study of "low-intensity, nonintrusive intervention" performed at a teaching hospital ED, no differences in hospitalization rate, length of stay, or estimated costs were demonstrated between the experimental group, which used the protocol, and the control group.

Other Computer-Based Decision Aids

Several investigators reported various computer-based decision aids to diagnose AMI. The artificial neural network (Baxt and Skora, 1996) was found to have high sensitivity and high specificity for AMI in a prospective study, but the clinical impact was not demonstrated.

Decision and Cost-Effectiveness Analysis

Some technologies for diagnosing ACI in the ED are more accurate and more expensive than others. We developed a cost-effectiveness model to evaluate the tradeoff between the costs of using a technology for the diagnosis of ED patients with ACI and its accuracy.

Decision and cost-effectiveness analyses were performed for 17 technologies and 4 combinations of technologies that were evaluated in the literature and this report. The cost analysis is from the payers' perspective (e.g., health insurance companies); patient outcomes are either appropriate triage or 30-day survival of patients with ACI.

As not all technologies can be applied to all patients in the ED (such as stress ECG), two different ED populations were used for the analysis: a general population model (population category I) with an AMI prevalence of 8 percent, and a subgroup model, in which high-risk patients are excluded (AMI prevalence of 6 percent). Stress tests, sestamibi imaging, and serial and continuous ECG were evaluated only in the subgroup population.

As expected, technologies with the best diagnostic accuracy for AMI and UAP had the highest values for appropriate triage for patients with ACI. Technologies that are more effective (greater number of patients with ACI appropriately triaged) tended to have higher total costs, with the exception of ACI-TIPI. The biomarkers were least costly and had the lowest values for appropriate triage. Algorithms, combination technologies, and echocardiography were the next most effective technologies, in that order. Sestamibi imaging and exercise ECG were more expensive than other technologies, but had excellent diagnostic performance for patients with ACI. Although the diagnostic performance of ACI-TIPI is hard to assess, its use did not decrease appropriate triage for patients with ACI; a large clinical trial showed that 97 percent of patients with ACI were appropriately triaged when the instrument was made available to ED physicians.

Based on data using only the diagnostic performance data of technologies, the combination technology of troponin T-echocardiography was the most cost-effective of all technologies applicable to the general population model. If results from clinical impact studies are incorporated, ACI-TIPI was the most cost-effective because of its very high triage accuracy and low cost.

The incremental cost-effectiveness of troponin T-echocardiography is about $7,670 per additional appropriate triage for a patient with ACI compared with the cost of serial or combination biomarkers.

The incremental cost-effectiveness of troponin T-echocardiography compared with that of the artificial neural network was approximately $10,568. The greater incremental cost-effectiveness reflected the smaller difference in triage accuracy between the artificial neural network and troponin T-echocardiography compared with that between the biomarkers and troponin T-echocardiography. Given the economic ramifications and the effects on the patient of a missed ACI diagnosis, this incremental cost-effectiveness for troponin T-echocardiography is minimal.

Because the estimates for UAP detection were based on sparse data, we evaluated the triage accuracy and cost-effectiveness of technologies for appropriate triage for patients with AMI only. In general, the relative cost-effective rankings for AMI triage accuracy were similar to those for patients with ACI. There were a few but important differences in triage accuracy for patients with AMI: (1) the Goldman protocol improved significantly, (2) serial CK-MB improved slightly, and (3) the combination troponin T-echocardiography was slightly better than ACI-TIPI (a difference of one patient with AMI appropriately triaged).

The combination troponin T-echocardiography was the most cost-effective, followed by the artificial neural network. The incremental cost-effectiveness between these two technologies was much larger than in the general ACI model: approximately $137,000 per additional appropriately triaged patient with AMI.

Sensitivity analyses, in which the prevalence of ACI was increased from 10 to 90 percent, does not change the relative cost-effectiveness of the technologies. As prevalence increased, the differences in cost-effectiveness among the technologies became smaller because costs increased less steeply than effectiveness values. At a prevalence as great as 90 percent, ACI-TIPI retained its dominant cost-effectiveness, followed by troponin T-echocardiography and artificial neural network.

In the subgroup model, ACI-TIPI was again the most cost-effective technology if data from clinical impact studies were incorporated. Sestamibi stress imaging had the best diagnostic performance (detected 82 percent of patients with ACI), followed by sestamibi rest scanning and exercise ECG. The per ED patient costs for these technologies, at over $2,700, were around $400 more than those for ACI-TIPI. The incremental cost-effectiveness between stress sestamibi imaging and the next cost-effective technology, the combination troponin T-echocardiography, was $12,757. As stress sestamibi imaging may result in the appropriate triage of 37 additional patients with ACI (per 1,000 ED patients) compared with troponin T-echocardiography, it appears to be a very cost-effective technology.

If data from the ACI-TIPI trial were used, the incremental cost-effectiveness of using ACI-TIPI compared with troponin T-echocardiography was only $1,502 per additional appropriate triage for a patient with ACI, a truly negligible increase for improved triage accuracy.

In the AMI only triage model, troponin T-echocardiography was the most cost-effective: it was less expensive than all the imaging technologies as well as ACI-TIPI and appropriately triaged nearly 99 percent of patients with AMI. Exercise ECG and stress sestamibi imaging also had 99 percent sensitivity for patients with AMI; however, the per ED patient costs for these two technologies is about $500 more than those for troponin T-echocardiography. The triage accuracy and cost for ACI-TIPI was nearly identical to those for troponin T-echocardiography. When the analysis was performed with the cost of ACI-TIPI at $0, the cost-effectiveness of ACI-TIPI and the combination was essentially equivalent.

The results of the decision and cost-effectiveness analysis should not be used as a definitive analysis of technology triage accuracy, as data on the actual effect on triage are lacking for most of the technologies. Furthermore, the values for sensitivity for UAP were estimates based on sparse data, which added to the uncertainty of the model. The decision analysis is also not meant to be used for clinical recommendations for individual patients, as pretest likelihoods are not explicitly modeled. Rather, these results should be used as an aid in decisionmaking and in understanding the factors that are involved in triage of patients with ACI in the ED. Prospective trials on the effect of technologies on actual ED patient triage are required before definitive conclusions can be made.

Chapter 5. Future Research

Many of the recommendations made here were also made in the 1997 Working Group report. Despite some progress in refining these technologies, the overall research effort in this area needs to be stimulated by proposing a research agenda to address these issues and by making research support available to complete the agenda.

Identifying patients with ACI in the ED is a complex problem. There is a diversity of patient populations, settings, availability of technologies, and timing of patients' presentation to the ED. Also, ACI is a clinical syndrome for which no one test may be the "best." All of these factors affect the selection of diagnostic technologies in these patients. Based on the review of the literature, we need additional research in the areas discussed in the sections that follow.

Diagnostic Performance Studies

Although more than 45 relevant studies have been published since 1994 on this topic, many of the diagnostic technologies for ACI remain underevaluated. The number of studies that have evaluated the diagnostic performance of echocardiography, sestamibi myocardial perfusion imaging, and exercise stress ECG remains small and more studies are needed. Although there are many studies of single presentation measurement of biomarkers (CK, CK-MB, myoglobin, or troponin T), the number of studies of serial measurements remains small. The few studies of serial (2 to 4 hours) myoglobin show promise of having high sensitivity and specificity for AMI; this finding should be verified. Studies of newer biomarkers such as P-selectin and fatty acid binding proteins are needed.

Combinations of Diagnostic Technologies

To date, most studies have evaluated a single technology. For example, single measurements of a biomarker at presentation to the ED, in general, have good specificity but limited sensitivity to detect AMI. Several of the current biomarkers (e.g., CK-MB, myoglobin, troponin I, or troponin T), when measured repeatedly over the course of several hours, can achieve excellent sensitivity for AMI while maintaining high specificity. But the several hours it takes to reach the correct diagnosis would delay the administering of reperfusion therapy for some of the patients and thus lessen the potential clinical benefit of this treatment. Even with improved accuracy for AMI when serial measurements are taken, currently available biomarkers do not identify most of the patients with UAP.

Given the morbidity and mortality associated with ACI, a diagnostic test or test combination with very high sensitivity and specificity is required. Also, since UAP may be a continuum of conditions, test combinations that optimize the strengths of the constituent tests may prove valuable. Research is needed to determine whether combinations of multiple tests, such as a panel of biomarkers, or of multiple modalities, such as ECG with serial CK-MB measurements, perform better than the component tests alone.

Clinical Impact Studies

The 1997 Working Group report also noted the lack of health services research on the clinical impact of these technologies. Attention has clearly been focused on diagnostic accuracy, especially for AMI. It is clearly important to have technologies with high sensitivity and specificity values to diagnose ACI in order to minimize missed ACI and avoid unnecessary hospitalization. Good test performance, however, does not automatically translate to appropriate utilization of the technology or desired clinical outcome. Therefore, in addition to high sensitivity and specificity, a diagnostic technology must also demonstrate desired clinical impact during routine use in the ED setting. Clinical impact studies need to assess the clinical implications of the actual use of the technologies and the economic implications of both correct and incorrect diagnoses. The high cost of some of these diagnostic technologies, especially when applied to selected low risk subpopulations, mandates that cost-effectiveness analyses should be part of the clinical impact studies.

ACI-TIPI was demonstrated in a large multicenter clinical trial to have high accuracy of identifying patients with ACI and reduce unnecessary hospitalizations. This finding needs to be repeated at other ED settings and by other investigators.

Multiple but Standardized Research Variables

The heterogeneity of inclusion and exclusion criteria, tests, outcome measures, and circumstances of testing found in the literature makes synthesizing the research results difficult. Reducing this heterogeneity by implementing a set of standard research variables would greatly assist in comparing studies. Such standard variables should accurately reflect the most common circumstances in which these technologies are used in the ED.

The characteristics of patients enrolled in studies need to be clearly defined. This is critical to ensure internal validity and to allow for study comparisons and data analyses and in attempting to apply the results to clinical practice. In our research, we found there was little correlation between the prevalence of ACI/AMI and the definitions of the study population. Studies with similarly defined populations often reported a wide range of ACI or AMI prevalence, and differently defined populations often have similar overlapping ranges of prevalence of ACI/AMI. Since the prevalence of the ACI/AMI outcomes is the most objective measure of the study's population similarity, the lack of correlation between the criteria and the prevalence raises the question of whether the inclusion criteria used in current studies are sufficiently refined to assist the interpretation of the results.

Standardization of research variables would also aid in identifying the best strategies for detection of patients with AMI and UAP. Because of the numerous possible combinations of technologies, it may not be possible to conduct all the clinical trials to assess the clinical impact of combination strategies. By standardizing protocols (of how diagnostic technologies are used), registries of patient data can be developed to provide complementary evidence of how best to diagnose patients with either AMI or UAP.

Higher Quality Studies and Better Reporting

The methodological quality of diagnostic test evaluation studies in general is lower than that of the randomized controlled trials used to assess efficacy of therapeutic interventions. This is also the case among the studies we examined. The AMI outcome is more often clearly defined than UAP. Some authors reported procedural diagnoses, such as angioplasty, or diagnoses based on invasive or noninvasive procedures, such as significant coronary artery disease, based on angiography or imaging rather than a symptom-defined diagnosis of UAP (using Braunwald's criteria, for example). Even though the procedural diagnoses could be reported, it would be desirable if studies would report UAP as clinically defined (or agreed on) to ensure consistency across studies.

Our literature review was complicated by several instances of multiple publications of overlapping data and sometimes of the same data. To avoid the bias resulting from overcounting, we had to eliminate some studies from our analyses. Several studies by the same group of authors published in adjacent years appeared to be based on the same data, but we were informed they were indeed independent studies when the investigators were contacted. To avoid under- or overusing published data, authors need to alert readers when there is overlapping of patient data in their reports.

Another major problem that we frequently encountered is the lack of clarity and accuracy of the reporting of data. Diagnostic studies are not consistently reported in a uniform manner, thus making it difficult to identify and verify relevant pieces of data needed for evidence synthesis. In addition, some studies reported data that contradicted their summary results, and some studies reported different values for the same outcomes in different locations in the report. There is much room for improvement in the conduct and reporting of diagnostic test evaluation studies. A document similar to the CONSORT statement that was published to improve the reporting of randomized controlled trials would be very useful to improve the reporting of diagnostic test evaluation studies.

Evidence Tables

Supplemental Analyses

Meta-Analyses

Table 53. Meta-analysis of diagnostic performance of prehospital 12-lead ECG for ACI
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kudenchuk 1998740/87356/13580.460.43-0.480.960.95-0.9748.1050
Bertini 1991298/1619/2720.950.92-0.970.930.90-0.968.6049
Aufderheide 1992b102/13726/1740.430.36-0.490.870.81-0.9116.6691
Aufderheide 199055/642/480.900.79-0.960.530.43-0.644.8193
Dalzell 199163/181/70.780.67-0.860.880.47-0.991.2950
Total (range)2,3082,0030.43-0.950.53-0.96
REM pooled0.760.54-0.890.880.67-0.96
Odds ratio23.3(6.3-85.4)

Note: The following acronyms are used in the tables: TP-true positive; FN-false negative; FP-false positive; TN-true negative; Sens-sensitivity; CI-confidence interval; Spec-specificity; Var-variance.

Table 54. Meta-analysis of diagnostic performance of prehospital 12-lead ECG for AMI
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kudenchuk 1998259/13241/7570.660.61-0.710.950.93-0.9627.3566
Bertini 1991120/3729/4190.760.69-0.830.940.91-0.9614.1654
Aufderheide 1992b38/531/3470.420.32-0.531.000.98-1.001.8216
Aufderheide 199013/111/1260.540.33-0.740.990.95-1.001.4857
Foster 199417/40/1340.810.58-0.941.000.97-1.000.7796
Millar-Craig 199771/153/370.990.92-1.000.410.31-0.521.7657
Dalzell 199137/111/450.770.62-0.870.980.87-1.001.5504
Arntz 1992268/1494/8050.640.59-0.691.000.99-1.004.7192
Total (range)1,2212,8000.42-0.990.41-1.00
REM pooled0.680.59-0.760.970.90-0.99
Odds ratio104 (48-224)
Table 55. Meta-analysis of diagnostic performance of single CK at presentation to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Thomson 199529/4022/2920.420.30-0.540.930.89-0.959.4696
Eisenberg 19797/612/550.540.26-0.790.820.70-0.902.5997
Mair 1991b22/2913/620.430.30-0.580.830.72-0.905.9958
Viskin 198729/4713/1630.380.27-0.500.930.87-0.967.5275
Lee 198740/64108/4270.380.29-0.490.800.76-0.8319.3463
Gornall 199611/298/500.280.15-0.440.860.74-0.933.9595
Hetland 19963/426/820.070.02-0.190.930.85-0.972.2936
Mair 1991a8/158/650.350.17-0.570.890.79-0.953.2347
Roxin 198422/5810/2150.280.18-0.390.960.92-0.986.3635
Hedges 198728/23253/4690.550.40-0.690.650.61-0.6811.9452
Total (range)5522,3330.07-0.550.65-0.96
REM pooled0.360.29-0.440.880.80-0.93
Odds ratio4.0 (2.6-6.2)
Table 56. Meta-analysis of diagnostic performance of single CK-MB at presentation to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Montague 19959/161/630.360.19-0.570.980.91-1.001.4517
Mair 199623/167/540.590.42-0.740.890.77-0.954.0226
Hetland 199620/252/860.440.30-0.600.980.91-1.002.2897
Mair 1991b29/226/690.570.42-0.700.920.83-0.974.2095
Gornall 199610/301/570.250.13-0.410.980.90-1.001.5328
Thomson 199538/3020/2870.560.43-0.680.930.90-0.969.0902
Hedges 199638/2935/9400.570.44-0.690.960.95-0.9711.2708
Brogan 19945/171/1660.230.09-0.460.990.96-1.001.3489
Collins 199342/393/1110.520.41-0.630.970.92-0.993.2253
Castaldo 199413/452/970.220.13-0.360.980.92-1.002.2636
Total (range)4962,0080.22-0.590.89-0.99
REM pooled0.440.35-0.530.960.94-0.97
Odds ratio23 (17-32)
Table 57. Meta-analysis of diagnostic performance of serial CK-MB to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Gerhardt 1982206/19/2651.000.97-1.000.970.94-0.981.6356
Montague 199517/85/590.680.47-0.840.920.82-0.972.7814
Gibler 199512/017/9811.000.74-1.000.980.97-0.990.8175
Hedges 199659/849/9260.880.77-0.940.950.93-0.966.6319
Hedges 199219/911/2220.680.48-0.830.950.91-0.974.1059
Brogan 19949/131/1660.410.22-0.630.990.96-1.001.4530
Castaldo 199436/225/940.620.48-0.740.950.88-0.983.9888
Total (range)4192,8100.41-1.000.92-0.99
REM pooled0.800.61-0.910.960.94-0.98
Odds ratio171 (58-505)
Table 58. Meta-analysis of diagnostic performance of presentation myoglobin to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Brogan 199412/103/1640.550.33-0.750.980.94-1.002.3092
Roxin 198457/2338/1870.710.60-0.810.830.77-0.8811.0202
Gilkeson 19785/84/540.380.15-0.680.930.83-0.981.9191
Mair 199223/287/680.450.31-0.600.910.81-0.964.5663
Mair 199618/217/550.460.30-0.630.890.78-0.954.0690
Hetland 199623/225/830.510.36-0.660.940.87-0.983.7364
Gornall 199617/231/570.430.27-0.590.980.90-1.001.5979
Montague 199516/1317/780.550.36-0.730.820.73-0.894.9098
Kennedy 19978/133/670.380.19-0.610.960.87-0.992.1752
Castaldo 199422/360/990.380.26-0.521.000.96-1.000.9158
Total (range)3989970.38-0.710.82-1.00
REM pooled0.490.41-0.570.930.88-0.96
Odds ratio13 (7.9-21)
Table 59. Meta-analysis of diagnostic performance of serial myoglobin (2 to 4 hours) to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Roxin 198478/238/1870.980.90-1.000.830.77-0.882.6218
Montague 199525/015/491.000.86-1.000.770.64-0.860.8593
Brogan 199416/66/1610.730.50-0.880.960.92-0.992.7718
Castaldo 199452/60/990.900.78-0.961.000.96-1.000.8441
Kennedy 199718/33/670.860.63-0.960.960.87-0.991.7079
Total (range)2066250.73-1.000.77-1.00
REM pooled0.900.78-0.960.920.82-0.97
Odds ratio140 (66-300)
Table 60. Meta-analysis of diagnostic performance of presentation troponin T to diagnose AMI (ED studies)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Mair 199113/103/700.570.35-0.760.960.88-0.992.2878
Mair 199611/285/570.280.16-0.450.920.82-0.973.2498
Hetland 199624/2112/760.530.38-0.680.860.77-0.925.6090
Gust 19984/121/510.250.08-0.520.980.88-1.001.2144
Hamm 199724/2355/6710.510.36-0.660.920.90-0.949.7299
Total (range)1701,0010.25-0.570.86-0.98
REM pooled0.440.32-0.560.920.88-0.95
Odds ratio10 (5.9-18)
Table 61. Meta-analysis of diagnostic performance of rest echocardiography to diagnose ACI (broadened definition of ACI used)
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kontos 199815/622/1420.710.48-0.880.870.80-0.913.7606
Mohler 199827/280/370.490.36-0.631.000.91-1.000.8885
Peels 199022/34/140.880.68-0.970.780.52-0.931.7045
Total (range)1012190.49-0.880.78-1.00
REM pooled0.700.43-0.880.870.72-0.94
Odds ratio20 (9-48)
Table 62. Meta-analysis of diagnostic performance of rest echocardiography to diagnose AMI
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kontos 19986/031/1481.000.54-1.000.830.76-0.880.7291
Peels 199012/114/160.920.62-1.000.530.35-0.711.3316
Sabia 199127/260/800.930.76-0.990.570.49-0.652.4348
Total (range)483490.92-1.000.53-0.83
REM pooled0.930.81-0.970.660.43-0.83
Odds ratio20 (6.5-62)
Table 63. Meta-analysis of diagnostic performance of sestamibi imaging to diagnose ACI
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kontos 1997b91/2380/3380.800.71-0.870.810.77-0.8414.6350
Stewart 19966/028/341.000.54-1.000.550.42-0.670.7156
Hilton 199414/118/690.930.66-1.000.790.69-0.871.4878
Total (range)1355670.80-1.000.55-0.81
REM pooled0.810.74-0.870.730.56-0.85
Odds ratio18 (11-29)

Sestambi scan- broader definition of coronary disease syndrome

Table 64. Meta-analysis of diagnostic performance of sestamibi imaging to diagnose AMI
Study Characteristics and Pooled Results
StudyTP/FNFP/TNSens95% CISpec95% CI1/Var.
Kontos 1997b26/2145/3590.930.75-0.990.710.67-0.752.5429
Stewart 19961/034/331.000.03-1.000.490.37-0.620.3269
Hilton 199412/020/701.000.74-1.000.780.68-0.860.8136
Total (range)416610.93-1.000.49-0.78
REM pooled0.920.78-0.980.670.52-0.79
Odds ratio26 (6-113)
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F015.jpg.

   Figure 15. SROC analysis of serial CK-MB to diagnose AMI (ED, hospital, and CCU studies combined)1

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F018.jpg.

   Figure 18. SROC analysis of presentation troponin T to diagnose AMI (ED only studies)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F019.jpg.

   Figure 19. SROC analysis of rest echocardiography to diagnose ACI (broadened definition of ACI used - see text)

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F020.jpg.

   Figure 20. Studies of rest echocardiography to diagnose AMI1

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F021.jpg.

   Figure 21. SROC analysis of sestamibi perfusion imaging to diagnose ACI

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F022.jpg.

   Figure 22. Studies of sestamibi perfusion imaging to diagnose AMI1

Data used for meta-analyses and the results are presented in this chapter (Tables 53 to 64; Figures 921 to 22). The methodologies for performing meta-analyses were described in Chapter 2. Meta-analyses were performed for technologies where there are an adequate number of studies.

Meta-analysis results of test performance are shown here for prehospital 12-lead ECG, single CK, single and serial CK-MB, myoglobin, troponin T, rest echocardiography, and rest sestamibi.

Explanations for Summary Receiver Characteristics Analysis Figures

Plotted in each of the SROC graphs are individual studies depicted as ellipses. The x- and y- dimensions of the ellipses are proportional to the square root of the number of patients available to study the specificity and sensitivity, respectively, within the analysis. In some of the figures, the ellipses are labeled with the name and publication year of the study, or the values of a variable (e.g., duration of symptoms in hours for CK enzymes), to provide a reference when the individual studies are mentioned in the discussions about the analyses. In some of the figures, the ellipses may be shaded with more than one density to depict various groupings of the studies. The unweighted SROC curve is displayed only within the range where data are available. The cross (X) represents the random effects average of sensitivity and specificity values of the studies. The x- and y- dimensions of the shaded rectangle represent the 95 percent confidence intervals of the combined specificity and sensitivity, respectively. Additional explanations are provided in the figures as needed.

Decision and Cost-Effectiveness Analysis

Introduction

Little information exists to aid ED physicians in the decision as to which test best identifies a patient with ACI among several alternatives or to aid policymakers in the decision as to which tests represent the best value. As some technologies for the diagnosis of ACI are more accurate and also more expensive than others, the tradeoff between cost and effectiveness may give valuable information about decisions on use of specific technologies in the ED.

We developed a decision analytic model to assess outcomes and costs that result from applying different diagnostic strategies to patients presenting to the ED with signs and symptoms suggestive of ACI. The model applies to all adult patients who present to the ED in whom the ED physician suspects UAP or AMI. We evaluated the cost-effectiveness of 17 different strategies and 4 combinations of technologies applied in all ED patients, as well as in a subgroup of low-risk ED patients with possible ACI. The technologies evaluated are those analyzed in this report (see Chapter 3); the combinations of technologies are those that have been reported in the literature.

Assessment of the cost-effectiveness of diagnostic strategies is complex because of the various definitions of effectiveness and the multiple factors that affect it. We attempted to be as inclusive as possible by taking into account both the patient's and payers' perspectives in the overall analysis. For the patient's perspective, we used two definitions of effectiveness: appropriate triage and 30-day survival of patients with ACI. Quality-of-life measures are not included because the model evaluates triage for, and not management of, ACI.

The cost analysis is from the payers' perspective (e.g., health maintenance organizations, commercial insurance companies, and the government). Direct costs associated with the use of the technologies and the triage of ED patients are considered. Indirect costs, such as lost work days, are not included.

The model demonstrates how the diagnostic performance of a technology affects (1) costs, (2) appropriate triage for ACI, and (3) 30-day survival; the model also evaluates the cost-effectiveness and marginal cost-effectiveness of each technology. The cost-effectiveness of a technology is the cost of applying the technology to an ED patient with possible ACI per appropriate triage (hospitalization) for a patient who truly has ACI.

Methods

Overview

The decision analysis applies to the population of ED patients who have signs and symptoms of ACI. The technologies evaluated in the decision analysis are:

  1. Serum biomarkers: single and serial CK-MB; single and serial troponin T; troponin I; single and doubling of myoglobin.

  2. ECG-based technologies and algorithms: continuous and serial ECG, nonstandard ECG leads, ACI-TIPI, artificial neural network, Goldman chest pain protocol, and stress ECG.

  3. Imaging studies: rest and stress echocardiography; rest and stress sestamibi.

  4. Combinations of two technologies: single CK-MB and single myoglobin; single CK-MB and serial ECG; single myoglobin and artificial neural network; single troponin T and echocardiography.

The cost-effectiveness of a specific technology is the sum of all the costs incurred by using the technology on a patient presenting to the ED divided by its effectiveness. Effectiveness is defined as either accurate triage for ACI (i.e., hospitalization) or 30-day survival. Although the results of diagnostic testing are rarely used alone to guide triage decisions, the lack of data on how most technologies affect triage of ED patients led us to use diagnostic performance as a proxy for triage accuracy.

Data on diagnostic performance for each technology were obtained from the meta-analyses and pooled results in this report or estimated from values reported in the literature. Sources for cost data included published Medicare reimbursements for patient services and median nationwide physicians fees for technologies. We used published mortality, hospitalization, and prevalence rates from a large study which included all ED patients evaluated for ACI.

Data TreeAge 3.5 software (Williamstown, MA) was used to construct the decision model and perform the cost-effectiveness analyses. Sensitivity analyses on relevant variables were performed to determine the effect of the specific values used and to assess the stability of the results from the base analysis.

Some of the technologies may not apply to all patients, such as patients at high risk for AMI. These patients include those with acute chest pain and significant ST-segment elevation or depression on initial ECG. It is unlikely that such patients would undergo stress ECG or imaging studies. Indeed, the studies that evaluated the diagnostic performance of such technologies explicitly excluded patients with significant ST-segment elevation on initial ECG from study protocols. Furthermore, it is unlikely that stress testing or sestamibi imaging would add significantly to diagnostic decisionmaking in patients with obvious AMI on presenting ECG.

To account for this, we used two models in the decision analysis. The General Population Model, which includes all ED patients, does not include any technologies that could not be applied to all ED patients. Such technologies include all stress testing, sestamibi imaging, continuous and serial ECGs, and technology combinations which use these technologies. We evaluated these technologies in the Subgroup Model which includes ED patients who do not have ST elevation on initial ECG that would preclude them from stress testing and imaging. This population has a lower prevalence of AMI than that of the General Population Model.

The structure, outcomes, and patient dispositions of the two models, the General Population Model and the Subgroup Model, are the same. The models differ in their AMI and ACI prevalences and in the technologies they evaluate. All technologies are evaluated in the Subgroup Model; a subset of technologies is evaluated in the General Population Model. Both models estimate the costs of the outcomes from applying technologies to their respective patient populations.

Patient Population

The General Population Model applies to all patients who present to the ED with signs and symptoms suggestive of ACI (population category I, described in Chapter 2). The Subgroup Model applies to a subgroup of the General Population Model: lower risk patients who do not have significant ST elevation on initial ECG (population category III).

The values for the prevalence of ACI, 23 percent, and AMI, approximately 8 percent, used in the General Population Model were obtained from the ACI-TIPI trial which evaluated all patients who presented to the ED with possible ACI (Pope, Ruthazer, Beshansky, et al., 1998; Pope, Aufderheide, Ruthazer, et al., 2000). The prevalence of ACI and AMI in the Subgroup Model was estimated by reducing the proportion of AMI patients by the percentage of AMI patients who have significant ST elevation on initial ECG. As approximately 25 percent of AMI patients may have significant ST elevation on initial ECG (Pope, Ruthazer, Beshansky, et al., 1998), we estimated the prevalence of AMI in the Subgroup Model to be 6 percent. The prevalence of UAP remains the same; thus the prevalence of ACI in the Subgroup Model is 21 percent.

Technology Performance Characteristics

Clinical test results are usually evaluated in the context of the patient's signs and symptoms, physical examination, and the ED physician's suspicion of ACI. Thus, triage decisions by ED physicians incorporate not only test results but clinical assessments as well. Sometimes test results may be ignored because of overwhelming clinical signs and symptoms and/or physicians' intuitions.

However, data on the effect of technologies on ED physicians' triage decisions of patients with possible ACI are very sparse. The ACI-TIPI trial, in which patient triage outcomes with the use of the predictive instrument were compared with triage outcomes without the instrument, is one of the few studies to provide data on the actual impact of a technology on ED patient triage.

As a result, we used the reported sensitivity and specificity of a technology for AMI and ACI as proxies for a test's "clinical impact," i.e., what we considered "triage accuracy." The diagnostic performance of a test may even be thought to represent the technology's optimal performance: how well a technology detects ACI without the added decisionmaking input (correct or incorrect) from an ED physician. The probabilities for appropriate ACI and non-ACI triage for each technology are thus based on the technology's reported diagnostic performance in ED patients, with the notable exception for ACI-TIPI, in which actual patient triage outcomes are available. The diagnostic performance for the four combinations of technologies were obtained from values reported in studies in which the combinations were evaluated.

Assessment of technology diagnostic performance for detection of ACI was determined from studies that applied the technologies to all eligible ED patients. Estimates for technology diagnostic performance for ACI were obtained from the meta-analyses or pooled results found in this report. When data on a technology's diagnostic performance in the ED were not available, values based on reported results in other settings (e.g., admitted ED patients) or on estimates were used.

Evaluation of the diagnostic performance of technologies for AMI is relatively straightforward because the reference standard for AMI in the majority of studies was the WHO criteria (with minor variations). In contrast, data on the diagnostic performance of technologies for the detection of UAP were lacking or particularly difficult to assess because definition and identification of UAP were not reported. In some cases, we used diagnostic performance for the detection of non-AMI "severe" coronary artery disease or the need for revascularization as proxies for unstable angina. Thus, the majority of sensitivity values for UAP are estimates.

Table 65. Sensitivity values for serum biomarkers' diagnostic performance used in the decision analysis
TechnologyAcute myocardial infarctionUnstable angina 1Nonacute cardiac ischemia
Base valueReferenceBase valueReferenceBase valueReference
CK-MB single0.44Supplemental Analyses: Meta-Analyses0.02Estimate based on: Hedges, 1987, 1996; Hamm, 1997; Laurino, 19960.95Estimate from Laurino, 1996; Levitt, 1996; Mohler, 1998
CK-MB serial0.88Supplemental Analyses: Meta-Analyses0.05Above0.95Above and Hedges, 1992
Myoglobin initial0.5Supplemental Analyses: Meta-Analyses0.05Kennedy, 19970.9Levitt, 1996; Kennedy, 1997; Gilkeson, 1978; Laurino, 1996
Myoglobin doubling0.87Estimate from pooled analysis (Chapter 3)0.2Estimate based on troponin data0.9Estimate based on above
Troponin T initial0.53Meta-Analysis0.2Mohler, 1998; Hamm, 1992, 19970.98Mohler, 1998
Troponin T serial0.88Sayre, 1998; Hamm, 19970.2Mohler, 1998; Hamm, 19970.98Estimate based on Green, 1998; Mohler, 1998
Troponin I0.35Mair, 1996; Tucker, 1997; Brogan, 1997; Hamm, 19970.2Mair, 1996; Hamm, 19970.98Hamm, 1997; Kontos, 1999b
1

Some values estimated from data on diagnostic performance for detection of coronary artery disease or similar endpoint because of sparse data on diagnostic performance for the detection of UAP specifically.

Table 66. Sensitivity values for imaging studies' diagnostic performance used in the decision analysis
TechnologyAcute myocardial ischemiaUnstable anginaNonacute cardiac ischemia
Base valueReferenceBase valueReferenceBase valueReference
ECHO rest0.93Supplemental Analyses: Meta-Analyses0.35Sasaki, 1986; Mohler, 19980.66Estimate from meta-analysis
ECHO stress0.93Estimated from Trippi, 19960.5Estimate from Trippi, 19970.89Trippi, 1996
Sestamibi rest0.93Supplemental Analyses: Meta-Analyses0.75High estimate based on Hilton, 1994; Kontos, 1999b; Varetto, 19930.77Meta-analysis
Sestamibi stress0.99Estimate based on Stewart, 19960.75Above0.99Estimate based on Hilton, 1994; Kontos, 1999b; Varetto, 1993
Table 67. Sensitivity values of diagnostic performance of ECG-based technologies and algorithms used in the decision analysis
TechnologyAcute myocardial ischemiaUnstable anginaNonacute cardiac ischemia
Base valueReferenceBase valueReferenceBase valueReference
Exercise ECG0.99Estimate based on Kirk, 1998; Lewis, 19990.6Lewis, 1999; Garber, 19990.77Kirk, 1998; Lewis, 1999; Tsakonis, 1991; Garber, 1999
ACI-TIPI0.98Selker, 19980.97Selker, 19980.43Selker, 1998
Goldman chest pain protocol0.89Goldman, 1982, 1988; Poretsky, 19850Not intended for detection of UAP0.7Estimate based on Goldman, 1982, 1988; Poretsky, 1985
Continuous/serial ECG0.4Estimate based on: Hedges, 1992; Fesmire, 19980.2Estimate based on Gibler, 19950.95Fesmire, 1998; Gibler, 1995
Nonstandard ECG leads0.7Estimate based on Justis, 1992; Zalenski, 1993, 1997b0.2Estimate 10.85Estimate based on Justis, 1992; Zalenski, 1993, 1997b
Artificial neural network0.95 and 0.5 2Baxt, 1991, 1996; Kennedy, 19970.23Kennedy, 19970.83Kennedy, 1997
1

Estimated from data on diagnostic performance for detection of coronary artery disease or similar endpoint because of sparse data on diagnostic performance for the detection of UAP specifically.

2

Two different values used: for General Population Model, sensitivity for AMI is 0.95 (Baxt, 1991, 1996); for Subgroup Model, sensitivity for AMI is 0.5 (Kennedy, 1997).

Table 68. Sensitivity values of combination technologies' diagnostic performance used in the decision analysis
TechnologyAcute myocardial infarctionUnstable anginaNonacute cardiac ischemia
Base valueReferenceBase valueReferenceBase valueReference
CK-MB (single) & myoglobin (single)0.7Estimate based on Levitt, 1996; Montague, 1995; Kontos, 1997a, 1999a0.3Estimate 10.9Levitt, 1996
Troponin T (single) & ECHO0.99Mohler, 19980.5Estimate based on Mohler, 19980.9Estimate based on Mohler, 1998
CK-MB (single) & serial ECG0.79Hedges, 19920.5Estimate 10.85Hedges, 1992
Myoglobin (single) & artificial neural network0.8Estimate based on Kennedy, 19970.4Estimate based on Kennedy, 19970.56Estimate based on Kennedy, 1997
1

Some values estimated from data on diagnostic performance for detection of coronary artery disease or similar endpoint because of sparse data on diagnostic performance for the detection of UAP specifically.

Values or estimates for each technology's sensitivity for AMI, UAP and non-ACI (specificity) used in the decision analysis, and the sources from which values or estimates are derived, are shown in Tables 65 to 68.

Outcomes

The model evaluated cost-effectiveness separately for each of two outcomes: triage accuracy and 30-day survival of patients with ACI. Quality-adjusted life-years were not used, since the focus of the analysis was appropriate triage for ACI and not long-term management of ACI.

Triage accuracy of the technologies for patients with ACI was defined as the number or proportion of patients with ACI appropriately admitted to the hospital from the ED. Admission was considered appropriate and ED discharge was considered inappropriate triage for patients with ACI.

Thirty-day survival was similarly defined: the number of patients with ACI who survived to 30 days. The probability of 30-day survival is calculated from mortality data for patients with ACI admitted to the hospital or discharged from the ED from the large ACI-TIPI trial (Pope, Aufderheide, Ruthazer, et al., 2000).

Table 69. Transition probabilities
VariableBase valueReference
Prevalence of ACIGeneral Model: 23% Subgroup Model: 21%Maynard, 1996; Pope, 1998
Prevalence of UAPBoth models:15%Pope, 1998
Prevalence of AMIGeneral Model: 8.4% Subgroup Model: 6%Maynard, 1996; Pope, 1998
Subsequent hospitalization rate within 30 days for missed AMI72%Pope, 2000
Subsequent hospitalization rate within 30 days for missed UAP42%Pope, 2000
30-day survival rate for hospitalized patients with AMI89%Pope, 2000
30-day survival rate for hospitalized patients with UAP98%Pope, 2000
30-day survival rate for patients with AMI discharged from the ED89%Pope, 2000
30-day survival rate for patients with UAP discharged from the ED96%Pope, 2000
Percentage of patients with untreated UAP who develop AMI within 30 days5%Estimate based on expert opinion and National Cooperative Study Group. 1981, and Krauss, 1972
Subsequent hospitalization rate for patient without ACI discharged from the ED10%Analysis of ACI-TIPI data
Value of appropriate triage or survival for patients with ACI1Decision analysis
Value of inappropriate triage or death for patients with ACI0Decision analysis
1

From the Physicians' Fee Reference, 1999; all values are the median nationwide fees.

2

PFR data not available; cost for test at New England Medical Center, Boston.

3

No separate billing code for nonstandard ECG; cost is that of a standard ECG.

4

This is the cost for an ECG as there is no additional cost for an ECG with the ACI-TIPI prediction output. Unless otherwise noted in the text, the cost for ACI-TIPI is that of an ECG.

Appropriate triage and survival of patients with ACI were assigned a utility value of 1. Inappropriate triage and death of patients with ACI were assigned a value of 0. Variable values for the probabilities used in the decision analysis are shown in Table 69.

Patient Disposition

The disposition of a patient who enters the ED is determined by the technology applied, the result of the test, the subsequent triage and followup evaluation, and the probabilities associated with survival or death from appropriately or inappropriately triaged ACI. This is detailed in the Description of the Decision Analysis section below. Values for probabilities affecting patient disposition, such as mortality and hospital admission rates, were obtained from the large ACI-TIPI trial or based on expert opinion and are shown in Table 69.

We did not include complications to patients that may arise from use of some of the technologies, such as stress testing, because of the extremely low rate of clinically significant complications and because we used a model to account for the subgroup of patients at higher risk of complications.

The rate of complications from stress testing, for example, is extremely low: As reported in the previous NHAAP report, in a national survey of ECG exercise testing, Stuart and Ellestad (1980) reported fewer than 1 death per 10,000 patients tested and overall morbidity (including clinically significant and insignificant arrhythmias and hypotension, some of which may have occurred regardless of stress testing) of 0.05 percent. None of the studies of stress sestamibi, ECG, or echocardiography stress testing evaluated in this report reported deaths or significant morbidity (complications that altered the care given to a patient).

We also accounted for the likely possibility of higher rates of complications from stress testing in a certain subgroup of patients with obvious ECG signs of ACI, such as T-wave inversion or ST elevation, or patients with severe clinical symptoms. Thus, these patients are only included in the General Population Model which does not include stress testing.

Costs

All costs used in the model are those that result from applying the technology to an ED patient and are incurred by the payer. The values for costs reflect the reimbursement to the hospital or outpatient clinic for patient services. The costs of the technology and subsequent management of a patient contribute to the total cost for each diagnostic strategy. The costs of subsequent management depend on a patient's disposition (discussed below) and may include costs of admission to the hospital, outpatient followup, return visits to the ED, and death from "missed" AMI or UAP. Costs associated with death of a patient with AMI or UAP inappropriately discharged from the ED range from a minimum of $600 for a return ED visit and resuscitation attempt to $2 million for a malpractice settlement case.

The cost of a technology represents a component of the total cost incurred by using the technology to determine triage for a patient with suspected ACI in the ED. Of note, ACI-TIPI was evaluated at two costs, $0 and $68, because ACI-TIPI may not be incorporated into all ECG machines in an ED. If the initial ECG performed in the ED has the ACI-TIPI printout, then there is no additional cost for the technology (the initial ECG is not included as an extra cost in the decision analysis). If a second ECG is required because the initial ECG does not provide the predictive instrument printout, then the cost for this ECG is considered the cost for ACI-TIPI.

Unless noted otherwise, all analyses were performed using the more conservative cost, the cost for an ECG ($68), for ACI-TIPI. Sensitivity analyses on the cost of ACI-TIPI were also performed and are described below.

Total costs do not include the reimbursement for the initial ED visit unless the patient is discharged from the ED, as the reimbursement for the ED visit for an admitted patient is usually included in the reimbursement for the admission. As the standard of care is to obtain an ECG on all ED patients evaluated for ACI, the cost of an initial ECG was not included in the analysis. Professional fees for administering the tests and ED and outpatient visits are included in total costs. Discounting was unnecessary because of the short time horizon of the analysis.

Table 70. Costs for technologies1
Technology and combinationsCosts ($)
CK-MB single45
CK-MB serial90
Myoglobin at time of ED presentation55
Myoglobin doubling110
Troponin T at ED presentation56 2
Troponin T serial112
Continuous/serial ECG297
Nonstandard ECG leads68 3
ACI-TIPI68 4
Goldman criteria0
Artificial neural network0
Exercise ECG296
ECHO rest379
ECHO stress929 (rest and stressed bundled)
Sestamibi rest834
Sestamibi stress1,130
Combination: CK-MB and myoglobin100
Combination: CK-MB and serial ECG342
Combination: Myoglobin and artificial neural network55
Combination: Troponin T and ECHO rest435
1

From the Physicians' Fee Reference, 1999; all values are the median nationwide fees.

2

PFR data not available; cost for test at New England Medical Center, Boston.

3

No separate billing code for nonstandard ECG; cost is that of a standard ECG.

4

This is the cost for an ECG as there is no additional cost for an ECG with the ACI-TIPI prediction output. Unless otherwise noted in the text, the cost for ACI-TIPI is that of an ECG.

Reimbursement rates for the technologies are taken from the Physicians' Fee Reference (1999), a compendium of nationwide fees for procedures and interventions charged by physicians and hospitals. We used the median nationwide fee charged for each technology because many payers reimburse only up to the 60th percentile. All costs include the physician report and interpretation. For strategies that used a combination of two technologies, the costs for each technology were added. The costs of the technologies used in the analysis are shown in Table 70.

Reimbursement rates for hospital admission were based on the average national payments for the diagnosis-related group (DRG) codes. The reimbursement rate for an ED visit for a patient with suspected ACI was obtained from actual ED patient visits to the New England Medical Center between October 1998 and May 1999. Outpatient visit reimbursements were calculated from median fees for tests performed as part of the outpatient workup and the average national payment for the professional component of the outpatient visit.

Table 71. Factors involved in calculating total costs1
Patient dispositionFactors involved in total cost calculationCost ($)
Patient with ACI appropriately triaged (admitted)Costs of: technology + admissionVaries
Patient with ACI inappropriately discharged, diesCosts of: technology + ED visit + return ED visit (resuscitation attempt) + missed ACI 2Varies
Patient with AMI inappropriately discharged, survivesCost of: technology + ED visit + subsequent admission or outpatient workupVaries
Patient with UAP inappropriately discharged, develops AMI, survivesCosts of: technology + ED visit + subsequent hospitalization for AMIVaries
Patient with UAP, inappropriately discharged, survivesCosts of: technology + ED visit + subsequent outpatient workup or subsequent admissionVaries
Non-ACI patient admitted for ACI treatmentCosts of: technology + 23-hr observation (ruled out for AMI)Varies
Non-ACI patient discharged from EDCosts of: technology + ED visit + outpatient workup or subsequent admissionVaries
Non-ACI admission (23-hr observation)2,158
Admission for ACIAverage of an average admission for an uncomplicated AMI ($4,627) and admission for UAP with angiography ($4,155)4,400
Initial and return ED visitsIncludes all services provided by the ED, including resuscitation attempts if necessary600
Outpatient visitIncludes professional fee and cost of exercise ECG430
Outpatient visit for patients with negative exercise ECG in EDIncludes professional fee and cost of stress sestamibi scan1,260
Outpatient visit for patients with negative stress sestamibi scan in EDProfessional fee only130
1

Values for patients' services obtained from average Medicare reimbursements based on DRG codes.

2

Costs associated with a missed ACI death include: cost for malpractice settlement or cost of a trial.

The costs and calculations for all total costs for each patient disposition are shown in Table 71.

Methods of Comparison

The diagnostic strategies were ranked by order of overall cost. The strategies that were both more expensive and less effective than another strategy were eliminated by strict dominance. The marginal cost-effectiveness of a strategy, or the additional cost per additional effectiveness, was compared with its less costly alternative among the remaining strategies. The analyses included evaluation of the total costs associated with 30-day survival and appropriate triage for patients with ACI.

Description of the Decision Analysis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F023.jpg.

   Figure 23 Decision Tree

Shown is one decison node branch of the tree. Each technology represents a decision node, or option. The branches emanating from a chance node, depicted by a circle, represent the possible outcomes that could occur. The possible outcomes are determined by the probabilities assigned to each branch of a chance node. The number of patients at each of the terminal nodes, from a cohort of 1000 ED patients, is determined by the probabilities of all choices that occurred along the way.

ED D/C: EDdischargeOPD: Out-patient

* Determined by prevalence of AMI and UAP in population of ED patients

The decision analysis represents the possible dispositions and triage outcomes that may occur for a patient with signs and symptoms of ACI evaluated in the ED. The possible patient dispositions and triage outcomes are shown in Figure 23, which represents the basic decision model used in the analysis. All outcomes and dispositions are those that occur within 30 days of initial evaluation in the ED. All strategies, a total of 21, are evaluated in the Subgroup Model. All biomarkers, algorithms, nonstandard ECGs, ACI-TIPIs, echocardiography, and the combinations of CK-MB-myoglobin, and troponin T-echocardiography are evaluated in the General Population Model.

The following are the possible patient dispositions that may occur in the decision analysis:

  1. A patient with signs and symptoms of ACI is admitted to the ED at which time the ED physician may apply 1 of the 21 diagnostic strategies evaluated in this analysis. This is the only decision point in the tree. All subsequent patient dispositions and outcomes are determined by the prevalence of ACI in the ED population, the triage accuracy of the technology applied, and the probabilities of survival from appropriately or inappropriately triaged UAP and AMI.

  2. The probability that a patient has ACI is determined by the prevalence of AMI and UAP.

  3. A patient with ACI may either be admitted for management of ACI ("appropriate" triage) or may be sent home from the ED ("inappropriate" triage) based on the results of the diagnostic technology the ED physicians decide to use. A positive test result leads to admission; a negative test result leads to discharge from the ED.

  4. A patient with UAP and AMI who is admitted will either survive or die. Although this outcome does not depend on the technology used to detect the ACI, it is included in the model to show the difference in death rates in those with ACI appropriately admitted and those inappropriately discharged from the ED.

  5. A patient with AMI who is discharged from the ED may either survive or die. Those who survive will return for either hospital admission or outpatient evaluation within 30 days.

  6. A patient with UAP who is discharged from the ED may either survive or die. Those who survive may: (a) develop an AMI and survive or die, or (b) might not develop an AMI. A patient who develops an AMI and survives will subsequently return for hospital admission. A patient who does not develop an AMI (and does not die from UAP) will either return for hospital admission or have an outpatient workup.

  7. A patient without ACI may be either admitted for ACI management (because of a false positive test result) or discharged appropriately (true negative test result). Some patients without ACI, however, continue to have symptoms even after appropriate discharge from the ED. A proportion of these patients will continue to seek care and may be admitted later (during the 30-day period) to the hospital for evaluation or will have outpatient followup. All patients without ACI, admitted or discharged from the ED, survive the 30-day period.

  8. A patient without ACI who is admitted appropriately for a recognized non-ACI condition is not included in the analyses because of the great uncertainty in modeling the costs and outcomes of this disposition.

The 15 possible patient dispositions are summarized below:

  • AMI detected and patient admitted: patient survives.

  • AMI detected and patient admitted: patient dies.

  • AMI missed: patient dies.

  • AMI missed, patient survives: returns for admission.

  • AMI missed, patient survives: outpatient workup.

  • UAP detected and patient admitted: patient survives.

  • UAP detected and patient admitted: patient dies.

  • UAP missed: patient dies.

  • UAP missed: patient develops AMI and dies.

  • UAP missed: patient develops AMI and survives and returns for admission.

  • UAP missed: UAP continues and patient returns for admission.

  • UAP missed: UAP continues and patient returns for outpatient workup.

  • Non-ACI detected: patient discharged from ED and has outpatient workup.

  • Non-ACI detected: patient discharged from ED and returns for admission.

  • Non-ACI incorrectly diagnosed as ACI: patient admitted for ACI management.

Model Assumptions

Patient Population

  • General Population Model: patients have signs and symptoms suggestive of unstable angina or myocardial infarction; no patients are excluded.

  • Subgroup Model: same population as General Population Model excluding patients who have ST elevation on initial ECG; this represents a lower risk population.

  • Both models include patients who may present without chest pain.

  • The model does not apply to patients who are in cardiac arrest or in whom life-saving measures need to be applied.

  • Patients who have stable angina are considered not to have ACI; they are included in the non-ACI group.

  • All patients are able to undergo any of the technologies evaluated in the decision analysis.

Costs

  • All patients are given the same treatment in the ED, other than the diagnostic test, so the cost for an ED visit is the same for all patients.

  • The cost for hospitalization of a patient with UAP is the same as for a patient with AMI to prevent penalizing appropriate admission for AMI and to provide for the possibility of cardiac angiography for a patient with UAP. This assumption was also based on the similar hospital reimbursements for uncomplicated AMI ($4,627) and UAP with cardiac angiography ($4,155) admission.

  • The cost for ACI-TIPI is assumed to be the cost for an ECG ($68) unless otherwise noted (discussed above).

  • The cost for death of a patient with AMI or UAP discharged form the ED is the cost of a return ED visit and resuscitation attempt ($600) in the base analysis and increases to the cost of a typical malpractice settlement ($2 million) in the sensitivity analyses.

Diagnostic Technologies

  • All test results are either positive or negative. A nondiagnostic test is considered negative.

  • A positive test result for combination technologies requires that either test be positive.

  • A negative test result for combination technologies requires that both tests be negative.

  • There are no complications from biomarker, algorithm, ECG-based, or rest-imaging technologies, as no study reported complications from these technologies. There are no complications from the stress technologies (exercise ECG, stress imaging) because these technologies are applied only to the subgroup of ED patients who have AMI excluded on initial ECG and because no studies reported significant complications from these technologies.

  • Only patients in the Subgroup Model undergo sestamibi imaging or stress or serial/continuous ECG tests.

  • Stress testing may be physiological (exercise), or pharmacological (dobutamine).

  • The model does not account for patients who are unable to undergo exercise ECG. ED physicians may choose which of the remaining strategies to apply to these patients.

  • All patients who return for an outpatient workup because of a negative test result undergo exercise stress testing for evaluation of their symptoms (expert opinion).

  • Patients who have a negative exercise test in the ED and return for outpatient followup undergo sestamibi stress testing.

  • Patients who have a negative stress sestamibi scan in the ED and return for outpatient evaluation do not undergo further testing because the negative predictive value of stress sestamibi scan is very high (expert opinion).

Effectiveness

  • The effectiveness of a technology is determined by the proportion of patients who are given the appropriate triage.

  • Appropriate triage is hospital admission for ACI and discharge home or non-ACI treatment for non-ACI patients.

  • Appropriate triage for UAP is hospital admission because early therapy during a high-risk period may prevent complications and/or progression to AMI.

  • Inappropriate triage is discharge from the ED for ACI patients or admission to the hospital for ACI treatment for non-ACI patients.

  • Triage of ED patients is determined by the results of the test.

  • Patients who have a positive test result, or a test result that implies a high likelihood of ACI, are admitted to the hospital for ACI treatment. These patients may have either a true positive or a false positive test result.

  • Patients who have a negative test result, or a test result that indicates a very low likelihood of ACI, are not admitted to the hospital for ACI treatment. These patients may have either a true negative or a false negative test result.

Patient Disposition

  • No patients die in the ED during the initial visit.

  • Patients with ACI who are admitted are considered to have definitive evaluation for their condition during hospitalization.

  • All patients with ACI who are inappropriately discharged from the ED may: (1) be admitted subsequently to the hospital for evaluation, (2) undergo outpatient diagnostic testing, or (3) die.

  • Admitted patients who do not have ACI are admitted for a "rule-out MI" protocol (a 23-hour observation hospitalization).

  • Patients with non-ACI who are admitted are considered to have a definitive evaluation of their condition and therefore do not return for further followup.

  • Patients who do not have ACI and who are admitted for treatment of a non-ACI condition have appropriate triage and no additional costs.

Results

The results are presented in the following order:

  1. Description of costs of technologies and total costs.

  2. General Population Model: triage accuracy, results of cost-effectiveness analysis, sensitivity analysis.

  3. Subgroup Model: triage accuracy, results of cost-effectiveness analysis, sensitivity analysis.

  4. Survival outcome results.

  5. Sensitivity analysis on cost of ACI-TIPI.

Technology Costs and Total Costs

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F024.jpg.

   Figure 24. Cost of technology and ratio of total costs to technology cost

A: ACI-TIPI; AN: artificial neural network; CK: CK-MB, single; CE: continuous/serial ECG; CS: CK-MB, serial; ER: resting ECHO; ES: stress ECHO; ET: exercise ECG; G: Goldman chest pain protocol; I: troponin I; M: Myoglobin initial; MD: Myoglobin doubling; NS: nonstandard ECG; SES: stress sestamibi; SR: resting sestamibi; T: troponin T, initial; TS: troponin T serial; 1: CK-MB and Myoglobin; 2: myoglobin and artificial neural network; 3: CK-MB and serial ECG; 4: troponin T and ECHO.

The cost for a technology makes up a component of the total costs incurred by applying the technology to a population of ED patients. The ratio of total cost associated with a technology to the cost of the technology itself decreases exponentially as the cost of a technology increases (Figure 24). This pattern indicates that the proportion of total costs from using a less costly technology (such as a serum biomarker) is much smaller compared with using a more costly technology (such as an imaging study) in which nearly half the total cost is the cost of the technology itself. The relationship between total costs and the effectiveness of technologies is discussed in detail below.

General Population Model

This model estimates the total costs associated with the use of technologies that can be applied to any patient in the ED with signs and symptoms of ACI. The prevalence of ACI in this population is 23 percent; 8 percent of patients have AMI. A test with perfect sensitivity results in appropriate triage of 230 patients with ACI per 1,000 ED patients evaluated for ACI and has an effectiveness value of 0.23.

Triage Accuracy

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F025.jpg.

   Figure 25. General population model: Technology rankings by percent inappropriate triage (ED discharge) of patients with ACI at ACI prevalence of 23 percent

ANN: Artificial neural network applied in general ED population.

Figure 25 shows the rankings of technologies by percentage of patients with ACI inappropriately discharged from the ED. The biomarkers perform poorly, primarily because of their inability to detect UAP. The range of missed ACI is 83 percent for a single CK-MB value to 56 percent for serial troponin T. Serial testing or combination of biomarkers improves sensitivity. Of the biomarkers, troponin T appears to have the best triage accuracy for ACI. Troponin T and myoglobin perform better than CK-MB, primarily because their estimates for UAP detection are higher than that of CK-MB; however, estimates for detection of UAP are based on few studies (see Sensitivity Analysis, below).

Nonstandard ECG is better than the single biomarkers, but not as effective as serial troponin T or myoglobin doubling. The artificial neural network performs better than any of the biomarkers; however, prospective trials of its actual performance in the general ED population have not been performed. Although the Goldman chest pain protocol has very good sensitivity for AMI, it is not designed to identify patients with UAP; thus, its triage accuracy for patients with both AMI and UAP is much lower than that for patients with AMI. The combination of troponin T-echocardiography, the technology with the best diagnostic performance, leads to appropriate triage of 68 percent of patients with ACI.

If data from clinical impact studies are used, ACI-TIPI has the best triage accuracy: In a large clinical trial, use of ACI-TIPI did not change the 97-percent triage accuracy for patients with ACI of ED physicians (whose decisions may have incorporated the results of various and multiple technologies).

Cost-Effectiveness Analysis

Table 72. General population model: Cost-effectiveness of technologies for appropriate triage for patients with ACI
TechnologyCost 1ACI triage 2CE 3Marginal CE 4No. ACI triage/1,000 ED patients 5ACI triage(%) 6
CPK-MB single$1,7280.039$43,8773917
Troponin I$1,7540.058$30,016$1,3585825
Troponin T initial$1,7690.073$24,129$1,0497332
Myoglobin initial$1,7870.049$36,652(Dominated)4921
CK-MB serial$1,8180.080$22,666$7,1298035
Troponin T serial$1,8560.102$18,138$1,68610244
Neural network$1,8680.113$16,597$1,16211349
Nonstandard ECG$1,8920.087$21,643(Dominated)8738
Goldman$1,9010.074$25,793(Dominated)7432
Myoglobin doubling$1,9120.101$18,838(Dominated)10144
CK-MB & myoglobin$1,9120.102$18,720(Dominated)10244
Troponin T & ECHO$2,3220.156$14,923$10,55815668
ACI-TIPI (at cost $0)$2,3490.224$10,489$39722497
ECHO rest$2,4000.129$18,670(Dominated)12956
ACI-TIPI (at cost $68)$2,4170.224$10,790$1,39722497
1

Total cost of applying technology to one ED patient with possible ACI (includes costs of technology, hospitalization, and/or return ED visits and/or outpatient followup).

2

"Effectiveness" value: probability of appropriate triage for ACI for an ED patient at ACI prevalence of 23 percent.

3

Cost-effectiveness ratio: total cost of applying technology to one ED patient with possible ACI divided by probability of appropriate triage for ACI. Calculated values may differ because of rounding.

4

Marginal cost-effectiveness: ratio of difference in costs to difference in effectiveness between two adjacent nondominated strategies.

5

Number of patients with ACI appropriately triaged per 1,000 ED patients.

6

Percent appropriate triage for patients with ACI.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F026.jpg.

   Figure 26. General population model: Cost of applying technology to 1,000 ED patients and number of patients appropriately triaged for ACI at 23 percent prevalence of ACI (reference line)

A: ACI-TIPI at a cost of $0; A68: ACI-TIPI at a cost of $68; AN: artificial neural network applied in general ED population; CK: CK-MB, single; CS: CK-MB, serial; ER: resting ECHO; G: Goldman protocol; I: Troponin I; M: Myoglobin initial; MD: myoglobin doubling; NS: nonstandard ECG; T: troponin T, initial; TS: troponin T serial; 1: CK-MB and myoglobin; 4: troponin T and ECHO.

The cost-effectiveness of a technology, which represents the total costs associated with the technology divided by its effectiveness (appropriate triage for ACI) reflects the tradeoff between costs and effectiveness. The results of the cost-effectiveness analysis for appropriate triage for patients with ACI are shown in Table 72. As expected, technologies with the best diagnostic accuracy for both AMI and UAP have the highest values for appropriate triage for patients with ACI (see ACI Triage [%] column). Figure 26 shows the number of patients with ACI appropriately triaged and the costs associated with applying a technology to 1,000 ED patients.

Technologies that are more effective (greater number of patients with ACI appropriately triaged) tend to have higher total costs, with the exception of ACI-TIPI. The biomarkers are least costly and have the lowest values for appropriate triage. The CE of troponin T-echocardiography is lower than that of the biomarkers because, although it has higher total costs, its better triage accuracy "outweighs" the higher costs.

Based on data using the diagnostic performance of technologies, the combination technology troponin T-echocardiography has the best CE of all technologies in the General Population Model. If results from clinical impact studies are incorporated, ACI-TIPI has the best CE because of its very high triage accuracy and low cost.

The combination of troponin T-echocardiography results in appropriate triage for about 55 more patients with ACI (per 1,000 ED patients) than serial troponin T, myoglobin doubling, or the combination of single CK-MB and single myoglobin. The total per ED patient cost for troponin T-echocardiography is $2,322 compared with about $1,900 for serial or combination biomarkers. Thus, troponin T-echocardiography costs approximately $422 more per ED patient but results in appropriate triage for 55 more patients with ACI (per 1,000 ED patients) compared with serial or combination biomarkers. The incremental CE between troponin T-echocardiography and serial or combination biomarkers is about $7,670: it costs about $7,670 more per additional appropriate triage for a patient with ACI using the combination of troponin T-echocardiography compared with the serial or combination biomarkers.

The next most effective technology is the artificial neural network applied in a general ED population. The per ED patient cost of applying the artificial neural network is $1,868, about $455 less than the combination troponin T and echocardiography; however, 43 fewer patients with ACI (per 1,000 ED patients) are appropriately triaged. The incremental cost-effectiveness between troponin T-echocardiography and the artificial neural network is approximately $10,568. This incremental CE is greater than that between the serial biomarkers and troponin T-echocardiography because the difference in triage accuracy between the artificial neural network and troponin T-echocardiography is smaller than the difference between the serial biomarkers and troponin T-echocardiography. Given the economic ramifications and the effects on the patient of a missed ACI diagnosis, the incremental CE for troponin T-echocardiography is minimal.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F027.jpg.

   Figure 27. General population model: Total cost of applying technology to 1,000 ED patients per appropriate triage for one patient with ACI at 23 percent prevalence of ACI

Numbers in bars represent the cost-effectiveness ratio.

* Artificial neural network applied in general ED population.

ACI-TIPI: at cost of an ECG ($68); ACI-TIPI $0: ACI-TIPI at cost of $0.

Cost-effectiveness rankings of the technologies are shown in Figure 27.

Sensitivity Analysis
Triage accuracy and cost-effectiveness for AMI only

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F028.jpg.

   Figure 28. General population model: Technology rankings by percent inappropriate triage (ED discharge) of patients with AMI at AMI prevalence of 8 percent

ANN: Artificial neural network applied in general ED population.

Because the estimates for technology sensitivity for UAP are based on sparse data, we evaluated the triage accuracy and cost-effectiveness of technologies for appropriate triage for patients with AMI only. Figure 28 shows the triage accuracy rankings of technologies for patients with AMI. There are few but important differences in triage accuracy for AMI: (1) the Goldman chest pain protocol improves significantly, (2) serial CK-MB improves slightly, and (3) the combination of troponin T-echocardiography is slightly more accurate than ACI-TIPI (a difference of one patient with AMI appropriately triaged).

Table 73. General population model: Cost-effectiveness of technologies for appropriate triage for patients with AMI
TechnologyCost 1Marginal costAMI triage 2Marginal effectivenessCE 3Marginal CE 4AMI triage (%) 5
CK-MB single$1,7280.03640$47,42344
Troponin I$1,754$260.029000.0075$60,509(Dominated)35
Troponin T initial$1,769$420.043900.0075$40,315$5,56853
Myoglobin initial$1,787$180.041400.0025$43,168(Dominated)50
CK-MB serial$1,818$490.072900.0290$24,956$1,69788
Troponin T serial$1,856$370.072900.0000$25,467(Dominated)88
Neural network$1,869$500.078700.0058$23,754$8,64195
Nonstandard ECG$1,892$230.058000.0207$32,636(Dominated)70
Goldman$1,903$340.073700.0050$25,823(Dominated)89
Myoglobin doubling$1,912$430.072000.0066$26,536(Dominated)87
CK-MB & myoglobin$1,912$430.058000.0207$32,982(Dominated)70
Troponin T & ECHO$2,322$4540.082000.0033$28,326$136,92599
ECHO rest$2,402$800.077000.0050$31,198(Dominated)93
ACI-TIPI$2,417$950.081100.0008$29,782(Dominated)98
1

Total cost of applying technology to one ED patient with possible ACI (includes costs of technology, hospital admission, and/or return ED visits and/or outpatient followup).

2

"Effectiveness" value: the probability of appropriate triage for AMI for an ED patient at AMI prevalence of 8 percent.

3

Cost-effectiveness ratio: total cost of applying technology to an ED patient with possible ACI divided by the probability of appropriate triage for a patient with AMI. Calculated values may be different due to rounding.

4

Difference in costs divided by difference in effectiveness between a strategy and the next nondominated most effective strategy.

5

Percent appropriate triage for patients with ACI.

Table 73 shows the cost-effectiveness and marginal cost-effectiveness for the technologies. The combination of troponin T-echocardiography is the most cost-effective, followed by the artificial neural network and serial CK-MB. The incremental CE between the two most cost-effective technologies is much larger than that in the general ACI model: approximately $137,000 per additional appropriately triaged patient with AMI. The incremental CE between troponin T-echocardiography and serial CK-MB, the third most cost-effective strategy, is approximately $56,000 per additional appropriately triaged patient with AMI, less than half that between troponin T-echocardiography and artificial neural network. Troponin T-echocardiography results in nine additional appropriate triages for patients with AMI compared with CK-MB, but only three additional appropriate triages for patients with AMI compared with the neural network.

Prevalence of ACI

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F029.jpg.

   Figure 29. General population model: Cost per appropriate triage for patients with ACI as prevalence of ACI changes

Shown are the five most effective technologies. Reference lines show range of ACI prevalence most commonly encountered in ED (17-25 percent).

To determine the effect of varying ACI prevalence, sensitivity analysis on the prevalence of ACI was performed. As the prevalence of ACI increases, the total cost per patient of applying a technology increases, as expected. The value associated with effectiveness (appropriate triage for ACI) also increases because as prevalence increases, the proportion of patients with ACI increases, increasing the positive predictive value of a test. The CE of the various strategies decreases in an exponential fashion as shown in Figure 29. As prevalence increases, the differences in cost effectiveness of the technologies become smaller because costs do not increase as much as triage accuracy. However, there is little change in relative cost effectiveness of the technologies: At a prevalence as high as 90 percent, ACI-TIPI retains its dominant cost-effectiveness, followed by troponin T-echocardiography and the artificial neural network.

Subgroup Model

This model estimates the costs and effectiveness of applying technologies to a population of ED patients with signs and symptoms of ACI who have a nondiagnostic initial ECG. The prevalence of ACI in this population is 21 percent; 6 percent of patients have AMI. All technologies can be applied to this population, and thus all imaging studies, stress testing, and serial ECG are included.

Triage Accuracy

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F030.jpg.

   Figure 30. Subgroup model: Technology rankings by percent inappropriate triage (ED discharge) of patients with ACI at ACI prevalence of 21 percent

ANN: artificial neural network applied to patients with AMI excluded by ECG.

Figure 30 shows the triage accuracy of the technologies applied in the Subgroup Model. The biomarkers have poor triage accuracy. Even serial testing fails to identify over half the patients with ACI. The Goldman protocol, artificial neural network, and continuous and serial ECG all have similar triage accuracy for ACI, between that of the single and serial or combination biomarkers. The combination technologies have better triage accuracy than the biomarkers and ECG-based technologies, with the exception of stress ECG. Of the combinations, troponin T-echocardiography has the best triage accuracy for ACI.

Sestamibi imaging (both resting and stress) has the best diagnostic performance among the technologies. Sestamibi stress imaging has the best diagnostic performance (detects 82 percent of patients with ACI), followed by sestamibi rest scanning and exercise ECG. If data from clinical impact studies are included, ACI-TIPI has the best triage accuracy (97 percent of patients with ACI appropriately triaged).

Cost-Effectiveness Analysis

Table 74. Subgroup model: Cost-effectiveness of technologies for appropriate triage of patients with ACI
TechnologyCost 1Marginal costACI triage 2Marginal ACI triageCE 3Marginal CE 4ACI triage (%) 5
CK-MB single$1,685.10.03$56,58914
Troponin I$1,713.9$290.0510.021$33,517$1,34924
Troponin T initial$1,722.8$90.0620.011$27,743$80730
Myoglobin initial$1,743.3$210.038−0.024$45,990Dominated18
CK-MB serial$1,759.2$370.061−0.001$28,817(Dominated)29
Neural network$1,786.0$630.0650.003$27,585$23,88331
Troponin T serial$1,796.0$100.0830.019$21,531$53640
Nonstandard ECG$1,841.2$450.072−0.011$25,414(Dominated)34
Goldman$1,845.9$500.054−0.029$34,056(Dominated)26
Myoglobin doubling$1,853.8$580.083−0.001$22,389(Dominated)40
CK-MB & myoglobin$1,860.4$640.0870.004$21,296$16,32441
Cont/serial ECG$1,980.0$1200.054−0.033$36,545(Dominated)26
Myoglobin & ANN 6$2,103.3$2430.110.023$19,088$10,64052
CK-MB & serial ECG$2,194.4$910.1230.012$17,890$7,30559
Troponin T & ECHO$2,259.5$650.1350.012$16,757$5,34564
ECHO rest$2,344.3$850.109−0.026$21,542(Dominated)52
ACI-TIPI$2,363.9$1040.2040.069$11,570$1,50297
Stress ECG$2,724.3$3600.15−0.055$18,192(Dominated)71
Sestamibi stress$2,731.8$3680.172−0.032$15,872(Dominated)82
ECHO stress$2,758.1$3940.131−0.073$21,024(Dominated)62
Sestamibi rest$2,816.0$4520.168−0.036$16,716(Dominated)80
1

Total cost of applying technology to one ED patient with possible ACI (includes cost of technology, admission and/or return ED visits and/or outpatient followup).

2

"Effectiveness" value: the probability of appropriate triage for ACI for an ED patient at ACI prevalence of 21 percent.

3

Cost-effectiveness ratio: total cost of applying technology to an ED patient with possible ACI divided by the probability of appropriate triage for a patient with ACI. Calculated values may be different due to rounding.

4

Marginal cost-effectiveness: ratio of the difference in costs to difference in effectiveness between two adjacent nondominated technologies.

5

Percent appropriate triage for patients with ACI.

6

Artificial neural network applied in subgroup ED patient population.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F031.jpg.

   Figure 31. Subgroup model: Cost of applying technology to 1,000 ED patients and number of patients appropriately triaged for ACI at 21 percent prevalence of ACI (reference line)

A: ACI-TIPI; AN: artificial neural network; CK: CK-MB, single; CE: continuous/serial ECG; CS: CK-MB, serial; ER: resting ECHO; ES: stress ECHO; ET: exercise ECG; G: Goldman chest pain protocol; I: troponin I; M: myoglobin initial; MD: myoglobin doubling; NS: nonstandard ECG; SES: stress sestamibi; SR: resting sestamibi; T: troponin T, initial; TS: troponin T serial; 1: CPK-MB and myoglobin; 2: myoglobin and artificial neural network; 3: CK-MB and serial ECG; 4: troponin T and ECHO.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F032.jpg.

   Figure 32. Subgroup model: Total cost of applying technology to 1,000 ED patients per appropriate triage for one patient with ACI

ANN: artificial neural network applied in ED patients with AMI excluded by ECG.

The results of the cost-effectiveness analysis for the Subgroup Model are shown in Table 74. The costs and effectiveness of each technology are shown in Figure 31. The CE rankings are similar to those in the General Population Model and are shown in Figure 32.

In the Subgroup Model, ACI-TIPI is still the most cost-effective technology if clinical impact data are included in the decision analysis: It has the greatest triage accuracy for ACI at a cost that is lower than the next most effective technology, stress sestamibi imaging. If only diagnostic performance data are used, stress sestamibi imaging is the most cost-effective, followed by troponin T-echocardiography. Stress sestamibi imaging costs about $470 more per ED patient but leads to appropriate triage for an additional 37 patients with ACI compared with troponin T-echocardiography. The incremental CE is $12,757, similar to the incremental CE between the two most cost-effective strategies in the General Population Model.

The per ED patient cost of sestamibi imaging is over $2,700, approximately $400 more than that of ACI-TIPI. The incremental CE between ACI-TIPI and the combination of troponin T-echocardiography is only $1,502 per additional appropriate triage for a patient with ACI, a negligible increase for improved triage accuracy.

Sensitivity Analysis
Triage accuracy and cost-effectiveness for AMI only

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F033.jpg.

   Figure 33. Subgroup model: Technology rankings by percent inappropriate triage (ED discharge) of patients with AMI at AMI prevalence of 6 percent

The triage accuracy of technologies for patients with AMI is shown in Figure 33. The rankings for the biomarkers are similar to those in the General Population Model. The biomarkers do not perform as well as the Goldman protocol or the imaging studies. The triage accuracy of the Goldman protocol for patients with AMI is significantly better than that for patients with ACI. Exercise ECG, stress imaging, and the combination of troponin T-echocardiography have the highest triage accuracy (99 percent). ACI-TIPI also has excellent triage accuracy (98 percent).

Table 75. Subgroup model: Cost-effectiveness of technologies for appropriate triage for patients with AMI
TechnologyCost 1Marginal costAMI triage 2Marginal effectivenessCE 3Marginal CE 4AMI triage (%) 5
CK-MB single$1,6850.0268$62,88644
Troponin I$1,714$290.0213−0.00550$80,408(Dominated)35
Troponin T initial$1,723$380.03230.00550$53,374$6,87153
Myoglobin initial$1,743$210.0304−0.00180$57,250(Dominated)50
CK-MB serial$1,759$370.05360.02130$32,826$1,71088
Neural network$1,786$270.0304−0.02310$58,652(Dominated)50
Troponin T serial$1,796$370.05360.00000$33,512(Dominated)88
Nonstandard ECG$1,841$820.0426−0.01100$43,191(Dominated)70
Goldman$1,846$870.05420.00060$34,056$142,32989
Myoglobin doubling$1,854$80.0530−0.00120$34,989(Dominated)87
CK-MB & myoglobin$1,860$150.0426−0.01160$43,641(Dominated)70
Cont/serial ECG$1,980$1340.0244−0.02980$81,281(Dominated)40
Myoglobin & ANN$2,103$2570.0505−0.00370$41,611(Dominated)83
CK-MB & serial ECG$2,194$3490.0481−0.00610$45,612(Dominated)79
Troponin T & ECHO$2,260$4140.06030.00610$37,477$67,92299
ECHO rest$2,344$850.0566−0.00370$41,392(Dominated)93
ACI-TIPI$2,364$1040.0597−0.00060$39,608(Dominated)98
Stress ECG$2,724$4650.06030.00000$45,185(Dominated)99
Sestamibi stress$2,732$4720.06030.00000$45,310(Dominated)99
ECHO stress$2,758$4990.0566−0.00370$48,698(Dominated)93
Sestamibi rest$2,816$5570.0566−0.00370$49,720(Dominated)93
1

Total cost of applying technology to one ED patient with possible ACI (includes cost of technology, hospital admission and/or return ED visits and/or outpatient followup).

2

"Effectiveness" value: the probability of appropriate triage for AMI for an ED patient at AMI prevalence of 6 percent.

3

Cost-effectiveness ratio: total cost of applying technology to an ED patient with possible ACI divided by the probability of appropriate triage for a patient with AMI. Calculated values may be different due to rounding.

4

Difference in costs divided by difference in effectiveness between a strategy and the next nondominated most effective strategy.

5

Percent appropriate triage for patients with AMI.

Results of the cost-effectiveness for triage of AMI patients only are shown in Table 75. The combination of troponin T-echocardiography is the most cost-effective: At a cost of $2,260 per ED patient, it is less expensive than all the imaging technologies as well as ACI-TIPI and appropriately triages nearly 99 percent of patients with AMI. Exercise ECG and stress sestamibi imaging also have 99 percent sensitivity for patients with AMI; however, the per ED patient costs of these two technologies is about $500 more than that of troponin T-echocardiography. Although the triage accuracy of ACI-TIPI is nearly identical to that of troponin T-echocardiography (98 percent), the per ED patient cost is about $100 more and its CE is $2,000 more per accurate triage for a patient with ACI. When the analysis was performed with cost of ACI-TIPI at $0, the cost-effectiveness of ACI-TIPI and the combination of troponin T-echocardiography are essentially equivalent as their total costs and effectiveness values are nearly identical.

The next most cost-effective strategy, the Goldman protocol, costs $414 less per ED patient but leads to appropriate triage of six fewer patients with AMI compared with troponin T-echocardiography. This yields an incremental cost-effectiveness of nearly $68,000, half that of the two most cost-effective strategies in the General Population Model.

Prevalence of ACI

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F034.jpg.

   Figure 34. Subgroup model: Cost per appropriate triage for patients with ACI as prevalence of ACI changes

Dominated strategies excluded (except for resting sestamibi imaging which is shown for comparison with stress sestamibi imaging. ACI-TIPI is shown in the General Population Model).

Reference lines show range of ACI prevalence most commonly encountered in the ED (17-25 percent).

As in the General Population Model, as prevalence of ACI increases, the cost-effectiveness of the technologies decreases. There is little change in the relative CE among the technologies. Sestamibi stress and sestamibi rest imaging become equally cost-effective at an ACI prevalence greater than 35 percent (Figure 34).

30-Day Survival Analysis

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F035.jpg.

   Figure 35. Subgroup model: Percent survival of patients with ACI

Reference line shows optimal survival rate if all patients with ACI appropriately triaged (none inappropriately discharged from the ED).

* Includes admitted patients who died.

The decision analysis was also performed using 30-day survival as an outcome. The effectiveness values are the number of patients with ACI who survived for 30 days; total costs are for 1,000 ED patients evaluated in the ED. There is little change in the rankings of the technologies using 30-day survival as an outcome, shown in Figure 35, compared with triage accuracy rankings in both the General Population Model and Subgroup Model. The figure gives the percentage of survival for a patient with ACI for each technology. The optimal survival rate of those with ACI would be the survival rate of admitted patients (because all patients with ACI would be appropriately hospitalized), which is 95.4 percent given the 89-percent survival rate of those with AMI and 98-percent survival rate for those with UAP.

In the Subgroup Model, ACI-TIPI has the highest 30-day survival rate, followed by sestamibi imaging (both rest and stress), stress ECG, and then three strategies with the same survival rate: stress echocardiography, and the combinations CK-MB -- serial ECG and troponin T-echocardiography. Differences in survival rates are very small: a difference of one patient with ACI surviving to 30 days between each subsequent ranking among all the technologies.

The most cost-effective strategy in both the General Population Model and the Subgroup Model is ACI-TIPI. In the General Population Model, the marginal cost-effectiveness between ACI-TIPI and the next most effective strategy, the combination of troponin T-echocardiography, is $53,700 per survival of an additional patient with ACI. In the Subgroup Model, the marginal cost-effectiveness between ACI-TIPI and the next most cost-effective strategy, the combination of CK-MB and serial ECG, is $125,563 per survival of an additional patient with ACI. The marginal CE between the next more effective and less costly strategy, the combination of myoglobin-artificial neural network, is $241,765.

When the analysis was repeated at a cost of $0 for ACI-TIPI, it dominated all other strategies.

Sensitivity analysis was also performed on cost of death of a patient inappropriately discharged from the ED. As the cost of death from "missed" ACI increased from a low of $600 for a return ED visit and resuscitation attempt to the cost of a malpractice settlement ($2 million), ACI-TIPI retained its cost-effectiveness. At very high "cost-of-death" values, it dominated all other strategies.

Cost of ACI-TIPI

If clinical impact data are included in the decision analyses, ACI-TIPI is the most effective and most cost-effective strategy for diagnosing patients with ACI in both population models. It also has a small incremental CE (about $1,500) compared with the next most cost-effective technology. These analyses were performed using a "conservative" cost for ACI-TIPI: that of an ECG ($68). Reducing the cost of ACI-TIPI to $0 only increases its cost-effectiveness and lowers its incremental cost-effectiveness even more (to about $400 in the General Population Model).

Because not every ED has ECG machines in which ACI-TIPI has been incorporated, hospitals may have to purchase an ECG with the predictive instrument. (Hewlett Packard has incorporated ACI-TIPI into all ECG machines beginning in 1998 [Joni Beshansky, personal communication]. GE Marquette will release ECG machines with ACI-TIPI beginning in January 2000 [Paul Elko, GE Marquette, personal communication]). We therefore performed analyses to determine how much more ACI-TIPI would have to cost for it to be no longer cost-effective.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f3663_F036.jpg.

   Figure 36. Total cost of applying technology to 1,000 ED patients per appropriate ACI triage and cost of ACI-TIPI

* Artificial neural network and combination of troponin T and ECHO applied in General Population Model.

Figure 36 shows the relative costs per appropriate ACI triage as ACI-TIPI increases in cost. For ACI-TIPI to no longer be the most cost-effective strategy, it would have to cost approximately $1,000 per patient use. The addition of the predictive instrument adds $800 to $1,000 to the total retail cost of an ECG machine (Paul Elko, GE Marquette, personal communication). Thus, the per patient cost of using an ECG machine that has the predictive instrument would be considerably lower than $1,000, and it is inconceivable that the cost per patient use of ACI-TIPI would ever come close to $1,000.

Discussion

The results of the decision analysis indicate that the biomarkers have the lowest triage accuracy and cost-effectiveness for appropriate triage of patients with ACI in the ED, primarily because their diagnostic performance in patients with UAP is poor. The ECG-based technologies, algorithms, and combinations of technologies perform better. Sestamibi imaging and stress tests have excellent diagnostic accuracy for ACI but are more costly than the other technologies and cannot be performed on all ED patients. Although sestamibi imaging, exercise ECG, and the combination of troponin T-echocardiography are effective for detecting ACI, and very effective for detection of AMI, their effects on the triage of patients with ACI have not been prospectively studied.

If data from clinical impact studies are included in the cost-effectiveness analysis, ACI-TIPI has the best triage accuracy for ACI, is the most cost-effective, and has a lower marginal CE than other technologies in both models. Furthermore, the predictive instrument's effect on ED patient triage in patients with ACI has been evaluated in a large clinical trial.

ACI-TIPI is also the most cost-effective strategy in the 30-day survival analysis. Interestingly, technologies that detect UAP at a much lower rate than that for AMI have lower rates for survival than would be expected given their AMI triage accuracy. This is likely because of the large difference in death rates between UAP and AMI. The survival analysis needs to be refined so that technologies that detect AMI are not penalized.

The prevalence of UAP used in the decision analysis is double that of AMI, making the model more sensitive to diagnostic performance in patients with UAP. Unfortunately the data on the diagnostic performance of technologies for UAP are very sparse. As ACI-TIPI has very high triage accuracy for unstable angina, we attempted to be fairly generous in our estimates of diagnostic performance of technologies for patients with UAP.

We did attempt to mitigate the effect of unknown values for UAP by analyzing the triage accuracy and cost-effectiveness for patients with AMI only, as the values for technology diagnostic performance for patients with AMI are based on a greater number of studies than those for patients with UAP. The most cost-effective strategy for AMI triage in both models is the combination troponin T-echocardiography; its marginal cost-effectiveness, however, is considerably greater than that of ACI-TIPI for ACI triage ($68,000 for the Subgroup Model and $137,000 for General Population Model vs. about $1,500 for ACI-TIPI). The artificial neural network in the General Population Model and the Goldman protocol in the Subgroup Model are the next most cost-effective strategies. Although evaluating only AMI patients is an artificial construct, since the ED physician does not know a priori which patients have AMI or UAP, this analysis does not rely on uncertain and unverifiable estimates for detection of UAP.

The results of this decision analysis do not explicitly incorporate the positive predictive value of a technology into the triage outcome for a patient. The predictive value of a test result may be a better estimate of triage accuracy and outcomes than test diagnostic performance, since ED physicians rarely base their decision to admit a patient solely on the results of a test. However, attempting to estimate the positive predictive value adds still another layer of uncertainty to the model estimates. Furthermore, by allowing for changes in the ED prevalence levels for ACI, the decision model does incorporate effects of pretest likelihood of ACI. For example, a 17-percent prevalence may indicate a 45-year-old man with no symptoms but some risk factors, whereas a 90-percent prevalence may indicate a 45-year-old man with typical angina (Patterson, Eng, Horowitz, et al., 1984).

The model also does not take into account the lower severity of disease in patients with ACI who have false negative test results. Certain technologies, especially the imaging studies, may be able to pick up the "sickest" ACI patients, and this may be reflected in different death rates for "missed" ACI among the technologies. An attempt to perform this analysis with our model, however, did not capture the negative effects of inappropriate triage of patients with ACI, since the death rate of admitted patients with AMI (presumably the sickest) is the same as that for patients inappropriately discharged from the ED. This is because, although the patients with false negative tests have less severe ACI than patients with true positive tests, their death rates are disproportionately higher than if they had been admitted. The analysis may need to be done separately for AMI and UAP in order to incorporate the different death rates and reflect the benefit of appropriate triage for ACI patients.

The results of the decision and cost-effectiveness analyses should not be used as a definitive analysis of technology triage accuracy as data on actual effect on triage are lacking for most of the technologies. Furthermore, the values for sensitivity of technologies for patients with UAP are estimates based on sparse data, which adds to the uncertainty of the model. The decision analysis is also not meant to be used for clinical recommendations for individual patients as pretest likelihoods are not explicitly modeled. Rather, these results should be used as an aid in decisionmaking and in understanding the factors that are involved in triage of patients with ACI in the ED. Prospective trials on the effect of technologies on actual ED patient triage are required before definitive conclusions can be made.

Appendices

Appendix A: Data Abstraction Forms

Diagnostic Tests for ACI in the Emergency Department Initial Article Screen Form

Instructions: Circle or fill-in where appropriate. Only one answer except: * - multiple answers possible, graphic element - answer only if required

Date of Screen: _________________

Author: _____________________________________________________________ UI: _________________

Journal: _____________________________________________________________ Volume _____Year _______

Data Extractor:PC JPI JL _____

1.Acute Cardiac Ischemia (meets inclusion criteria)?YesNoUnclear(Stop if No)
2.Excluded (by exclusion criteria)?eXcludednOt Excluded(Stop if Excluded)
4.Article Type:Patient Outcome StudyreViewOther_______________
*3.Study Question:
DiagnosisClinical ImpactDA/CEAOther__________________(Stop if Other)
graphic element5.If Clinical Impact study-Study Type: RCTProsp CohortreTrosp CohortCase-controlCase-Series
graphic element6.If Diagnosis study:Comparative StudyOther ____________________________________
*7.Setting:
PreHospitalEDCP Eval UnitCCUHospital(not CCU)No DataOther_____________________
*8a. Technology Type:
  1. Standard ECG

  2. Continuous/Serial 12-lead ECG

  3. Prehospital ECG

  4. Nonstandard ECG: # leads ______

  5. ECG stress test

  6. Echocardiogram

  7. Creatine Kinase

  8. Myoglobin

  9. Troponin-T and troponin-I

  10. Sestamibi

  11. Computer-based decision aids

  12. Original ACI predictive instrument

  13. TIPI

  14. Goldman chest pain protocol

  15. Other_____________________________

* 8b. Compared with:
  1. Standard ECG

  2. Continuous/Serial 12-lead ECG

  3. Prehospital ECG

  4. Nonstandard ECG: # leads ______

  5. ECG stress test

  6. Echocardiogram

  7. Creatine Kinase

  8. Myoglobin

  9. Troponin-T and troponin-I

  10. Sestamibi

  11. Computer-based decision aids

  12. Original ACI predictive instrument

  13. TIPI

  14. Goldman chest pain protocol

  15. Other_____________________________

9.Treatment Study: Intervention: _____________________________

Diagnostic Tests for ACI in the Emergency Department

Instructions: Circle or fill-in where appropriate. Only one answer except where indicated by an*

Reviewer: EB PC JI CM _________
Key: n.A.-not applicable n.D.-not described *-one or more answers TEST-test under investigation
Review date: ____/_____/____
UI:____________________
First author:_______________________

Study Characteristic

Number of Study Arms:_____

*Country: US UK Can Eur Asia Other _____________n.D.

Number of Centers: _______ n.D.

*Hospital type(s):

UrbanSuburbanRural
CommunityTertiary
Other_____________________

*Funding source:

Government Pharmaceutical pRivate Unfunded n.D.

*Study Setting:

Pre-Hospital/pre-EDEDCP Eval Unit
CCUHospital (Not CCU)n.D.
oTher_____________________________________

Study Patients Characteristics

*Age Group (Overall recruitment)

Adult(18-65) Elderly (>65) Other ________________

Mean Age: ______

Age Range: _____ - _____

% Male: _____

Race:

Dates of Study: _____________________________________

*Comments: _______________________________________
__________________________________________________
__________________________________________________

Verification of Acute Cardiac Ischemia

*Inclusion criteria presenting:

Signs/Symptoms:

Chest pain

_________________________________

_________________________________

_________________________________

_________________________________

_________________________________

_________________________________

Past history of ACI allowed?YesNon.D.
Past history of AMI allowed?YesNon.D.
Past history of known CAD allowed?YesNon.D.
Other inclusion criteria: _________________________________ _____________________________________________ _____________________________________________

*What were protocol's exclusion criteria:

ECG Diagnosis of AMI

Other: _______________________________________
_____________________________________________
_____________________________________________
_____________________________________________

*Comments: _______________________________________
__________________________________________________
__________________________________________________

DIAGNOSIS Tests/Characteristics

Changes in protocol during study: _______________________ ___________________________________________________ ___________________________________________________ __________________________________________________Comments on diagnostic methods: ____________________ ________________________________________________ ________________________________________________ ________________________________________________

ROC data reported? Y es No

AUC: _________

Subgroup data available:Y es No

If YES, list subgroups:

___________________ ___________________ ___________________ ___________________ ______________________________________ ___________________ ___________________ ___________________ ___________________

Total number of patients per group: Reference Group: _________ TEST Group: _________Number of withdrawals by group: Reference Group: _________ TEST Group: _________

Reason for withdrawal:
Reference Group: ___________________________________________________________________________
TEST Group: _______________________________________________________________________________

Codes for Diagnostic Tests:

A Standard ECG

B Continuous/Serial 12-lead ECG

C Pre-hospital ECG

D Non-standard ECG: # leads __________________

E ECG stress test

F Echocardiogram

G Creatine Kinase

H Myoglobin

I Troponin-T and/or Troponin-I

J Sestamibi

K Computer-based decision aids

L Original ACI predictive instrument

M TIPI

N Goldman chest pain protocol

O Other: _______________________

Test Results

Use above codes to fill in 2 x 2 table. Use multiple tables for multiple TEST cut-off values, if applicable.

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Test for (Circle one): AMI ACIReference Test PositiveReference Test Negative
TEST Positive(TP)(FP)
TEST Negative(FN)(TN)

Comment: _________________________________
_____________________________________________
_____________________________________________
_____________________________________________

Quality Assessment of Evaluations of Diagnostic Test Performance for ACI/AMI

Reviewer: EB PC JI CM ________ Review date: ______________ UI:____________________

First author:_______________________ Title:____________________________________________

Journal: ________________________________________________________________ Year: ________

TestYesNo
1.Were the technicalities of the TEST adequately described?43210
2.Were the TEST criteria of normality defined (positive/negative)?43210
Description of TEST ____________________________________________________________________
Definition of TEST graphic element: ___________________________________________________________________
Reference Standard
3.Was the Reference Standard stated?YN
4.Description of Reference Stnd? ________________________________________________________
Definition of Reference Stnd graphic element: ________________________________________________________
(Was the appropriate Reference Standard used?43210
5.Were the Reference Standard criteria of normality defined?43210
Study Population
6.Was the study population appropriate for evaluating ED use?43210
7.Were individuals with and without disease included in the evaluation?43210
8.Were the inclusion/exclusion criteria stated?43210
9.Was the patient selection process described?43210
10.Was a wide spectrum of diseased patients included?43210
11.Were the characteristics of the patients described?43210
12.Was the source of the patients described?43210
13.Were positive and negative test results verified equally?43210
14.If not, was there random sampling of individuals with negative tests for verification?43210
Performance of tests and Interpretation of test results
15.Were tests being directly compared (TEST vs. Reference Standard)?YN
16.Was the TEST repeated blindly and randomly?YN
17.If appropriate, was order of tests (TEST/Ref. Standard) randomized?YNn.a.
18.Regarding the interpretation of the TEST:
a. Was the interpreter blinded to the results of the Reference Standard?YN
b. Was the interpreter blinded to clinical information?YN
c. Was blinded interpretation of TEST performed in duplicate?YN
19.Regarding the interpretation of Reference Standard:
a. Was the interpreter blinded to the result of the TEST?YN
b. Was the interpreter blinded to clinical information?YN
c. Was blinded interpretation of Ref. Standard performed in duplicate?YN
Statistical Analysis and Reporting
20.Were the data reported in sufficient detail so that the reported results may be replicated (and meta-analysis performed)?43210
21.Adequate description of patients excluded from all analyses (drop-outs, protocol non-compliance, etc)?43210
22.Were the sensitivity and specificity of the TEST reported?YN
23.Was the TEST precision (confidence interval, SE) reported?YN
24.If appropriate, were different criteria used to establish an ROC curve?YN
Comments:

Quality Assessment of RCTs on Clinical Impact of Diagnostic Test on care for ACI/AMI

Reviewer: EB PC JI CM _____________ Review date: ___________________

First author: ____________________________ Title:___________________________________________

Journal: ________________________________ Year:__________ UI:___________________________

Study PopulationYesNo
1.Was the study population appropriate for evaluating the ED use?43210
2.Were individuals with and without disease included in the evaluation?43210
3.Were the inclusion/exclusion criteria stated?43210
4.Was the patient selection process adequately described?43210
5.Was a wide spectrum of diseased patients included?43210
6.Were the characteristics of the patients adeqiately described?43210
7.Adequate description of eligible but not enrolled patients?43210
Randomization
8.Method of the randomization? Central Envelope Other Unknown
9.Adequate blinding of randomization?43210
10.Was randomization achieved (balanced groups)?43210
Tests compared
11.Were the tests being compared adequately described?43210
12.a. Could the test assignment be blinded to the patient?43210
b. If blindable, were the test assignments blinded?43210
13.a. Could the test assignment be blinded to the observer?43210
b. If blindable, were the test assignments blinded?43210
14.Did both tests receive the same clinical information in interpretation the results?43210
15.If test assignments were unblinded, were patients treated equivalently for the same test diagnosis?43210
16.Was a test for differences in treatment pattern between test group performed?43210
Outcomes
17.Were all relevant outcomes reported?43210
18.Were the outcome measures clearly defined?43210
19.Was the observer blinded to the interim results?43210
Statistical Analysis and Reporting
20.Was there a priori estimation of sample size?YN
21.Was the analysis based on intention-to-treat?YN
22.Was there adequate reporting of compliance?43210
23.Was a stopping rule defined?YN
24.Were statistical analyses appropriate?4320
25.Were exact P values or confidence intervals reported?YN
26.Were post hoc power calculations or confidence intervals reported for non-significant results?YN
27.Were the data reported in sufficient details so that the results may be replicated (and meta-analysis performed)?43210
28.Adequate description of patients excluded from all analyses (drop-outs, protocol non-compliance, etc)?43210
Comments: _______________________________________________________________________________ _________________________________________________________________________________________

Appendix B: Acknowledgments

New England Medical Center EPC Project Staff

  • EPC/Project Director

  • Joseph Lau, MD

  • Assistant Project Director

  • John Ioannidis, MD

  • Primary Technical Expert

  • Deeb Salem, MD

  • Decision Analysis Consultant

  • John B. Wong, MD

  • NRSA Research Fellows

  • Ethan Balk, MD, MPH; Catherine Milch, MD

  • Project Manager

  • Deirdre DeVine, M Litt

  • Statistician

  • Norma Terrin, PhD

  • Research Associate

  • Priscilla Chew, MPH

  • Technical Writer

  • Thomas A. Lang, MA

  • Library Assistant

  • David Liu, BA

NHAAP Technology Working Group Technical Experts

  • Harry Selker, MD, MSPH

  • Co-chair, NHAAP TWG

  • Division of Clinical Care Research

  • New England Medical Center

  • Boston, Massachusetts

  • Robert Zalenski, MD

  • Co-chair, NHAAP TWG

  • Emergency Medicine and Internal Medicine

  • Wayne State University

  • Detroit, Michigan

  • Tom Aufderheide, MD

  • Department of Emergency Medicine

  • Medical College of Wisconsin

  • Milwaukee, Wisconsin

  • Michael Hagen, MD, FAAFP

  • Department of Family Practice

  • University of Kentucky

  • Lexington, Kentucky

  • Robert McNutt, MD

  • Department of Medicine

  • Cook County Hospital

  • Chicago, Illinois

  • Joseph Ornato, MD

  • Department of Emergency Medicine

  • Medical College of Virginia

  • Virginia Commonwealth University

  • Richmond, Virginia

Peer Reviewers

The partner organization, NHAAP, for the evidence report nominated individuals to participate in the peer review of the evidence report. Additional individuals with appropriate methodologic and clinical expertise were identified by the EPC. Review of the evidence report by these individuals does not represent endorsement of the report. We are grateful to the peer reviewers for generously offering their time and knowledge.

NHAAP Coordinating Committee-Science Base Subcommittee
  • Costas T. Lambrew, MD

  • Division of Cardiology

  • Maine Medical Center

  • Portland, Maine

NHAAP Coordinating Committee
  • James M. Atkins, MD, FACC

  • Emergency Medicine Education

  • University of Texas Southwestern Medical Center

  • Dallas, Texas

  • Representing: American College of Cardiology

  • Gottlieb Friesinger, MD

  • Division of Cardiology

  • Vanderbilt University Medical Center

  • Nashville, Tennessee

  • Representing: American College of Physicians

  • Lee Green, MD, MPH

  • Department of Family Practice

  • University of Michigan

  • Ann Arbor, Michigan

  • Representing: American Academy of Family Physicians

External Reviewers
  • Harold Sox, MD

  • Department of Medicine

  • Dartmouth Hitchcock Medical Center

  • Lebanon, New Hampshire

Appendix C: Acronyms

ACI: acute cardiac ischemia

ACI-TIPI: Acute Cardiac Ischemia Time-Insensitive Predictive Instrument

AHCPR: Agency for Health Care Policy and Research

AHRQ: Agency for Healthcare Research and Quality

AMI: acute myocardial infarction

AST: aspartate aminotransferase

CABG: coronary artery bypass graft

CAD: cardiac arterial disease

CCU: cardiac care unit

CE: cost-effectiveness

CI: confidence interval

CK: creatine kinase

CPOU: chest pain observation unit

CPU: chest pain unit

DRG: diagnosis-related group

ECG: electrocardiograph(y)

ED: emergency department

EMIP: European Myocardial Infarction Project

EMS: emergency medical service

EPC: Evidence-based Center

GREAT: Grampian Region Early Anistreplase Trial

IV: intravenous

LDH: lactate dehydrogenase

LDL: lipoprotein

MDA: malondialdehyde

MI: myocardial infarction

MITI: Myocardial Infarction Triage and Intervention

NHAAP: National Heart Attack Alert Program

NHLBI: National Heart, Lung, and Blood Institute

NIH: National Institutes of Health

NRSA: National Research Service Award

PFR: Physicians=Fee Reference

ROC: receiver operating characteristics

SD: standard deviation

SROC: summary receiver operating characteristics

Tc: technetium

UA: unstable angina

UAP: unstable angina pectoris

WHO: World Health Organization

References
Aase O, Jonsbu J, Liestol K, et al. Decision support by computer analysis of selected case history variables in the emergency room among patients with acute chest pain Eur Heart J 1993 Apr. 14:(4):43340. [PubMed].
Apple FS, Voss E, Lund L, et al. Cardiac troponin, CK-MB and myoglobin for the early detection of acute myocardial infarction and monitoring of reperfusion following thrombolytic therapy Clin Chim Acta 1995 Jun 15. 237:(1-2):5966.
Arntz HR, Stern R, Linderer T, et al. Efficiency of a physician-operated mobile intensive care unit for prehospital thrombolysis in acute myocardial infarction Am J Cardiol 1992 Aug 15. 70:(4):41720.
Aufderheide TP, Haselow WC, Hendley GE, et al. Feasibility of prehospital r-TPA therapy in chest pain patients Ann Emerg Med 1992c Apr. 21:(4):37983. [PubMed].
Aufderheide TP, Hendley GE, Thakur RK, et al. The diagnostic impact of prehospital 12-lead electrocardiography Ann Emerg Med 1990 Nov. 19:(11):12807. [PubMed].
Aufderheide TP, Hendley GE, Woo J, et al. A prospective evaluation of prehospital 12-lead ECG application in chest pain patients J Electrocardiol 1992b. 24:Suppl:813. [PubMed].
Aufderheide TP, Keelan MH, Hendley GE, et al. Milwaukee Prehospital Chest Pain Project -- phase I: Feasibility and accuracy of prehospital thrombolytic candidate selection Am J Cardiol 1992a Apr 15. 69:(12):9916.
Aufderheide TP, Rowlandson I, Lawrence SW, et al. Test of the acute cardiac ischemia time-insensitive predictive instrument (ACI-TIPI) for prehospital use Ann Emerg Med 1996 Feb. 27:(2):1938. [PubMed].
Banerjee S, Rhoden WE. Fast-tracking of myocardial infarction by paramedics J R Coll Physicians Lond 1998 Jan-Feb. 32:(1):368. [PubMed].
Barbash GI, Roth A, Hod H, et al. Improved survival but not left ventricular function with early and prehospital treatment with tissue plasminogen activator in acute myocardial infarction Am J Cardiol 1990 Aug 1. 66:(3):2616.
Baxt WG. Use of an artificial neural network for the diagnosis of myocardial infarction Ann Intern Med 1991 Dec 1. 115:(11):8438.
Baxt WG, Skora J. Prospective validation of artificial neural network trained to identify acute myocardial infarction Lancet 1996 Jan 6. 347:(8993):125.
BEPS Collaborative Group. Prehospital thrombolysis in acute myocardial infarction: The Belgian eminase prehospital study (BEPS) Eur Heart J 1991 Sep. 12:(9):9657. [PubMed].
Bertini G, Rostagno C, Taddei T, et al. Evaluation of a mobile coronary care unit protocol in patients with acute onset chest pain J Emerg Med 1991. 9:Suppl 1:5763. [PubMed].
Bossaert LL, Demey HE, Colemont LJ, et al. Prehospital thrombolytic treatment of acute myocardial infarction with anisoylated plasminogen streptokinase activator complex Crit Care Med 1988 Sep. 16:(9):82330. [PubMed].
Bouten MJ, Simoons ML, Hartman JA, et al. Prehospital thrombolysis with alteplase (rt-PA) in acute myocardial infarction Eur Heart J 1992 Jul. 13:(7):92531. [PubMed].
Brogan GX Jr, Bock JL, McCuskey CF, et al. Evaluation of cardiac STATus CK-MB/myoglobin device for rapidly ruling out acute myocardial infarction Clin Lab Med 1997 Dec. 17:(4):65568. [PubMed].
Brogan GX Jr, Friedman S, McCuskey C, et al. Evaluation of a new rapid quantitative immunoassay for serum myoglobin versus CK-MB for ruling out acute myocardial infarction in the emergency department Ann Emerg Med 1994 Oct. 24:(4):66571. [PubMed].
Brogan GX Jr, Vuori J, Friedman S, et al. Improved specificity of myoglobin plus carbonic anhydrase assay versus that of creatine kinase-MB for early diagnosis of acute myocardial infarction Ann Emerg Med 1996 Jan. 27:(1):228. [PubMed].
Brouwer MA, Martin JS, Maynard C, et al. Influence of early prehospital thrombolysis on mortality and event-free survival (the Myocardial Infarction Triage and Intervention [MITI] Randomized Trial). MITI Project Investigators Am J Cardiol 1996 Sep 1. 78:(5):497502.
Brown JL Jr. An eight-month evaluation of prehospital 12-lead electrocardiogram monitoring in Baltimore County Md Med J 1997Suppl:646.
Brush JE, Brand DA, Acampora D, et al. Use of the initial electrocardiogram to predict in-hospital complications of acute myocardial infarction N Engl J Med 1985 May 2. 312:(18):113741.
Canto JG, Rogers WJ, Bowlby LJ, et al. The prehospital electrocardiogram in acute myocardial infarction: is its full potential being realized? National Registry of Myocardial Infarction 2 Investigators J Am Coll Cardiol 1997 Mar 1. 29:(3):498505.
Castaigne AD, Herve C, Duval-Moulin AM, et al. Prehospital use of APSAC: results of a placebo-controlled study Am J Cardiol