Feedback Box
ALL technology introduces new errors, even when its sole purpose is to prevent errors.1
Information technology has been cited as a key to improving the safety of health care delivery. In its 1999 report, To Err Is Human: Building a Safer Health System, the Institute of Medicine (IOM) emphasized the need for technologies specifically engineered to prevent medical errors. Such technologies include automated order entry systems, drug-drug interaction software, and decision support systems.1 Leveraging clinical data from electronic health record systems (EHR), such technologies hold the promise of reducing errors in medical decision making that are due to inadequate information at the point of care.
| Reasons why introducing a decision support system for drug recommendations may introduce new errors into the clinical workflow: |
| • Missing data leading to recommendation of a contraindicated drug. |
| • Potential interaction of the recommended drug with another drug prescribed for the patient. |
| • Inaccuracies in program logic which could lead to erroneous recommendations. |
| • Potential harm due to rearranging clinician priorities with required use of the decision support system (e.g., invoking a decision support system for hypertension when hypertension is not a clinical priority for that visit). |
| • The clinician-user has knowledge gaps that are directly relevant to the decision support system recommendations (e.g., promoting the use of a guideline- recommended drug without simultaneously providing information about dose limits). |
| • Generation of false expectations on the part of the clinician-user that the system will alert them to all problems. |
| • Potential for data overload. |
| • Potential for incorrect recommendations on cases that the system was not designed to handle. |
An analogous challenge is faced with the approval of prescription drugs for the general public. Despite rigorous testing to establish safety and efficacy prior to drug approval, some problems are discovered only after widespread use of the drug. Phase III clinical trials are typically conducted over a short time period and may involve too few subjects to detect all adverse outcomes, especially when the events are rare. Also, the target population for use of the product expands after U.S. Food and Drug Administration approval.7 In recognition of these problems, the U.S. Food and Drug Administration Center for Drug Evaluation and Research (CDER) has a Post-Marketing Surveillance (PMS) system designed specifically to monitor and report adverse events data. CDER created MEDWatch, an Internet-based resource that includes an online submission form for health care professionals to report adverse events observed during use of medical products.8
Clinical decision support systems may undergo predeployment testing prior to introduction into the clinical workflow. However, rarely occurring problems in data sources, for example, may not be detected in predeployment testing. Miller and Gardner recognized the need for post-fielding surveillance of clinical software systems.9 Summarizing the findings of a large consortium of professional organizations, including the American Medical Informatics Association, Medical Library Association, and the Computer-based Patient Record Institute, they noted that software products may work well in isolation but face challenges when integrated into complex systems involving multiple clinical software systems and multiple vendors. Also, “raw complaints” from individual users were identified as potentially useful in monitoring and evaluating clinical software systems for potential sources of error.9 Gould and Lewis also noted the need for “design, test, measure, and redesign,” that is, the power of iterative user testing to discover and repair system problems.10
In this paper, researchers describe the method of maintaining quality assurance for a hypertension guideline system, Assessment and Treatment of Hypertension: Evidence-Based Automation Decision Support System (ATHENA DSS). Based on widely accepted national guidelines for hypertension (Joint National Committee on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure [JNC] 6 and the Department of Veterans Affairs [VA]),11, 12 ATHENA DSS delivers treatment advisories to clinicians at the point of care. Built with EON technology for guideline-based decision support,13 ATHENA DSS consists of two main components: a hypertension knowledge base modeled in Protégé14 and a guideline interpreter that applies the information in the knowledge base to the clinical information retrieved from the computerized patient record system (CPRS) to create patient-specific recommendations for a patient encounter, on a visit-by-visit basis.15–17 ATHENA DSS displays advisories via an interface to the VA CPRS, a uniform EHR in patient care delivery settings. Recommendations generated by ATHENA DSS were displayed to primary care physicians during clinic visits with hypertensive patients (Figure 1
ATHENA DSS included two features to maintain quality assurance after deployment. First, the user interface of ATHENA DSS, in addition to delivering drug recommendations, included a feedback box for clinician-users to enter free text comments during point-of-care system use. Impressions of clinician-users were captured in real-time, linking comments to the specific patient scenario in which the error was observed and reducing the potential for recall biases. Second, the actions of the ATHENA DSS program, including unanticipated error conditions known as “exceptions,” were logged during program execution. A scripting program scanned these log files for exceptions and automatically e-mailed messages to the ATHENA DSS programmers for investigation of potential sources of error in program function. To monitor for any unanticipated problems not detected in predeployment testing of ATHENA DSS, researchers monitored both the real-time feedback provided by clinicians and program exceptions during point-of-care use of the system.
An example of ATHENA DSS displays an advisory pop-up window on top of the VA electronic medical record. In keeping with national guidelines for management of hypertension, the ATHENA DSS encourages use of thiazide diuretics; however, it also monitors for potential problems in their use and, for example, alerts clinicians to hypokalemia. Other parts of the system, not shown here, provide information about thiazide dosing to avoid hypokalemia. The feedback box at the bottom of the window allows clinician-users to enter free text feedback comments to the ATHENA DSS knowledge management team.
As part of a randomized trial to assess the overall effect of ATHENA DSS on choice of drug therapy and blood pressure control, recommendations were generated on a daily basis for 15 months at nine geographically dispersed clinical sites within the VA Durham, Palo Alto, and San Francisco Health Care Systems. Ninety-one primary care providers in the ATHENA arm of the study received the ATHENA Hypertension Advisory.
Clinicians were encouraged to comment about their interactions with the ATHENA DSS. Clinicians could phone or e-mail study staff with any concerns or questions about the program. Clinicians were also given the option to enter feedback during the viewing of the ATHENA Hypertension Advisory. Feedback comments submitted by clinician-users were initially logged into a Microsoft SQL database. These comments were subsequently imported into a Microsoft Access database to facilitate data analysis and monitoring by the ATHENA DSS team. Typically within 1 week of receipt of feedback, a member of the research staff reviewed the comments to identify any indication of possible error in the program.
The feedback comments were later analyzed using a qualitative research approach. An ATHENA team member met with the project Principal Investigator to review all comments and classify the principal idea(s) expressed. New categories were created as needed. After all comments had been reviewed, the entire set of comments was reviewed again to classify them into the final set of categories. At least two members of the research team reviewed each comment. Any classification questions were resolved by consensus.
Text log files of ATHENA DSS program actions were also monitored for potential errors. During the generation of ATHENA DSS recommendations, messages describing the actions of the program were recorded in text files, including any program exceptions that occurred. A Perl language text searching script extracted and compiled exception messages into a file that was sent to the ATHENA DSS team for daily analysis.
ATHENA DSS displayed advisories for 19,859 clinic visits and 10,806 distinct patients during the study period. A total of 835 free text feedback comments were submitted by 44 of the 91 (48.4 percent) study clinicians via the feedback window. A median of 8.5 comments per clinician-respondent were received during the study period (range 1–140).
Free text comments were investigated as potential sources of error in the program or its data sources. In most cases, clinician reports were false positives for error. For example, a diagnosis questioned by the clinician as “inaccurate” could be found in the EHR, although apparently unknown to the clinician. Similarly, “missing” adverse drug reactions noted by clinicians were confirmed by chart review as not documented in the EHR.
| Problem | Number of comments | Cause of problem |
|---|---|---|
| Active medications omitted from the prescription list | 19 | Method of updating a VISTA* medication status led to labeling active medications as inactive. |
| Incorrect adverse drug reaction (ADR) notification | 1 | Error in the data conversion, leading to the incorrect labeling of an ADR. |
| Omission of diagnoses (ICD-9 codes) | 2 | Programming stop code introduced into the extraction routine led to dropping of ICD-9 codes. |
| Expired medications appeared as active in the prescription list | 1 | VISTA* medication status flag led to inaccurate labeling of expired medications as active. |
Veterans Health Information Systems and Technology Architecture
Clinician feedback also identified areas where recommendations from ATHENA DSS benefited from additional clarification. For example, one provider commented that a display about specific drug contraindications was more beneficial than a general alert about the presence of a drug contraindication, so this change was implemented to improve the system.
Clinicians rarely contacted study staff by e-mail or phone about program problems. In the few cases in which they did, it was to report a technical malfunction (such as failure of any recommendations to appear, as would happen if the network connection was lost).
Tracking of ATHENA DSS program exceptions provided additional insight. In general, such tracking revealed very few program exceptions in ATHENA DSS function. Almost all of the exceptions reported a technical glitch not related to ATHENA DSS. Changes in drug formulary were forecasted and therefore a tracking system was implemented to alert the ATHENA team to new antihypertensives in the VA pharmacy formulary. As an example, the angiotensin-receptor blocker irbesartan was added to the formulary during the course of the trial. Thus, tracking of program exceptions facilitated maintenance of drug databases necessary for ATHENA DSS to make proper drug recommendations.
There were no additional programming, logic, or treatment errors identified by study providers. In all of the above examples, ongoing surveillance of system performance ensured the continued delivery of accurate decision support recommendations.
Failure to maintain a commitment to quality assurance has historically led to some devastating results. A presidential commission on the NASA space shuttle Challenger accident concluded that “a disproportionate reduction may have occurred in the safety, reliability, and quality assurance staff at NASA…. The decreases may have limited the ability of those offices to perform their review functions.”18 An authority on software development has argued that successful software development demands a similar commitment to quality assurance.19 It is essential to recognize that all human work is susceptible to error, and that even those working on systems aimed at lowering error rates must plan for anticipated errors in their own systems.
Clinical software systems such as decision support systems implemented to reduce medical errors pose similar challenges to maintaining quality assurance after deployment. Scaling up from a limited set of pilot users to a larger, more widespread audience and a larger number of patient record extracts may reveal unanticipated consequences, especially over a distributed network of clinicians. Clinicians may attempt to bypass time-consuming physician order entry systems by asking allied health professionals to write verbal or handwritten medication orders.2 This may not be observed or anticipated in testing with clinician early adopters.
One must also consider the inevitable changes to a clinical decision support system or the integrated network of software to which it is connected. A clinical decision support system may evolve to incorporate new evidence regarding best treatment practices, as with upgrading ATHENA DSS for JNC 7. The software to which the clinical decision support system is connected may be upgraded or updated over time. An analysis of the Health Evaluation through Logic Processing (HELP) system at Latter Day Saints Hospital and its interconnected referral centers in Salt Lake City, UT, revealed 1024 possible software configurations.9 Implementing any changes after deployment, either to the decision support system itself or other software dependencies, can lead to new unanticipated errors.
Our approach to these challenges was to create several methods to monitor system accuracy. The free-text window on the recommendations screen promoted direct interaction between ATHENA DSS and its clinician-users. Despite extremely busy clinical workloads and limited time per patient visit, study participants interacted with the ATHENA DSS advisory in some fashion for 63 percent of the patients.20 Nearly 50 percent of the study providers entered feedback comments during point-of-care use of ATHENA DSS. This level of participation is substantial, given other published reports of clinician interactions with decision support systems. One tertiary academic medical center reported that clinicians chose to interact with a guideline-based decision support system for hyperlipidemia in only 20 of 2,610 visits (0.8 percent).21 Clinician interaction with ATHENA DSS proved to be crucial to our study. Surveillance of feedback data provided a sensitive method of detecting potentially important problems in data extraction and conversion.
Given the complex milieu of data sources on which a decision support system relies, it is not surprising that rarely occurring errors may not be detected despite rigorous predeployment testing. Clinician feedback submitted via the ATHENA DSS user interface and program execution tracking provided important facilities for monitoring quality assurance after deployment of the system. With such data, a small team of investigators monitored the interactions of 91 ATHENA DSS clinician-users distributed over three geographically dispersed sites for a 15-month study period. More important, this method provided an efficient means of detecting and subsequently correcting inaccurate recommendations based on rarely occurring problems. New technologies for improving patient safety should include mechanisms and funding for post-fielding surveillance such as point-of-care feedback and other monitoring of the system to prevent introduction of new errors into the clinical workflow.
The ATHENA project was supported by VA Health Services Research and Development Service grant CPI 99275. The EON project was supported by National Library of Medicine grant LM 05708. Dr Chan was supported by the National Library of Medicine grant LM 07033. Views expressed are those of the authors and not necessarily those of the Department of Veterans Affairs or other affiliated institutions.
VA Palo Alto Health Care System, Palo Alto, CA (ASC, SBM, RWC, MKG). Stanford Medical Informatics, Stanford, CA (ASC, SWT, MAM, MKG). Durham VA Medical Center, Durham, NC (HBB, EZO). Boston VA Medical Center (BBH). Center for Primary Care Outcomes Research, Stanford, CA (MKG). Harvard Medical School, Boston (BBH), San Francisco VA Medical Center (MGS). University of California, San Francisco Department of Family and Community Medicine (ASC) and Department of Medicine (MGS).
Address correspondence to: Mary K. Goldstein, M.D, M.S.; VA Palo Alto Health Care System; GRECC 182B; 3801 Miranda Avenue; Palo Alto, CA 94304. Phone: 650-858-3933; e-mail: Mary.goldstein@med.va.gov.
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]
Free Full text in PMC]