pmc logo image
Logo of procamiaJournal URL: http://www.amia.org/meetings/archives.asp

Formats:

AMIA Annu Symp Proc. 2003; 2003: 215–219.
PMCID: PMC1480215
Telephone Triage: A Timely Data Source for Surveillance of Influenza-like Diseases
Jeremy U. Espino, M.D., William R. Hogan, M.D., M.S., and Michael M. Wagner, M.D., Ph.D.
RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pennsylvania
Abstract
We evaluated telephone triage (TT) data for public health early warning systems. TT data is electronically available and contains coded elements that include the demographics and description of a caller’s medical complaints. In the study, we obtained emergency room TT data and after hours TT data from a commercial TT software and service company. We compared the timeliness of the TT data with influenza surveillance data from the Centers for Disease Control using the cross correlation function. Emergency room TT calls are one to five weeks ahead of surveillance data collected by the CDC.
The threat of bioterrorism attacks and emerging infectious disease has generated interest in developing public health early warning systems capable of extreme timeliness of detection.1 These approaches rely in part on the collection and analysis of data that are inherently earlier than conventional public health surveillance data. Examples of this type of data include absenteeism records,2, 3 over the counter drug sales,4, 5 patient chief complaints,6, 7 real-time electronic laboratory reporting8, 9 and 911 calls.10
Telephone triage (TT) is another data source of considerable interest. 11 Hospitals and healthcare systems use TT to direct patients to the appropriate medical resources.12 For example, a parent whose child has a persistent cough could be directed to go to a pediatric walk-in clinic instead of an emergency department. Appropriate allocation of resources can lead to a significant reduction in costs13, 14 and there is often a high degree of patient satisfaction with TT.15 For these reasons, TT is becoming ubiquitous in today’s managed care organizations.16
TT has several characteristics that make it a good candidate for early detection of disease outbreaks. TT occurs before patients visit an emergency room or a doctor’s office, and thus TT data are likely to precede clinical data that arise from such clinical encounters. Also, TT can account for a large percentage of patient encounters—as much as 25% in internal medicine and even more in pediatrics.15 Finally, modern TT call centers incorporate computer systems that track callers along with a coded description of their medical problem. Thus, TT can provide data electronically for surveillance and there is the capability for these TT systems to provide data in real time.
In this paper, we show how TT data can be used in public health early warning systems and evaluate the timeliness of two TT datasets when compared to conventional influenza surveillance data.
TT Datasets
We obtained two TT datasets from a healthcare call center services and software company. One dataset came from an emergency room telephone triage facility (we refer to that dataset as ER-TT) that serves 10 hospitals in a major city and the other dataset came from an after-hours telephone triage facility for a group of physician offices in a single state (referred to as AH-TT). The two datasets were from two geographically distant areas. Each dataset contained 13 months of data, from September 1, 2001 to August 31, 2002, The ER-TT dataset contained 21,304 records and the AH-TT dataset contained 18,135 records. Each record in the datasets consisted of 10 elements:
  • Start date/time of initial call
  • Acuity: acute or non-acute
  • Inclination of the caller: There were 12 unique reasons why a call could be made. Examples are call physician, research, and 911.
  • Disposition assigned to the caller by the nurse: There were 23 unique dispositions. Examples are home care, see MD within 24 hours, call PCP within 24 hours, and ER immediately.
  • Call outcome: This element uses the same 23 possible values used in the disposition.
  • Primary symptoms in free-text: Examples are “stomach bug-6 loose bowel movements throughout the day” and “feels hot and some pain in throat.”
  • Five digit home zip code
  • Age
  • Gender
  • Treatment guideline used: There were 380 unique guidelines used in the datasets. A single guideline chosen by the triage nurse is recorded for each call. Examples include diarrhea, earache, breathing difficulty, vomiting, and asthma attack.
In the ER-TT dataset callers were encouraged to call TT before going to the emergency room. In the AH-TT data-set callers were routed to TT when physician offices were closed. When a person contacts the call center, a nurse records the basic demographics of the patient as well as the symptoms of the patient into the call center computer system. The nurse determines why the person is calling (the inclination) and if the caller has an acute medical emergency. Then, the nurse uses a standard guideline or protocol to interview the person and determine what the person should do (the disposition). Finally, the nurse may make a followup-call to determine if the person followed the nurse’s advice (the call outcome).
Guideline-to-Syndrome Mapping
It is usually necessary in public health surveillance to define syndromic categories as a group of health-related codes that are likely to be used for a patient presenting with that particular syndrome.6, 17, 18 In the case of TT guidelines, there is likely to be considerable variation in the way that callers with a particular disease present themselves to a TT nurse and the selection of a guideline by the nurse For this reason, we mapped guidelines into syndromes.
We categorized the 380 unique guidelines found in the datasets into a set of mutually exclusive categories that corresponded to syndromes of public health interest. We included a guideline in a syndrome category if the guideline could be used for a patient presenting with symptoms of interest to public health officials. We based the syndromes on syndrome definitions previously developed by the US Department of Defense, the CDC and the Utah Department of Health.
Gold Standard Determination of Influenza Outbreak
We chose the 2001–2002 outbreak of the disease influenza to evaluate the timeliness of TT data relative to conventional surveillance data. Influenza surveillance data is readily available from the Centers for Disease Control (CDC) and because of antigenic drifts or shifts in the influenza virus there are usually yearly outbreaks available for study. We obtained the following influenza surveillance data from the CDC website at http://www.cdc.gov/ncidod/diseases/flu/:
  • Weekly state epidemiologist reports of estimated influenza activity. (CDC-SI)
  • The weekly regional number of influenza-like-illness1 (ILI) cases from the U.S. Influenza Sentinel Physicians Surveillance Network. (CDC-RI)
  • The weekly regional number of positive influenza tests (CDC-RC)
The datasets we obtained were specific to the geographic coverage of each TT dataset.
We wanted to use state as well as regional influenza surveillance data. While the number of ILI cases is available regionally, only estimated levels of influenza activity (item number one in the preceding list) are available. Subjective assessments of influenza activity each week are reported as one of four values:
  • No Activity: No cases of influenza or ILI reported.
  • Sporadic: Cases of influenza or ILI are reported, but reports of outbreaks in places such as schools, nursing homes, and other institutional settings have not been received.
  • Regional: Outbreaks of influenza or ILI are occurring in geographic areas containing less that 50% of the state’s population. A geographic area could be a city, county, or district.
  • Widespread: Outbreaks are occurring in geographic areas representing more than 50% of the state’s population.
In order to include state influenza activity data in our quantitative analyses, we assigned an ordinal value to each of the estimated flu activity levels: “No activity” was assigned a value of 0, “sporadic” was assigned a value of 1, “regional” was assigned a value of 2, and “widespread” was assigned a value of 3. We accounted for missing flu activity data by interpolation, using the average of the two nearest weeks.
Classifying TT Calls as Influenza Calls
We defined an influenza call as a TT call coded with a guideline that we had categorized as being either respiratory or constitutional. We aggregated calls into weekly counts to match the weekly aggregation of influenza surveillance data.
Cross Correlation Metric
We used the cross-correlation function (CCF)19 to measure the time latency between TT and conventional influenza surveillance. In particular, we computed the cross-correlation function for the weekly counts of influenza calls from the ER-TT dataset and the AH-TT datasets with the state (CDC-SI) and regional (CDC-RI and CDC-RC) influenza surveillance data. Prior to computing the cross correlation function, we normalized each of the time-series to the number of standard deviations from the mean to satisfy the assumption of cross correlation analysis that the time series being compared are normalized to the same scale.
This type of analysis produces a correlation number over a range of possible time latencies between time series. We used the time latency (a.k.a. lag) at which the correlation number was maximized as our measure of timeliness, a method first used by Tsui et al.19
A lag of zero would indicate that TT data are no more or less timely that conventional surveillance data. A lag less than zero would indicate that TT data are more timely that conventional surveillance data. A lag greater than zero would indicate that TT data are less timely than conventional surveillance data.
Guideline-to-Syndrome Mapping
Of the 380 guidelines used in the ER-TT and AH-TT datasets, we assigned 69 guidelines to eight syndromic categories—gastrointestinal, respiratory, constitutional, hemorrhagic, rash, lymphadenopathy, botulinic, and neurologic. We further subdivided these primary categories into smaller categories to distinguish syndromes with and without fever, the disease location, and individual symptoms like cough that are important public health indicators (although we did not use these subdivisions in the measurements of timeliness with the correlation function, we provide them in Figure 1Figure 1 for other researchers). The final influenza guideline set, comprised eighteen respiratory guidelines and four constitutional guidelines.
Figure 1
Figure 1
Figure 1
Syndromic categories for TT data. The number of guidelines in each category is given in parentheses. The guideline descriptions for the respiratory and constitutional categories are shown in italics.
TT vs. Regional Influenza Surveillance Data
On inspection, the TT and regional influenza surveillance data (CDC-RI and CDC-RC) have similar temporal patterns (See Figure 2Figure 2 and Figure 3Figure 3). Cross correlation analysis (See Figure 4Figure 4 and Table 1) shows that ER-TT occurs earlier than CDC-RI (has a maximal correlation at a negative time lag) and AH-TT occurs later than regional influenza surveillance data.
Figure 2
Figure 2
Figure 2
Emergency room TT and CDC influenza surveillance. Each time-series plots weekly counts. ER-TT=emergency room TT influenza calls; CDC-SI=state influenza activity from the CDC; CDC-RI=regional influenza like illness cases from the (more ...)
Figure 3
Figure 3
Figure 3
After hours TT and CDC influenza surveillance data. Each time-series plots weekly counts. AH-TT=after hours TT influenza calls. CDC data label abbreviations are the same as in Figure 2Figure 2.
Figure 4
Figure 4
Figure 4
Cross correlation analysis of TT data and CDC influenza surveillance data. Data label abbreviations are the same as in Figures 2Figure 2 and and33Figure 3.
Table 1
Table 1
Time lag in weeks at maximum correlation between TT data and CDC influenza data. Data label abbreviations are the same as in Figures 2Figure 2 and and3.3Figure 3. Correlation is shown in parentheses (maximum possible correlation is 32.00).
TT vs. State Influenza Surveillance Data
ER-TT and CDC-SI have very similar temporal patterns. AH-TT and CDC-SI are moderately similar. Cross correlation analysis shows that ER-TT occurs earlier than CDC-SI and AH-TT occurs later than CDC-SI.
In this paper we attempt to show how TT data can be used in public health early warning systems and test the hypothesis that TT data are more timely than current influenza surveillance data. We used the 2001–2002 influenza outbreak and cross correlation analysis to test this hypothesis.
Is TT a timely data source?
Our analysis shows that ER-TT can be a timely data source for monitoring influenza outbreaks. This study shows the limitations in using cross correlation analysis alone to evaluate the timeliness of secondary data (data collected for other reasons) in public health surveillance. According to the cross correlation analysis, the after hours TT data comes after the conventional influenza surveillance data. The graphs of the data, however, show that there is a gradual increase in AH-TT calls several weeks before there is a steep increase but the regional influenza surveillance data do not show this characteristic.
One possible reason for the late peak in influenza calls to AH-TT is that patients might be more likely to wait until the doctor’s office is open instead of talking to the after hours triage service. This would lead to a large percentage of patients with influenza that are not captured. Only when patients are experiencing severe influenza symptoms at the height of the influenza outbreak do they call the AH-TT center.
There are also limitations in the using state influenza activity in this analysis. The state influenza activity level is based on estimates of influenza activity in the state. Is the appearance of multiple peaks of influenza activity in the AH-TT state due to poor reporting to the state health department or a real phenomenom?
How can public health use TT data?
TT data are collected primarily to maintain a record of the interaction between a caller and a nurse, and are not collected for public health surveillance purposes. The TT datasets we obtained nevertheless contain two types of information useful for public health—the demographics of the person requiring medical assistance and descriptions of that person’s symptoms. The descriptions were available in coded and free text versions. Although natural language processing tools are available to interpret free-text medical data, they often require a large amount of training data in order to be effective. Therefore, we specifically focused on the coded guideline of the call as a proxy of the caller’s symptoms for this initial study.
In order to use the TT data to monitor an outbreak like influenza, we defined a set of syndromes. The syndromes are mutually exclusive to eliminate double counting of records when leaf categories are combined into higher-level categories (recall that each phone call is associated with exactly one guidleline). The syndrome categories are hierarchal and sufficiently granular to allow for a high degree of modeling flexibility. For example, if an epidemiologist wanted to compute the incidence of all TT calls associated with fever, she could “build” a syndromic definition with the leaves of hierarchy pertaining to fever. Although we only used the respiratory and constitutional syndrome categories in this evaluation, we show the entire set for completeness.
In this evaluation we focused on a single element of TT data—the coded guideline. We note that the data contain other elements—acuity of the call, inclination, age, and gender—that are also potentially useful to public health. For example, public health might want to know the weekly incidence of acute fever in callers between 10 and 20 years old who are male.
TT Limitations and Future Work
One limitation of TT for public health early warning systems is that TT is not ubiquitous. Although market projections anticipate steady growth of TT services14 they have not achieved widespread use except in areas with large managed care organizations. Also, many of the treatment guideline sets are proprietary in nature and are distributed by multiple TT software vendors. This makes it more difficult for TT to be universally incorporated into public health early warning systems unless methods to map the free-text primary symptoms of the caller to syndromes are used (i.e., natural language processing).
This paper suggests that ER-TT data are more timely than currently collected influenza surveillance data. Future work must confirm or refute this pilot observation using multiple years of TT data so that confident estimates of the sensitivity and specificity of influenza detection using TT data can be made. Data from other known disease outbreaks should be obtained to evaluate the potential of TT data to providing early warning of disease outbreaks other than influenza. Finally, work should be done to evaluate how the other elements in the data, particularly demographic and spatial information, can be used to identify smaller outbreaks of disease.
CONCLUSION
For detection of influenza-like illnesses, TT data is a promising data source for use in public health surveillance. TT has several favorable characteristics including electronic availability, coded data elements, and timeliness.
Acknowledgments
This work was supported by Pennsylvania Department of Health Award number ME-01-737, grant F 30602-01-2-0550 from the Defense Advanced Research Projects Agency and grant T15 LM/DE07059 from the National Library of Medicine.
Footnotes
1The CDC defines influenza like illness as fever (temperature of >100°F) plus either a cough or a sore throat.
1. Wagner MM, Tsui FC, Espino JU, Dato VM, Sittig DF, Caruana RA, et al. The emerging science of very early detection of disease outbreaks. J Public Health Manag Pract. 2001;7(6):51–9. [PubMed]
2. Neuzil KM, Hohlbein C, Zhu Y. Illness among schoolchildren during influenza season: effect on school absenteeism, parental absenteeism from work, and secondary illness in families. Archives of Pediatrics & Adolescent Medicine. 2002;156(10):986–91. [PubMed]
3. O’Reilly FW, Stevens AB. Sickness absence due to influenza. Occupational Medicine (Oxford). 2002;52(5):265–9.
4. Mikol YB, Miller JR, Seeley A, Ashendorff A, Waterbourne Diseases Risk Assessment Program 1997 Annual Report, In:New York City Department of Environmental Protection. 1998. http://www.nyc.gov/html/dep/pdf/wdrap97.pdf.
5. Goldenberg A, Shmueli G, Caruana RA, Fienberg SE. Early statistical detection of anthrax outbreaks by tracking over-the-counter medication sales. Proc Natl Acad Sci U S A. 2002;99(8):5237–40. [PubMed]
6. Espino JU, Wagner MM. Accuracy of ICD-9-coded chief complaints and diagnoses for the detection of acute respiratory illness. Proc AMIA Symp. 2001:164–8. [PubMed]
7. Ivanov O, Wagner MM, Chapman WW, Olszewski RT. Accuracy of three classifiers of acute gastrointestinal syndrome for syndromic surveillance. Proc AMIA Symp. 2002:345–9. [PubMed]
8. Panackal AA, M’Ikanatha NM, Tsui FC, McMahon J, Wagner MM, Dixon BW, et al. Automatic electronic laboratory-based reporting of notifiable infectious diseases at a large health system. Emerg Infect Dis. 2002;8(7):685–91. [PubMed]
9. Effler P, Ching-Lee M, Bogard A, Ieong MC, Nekomoto T, Jernigan D. Statewide system of electronic notifiable disease reporting from clinical laboratories: comparing automated reporting with conventional methods. Jama. 1999;282(19):1845–50. [PubMed]
10. Lee E, ALERT #28: The New York City Department of Health and Mental Hygiene requests that health-care providers maintain a heightened awareness for unusual manifestations or clusters of illnesses during upcoming high profile events in New York City, In:New York City Department of Health & Mental Hygiene. http://www.nyc.gov/html/doh/html/cd/02md28.html (March 3 2003).
11. Rodman JS, Frost F, Jakubowski W. Using nurse hot line calls for disease surveillance. Emerg Infect Dis. 1998;4(2):329–32. [PubMed]
12. Simonsen-Anderson S. Safe and sound. Nursing Management. 2002;33(6):41–3. [PubMed]
13. Sabin M. Telephone triage improves demand management effectiveness. Healthc Financ Manage. 1998;52(8):49–51. [PubMed]
14. Unknown. Telephone Triage: Products, Services, & Successful Systems. Jacksonville: Feedback Research Services; 2002 December 2002. Report No.: TT120.
15. Delichatsios H, Callahan M, Charlson M. Outcomes of telephone medical care. J Gen Intern Med. 1998;13(9):579–85. [PubMed]
16. Galimi J. The Market for Medical Call Center Applications: Gartner Group; 1999.
17. Brinsfield KH, Gunn JE, Barry MA, McKenna V, Dyer KS, Sulis C. Using Volume-based Surveillance for an Outbreak Early Warning System. Acad Emerg Med. 2001;8(5):492.
18. Unknown, ESSENCE: Electronic Surveillance System for the Early Notification of Community-based Epidemics, In:U.S. Department of Defense, Global Emerging Infections Surveillance and Response System. 2003. http://www.geis.ha.osd.mil/GEIS/SurveillanceActivities/ESSENCE/ESSENCE.asp (February 25 2003).
19. Tsui FC, Wagner MM, Dato V, Chang CC. Value of ICD-9 coded chief complaints for detection of epidemics. J Am Med Inform Assoc. 2002;9:S41–S47.

See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph
See more articles cited in this paragraph