Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
AMIA Annu Symp Proc. 2012; 2012: 726–733.
Published online 2012 Nov 3.
PMCID: PMC3540476
PMID: 23304346

Utility of a Clinical Support Tool for Outpatient Evaluation of Pediatric Chest Pain

Abstract

This study evaluates a clinical pathway currently being employed at a large single-center pediatric cardiology practice. The dataset includes 1,997 pediatric patients with the primary complaint of chest pain. A logistic regression model was developed to predict cardiac disease and identify strong indicators of cardiac pathology. The area under the ROC curve was 0.73 and the Matthews correlation coefficient was 0.23. Given the low incidence of pathology disease, this study was unable to identify strong predictors of major cardiac pathology. The analysis did support syncope, palpitations and the onset of chest pain in the past 2–7 days as predictors of minor cardiac disease. However, the model indicated exertional chest pain is negatively associated with cardiac disease. This data should be evaluated with caution as some of the results are contrary to most clinical cardiologists’ views. The majority of the results support the cardiac disease predictors in the clinical pathway.

Introduction

Appropriate resource utilization is a key element with respect to successful reform of the US healthcare delivery system1,2. While appropriate use criteria exists for many medical disciplines, this is not necessarily the case in pediatric cardiology. Clinical pathways have been developed to streamline care and limit variability in physician practices3,4, with some focusing on common outpatient pediatric cardiology complaints such as chest pain5. It is important to note that the vast majority of children seen with chest pain have no underlying cardiac pathology, yet the possibility of serious and life-threatening pathology often leads to expensive testing (i.e., echocardiography). The purpose of this study was to use data mining techniques to evaluate the clinical pathway currently being employed at a large single-center pediatric cardiology practice.

Electronic health records (EHRs) are intended to improve the quality of patient care by providing accurate and complete patient data across healthcare organizations. However EHRs also act as a rich data source for healthcare researchers. Many studies utilize EHRs to identify predicting variables, such as predictors of asthma exacerbations6 or immunization delay7. This study analyzes EHRs to confirm or reject the clinical pathway’s assumed key predictors of cardiac disease. We supplement this analysis by developing and evaluating a diagnostic aid based on significant predictors.

Method

This study presents a three stage analysis. Stage-one performs univariate analysis to identify subgroups of patients which are more likely to have cardiac disease. Stage-two analysis uses logistic regression to predict whether a patient has cardiac disease while stage-three analysis develops a diagnostic aid to help physicians identify patients potentially at-risk for cardiac disease.

During stage-two analysis, developing a logistic regression model, variables with p-values less than 0.35 in stage-one analysis were included as initial variables in the model. The model was trained using 80% records, and tested with the remaining 20%. This dataset exhibits class imbalance issues which can cause logistic regression to perform poorly. However, oversampling the minority class can considerably improve performance and obtains comparable results to more complex solutions8. The minority class was oversampled until the number of majority and minority records were the same. The logistic regression model was trained on the oversampled data. The resulting parameters were used to identify significant predictors in the final model (i.e. factors with p-values greater than 0.05 were removed).

In order to avoid overfitting the data with too few outcome events per predictor variable, a general rule is there should be at least 10 outcome events per independent variable in the prediction rule9. As such, if the events-to-predictor ratio was less than 10, the m (where m = # outcome events/10) most significant predictors were kept while the remaining were removed.

Stage-three analysis develops a risk model by assigning a risk score to each significant predictor from the logistic regression model. This value was calculated by dividing the β-coefficient by one-half the sum of the two smallest (absolute sense) coefficients in the model and rounding to the nearest integer10. The overall risk score was calculated by summing the risk scores from each predictor.

The discrimination of our models was assessed by several metrics. We employed the metrics area under the receiver operating characteristic curve (AUC) and Matthews correlation coefficient (MCC). AUC assesses a classifier’s ability to balance the true positive and false positive rates. An AUC of 1.00 corresponds to perfect discriminatory power while an AUC of 0.50 indicates no ability to discriminate. New research indicates the AUC is a noisy classification measure11. As a result we also included MCC as it considers both true and false positive and negative rates. MCC is considered a more balanced measure than AUC and can also be used when classes are imbalanced. A MCC of 1.00 indicates perfect discrimination while a MCC 0.00 indicates no better than random classification.

Dataset

The dataset contained anonymous records of 1,997 pediatric patients seen in one of 17 outpatient clinics (11 permanent and 6 outreach). The permanent clinics are typically accessible 5 days per week and mainly serve metropolitan areas. The outreach clinics accessibility varies from one day every other week to an average of 1.5 days per week, and typically serve more rural areas in a single state. The clinical protocol was developed within the pediatric cardiology practice utilizing available best practices, local clinical expertise and group consensus. The clinical evaluation consists of the following three components: a) patient questionnaire and family history completed prior to seeing physician and again reviewed with physician, b) comprehensive physical examination, and c) 12-lead electrocardiogram (EKG). If no ”red flags” are raised or noted with a, b, and c, then the evaluation is complete and the patient is diagnosed with non-cardiac chest pain. If red flags are raised the patient is referred for further testing at the discretion of the treating cardiologist. Any deviations from the pathway must be documented and explained by the treating physician.

As a result of this procedure each record is composed of five components: 1) clinic and attending physician identifiers, 2) results of the physical exam, 3) responses to the patient and family history questionnaires, 4) outcomes of diagnostic testing, and 5) ICD-9 codes for diagnoses.

The physical exam records the following values 1) height, 2) weight, 3) BMI, 4) respiratory rate, 5) heart rate, 6) blood pressure, and 7) physical exam outcome. The outcome of the physical exam is a free text field written by the physician. The patient and family questionnaires comprise the majority of the patient record. The patient questionnaire consists of over 10 questions regarding 1) where the site of pain occurs, 2) how long the pains persists, 3) how often the pain occurs, 4) when the patient first noticed the pain, 5) how intense is the pain is, 6) what symptoms best describes the pain, and 7) what actions increase the pain. Each question, except concerning the site of pain, provides a set of fixed responses to which the patient indicates which responses most accurately describe his/her condition. The family history questionnaire consists of 22 questions regarding family history of cardiac disease, seizure, sudden death, etc. All questions require a yes/no response, with the exception of a single field provided for additional relevant information.

The ”red flags” on the patient questionnaire are as follows: 1) chest pain described as ”pressure”, 2) duration of chest pain is less than 48 hours or 2–7 days, 3) chest pain occurring during exertion/exercise, 4) chest pain made worse with exercise or lying down, 5) associated palpitations, lightheadedness, syncope or nausea, and 6) past history of rheumatic fever or Kawasaki disease. The ”red flags” on the clinical and family history are as follows: 1) ”has your child passed out during or after exercise, emotion, or startle?”, 2) ”has your child ever had extreme shortness of breath during exercise?”, 3)”has your child had extreme fatigue associated with exercise (different from other children)?”, 4) ”has your child ever had discomfort, pain or pressure in his chest during exercise?”, 5) ”are there any family members who died suddenly of heart problems before age 50?”, 6) ”are there family members who have had unexplained fainting or seizures?”, and 7) ”are there any relatives with an enlarged heart (hypertrophic or dilated cardiomyopathy)?”. These ”red flags” do not necessarily mean the child has cardiac chest pain, rather signify that these patients might require a more intensive evaluation to rule-out cardiac disease. From the patient questionnaire most experts would agree that exertional chest pain and associated syncope (passing out) and/or palpitations (potential heart rhythm abnormality) could signal significant cardiac disease.

The questionnaires were provided to patients in paper format and were later transcribed as EHRs. Each question was coded in a systematic method with the exception of free response fields. We assume the errors induced during transcription are minimal as free response fields were limited. However, the accuracy of free response questions may be questionable; especially concerning the site of maximal pain which asks patients to indicate on a diagram where the pain occurs. Each diagnostic test performed included its CPT code and a text field describing the outcome. Diagnoses were encoded as both an ICD-9 code and a text value. The vast majority (86.7%) of patients were assigned at least one of over 80 unique diagnoses in the dataset.

In addition to the patient records, we were provided each physician’s age, each clinic’s status (either permanent or outreach), and the diagnosis severity (no, minor, or major cardiac disease). The diagnosis severity was determined by a panel composed of physicians from this practice. Each physician independently reviewed the diagnoses and assigned them one of the three possible outcomes. If the reviewers’ selections were unanimous, the action was considered valid. However if the reviewers were not unanimous in their selection, discussion followed and a selection was made as a panel. The panel identified 20 diagnoses as major cardiac disease, 14 as minor, and the remainder as no cardiac disease.

Before developing a support tool, the data set was cleaned. Patients who had not completed the questionnaires were removed from the data set, leaving 1,888 remaining records. The question ”Does chest pain occur during exercise, rest, or both?” had 43 missing values while the question ”what is the severity of the pain (1–10)?” had 88 missing values. The regression method presented above was used to estimate the missing values.

Results

Before stage one analysis was performed missing entries in the data were estimated. Our regression models included the variables from the patient and family history questionnaires and the outcomes of the physical exam and EKG. These variables compose the results of the standard clinical procedure. The regression model to predict if pain occurred during exercise, rest, or both achieved AUCs of 0.871, 0.864, and 0.771 respectively. These results indicated the classifiers had good discriminatory power and were used to predict the 43 missing values. However, the linear regression model to predict the severity of pain had poor performance (RMSE = 2.542). As such, the question was removed from the dataset. The question concerning the site of pain was also removed as there was no systematic method to interpret the response.

For this analysis, positive cardiac disease is identified as both major and minor cardiac disease. The univariate analysis of the questionnaires identified 16 significant (p-value ≤ 0.05) questions. The set of questions concerning what symptoms co-occur with chest pain was the only question set with all the questions identified as significant. The set of questions concerning family history was the only question set with no significant differences. The univariate analysis of the patient symptom questionnaire is found in Table 3 while the analysis of the family history questionnaire is found in Table 4. A positive physical exam had an odds ratio 0.84 (0.66–1.07 95% CI) and a p-value of 0.15. A positive EKG had an odds ratio of 1.82 (1.40–2.34 95% CI) and a p-value < 0.01.

Table 3:

Univariate analysis of patient symptoms. (+) indicates major or minor cardiac disease.

Symptom(+) n = 359(−) n = 1529Odds Ratio95% CIp-value
Pain is described as
  Burning11.7%15.6%0.720.51–1.020.06
  Dull ache17.3%17.8%0.970.71–1.310.82
  Fluttering20.6%9.1%2.601.91–3.54<0.01
  Itching3.1%1.7%1.830.89–3.730.09
  Loss of breath44.3%38.9%1.250.99–1.570.06
  Pressure44.6%41.7%1.120.89–1.420.33
  Sharp52.6%51.9%1.030.82–1.300.81
  Sticking7.8%10.4%0.730.48–1.110.14
  Other19.5%17.1%1.180.88–1.580.28
Has noticed pain
  48 hours or less6.1%5.2%1.200.74–1.950.47
  2 - 7 days11.1%8.2%1.400.96–2.030.08
  1 week - 1 month26.5%24.6%1.100.85–1.430.46
  1 - 6 months31.5%29.8%1.080.85–1.390.52
  More than 6 months17.3%22.3%0.730.54–0.980.04
  Other8.4%10.1%0.810.54–1.230.32
How often pain occurs
  Several times daily26.2%22.4%1.230.95–1.600.12
  Once daily12.5%9.3%1.400.98–2.000.06
  Several times weekly18.1%16.0%1.160.86–1.570.34
  Weekly13.6%14.9%0.900.65–1.260.54
  Less often than weekly22.6%29.8%0.690.53–0.900.01
Pain usually lasts for
  Seconds19.8%26.0%0.700.53–0.930.01
  Minutes63.8%60.0%1.180.93–1.500.18
  Hours21.4%16.4%1.391.04–1.850.02
Pain occurs during
  Exercise15.6%23.9%0.5870.43–0.80<0.01
  Rest27.3%24.3%1.1680.90–1.510.24
  Both57.7%52.3%1.2440.99–1.570.06
Pain increases from
  Exercise31.2%39.6%0.690.54–0.88<0.01
  Sitting13.4%11.1%1.240.88–1.750.22
  Standing10.9%9.0%1.230.84–1.790.28
  Lying down16.4%14.2%1.190.87–1.630.28
  Coughing12.5%13.7%0.910.64–1.280.57
  Pushing on chest11.4%12.9%0.870.61–1.250.45
  Eating4.2%7.1%0.570.33–0.980.04
  Any type of movement14.5%15.6%0.910.66–1.270.59
  Taking a deep breath31.2%32.4%0.950.74–1.210.67
During chest pain, also suffers from
  Palpitations53.5%26.5%3.192.52–4.04<0.01
  Lightheadedness38.7%29.0%1.541.22–1.96<0.01
  Syncope11.1%2.2%5.683.53–9.15<0.01
  Nausea18.9%12.2%1.691.24–2.29<0.01
  Difficulty breathing50.1%44.4%1.261.00–1.590.05

Table 4:

Univariate Analysis of patient and family history. (+) indicates major or minor cardiac disease.

(+) n = 359(−) n = 1529Odds Ratio95% CIp-value
Patient has had
  Recent chest injury1.9%2.9%0.670.30–1.500.33
  Recent cough or cold symptoms22.0%20.6%1.090.82–1.440.56
  Recent fever5.6%4.6%1.230.74–2.050.43
  History of asthma25.9%22.7%1.190.91–1.550.20
  Kawasaki disease0.6%0.5%1.220.25–5.890.81
Pain wakes from sleep32.3%25.2%1.421.11–1.820.01
Passed out during exercise, emotion or startle10.3%5.2%2.081.38–3.13<0.01
Passed out after exercise0.3%0.2%1.420.15–13.700.76
Extreme SOB during exercise39.6%35.8%1.180.93–1.480.18
Extreme fatigue w/exercise27.9%21.4%1.421.09–1.840.01
Discomfort, pain, pressure in chest during exercise63.8%60.1%1.170.92–1.480.20
Family Hx
  Sudden death before 50 years18.9%16.6%1.170.87–1.580.29
  Unexplained fainting or seizures20.3%17.2%1.230.92–1.640.16
  Enlarged Heart: HCM9.2%8.1%1.150.77–1.720.50
  Enlarged Heart: DCM2.8%2.1%1.340.65–2.750.42

During stage-two analysis, developing a logistic regression model for cardiac disease (major or minor), we included all variables in stage-one analysis with p-values less than 0.35 as initial explanatory variables. At the end of the stepwise logistic regression model, there were 15 variables with p-values less than 0.05. These variables, along with their corresponding coefficients and odds ratios, are included in Table 5. The most significant predictors within the model were syncope (odds ratio = 2.93) and palpitations (odds ratio= 3.52). The model achieved an AUC = 0.726 indicating moderate discriminatory power (Table 6).

Table 5:

Predictors of cardiac disease (major or minor)

CharacteristicCoef.SEOdds RatioRisk Score
Pain Described as
  Burning−0.340.120.71−1
  Fluttering0.640.131.873
  Sticking−0.570.160.57−2
Pain has been present
  2–7 Days0.560.151.762
  Other−0.520.160.60−2
Pain occurs daily0.351.420.732
Pain usually persists for hours0.231.270.761
Activities that increase pain
  Exercise−0.510.090.60−2
  Sitting0.320.131.381
Eating−0.950.210.39−4
Palpitations1.080.092.935
Syncope1.260.213.526
Difficulty Breathing0.220.091.231
History of asthma0.220.101.251
Positive EKG0.620.101.863

Table 6:

Performance metrics for predicting cardiac disease (major or minor). (a) logistic regression performance. (b) diagnostic aid performance.

(a)

MetricEstimate

AUC0.73
MCC0.23
Sensitivity0.64
Specificity0.67
Likelihood ratio positive1.75
Likelihood ratio negative0.57
(b)

RangePatientsPatients w/ disease

12 to 193562.8%
7 to 1125641.0%
2 to 673521.6%
−8 to 28627.08%

As part of stage-three analysis, each significant predictor from stage two was assigned a risk score by dividing its β-coefficient by one-half the sum of the two smallest (absolute sense) coefficients in the model and rounding to the nearest integer10. These risk scores formulate a diagnostic aid by assigning a patient a cumulative risk score, the sum of the risk scores from each predictor. The diagnostic aid had a sensitivity of 0.50 and specificity of 0.65. The cut-off point was selected where the sum of the sensitivity and specificity was the largest. The likelihood of cardiac disease increased with the cumulative risk score as shown in Table 6.

When comparing the ”red flags” in the clinical protocol with the univariate analysis of cardiac disease (major or minor), nine of the 20 ”red flags” are considered significant. Seven of these significant variables agree with the ”red flag” interpretation. However two significant variables, 1) pain occurs with exercise and 2) exercise increases pain, contradict the ”red flag” interpretation. They indicate pain with exercising is correlated with no cardiac disease. The regression model agrees with the ”red flags”, onset of pain within the past 2–7 days, syncope, and palpitations.

Stage one through three analysis was repeated, however cardiac disease was defined as only major cardiac disease. The univariate analysis for major cardiac disease is not reported in this study. However stage-two results are found in Table 7. Ten variables were significant predictors of major cardiac disease with the strongest predictor being a positive EKG. However, this model had an AUC = 0.60 (Table 8) indicating very poor discriminatory power. Risk scores were also assigned to each predictor in a similar manner. The major cardiac disease diagnostic aid had a sensitivity of 0.87 but only 0.39 for specificity (Table 8). This model has sufficiently poor performance that extreme caution should be taken when interpreting its results.

Table 7:

Predictors of major cardiac disease

CharacteristicCoef.SEOdds RatioRisk Score
Pain Described as
  Burning−0.610.120.54−2
  Pressure0.740.082.112
  Other−0.700.120.49−2
Pain occurs less than weekly−0.450.100.63−1
Activities that increase pain
  Standing0.760.122.152
Light-headedness−0.420.100.65−1
Syncope0.760.182.142
Extreme fatigue associated with exercise0.620.091.862
Positive physical exam0.260.081.291
Positive EKG0.990.092.683

Table 8:

Performance metrics for predicting major cardiac disease. (a) logistic regression performance. (b) diagnostic aid performance.

(a)

MetricEstimate

AUC0.60
MCC0.04
Sensitivity0.45
Specificity0.65
Likelihood ratio positive1.31
Likelihood ratio negative0.83
(b)

RangePatientsPatients w/ disease

7 to 11586.90%
3 to 67245.25%
0 to 28621.97%
−6 to −12440.41%

Discussion

Evaluating physician and pathway performance is essential to quality assurance and has long been a central issue in medicine. While many studies have focused on analyzing physician performance12,13,14 and the impact of continuing medical education15, our study focuses on evaluating the performance of a clinical pathway in a large single-center pediatric cardiology practice. In addition to evaluating the pathway, we developed a diagnostic aid for cardiac pathology. However, the regression model for major cardiac disease had sufficiently poor performance that no confidence should be taken in interpreting its results.

Given the low incidence of major cardiac disease in this population, it is not surprising that our analysis did not identify a single predictor of major cardiac pathology with the current clinical pathway. However, there were a number of clinical and historical data elements that favored a presence or absence of minor cardiac pathology. This data should be evaluated with caution as some of the results were contrary to what most clinical cardiologists would view as significant/pertinent positives. For example, this analysis revealed a negative correlation of exertional chest pain with minor cardiac disease, while most would agree that these symptoms are often the most concerning. One explanation for this anomaly would be that children are not often reliable historians, thus relying too much on their description of the events might lead one to an incorrect conclusion. It is also possible that exertional chest pain is negatively associated with minor cardiac disease but positively associated with major cardiac disease. These results may be affected by a low incidence of cardiac pathology in this subset which makes interpretation difficult at this time. Although the panel labelled diagnoses as major or minor cardiac disease, they did feel that the minor diagnoses were not necessarily related to contributing to the chest pain. Our hope is that the current model can be applied to an increasing number of patients and provide more robust results/analysis.

Despite these limitations, our results did confirm a number of ”red flags” utilized in the clinical pathway, such as syncope, palpitations, and having noticed pain for 2–7 days. This study also raises potential ”red flags”, such as pain occurring daily, which should be further studied. This study highlights the difficulty in drawing significant conclusions primarily from patient and family history. However, empirical analysis of patient symptoms can lead to improved clinical pathways and patient outcomes.

Table 1:

Basic patient statistics

Statistic
Total patients1997
Patient age12.7 ± 3.9
Patients w/ cardiac disease19.1%
Patients w/ major cardiac disease3.1%
Patients w/ positive physical exam36.5%
Patients w/ positive EKG20.7%

Table 2:

Examples of diagnosis severity

ICD-9Minor Cardiac DiseaseICD-9Major Cardiac Disease
427.61Supraventricular premature beats426.7Anomalous atrioventricular excitation
429.5Rupture of chordae tendineae745.54Anomalies of cardiac septal closure
756.83Ehlers-Danlos syndrome746.02Stenosis of Pulmonary Valve, congenital
785.1Palpitations746.85Coronary artery anomaly
785.0Tachycardia, unspecified747.31Pulmonary artery coarctation and atresia

References

1. Memtsoudis SG, Sun X, Chiu YL, et al. Utilization of critical care services among patients undergoing total hip and knee arthroplasty: epidemiology and risk factors. Anesthesiology. 2012 Jul;117(1):107–116. [PMC free article] [PubMed] [Google Scholar]
2. Bodenheimer T, Lo B, Casalino L. Primary care physicians should be coordinators, not gatekeepers. JAMA. 1999;281(21):2045–2049. [PubMed] [Google Scholar]
3. Basse L, Jakobsen DH, Billesbolle P, Werner M, Kehlet H. A clinical pathway to accelerate recovery after colonic resection. Ann Surg. 2000 Jul;232(1):5157. [PMC free article] [PubMed] [Google Scholar]
4. Biffl WL, Smith WR, Moore EE, et al. Evolution of a multidisciplinary clinical pathway for the management of unstable patients with pelvic fractures. Ann Surg. 2001 Jun;233(6):843850. [PMC free article] [PubMed] [Google Scholar]
5. Rathod RH, Farias M, Friedman KG, et al. A novel approach to gathering and acting on relevant clinical information: scamp. Congenit Heart Dis. 2010;5:343–353. [PMC free article] [PubMed] [Google Scholar]
6. Himes BE, Kohane IS, Ramoni MF, Weiss ST. Characterization of patients who suffer asthma exacerbations using data extracted from electronic medical records. AMIA Annual Symposium Proceedings; 2008. [PMC free article] [PubMed] [Google Scholar]
7. Fiks AG, Alessandrini EA, Luberti AA, Ostapenko S, Zhang X, Silber JH. Identifying factors predicting immunization delay for children followed in an urban primary care network using an electronic health record. Pediatrics. 2006 Dec 1;118(6):1680–1686. [PubMed] [Google Scholar]
8. Japkowicz N. The class imbalance problem: significance and strategies. Proceedings of the 2000 International Conference on Artificial Intelligence; 2000. [Google Scholar]
9. Laupacis A, Sekar N, Stiell IG. Clinical prediction rules: a review and suggested modifications of methodological standards. JAMA. 1997 Feb 12;277(6):488–494. [PubMed] [Google Scholar]
10. Konno S, Hayashino Y, Fukuhara S, et al. Development of a clinical diagnosis support tool to identify patients with lumbar spinal stenosis. Eur Sprine. 2007 Nov;16(11):1951–1957. [PMC free article] [PubMed] [Google Scholar]
11. Hanczar B, Hua J, Sima C, Weinstein J, Bittner M, Dougherty ER. Small-sample precision of roc-related estimates. Bioinformatics. 2010;26(6):822–830. [PubMed] [Google Scholar]
12. Hartz AJ, Kuhn FM, Pulido J. Prestige of training programs and experience of bypass surgeons as factors in adjusted patient mortality rates. Medical Care. 1999;37(1):93–103. [PubMed] [Google Scholar]
13. Rhee SO. Factors determining the quality of physician performance in patient care. Medical Care. 1976 Sep;14(9):733–750. [PubMed] [Google Scholar]
14. Choudhry NK, Fletcher RH, Soumerai S. Systematic review: the relationship between clinical experience and quality of health care. Annals of Internal Medicine. 2005 Feb 15;142(4):260–73. [PubMed] [Google Scholar]
15. Bloom BS. Effects of continuing medical education on improving physician clinical care and patient health: a review of systematic reviews. International Journal of Technology Assessment in Health Care. 2005;21:380–385. [PubMed] [Google Scholar]

Articles from AMIA Annual Symposium Proceedings are provided here courtesy of American Medical Informatics Association