U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Balion C, Santaguida PL, Hill S, et al. Testing for BNP and NT-proBNP in the Diagnosis and Prognosis of Heart Failure. Rockville (MD): Agency for Healthcare Research and Quality (US); 2006 Sep. (Evidence Reports/Technology Assessments, No. 142.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Testing for BNP and NT-proBNP in the Diagnosis and Prognosis of Heart Failure

Testing for BNP and NT-proBNP in the Diagnosis and Prognosis of Heart Failure.

Show details

3Results

The search yielded 4338 citations in total. From these 1733 citations proceeded to full text screening. Criteria for each specific research question were applied to these 1733 citations that yielded 4 subsets of papers to be further screened: one for each of the research questions (Figure 2). A total of 264 citations (6 percent) were eliminated because of non-English language of publication (6 percent) at the title and abstract phase. The final number of eligible papers varied as a function of the specific research question. A total of 30 studies were eligible for Question 2; the results of the review of reviews for Question 2b are detailed later in the results. For question 3, a total of 150 citations were eligible, and from these 110 are evaluated for this report. Forty of the citations for Question 3 reflected very specialized populations that did not necessarily reflect cardiac dysfunction. Finally for Question 4, a total of 18 studies were abstracted and evaluated.

Figure 2. Flow diagram showing the numbers of articles processed at each level.

Figure

Figure 2. Flow diagram showing the numbers of articles processed at each level.

The results of the systematic review are presented in this chapter according to the four research questions: determinants, diagnostic performance, prognosis and monitoring of treatment.

Question 1: What Are the Determinants of Both BNP and NT-proBNP?

Study Characteristics

There were 144 studies included for all the clinical questions in this systematic review (Appendix C - Reference List of Included Articles). Of these, 72 studies showed a relationship between B-type natriuretic peptides and a biological determinant. In general, most determinants showed a positive association with B-type natriuretic peptides in this review. The determinants were categorized according to type of measurement (i.e., demographic, biochemical or physiological), disease and treatment. The determinant was considered to show a significant effect on B-type natriuretic peptide levels if the p-value was less than 0.05. Table 3 lists the details of the associations found and is presented according to the determinant category, effect (increase, none, and decrease) and test type (BNP and NT-proBNP).

Table 3. The effect and association of various biological determinants on BNP and NT-proBNP levels.*.

Table 3

The effect and association of various biological determinants on BNP and NT-proBNP levels.*.

Demographic Characteristics

Age was the most frequently reported determinant and in 13 of 15 studies was positively correlated with both BNP and NT-proBNP.3– 15 There were two studies that did not show a relationship with age,31, 34 but these studies had the smallest number of patients (n = 21 and 36, respectively) as compared to the other studies (range = 85 to 6809). One study reported no difference between African Americans and Caucasians.49 The association of B-type natriuretic peptides with gender was examined in 11 studies, with an almost equal number reporting either a higher level (n=5), or no difference in males (n = 6). There were no obvious similarities among studies with respect to observed association and patient population. However, larger studies were more likely than smaller studies to report a higher B-type natriuretic peptide level in females compared to males. Two studies looked at current smoking and reported no association.8, 85

Cardiac Disease

In general, all cardiac diseases (n = 21) were associated with an increase in the B-type natriuretic peptides. These included diastolic dysfunction,5, 11, 16, 33, 34, 65 cardiac decompensation,26 acute right HF,57 and cardiac pulmonary edema (CPE).57 Acute right HF without cardiac pulmonary decompensation was not related to BNP concentration. Cardiac decompensation, however, was related to an increase in NT-proBNP. Patients with CPE had higher levels of BNP than patients with obstructive lung disease. Patients with diastolic dysfunction had elevated B-type natriuretic peptide levels but not as elevated as patients with systolic dysfunction.11, 16, 172

There were differences among diseases within the broad category of cardiac ischemia. Patients with acute coronary syndrome (ACS) had elevated NT-proBNP levels,85 but there was no difference between patients with and without ischemic heart disease unless the patients had cardiovascular risk factors.4, 116 Acute myocardial infarction (MI)4, 8, 29 or historical MI14, 85 were associated with increased levels of B-type natriuretic peptides. Stable angina was not associated with a difference in B-type natriuretic peptides in one study4 that included hypertensive patients, but was positively associated in patients with Non ST-elevation myocardial infarction (NSTEMI) ACS.57 Patients with left anterior descending (LAD) coronary artery lesions had elevated BNP and those with proximal lesions had higher levels than those with mid-lesions.95 Multi-vessel disease was associated with higher NT-proBNP levels.85 Also NT-proBNP levels were positively associated with patients who had previous revascularization.8 There was no difference between patients with dilated cardiomyopathy and old MI.124 Arrhythmia65 was associated with elevated levels of B-type natriuretic peptides; however, there was no difference between atrial fibrillation and sinus rhythm115 valvular disease65 and all severities of aortic stenosis6 were positively associated with B-type natriuretic peptides levels.

Non-cardiac Diseases

The effect of non-cardiac diseases (n = 11) on B-type natriuretic peptide levels was mixed. Non-cardiac causes of dyspnea,16– 18 diabetic nephropathy,15 and stroke8 were all associated with higher levels of B-type natriuretic peptides. Lung disease compared to HF,56 or HF plus lung disease,65 had lower BNP and NT-proBNP levels respectively. Diabetic retinopathy15 and cerebrovascular disease (including stroke and transient ischemic attack)4, 8 did not show association with B-type natriuretic peptide levels. For diabetes, one study showed a positive association with NT-proBNP8 but in three studies4, 14, 85 there was no association. Four of five studies that evaluated hypertension8, 10, 34, 89 showed a positive association with B-type natriuretic peptides. The one study85 that did not show a difference used a statistical test for the difference in medians whereas the other studies used mean difference tests or regression analysis. Duration of hypertension was also not associated with BNP levels.10 There was no difference in NT-proBNP levels between patients with peripheral vascular disease as compared to patients without risk factors for cardiovascular disease (CVD).4 Two studies reported hyperlipidemia as a determinant. One of these studies showed an inverse relationship with NT-proBNP levels85 using the Wilcoxon rank sum test, whereas the other study showed no relationship8 using multiple linear regression analysis.

Biochemical and Hematological Markers

There were 29 biochemical and hematological markers where an association with the B-type natriuretic peptides was made. Markers of myocardial damage, including Tn-I,3, 19, 20 Tn-T,8, 14, 16, 21– 26 myoglobin,21 and CK-MB,21, 27– 29 were mostly positively associated with B-type natriuretic peptide levels. One study did not show a statistically significant correlation with CK-MB.27 This study included only ST-segment elevation myocardial infarction (STEMI) patients, whereas the other studies excluded STEMI patients, or included MI patients admitted to the coronary care unit. No significant association was found in one study with Tn-I.20 This study included only NSTEMI and unstable angina patients in contrast to the other studies that included STEMI and ACS patients. Total creatine kinase showed no significant association with BNP but this may be because no patients in this study had elevated levels of this marker.26 Furthermore, the cardiac hormones ANP,12, 13, 25, 84 NT-proANP,13, 20, 41, 74, 84, 144, 144 and second messenger cGMP,12, 84 were positively associated with B-type natriuretic peptide levels. However, relaxin, also a cardiac hormone, showed no association with NT-proBNP.136 Several markers of inflammation including C-reactive protein,8, 14, 21 interleukin-6,22 the ST2 receptor protein90 and osteoprotegerin98 were positively associated with B-type natriuretic peptide levels. The association with lymphocytes was patient group specific.105 There was no statistically significant association between BNP and lymphocytes observed in the patient group with hypertensive heart disease, mitral stenosis, atrial fibrillation and hypertrophic cardiomyopathy, but a negative association was observed in a group composed of patients with ischemic heart disease, dilated cardiomyopathy aortic stenosis, aortic regurgitation and mitral regurgitation. There was a mixed association with markers of the renin-angiotensin-aldosterone system (RAAS). Plasma renin activity106 was inversely associated with BNP, whereas andromedullin84 and aldosterone106 showed no significant relationship with NT-proBNP and BNP, respectively. The ACE genotype DD,81 endothelin-1,106 big endothelin-1,41, 106 epinephrine84 and norepinephrine12, 25, 39, 41, 84, 106 were all positively associated with the B-type natriuretic peptides. Creatinine, an indirect marker of renal function, increased in five of eight studies with increasing levels of B-type natriuretic peptides.8, 9, 13, 29, 34, 36, 76, 100 The reason why two studies29, 34 did not show a correlation with creatinine is unknown; however, these two studies had the smallest sample size (n = 64 and 36, respectively) compared to the other studies (n = 84 to 6809). There was also no significant relationship observed between total protein9 and BNP. Fasting glucose15 and HbA1c9, 15 tests for diabetes showed no significant relationship with B-type natriuretic peptides, but random glucose135 was positively associated with BNP. Cholesterol,9, 15 a marker of HF, showed no significant relationship with BNP or NT-proBNP. However, hemoglobin,15 a marker of anemia, was negatively associated with NT-proBNP.

Functional and Physiologic Measure

Two measures of renal function, glomerular filtration rate15 and creatinine clearance,122 showed an inverse relationship with B-type natriuretic peptides. Weight,8 but not BMI,9, 10 showed a negative relationship with B-type natriuretic peptides. Exercise testing also showed that a decrease in physical endurance was related to higher B-type natriuretic peptide levels.116 Two studies which evaluated BMI as a determinant had no,9 or very few,10 patients who were obese.

Hemodynamic, Electrocardiographic and Echocardiographic Measures

There were 23 measures from 14 studies reported for heart function.4, 8– 12, 14, 15, 29– 34 Most of the hemodynamic, electrocardiographic and echocardiographic measures were compared to BNP and a few were compared to NT-proBNP. Nine were positively associated with the B-type natriuretic peptides whereas eight showed no association. I-123 — metaiodobenzylquanidine (MIBG) activity,30 was negatively associated with B-type natriuretic peptides. Deceleration time of early mitral inflow was also negatively associated with BNP (the lower the deceleration time, the higher the plasma BNP).32 Heart rate and systolic blood pressure were the only two measurements that showed discrepant effects on B-type natriuretic levels. Heart rate was associated with both an increase8, 12, 31 and no change9, 34 in B-type natriuretic peptides. Of the two studies that did not show an association, one included only hypertensive patients34 and in the other, the association was in elderly subjects (> 80 years).9 Systolic blood pressure was either positively4, 15 associated with NT-proBNP, or showed no association with BNP.10, 31 In one of these studies31 the association changed to positive after the patients were treated with a beta blocker.

Drug Treatment

There were 14 studies, including nine different drug treatments, with data on the effect of drug treatment.31, 35– 47 The effect of these drugs was a decrease or no effect on B-type natriuretic peptide levels. Studies involving therapy with amiodarone,37 atenolol,41 enalapril40, 42 and valsartan39, 43, 45, 46 all showed a decrease in B-type natriuretic peptides. Studies that assessed B-type natriuretic peptide levels after therapy with perindopril47 or metoprolol44 showed no difference compared to baseline. There was no dose dependent change in B-type natriuretic peptide levels with lisinopril36 or furosemide.36 The effect of carvedilol therapy on B-type natriuretic peptide levels, compared to baseline or a placebo group, showed either a decrease,31, 36, 47 or no change.35, 44 There were two studies38, 44 that treated patients with beta blockers but did not differentiate between the two drugs (carvedilol or metoprolol). One study reported a decrease in NT-proBNP concentration38 whereas the other study reported no change in BNP concentration44 after treatment.

Non-drug Treatment

There was only one study in this group of papers that reported a non-drug therapy. In this study the concentration of BNP decreased after implantation of a left ventricular device.150

Question 2a: What Are the Clinical Performance Characteristics of Both BNP and NT-proBNP Measurement in Patients with Symptoms Suggestive of HF or with Known HF?

Question 2ai: Emergency Department

Sample and Design Characteristics of Studies

Fourteen articles met all of the inclusion criteria and were selected for data abstraction.7, 16– 18, 48– 57 Of the 14 selected studies, four were from the Breathing Not Properly Multinational Study.18 The data from the sub-studies49, 51, 54 were excluded from the meta- analyses. This study and seven others examined only BNP.17, 48, 50, 52, 55– 57 Two other studies examined only NT-proBNP, 16, 53 and one study examined variations of both BNP and NT-proBNP.7 The included studies were published over a period of four years (2001 - 2004) with the majority published in 2002 and 2004 (Table 4). The patients enrolled in all studies presented to emergency departments with shortness of breath and were over 18 years of age. One study57 limited enrolment to patients over 65 years of age while another16 limited enrolment to patients between 44 and 88 years of age.

Table 4. Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of HF in emergency or urgent care settings.

Table 4

Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of HF in emergency or urgent care settings.

Diagnosis of HF in studies. All studies except two52, 53 selected for data abstraction employed a cohort design and a reference standard agreed upon by consensus of at least two physicians (mostly cardiologists). Two studies based the diagnosis on the opinion of a single cardiologist48, 52 and the third only stated that the definitive diagnosis was based on the Framingham criteria and echocardiography results.53 The adjudicating physicians each arrived at a diagnosis of HF based on their interpretation of all available clinical data, often including echocardiography results. The Boston Criteria were employed in the diagnosis in one study52 and the Framingham criteria in three studies,18, 53, 55 one of which also applied the National Health and Nutrition Examination Survey (NHANES).18

Diagnostic properties. Table 4 presents the results to answer the question “What are the clinical performance characteristics of both BNP and NT-proBNP measurement in patients with symptoms suggestive of HF or with known HF presenting to an emergency department?”

The 12 studies evaluating BNP utilized several cut point values ranging from 50 to 400 pg/mL and reported sensitivities from 60 percent to 100 percent, specificities from 27 to 99 percent, and areas under the curve (AUC) of 0.67 to 0.99.7, 17, 18, 48– 52, 54– 57 In addition, the reported positive likelihood ratio (LR+) ranged from 0.69 to 70 and the negative likelihood ratio (LR-) ranged from 0.00 to 0.44 (Table 4). The three studies evaluating NT-proBNP utilized several values ranging from 254 to 4567 pg/mL and reported sensitivities from 74 percent to 98.6 percent, specificities from 47 to 93 percent, and AUC values of 0.89 to 0.96. It was possible to do meta-analysis on eight studies for BNP7, 17, 18, 48, 52, 55– 57 and three studies for NT-proBNP.7, 16, 53 To maximize sensitivity the lowest cut point was used if multiple cut point data were given. The data are summarized in Table 7 and Figures 3 and 4. The sensitivities of the BNP studies were similar with a summary sensitivity of 97 percent and a CI of 96 to 98 percent. In contrast, the specificity data was very heterogeneous with a summary estimate of 70 percent and a CI ranging from 56 to 85 percent (see Appendix C, Table 1315 for results of tests for heterogeneity with regards to setting). The corresponding likelihood ratios (LRs) showed that the LR- (0.06, 95 percent CI: 0.03 to 0.10) was better than the LR+ (3.63, 95 percent CI: 2.49 to 5.31) in terms of diagnostic value. The diagnostic odds ratio (DOR) for BNP ranged from 13 to 1635 with a summary estimate of 81 (95 percent CI: 29 to 219). With the exception of the pooled sensitivity estimates (Figure 3a) for BNP in the ED, all other combined estimates (for specificity, LR+, LR-, and DOR) had positive tests for heterogeneity; as such our confidence in these pooled estimates is decreased.

Table 7. Diagnostic odds ratios for studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of or with HF across all settings.

Table 7

Diagnostic odds ratios for studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of or with HF across all settings.

Figure 3. Forest plots for BNP in all settings using the lowest cut point provided in each study: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

Figure

Figure 3. Forest plots for BNP in all settings using the lowest cut point provided in each study: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

Figure 4. Forest plots for NT-proBNP in all settings using the lowest cut point provided in each study: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

Figure

Figure 4. Forest plots for NT-proBNP in all settings using the lowest cut point provided in each study: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

For NT-proBNP the summary estimates were similar to BNP in that the sensitivity (95 percent, 95 percent CI: 90 to 101) was much higher than the specificity (72 percent, 95 percent CI: 53 to 90). The LR- (0.07, 95 percent CI: 0.02 to 0.27) was also better than the LR+ (3.35, 95 percent CI: 1.75 to 6.41). The DOR from these three studies assessing NT-proBNP ranged from 17 to 291 with a summary estimate of 60 (94 percent CI: 9 to 407). All pooled estimates of NT-proBNP diagnostic accuracy measures were significant for heterogeneity for ED studies.

There were six studies that provided diagnostic information at a BNP cut point of 100 (±5) pg/mL.7, 17, 18, 55– 57 The meta-analysis on these studies shows a similar pattern to that described for varying cut points (Figure 5); with the exception of LR-, all other diagnostic pooled estimates were positive for heterogeneity. The sensitivity summary estimation is 95 percent (95 percent CI: 91 to 96) with a lower and broader specificity summary estimation (67 percent, 95 percent CI: 53 to 80). The LR+ was 3.4 (95 percent CI: 2.14 to 5.42) and the LR- was 0.11 (95 percent CI: 0.08 to 0.15), which is higher than the lowest cut point summary estimate. The overall DOR for this group of studies was reduced to 38 but the 95 percent CI was tighter (17 to 85) compared to the lowest cut point summary estimate.

Figure 5. Forest plots for BNP in the ED using a cut point of 100 (±5) pg/mL: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

Figure

Figure 5. Forest plots for BNP in the ED using a cut point of 100 (±5) pg/mL: a) sensitivity, b) specificity, c) LR+, d) LR-, e) DOR.

Quality assessment of studies. Results from the application of the QUADAS Question 14, quality assessment tool are as follows (see also Appendix C Evidence Figures, Figure 1): one (9.1 percent) of the studies clearly addressed the issue of disease progression bias (QUADAS Question 4); the reference standard was independent of the index test result (QUADAS Question 7) in 10 (90.9 percent) of the studies; the reference standard was described in sufficient detail (QUADAS Question 9) in seven (63.6 percent) of the studies; interpretation of the peptide marker (BNP or NT-proBNP) measurement was clearly without knowledge of the reference test results (QUADAS Question 10) in one (9.1 percent) of the studies; interpretation of the reference test results was clearly without knowledge of the B-type natriuretic peptide marker results (QUADAS Question 11) in seven (63.6 percent) of the studies; none (100 percent) of the studies stated whether or not the clinical data was available when the B-type natriuretic peptide test results were interpreted as would be the case when the test is used in practice (QUADAS Question 12); one (9.1 percent) of the studies reported uninterpretable or intermediate test results (QUADAS Question 13) and; withdrawals were not explained in two (18.2 percent) of the studies (QUADAS Question 14). Overall, the quality of these studies was good.

Question 2aii: Specialized Clinic or Outpatient Setting

Sample and Design Characteristics of Studies

There were a total of six papers eligible for review published between 1997 and 2004.11, 58– 62 All studies evaluated BNP with the exception of two58, 60 which compared both BNP and NT-proBNP. The studies were conducted in Austria, Japan, Portugal and USA. Two studies were based on patients referred to a HF clinic11, 59 and the remaining, to outpatient settings.58, 60– 62 All studies provided evaluation on BNP and two compared BNP and NT-proBNP.58, 60 Three of these papers provided data on sensitivity, specificity, and ROC curves for BNP or NT-proBNP 11, 58, 60. Three of these papers did not provide ROC characteristics or sensitivity or specificity data; instead, only correlation data for BNP or NT-proBNP with different variables of cardiac structure, function and symptoms were provided.

Diagnosis of HF in studies. Three studies58, 60, 62 used echocardiography as the reference standard, one study used echocardiography plus clinical criteria11 and two studies59, 61 used the NYHA classification.

Diagnostic properties. Hammerer-Lercher et al.60 directly compared the diagnostic values of NT-proBNP with BNP in 57 patients with stable chronic HF. In the analysis of normal (echocardiographic ejection fraction (EF) ≤ to 48 percent or radionuclide angiographic EF ≤ to 55 percent) versus impaired (echocardiographic EF < 48 percent or radionuclide angiographic EF < 55 percent) LVEF the AUC for BNP was 0.75 (SE ± 0.06), and for NT-proBNP was 0.67 (SE ± 0.07) (Table 5). In the analysis of LVEF less than 40 percent versus greater than or equal to 40 percent, the AUC for BNP was 0.83 (SE ± 0.06), and for NT-proBNP was 0.79 (SE ± 0.07). Positive and negative LRs were 3.17 and 0.35, respectively for BNP and 2.59 and 0.41, respectively for NT-proBNP. NT-proBNP did not differ significantly from BNP in either of the analyses. The optimal discriminator values were 142 pg/mL for BNP, and 4127 pg/mL for NT-proBNP for the detection of LVEF less than 40 percent compared to greater than or equal to 40 percent. For these discriminators the sensitivity was 73 percent for BNP, and 70 percent for NT-proBNP. The specificities were 77 percent for BNP, and 73 percent for NT-proBNP.

Table 5. Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of or with HF in outpatient or specialty clinic settings.

Table 5

Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of or with HF in outpatient or specialty clinic settings.

Bettencourt et al.11 studied 100 patients with symptoms suggestive of HF referred to a HF clinic. These patients had suspected or not previously investigated HF. Since healthy controls were included in this study, this suggested the potential for spectrum bias, although it was not clear if the control data was used in the estimates of diagnostic accuracy. For a cut point value of 39.7 pg/mL, the positive predictive value was 95.5 percent. For the diagnosis of HF regardless of LVEF, the AUC was 0.92 (95 percent CI: 0.86 to 0.99; p < 0.0001). The accuracy of BNP for the detection of systolic dysfunction was slightly less with an AUC of 0.78 (95 percent CI: 0.69 to 0.88; p < 0.0001). The BNP performance for detection of diastolic dysfunction expressed by the AUC was 0.89 (95 percent CI: 0.78 to1.00; p < 0.0001) for the patients without systolic dysfunction. Multiple regression analysis demonstrated that age, left ventricular mass index, and LVEF were independently associated with BNP.

Seino et al.58 compared BNP and NT-proBNP relative to LVEF less than 40 percent and less than 50 percent in patients with HF. Their data indicate that detection of LVEF less than 50 percent was slightly greater for NT-proBNP than BNP (AUC 0.820 and 0.794, respectively). The reverse was true for LVEF less than 40 percent, with BNP having slightly greater AUC (0.770) compared to NT-proBNP (0.754). The optimum cut point values were determined to be 135 pg/mL for BNP and 695 pg/mL for NT-proBNP. There were four papers 11,59,61,62 that examined the relationship between BNP only and other HF variables but did not provide any data about the sensitivity, specificity or accuracy of these measurements. In general, it was demonstrated that BNP was related to cardiac function measured either as LVEF62 or left ventricular end diastolic pressure.61 One study59 examined 41 HF patients and found BNP was related to the NYHA class.

There was only one study58 that contained sufficient information to conduct meta-analysis by clinic setting alone. Therefore, no overall estimates for the individual clinic setting are possible. However, this one study was used to conduct meta-analysis for all sites combined (Figure 3 and 4). The results of this analysis are described elsewhere.

Quality assessment of studies. Results from the application of the QUADAS Question 14 quality assessment tool are as follows (see also Appendix C Evidence Figures - Figure 2): three (50.0 percent) of the studies clearly addressed the issue of spectrum bias(QUADAS Question 1); the selection criteria were only described in four (66.7 percent) of the studies (QUADAS Question 2) and in remaining two (33.3 percent) of the studies it was difficult to assess the selection criteria; four (66.7 percent) of the studies described if the reference standard was likely to correctly classify the HF (QUADAS Question 3); three (50 percent) of the studies clearly described the issue of disease progression (QUADAS Question 4); the reference standard was described in sufficient detail (QUADAS Question 9) in five (83.3 percent) of the studies; interpretation of the peptide marker (BNP or NT-proBNP) measurement was clearly made without knowledge of the reference test results (QUADAS Question 10) in five (83.3 percent) of the studies; interpretation of the reference test results was clearly made without knowledge of the B-type natriuretic peptide marker results (QUADAS Question 11) in all of the studies; five (83.3 percent) of the studies stated whether or not the clinical data was available when the B-type natriuretic peptide test results were interpreted as would be the case when the test is used in practice (QUADAS Question 12); one (16.7 percent) of the studies reported uninterpretable or intermediate test results (QUADAS Question 13); and withdrawals were explained in five (83.3 percent) of the studies (QUADAS Question 14). Overall, the quality of these studies was good.

Question 2aiii: Primary Care

Sample and Design Characteristics of Studies

There were seven papers eligible for review that selected patients from a primary care setting. Five of these studies were cross-sectional in design34, 65– 68 and one was a RCT64. There was one study that selected patients randomly and identified a high risk cohort group.63

Two of the studies restricted their recruitment by age; one to 40 years of age and above,64 and one to more than 45 years of age.63 One study presented data stratified by gender.65 The RCT64 examined the effect of BNP measurement on diagnostic accuracy in primary care. All patients had BNP measured, but the groups were randomized as to whether or not the primary care physician received the results. Nevertheless, this paper is useful because the BNP concentrations can be compared against the reference standard of HF (expert diagnosis) in both arms of the study.

Two studies34, 67 either did not provide estimates of the diagnostic performance of the BNP test, or presented the data in a manner such that these diagnostic characteristics could not be calculated.

Diagnosis of HF in studies. Four studies used evaluation of left ventricular systolic function by echocardiogram as the reference standard for HF.63, 65, 67, 68 Three used LVEF of less than or equal to 40 percent,63, 67, 68 one used LVEF less than or equal to 45 percent,65 and one34 did not state the reference cut point. Another study used x-ray or echocardiogram with evidence of pulmonary edema or cardiomegaly as the reference cut point.66 The European Society of Cardiology criteria were used as the reference standard for the RCT study.64

Diagnostic properties. Table 6 presents the results to answer the question, “What are the clinical performance characteristics of both BNP and NT-proBNP measurement in patients with symptoms suggestive of HF or with known HF presenting to a primary care physician?” Three papers evaluated only BNP,34, 66, 67 three evaluated only NT-proBNP,64, 65, 68 and one evaluated both.63 Two of seven failed to indicate the cut point for BNP or NT-proBNP used. Two studies presented data for more than one cut point.65, 66 The range in cut points were 10 to 115 pg/mL for BNP and 67 to 338 pg/mL for NT-proBNP. Where possible, sensitivities, specificities, and LRs, either reported or calculated, are presented. Area under the ROC curve is presented when reported. Sensitivity ranged from 66 to 92 percent for BNP and 80 to 100 percent for NT-proBNP. Specificity ranged from 18 to 88 percent for BNP and 18 to 84 percent for NT-proBNP. For BNP, LR+ ranged from 1.12 to 6.71 and LR- ranged from 0.022 to 0.75. For NT-proBNP the LR+ ranged from 1.22 to 5.7 and the LR- ranged from 0 to 0.27.

Table 6. Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of HF in primary care settings.

Table 6

Diagnostic properties of studies that evaluated BNP and NT-proBNP in patients with symptoms suggestive of HF in primary care settings.

Meta-analysis was done on two studies63, 66 for BNP and three studies for NT-proBNP.63– 65 To maximize sensitivity the lowest cut point was used if data for multiple cut points were given. The data for the meta-analysis are summarized in Table 7 and the results presented in Figures 3 and 4. Since there were few studies available, the accompanying pooled summary statistics must be interpreted with caution. However, looking at the three studies for NT-proBNP the DOR summary estimate was 17 (95 percent CI: 9 to 32) whereas it was only 2 (95 percent CI: 1 to 6) for BNP. Furthermore, in the Hobbs study63where both BNP and NT-proBNP were measured, the DOR was about eight times higher (2 and 17 for BNP and NT-proBNP, respectively). Tests for heterogeneity were not significant for either of the B-type natriuretic peptides for the pooled LR- or DOR.

Quality assessment of studies. Results from the application of the QUADAS Question 14 quality assessment tool are as follows (see also Appendix C Evidence Figures - Figure 3): six (85.7 percent) of the studies clearly addressed the issue of spectrum bias(QUADAS Question 1); the selection criteria were only described in six (85.7 percent) of the studies (QUADAS Question 2); all of the studies described if the reference standard was likely to classify the HF properly (QUADAS Question 3); five (71.4 percent) of the studies clearly described the issue of disease progression (QUADAS Question 4); the reference standard was described in sufficient detail (QUADAS Question 9) in six (85.7 percent) of the studies; interpretation of the peptide marker (BNP or NT-proBNP) measurement was clearly without knowledge of the reference test results (QUADAS Question 10) in all of the studies; interpretation of the reference test results was clearly without knowledge of the B-type natriuretic peptide marker results (QUADAS Question 11) in all of the studies; six (85.7 percent) of the studies stated whether or not the clinical data was available when the B-type natriuretic peptide test results were interpreted as would be the case when the test is used in practice (QUADAS Question 12); five (71.4 percent) studies reported uninterpretable or intermediate test results (QUADAS Question 13) and; withdrawals were explained in five (71.4 percent) studies (QUADAS Question 14). Overall, the quality of these studies was good.

Question 2aiv: Long Term Care Setting

No papers were identified in the screening process that examined the question “What are the clinical performance characteristics of both BNP and NT-proBNP measurement in patients with symptoms suggestive of HF or with known HF presenting in long term care settings?”

Question 2av: All Settings Combined

Meta-Analysis

We chose studies for meta-analysis from Questions 2ai, 2aii and 2aiii where sufficient information was presented to allow calculation of sensitivity, specificity, LRs and DOR for as many diagnostic cut points as were presented (Table 7). Using this information, we developed summary estimates of these parameters (Figures 3, 4 and 5) as well as summary receiver operator characteristic (SROC) curves (Figure 6). In the pooling these data, we observed significant heterogeneity. As a result, we tried to explore the sources of the heterogeneity using meta-regressions and stratifications. We evaluated potential sources of heterogeneity for B-type natriuretics by stratifying groups according to the following factors: a) study setting (clinic, emergency department, and primary care), b) study design (cross-sectional, prospective cohort, randomized trials, and diagnostic types), c) sample size (greater than or equal to 500 and less than 500), d) comparison to reference standard (LVEF, compared to other signs and symptoms, and HF defined by clinical criteria), and e) cut points (exactly 100 pg/mL, greater than 100 pg/mL, and less than 100 pg/mL). Across the 5 different metrics of diagnostic accuracy (sensitivity, specificity, LR+, LR-, DOR), many of these were observed to be positive for heterogeneity, suggesting that no single factor helped to explain the variation between studies (Appendix C, Tables 1326 detail the results of the heterogeneity tests by factors). The small number of studies within each of the various categories was also a limiting factor in exploring the relative contribution of these covariates to the observed heterogeneity.

Figure 6. Summary ROC curves for a) BNP and b) NT-proBNP from all settings using the lowest cut point provided in each study.

Figure

Figure 6. Summary ROC curves for a) BNP and b) NT-proBNP from all settings using the lowest cut point provided in each study.

We also used the Moses-Littenberg regression model to develop a summary ROC curve and test for the presence of a threshold effect. Using both weighted and unweighted regressions, the slope parameter was small and not statistically significant (BNP p = 0.4183, NT-proBNP p = 0.3430), thus indicating the lack of a threshold effect. These data show that despite the various cut points and patient cohorts studied there was fairly high concordance among the studies.

Figure 6 presents the summary ROC curves for BNP and NT-proBNP. In both cases the curve tends strongly towards the upper left hand corner. The cut points ranged from 10 to 200 pg/mL (mean = 95 pg/mL) for BNP and 125 to 1691 pg/mL (mean = 642 pg/mL) for NT-proBNP. Sensitivities for BNP and NT-proBNP ranged from 50 to 99 percent and 83 to 99 percent, respectively. Specificities for BNP and NT-proBNP ranged from 19 to 97 percent and 46 to 89 percent, respectively. The areas under the curves, however, are 0.86 for both BNP and NT-proBNP, suggesting that regardless of the clinical setting, the cut point chosen, or the test used, measurement of B-type natriuretic peptides are useful in the diagnosis of HF. The standard error (SE) for the BNP AUC was slightly higher than for the NT-proBNP AUC (0.068 compared to 0.034, respectively). There are two noted outliers in the BNP SROC with respect to sensitivity that can account for this. One is from the clinic setting58 and the other is from the primary care setting.63 Although there were two studies from the primary care setting in the SROC, the study that appeared as an outlier selected patients who were at high risk for HF (prevalence = 7.5 percent) compared to the other study which selected patients suspected of having HF (prevalence = 32 percent).66

Question 2b: Does Measurement of BNP or NT-proBNP Add Independent Diagnostic Information to the Traditional Diagnostic Measures of HF in Patients with Suggestive HF?

Study Characteristics

To address this question, data were abstracted from studies included in Question 2a. These included evaluation of multivariate analysis to quantify the independent contribution of the B-type natriuretic peptides for the diagnosis of HF.

For the review of reviews a total of 145 reviews were evaluated for relevance by examining the titles and abstracts and 13 reviews were retrieved for full text screening.151, 163, 173– 183 One additional review was obtained by the local expert panel,184 bringing the total to fourteen. However, only nine reviews met our inclusion criteria for data abstraction.

Multivariate Analyses

It is recognized that clinicians request more than a single test, which are typically not independent of each other. Thus, methods that adjust for multiple tests such as, multivariate regression analysis, may assist in evaluating the independence of all the tests used within the same study. These analyses also provide estimates of the independent ability to “predict” the probability of the disease of interest while controlling for other tests. Limitations of multivariate analyses include sample sizes and the number of variables included in the model.185

Studies from Questions 2ai, 2aii and 2iii that performed multiple linear regression or multiple logistic regression to assess the value of B-type natriuretic peptides for the diagnosis of HF were brought forward into Question 2b. Nine papers from 2ai (emergency department) met this requirement. Eight of these studies used BNP17, 18, 49, 51, 54– 57 and one used NT-proBNP53 for HF diagnosis. Four of the BNP studies were from the Breathing Not Properly Cohort.18, 49, 51, 54 There were no studies from 2aii (specialized or outpatient clinic) or 2aiii (primary care) with multivariate analysis data.

The diagnostic measures considered in this section included: clinical signs and symptoms (dyspnea, edema, rales, orthopnea, increased jugular venous pressure (JVP), S3 heart sound and murmurs); other objective diagnostic measures (chest X-ray echocardiography, myocardial radionuclide angiogram, cardiac catheterization, MRI, CT scan); and composite scoring systems (NHANES score, Framingham score, NYHA class, and clinical judgment). In eight studies the NHANES and Framingham composite scoring systems and LVEF less than 40 percent were used as the reference standard to establish the diagnosis of HF.17, 18, 51, 53– 57

Two papers49, 54 used clinical judgment as a composite measure, five18, 51, 53, 56, 57 used edema, four17, 18, 53, 55 used increased JVP, four18, 53, 55, 57 used rales, three17, 53, 56 used orthopnea, and two used gallops or murmurs53, 56 as variables in the regression analysis. X-ray measures included four papers on pulmonary venous hypertension,18, 55, 56, 186 three papers on cardiomegaly,51, 53, 55 and two on x-ray edema.17, 51 Seven of the papers report the results of multiple logistic regression as odds ratios (ORs) or exponential β, two use chi square, and one used diagnostic accuracy.

Table 8 presents the results of the data abstraction for this section. In cases where the results are expressed as ORs with 95 percent CI, BNP appears to add significant information to the diagnosis of HF that is independent of other diagnostic measures. The ORs associated with BNP ranged from 12.3 (95 percent CI: 7.4 to 20.4) to 221 (95 percent CI: 24.6 to 1983.1), and were usually larger than the other diagnostic measures in the study. The single paper reporting the results of NT-proBNP53 gave an OR of 8.9 (95 percent CI: 3.9 to 20.5)

Table 8. Studies that performed multivariate analyses to compare the independent contribution of BNP and NT-proBNP with other diagnostic tests.

Table 8

Studies that performed multivariate analyses to compare the independent contribution of BNP and NT-proBNP with other diagnostic tests.

Those publications that reported the results as chi-square or diagnostic accuracy, also suggest that BNP measurement adds significant information to the diagnosis. This suggests that BNP and NT-proBNP measurement can add independent diagnostic information beyond that which is available from the traditional diagnostic measures.

Comparison of Estimates of DOR and SROC for Different Tests for HF Based on the Review of Reviews

Review characteristics. A total of 14 reviews evaluated some aspect of tests used for HF. Table 9 describes the characteristics of these reviews. Of these 14, nine151, 173, 174, 177, 179, 181– 184 contained information that was useful and pertinent to this review. Five studies were excluded from further analysis. Three163, 176, 178 were systematic reviews, but did not examine the diagnosis of heart failure. The remaining two175, 180 were not systematic reviews.

Table 9. Characteristics of the systematic reviews of diagnostic tests for HF that were eligible for this review.

Table 9

Characteristics of the systematic reviews of diagnostic tests for HF that were eligible for this review.

Two reviews174, 184 considered patients in all settings, one182 considered only emergency department patients, one177 primary care only, one183 considered both primary care and emergency department patients, and three151, 173, 179 did not specifically state a clinical setting. One review181 selected studies “on the basis of quality and relevance to primary care”. Three reviews182– 184 have examined the value of BNP and NT-proBNP measurement in the diagnosis of HF compared to other diagnostic measures. One review 174 examined BNP alone, and one151 examined BNP and “related peptides” alone. One review177 examined the 12-lead electrocardiogram (ECG) only, and one181examined the clinical exam, x-ray and ECG. Neither of these studies provided a comparison to B-type natriuretic peptides, but both present useful supporting evidence for the discussion.

The National Health Service Quality Improvement Scotland Technology Assessment Report #6183 examined the role of B-type natriuretic peptide measurement and ECG in primary care. They concluded that BNP or NT-proBNP is superior to machine-read ECGs, but equivalent to an accurate physician-interpreted ECG in deciding which patients to refer to echocardiography. A systematic review of the 12-lead ECG for the evaluation of suspected HF177 concludes that this is an inadequate tool to screen for those patients that require echocardiography.

Doust et al.184 prepared a systematic review for the National Institute for Clinical Studies in Melbourne Australia. The results of this review are difficult to interpret because no pooled estimates of the data are presented. Nevertheless, they conclude that most signs and symptoms lack both the sensitivity and specificity required for the diagnosis of HF. Tachycardia at rest, elevated JVP, displaced apex beat, and added heart sounds are the most specific. A normal ECG will rule out HF, but may require specialist interpretation. An abnormal chest x-ray is useful only when accompanied by an abnormal ECG. They further conclude that B-type natriuretic peptide measurement is the most valuable tool in ruling out HF, because of its high negative predictive value.

Wang et al.182 reviewed papers that assessed the diagnosis of HF in patients presenting to the ED with dyspnea. The features that increased the probability of HF were S3 gallop, chest x-ray showing pulmonary venous congestion and an ECG showing atrial fibrillation. Those that decreased the probability were an absence of rales, a normal response to the Valsalva maneuver, absence of cardiomegaly or edema on x-ray and a normal ECG. A serum BNP less than 100 pg/mL proved to be the most useful tool in ruling out HF (LR-.011, 95 percent CI: 0.07 to 0.16).

Table 10 outlines the diagnostic tests examined and the performance characteristics in each of the reviews. To compare the performance of diagnostic tests between reviews, we chose to use the DOR. This performance characteristic is the single most useful measure of diagnostic performance and the most easily comparable between studies and reviews, partly because it is relatively insensitive to the decision threshold chosen in each study. In cases where the DOR was not presented, we estimated the DOR from the sensitivity and specificity or positive and negative LR.

Table 10. Diagnostic performance estimates of BNP and NT-proBNP compared to other diagnostic tests based on previous systematic reviews.

Table 10

Diagnostic performance estimates of BNP and NT-proBNP compared to other diagnostic tests based on previous systematic reviews.

Three reviews151, 173, 174 considered only BNP or NT-proBNP without comparison to other tests. In these studies the DOR or the estimated DOR for BNP ranged from 31 to 569. The single review that examined NT-proBNP has an estimated DOR of 230. BNP and NT-proBNP were compared to other diagnostic tests in three other reviews.182– 184 In these reviews the DOR for BNP ranged from 10 to 498, and the single review comparing NT-proBNP to other tests had a DOR of 14.

B-type Natriuretic Peptides Compared to Other Diagnostic Measures

In our assessment of the primary research studies of the diagnostic properties of BNP and NT-proBNP, we looked for papers that compared the B-type natriuretic peptides against a number of reference measures. Therefore, in our examination of reviews, we also looked for studies that measured the independent contribution of these peptides against these reference measures. No reviews compared the diagnostic performance characteristics of BNP or NT-proBNP against myocardial radionuclide angiography, cardiac catheterization, MRI, or CT scan.

Three reviews182– 184 compared BNP to abnormal ECG and one177 examined ECG abnormalities alone. The estimated DOR ranged from 3 to 223 whereas the DOR for BNP in the same studies ranged from 10 to 498. The DOR for an abnormal ECG exceeded the DOR for BNP in only one case183 (12.4 vs. 10.4), but the 95 percent CIs were overlapping. Similarly the DOR for NT-proBNP was similar to that for an abnormal ECG (14.9 vs. 12.4). One systematic review181 describes an abnormal ECG as having a high sensitivity but poor specificity and useful for confirmation of diagnosis only but no numerical values were provided.

Two parameters of the chest x-ray (evidence of pulmonary venous hypertension and evidence of cardiomegaly), had diagnostic importance and were examined in three studies.181, 182, 184 The estimated DOR for these two tests were 3 to 28 for pulmonary venous hypertension and 2 to 10 for cardiomegaly. Again this is less than the DOR of BNP in the same studies.

Any abnormality in the clinical exam, history of paroxysmal nocturnal dyspnea, S3 gallop, and increased JVP were the components of the clinical exam that had diagnostic usefulness. Three reviews182– 184 compared these parameters to BNP. In all cases the estimated DOR for these tests was less than the estimated DOR for BNP.

Quality Assessment of Reviews

The quality of the reviews was assessed by the STEP Questionnaire.169 The quality of reviews varied from good to poor. Three reviews173, 182, 183 were obviously of higher quality than the rest. These reviews clearly stated the main question, the clinical population and the main comparators. Inclusion and exclusion criteria were plainly stated and it is unlikely that relevant studies were missed. Assessments were made by at least two reviewers and the results were presented clearly. Meta-analysis was performed where appropriate. Two reviews177, 184 were of good quality, but the results were presented in a less clear manner. Five further reviews151, 174, 175, 180, 181 were of lesser quality.

Question 3a: Do BNP or NT-proBNP Levels Predict Cardiac Events in Populations at Risk of CAD, with Diagnosed CAD and HF?

A total of 150 studies were eligible for evaluating the prognostic ability of BNP or NT-proBNP levels in HF patients to predict cardiac events of interest for this review. For the purposes of this review we have limited the findings to four major groups and these include, people at risk of CAD, those diagnosed CAD or HF, and general populations for screening. A small group (n = 40) of studies included populations in ICU, with pulmonary embolism, stroke, or renal failure; the data from these specialized groups will not be presented here.

Question 3ai: At risk of CAD

Design and Sample Characteristics of Studies

The prognostic value of BNP or NT-proBNP was examined in 12 studies of people with risk factors for CAD, ten were prospective cohorts,9, 10, 15, 24, 69– 74 one was a RCT,4 and one was cross-sectional.6 Sample sizes ranged from 1119 to 3346.74 Most study participants were between 50 and 75 years of age, although eight studies included patients over 50 years.6, 10, 15, 24, 69, 71, 73, 74 Four studies included participants under 75 years of age.9, 10, 24, 71 The widest age range, (19 to 105 years) was from an Irish study71 in an emergency room. In one study,70 only the mean age of participants was reported so it was not possible to assess the age range of the sample. The percentage of males in the studies ranged from 21 percent9 to 96 percent.69 Follow up averaged 8 to 9 years in two studies15, 70 and 8 to 12 days in another.71 Excluding the cross-sectional study,6 and another study for which the follow up time was reported as “until discharge,”72 follow up time ranged from approximately 1 to 5 years.4, 9, 10, 24, 69, 73, 74

CAD Risk Factors

Study participants were recruited in an emergency room or when admitted to a cardiac care unit. CAD risk factors included diabetes,15, 69, 70, 74 suspicion of cardiac dysfunction,69 ACS or chest pain,24 suspected heart disease,9, 71, 74 cardiac arrest with cardiac cause,72 hypertension,4, 10, 74 prior MI,74 aortic stenosis or aortic valve replacement,6 left atrial enlargement or left ventricular hypertrophy,74 significant heart disease upon admission, or a cardiac event within 90 days of admission.73

CAD risk factors were assessed using a mixture of electrocardiography, chest x-ray, clinical examination, LVEF, or NYHA classification criteria. Diabetes was assessed by clinical criteria such as persistent macroalbuminuria (> 300 mg/24 hours) in at least two out of three consecutive 24-hour urine collections, with the presence of diabetic retinopathy and the absence of other kidney or urinary tract disease. Hypertension was defined as systolic blood pressure greater than or equal to 140 mmHg or diastolic blood pressure greater than or equal to 90 mmHg.

BNP and NT-proBNP Tests and Threshold Values

BNP was measured using the Triage BNP method in two studies,69, 71 and the Shionoria-IRMA method in four studies.9, 10, 72, 74 NT-proBNP was measured using the Elecsys method in six studies.4, 6, 15, 24, 70, 73

BNP or NT-proBNP cut point values were not uniform; six studies6, 9, 24, 69, 71, 73 reported multiple and six6, 24, 70, 71, 73, 74 reported single cut points. Cut points were chosen based on median or percentile levels of fasting plasma NT-proBNP4, 70 or plasma BNP,9, 74 mean BNP,10 ROC curves,6, 72, 73 information from the test package insert,71 or miscellaneous external sources.15 In two studies, the selection of cut points was arbitrary or unexplained.24, 69

Definition of Outcomes

Death was a primary outcome in 10 studies,4, 9, 10, 15, 24, 69– 71, 73, 74 although in four 4, 10, 70, 73 it was part of a composite outcome including other cardiovascular events (e.g., non-fatal MI). Death was limited to cardiovascular events in four studies4, 10, 70, 73 and included all-cause mortality in six studies.9, 15, 24, 69, 71, 74 Deaths were ascertained using public records (e.g., death certificates) in four studies,15, 24, 69, 73 while in three there was mention of “clinical assessment” of cardiovascular outcomes (including death).4, 10, 74 Assessment of death was not described in three studies.9, 24, 71

The two studies with non-death outcomes were the only studies that enrolled patients with prior surgeries. In the first study,72 27 to 40 percent of patients had an intra-aortic balloon counterpulsation or a coronary revascularization. The outcomes ranged from survival to hospital discharge. In the second study,6 12 percent of patients had coronary artery bypass graft (CABG) and 9 percent had percutaneous coronary intervention. The outcome was the severity of aortic stenosis. Outcomes other than death were measured by clinical tests such as the mean transvalvular pressure gradient6 or LVEF.73

Four studies had secondary outcomes, including non-cardiac causes of death69 and cardiovascular mortality plus hospitalization for HF.70 One study74 had three secondary outcomes: MI, heart disease, and atrial fibrillation. The fourth study72 had four secondary outcomes: return of spontaneous circulation, hospital admission, 24-hour survival, and favorable neurologic outcome after discharge.

Adjusted Results — Multiple Regression Analysis

Eleven of the 12 studies (Neilson et al.73 excepted) featured regression analysis to examine the association between levels of BNP or NT-proBNP and the outcome of interest. In two of the 11 studies, one BNP69 and one NT-proBNP,6 the reported regression results consisted only of p-values or chi-square test statistics. See Tables 11 and 12 for summary results for all 12 studies.

Table 11. Summary of studies in patients with risk of CAD: BNP.

Table 11

Summary of studies in patients with risk of CAD: BNP.

Table 12. Summary of studies in patients with risk of CAD: NT-proBNP.

Table 12

Summary of studies in patients with risk of CAD: NT-proBNP.

BNP was treated as a categorical variable in four studies,9, 71, 72, 74 with the categories based on the cut points discussed above. Higher levels of BNP were consistently found to be positively associated with all-cause mortality or the occurrence of cardiac events (e.g., HF).9, 71, 74 The adjusted measures of association (OR or hazard ratio (HR)) ranged from 1.10 to 4.26 and did not appear to differ by outcome in two studies (point estimates < 2.00 in both studies).9, 74 In one mortality study,71 though, the estimated OR for BNP (≥ 700 pg/mL versus < 700 pg/mL) was found to be much larger at 4.26.

In the final study where BNP was treated as categorical,72 the outcome was survival to hospital discharge. For the study participants, some of whom had prior surgery, all of the adjusted, estimated ORs were less than 1.00. Since the ORs decreased as the level of BNP increased, higher levels of BNP were negatively associated with survival.

The results of the study10 where BNP was treated as a continuous variable showed that higher values of BNP were positively associated with the occurrence of cardiovascular events (adjusted risk ratio (RR) = 1.02; 95 percent CI: 1.01 to 1.02).

NT-proBNP was also treated as a categorical variable in four studies.4, 15, 24, 70 Again, the categories were based on the cut points discussed above. Higher levels of NT-proBNP were consistently found to be positively associated with composite endpoints that included both cardiovascular mortality and other cardiac events such as non-fatal MI.4, 70 Positive associations were also found between levels of NT-proBNP and all-cause mortality.15, 24 The adjusted measures of association (RR or HR) ranged from 1.85 to 5.40, with a concentration between 2.8 and 3.6.

In 10 of the studies, all of the adjusted measures of association (i.e., OR, HR, RR) were statistically significant at the 5 percent level. In the other two studies,9, 24 all of the measures except for three were also statistically significant at the 5 percent level.

Several variables were included in many of the regression models as covariates, including age, gender, blood pressure, cholesterol level, left ventricular mass or function, and smoking.

Quality Assessment of Studies

Selection and information bias are two common threats to the internal validity of observational studies. One method of minimizing selection bias is to evaluate all patients who present at the research site between a certain set of fixed dates. Patients who meet the inclusion criteria are then entered into the study. This method of enrollment — often called ‘consecutive enrollment’ — helps to prevent a conscious or unconscious bias from affecting the selection of patients into the study. The bias would occur if, for example, patients with multiple risk factors for CAD, or with higher risk factors for CAD, were the only persons selected for the study. In the 12 studies of persons with risk factors for CAD, the authors of only four studies15, 24, 71, 73 explicitly wrote that patients were enrolled consecutively. The other eight studies contained no mention of consecutive enrollment. The authors of future studies should be explicit about the order of patient enrollment so as to allow readers to assess study quality.

Information bias occurs when study subjects are misclassified on their exposure or disease status. Misclassification can occur due to random chance (e.g., inaccurate measures of BNP or NT-proBNP), or because the persons who take study measurements have knowledge of a subject's exposure and disease status. For example, knowing that subjects have very high levels of BNP or NT-proBNP could trigger additional clinical investigations that lead to what would have otherwise been unmade diagnoses or treatments. This could inflate the association between BNP or NT-proBNP and the outcome of interest. Blinding is one way to avoid the problem. In the 12 studies involving persons at risk for CAD, the authors of five publications4, 6, 70, 73, 74 reported that blinding had occurred and the authors of seven studies did not mention anything about blinding. Again, authors should be explicit about what they do (blinding, no blinding) so as to facilitate the assessment of study quality.

The absence of information about consecutive enrollment or blinding makes the presence of bias impossible to rule out for a majority of the studies. The same absence of information prevents the extent of any bias from being ascertained.

Question 3aii: With diagnosed CAD

Design and Sample Characteristics of Studies

Thirty-eight studies examined the association between BNP or NT-proBNP and outcomes such as mortality or re-infarction in persons with CAD. Twenty-eight of the studies were prospective cohorts,3, 13, 19– 22, 29, 33, 75– 94 nine were RCTs,8, 14, 27, 28, 95– 99 and one was cross-sectional.100 Sample sizes ranged from a low of 1494 to a high of 7800.8, 97 The mean age of study participants was clustered around 55 and 65 years, with a range of approximately 40 to 70 years. The widest age range spanned 54 years and was observed in two studies (21 to 75 years,27 and 26 to 80 years80). The percentage of males in 31 of the studies ranged from a low of 45 percent3 to a high of 100 percent.91 The percentage was not reported in five studies.20, 22, 78, 83, 94 Lengths of follow up varied greatly between the studies. The mean, median, or maximum follow up was up to and including 6 months in 12 studies,3, 21, 27, 28, 76, 86, 87, 90, 92, 93, 96, 99 7 to 12 months in eight studies,8, 33, 77, 78, 83, 91, 95, 97 13 to 24 months in eight studies,14, 20, 29, 75, 84, 85, 89, 94 and more than 24 months in eight studies.13, 22, 79– 82, 88, 98 Follow up time was not reported in two studies.19, 100 The shortest follow up time was 72 hours99 and the longest was 4.9 years.88

CAD Diagnosis

Study participants were recruited after admission to hospital for a CAD related event. CAD related events included MI (16 studies),3, 13, 27, 29, 33, 77, 78, 80, 82, 84, 85, 88, 91, 92, 95, 100 angina (10 studies),8, 20, 76, 77, 82, 85, 92, 95– 97 ischemia (five studies),22, 81, 90, 96, 100 chest pain (three studies),21, 77, 86 stenosis (three studies),19, 79, 85 LVD (one study),98 ACSs (one study),3 cardiac arrest (one study),99 and hypertension (one study).89 More than one CAD event was involved in several studies.

CAD was diagnosed with a plethora of clinical tests such as electrocardiography, ST elevation, development of left bundle blockade, rises in creatinine kinase levels, T-wave inversion, and blood pressure. In some studies,14, 28, 83, 87 persons were enrolled on the basis of whether test results exceeded a certain threshold value (e.g., ST elevation ≥ 1 mm). In other studies, persons were enrolled if they were undergoing percutaneous transluminal coronary angioplasty (PTCA)75 or CABG.91, 93, 94, 99

BNP and NT-proBNP Tests and Threshold Values

BNP was measured using the Triage method in nine studies,3, 19, 28, 29, 77, 78, 83, 95, 96 the Shionoria-IRMA method in seven studies,13, 33, 79, 88, 90, 93, 94 and the ADVIA Centaur method in one study.27 NT-proBNP was measured using either the Elecsys system in 13 studies8, 14, 21, 22, 75, 76, 85– 87, 92, 97, 99, 100 or by a variety of other methods in seven studies.20, 80– 82, 84, 89, 98

BNP or NT-proBNP cut points were not uniform. Twenty-four of the studies3, 13, 19– 21, 27– 29, 75, 77– 79, 81– 83, 87– 90, 93, 95, 96, 98, 100 reported a single cut point and six8, 76, 86, 91, 92, 97 reported multiple cut points. Another six studies reported a single cut point, but the analyses were stratified by disease,33, 84, 85 gender,22 BNP versus NT-proBNP,80 or time period during follow up.14 The cut points were based on the medians or quartiles of measured BNP or NT-proBNP in the study participants,8, 13, 14, 19, 22, 75, 76, 78, 80– 82, 85– 87, 89, 95, 97, 98, 100 ROC curves,3, 21, 27, 29, 33, 79, 84, 88, 91, 92 previously published literature,20, 28, 77, 83, 90, 96 or regression analysis93Cut points were not reported in two studies.94, 99

Definition of Outcomes

Primary outcomes were death in 32 studies,3, 8, 13, 14, 19– 22, 27– 29, 75– 77, 80– 90, 92, 94– 98, 100 non-fatal MI in 15 studies,8, 14, 21, 22, 75, 77, 80, 83, 85, 86, 92, 95– 98 HF or cardiogenic shock in 10 studies,28, 33, 78, 80, 83, 87, 89, 90, 92, 100 re-infarction in four studies,19, 28, 29, 100 repeat hospitalizations for ACS in five studies,28, 80, 91, 95, 96 angina in three studies,29, 79, 94 ischemia in two studies,33, 87 miscellaneous cardiovascular complications (e.g., arrhythmia, cardiogenic shock) in two studies,91, 92 BNP or NT-proBNP levels in two studies,93, 99 and LVD in one study.33 Twenty-five studies8, 14, 19, 21, 22, 28, 29, 33, 75, 77, 80, 83, 85– 87, 89– 92, 94– 98, 100 included more than one outcome or had a composite outcome that was formed by aggregating two or more single outcomes. The outcomes were ascertained using clinical definitions (e.g., LVEF < 35 percent) in 20 studies.3, 8, 21, 29, 33, 75, 77– 79, 83, 84, 87, 89– 92, 97– 100 In the other 18 studies,13, 14, 19, 20, 22, 27, 28, 76, 80– 82, 85, 86, 88, 93– 96 the authors simply named the outcomes without specifying how they were assessed. This lack of specification was often the case with mortality: authors did not describe their method (e.g., review of death certificates) of determining whether and why a participant died.

Nine studies had secondary outcomes, including death,33, 88, 97 coronary HF or cardiogenic shock,78 recurrent MI,90 poor myocardial perfusion or failed ST segment resolution,27 recurrent ischemic events and severe HF,76 stroke,98 and thrombosis in MI.3

Adjusted Results — Multiple Regression Analysis

Thirty-three of the studies used regression analysis to examine the association between levels of BNP or NT-proBNP and the outcome of interest. Multiple regression was used in 31 of the studies3, 8, 13, 14, 19– 22, 27, 28, 33, 75– 78, 80– 90, 93, 96– 98, 100 and simple regression was used in two of the studies.79, 94 Logistic regression was the sole approach in 17 studies.3, 8, 19– 21, 27, 28, 33, 76– 78, 84, 86, 87, 90, 96, 100 Logistic regression was also employed with Cox regression in two studies75, 97 and with linear regression in two studies.14, 89 Cox regression was utilized alone in ten studies.13, 22, 79– 83, 85, 88, 98 Regression analysis was not used in five studies.29, 91, 92, 95, 99 The studies were stratified according to type of B-type natriuretic peptide (BNP or NT-proBNP). Further stratification was done according to whether patients received prior cardiac related surgery (yes/no) and whether the outcome was mortality or non-fatal event (e.g., MI). For BNP, measures of association (OR, HR, etc.) ranged from 1.60 to 16.30 in studies of prior surgery patients and mortality3, 28, 77, 96 (Table 13). The measures of association were concentrated in the range of 1.60 to 2.96 and they were statistically significant at the 5 percent level in three of the four studies.3, 28, 77 In two studies of prior surgery patients and non-fatal outcomes, point estimates of the measures of association were 3.928 and 41.1279 (Table 14). Both estimates were statistically significant at the 5 percent level.

Table 13. Summary of studies in patients with CAD with surgery: BNP.

Table 13

Summary of studies in patients with CAD with surgery: BNP.

Table 14. Summary of studies in patients with CAD, no surgery: BNP.

Table 14

Summary of studies in patients with CAD, no surgery: BNP.

The predictive ability of BNP was found to be 2.53 (HR)13 or 7.20 (OR)27 in two mortality studies of patients with no prior cardiac related surgery. Three studies33, 78, 83 were conducted to examine non-fatal outcomes in patients with no prior surgery and the measures of association ranged from 1.01 to 3.03 (Table 14). The measures of association in all five studies were statistically significant at the 5 percent level.

Turning to NT-proBNP, the measures of association spanned from 1.33 to 6.63 in six mortality studies of patients with prior cardiac related surgeries, with most measures concentrated in the range of 1.33 to 3.42.8, 21, 75, 76, 80, 82 All of the measures were statistically significant at the 5 percent level in five studies. In the sixth study,76 only one of three ORs was significant at 5 percent. Studies8, 21, 80 of non-fatal outcomes in patients with prior surgery yielded measures of association that ranged from 1.01 to 3.51. However, the measures were statistically significant at the 5 percent level in only one study (Table 15).80

Table 15. Summary of studies in patients with CAD not surgery: NT-proBNP.

Table 15

Summary of studies in patients with CAD not surgery: NT-proBNP.

In studies of NT-proBNP as a predictor of mortality in persons who did not have a prior cardiac related surgery, the measures of association ranged from 1.01 to 19.70, with a concentration in the range of 1.50 to 4.80. The measures were all statistically significant at the 5 percent level in four studies,20, 22, 81, 84 two of the three measures were significant in one study,97 and none were significant in two studies.87, 98 In the case of non-fatal outcomes in persons who did not have a prior surgery, measures of association ranged from 0.64 to 5.90 and the concentration was between 1.30 and 5.50. The measures were all significant in two studies,84, 86 three measures of 13 were significant in three studies,85, 87, 97 and none were significant in two studies (Table 16).22, 98

Table 16. Summary of studies in patients with CAD no surgery: NT-proBNP.

Table 16

Summary of studies in patients with CAD no surgery: NT-proBNP.

In one89 of the two studies where linear regression was used as an analytic tool, NT-proBNP was found to be a predictor (p = 0.03) of left ventricular systolic volume after MI. However, the authors did not quantify the relationship by providing an estimated regression coefficient. In the other study, several variables were found to predict baseline levels of, and rates of change in, NT-proBNP.14 These variables were age, gender, diabetes, previous MI, cTnT level greater than or equal to 0.01 μg/L, calculated creatinine clearance less than 73 ml/min, C-reactive protein greater than 10 mg/l, ST-segment depression, and use of diuretics or nitrates on admission to hospital (Table 17).

Table 17. Summary of studies in patients with CAD no regression analyses.

Table 17

Summary of studies in patients with CAD no regression analyses.

Table 17 shows the studies for which no regression results were reported (even though the authors claimed to have conducted regression analyses) as well as the studies for which no regression analyses were performed.

Several variables were included in many of the regression models as covariates, including age, gender, biological markers (e.g., heart rate, blood pressure, baseline creatine kinase, LVEF), and disease history (e.g., history of MI, hypertension, diabetes).

Quality Assessment of Studies

The authors of 11 studies3, 20, 21, 33, 75, 77, 79, 82, 84, 91, 92 indicated that participants were consecutively enrolled in their research. The authors of the remaining 28 studies did not report whether or not enrolment was consecutive. For blinding, reporting was only slightly better as the authors of 14 studies3, 8, 13, 19, 21, 28, 29, 78, 81, 87, 92, 95, 97, 99 reported that outcomes were assessed in a blinded fashion, while the authors of the remaining 25 studies did not mention blinding in their published manuscripts. The lack of reporting in both areas meant that it was impossible to rule out the presence of selection or information bias in a majority of the studies.

Question 3aiii: With diagnosed HF

Thirty-eight publications evaluated BNP levels and 14 evaluated NT-proBNP and the association with cardiac events in patients with HF. The predictive value of BNP and NT-proBNP are presented separately with respect to the ability to predict future outcomes of interest. Six publications evaluated both BNP and NT-proBNP.41, 103, 112, 125, 126, 128 The findings from these studies are presented in the BNP section only.

Prognosis Studies Using BNP Levels

Design and Sample Characteristics of Studies. All the studies evaluating BNP levels were prospective cohort designs with the exception of three publications based on the ValHeFT106, 110 including a sub-study41which were randomized trials. The evaluation of the VaLHeFT cohort included both arms of the trial in the evaluations. In addition to the ValHeFT publications, an additional two publications reported on the same study cohort125, 128 with different follow up periods. Twelve studies12, 23, 104, 105, 107, 111, 113, 118– 120, 124, 129 recruited patients that were admitted as hospital inpatients for acute episodes; three studies contained patients recruited from both emergency and inpatients,102 and outpatient clinics and inpatients combined.25, 117 Twenty studies30, 36, 41, 101, 103, 106, 108– 110, 112, 115, 116, 121– 123, 125– 128, 130 indicated that patients were recruited from primary care, general population, specialty clinics and outpatient settings (see Tables 18 and 19). Three studies recruited patients with HF from emergency departments.32, 48, 114

Table 18. Summary of studies in patients with HF and mortality outcomes: BNP.

Table 18

Summary of studies in patients with HF and mortality outcomes: BNP.

Table 19. Summary of studies in patients with HF and mixed outcomes: BNP.

Table 19

Summary of studies in patients with HF and mixed outcomes: BNP.

Sample sizes ranged from a low of 3332 subjects to a high of 4300 subjects.106, 110 The mean sample size for all 36 studies was 327 (± 829) and the median was 102. The mean age of study participants was clustered around 65 years, with the widest range of 68 years113 ranging from 17 to 85 years. The percentage of males in 32 of the studies ranged from a low of 44 percent111 to a high of 100 percent.48, 119 The percentage was not reported in four studies.101, 106, 110, 122 One cohort102 based on the REDHOT study had a high proportion of African Americans (63.4 percent).

Lengths of follow up varied greatly between the studies, with the shortest time being 30 days119 and the longest being 92 months.126 For studies with short term follow up, the period varied from 30 days119 to 90 days32, 48, 102 and six months.107, 109, 111, 114, 124 For the remaining studies the mean or median follow up was 7 to 12 months,118, 130 13 to 24 months,12, 23, 30, 101, 103, 105, 115, 116, 125, 129 25 to 48 months,25, 36, 41, 104, 113, 117, 120, 121, 123 and greater than 48 months.126, 127 Several other studies did not report median or mean follow up times but rather end points of 12 months,112 18 months,108 36 months,110, 128 and 60 months122 or did not specify.106

HF Diagnosis and Severity. The diagnosis of HF was established in a number of ways, but predominately confirmed using echocardiography, carried out as part of the study, or obtained from previous medical records. The exceptions were three studies that used ventriculography41, 121, 187 and eight that used clinical evaluation (signs and symptoms).102, 103, 105, 108, 114, 122, 126, 188 All but two studies48, 107 reported some rating of the NYHA classification for all the subjects or a subgroup.

The majority of studies included subjects across all levels of the NYHA classification I-IV. The exceptions were eight studies23, 32, 108, 109, 119, 120, 123, 124 that restricted subjects to levels III-IV and one that possibly restricted to level IV.107 A single study121 enrolled subjects with level I and II; three other studies36, 127, 130 had predominately level I and II with less than 10 percent of subjects with level III, and none at level IV. Three studies48, 114, 122 did not specify the NYHA classification of their subjects.

LVEF was not reported in seven studies102, 104, 105, 111, 114, 122, 127. Mean LVEF percentages were reported in 22 studies and varied between 50 percent,115 40 to 49 percent,32, 129 30 to 39 percent,12, 23, 36, 107, 113, 118, 119, 123, 123, 126, 130 and 20 to 29 percent.25, 108, 109, 112, 117, 120, 125, 128 Seven studies reported a threshold LVEF of less than 45 percent,48, 101, 106, 110, 121 less than 40 percent,124 and less than 30 percent.30

BNP and Cut Points. BNP was measured using the Triage method in 11 studies48, 107– 109, 111, 112, 114, 119, 125, 128, 189 and by Shionoria-IRMA method in the remaining 25 studies (see Tables 18 and 19). Four studies112, 125, 126, 128 also measured NT-proBNP using the Biomedica method. Results are described in the NT-proBNP section of this report.

The rationale for selecting the BNP cut points differed and as such, the values varied significantly (Table 18 and 19). Sixteen studies12, 30, 32, 36, 101, 106, 110, 117– 121, 123, 124, 126, 130 selected the mean or median values as the cut point for categorizing high and low BNP groups. Eight studies23, 108, 114, 120, 125, 128, 129, 190 selected values based on ROC. Five studies categorized the sample BNP values into two levels,127 three levels,107, 109 four levels,119 and five levels.122 Six studies104, 105, 111– 113, 115 did not specify a threshold as they used the BNP values as a continuous variable in their statistical analyses. Three studies used other rationale for threshold selection including: previous literature reference,48 75th percentile value,116 and an unspecified internal analysis.102

Definition of Outcomes. All studies with the exception of one,48 had a primary outcome of mortality or a composite endpoint. These endpoints typically included death, other cardiac events, readmission or worsening HF. There were 21 studies12, 23, 30, 32, 36, 102, 104, 106, 109, 112, 114, 117, 119– 123, 125– 127, 130 that evaluated either all-cause mortality, cardiac related mortality or both. There were 25 studies23, 25, 101– 103, 105– 108, 110– 120, 123, 124, 126, 128, 129 that evaluated composite endpoints that included a mixture of fatal and non-fatal cardiac events. One study116 evaluated the performance of BNP relative to the Heart Failure Survival Score. Most authors did not specify how the outcomes of death were verified or subsequent events (such as other events or re-admission to hospital) were evaluated.

Adjusted Results — Multiple Regression Analysis. Twenty-nine studies undertook Cox proportional hazards regression analyses and five studies undertook logistic regression analyses to evaluate the relationship between BNP levels and various outcomes (Table 18 and 19). For the studies using Cox regression analyses, three studies32, 111, 115 presented only univariate analyses, 10 studies101, 105, 110, 112, 113, 122– 124, 126, 127 presented only the results from multivariate analyses, and the remaining 16 studies12, 23, 30, 36, 104, 106– 109, 117, 118, 120, 121, 125, 129, 130 undertook both univariate and multivariate computations. Four studies undertook multivariate logistic regression25, 48, 102, 116 and one study119 univariate logistic regression, to evaluate the strength of the association between levels of BNP and the outcomes of interest. A single study114 reported unadjusted RRs and another study121 undertook an unspecified type of linear regression.

BNP Studies with Mortality Outcomes. Table 18 details the 21 studies with the outcomes of all-cause mortality, cardiac mortality and sudden death. The results are expressed as both univariate and multivariate HR,12, 23, 30, 36, 104, 106, 109, 113, 117, 122, 123, 127 Chi square statistic and probabilities values,12, 32, 102, 120, 125, 126, 130 beta values,102, 130 and unadjusted RR.114 One study measured, but did not report, findings specific to mortality.119 Some studies102, 123, 125reported the estimates of risk based on the log of the BNP levels, which makes interpretation of the magnitude somewhat limited. In general, there were 11 studies12, 23, 30, 36, 106, 109, 113, 123, 125, 127, 130 that found baseline BNP levels to be significant predictors for mortality related outcomes after adjustment in multivariate models. For those studies that presented adjusted HR, the risk estimates varied from 2.48 (95 percent CI: 2.13 to 2.88)106 to 7.2 (95 percent CI: 1.6 to 32.1),30 with the majority point estimates clustering around 2.5 to 3.0. It should be noted that despite the differing cut points, (> 97 pg/mL,106 > 172 pg/mL,30 > 230 pg/mL,114 > 260.4 pg/mL,36 > 700 pg/mL,32 1000 pg/mL,109 > 346 pg/mL,127), the multivariate models found baseline BNP to be a significant predictor of mortality outcomes. All these regression models used a variety of variables within their models, which makes comparisons across studies challenging.

Six of the studies evaluating mortality recruited decompensated HF patients,12, 23, 102, 112 emergency department patients,32, 114 or mixed patients.114 Five of these studies reported 46 to 100 percent of patients in NYHA class III and IV suggesting relatively severe HF patients. Although all studies reported that BNP was a significant predictor of mortality, only one study23provided an estimate of the HR; the multivariate model included a variable of troponin T and baseline BNP. The log BNP had a HR = 3.11 (95 percent CI: 1.61 to 6.01, p = 0.0005).

One study reported unadjusted RR for HF death for baseline BNP greater than 230 pg/mL vs. less than 230 pg/mL: RR = 24.1 (95 percent CI: 6.3.5 to 491.1); for the outcome of cardiac death unadjusted baseline BNP greater than 230 pg/mL vs. less than 230 pg/mL: RR = 37.9 (95 percent CI: 5.7.5 to 755.8). The widely varying CI suggests instability with the estimate; therefore, results must be interpreted with caution.

There were six studies that found BNP levels to be not significant predictors of mortality based on univariate analyses alone,32, 126 or on multivariate analyses;30, 104, 117, 122 one study123 found sudden unexpected death to be significant but not HF death. One of these studies32 had a small sample size (n = 33) and 73 percent of patients were NYHA level IV. No clear trend from these studies can account for the non-significance.

BNP Studies with Composite Outcomes. Table 19 details the 27 studies with composite outcomes which included death in all but four publications.111, 113, 118, 128 The results are expressed as both univariate and multivariate HR,23, 25, 106, 110, 115, 117 chi square and probabilities values,113, 119, 121, 124, 128, 129, 191 and unadjusted RR.114 Some studies23, 102, 118, 119, 123, 128, 129 reported the estimates of risk based on log BNP levels, which makes comparison with studies not using log values difficult.

In general, there were 9 studies12, 30, 36, 106, 109, 123, 125, 127, 130 that found baseline BNP levels to be significant predictors for composite outcomes after adjustment in multivariate models. Two studies105, 111 with only univariate analyses showed baseline BNP levels to be not significant predictors and three studies (multivariate estimates106, 117 and univariate estimate115) showed only marginal significance.

For those studies that presented adjusted risk estimates for baseline BNP, the values varied from HR = 1.66 (95 percent CI: 1.36 to 2.04, p less than 0.0001)110 to RR = 3.23 (95 percent CI: 1.32 to 7.94, p = 0.01)25, with the majority point estimates clustering around 2.0. Three studies23, 108, 123 included estimates of HR that combined levels of troponin I, troponin T and LVEF with baseline BNP. One study114 reported unadjusted RR for HF events that varied from baseline BNP greater than 230 pg/mL vs. less than 230 pg/mL: RR = 15.5 (95 percent CI: 6.2 to 43.7) to BNP greater than 480 pg/mL vs. less than 230 pg/mL: RR = 8.2 (95 percent CI: 4.7 to 14.3).

Comparison of NT-proBNP and BNP. Six studies41, 103, 112, 125, 126, 192 evaluated both BNP and NT-proBNP levels within the same group of subjects. Van Beneden et al.126 compared a small number of subjects who had mild to moderate HF (NYHA I-II) with severe HF patients (NYHA level IIII-IV). Their univariate analysis showed that NT-proBNP was a significant predictor of mortality. However, no association between BNP and mortality was observed using either assay method. Two studies evaluated both B-type natriuretic peptides in predominately NYHA level II patients (approximately 78 percent). One study103 found log BNP levels significant in predicting worsening HF in both univariate and multivariate analyses while NT-proBNP was significant only in the univariate model. In contrast, a second study,41 with mild to moderate HF subjects, found that BNP and NT-proBNP levels differed in their ability to predict 4-year mortality with respect to whether baseline levels, or measurement taken at last follow up, were considered. In this study, BNP was significant in the univariate analysis for baseline and significant for last follow up in the multivariate analysis; NT-proBNP was not significant at baseline in the univariate analysis but significant for the multivariate analysis that included last follow up NT-proBNP level. The findings of this study41 would suggest that measures of either BNP or NT-proBNP may be independent predictors of mortality, but the sample size for this study was relatively small (n = 100).

Two other studies (based on three publications)112, 125, 128 from the same research team evaluated BNP and NT-proBNP. With respect to predicting event free survival at 1 year in an ambulatory group of patients, one study112 found both BNP and NT-proBNP were significant predictors at 1 year in univariate analysis. The subsequent multivariate analysis found four of the nine variables to be significant and included BNP, N-ANP, RAAS antagonists, and Living with Heart Failure questionnaire. From the same research group, but in a different study cohort, Berger et al.,125 using univariate analysis, found the log transformation of both B-type natriuretic peptides to be significant predictors of pump failure death; however, only the log of NT-proBNP was significant in multivariate analyses. A sub-analysis, which excluded patients with atrial fibrillation, showed that BNP was the best independent predictor of sudden death. Their findings suggest that the magnitude of the prognostic prediction is dependent on the specific mode of death and the specific form of the natriuretic peptide. A re-analysis of this same cohort of subjects was undertaken in a third study.128 The sample was classified according to severity levels based on LVEF and NYHA classification; a multivariate analysis was undertaken to see which factors predicted combined death and urgent heart transplant (for each of the 3 years of follow up). For Group A (NYHA I-II, LVEF ≤ 20 percent) log BNP was significant in both year 2 and 3. For Group B (NYHA I-II, LVEF < 20 percent, or, NYHA III-IV, LVEF ≤ 20 percent) only the log BNP was a significant predictor for year 1, but only log NT-proBNP was significant for years 2 and 3. For Group C (NYHA III-IV, LVEF < 20 percent) both log BNP and log NT-proBNP were significant for year 1; only log NT-proBNP was a significant predictor for years 2 and 3. These results would suggest that the strength of prediction for the B-type natriuretic peptides is also dependent on the year of follow up and less so on the severity of the HF.

Overall, of the six studies evaluating both BNP and NT-proBNP, only two studies found both BNP and NT-proBNP to be independent predictors of mortality. In one study128 year of follow up and group stratification, and in a second study41 the timing of the B-type natriuretic peptide measurement, similarly altered the predictive ability. No clear pattern emerges to suggest superiority of one type of B-type natiruetic peptide relative to the other in these head to head studies.

Comparison of Studies That Evaluated Baseline and Predischarge Measures.

All studies with the exception of seven publications36, 41, 107, 111, 118, 119, 124 evaluated only baseline BNP levels. One of these studies36 evaluated outpatients and measured BNP levels at two time points with an interval of 8 to 12 months. Another study evaluated patients BNP and NT-proBNP levels at successive follow up intervals over a 4-year period.41 Three107, 119, 124 of these studies had severe HF patients (NYHA III-IV) and approximately LVEF less than 40 percent; a fourth study111 had 88 percent of subjects at NYHA level III and IV (LVEF not reported). Logeart et al.,107 which had the largest sample size of all these seven studies, found the univariate HR for each 100 pg/mL increase of BNP to be slightly larger for predischarge BNP levels (HR = 1.22, 95 percent CI: 1.15 to 1.30, p = 0.0001) than baseline BNP levels (HR = 1.06, 95 percent CI: 1.03 to 1.10, p = 0.0001) for the composite outcome (death and readmission). In addition, they demonstrated a gradient of increasing risk from the first quartile (0 to 130) to the last quartile (660 to 1725) with the latter having the largest HR risk estimate (HR = 13.77). Similarly, the adjusted HR in this study showed increasing risk for poor outcome with increasing predischarge ranges, with the highest threshold (> 700 pg/mL) having a HR = 15.2, (95 percent CI: 8.5 to 27). Hamada et al.124 found predischarge BNP levels to be the only significant (p = 0.0086) predictor of re-hospitalization within 1 year in a multivariate analysis that included baseline admission BNP levels. Cheng et al.119 undertook only univariate logistic regression and found admission BNP and discharge BNP to both be significant for the composite endpoint and 30 day readmission; however, the small sample in this study size did not permit a true multivariate analysis. Given the likely strong correlation between admission and discharge BNP levels, it would be important to use multivariate analyses to adjust for strong correlations between these two measures of BNP. Similarly, Bettencourt et al.111 in a univariate analysis did not find median levels of either admission BNP (> 541 pg/mL) or discharge BNP (> 321 pg/mL) to be significant predictors of their composite endpoint (death or readmission).

Cheng et al.119 and a second study118 specifically recruited for new onset or first episode of HF. However, Tamura et al.118 measured BNP at discharge only and found the discharge log BNP to have the largest risk (HR = 2.656, p = 0.015) relative to the other variables significant in the model (NYHA class, LVEF, and left ventricular mass index) for predicting cardiac events.

A single study36 evaluated patients in an outpatient clinic (sample was 93 percent NYHA level I-II) and measured BNP levels at initial visit and second time-point (8 to12 months later), as well as the change in BNP levels (per 100 pg/mL) and any BNP level increase (versus decrease) during follow up. However, they did not include discharge BNP as a unique variable in subsequent univariate or multivariate models. In this study, the change in BNP (HR = 1.34, 95 percent CI: 1.10 to 1.63) and change in NYHA class (HR = 6.68, 95 percent CI: 2.23 to 19.12) were the only significant variables in the multivariate model.

Kaplan Meier Survival Analyses. Twenty-seven studies12, 23, 30, 32, 36, 41, 101, 103– 105, 108– 118, 120– 123, 129, 130 reported results from Kaplan Meier survival analyses using various cut points that were based primarily on median/mean or best values from ROC curves. All studies that undertook Kaplan Meier analysis, regardless of the outcome or cut point, found significant differences between the two groups.

Quality Assessment of Studies. For this research question, which evaluated prognosis, the eligible studies were based predominately on prospective cohort designs. As such, selection biases attributed to non-consecutive enrolment, and misclassification bias attributed to blinding, were chosen as the main criteria for methodological quality evaluation. Thirteen studies30, 104, 105, 107, 108, 112– 114, 118, 120, 121, 129, 130 selected patients in a consecutive manner. The remaining studies did not specify, and likely did not employ, this strategy to minimize bias as convenience samples were generally selected.

Attempts were made to evaluate the potential for misclassification bias through lack of blinding of clinicians who evaluated subjects or those who ascertained the endpoints. Blinding of the clinicians to the BNP level was undertaken in only four studies23, 36, 102, 115 and this minimized the potential for clinicians to systematically provide differential treatments or request additional tests. Blinding to NT-proBNP levels was not undertaken in any study; however, three studies113, 120, 193 indicated that the outcome was judged by researchers external to the clinical setting and had some potential to minimize ascertainment bias.

Prognosis Studies Using NT-proBNP Levels

Study Design and Sample Characteristics of Studies. For those studies evaluating NT-proBNP, four were RCTs41, 136, 138, 140 and the remaining 14 were prospective cohort studies.26, 35, 103, 112, 125, 126, 128, 131– 135, 137, 139 Six of these studies evaluated both NT-proBNP and BNP41, 103, 112, 125, 126, 128. Two publications were based on the same COPERNICUS study35, 140 and an additional two publications reported on the same study cohort125, 128 with different follow up periods.

Only four studies26, 135, 136, 139 recruited patients that were admitted to hospital for acute episodes. The remaining 14 studies35, 41, 103, 112, 125, 126, 128, 131– 134, 137, 138, 140 indicated that patients were recruited from primary care or specialty clinics and outpatient settings (see Tables 20 and 21).

Table 20. Summary of studies in patients with HF and mortality outcomes: NT-proBNP.

Table 20

Summary of studies in patients with HF and mortality outcomes: NT-proBNP.

Table 21. Summary of studies in patients with HF and mixed outcomes: NT-proBNP.

Table 21

Summary of studies in patients with HF and mixed outcomes: NT-proBNP.

Sample sizes ranged from a low of 48131 to a high of 2320 subjects.133 The mean sample size for all 18 studies was 378 (± 596) and the median was 121. The mean age of study participants clustered around 65 years but varied from 51 to 78 years; the widest age range spanned 64 years (40 to 104 years).133

Lengths of follow up varied greatly between the studies and the shortest time was 6 months139 while the longest time was 92 months126. The mean or median follow up time varied from 7 to 12 months,135, 136 13 to 24 months,103, 134 25 to 48 months,35, 128, 131, 194 and greater than 48 months.126 Several studies specified only endpoints of 6 months,139 12 months,112, 133, 137 17/18 months,26, 138 24 months,140 and 48 months41 (Tables 19 and 20).

HF Diagnosis and Severity. The diagnosis of HF was established in a number of ways, and these included echocardiography (carried out as part of the study or obtained from previous medical records) in six studies,35, 112, 125, 132, 133, 140 ventriculography in four studies,41, 134, 137, 138 clinical evaluation (signs and symptoms or NYHA classification) in six studies,103, 126, 131, 135, 136, 139 and other methods in one study.26 All studies used the NYHA classification, with the exception of one study135 which used the Killip method, and three studies35, 133, 140 which did not report any rating.

With respect to severity of HF, the majority of studies included subjects across all levels of the NYHA classification. The exception to this was one study139 that restricted subjects to levels III-IV. A single study112 enrolled subjects with predominately level II. Three studies35, 133, 140 did not specify the NYHA classification of their subjects. A single study135 used the Killup method of classification, and these patients varied from levels II to IV.

LVEF was not reported in four studies on admission.26, 135, 136, 139 Four studies35, 132, 137, 138 reported a threshold LVEF of less than 45 percent, five studies112, 125, 126, 128, 134 less than 35 percent, and five studies41, 103, 131, 133, 140 less than 25 percent.

NT-proBNP and Cut Points. NT-proBNP was measured in nine studies using the Elecsys method, six studies used the Biomedica method,41, 103, 112, 125, 126, 128 two used the Roche ELISA method, 135, 137 and one study used the Christchurch method138 (Tables 20 and 21). Six of these studies evaluated both NT-proBNP and BNP.41, 103, 112, 125, 126, 128

From the two publications based on the COPERNICUS study, one of the reports35 stated that a newly developed NT-proBNP Roche ELISA method was used and that samples would subsequently be retested using the Elecsys method. The re-analysis was published the same year but does not reference the previous report. Since this is the same cohort, we assume that the assay met our eligibility criteria but was not adequately reported.

Definition of Outcomes. Ten of these reports examined mortality as a discrete end point.35, 41, 125, 126, 131, 133, 134, 136, 138, 140 Nine studies103, 112, 128, 134– 137, 139, 140 reported composite end point of death or worsening HF. The remaining studies evaluated the ability to predict recommendation of heart transplantation,132 worsening HF alone,138 and in one case no estimate of risk was provided.26

Adjusted results — Multiple Regression Analysis. One26 study examined stratification with troponin level and presented no relative measure of risk such as a HR. However, it did make the limited statement that there were fewer events when NT-proBNP levels were below rather than above the median admission level (1357 pg/mL, p < 0.01).

Eleven studies undertook Cox proportional hazard regressions analyses and six studies undertook logistic regression analyses to evaluate the relationship between BNP levels and the relationship to the various outcomes. For the studies using Cox proportional hazard regression analyses, two studies112, 138 presented only multivariate estimates and the remaining studies undertook both univariate and multivariate computations. Six studies undertook multivariate logistic regression126, 132, 133, 135– 137 to evaluate the strength of the association between levels of BNP and the outcomes of interest. A single study26 did not report estimates of risk.

NT-proBNP Studies With Mortality Outcomes. Eleven studies35, 41, 112, 125, 126, 131, 133, 134, 136, 138, 140 evaluated mortality outcomes and these are detailed in Table 20. The results are expressed as both univariate and multivariate HR,35, 131, 140 chi square statistic,41, 112, 126, 134 OR,133, 134, 136or not specified.138 Some studies41, 126, 131, 133 reported the estimates of risk based on the log NT-proBNP levels. In general, all studies that undertook univariate or both univariate and multivariate analyses found NT-proBNP to be a significant predictor of mortality. One study135 found multivariate estimate on a subsample with HF to be significant (OR = 5.30, 95 percent CI: 1.4 to 168.9, p = 0.026) but the CI was wide. For those studies that presented adjusted HR, the risk estimates varied from HR = 2.17 (NT-proBNP > 1767 pg/mL) (95 percent CI: 1.33 to 3.54, p < 0.02)140 to HR = 9.35 (log NT-proBNP) (95 percent CI: 2.42 to 36.10, p = 0.001). These estimates encompass baseline, discharge and change of NT-proBNP levels.

NT-proBNP Studies With Composite Outcomes. Table 21 details the ten studies with the composite end points, which included death as a component of the outcome. The results are expressed as both univariate and multivariate HR,139, 140 chi square statistic,103, 112, 128, 134, 137 OR,134– 136 or not specified.195 One study128 reported the estimates of risk based on the log NT-proBNP levels.

In general, there were seven studies112, 128, 134– 137, 140 that found baseline, discharge or change levels of NT-proBNP levels to be significant predictors for composite outcomes after adjustment in multivariate models. Two studies103, 139 showed marginal or no statistical significance. All these regression models used a variety of variables within their models, which makes comparisons across studies challenging.

For those studies that presented adjusted risk estimates for NT-proBNP, the values varied from HR = 2.11 (95 percent CI: 1.54 to 2.90, p < 0.0001) for NT-proBNP greater than 1767 pg/mL,140 to HR = 5.96 (95 percent CI: 3.23 to 11.01) for change in NT-proBNP vs. decrease greater than 30 percent or increase greater than 30 percent.

Quality Assessment of Studies. As with the BNP studies, potential for selection bias (attributed to non-consecutive enrolment) and misclassification bias (attributed to blinding) were chosen as the main criteria for methodological quality evaluation. Eight studies26, 112, 132– 135, 137, 139 selected patients in a consecutive manner. The remaining studies did not specify and likely did not employ this strategy to minimize bias, as convenience samples were generally selected. Blinding of the clinicians or the investigators to the NT-proBNP levels with respect to the outcomes was undertaken in only three studies132, 136, 139 suggesting some potential for ascertainment bias for of the outcome.

Question 3b: What Are the Screening Performance Characteristics of BNP or NT-proBNP in General Asymptomatic Populations?

Definition of Screening

A screening test can be used in defined populations who need not believe that they are at risk of a disease or that they are already affected by it or by its complications. It may also be used in clinical practice in individuals who do not have established or overt disease, but who may have one or more risk factors for the disease. In this review, a screening test was defined as being used to detect preclinical cardiac dysfunction, systolic or diastolic, in general asymptomatic population.

Design and Sample Characteristics

There were eight studies5, 74, 122, 141– 145 identified in populations apparently without established or overt disease (or heart failure). Two studies had no sensitivity or specificity data.74, 122 Six studies had relevant data,5, 141– 145 two were cross-sectional141, 143 and four were prospective cohort studies.5, 142, 144, 145 The age of the population included in these six studies ranged between 40 and 84. All but one study145 evaluated BNP.

Study Outcomes

Although, these studies using BNP and NT-proBNP for screening focused primarily on LVD, there were differences in the specific outcomes with respect to the type and level of severity. One study5 evaluated preclinical ventricular dysfunction, both diastolic and systolic components, (EF at < 40 percent); similarly, another study141 evaluated left ventricular systolic dysfunction alone (EF at < 40 percent). One study143 evaluated asymptomatic systolic (EF < 55 percent) and diastolic dysfunction (diastolic dominant pulmonary vein flow with EF of > 55 percent).

Other studies broadened the types of dysfunction. One study142 evaluated cardiac dysfunction (defined as left ventricular systolic and diastolic, unknown LVD, and valvular dysfunction), while another study144 used three outcomes that included left ventricular mass, EF of less than 50 percent and moderate to severe LVSD (EF < 40 percent). The sole study145 evaluating NT-proBNP evaluated the outcomes of LVSD, mortality, chronic heart failure admissions, and other cardiac admissions.

Screening Properties

To ensure comparability of the test characteristics, the sensitivity, specificity, LR+ and LR-, and AUC are included (Table 22), either as listed in the original publications or calculated141, 142, 145 from the available data. Comments on the performance of the index test are based on a LR+ greater than 10 or LR- less than 0.1 and AUC greater than 0.8 providing convincing evidence for accuracy. Moderate to strong evidence for accuracy is provided by a LR+ greater than 5 or LR- less than 0.2 and AUC 0.70 to 0.80. Poor evidence for accuracy comes from LR+ less than 5, LR- greater than 0.2 and AUC less than 0.70.168

Table 22. Summary of studies evaluating BNP and NT-proBNP in the general population.

Table 22

Summary of studies evaluating BNP and NT-proBNP in the general population.

The study by Redfield et al.5 showed the AUC for detection of LVEF less than 50 percent or mild diastolic dysfunction were consistently less than 0.70, so the authors then confined the use of BNP to detecting an EF less than 40 percent or moderate to severe diastolic dysfunction. This study explored a variety of factors with respect to screening properties including systolic versus diastolic dysfunction within these categories: age over 65 years, gender, and high-risk groups. For the diastolic category alone, the subgroup of moderate to severe dysfunction had some or all diagnostic properties assessed. The prevalence rates varied significantly within these sub groupings, with the lowest values in the general population and women with systolic dysfunction, and the greatest rates for the population with diastolic dysfunction of moderate to severe levels (Table 22). The corresponding diagnostic estimates of accuracy reflected moderate strength at best for LR+ (7.4) for two of the thirty subgroups they evaluated; similarly, the LR- reflected poor accuracy. With the exception of systolic dysfunction in high-risk men, the AUC was generally in the moderate range. In general, within the systolic and diastolic groups there were no differences due to gender and age. Overall this would suggest poor detection characteristics for BNP as a screening test. The low prevalence of preclinical systolic dysfunction and the observed specificity would result in a large number of screened people requiring an echocardiogram and nearly all of them would be negative. Using a higher cut point, for example one based on the upper normal value, would result in fewer confirmatory echocardiograms but would miss 30 percent or more of people with preclinical systolic dysfunction.

These results are similar to those of Vasan et al.144 which showed the test characteristics of BNP are limited at each of the three discriminatory levels; this study only reported AUC as a metric of accuracy. Overall, the AUC for BNP was less than 0.75 for elevated LSVD and left ventricular mass in both genders. This estimate exceeded 0.80 in high-risk women, but this was based on a very small sample size and the confidence interval was wide suggesting caution in interpretation. When maximizing the sums of sensitivity and specificity the cut point for women is BNP 27 pg/mL for LV mass greater than 90th percentile, it is 34 pg/mL for any LVSD and is also 34 pg/mL for moderate to severe LVSD (< 40 percent). For LV mass this gives a sensitivity of 26 percent, specificity 86 percent, LR+ 1.82, LR- 0.86. For any LVSD it gives a sensitivity of 26 percent, specificity 89 percent, LR+ 2.49, LR- 0.82, and for moderate to severe LVSD (< 40 percent) the sensitivity is 80 percent, specificity 90 percent, LR+ 7.67, LR- 0.22 (Table 22). The performance of the test improved in women but not in men when select high-risk subgroups were targeted. Discriminatory limits that were based on a high specificity (95 percent) give better positive predictive values and LR, yet identified less than one third of the participants who had elevated LV mass or LVSD. Both these studies5, 144 highlight the need for different BNP levels for women.

In the study by Hedberg et al.141 a BNP greater than 28 pg/mL gave the highest specificity at a sensitivity greater than 95 percent in detecting LVSD, but the highest accuracy was found with a concentration greater than 73 pg/mL in a random sample of subjects 75 years of age. Both of these values produce negative predictive values of 98 to 99 percent but the routine predictive value for BNP greater than 28 pg/mL is 13 percent (95 percent CI: 9 to 19) and for BNP greater than 73 pg/mL it is 34 percent (95 percent CI: 24 to 47), and the latter cut point will miss more individuals with preclinical systolic dysfunction. The combination of ECG and BNP greater than 28 pg/mL and BNP less than 28 pg/mL found LVSD in less than 1 percent of the study population irrespective of the BNP concentrations, leading to the conclusion that the BNP had screening value in addition to the ECG, but only in those with abnormal ECG's. Overall, the AUC for either cut point of greater than 731 or greater than 28 show the same AUC (0.88).

The screening test characteristics for BNP for systolic and/or diastolic dysfunction are also poor in one study142 in which a BNP value of 20 pg/mL was pre-selected. The overall negative predictive value was 57 percent (95 percent CI: 48 to 65) and the accuracy of BNP did not change with higher cut points but these produced decreasing sensitivity and negative predictive value. The AUC were not estimated, but the LRs indicate poor evidence for accuracy. Another study143 of patients with stable CAD and a pre-selected BNP cut point of 100 pg/mL had all of the participants undergo an extensive examination and an exercise stress treadmill test to ensure no overt symptoms of HF. The test characteristics for BNP were poor for systolic dysfunction with a 38 percent sensitivity, 80 percent specificity, LR+ 1.9 (95 percent CI: 1.2 to 2.9), LR- 0.8 (95 percent CI: 0.60 to 1.00) and AUC 0.59 (95 percent CI: 0.49 to 0.69). They were also poor for diastolic dysfunction with a 55 percent sensitivity, 85 percent specificity, LR+ 3.8 (95 percent CI: 2.4 to 5.9), LR- 0.8 (95 percent CI: 0.4 to 0.8 and AUC 0.79 (95 percent CI: 0.71 to 0.87).

Only one study145 used NT-proBNP as the index test and accuracy was evaluated with respect to LVEF levels, European Society Cardiology (ESC) criteria for HF and further stratified by age over 70 years, and high risk medical history. The subgroup classified by ESC criteria, LVEF less than 40 percent, and age over 70 years, showed very strong evidence for accuracy (LR+ 10.71, LR- 0.10, AUC 94); values for a similar group (without the age restriction) showed moderately high values (Table 22). This suggests that for these groups there is some potential benefit for screening for LVSD. Using the Cox proportional hazard model, Log NT-proBNP (HR = 5.70), and male gender (HR = 3.10) were shown to be significant independent predictors of mortality in patients that were followed up for a median of 805 days. Log NT-proBNP (HR = 13.83), male gender (HR = 2.71) and dyspnea (HR = 1.45) were significant independent predictors of admission for heart failure. Finally, log NT-proBNP (HR = 3.69), abnormal ECG (HR = 2.56) and history of ischemic heart disease (HR = 1.9) were independent predictors of other cardiac admissions, eliminating LVEF from all prognostic models.

Quality Assessment of Studies

BNP was the index test in five5, 141– 144 of the accepted studies and NT-proBNP in the sixth one.145 In all six echocardiography was a reference test. The subjects were either randomly selected from the community,5, 141 were part of another prospective community cohort,144, 145 or cross-sectional study.141, 143 In the selected studies subjects may have had risk factors for HF, such as stable CAD, but none had overt or symptomatic HF. The patient samples were consecutively or randomly selected, the index and reference tests were clearly described, the index test was not available to those making the clinical diagnosis, the study populations were not classified by disease state, and appropriate descriptions were given as to the steps taken to ensure that the subjects did not have overt cardiac dysfunction.

Question 4: Can BNP or NT-proBNP Measurement Be Used To Monitor Response to Therapy?

Sample and Design Characteristics

There were 18 studies meeting the eligibility criteria to be included in this section.31, 37– 47, 110, 146– 150 In brief, the included studies enrolled chronic HF patients with at least three B-type natriuretic peptide measurements over time. The LVEF was reported as less than 25 percent,41, 150 ≤ 35 percent,149 ≤ 40 percent,39, 42, 43, 43– 47, 110, 147, 148 less than 45 percent,37, 38 less than 50 percent;31 it was not reported in two studies.40, 146 A total of nine of these papers reported the change in BNP or NT-proBNP and related the change to other outcomes including cardiac function, exercise capacity, symptoms or clinical events.31, 37, 38, 41, 44, 110, 148– 150 The other nine studies reported changes in BNP or NT-proBNP and also may have reported the changes in other variables; however, there was no determination in these studies of the relationship between change in the B-type natriuretic peptide and change in these other variables.39, 43, 45– 47, 146, 147, 196, 197 Five39, 43, 45, 46, 110 of the 18 papers that reported findings, examined in different ways, data from a recently published large randomized clinical trial (Val-HeFT). Two other studies also used the same database147, 148, but one of these was a comparison of two different NT-proBNP methods.147 Although all these studies described collection of at least three B-type natriuretic peptide measurements, there were only ten studies that provided values for each of the time points.31, 37– 41, 44, 146, 147, 149

Response to Therapy

In the studies where change in BNP or NT-proBNP was related to other clinical findings, it was found that the B-type natriuretic peptide was related to at least one other variable in seven of the nine studies (Table 23). A study by Murdoch et al. was the first to evaluate whether plasma BNP would be useful to tailor therapy in patients with chronic stable HF (BNP guided group).149 They randomized 20 patients to receive optimization ACE inhibitor therapy based on serial plasma BNP measures or clinical assessment with up-titration of ACE inhibitor to the target levels used in clinical trials, over the 8 week course of the study. They found only the BNP driven approach was associated with greater reductions in plasma BNP concentration throughout the course of the study and that there was a significantly greater suppression when compared to the clinical assessment group after 4 weeks of therapy (p = 0.03), but not at 8 weeks (p = 0.73) (Figure 7D). Although there was a decrease in BNP observed, there were no significant changes observed in haemodynamics within either group; however, heart rate was significantly different between groups (p = 0.02).

Table 23. Evidence table for studies using BNP or NT-proBNP to monitor treatment.

Table 23

Evidence table for studies using BNP or NT-proBNP to monitor treatment.

Figure 7. Change in BNP or NT-proBNP concentration after treatments.

Figure

Figure 7. Change in BNP or NT-proBNP concentration after treatments.

Troughton et al. examined whether titration of treatment to reduce plasma NT-proBNP concentrations in systolic HF patients (NT-proBNP guided group) would be superior to clinically based treatment.148 There were 69 NYHA class II-IV HF patients with LVEF less than 40 percent recruited into the study, with a median follow up of 9.5 months. Although the mean NT-proBNP concentrations decreased to 668 pg/mL below baseline in the NT-proBNP guided group, compared with 25 pg/mL in the clinical group, this difference was not significant (p = 0.16) (Figure 7D). The BNP guided group had fewer cardiovascular events (death, hospital admission, or HF decompensation) compared to the clinical group (19 versus 54; p = 0.02). Changes in left ventricular function, quality-of-life score, 6 minute walk test, blood pressure, renal function, and adverse events were similar in both groups.

Three additional studies have demonstrated that changes in BNP or NT-proBNP concentrations relate to changes in mortality and morbidity.37, 41, 110 Two of these three papers had study sizes of less than 100 patients.37, 41 The third study110 was part of a large clinical trial and the BNP was measured at baseline in 4305 NYHA class II-IV HF patients (Val-HeFT). Follow up measurements were made at 4, 12, and 24 months after randomization. They found the baseline BNP predicted all-cause mortality and first morbid events. The study results demonstrated at study end that the group taking valsartan (the study drug) had a decline in BNP (decreased by 21 ± 5 pg/mL) compared to the placebo group (increased by 23 ±5 pg/mL). Patients with the greatest percent decrease in BNP from baseline to 4 months had the lowest, whereas patients with the greatest percent increase in BNP had the highest all-cause mortality and first morbid events.

In six of the papers, change in NT-proBNP or BNP was related to change in cardiac function, functional capacity or quality-of-life.31, 37, 38, 44, 148, 149 In three of these papers, despite changes in BNP or NT-proBNP, there was no relationship to changes in these other variables.37, 148, 149 The two exceptions were blood pressure (p = 0.015)148 and heart rate (p = 0.02).149 One of these studies demonstrated changes in left ventricular end diastolic dimension and end systolic dimension, but no change in BNP concentrations.44 This study also found a significant difference in BNP change in patients with HF of non-ischemic etiology in both early and late phases (p < 0.05), but not in those of ischemic etiology. In this study all the patients were receiving beta blocker therapy.

There were two studies31, 38 demonstrating that the changes in BNP or NT-proBNP were related to changes in cardiac function. Patients with ischemic heart disease treated with metoprolol showed significant differences at 12 weeks and 1 year from baseline for LVEF (32 percent and 38 percent, respectively, p < 0.01), symptom questionnaire score (3.9 and 3.6, p < 0.01) and 6 minute walk test (1310 and 1269 feet respectively, p < 0.05).38 In the other study, patients with idiopathic dilated cardiomyopathy treated with carvedilol demonstrated significant differences in all parameters measured.31 These parameters included NYHA (r = 0.50, p < 0.0001), systolic blood pressure (r = 0.31, p = 0.014), heart rate (r = 0.43, p = 0.0007), LVEDD (r = 0.84, p < 0.0001), LVESD (r = 0.84, p < 0.0001), LVEF (r = -0.60, p < 0.0001), and LV mass index (r = 0.66, p < 0.0001).

There were nine papers that examined the response of BNP or NT-proBNP to different types of HF therapy.39, 40, 42, 43, 45– 47, 146, 150 In four of these studies BNP or NT-proBNP changes were related to changes in HF therapy.40, 42, 146, 150 These studies demonstrated that the B-type natriuretic peptide concentration varied in response to the intensity of drug therapy40, 42, 146, 150 or the use of various types of left ventricular assist devices.150 The other studies 39, 45– 47 demonstrated that HF patients receiving active therapy had a greater reduction in B-type natriuretic peptide concentration than those not taking the therapy. However, none of these studies examined whether BNP or NT-proBNP were related to change in drug dose.

Figure 7 is a compilation of all studies with abstractable data showing percent change from baseline in BNP31, 37, 39, 40, 44, 149 or NT-proBNP38, 41, 147 concentration with time and drug therapy. There were four studies, which treated patients with beta blockers,31, 38, 41, 44 and all showed a decrease in BNP or NT-proBNP with time. The metoprolol and carvedilol treated patients31, 38, 44 were very similar in their changes to BNP or NT-proBNP levels over time in contrast to the atenolol treated patients.41 The one study with the antiarrhythmic amiodarone showed a rate of change similar to the beta blocker group.37 Valsartan treated patients had lower BNP values from baseline at 4 months after treatment but exhibited a slight increase over time.39 The ACE inhibitor enalapril showed a greater rate of change and greatest overall change from baseline compared to all treatments.40 The high dose treatment group showed a similar change in BNP as those in the low dose treatment group up to 12 weeks but there was a large departure at 24 weeks (75 percent versus 54 percent, respectively). Of these nine studies, only four included a placebo group.31, 37, 39, 41 In two of these studies, the placebo groups37, 41 showed no significant change from baseline at any time point.

In the valsartan therapy study,39 all time points levels in the placebo group were higher than the treated group and increased with time. The carvedilol therapy study with a placebo group31 provided data at baseline and end of study only. Both the placebo and carvedilol treated groups had significantly lower BNP levels at six months compared to baseline and were not significantly different from each other over time (p = 0.18).

There were two studies which assessed change in BNP149 and NT-proBNP147 in patients guided by the B-type natriuretic peptide level compared to patients following a clinically driven protocol (Table 23). Both studies demonstrated that the B-type natriuretic peptide guided therapy groups sustained faster and lower concentrations. The study that did not use any beta blocker therapy showed the smallest change overall, particularly in the clinically guided treatment group.147 Overall, Figure 7 shows that drug treatment decreases B-type natriuretic peptide levels in a time-dependent mode indicating that BNP or NT-proBNP might be reasonably good markers for monitoring the effect of treatment of chronic HF patients.

Quality Assessment of Studies

Of the 18 studies, 12 were RCTs. The quality of these 12 studies were evaluated using the Jadad scale167. Two studies147, 148 scored 4, one study41 scored 3. The remaining studies39, 40, 43– 47, 110, 149 scored less than 3, indicating poor quality.

Views

  • PubReader
  • Print View
  • Cite this Page

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...