Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Stat Med. Author manuscript; available in PMC 2016 May 10.
Published in final edited form as:
Stat Med. 2015 May 10; 34(10): 1747–1760.
Published online 2015 Jan 28. doi: 10.1002/sim.6436
PMCID: PMC4390486
NIHMSID: NIHMS662830
PMID: 25628249

Surrogates of Protection in Repeated Low-Dose Challenge Experiments

Abstract

A critical step toward developing a successful vaccine to control the human immunodeficiency virus (HIV) pandemic entails evaluation of vaccine candidates in non-human primates (NHPs). Historically, these studies have usually entailed challenges (i.e., exposures) with very high doses of a simian version of HIV, resulting in infection of all NHPs in the experiment after a single challenge. More recently, researchers have begun to conduct repeated low-dose challenge (RLC) studies in NHPs that are believed to more closely mimic typical exposure in natural human transmission settings. One objective of RLC studies is to assess whether measured immune responses to vaccination can serve as surrogate endpoints for the primary endpoint of interest, namely infection. In this paper, different designs of RLC studies for assessing a binary surrogate of protection are considered.

Keywords: Causal Inference, HIV, Immunogenicity, Infectious Diseases, Transmission Probability, Vaccine

1. Introduction

Experiments in non-human primates (NHPs) play an essential role in the development, screening, and evaluation of preventive human immunodeficiency virus (HIV) vaccines [1, 2]. Historically, these studies have usually entailed challenges (exposures) with very high doses of a hybrid simian HIV, resulting in infection of all NHPs in the experiment. The risk of simian HIV transmission per exposure in such studies is much higher than in human-to-human HIV transmission, so that the failure of a vaccine candidate to protect from infection in this setting is not necessarily indicative of a vaccine without utility in humans [3]. Thus researchers have recently begun to conduct repeated low-dose challenge (RLC) studies in NHPs (e.g., [48]) that may more closely mimic typical exposure in natural human transmission settings. It has been shown that these experiments can be adequately powered to detect vaccine efficacy against infection with a feasible numbers of NHPs [911]. Because investigators control exposure, RLC experiments can assess vaccine effects of interest that may be difficult to observe in typical human efficacy trials. For example, infection status can be determined after each exposure via viral detection assays, facilitating estimation of the per-exposure or “per-contact” effect of vaccination. Likewise, challenge studies allow precise characterization of immunological and virological parameters very early after infection, which are practically impossible to evaluate in humans (except possibly in neonates).

In addition to assessing vaccine effects on infection and post-infection endpoints, evaluation of immunological surrogates of protection (SoPs) are an important objective of NHP challenge experiments, providing points of reference to judge new vaccine candidates and for retrospective analyses of candidates evaluated previously in NHP experiments [9]. In general, a SoP is defined to be an immunological variable S (e.g., a T-cell or antibody measurement) such that a vaccine effect on S is predictive of a vaccine effect on the risk of infection or disease [12]. The utility of such a SoP includes guiding vaccine development, providing guidance for regulatory and immunization policy decisions, and bridging efficacy of a vaccine observed in a trial to a new setting. For RLC studies, knowledge of an immunological surrogate can inform comparisons of vaccine candidates in NHPs and support predictions of vaccine efficacy in humans.

Despite the importance of finding immune SoPs, methods for their quantitative assessment are quite limited. Moreover, there exists considerable confusion about what constitutes an immune correlate or surrogate of protection and how it should be appropriately evaluated. Recently Qin et al. [13] and Gilbert et al. [14] proposed a hierarchical three-tier framework for evaluating immune correlates: correlate of risk, specific SoP, and general SoP. A correlate of risk is an immunological measurement that correlates with the risk of a clinical endpoint (such as infection or disease) in a defined population. A specific SoP is a correlate of risk that is predictive of vaccine efficacy in a particular setting. A general SoP is a specific SoP that is also predictive of vaccine efficacy across different settings (e.g., across populations or across vaccine formulations). Meta-analysis of multiple vaccine studies is required for evaluating a general SoP whereas one study may be sufficient for evaluating a specific SoP. In this paper attention is restricted to the evaluation of a specific SoP (hereafter simply referred to as a ‘SoP’) from a single RLC study of a candidate HIV vaccine.

Traditionally, identification of potential SoPs has relied on solely assessing whether a immune response was a correlate of risk, i.e., associated with risk of infection or disease. For example, in the first phase III trial of an HIV vaccine, a significant negative association was found between risk of HIV infection and antibody response to the vaccine [15]. Unfortunately, this association-based analysis provides no information to distinguish between two possibilities: (i) a greater vaccine effect on the immune response predicted a greater vaccine effect on infection risk, or (ii) the immune response simply marked an innate ability to escape infection but did not predict vaccine efficacy. In other words, it was difficult based on the analysis results to conclude whether antibody response to the vaccine was a SoP or just a correlate of risk. A similar example is given by Ellenberger et al. [6], who found an association between vaccine induced ELISpot Gag responses and risk of simian HIV infection in a RLC study of a candidate HIV vaccine.

Recently, novel experimental designs and corresponding statistical methodology have been proposed for evaluating potential SoPs in the context of human efficacy trials [12, 16]. The central premise behind these designs is to attempt to infer the immune response control NHPs would have had if (counter to fact) they had been vaccinated. The first design, referred to as the baseline immunogenicity predictor (BIP) design, entails measuring a baseline covariate(s) W that is correlated with the immune response that NHPs would have to the HIV vaccine being evaluated. For example, W might be an immune response to a non-HIV vaccine. The missing HIV vaccine immune response for NHPs in the control arm can then be predicted or inferred (perhaps implicitly) from their W and a prediction model based on observed data from the vaccine group. In turn, the association between the vaccine effect on the immune endpoint and the vaccine effect on the infection endpoint can be assessed.

The second study design proposed for evaluation of SoPs in human efficacy trials is the close-out placebo vaccination design, where placebo recipients who are uninfected at the end of the trial are administered the HIV vaccine and their immune response is measured. In the RLC study setting, this design may not be feasible since most, if not all, control NHPs are often infected after repeated challenges, e.g., see [6, 7]. Therefore, in this paper we propose a modification of close-out placebo vaccination wherein after each challenge uninfected control NHPs are randomly assigned to receive vaccine according to a pre-specified crossover probability. This will be referred to as the crossover vaccination (CrV) design.

Simulation studies of large (e.g., Phase III) randomized human efficacy trials have demonstrated that the additional information provided by employing a BIP or close-out placebo vaccination design can enable assessing the extent to which a correlate of risk is a SoP [12,16]. Similarly, [10] demonstrated the BIP design also holds promise in identifying SoPs in the RLC study setting; however, [10] did not consider a close-out placebo or CrV design. The feasibility of the BIP design relies on the existence of a baseline covariate W that is correlated with the HIV vaccine immune response. Whether such a covariate will be available may not be evident a priori, especially when evaluating new vaccine candidates. The CrV design, on the other hand, does not require a BIP. In order to further compare these two designs, in this paper methods are proposed for assessing potential binary immunological SoPs using either the BIP or CrV design. The operating characteristics of the different designs are then evaluated by simulating RLC studies.

The outline of the rest of this paper is as follows. In Section 2 the methods employed for analysis of RLC studies to determine SoPs using the BIP and CrV design are presented. The finite sample performance of the BIP and CrV designs under various scenarios are assessed in a simulation study in Section 3. The different approaches are applied to a recent RLC study in Section 4. A brief discussion is given in Section 5.

2. Methods

2.1. Notation, Assumptions, and Estimands

Following [10], consider the usual two-arm RLC study design that entails the following three sequential steps: (i) uninfected NHPs are randomized to receive vaccine or control, (ii) immune responses are measured, and (iii) low-dose simian HIV challenges are repeatedly administered until infection or some maximum number of challenges is reached. Let Zi be the treatment assignment NHP i was randomized to, where Zi = 0 for control and Zi = 1 for vaccine. Using the Neyman potential outcome model, let Si(z) denote the potential HIV-specific immune response when assigned treatment z for z = 0, 1, where Si(z) = 0 indicates the absence of the immune response and Si(z) = 1 indicates the presence of the immune response. Note the immune response is being measured in step (ii), among uninfected NHPs that have yet to challenged with simian HIV. Therefore we assume Si(0) = 0 for all i because vaccine antigens (absent in the control) are necessary to induce an HIV-specific immune response in an uninfected, unexposed NHP. Let Ti(z) be the potential survival time under assignment z = 0, 1. Assume the survival time is measured by the number of exposures (i.e., challenges) until infection with simian HIV; thus Ti(z) is a positive integer.

The models and likelihoods developed below rely on several key assumptions. First, assume no interference between NHPs and no variations in the type of vaccine such that the stable unit treatment value assumption holds [17]. The lack of interference between NHPs should hold in this setting since investigators can prevent transmission of simian HIV between NHPs. Second, assume ignorable treatment assignment, i.e., Zi ⊥ (Si(0), Si(1), Ti(0), Ti(1)), which is insured by the use of randomization in assigning NHPs to receive vaccine or control. Third, assume the per-challenge probability of infection is independent of the prior number of challenges.

Let p(Zi, Si(1)) denote the probability of infection from a single exposure (i.e., the transmission probability) under assignment to treatment Zi for NHPs with immune response Si(1) when vaccinated, i.e., p(Zi, Si(1)) = Pr[Ti(Zi) = 1|Si(1)]. Following [10, 16], consider the following probit model of the transmission probability

p(ZiSi(1); β) = Φ{β1β2Ziβ3Si(1) + β4ZSi(1)}, 
(1)

where Φ is the standard normal CDF and β = (β1, β2, β3, β4). For NHPs for which the vaccine has no effect on the immune response, i.e., Si(1) = 0, the value of β1 determines the per-challenge risk of infection when not vaccinated and the parameter β2 describes the vaccine effect on the per-challenge risk of infection. The parameter β3 corresponds to differences in the per-challenge infection risk when not vaccinated between NHPs with an immune response to vaccine (i.e., Si(1) = 1) and NHPs without an immune response to vaccine (i.e., Si(1) = 0). A key parameter of model (1) is β4 where β4 ≠ 0 indicates that the vaccine effect on the per-challenge probability of infection differs between NHPs with Si(1) = 1 and NHPs with Si(1) = 0. The parameter β4 is particularly useful in describing the surrogate value of Si(1) as discussed below.

Consider the principal stratification [18] of NHPs according to the potential immune response if vaccinated Si(1). Define the average causal effect of the vaccine on survival (i.e., infection) within principal stratum Si(1) = s to be h(E{Ti(0)|Si(1) = s}, E{Ti(1)|Si(1) = s}) for s = 0, 1, where h is some contrast function such that h(x, y) = 0 iff x = y, and E{Ti(z)|Si(1) = s} is the expected time to infection under treatment z in principal stratum Si(1) = s. Assuming the transmission probability is independent of the previous number of challenges, Ti(z)|Si(1) = s is a geometric random variable with mean E{Ti(z)|Si(1) = s} = 1/p(z, s; β). Following [10], choosing h1(x, y) = Φ−1(1/x) − Φ−1(1/y) yields a simple expression for the principal stratum causal effect, namely −β2β4s. Alternatively, for contrast function h2(x, y) = 1/x − 1/y the principal stratum causal effect equals the difference in risk of infection per challenge, which under model (1) is given by

p(0, sβ) - p(1, sβ) = Φ{β1β3s} - Φ{β1β2 + (β3β4)s}.

Another possible contrast function h3(x, y) = xy yields the difference in expected survival times within principal strata Si(1) = s, which under model (1) equals

E{Ti(0)∣Si(1) = s} - E{Ti(1)∣Si(1) = s} = Φ{β1+β3s}-1 - Φ{β1+β2+(β3+β4)s}-1.

The proportion associative effect (PAE) statistic has been suggested [10, 12] as a summary measure of the surrogate value of Si. PAE describes the proportion of the total effect of treatment (e.g., vaccine) represented by the expected associative effect (EAE) compared to the expected dissociative effect (EDE). Specifically, if EAE is the vaccine effect on infection within the stratum where Si(1) = 1 and EDE is the vaccine effect on infection in the stratum where Si(1) = 0, then PAE = |EAE|/(|EAE| + |EDE|). For h1 defined above EAE = −β2β4 and EDE = −β2, implying

PAE = ∣β2β4∣/{∣β2β4∣ + ∣β2∣}, 
(2)

where in general we adopt the convention |0|/(|0| + |0|) = 0.5, which for (2) implies PAE = 0.5 if β2 = β4 = 0. For contrast function h2 the expression for PAE has a slightly more complicated form

PAE=Φ{β1+β3}-Φ{β1+β2+β3+β4}Φ{β1+β3}-Φ{β1+β2+β3+β4}+Φ{β1}-Φ{β1+β2}
(3)

The PAE for h3 can be expressed similarly. Note the dose in RLC is typically selected by investigators such that p(Zi, Si(1)) ∈ [0.1, 0.5], suggesting for z, s = 0, 1 that (β1 + β2z + β3s + β4zs) ∈ [Φ−1(0.1), 0]. Over this domain Φ(·) is approximately linear (by Taylor series expansion Φ(x)1/2+x/2π, suggesting that (2) and (3) will tend to be similar.

Regardless of the choice of contrast function, PAE = 0.5 if and only if EAE and EDE are equal in absolute value, i.e., the magnitude of the vaccine effect on time until infection T is the same for NHPs who would have an immune response to the vaccine compared to those who would not have an immune response. In this case, S has no surrogate value. On the other hand, PAE = 1 if and only if EDE = 0, indicating there is a vaccine effect on infection if and only if there is a vaccine effect on S; in this case S would be a perfect surrogate for T. Similarly, PAE = 0 if and only if EAE = 0, indicating there is a vaccine effect on infection if and only if there is no vaccine effect on S; in this case S would be a perfect surrogate for T as well, although in most contexts this scenario might be considered unrealistic. Biomarkers with some surrogate value will have PAE ≠ 0.5, with the surrogate value increasing as PAE nears 0 or 1.

For simplicity in the sequel we will focus on (2); similar although slightly more complicated considerations apply if a contrast function other than h1 is chosen by the analyst. Note for (2) that β4 = 0 if and only if PAE = 0.5. Therefore we will focus on testing the null H0 : β4 = 0 versus the alternative H0 : β4 ≠ 0, or equivalently H0 : PAE = 0.5 versus HA : PAE ≠ 0.5. The likelihood ratio test can be employed to test H0 versus HA based on the likelihoods described below.

2.2. Standard RLC Design

The likelihood for the standard RLC design can be constructed as follows. For NHP i, let Ti = min{Ti(0)(1 − Zi) + Ti(1)Zi, cmax} = min{Ti(Zi), cmax} denote the observed number of exposures during the experiment where cmax denotes the right censoring time, i.e., the maximum allowable number of exposures. In general cmax may differ from NHP to NHP, but here for simplicity it is assumed cmax is the same for all NHPs. Assume cmax is a known constant and that cmax > 1, i.e., each NHP is challenged more than once provided infection does not occur after the first challenge. Let δi equal 1 (0) if NHP i is infected (uninfected) by the end of the study, i.e., δi = I(Ti(Zi) ≤ cmax) where I(·) is the usual indicator function. Let Si = Si(0)(1 − Zi) + Si(1)Zi denote the observed immune response. Suppose n iid copies of (Zi, Ti, δi, Si) are observed. Letting G denote the marginal distribution function of Si(1), the likelihood is proportional to

Lrlc(β,G)=i=1n{φ(1,Si,Ti,δi;β)dG(Si)}Zi{φ(0,s,Ti,δi;β)dG(s)}1-Zi,
(4)

where φ(Z, S, T, δ; β) = {1 − p(Z, S; β)}Tδp(Z, S; β)δ. Because S is assumed binary in this paper, the integral in (4) could be written instead as a sum over s = 0, 1; a similar comment applies to the integrals given below as well. The more general form as in (4) is applicable for discrete or continuous S.

Note, because (1) is saturated, the likelihood (4) could equivalently be parameterized directly in terms of the geometric success probability parameters p(z, s) instead of β. For the vaccine arm, it is straightforward to show that the maximum likelihood estimate of p(1, s) is p̂(1, s) = Σi:Zi=1,Si=sδii:Zi=1, Si=sTi, i.e., the total number of infections divided by the total number of challenges in vaccinated NHPs with immune response s. Thus maximizing the likelihood is more straightforward using this parameterization, although the maximum likelihood estimator of p(0, s) does not appear to have a closed form. The β parameterization has the advantage of summarizing the surrogate value of Si(1) with a single parameter, namely β4.

In the standard RLC study design, Si(1) is only observed when Zi = 1. Thus, intuitively, one might not expect β to be fully identifiable without additional information. To the contrary, under the assumptions stated above, β is in fact identifiable (provided Pr[Si(1) = 0] ≠ Pr[Si(1) = 1]) provided cmax > 2. A proof is given in the Appendix. We note the proof does not require that (1) be a probit model; any invertible function from the real numbers ℝ to [0, 1] could be used in place of Φ and identifiability would still hold. The proof in the Appendix also shows that β is not identifiable for a single dose challenge design (or for a RLC with at most two challenges cmax = 2). Identifiability of β for the RLC design in turn implies identifiability of PAE. This has two important implications. First, the surprising result that β and PAE are identifiable explains why in the simulation study presented below that the standard RLC study design performs relatively well compared to the BIP and CrV designs. Second, that β and PAE are identifiable suggests that the surrogate value of a binary S may be assessed using standard RLC study designs, i.e., that the BIP and CrV designs are not necessary (although they may improve power to detect surrogate value).

Intuition regarding why β is identifiable for the RLC design but not the single dose challenge design can be gained by considering two coins, one red and one blue. Suppose the probability of landing heads upon a single toss is p(0, 0; β) = Φ(β1) for the red coin and p(0, 1; β) = Φ(β1 + β3) for the blue coin. Consider the following simple experiment. One of the coins is selected randomly, with 2/3 probability of selecting the red coin (and thus 1/3 probability of selecting the blue coin). The selected coin is repeatedly tossed until the coin lands on heads or 10 tosses occur, which ever happens first. The number of tosses is recorded; the color of the selected coin is not recorded. The process of randomly selecting one of the coins and repeatedly tossing until heads or 10 tosses occur is then repeated a large number of times; each time only the number of tosses is recorded. Now consider the scenario where the two coins have different probabilities of landing on heads; moreover, suppose the red coin is very likely to land on heads (p(0, 0; β) ≈ 1) and the blue coin is very unlikely to land on heads (p(0, 1; β) ≈ 0). Then the empirical distribution of the number of tosses until heads or 10 tosses will be bimodal, with 2/3 of the mass near 1 (corresponding to the red coin with p(0, 0; β) ≈ 1) and the other 1/3 of the mass near 10 (corresponding to the blue coin with p(0, 1; β) ≈ 0). On the other hand, consider the scenario where the two coins have the same probability of landing on heads and p(0, 0; β) = p(0, 1; β) ≈ 1/2. In that case, the empirical distribution of the number of tosses will have a unimodal distribution with most of the mass near 1. Thus despite the recorded data not including information about the color of the coin being flipped during each iteration of the experiment, it is possible to distinguish scenarios where the two probabilities are different versus where they are the same. The two coins are analogous to the two strata of unvaccinated NHPs defined by Si(1) = 0 and Si(1) = 1; although we cannot determine strata membership on the basis of the observed data, the RLC experiment affords data that do provide information about the strata specific probabilities of infection per challenge. By the same token, if we imagine repeating the coin experiment described above, but are now only permitted to flip each selected coin a single time, clearly there is no hope of assessing from the recorded data alone whether or not the two coins have different probabilities of landing on heads. Likewise, in a single dose challenge study the data provide no information about the relative probability infection per challenge for the two strata of NHPs.

2.3. Baseline Immunogenicity Predictor Design

Let Wi denote a baseline covariate that might be correlated with Si(1) and now suppose n iid copies of (Zi, Ti, δi, Si, Wi) are observed. Letting Gw denote the conditional distribution of Si(1) given Wi, the likelihood is proportional to

Lbip(β,Gw)=i=1n{φ(1,Si,Ti,δi;β)dGw(SiWi)}Zi{φ(0,s,Ti,δi;β)dGw(sWi)}1-Zi.
(5)

Note if Si(1) is assumed independent of Wi, then Gw = G and (4) and (5) are equal. Also note the likelihood (5) relies on the assumption that the per-challenge risk of infection is independent of Wi conditional on Si(1) and Zi. BIP models that relax this assumption are considered in [16], [12], and [10].

2.4. Crossover Vaccination Design

Next consider the likelihood for the CrV design. Recall in this design that after each challenge uninfected control NHPs are randomly assigned to receive vaccine according to a pre-specified crossover probability. For NHPs initially assigned to the control arm that cross-over, immune response to the vaccine Si(1) is measured after vaccination but prior to subsequent challenges. Denote the cross-over probability by cp. Let Ti0 denote the observed number of challenges when NHP i was unvaccinated and δi0 be an indicator which equals one if challenge Ti0 resulted in infection. Define Ti1 and δi1 similarly for when NHP i was vaccinated. For NHPs randomly assigned to receive vaccine at the start of the experiment (i.e., Zi = 1), let Ti0=δi0=0. Note if a NHP is initially assigned vaccine (Zi = 1) and becomes infected ( δi1=1), then Ti1=Ti(1). For NHPs randomly assigned to control at the start of the experiment (i.e., Zi = 0) who become infected prior to crossover or reach the end of the experiment prior to crossover, let Ti1=δi1=0. Note if a NHP is initially assigned control (Zi = 0) and becomes infected prior to crossover, then Ti0=Ti(0). Let Z̃i indicate vaccination at some point during the study, i.e., Zi=Zi+(1-Zi)I[Ti1>0]. Suppose n iid copies of ( Zi,Ti0,δi0,Ti1,δi1,Si(1)I[Zi=1]) are observed. Note it is being assumed the following two immune responses are equal: (i) the immune response observed when an unvaccinated NHP that has escaped infection from one or more challenges crosses over and is vaccinated, and (ii) the immune response that NHP would have had if, counter to fact, they had been vaccinated at baseline. The likelihood for the CrV design is proportional to

Lcrv(β,G)=i=1n{(φ(0,Si(1),Ti0,δi0;β)φ(1,Si(1),Ti1,δi1;β)dG(Si(1))}Zi{φ(0,s,Ti0,δi0;β)dG(s)}1-Zi
(6)

Note if the cross-over probability is set to zero (i.e., cp = 0), then Zi = Z̃i and Lcrv simplifies to the likelihood for a standard RLC design Lrlc given in (4). If the CrV and BIP designs are combined such that Wi is also observed for i = 1, …, n, then the likelihood is the same as Lcrv except that dG(·) is replaced by dGw(·|Wi) in (6).

3. Simulation study

3.1. Standard RLC Design

A simulation study was conducted to assess whether standardly designed RLC studies of typical moderate sample size can provide adequate power to detect immune responses with high surrogate value. Data were generated assuming n/2 NHPs were randomized to each arm (vaccine or control), a maximum number of exposures per NHP of cmax = 10, and model (1). Simulations were conducted under different scenarios reflecting the degree of association between a binary BIP Wi and immune response Si(1). In particular, for each NHP, Wi was randomly generated from a Bernoulli distribution with mean 0.5. The immune response under vaccine Si(1) was then generated under three different scenarios. Under the first scenario, Si(1) was generated from a Bernoulli distribution with mean 0.3 and independent of Wi such that OR = odds(Si(1) = 1|Wi = 1)/odds(Si(1) = 1|Wi = 0) = 1. Under the second scenario, Si(1) was generated from a Bernoulli with mean 0.2 + 0.2Wi such that OR = 2.67. Under the third scenario, Si(1) was generated from a Bernoulli with mean 0.1 + 0.4Wi such that OR = 9. Note in all three scenarios the marginal distribution of Si(1) was Bernoulli with mean 0.3. Values of β were selected such that the probability of infection per challenge for controls was 0.5 regardless of Si(1), i.e., β1 = β3 = 0. To reflect immune responses with varying surrogate values, simulated data sets were generated under three scenarios corresponding to PAE ∈ {0.5, 0.75, 1} for contrast function h1. Specifically, for the first scenario β = (0, −1, 0, 0) such that PAE = 0.5; for the second scenario β = (0, −1, 0, −2) such that PAE = 0.75; and for the third scenario β = (0, 0, 0, −3) such that PAE = 1. RLC studies were simulated with total sample sizes n = 30, 40, 50. For each combination of OR, PAE, and n, 5000 simulated data sets were generated. For each data set the likelihood ratio test was used to assess H0 : β4 = 0 versus HA : β4 ≠ 0 at the α = 0.05 significance level.

Simulation results are summarized in Tables 12. Empirical type I error and power for a standard RLC design (i.e., without or ignoring Wi) are given in Table 1. For Si(1) with no surrogate value, i.e., β4 = 0 and PAE = 0.50, the type I error of the likelihood ratio test did not exceed the nominal level α = 0.05 (results were similar under the null for all simulations discussed below). For Si(1) with moderate surrogate value, i.e., β4 = −2 and PAE = 0.75, there was low power (17–49%), even if the total sample size was n = 50. For Si(1) a perfect surrogate, i.e., β4 = −3 and PAE = 1, there was decent power (71–82%) provided the total sample size was at least n = 40.

Table 1

Empirical type I error and power for standard RLC study design based on simulation study described in Section 3. Each table entry is based on 5000 simulated RLC trials with n NHPs total, of which n/2 were randomized to each study arm. Infection outcomes were simulated according to model (1) and PAE is given by (2). Table entries equal the empirical type I error or power for the likelihood ratio test of H0 : β4 = 0 versus HA : β4 ≠ 0 at significance level α = 0.05.

β4PAEn = 30n = 40n = 50
00.500.010.010.01
−20.750.170.320.49
−31.000.530.710.82

Table 2

Empirical type I error and power of likelihood ratio test of H0 : β4 = 0 for standard RLC study design with a binary BIP where OR = odds(Si(1) = 1|Wi = 1)/odds(Si(1) = 1|Wi = 0).

β4PAEn = 30
n = 40
n = 50
OR=1OR=2.67OR=9OR=1OR=2.67OR=9OR=1OR=2.67OR=9
00.500.010.020.020.010.010.020.010.020.02
−20.750.180.200.290.300.330.430.420.450.54
−31.000.500.530.580.650.670.730.760.770.82

Empirical power for a standard RLC design with a (supposed) BIP is given in Table 2. For Wi with a strong association with Si(1), i.e., OR = 9, the presence of the BIP led to an increase in power, although the increase was often modest. For example, for n = 50 and PAE = 0.75 power was 0.49 without the BIP compared to 0.54 with the BIP. For Wi with a moderate association with Si(1), i.e., OR = 2.67, the presence of a BIP generally did not increase power and in fact, perhaps surprisingly, sometimes led to a decrease in power. For example, for n = 40 and PAE = 1 power was 0.71 without a BIP compared to 0.67 with the BIP. Presumably this decrease in power can be attributed to the BIP likelihood (5) requiring estimation of the two conditional distributions Gw for Wi = 0, 1 whereas the standard RLC likelihood (4) requires only estimating the marginal distribution G. The cost of estimating the additional parameters associated with Gw is readily apparent by comparing the power when PAE = 1 in Table 1 with Table 2 when OR = 1; in this case Wi provides no information about Si(1) and therefore it is better to ignore the putative BIP.

3.2. CrV Design

Simulations studies were also conducted to evaluate the utility of the CrV design as a means to potentially increase the power of RLC studies to detect immune responses with high surrogate value. Different values of the crossover probability cp were investigated to ascertain the effect of cp on the power to detect surrogate value. Results for cp ∈ {0.1, 0.25, 0.5} are given in Table 3. In comparison to Table 1, CrV resulted in an increase in power for cp = 0.1. For example, for n = 30 and PAE = 0.75, power was 0.17 for the standard RLC design compared to 0.32 for the CrV design with cp = 0.1, i.e., the CrV design had almost twice the power. Results for cp = 0.25 and cp = 0.5 were mixed, with generally an increase in power for PAE = 0.75 and a decrease in power for PAE = 1 relative to the standard RLC design.

Table 3

Empirical type I error and power of likelihood ratio test of H0 : β4 = 0 for the CrV study design without a BIP. After each exposure, uninfected NHPs randomized to the control arm randomly crossed-over to the vaccine arm with probability cp.

β4PAEn = 30
n = 40
n = 50
cp = .1cp = .25cp = .5cp = .1cp = .25cp = .5cp = .1cp = .25cp = .5
00.500.030.040.060.030.050.060.030.050.06
−20.750.320.360.280.500.490.380.610.610.45
−31.000.610.560.370.790.700.470.890.800.56

Additional simulation studies were undertaken to examine usefulness of the CrV design in the presence of a BIP Wi having strong association with Si(1), i.e., OR = 9. These results are given in Table 4. Comparison with Table 2 suggests that CrV is of modest utility in the presence of a strong BIP. For example, for n = 30 and PAE = 1, the empirical power was 0.58 without CrV compared to 0.65 for CrV with cp = 0.1. Similarly, comparing Table 3 and and44 indicates a BIP having strong association with Si(1) does not meaningfully increase power in the CrV design. For instance, for n = 30, PAE = 1, and cp = 0.1, the empirical power was 0.61 in the absence of a BIP compared to 0.65 when OR = 9.

Table 4

Empirical type I error and power of likelihood ratio test of H0 : β4 = 0 for the CrV study design in the presence of a binary BIP with cross-over probability cp and OR = odds(Si(1) = 1|Wi = 1)/odds(Si(1) = 1|Wi = 0) = 9.

β4PAEn = 30
n = 40
n = 50
cp = .1cp = .25cp = .5cp = .1cp = .25cp = .5cp = .1cp = .25cp = .5
00.500.040.040.060.040.050.050.040.040.05
−20.750.360.350.300.500.480.380.620.570.46
−31.000.650.570.410.790.700.510.860.780.59

The results in Tables 1, ,3,3, and and44 suggest power for the CrV design is a non-monotonic function of cp. For the simulations presented here cp = 0.1 yielded the greatest power. However, for other parameter configurations a different cross-over probability may be optimal. Therefore it is important that investigators planning a CrV RLC study conduct additional simulation studies for parameter values tailored to the particular setting where the RLC study will take place in order to determine the optimal value of cp.

One potential drawback of the CrV design is that there are fewer challenges of unvaccinated NHPs, potentially diminishing the power to detect a vaccine effect on the primary endpoint, namely infection. Therefore simulation studies were also conducted to assess how cp affects the power to detect a vaccine effect on the risk of infection. Let VE = 1 − p(1)/p(0) where p(z) denotes the (marginal) risk of infection per challenge if a NHP is assigned treatment z = 0, 1. In words, VE is the relative reduction in the per challenge risk of infection due to vaccination. RLC studies were simulated under different scenarios to assess the power of the likelihood ratio test to reject the null hypothesis of no vaccine effect on the risk of infection, i.e., H0 : VE = 0. Simulations were performed for VE ∈ {0, 0.25, 0.50, 0.75} and cp ∈ {0, 0.1, 0.25, 0.5}. Results are shown in Table 5. For each scenario the average number of NHPs that crossed-over was in good agreement with the expected number of cross-overs, which in general equals (n/2)j=1cmax-1{1-p(0)}j(1-cp)j-1cp. The empirical power of the likelihood ratio test presented in Table 5 demonstrates that the CrV design may result in modest diminution of power to detect a vaccine effect on infection relative to the standard RLC study design. For example, for VE = 0.50 the empirical power was 0.86 when cp = 0 compared to 0.83 when cp = 0.25. These results suggest the CrV design could be implemented without adversely affecting power for the primary endpoint (infection) while potentially increasing power to detect a SoP.

Table 5

Empirical type I error and power to reject H0 : VE = 0 where VE = 1 − p(1)/p(0) is the relative reduction in the per-challenge risk of infection due to vaccination. Simulation results shown are based on p(0) = 0.5 and n = 50 NHPs total, with 25 randomized to vaccine initially. The average number of control NHPs that crossed-over are given in parentheses. The parenthetical numbers in the column headers denote the (theoretical) expected number of NHPs to crossover.

Crossover probability cp
VE0(0)0.1(2.3)0.25(5.0)0.5(8.3)
00.05(0)0.05(2.3)0.05(5.0)0.05(8.4)
0.250.26(0)0.28(2.3)0.25(4.9)0.22(8.3)
0.500.86(0)0.85(2.2)0.83(5.0)0.78(8.3)
0.751.00(0)1.00(2.3)1.00(5.0)1.00(8.3)
Expected crossovers
Average simulated number of NHPs that crossed-over

3.3. Additional Simulations

In the simulations described above β1 = β3 = 0 such that the probability of infection per challenge for unvaccinated NHPs was 0.5 regardless of Si(1), i.e., p(0, 1; β) = p(0, 0; β) = 0.5. Additional simulation studies were conducted to assess the power of RLC studies to detect an immune response with surrogate value when the per challenge infection probability differs between unvaccinated NHPs with Si(1) = 1 versus Si(1) = 0, i.e., when β3 ≠ 0. In particular, the simulation study described above for a standard RLC design without a BIP (the results of which are given in Table 1) was repeated with three different combinations of values for β1 and β3 such that p(0, 1; β) < p(0, 0; β) but the marginal probability of infection per challenge when unvaccinated remained 0.5 as in Table 1. For simulations where PAE = 0.5 (i.e., β4 = 0), this correspond to the scenario where the immune response Si(1) is a correlate of infection risk but not predictive of the vaccine effect on risk of infection. The results for these additional simulations are given in Table 6 and are comparable to those in Table 1, with only a slight reduction in power as the difference in p(0, 1; β) and p(0, 0; β) increases.

Table 6

Empirical type I error and power of likelihood ratio test of H0 : β4 = 0 for standard RLC study design as in Table 1 for different values of the unvaccinated per-infection transmission probabilities p(0, 1; β) and p(0, 0; β).

β4PAEp(0, 1; β) = 0.45
p(0, 0; β) = 0.52
p(0, 1; β) = 0.40
p(0, 0; β) = 0.54
p(0, 1; β) = 0.35
p(0, 0; β) = 0.56
n = 30n = 40n = 50n = 30n = 40n = 50n = 30n = 40n = 50
00.500.010.010.010.010.010.010.010.020.02
−20.750.200.370.500.220.390.520.240.400.52
−31.000.540.710.830.520.700.790.490.670.76

4. Application

Ellenberger et al. [6] reported results from a RLC experiment with 16 vaccinated NHPs and 14 control NHPs. All animals were challenged weekly with simian HIV and evidence of infection was assessed after each challenge. Previous analyses of these data indicate the vaccine caused a significant reduction in the per challenge probability of infection [6,10]. Analysis of various immune response measurements from vaccinated NHPs suggested possible correlation between the number of challenges until infection and immune responses to Gag peptide pools as measured by the ELISpot assay (see Figure 3 from [6]). For the analysis here, we let Si = 1 if the total ELISpot response to Gag was at least 1000 per 106 PBMC and Si = 0 otherwise. The observed data (Zi, Ti, δi, Si) are presented in Table 7.

Table 7

Data based on Ellenberger et al. [6] RLC experiment, where Zi indicates vaccination (1) or not (0), Ti denotes the observed number of challenges, δi indicates infection (1) or not (0) by the last challenge, Si denotes immune response to the vaccine, and Wi is a baseline immunogenicity predictor (BIP).

Control
Vaccine
ZiTiδiWiZiTiδiSiWi
011112111
011012100
021012110
021115100
021015100
021015111
021015100
021116100
031016100
0411110100
0810118100
01010118000
01110118011
01400119101
119010
126011

Fitting model (1) to these data based on (4), the likelihood ratio test of H0 : β4 = 0 yields a two-sided p-value p = 0.43, providing no indication that Si has surrogate value. Suppose, however, contrary to fact, that a BIP Wi had been collected for all NHPs that correlated well (but not perfectly) with Si in the vaccine arm of the study; such a Wi is given in Table 7. In this case fitting model (1) based on (5), the likelihood ratio test of H0 : β4 = 0 yields p = 0.04 and the maximum likelihood estimate of PAE is PAE^=0.87, indicating that Si has significant surrogate value. While finding a BIP Wi that correlates well with Si(1) may not always be feasible in practice, this example illustrates the potential utility of a BIP in discerning the surrogate value of Si(1). Similarly, suppose contrary to fact that the CrV design had been employed in this experiment such that four of the NHPs initially assigned control were randomly crossed-over after escaping infection from at least one challenge prior to vaccination. One possible observed data set from such an experiment is given in Table 8. For these data, fitting model (1) based on (6), the likelihood ratio test of H0 : β4 = 0 yields p = 0.04 and PAE^=0.87, indicating that Si has significant surrogate value. This illustrates the potential utility of the CrV design in the absence of a BIP.

Table 8

Data based on Ellenberger et al. [6] RLC experiment with hypothetical CrV, where Zi denotes initial randomization to vaccine (1) or not (0), Z̃i indicates ever vaccinated (1) or not (0), Ti0 denotes number of challenges while not vaccinated, δi0 denotes infected (1) or not (0) after Ti0 challenges, Ti1 denotes number of challenges while vaccinated, δi1 denotes infected (1) or not (0) after challenges while vaccinated, and Si(1) indicates potential immune response (1) or not (0) to vaccine. Missing values are denoted by *.

Control (Zi = 0)
Vaccine (Zi = 1)
Zi Ti0 δi0 Ti1 δi1Si(1)Zi Ti0 δi0 Ti1 δi1Si(1)
01100*100211
01100*100210
02100*100211
02100*100510
02100*100510
02100*100511
02100*100510
02100*100610
03100*100610
04100*1001010
1602101001810
1802101001800
15015001001801
12018011001910
1001901
1002601

It is important to keep in mind that while in this example the presence of a BIP or employing the CrV design has a substantial effect on the resulting inference drawn about the surrogate value of Si(1), for other data sets this may not be the case. Moreover, the simulation results from the previous section indicate gains in power due to a BIP or the CrV design tend not to be substantial.

5. Discussion

A challenge in determining the surrogate value of an immune response in a standard RLC study is that Si(1) is missing for unvaccinated NHPs. Following previous work on human vaccine trials (e.g., [1214, 16]), in this paper we considered evaluating surrogate value in a standard RLC study design with and without a BIP. We also considered the CrV design, an adaptation of the close-out placebo vaccination design [16], that reveals Si(1) for some NHPs that initially were unvaccinated. There are four main findings from this paper. First, perhaps surprisingly, it is possible to detect an immune response with surrogate value in a standard RLC study without a BIP; this is not true of a single challenge study design. Second, the presence of a BIP does not appreciably increase power, even if the correlation with the immune response is high. Third, CrV can lead to an increase in power, although this gain is sometimes modest and dependent on the cross-over probability selected. Fourth, regardless of whether a BIP is present or CrV is employed, in order to have adequate power to detect a surrogate of protection (i) the vaccine effect on the immune response must be strongly associated with the vaccine effect on risk of infection (PAE ≈ 1) and (ii) the sample size must be relatively large (e.g., n = 50).

There are several areas of possible future research related to evaluating immunological surrogates of protection in RLC studies. First, development of exact-type methods for this setting might be of utility given the relatively small sample sizes typical of RLC studies. While the maximum likelihood approach used in this paper performed well in simulation studies (e.g., the nominal type I error was preserved), it might be preferred to use methods that do not rely on large sample approximations. Indeed the likelihood ratio test performed conservatively in some of the simulation studies presented (e.g., in Tables 1 and and22 the empirical type I error was less than the nominal level), suggesting more powerful tests might be developed. Some possibilities would be to consider higher-order approximations, such as a Bartlett correction [19], to the likelihood ratio test or resampling techniques like the bootstrap as in [10]. Alternatively, a Bayesian approach (e.g., [20, 21]) might be considered if one was willing to entertain the attendant assumptions. A second area of possible future research would be to examine continuous or multivariate S. In this paper we consider only a binary S, which simplifies matters (such as proving identifiability or lack thereof) and is also of practical relevance as binary immune responses (e.g., seroconversion) are often used operationally in vaccine studies as surrogates of protection. If the immune response outcome of interest is continuous (or ordinal), the methods developed in this paper readily apply by choosing different cut-points or positivity thresholds to create a binary S.

The methods in this paper could also be extended to relax the assumption that the transmission probability is independent of the number of previous challenges. One approach in this direction would be to develop a beta-geometric model; Hudgens and Gilbert [10] applied such a model to the Ellenberger et al. RLC experiment and found the time homogeneous transmission probability model fit the data better. Similarly, within the context of the CrV design, methods could be developed to relax the time constancy assumption that (i) the immune response measured after escaping infection from one or more challenges and then being vaccinated is the same as (ii) the immune response that would have been measured if vaccinated prior to any challenges. This assumption would be violated if being challenged but not infected causes the immune response to vaccine to be altered. A lack of association between the observed number of challenges when not vaccinated Ti0 and observed immune response when vaccinated Si(1) would be supportive of this assumption. However, presence of such an association does not necessarily indicate the time constancy assumption is invalid because NHPs with missing Si(1) in the CrV design are not necessarily a random subset of NHPs in the experiment. In the context of human efficacy trials, Follmann [16] suggests some additional approaches to assessing this time constancy assumption which rely on invoking other additional assumptions. Until methods which formally relax this assumption are developed within the context of RLC studies, analysts should rely on subject-matter experts as to the feasibility of this assumption. If the time constancy assumption of the CrV design is not considered feasible, then inferences drawn using the methods described in this paper may be erroneous and should therefore be interpreted with caution.

Finally we note that as high dimensional immune response data becomes more readily available on each NHP, a natural next area to explore would be the application of supervised learning methods for this “large p, small n” setting in identifying immune SoPs.

Acknowledgments

This work was supported in part by National Institutes of Health grants U54-GM104942, P30-AI50410, and R37-AI054165 and Centers for Disease Control and Prevention grant grant U48-DP-001921. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the Centers for Disease Control and Prevention. The authors thank Peter Gilbert for helpful discussions and comments.

Contract/grant sponsor: NIH grants R37 AI054165, P30 AI50410, and U54-GM104942.

Appendix

Identifiability of β in standard RLC study deisgn

Here we show β = (β1, β2, β3, β4) is identifiable in the standard RLC study design (i.e., without CrV or BIP) for Si(1) binary. Because treatment assignment is random and Si(1) is observed in all NHPs randomized to Zi = 1, G is identifiable and can be regarded as fixed and known. Assume β1,…, β4 are all finite, so that there is a positive probability of observing (or not) an infection under either treatment assignment. Also assume G is such that 0 < Pr[Si(1) = s] < 1 for s = 0, 1. Let κ= Pr[Si(1) = 1].

Under these assumptions, it is straightforward to show that β1 + β2 and β3 + β4 are identifiable. To see this, suppose there are two parameterizations β = (β1, β2, β3, β4) and β̃ = (β̃1, β̃2, β̃3, β̃4) such that

Pr[Ti=t,δi=d,Si=sZi=z;β]=Pr[Ti=t,δi=d,Si=sZi=z;β]
(7)

for all possibly observed combinations of t, d, s, z. Then for t = 1, d = 1, s = 0, z = 1, (7) implies Φ{β1 + β2} = Φ{β̃1 + β̃2}. Because Φ is invertible, this implies β1 + β2 = β̃1 + β̃2. Similarly it can be shown β3 + β4 = β̃3 + β̃4. That is, β1 + β2 and β3 + β4 are identifiable. The more interesting issue germane to assessing the surrogate value of S is whether β2 and β4 individually are identifiable from the observable data from a RLC study. To prove this, it will be shown below that β1 and β3 are identifiable provided Pr[Si(1) = 1] ≠ 1/2 and cmax > 2.

As a brief aside, first consider a single dose challenge study (i.e., cmax = 1). Then for t = 1, d = 1, z = 0, (7) implies

Φ(β1+β3s)dG(s)=Φ(β1+β3s)dG(s)
(8)

The only other possible pattern of observed data under Zi = 0 would be t = 1, d = 0, z = 0, in which case (7) implies

{1-Φ(β1+β3s)}dG(s)={1-Φ(β1+β3s)}dG(s)
(9)

Suppose without loss of generality that β̃3≠ 0, and let β3 = 0 and β1 = Φ−1{ Φ(β̃1 + β̃3s)dG(s)}. Then (8) and (9) hold, yet β3β̃3. That is, β1 and β3 are not identifiable in a single dose challenge study.

One can also show β1 and β3 are not identifiable for a RLC study where cmax = 2. To prove this, let c1 = (1 − κ)Φ(β1) + κΦ(β1 + β3) and c2 = (1 − κ)Φ(β1)2 + κΦ(β1 + β3)2. Note Pr[Ti = 1, δi = 1, Si = 0|Zi = 0] = c1, Pr[Ti = 2, δi = 1, Si = 0|Zi = 0] = c1c2, and Pr[Ti = 2, δi = 0, Si = 0|Zi = 0] = 1 − 2c1 + c2. It is straight-forward to show that Φ(β1) equals either c1+{κ(c2-c12)/(1-κ)}1/2 or c1-{κ(c2-c12)/(1-κ)}1/2. Without loss of generality, assume Φ(β1) equals the former and let β̃1 be such that Φ(β̃1) equals the latter. Let β̃3 be such that (1 − κ)Φ(β̃1) + κΦ(β̃1 + β̃3) = c1. Then one can show (1 − κ)Φ(β̃1)2 + κΦ(β̃1 + β̃3)2 = c2, implying (7) holds. Yet c2>c12, implying β1β̃1.

Now consider an RLC studies where cmax > 2. Below it is shown that if κ ≠1/2, then β1 and β3 are identifiable. On the other hand, if κ = 1/2, then β1 and β3 are not identifiable. To begin, note for z = 0 that (7) can be equivalently expressed as

(1 - κ)At-d(1-A)dκBt-d(1-B)d = (1 - κ)Ct-d(1-C)dκDt-d(1-D)d
(10)

for all t, d where A = 1 − Φ(β1), B = 1 −Φ (β1 + β3), C = 1 − Φ(β̃1), D = 1 − Φ(β̃1 + β̃3). If A = C, then (10) implies B = D and identifiability is proved. So assume by way of contradiction AC, and thus DB. For d = 0 and t = 1, 2, 3, (10) implies

(1 - κ)(AC) = κ(DB)
(11)
(1 - κ)(A2C2) = κ(D2B2)
(12)
(1 - κ)(A3C3) = κ(D3B3)
(13)

Dividing (12) and (13) by (11) yields

ACDB
(14)
A2ACC2D2DBB2
(15)

Squaring (14) and subtracting the result by (15) yields

ACBD
(16)

Squaring (14) and then subtracting the result by four times (16) on both sides gives

(A-C)2 = (D-B)2
(17)

Assume without loss of generality that A > C. Then (11) implies D > B which by (17) in turn implies

ACDB.
(18)

If κ ≠ 1/2, then (11) and (18) cannot simultaneously be true if AC and BD, and therefore we have a contradiction. Thus if κ ≠ 1/2, then β1 and β3 are identifiable, and likewise β2 and β4.

Finally, suppose κ= 1/2. Note equation (7) for z = 0 is equivalent to

At-d(1-A)dBt-d(1-B)dCt-d(1-C)dDt-d(1-D)d
(19)

for all t, d. Suppose without loss of generality that β̃3 ≠ 0 and let β1 = β̃1 + β̃3 and β3 =−β̃3 such that A = D and B = C. Then (19) holds for all t, d even though β1β̃1 and β3β̃3.

References

1. Morgan C, Marthas M, Miller C, Duerr A, Cheng-Mayer C, Desrosiers R, Flores J, Haigwood N, Hu S, Johnson R, et al. The use of nonhuman primate models in HIV vaccine development. PLoS Medicine. 2008;5(8):e173. [PMC free article] [PubMed] [Google Scholar]
2. Fauci A, Johnston M, Dieffenbach C, Burton D, Hammer S, Hoxie J, Martin M, Overbaugh J, Watkins D, Mahmoud A, Greene W. HIV vaccine research: the way forward. Science. 2008;321:530–532. [PubMed] [Google Scholar]
3. Feinberg MB, Moore JP. AIDS vaccine models: challenging challenge viruses. Nature Medicine. 2002;8:207–210. [PubMed] [Google Scholar]
4. Roederer M, Keele BF, Schmidt SD, Mason RD, Welles HC, Fischer W, Labranche C, et al. Immunological and virological mechanisms of vaccine-mediated protection against SIV and HIV. Nature. 2013;505:502–508. [PMC free article] [PubMed] [Google Scholar]
5. García-Lerma J, Otten R, Qari S, Jackson E, Cong M, Masciotra S, Luo W, Kim C, Adams D, Monsour M, et al. Prevention of rectal SHIV transmission in macaques by daily or intermittent prophylaxis with emtricitabine and tenofovir. PLoS Med. 2008;5:e28. [PMC free article] [PubMed] [Google Scholar]
6. Ellenberger D, Otten RA, Li B, Rodriguez V, Sariol CA, Martinez M, Monsour M, Wyatt L, et al. HIV-1 DNA/MVA vaccination reduces the per exposure probability of infection during repeated mucosal SHIV challenges. Virology. 2006;352:216–225. [PubMed] [Google Scholar]
7. Subbarao S, Otten R, Ramos A, Jackson E, Monsour M, Bashirian S, Kim C, Johnson J, Soriano V, et al. Chemoprophylaxis with tenofovir disoproxil fumarate provided partial protection against simian human immunodeficiency virus infection in macaques given multiple virus challenges. Journal of Infectious Diseases. 2006;194:904–11. [PubMed] [Google Scholar]
8. Otten R, Adams D, Kim C, Jackson E, Pullium J, Lee K, Grohskopf L, Monsour M, Butera S, Folks T. Multiple vaginal exposures to low doses of R5 simian-human immunodeficiency virus: strategy to study HIV preclinical interventions in nonhuman primates. Journal of Infectious Diseases. 2005;191:164–173. [PubMed] [Google Scholar]
9. Regoes RR, Longini IM, Feinberg MB, Staprans SI. Preclinical assessment of HIV vaccines and microbicides by repeated low-dose virus challenges. PLoS Medicine. 2005;2(8):e249. [PMC free article] [PubMed] [Google Scholar]
10. Hudgens MG, Gilbert PB. Assessing vaccine effects in repeated low-dose challenge experiments. Biometrics. 2009;65:1223–1232. [PMC free article] [PubMed] [Google Scholar]
11. Hudgens M, Gilbert P, Mascola J, Wu C, Barouch D, Self S. Power to detect HIV vaccine effects in repeated low-dose challenge experiments. Journal of Infectious Diseases. 2009;200(4):609–613. [PMC free article] [PubMed] [Google Scholar]
12. Gilbert PB, Hudgens MG. Evaluating candidate principal surrogate endpoints. Biometrics. 2008;64:1146–1154. [PMC free article] [PubMed] [Google Scholar]
13. Qin L, Gilbert P, Corey L, McElrath M, Self S. A framework for assessing immunological correlates of protection in vaccine trials. Journal of Infectious Diseases. 2007;196:1304–1312. [PubMed] [Google Scholar]
14. Gilbert PB, Qin L, Self SG. Evaluating a surrogate endpoint at three levels, with application to vaccine development. Statistics in Medicine. 2008;27:4758–4778. [PMC free article] [PubMed] [Google Scholar]
15. Gilbert PB, Peterson ML, Follmann D, Hudgens MG, Francis DP, Gurwith M, Heyward WL, Jobes DV, Popovic V, Self SG, et al. Correlation between immunologic responses to a recombinant glycoprotein 120 vaccine and incidence of HIV-1 infection in a phase 3 HIV-1 preventive vaccine trial. Journal of Infectious Diseases. 2005;191:666–77. [PubMed] [Google Scholar]
16. Follmann D. Augmented designs to assess immune response in vaccine trials. Biometrics. 2006;62:1161–1169. [PMC free article] [PubMed] [Google Scholar]
17. Rubin DB. Discussion of “Randomization analysis of experimental data in the Fisher randomization test” by D. Basu. Journal of the American Statistical Association. 1980;75:591–593. [Google Scholar]
18. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58:21–29. [PMC free article] [PubMed] [Google Scholar]
19. Cordeiro GM, Cribari-Neto F. An Introduction to Bartlett Correction and Bias Reduction. Springer; 2014. [Google Scholar]
20. Zigler CM, Belin TR. A Bayesian approach to improved estimation of causal effect predictiveness for a principal surrogate endpoint. Biometrics. 2012;68 (3):922–932. [PMC free article] [PubMed] [Google Scholar]
21. Li Y, Taylor JM, Elliott MR. A Bayesian approach to surrogacy assessment using principal stratification in clinical trials. Biometrics. 2010;66(2):523–531. [PMC free article] [PubMed] [Google Scholar]