nach oben

BMC Medical Research Methodology

Erschienen in:

Open Access 01.12.2013 | Research article

Clinical trial simulation to evaluate power to compare the antiviral effectiveness of two hepatitis C protease inhibitors using nonlinear mixed effect models: a viral kinetic approach

verfasst von: Cédric Laouénan, Jeremie Guedj, France Mentré

Erschienen in: BMC Medical Research Methodology | Ausgabe 1/2013

Abstract

Background

Models of hepatitis C virus (HCV) kinetics are increasingly used to estimate and to compare in vivo drug’s antiviral effectiveness of new potent anti-HCV agents. Viral kinetic parameters can be estimated using non-linear mixed effect models (NLMEM). Here we aimed to evaluate the performance of this approach to precisely estimate the parameters and to evaluate the type I errors and the power of the Wald test to compare the antiviral effectiveness between two treatment groups when data are sparse and/or a large proportion of viral load (VL) are below the limit of detection (BLD).

Methods

We performed a clinical trial simulation assuming two treatment groups with different levels of antiviral effectiveness. We evaluated the precision and the accuracy of parameter estimates obtained on 500 replication of this trial using the stochastic approximation expectation-approximation algorithm which appropriately handles BLD data. Next we evaluated the type I error and the power of the Wald test to assess a difference of antiviral effectiveness between the two groups. Standard error of the parameters and Wald test property were evaluated according to the number of patients, the number of samples per patient and the expected difference in antiviral effectiveness.

Results

NLMEM provided precise and accurate estimates for both the fixed effects and the inter-individual variance parameters even with sparse data and large proportion of BLD data. However Wald test with small number of patients and lack of information due to BLD resulted in an inflation of the type I error as compared to the results obtained when no limit of detection of VL was considered. The corrected power of the test was very high and largely outperformed what can be obtained with empirical comparison of the mean VL decline using Wilcoxon test.

Conclusion

This simulation study shows the benefit of viral kinetic models analyzed with NLMEM over empirical approaches used in most clinical studies. When designing a viral kinetic study, our results indicate that the enrollment of a large number of patients is to be preferred to small population sample with frequent assessments of VL.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2288-13-60) contains supplementary material, which is available to authorized users.

Competing interests

All authors declare that they have no competing interests.

Authors’ contributions

CL, JG and FM designed the simulation study and the MODCUPIC trial. CL carried out the simulations and drafted the manuscript. JG participated to the work of estimation (with CL). CL, JG and FM participated in the statistical analysis and helped to draft the manuscript. All authors read and approved the final manuscript.

Background

Chronic infection with Hepatitis C Virus (HCV) affects 130–200 million people worldwide [1]. It is the leading cause of cirrhosis, liver cancer and liver transplants which result in 350,000 deaths worldwide [2]. HCV is divided into 6 genotypes, with genotype 1 being the hardest to treat and the most prevalent in Western countries. The goal of treatment is to achieve a sustained virologic response (SVR), marker of viral eradication, assessed by a viral load HCV RNA (VL) below the limit of detection (LOD) six months after cessation of therapy. Until 2011, the only available treatment was based on weekly injections of pegylated interferon (peg-IFN) and daily oral ribavirin (RBV) during 48 weeks, with SVR rate lower than 50% in treatment-naïve HCV genotype 1 patients [3].

In 2011, the approval of two protease inhibitors (PI), telaprevir and boceprevir, in combination with peg-IFN/RBV (triple therapy), marked a milestone for anti-HCV therapy with SVR rates larger than 70% in treatment-naïve HCV genotype 1 patients [4, 5]. Dozens of compounds targeting different viral proteins are currently in different stages of clinical trials, raising the expectation that several IFN-free regimens might be available in the coming years.

Viral kinetic modeling aims at characterizing the main mechanisms that govern the virologic response to treatment using mathematical models. Following the recommendations of the Food and Drug Administration [6], this approach has been increasingly used in phase 1/2 of clinical development to estimate viral kinetic parameters and to evaluate drug antiviral effectiveness in vivo[7, 8]. Parameter estimation is often achieved using non-linear mixed effect models (NLMEM) [9]. The popularity of this approach is due to the fact that it optimizes the information available by borrowing strength from the whole sample to provide precise estimation of the parameters, including covariate effects [10‐12]. Moreover it naturally accounts for the information brought by VL data below the limit of detection (BLD) and reduces the bias in parameter estimation as compared to empirical approaches where BLD data are ignored or assigned to half the LOD [10, 13, 14].

So far, viral kinetic models and NLMEM have mostly been used in phase 1/2 clinical trials with large number of patients and/or frequent assessment of VL data within each patient. However in most clinical trials, in particular when they are not sponsored by the industry, it is not possible to hospitalize patients and to get frequent viral load samples. In this challenging context, the capacity of NLMEM to precisely estimate viral kinetic parameters is not known. In particular the performance of tests used to assess the effect of a covariate which have good asymptotic properties (Wald test, likelihood ratio test or score test) is not warranted when one is far from the asymptotic conditions. For instance an inflation of the type I error has been reported in another clinical context where data were sparse [15]. With the new potent triple therapies against HCV the amount of information available may also be limited by the fact that a large proportion of VL data are below LOD.

Here our goal was to evaluate the capacity of NLMEM to precisely estimate the parameters of viral kinetic models when there is a large proportion of BLD data and a limited number of data per patient. In particular we aimed to evaluate by simulation the type I errors and the power of the Wald test to compare the antiviral effectiveness of two groups receiving different triple therapies (noted PI-A and PI-B in the following). Parameter estimation and Wald test property were evaluated according to the number of patients, the number of samples per patient and the expected difference in antiviral effectiveness between the two treatment groups.

Methods

Viral kinetic model

We used the standard biphasic model of HCV kinetics defined by the following set of differential equations [16]:

\{\begin{cases} \frac{dI}{dt} = bVT - δI \\ \frac{dV}{dt} = p (1 - ϵ) I - cV \end{cases}

(1)

where T represents the density of target cells that can be infected by virus measured as HCV RNA (V), with rate constant b. In the model, infected cells (I) die or lose their infected state with rate constant δ and produce virions at constant rate p per cell. Virions are assumed to be cleared with rate constant c. As it is done usually when considering short term VL data we assumed that the target cell level is constant throughout the study period and remains at its pre-treatment steady state value T ₀ = cδ/pb[17, 18].

Treatment is assumed to reduce the average rate of viral production per cell from p to p(1–ϵ), where ϵ represents the constant drug effectiveness, i.e., ϵ = 0.990 implying the drug is 99% effective in blocking viral production. If all parameters including treatment effectiveness are constant over time this model predicts that VL will fall in a biphasic manner [16], with a rapid first phase of viral decline with rate approximately equal to c lasting for a couple of days and with the magnitude viral decline depending on ϵ, and a second slower but persistent second phase of viral decline with rate ϵδ. Hence, for potent therapies for which ϵ is close to 1, the second-phase slope will be approximately δ.

Lastly mathematical analysis shows that if ϵ is constant, p and b do not intervene in the VL equation and thus where ignored in the following without loss of generality [19].

Statistical model

We assumed an additive error (σ) on the log₁₀ of the VL observations, i.e., the observed data y _ij for patient i at time t _ij is given by:

y_{ij} = f (ϕ_{i}, t_{ij}) + e_{ij},

(2)

e_{i} \sim N (0, σ^{2}),

(3)

h (ϕ_{i}) = μ + β T_{i} + η_{i} with η_{i} \sim N (0, Ω),

(4)

where:

f is the non-linear model,
Φ _i is the vector of individual parameters of length p where p is the number of parameters,
e _ij is the residual error assumed to follow a normal distribution with mean 0 and variance σ ²,
h is the transformation of the vector of parameters that make them normally distributed,
μ is the vector of fixed effects,
β is the vector of coefficient of the only covariate studied i.e. the difference of effectiveness between PI-A and PI-B (with T _i = 0 if treatment is PI-A and T _i = 1 if treatment is PI-B),
η _i is the vector of random effects independent of e _i, and are supposed to be independent, with diagonal variance-covariance matrix $Ω = diag (ω_{1}^{2}, \dots, ω_{p}^{2})$ .

It is assumed that h is the logarithm transformation for V ₀, δ and c and the logit transformation for ϵ.

Parameter values

Mean parameter values, inter-individual standard deviations (ω) and standard deviation of residual error, σ, for patients treated with PI-A were assumed to be similar to those found in phase 1 of clinical trials with telaprevir at steady state [7] (Table 1). We assumed that PI-A immediately reached its steady state level of effectiveness, with mean ϵ ^A = 0.999. Similar parameter distribution was assumed in patients treated with PI-B, except for the mean antiviral effectiveness of PI-B, ϵ ^B. We considered several values for ϵ ^B equal to 0.999, 0.998, 0.995 and 0.990, corresponding to a similar, 2-fold, 5-fold and 10-fold lower levels in the blocking of viral production than PI-A, respectively. The LOD was fixed to 12 IU/mL [20].

Table 1

Value, distribution and inter-individual standard deviation of population parameters from data with patients treated by monotherapy of PI-A [7]

	V ₀(IU/mL)	c(day^-1)	δ(day^-1)	ϵ	σ (log₁₀IU/mL)
Fixed effect	2.68 10⁶	13.4	0.58	0.999	0.19
Transformation	lognormal	lognormal	lognormal	logistic-normal	-
Inter-individual standard deviation (ω)	1.09	0.25	0.25	0.61	-

Clinical trial simulation

We considered different designs in real-life setting, i.e., with a limited number of VL measurements per patient. Two schedules for the VL assessments were considered, called “7 VL” and “5 VL” in the following. “7 VL” had seven VL measurements at days 0, 0.33, 1, 2, 3, 7 and 14 whereas “5 VL” was sparser and did not have the early measurements at days 0.33 and 1 that are often difficult to obtain in clinical practice. Then different scenarios were considered according to the number of VL measurements (n) and the number of patients per group of treatment (N). In order to have designs that could be easily compared, we considered different designs with 5 VL or 7 VL but constant total numbers of observations per group n_tot = N×n. We considered small sample size with: n_tot = 50 (N = 10 and n = 5) and n_tot = 70 (N = 10 and n = 7 or N = 14 and n = 5), middle sample size with: n_tot = 100 (N = 20 and n = 5) and n_tot = 140 (N = 20 and n = 7 or N = 28 and n = 5) and larger sample size with: n_tot = 150 (N = 30 and n = 5) and n_tot = 210 (N = 30 and n = 7 or N = 42 and n = 5). For each scenario, K = 500 dataset were generated using R software version 2.15.0 (R foundation for Statistical Computing, Vienna, Austria). Examples of simulated dataset with the design N = 30 and n = 7 and the different levels of antiviral effectiveness considered with the percentage of patients below the LOD at day 3, 7 and 14 is shown in Figure 1.

Parameter estimation

Data of each simulated trial were analyzed using MONOLIX version 4.2 (http://www.lixoft.eu/monolix/product-monolix-overview/) [21], a software devoted to maximum likelihood estimation of parameters in NLMEM using an extension of the stochastic approximation expectation-approximation (SAEM) algorithm [22, 23]. Of note one advantage of maximum likelihood estimation (noted “ML” in the following) is that it takes into account the information brought by BLD data [10]. We compared accuracy and precision of parameter estimations obtained with those that would be obtained if all data were observed with no BLD data at all (referred as “all data”).

For each scenario, relative estimation errors REE

({\hat{θ}}_{k}), k = 1, \dots, K

were computed as shown in equation (5), where

{\hat{θ}}_{k}

is the parameter estimated for the k^th replicate and θ ^* the true parameter value used to generate the data.

REE ({\hat{θ}}_{k}) = \frac{{\hat{θ}}_{k} - θ^{*}}{θ^{*}} \times 100

(5)

Each REE_k was expressed in percent. We plotted the boxplot of the REE with the 10% and 90% percentiles. Then, from the REE, the relative bias (RB) and the relative root mean square error (RRMSE) were computed as shown in equation (6) and (7).

RB ({\hat{θ}}_{k}) = \frac{1}{k} \sum_{k = 1}^{K} REE ({\hat{θ}}_{k})

(6)

RRMSE ({\hat{θ}}_{k}) = \sqrt{\frac{1}{k} \sum_{k = 1}^{K} REE {({\hat{θ}}_{k})}^{2}}

(7)

The accuracy and the precision of parameter estimates were evaluated using RB and RRMSE, respectively.

Detection of a difference in antiviral effectiveness

We analyzed for each scenario the ability to detect a difference of antiviral effectiveness between the two PIs. Given that treatment antiviral effectiveness was estimated on a logistic scale we defined accordingly the difference of effectiveness between the two PIs, β, as β = logit(ϵ ^A) − logit(ϵ ^B). For each simulated dataset, the estimated value of

β, \hat{β}

, was obtained along with its (estimated) standard error,

S E_{\hat{β}}

. Then the Wald test statistics given by

\frac{\hat{β}}{S E_{\hat{β}}}

was calculated and the null hypothesis “H₀: β = 0” was rejected at the level of 5% if

|\frac{\hat{β}}{S E_{\hat{β}}}| > 1.96

. Thus in scenarios where ϵ ^A = ϵ ^B = 0.999 and therefore β = 0, the type I error was given by the proportion of dataset among the 500 simulated that led to reject H₀. Similarly the power to detect a difference in treatment antiviral effectiveness was calculated in scenarios where β ≠ 0 and was given by the proportion of datasets among the 500 simulated that led to reject H_0. With ϵ ^A = 0.999 and with values of ϵ ^B equal to 0.998, 0.995 and 0.990, β were equal to 0.7, 1.6 and 2.3, respectively. Type I error and power were evaluated with all designs describe above and consistent with previous analysis [15, 24], we expect an inflation of the type I error with the Wald test. To ensure a type I error of 5%, we define for each design a correction threshold as the 5^th percentile of the distribution of the p-values of the test under H₀ for the 500 simulated dataset. Then we used that corrected threshold as a limit of significance in the evaluation of the tests under H₁ to compute the corrected power [15, 25]. Furthermore we evaluated type I error with 2 larger sample size to approach asymptotic conditions with: n_tot = 350 (N = 50 and n = 7) and n_tot = 700 (N = 100 and n = 7). Lastly, the power to detect a difference in treatment effectiveness between the two PIs was compared with the one obtained by standard empirical approaches where the difference in viral decline at day 14 between two treatments is tested by a non parametric two-sided Wilcoxon test.

Results

Parameter estimation

First we evaluated the impact of having a large proportion of BLD data on the precision of parameter estimates. Proportions of BLD data were equal to 19.3% and 88.3% at days 7 and 14 with ϵ = 0.999, 10.2% and 80.7% with ϵ = 0.998, 3.9% and 66.7% with ϵ = 0.995, 1.5% and 54.3% with ϵ = 0.990, respectively (Figure 1). For that purpose we compared the parameters estimation with all data or ML with the design N = 30 and n = 7 and assuming a lower effectiveness for PI-B than PI-A (ϵ ^A = 0.999 vs ϵ ^B = 0.990).

Assuming all data, i.e., all data can be observed and there is no LOD, all the parameter estimates had a very small RB lower than 1% and 11% for the fixed effects and the inter-individual variance parameters, respectively. Similarly, the RRMSE were lower than 10% and 33% for the fixed effects and the inter-individual variance parameters, respectively (Table 2). Yet, the precision of both the fixed effect and inter-individual variance parameters were very close to those found with the ML uncensored data (Table 2), showing the relevance of maximum likelihood in the handling of censored data. Similar results were obtained when comparing the distribution of the REE from the 500 simulated datasets (Figure 2). Equally good performance was obtained when considering sparser sampling design with n = 5 VL measurements per patient except for the viral clearance rate, c (Table 2 and Figure 2) [11].

Table 2

Relative bias (RB) (%) and relative root mean square error (RRMSE) (%) of the estimated parameters evaluated from 500 simulated datasets

	All data (n = 7 VL)		ML (n = 7 VL)		ML (n = 5 VL)
	RB (%)	RRMSE (%)	RB (%)	RRMSE (%)	RB (%)	RRMSE (%)
log₁₀(V ₀) (IU/mL)	0.1	1.0	0.2	1.0	0.1	1.0
c (day^-1)	1.0	4.3	0.6	4.1	34.1	78.8
δ (day^-1)	0.2	3.2	0.8	3.7	0.6	3.7
-log₁₀(1-ϵ)	0.1	3.2	0.4	3.1	0.3	3.6
β	0.4	8.5	−0.5	8.5	0.4	9.9
_ω ² _vo	−0.4	18.9	−1.0	19.0	−0.5	19.3
_ω ² _c	−4.6	31.5	−10.9	32.9	236.6	358.3
_ω ² _δ	−3.0	19.8	−2.3	24.5	−2.4	24.9
_ω ² _ϵ	−2.6	32.2	−4.5	32.0	−9.9	36.3
σ	−0.03	5.3	−0.7	6.2	−1.3	8.0

Parameters were simulated with N = 30 patients per group assuming ϵ ^A = 0.999 and ϵ ^B = 0.990, no limit of detection (“All data”) or a limit of detection at 12 IU/mL (“ML”), and with n = 7 or 5 viral load (VL) measurements per patient.

Type I error of the Wald test

Next we evaluated the type I error of the Wald test according to different designs and assuming ϵ ^A = ϵ ^B = 0.999 (Figure 3). A type I error of 14.4% was found with ML data, n = 5 VL and N = 10 showing that the asymptotic conditions under which the Wald test is valid were not met with this design. In fact a minimal number of observations per group n_tot = 140 was necessary in order to achieve a type I error less than 10% and n_tot = 700 (N = 100 and n = 7) were needed for type I error to be in the 95% prediction interval around 5% [3.1%; 6.9%] with ML data. Of note, for a given value of N, the number of VL measurements did not substantially change the type I error (Figure 3) and increasing the number of patients N was more beneficial than having more frequent VL assessments within each patient. For instance the type I error was lower with the design N = 14 and n = 5 than with the design N = 10 and n = 7 (Figure 3) although the total number of observation per patient n_tot was the same and equal to 70. In addition to the influence of the number of patients, the type I error was also deteriorated by the presence of a large proportion of BLD data, even if they were taken into account appropriately. Indeed type I errors were consistently smaller when assuming that there was no LOD (“all data”). As shown in Figure 3, the type I error was equal to 10.4% with all data, N = 10 and n = 5 compared to 14.4% with ML data.

Power to detect a difference in antiviral effectiveness

Next, we evaluated the power to detect a difference of effectiveness between the two PIs assuming ϵ ^A = 0.999 and lower values of ϵ ^B equal to 0.998, 0.995 and 0.990 with different designs. Except when both effectiveness were close (i.e., ϵ ^B = 0.998), the power of the Wald test, corrected or not (see methods), was larger than 95%, regardless of the number of sampling VL measurements and the number of patients per PI (Table 3). As found for the type I error, increasing the number of patients N was more beneficial than having more frequent VL assessments within each patient. For instance power was higher with the design N = 28 and n = 5 than with the design N = 20 and n = 7 (Table 3) although the total number of observation per patient n_tot was the same and equal to 140.

Table 3

Power (%) to detect a difference of effectiveness between PI-A and PI-B according to the study design and the effectiveness of PI-B ( ϵ ^B ), assuming ϵ ^A = 0.999 and a limit of detection (“ML data”)

	ϵ ^B	0.998	0.995	0.990	0.998	0.995	0.990	0.998	0.995	0.990
Small sample size	Design*	N = 10 and n = 7			N = 14 and n = 5			N = 10 and n = 5
	Design*	n_tot = 70			n_tot = 70			n_tot =50
	Wald test (uncorrected)	62.2	99.8	100	61.8	100	100	55.2	98.8	100
	Wald test (corrected)	44.2	98.4	100	50.4	100	100	35.8	95.8	100
	Wilcoxon test	6.6	11.2	26.8	4.4	15.6	39.0	6.6	11.2	26.8
	Design*	N = 20 and n = 7			N = 28 and n = 5			N = 20 and n = 5
	Design*	n_tot = 140			n_tot = 140			n_tot = 100
Middle sample size	Wald test (uncorrected)	83.4	100	100	86.8	100	100	77.8	100	100
	Wald test (corrected)	69.0	100	100	78.0	100	100	58.8	100	100
	Wilcoxon test	7.0	23.0	50.4	6.8	30.4	64.6	7.0	23.0	50.4
Large sample size	Design*	N = 30 and n = 7			N = 42 and n = 5			N = 30 and n = 5
	Design*	n_tot = 210			n_tot = 210			n_tot = 150
	Wald test (uncorrected)	94.0	100	100	86.8	100	100	89.4	100	100
	Wald test (corrected)	89.2	100	100	82.6	100	100	82.6	100	100
	Wilcoxon test	7.4	31.0	67.0	9.2	43.8	85.0	7.4	31.0	67.0

* N: number of patients per group of treatment; n: number of viral load measurements per patient; n_tot: total numbers of observations per group of treatment.

Lastly, we compared these results with the power achieved by comparing the mean viral decline at day 14 between both groups using a Wilcoxon test. The power of Wilcoxon test was on average lower of 44% as compared to the one achieved with viral kinetic model and NLMEM with ϵ ^B = 0.995 or 0.998 (Table 3). Even when PI-B’s and PI-A’s antiviral effectivenesses were assumed to be 0.990 and 0.999, respectively, corresponding to a 10-fold difference in the viral production under treatment, the power of the Wilcoxon test was only equal to 67% with N = 30, compare to 100% with the Wald test (Table 3). It should be noted that the type I error of the Wilcoxon test, which is a non parametric test, was not inflated (5.2% with the design N = 10 for example).

Discussion

The goal of this study was to evaluate the capacity of NLMEM to provide precise and accurate estimates of viral kinetic parameters when only sparse data with a large proportion of BLD data are available. In particular we aimed to evaluate the ability of this approach to correctly reject or not the null hypothesis of equal treatment effectiveness when two groups with different antiviral strategies are compared.

Our results showed that NLMEM provide very precise and accurate estimates for both the fixed effects and the inter-individual variance parameters, even when only 5 data points (at days 0, 2, 3, 7 and 14) were available within each patient. This allowed circumventing the need for intensive VL sampling measurements at treatment initiation, which are difficult to obtain in current clinical practice. Of note the viral clearance rate, c and its associated variability ω_c, were poorly estimated in this sparse initial sampling. However this parameter is mostly involved in the initial rate of viral decline and thus a poor estimation of c did not substantially deteriorate the estimation of the other parameters (Table 2).

By comparing the results obtained with and without a LOD for VL, we demonstrated that maximum likelihood appropriately handle BLD, consistent with results found previously [10]. The conclusion was somewhat different when considering the outcome of Wald test for comparing antiviral effectiveness. In this case the lack of information due to BLD contributed to an inflation of the type I error as compared to the results obtained with no LOD of VL, suggesting that the development of real-time PCR assays with lower LOD may improve the estimation of viral kinetic parameters. Interestingly, even when there was no LOD of VL, we still found that the type I error was inflated when the number of observations n_tot was lower than 140. This suggests that the outcome of Wald test should be taken with caution when the number of patients is low and in that case we suggested to use a threshold correction for the Wald test to limit the impact of this inflation. Here we used an empirical threshold correction but other corrections exist such as the Galland correction or the permutation test [15]. On the other hand the power of the Wald test (corrected or not) was found to be very high, especially when compared with that obtained using a Wilcoxon test on the mean viral decline at day 14. This result clearly shows the benefit of viral kinetic analyzed with NLMEM over empirical approaches done in most clinical studies. Although better results may be obtained by comparing the viral decline at earlier time points (such as day 2 or 7) the power of the Wilcoxon test remained lower than those achieved by modeling approach (not shown). Consistent with results found elsewhere, the power increases when the number of observations per patient increases and was much less sensitive to the number of measurements within each patient [26]. From a clinical standpoint this finding indicates that the enrollment of a large population of study is to be preferred to small population sample with frequent assessments of VL.

We focused here on the properties of the Wald test and further studies would be needed to study how these results apply to other tests that require more computation time, such as likelihood ratio tests (LRT) or score test. Interestingly previous simulation studies using the SAEM algorithm in MONOLIX showed that the outcomes of these tests were largely comparable [15]. Of note this result may not hold when other estimation methods are used and for instance the outcomes of Wald test and LRT were found to be different when using the FOCE-I algorithm in NONMEM version 7 [15]. Indeed the Wald test had a lower power than LRT with FOCE-I, which was probably due to the poor estimation of the standard error of the covariate effect [25]. The advantage of the Wald test is that results are immediately obtained and do not require to compute the likelihood or its derivatives, as done for the LRT and the score test. Computation time needed by simulations could be largely reduced by using information theory and approximations to derive Fisher information matrix. For instance the software PFIM uses a first order approximation of the likelihood and, under this approximation, an analytical form of the Fisher matrix can be obtained [27]. Thus the expected variance of viral kinetic parameters could be obtained without the intensive simulations done here. Although such approximations worked well even with limited number of patients [11], it does not take into account BLD data and hence could underestimate the standard error when a large proportion of data are BLD. It should be noted that optimal design theory predicts that an increase of variances in random effect may deteriorate the precision of parameter estimates and the power of the Wald test. However this possibility was not investigated in this study where the inter-individual variance parameters were fixed.

Here we focused on the comparison of treatment antiviral effectiveness in the first two weeks of treatment. On this short time scale the standard biphasic model of viral kinetics has been shown to provide a good fit to the data [7, 16]. However more complex models may be needed to fit long-term VL data, such as models that relax the assumption of constant target cells and/or account for the emergence of treatment resistant viruses [28, 29]. Moreover viral decline during PI therapy is faster than what is observed with IFN-based therapy [30]. This feature is captured in the standard biphasic model by assuming that PIs lead to an enhancement of the treatment effectiveness, ϵ, and of the clearance rate of infected cells, δ[7, 29, 31]. Consistent with this observation we set here large mean values for both ϵ and δ, equal to 0.999 and 0.58 day^-1 as compared to 0.92 and 0.14 day^-1 with IFN-based therapy, respectively [30]. However this dual mode of action of PIs may be integrated in a more physiological way by using new multiscale viral kinetic models that explicitly integrate the effect of PIs on the intra-cellular viral dynamics [9].

Although the use of NLMEM has been shown to provide very precise and accurate estimates of the parameters even in presence of sparse designs, it should be acknowledged that these estimates are done on the population parameters, i.e., the mean and the variance of parameters in the population. How NLMEM also allow precise and accurate estimation of the individual parameters for individualized treatment duration remains to be evaluated.

Conclusion

Compared with standard approach (with Wilcoxon test), modeling approach (with Wald test) provides very precise and accurate estimates of viral kinetic parameters and a more powerful tool to detect a difference in early viral kinetic profile of two PIs with different antiviral efficacy, even with sparse initial sampling or small number of patients. When designing a viral kinetic study, our results indicate that the enrollment of a larger number of patients is to be preferred to smaller sample size with more frequent assessments of viral load. We showed that a threshold correction is needed for the Wald test with small samples especially if there are many BLD data.

Acknowledgements

The authors thank IFR02 and Hervé Le Nagard for the use of the Centre de Biomodélisation.

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Competing interests

All authors declare that they have no competing interests.

Authors’ contributions

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Lavanchy D: Evolving epidemiology of hepatitis C virus. Clin Microbiol Infect. 2011, 17: 107-115. 10.1111/j.1469-0691.2010.03432.x.CrossRefPubMed

Shepard CW, Finelli L, Alter MJ: Global epidemiology of hepatitis C virus infection. Lancet Infect Dis. 2005, 5: 558-567. 10.1016/S1473-3099(05)70216-4.CrossRefPubMed

European Association for the Study of the Liver: EASL Clinical Practice Guidelines: management of hepatitis C virus infection. J Hepatol. 2011, 55 (2): 245-264.CrossRef

McHutchison JG, Everson GT, Gordon SC, Jacobson IM, Sulkowski M, Kauffman R, McNair L, Alam J, Muir AJ: Telaprevir with peginterferon and ribavirin for chronic HCV genotype 1 infection. N Engl J Med. 2009, 360: 1827-1838. 10.1056/NEJMoa0806104.CrossRefPubMed

Poordad F, McCone J, Bacon BR, Bruno S, Manns MP, Sulkowski MS, Jacobson IM, Reddy KR, Goodman ZD, Boparai N, DiNubile MJ, Sniukiene V, Brass CA, Albrecht JK, Bronowicki J-P: Boceprevir for untreated chronic HCV genotype 1 infection. N Engl J Med. 2011, 364: 1195-1206. 10.1056/NEJMoa1010494.CrossRefPubMedPubMedCentral

Guidance for industry: Chronic hepatitis C virus infection: developing direct-acting antiviral agents for treatment. Rockville, MD: Food and Drug Administration, September 2010 (draft). http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM225333.pdf

Guedj J, Perelson AS: Second-phase hepatitis C virus RNA decline during telaprevir-based therapy increases with drug effectiveness: implications for treatment duration. Hepatology. 2011, 53: 1801-1808. 10.1002/hep.24272.CrossRefPubMedPubMedCentral

Lenz O, de Bruijne J, Vijgen L, Verbinnen T, Weegink C, Van Marck H, Vandenbroucke I, Peeters M, Simmen K, Fanning G, Verloes R, Picchio G, Reesink H: Efficacy of re-treatment with TMC435 as combination therapy in hepatitis C virus-infected patients following TMC435 monotherapy. Gastroenterology. 2012, 143: 1176-1178. 10.1053/j.gastro.2012.07.117. e1–6CrossRefPubMed

Guedj J, Dahari H, Rong L, Sansone ND, Nettles RE, Cotler SJ, Layden TJ, Uprichard SL, Perelson AS: Modeling shows that the NS5A inhibitor daclatasvir has two modes of action and yields a shorter estimate of the hepatitis C virus half-life. Proc Natl Acad Sci U S A. 2013, 110 (10): 3991-3996. 10.1073/pnas.1203110110.CrossRefPubMedPubMedCentral

10.

Thiébaut R, Guedj J, Jacqmin-Gadda H, Chêne G, Trimoulet P, Neau D, Commenges D: Estimation of dynamical model parameters taking into account undetectable marker values. BMC Med Res Methodol. 2006, 6: 38-10.1186/1471-2288-6-38.CrossRefPubMedPubMedCentral

11.

Guedj J, Bazzoli C, Neumann AU, Mentré F: Design evaluation and optimization for models of hepatitis C viral dynamics. Stat Med. 2011, 30: 1045-1056. 10.1002/sim.4191.CrossRefPubMed

12.

Wu H, Ding AA, De Gruttola V: Estimation of HIV dynamic parameters. Stat Med. 1998, 17: 2463-2485. 10.1002/(SICI)1097-0258(19981115)17:21<2463::AID-SIM939>3.0.CO;2-A.CrossRefPubMed

13.

Hing JP, Woolfrey SG, Greenslade D, Wright PM: Analysis of toxicokinetic data using NONMEM: impact of quantification limit and replacement strategies for censored data. J Pharmacokinet Pharmacodyn. 2001, 28: 465-479. 10.1023/A:1012247131190.CrossRefPubMed

14.

Duval V, Karlsson MO: Impact of omission or replacement of data below the limit of quantification on parameter estimates in a two-compartment model. Pharm Res. 2002, 19: 1835-1840. 10.1023/A:1021441407898.CrossRefPubMed

15.

Bertrand J, Comets E, Chenel M, Mentré F: Some alternatives to asymptotic tests for the analysis of pharmacogenetic data using nonlinear mixed effects models. Biometrics. 2012, 68: 146-155. 10.1111/j.1541-0420.2011.01665.x.CrossRefPubMed

16.

Neumann AU, Lam NP, Dahari H, Gretch DR, Wiley TE, Layden TJ, Perelson AS: Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science. 1998, 282: 103-107.CrossRefPubMed

17.

Guedj J, Dahari H, Shudo E, Smith P, Perelson AS: Hepatitis C viral kinetics with the nucleoside polymerase inhibitor mericitabine (RG7128). Hepatology. 2012, 55: 1030-1037. 10.1002/hep.24788.CrossRefPubMedPubMedCentral

18.

Guedj J, Dahari H, Pohl RT, Ferenci P, Perelson AS: Understanding silibinin’s modes of action against HCV using viral kinetic modeling. J Hepatol. 2012, 56: 1019-1024. 10.1016/j.jhep.2011.12.012.CrossRefPubMedPubMedCentral

19.

Shudo E, Ribeiro RM, Talal AH, Perelson AS: A hepatitis C viral kinetic model that allows for time-varying drug effectiveness. Antivir Ther. 2008, 13: 919-926.PubMed

20.

Matsuura K, Tanaka Y, Hasegawa I, Ohno T, Tokuda H, Kurbanov F, Sugauchi F, Nojiri S, Joh T, Mizokami M: Abbott RealTime hepatitis C virus (HCV) and Roche Cobas AmpliPrep/Cobas TaqMan HCV assays for prediction of sustained virological response to pegylated interferon and ribavirin in chronic hepatitis C patients. J Clin Microbiol. 2009, 47: 385-389. 10.1128/JCM.01753-08.CrossRefPubMed

21.

Kuhn E, Lavielle M: Maximum likelihood estimation in nonlinear mixed effects models. Comput Stat Data Anal. 2005, 49: 1020-1038. 10.1016/j.csda.2004.07.002.CrossRef

22.

Beal SL: Ways to fit a PK model with some data below the quantification limit. J Pharmacokinet Pharmacodyn. 2001, 28: 481-504. 10.1023/A:1012299115260.CrossRefPubMed

23.

Samson A, Lavielle M, Mentré F: Extension of the SAEM algorithm to left censored data in nonlinear mixed-effects model: application to HIV dynamics model. Comput Statist Data Anal. 2006, 51: 1562-1574. 10.1016/j.csda.2006.05.007.CrossRef

24.

Dubois A, Lavielle M, Gsteiger S, Pigeolet E, Mentré F: Model-based analyses of bioequivalence crossover trials using the stochastic approximation expectation maximisation algorithm. Stat Med. 2011, 30: 2582-2600.PubMed

25.

Bertrand J, Comets E, Laffont CM, Chenel M, Mentré F: Pharmacogenetics and population pharmacokinetics: impact of the design on three tests using the SAEM algorithm. J Pharmacokinet Pharmacodyn. 2009, 36: 317-339. 10.1007/s10928-009-9124-x.CrossRefPubMedPubMedCentral

26.

Retout S, Comets E, Samson A, Mentré F: Design in nonlinear mixed effects models: optimization using the Fedorov-Wynn algorithm and power of the Wald test for binary covariates. Stat Med. 2007, 26: 5162-5179. 10.1002/sim.2910.CrossRefPubMed

27.

Retout S, Mentré F: Optimization of individual and population designs using Splus. J Pharmacokinet Pharmacodyn. 2003, 30: 417-443.CrossRefPubMed

28.

Snoeck E, Chanu P, Lavielle M, Jacqmin P, Jonsson EN, Jorga K, Goggin T, Grippo J, Jumbe NL, Frey N: A comprehensive hepatitis C viral kinetic model explaining cure. Clin Pharmacol Ther. 2010, 87: 706-713. 10.1038/clpt.2010.35.CrossRefPubMed

29.

Rong L, Dahari H, Ribeiro RM, Perelson AS: Rapid emergence of protease inhibitor resistance in hepatitis C virus. Sci Transl Med. 2010, 2: 30ra32-10.1126/scitranslmed.3000544.CrossRefPubMedPubMedCentral

30.

Chatterjee A, Guedj J, Perelson AS: Mathematical modelling of HCV infection: what can it teach us in the era of direct-acting antiviral agents?. Antivir Ther. 2012, 17: 1171-1182. 10.3851/IMP2428.CrossRefPubMedPubMedCentral

31.

Adiwijaya BS, Hare B, Caron PR, Randle JC, Neumann AU, Reesink HW, Zeuzem S, Herrmann E: Rapid decrease of wild-type hepatitis C virus on telaprevir treatment. Antivir Ther. 2009, 14: 591-595.PubMed

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/13/60/prepub

Titel: Clinical trial simulation to evaluate power to compare the antiviral effectiveness of two hepatitis C protease inhibitors using nonlinear mixed effect models: a viral kinetic approach
verfasst von: Cédric Laouénan
Jeremie Guedj
France Mentré
Publikationsdatum: 01.12.2013
Verlag: BioMed Central
Erschienen in: BMC Medical Research Methodology / Ausgabe 1/2013
Elektronische ISSN: 1471-2288
DOI: https://doi.org/10.1186/1471-2288-13-60

Live-Webinar: Aktuelle Leitlinien bei Herz-Kreislauf-Erkrankungen

Springer Medizin

Clinical trial simulation to evaluate power to compare the antiviral effectiveness of two hepatitis C protease inhibitors using nonlinear mixed effect models: a viral kinetic approach

Abstract

Background

Methods

Results

Conclusion

Electronic supplementary material

Competing interests

Authors’ contributions

Background

Methods

Viral kinetic model

Statistical model

Parameter values

Clinical trial simulation

Parameter estimation

Detection of a difference in antiviral effectiveness

Results

Parameter estimation

Type I error of the Wald test

Power to detect a difference in antiviral effectiveness

Discussion

Conclusion

Acknowledgements

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Live-Webinar: Aktuelle Leitlinien bei Herz-Kreislauf-Erkrankungen

Springer Medizin

Abstract

Background

Methods

Results

Conclusion

Electronic supplementary material

Competing interests

Authors’ contributions

Background

Methods

Viral kinetic model

Statistical model

Parameter values

Clinical trial simulation

Parameter estimation

Detection of a difference in antiviral effectiveness

Results

Parameter estimation

Type I error of the Wald test

Power to detect a difference in antiviral effectiveness

Discussion

Conclusion

Acknowledgements

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Weitere Artikel der Ausgabe 1/2013

Empirical study of correlated survival times for recurrent events with proportional hazards margins and the effect of correlation and censoring

Testing the proportional hazards assumption in case-cohort analysis

Empirical comparison of methods for analyzing multiple time-to-event outcomes in a non-inferiority trial: a breast cancer study

External validation of a Cox prognostic model: principles and methods

Derivation and assessment of risk prediction models using case-cohort data

Shape information from glucose curves: Functional data analysis compared with traditional summary measures