Power Calculation for Interaction Term Binary and Continuous Variable Cox Regression

Appendix 1

A1.1. Exponential distribution of event times (covariate effect is a linear effect of time)

If event times follow an exponential distribution, the baseline hazard function is equal to $λ_{0} (t) = λ$ . Then the cumulative hazard function is equal to $\begin{aligned} H (t | x, z) & = \int_{0}^{t} λ_{0} (s) e^{γ x + β (s) z} d s = \int_{0}^{t} λ e^{γ x} e^{β (s) z} d s = λ e^{γ x} \int_{0}^{t} e^{β (s) z} d s \\ = λ e^{γ x} \int_{0}^{t} e^{(β_{0} + β_{1} s) z} d s = λ e^{γ x} \int_{0}^{t} e^{β_{0} z} e^{β_{1} s z} d s = λ e^{γ x} e^{β_{0} z} \int_{0}^{t} e^{β_{1} s z} d s \\ = \frac{λ e^{γ x} e^{β_{0} z}}{β_{1} z} [e^{β_{1} s z}]_{0}^{t} = \frac{λ e^{γ x} e^{β_{0} z}}{β_{1} z} [e^{β_{1} t z} - 1] (provided β_{1} z \neq 0) . \end{aligned}$ The distribution function of the event times is equal to $F (t | x, z) = 1 - exp (- H (t | x, z) = 1 - exp (- \frac{λ e^{γ x} e^{β_{0} z}}{β_{1} z} [e^{β_{1} t z} - 1]) .$ Therefore, $u = exp (- \frac{λ e^{γ x} e^{β_{0} z}}{β_{1} z} [e^{β_{1} t z} - 1]) \sim U [0, 1] .$ Solving for t gives: $log (u) = - \frac{λ e^{γ x} e^{β_{0} z}}{β_{1} z} (e^{β_{1} t z} - 1),$ thus $\frac{- β_{1} z log (u)}{λ e^{γ x} e^{β_{0} z}} = e^{β_{1} t z} - 1,$ thus $1 - \frac{β_{1} z log (u)}{λ e^{γ x} e^{β_{0} z}} = e^{β_{1} t z},$ thus $log (1 - \frac{β_{1} z log (u)}{λ e^{γ x} e^{β_{0} z}}) = β_{1} z t,$ resulting in $t = \frac{1}{β_{1} z} log (1 - \frac{β_{1} z log (u)}{λ e^{γ x} e^{β_{0} z}}), provided that β_{1} z \neq 0.$

If β ₁ z = 0, then the effect of the covariate is time-invariant and methods based on those described by Bender et al. can be used to generate an event time.

A1.2. Weibull distribution of event times (covariate effect is a linear effect of time)

The baseline hazard function is equal to $λ_{0} (t) = λ υ t^{υ - 1}$ . Then the cumulative hazard function is $\begin{aligned} H (t | x, z) & = \int_{0}^{t} λ_{0} (s) e^{γ x + β (s) z} d s = \int_{0}^{t} λ υ s^{υ - 1} e^{γ x} e^{β (s) z} d s = λ υ e^{γ x} \int_{0}^{t} s^{υ - 1} e^{(β_{0} + β_{1} s) z} d s \\ = λ υ e^{γ x} e^{β_{0} z} \int_{0}^{t} s^{υ - 1} e^{β_{1} s z} d s = λ υ e^{γ x} e^{β_{0} z} [t^{υ} (- β_{1} t z)^{- υ} (Γ (υ) - Γ (υ, - β_{1} t z))] . \end{aligned}$ The cumulative hazard function involves a difference of a Gamma function and an Incomplete Gamma function. Then, the cumulative distribution of event times is $F (t | x, z) = 1 - exp [- H (t | x, z),$ and $u = exp [- λ υ e^{γ x} e^{β_{0} z} (t^{υ} (- β_{1} t z)^{- υ} (Γ (υ) - Γ (υ, - β_{1} t z)))] \sim U (0, 1) .$

An event time can be generated by solving the above expression for t and evaluating the expression at $u \sim U (0, 1)$ . However, a closed-form expression for the inverse of the above expression cannot be obtained. As an alternative approach, one could use numerical methods to find an approximate inverse to the above expression for a given value of u.

A1.3. Gompertz distribution of event times (covariate effect is a linear effect of time)

The baseline hazard function is equal to $λ_{0} (t) = λ e^{α t}$ . Then the cumulative hazard function is $\begin{aligned} H (t | x, z) & = \int_{0}^{t} λ_{0} (s) e^{γ x + β (s) z} d s = \int_{0}^{t} λ e^{α s} e^{γ x} e^{(β_{0} + β_{1} s) z} d s = λ e^{γ x} \int_{0}^{t} e^{α s} e^{β_{0} z} e^{β_{1} z s} d s \\ = λ e^{γ x} e^{β_{0} z} \int_{0}^{t} e^{α s} e^{β_{1} z s} d s = λ e^{γ x} e^{β_{0} z} \int_{0}^{t} e^{(α + β_{1} z) s} d s \\ = \frac{λ e^{γ x} e^{β_{0} z}}{α + β_{1} z} [e^{(α + β_{1} z) s}]_{0}^{t} = \frac{λ e^{γ x} e^{β_{0} z}}{α + β_{1} z} [e^{(α + β_{1} z) t} - 1] provided that α + β_{1} z \neq 0. \end{aligned}$ Then the cumulative distribution of event times is $F (t | x, z) = 1 - exp (- H (t | x, z))$ and $u = exp (- H (t | x, z)) \sim U (0, 1) .$ To invert this function, we have that $u = exp (- \frac{λ e^{γ x} e^{β_{0} z}}{α + β_{1} z} [e^{(α + β_{1} z) t} - 1]),$ thus $log (u) = - \frac{λ e^{γ x} e^{β_{0} z}}{α + β_{1} z} [e^{(α + β_{1} z) t} - 1],$ $\frac{- (α + β_{1} z)}{λ e^{γ x} e^{β_{0} z}} log (u) = e^{(α + β_{1} z) t} - 1,$ $1 - \frac{(α + β_{1} z)}{λ e^{γ x} e^{β_{0} z}} log (u) = e^{(α + β_{1} z) t},$ so that $log (1 - \frac{(α + β_{1} z)}{λ e^{γ x} e^{β_{0} z}} log (u)) = (α + β_{1} z) t,$ so that $t = \frac{1}{α + β_{1} z} log (1 - \frac{(α + β_{1} z)}{λ e^{γ x} e^{β_{0} z}} log (u)), provided that α + β_{1} z \neq 0.$ If $α + β_{1} z = 0$ then the calculation of the cumulative hazard function must be modified as follows: $H (t | x, z) = λ e^{γ x} e^{β_{0} z} \int_{0}^{t} e^{0} d s = λ e^{γ x} e^{β_{0} z} \int_{0}^{t} d s = λ e^{γ x} e^{β_{0} z} t .$ Then, we have that $u = exp (- H (t | x, z)) = exp (- λ e^{γ x} e^{β_{0} z} t)$ . To invert this expression, we have that $log (u) = - λ e^{γ x} e^{β_{0} z} t$ and hence that $t = \frac{- log (u)}{λ e^{γ x} e^{β_{0} z}} .$

Appendix 2

A2.1. R and SAS code for simulating event time data with an exponential distribution when the log-hazard ratio of the binary variable changes linear with time

# The following software code is provided for illustrative purposes only and comes with absolutely no warranty.

# Simulate data from a Cox-exponential model when the effect of

# treatment

# varies as a linear function of time.

N <- 10000

# Number of subjects in each simulated dataset.

# Parameter of the exponential distribution.

lambda <- 0.01

g1 <- log(1.5)

# Log-hazard ratio for the continuous covariate.

b0 <- log(1.1)

# Log-hazard ratio for the binary covariate at t = 0.

hr <- 1.005

b1 <- log(hr)

# Effect of time on the log-hazard ratio for the binary covariate.

prev <- 0.25

# Prevalence of the binary covariate.

set.seed(1)

# Set random number seed for reproducibility.

x <- rnorm(N)

z <- rbinom(N,size=1,prob=prev)

u <- runif(N,0,1)

event.time <- ifelse(b1*z==0,

-log(u)/(lambda*exp(g1*x)),

(1/b1*z) * log(1-b1*z*log(u)/(lambda*exp(g1*x + b0*z))))

simdata <- cbind(x,z,event.time)

write(t(simdata),ncol=3,file="exp.dat",append=T)

%macro dgp_exp(N=,lambda=,hr_cont=,b0=,b1=,prev=0,ranseed=);

*** N is the size of the simulated dataset;

*** lambda is the parameter for the exponential distribution of

event times;

*** hr_cont is the hazard ratio for the continuous covariate;

*** b0 is the hazard ratio for the binary variable at time t = 0;

*** b1 is the relative change in the effect of binary covariate

as a function;

** of time;

*** prev is the prevalence of the binary covariate;

*** ranseed is a seed for random number generation to ensure ;

*** reproducibility of the results.;

data randata;

call streaminit(&ranseed);

do i = 1 to &N;

/* Random uniform variable for generating event times */

u = rand("Uniform");

/* The binary covariate with a time-varying covariate effect */

z = rand("binomial",&prev,1);

/* The continuous covariate for which we are adjusting */

x = rand("normal",0,1);

/* Parameter for the exponential distribution */

lambda = λ

gamma = log(&hr_cont);

/* A one-standard deviation increase in x increases the hazard

of the outcome by this amount */

b0 = log(&b0);

b1 = log(&b1);

if (b1*z=0) then

event_time = -log(u)/(lambda*exp(gamma*x));

else

event_time = (1/b1*z) * log(1 - b1*z*log(u)/(lambda*

exp(gamma*x + b0*z)));

event = 1;

output;

end;

run;

%mend dgp_exp;

%dgp_exp(N=10000,lambda=0.01,hr_cont=1.5,b0=1.1,b1=1.005,

ranseed=1,prev=0.25);

A2.2. R code for simulating event time data with a Weibull distribution when the log-hazard ratio of the binary variable changes linear with time

# The following software code is provided for illustrative purposes only and comes with absolutely no warranty.

# Simulate data from a Cox-Weibull model when the effect of

# treatment varies as a linear function of time.

# There is not a closed-form expression for the inverse of the

# survival function (S).

# Numerical methods are used to invert the survival function.

# We use Brent's method for root finding to invert the function.

library(pracma)

# pracma package has a function for Brent's method for finding

# roots and a function for the incomplete Gamma function.

N <- 10000

# Number of subjects in each simulated dataset.

# Parameters of the Weibull distribution.

lambda <- 0.001

nu <- 1

g1 <- log(1.5)

# Log-hazard ratio for the continuous covariate.

b0 <- log(1.1)

# Log-hazard ratio for the binary covariate at t=0.

hr <- 1.001

b1 <- log(hr)

# Effect of time on the log-hazard ratio for the binary covariate.

set.seed(1)

# Set random number seed for reproducibility.

simdata.matrix <- NULL

# Define the survival function

S2 <- function(t){

exp(-lambda*nu*exp(g1*x + b0*z) * ((t^nu) * (-b1*t*z)^(-nu) *

(gamma(nu) - gammainc(-b1*t*z,nu)[2]) ) ) - u

}

for (i in 1:N){

u <- runif(1,0,1)

x <- rnorm(1)

z <- rbinom(1,size=1,prob=0.25)

if (z==0){

event.time <- (-log(u)/(lambda*exp(g1*x)))^(1/nu)

} else {

brent.sol <- brent(S2,0.001,250000)

event.time <- brent.sol$root

}

simdata <- c(x,z,event.time)

simdata.matrix <- rbind(simdata.matrix,simdata)

}

write(t(simdata.matrix),ncol=3,file="weibull.dat",append=T)

A2.3. R and SAS code for simulating event time data with a Gompertz distribution when the log-hazard ratio of the binary variable changes linear with time

# The following software code is provided for illustrative purposes only and comes with absolutely no warranty.

# Simulate data from a Cox-Gompertz model when the effect of

# treatment varies as a linear function of time.

N <- 10000

# Number of subjects in each simulated dataset.

# Parameters of the Gompertz distribution.

lambda <- 0.0001

alpha <- 0.025

g1 <- log(1.5)

# Log-hazard ratio for the continuous covariate.

b0 <- log(1.1)

# Log-hazard ratio for the binary covariate at t=0.

hr <- 1.005

b1 <- log(hr)

# Effect of time on the log-hazard ratio for the binary covariate.

prev <- 0.25

# Prevalence of the binary covariate.

set.seed(1)

# Set random number seed for reproducibility.

x <- rnorm(N)

z <- rbinom(N,size=1,prob=prev)

u <- runif(N,0,1)

event.time <- ifelse(alpha+b1*z==0,

-log(u)/(lambda*exp(g1*x + b0*z)),

(1/(alpha + b1*z)) * log(1 - (alpha + b1*z)*log(u)

/(lambda*exp(g1*x + b0*z))))

simdata <- cbind(x,z,event.time)

write(t(simdata),ncol=3,file="gompertz.dat",append=T)

%macro dgp_gompertz(N=,lambda=,alpha=,hr_cont=,b0=,b1=,

prev=0,ranseed=);

*** N is the size of the simulated dataset;

*** lambda and alpha are the parameters of the Gompertz

distribution;

*** hr_cont is the hazard ratio for the continuous covariate;

*** b0 is the hazard ratio for the binary variable at time t = 0;

*** b1 is the relative change in the effect of binary covariate

as a function;

** of time;

*** prev is the prevalence of the binary covariate;

*** ranseed is a seed for random number generation to ensure;

*** reproducibility of the results.;

data randata;

call streaminit(&ranseed);

do i = 1 to &N;

/* Random uniform variable for generating event times */

u = rand("Uniform");

/* The binary covariate with a time-varying covariate effect */

z = rand("binomial",&prev,1);

/* The continuous covariate for which we are adjusting */

x = rand("normal",0,1);

/* Parameters for the Gompertz distribution */

lambda = λ

alpha = α

gamma = log(&hr_cont);

b0 = log(&b0);

b1 = log(&b1);

if alpha + b1*z = 0 then

event_time = -log(u)/(lambda * exp(gamma*x + b0*z));

else

event_time = (1/(alpha + b1*z)) * log(1 - (alpha + b1*z)

*log(u)/(lambda*exp(gamma*x + b0*z)));

event = 1;

output;

end;

run;

%mend dgp_gompertz;

%dgp_gompertz(N=10000,lambda=0.0001,alpha=0.025,hr_cont=1.5,

b0=1.1,b1=1.005,prev=0.25,ranseed=1);

savageprommented.blogspot.com

Source: https://www.tandfonline.com/doi/full/10.1080/00949655.2017.1397151

Power Calculation for Interaction Term Binary and Continuous Variable Cox Regression

Appendix 1

A1.1. Exponential distribution of event times (covariate effect is a linear effect of time)

A1.2. Weibull distribution of event times (covariate effect is a linear effect of time)

A1.3. Gompertz distribution of event times (covariate effect is a linear effect of time)

Appendix 2

A2.1. R and SAS code for simulating event time data with an exponential distribution when the log-hazard ratio of the binary variable changes linear with time

A2.2. R code for simulating event time data with a Weibull distribution when the log-hazard ratio of the binary variable changes linear with time

A2.3. R and SAS code for simulating event time data with a Gompertz distribution when the log-hazard ratio of the binary variable changes linear with time

0 Response to "Power Calculation for Interaction Term Binary and Continuous Variable Cox Regression"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel