This vignette defines the models and historical borrowing metrics
supported in the historicalborrowlong
package.
Models
Definitions
Unless otherwise specified, Greek letters refer to parameters
estimated by the model, and Roman letters refer to fixed hyperparameters
and other constants set by the user in advance.
- “Study”: a clinical trial. Could be a borrowed clinical trial from
the past or a current clinical trial under analysis.
- “Group”: a study arm, such as a treatment group or placebo
group.
- “Rep”: a repeated measure in the context of repeated measures /
longitudinal modeling. Could be a time point such as a study visit.
-
:
study index.
-
:
index of the current study.
-
:
vector of patient-by-rep clinical responses to a continuous outcome
variable for study
.
-
:
the row of matrix
corresponding to study
.
-
:
Vector of control group mean parameters, one for each rep for the pooled
model, one for each study and rep in the hierarchical and independent
models. The first elements are for the historical studies, and the last
one is for the current study.
-
:
Vector of study-specific treatment mean parameters. There is one for
each combination of study, non-control treatment group, and rep. An
optional constraint may be added to pool all arms at baseline within
each study, which reduces the number of elements of
.
-
:
integer index for the elements of
.
-
:
integer index for the elements of
.
-
:
index of a rep (e.g. time point of a repeated measure such as a patient
visit in a clinical trial.)
-
:
Vector of study-specific baseline covariate parameters.
-
:
model matrix for the control group mean parameters
.
It has indicator columns to select the appropriate element of
for each element of
.
-
:
model matrix for the treatment mean parameters
.
It has indicator columns to select the appropriate element of
for each element of
.
-
:
model matrix for the baseline covariate fixed effect parameters
.
It has indicator columns to select the appropriate element of
for each element of
.
-
:
Vector of rep-specific residual standard deviations for study
.
-
:
lower-triangular Cholesky factor of the by-rep residual correlation
matrix for study
.
-
:
number of patients in study
.
-
:
by-rep residual covariance matrix of study
.
-
:
number of repeated measures per subject.
-
:
identity matrix with rows and columns equal to the number of repeated
measures per subject.
-
:
indicator function
-
:
an AR(1) correlation matrix with
rows and correlation parameter
.
-
index to indicate the type of residual covariance of study
:
1 for unstructured / fully parameterized, 2 for AR(1), and 3 for
diagonal.
Baseline covariates
The baseline covariates model matrix
adjusts for baseline covariates. It may contain a continuous column for
baseline and binary indicator columns for the levels of user-defined
covariates. All these columns are included if possible, but the method
automatically drops baseline covariate columns to ensure that the
combine model matrix
is full rank, where
denotes the rows of matrix
corresponding to study
,
with additional rows dropped if the corresponding elements of
are missing. The choice of columns to drop from
is determined by the rank and pivoting strategy of the QR decomposition
of
using the Householder algorithm with pivoting (base::qr()
,
LINPACK routine DQRDC).
Separately within each study, each column of
is centered to have mean 0, and if possible, scaled to have variance 1.
Scaling ensures that the priors on parameters
remain relatively diffuse relative to the input data. Study-level
centering ensures that the
parameters truly act as unconditional study-specific control
group means (as opposed to conditional on the subset of patients at the
reference level of
),
and it ensures that borrowing across
components fully presents as control group borrowing.
Model matrices
Each primary model is parameterized thus:
Above,
,
,
and
are fixed matrices for study
.
is a conventional model matrix for the baseline covariates
,
and the details are explained in the “Baseline covariates” section
below.
is a matrix of zeroes and ones. It is constructed such that each scalar
component of
is the mean response of the control group in a particular study at a
given time point. Likewise,
is a matrix of zeroes and ones such that each scalar component of
is the mean response of a non-control treatment group in a particular
study at a given time point.
To illustrate, let
be patient
in treatment group
(where
is the control group) of study
at time point
,
and let
be the corresponding scalar element of the vector
.
Then,
In addition, if the constraint in the parameterization is activated
(i.e. hbl_mcmc_hierarchical(constraint = TRUE)
) then the
control and treatment patients are pooled at time point
within each study
:
This parameterization is represented in the more compact expression
in the model definitions in this vignette.
Post-processing
The hbl_summary()
function post-processes the results
from the model. It accepts MCMC samples of parameters and returns
estimated marginal means of the response and treatment effect. To
estimate marginal means of the response, hbl_summary()
takes group-level averages of posterior samples of fitted values while
dropping covariate adjustment terms from the model
(i.e. ).
Because the columns of
are centered at their means, this choice is mathematically equivalent to
emmeans::emmeans()
with the
weights = "proportional"
(Lenth
(2016)).
Hierarchical model
Functions:
The hierarchical model analyzes the data from all studies and shrinks
the control study-by-rep means
(one scalar parameter for each unique combination of study and rep)
towards a common normal distribution with mean
and variance
.
For each study in the data (both current and historical), the covariance
is user-defined. Options include:
- Fully parameterized (“unstructured”) with a separation strategy with
the LKJ prior to model within-subject correlations among residuals.
- AR(1) variances
and correlation
.
- Diagonal with variances
.
The prior
on
is critically important because:
- It controls the prior amount of borrowing, and
- The prior has a large influence if there are few historical studies
in the data.
can either be a flexible half-Student-t distribution with
degrees of freedom and scale parameter
:
or a uniform distribution with lower
bound 0 and upper bound
:
Following the recommendation of Gelman
(2006), please use half-Student-t if the number of historical
studies is small and consider uniform for large numbers of historical
studies.
For the half-Student-t distribution, the role of the
parameter is equivalent to the
parameter from the Student-t
parameterization in the Stan user manual.
Independent model
Functions:
The independent model is the same as the hierarchical model, but with
independent control group parameters
.
We use it as a no-borrowing benchmark to quantify the borrowing strength
of the hierarchical model.
Pooled model
Functions:
The pooled model is the same as the independent model, but with
rep-specific control means pooled across studies. In other words
loses the
subscript, and we use a smaller matrix
instead of
.
has fewer columns (rep-specific rather than study-by-rep-specific). Like
the independent model, we use it as a no-borrowing benchmark to quantify
the borrowing strength of the hierarchical model.
Borrowing metrics
The package supports the following metrics to quantify borrowing.
Various functions in historicalborrowlong
compute each of
the following metrics independently for each discrete time point
(“rep”).
Effective sample size (ESS)
See the hbl_ess()
function for an implementation.
Neuenschwander et al. (2006) posit a
prior effective sample size metric for meta-analytic predictive (MAP)
priors. In the original paper, the underlying hierarchical model only
uses historical controls, and the hypothetical new study is the current
study of interest. In historicalborrow
, we adapt this
metric to a hierarchical model which also includes both control and
treatment data from the current study. We still define
below to be the number of (non-missing) historical control patients so
we can still interpret ESS on the same scale as in the paper.
For the pooled model, define
to be the posterior predictive variance of the control mean
of a hypothetical new unobserved study. According to Neuenschwander et al. (2006), it can be derived
as an average of study-specific variances. In practice, we estimate
using the average of MCMC samples of
.
For the hierarchical model, we define the analogous posterior
predictive variance
using the prior distribution
The above integral implies a straightforward method of estimating
using MCMC samples:
- For each MCMC sample
from the hierarchical model, identify samples
and
of
and
,
respectively.
- Draw
from a
Normal(,
)
distribution.
- Estimate
as the variance of the collection
from (2).
Next, define
as the number of non-missing control patients from the historical
studies only. Given
,
,
and
,
define the effective sample size as:
is a weight which quantifies the fraction of historical information that
the hierarchical model leverages for borrowing. Notably, the weight
should be 1 if the hierarchical and pooled model exhibit the same
strength of borrowing. Multiplied by
,
the quantity becomes a heuristic for the strength of borrowing of the
hierarchical model, measured in terms of the number of historical
patients.
Precision ratio (hierarchical model only)
The precision ratio is an experimental ad hoc metric and should be
used with caution. It is implemented in the hbl_summary()
function for the hierarchical model.
The precision ratio compares the prior precision of a control mean
response (an
component, numerator) to the analogous precision of the full conditional
distribution (denominator). The former is
,
and the latter is
.
Here,
is the number of non-missing patients in the current study,
is the residual variance, and
is the variance of study-specific control means (components of
).
The full precision ratio is:
The precision ratio comes from the conditional distribution of
in the hierarchical model given the other parameters and the data. More
precisely, in this conditional distribution, the mean is a weighted
average between the prior mean and data mean, and the precision ratio is
the weight on the prior mean. This can be seen in a simpler case with a
Bayesian model with a normal data model, a normal prior on the mean, and
known constant variance. For details, see Chapter 2 of Gelman et al. (2020).
Variance shift ratio
The variance shift ratio is an experimental ad hoc metric and should
be used with caution. It is implemented in the legacy
hbl_metrics()
function.
Let
be the estimated posterior variance of
(current study control group response mean) estimated by model
.
The variance shift ratio is:
where
is a historical borrowing model like the mixture model or hierarchical
model.
Mean shift ratio (legacy)
The mean shift ratio is not recommended to measure the strength of
borrowing. Rather, it is an informal ad hoc measure of the lack of
commensurability between the current and historical data sources. It is
implemented in the legacy hbl_metrics()
function.
To define the mean shift ratio, let
be the posterior mean control group response estimated by model
.
The mean shift ratio is:
where
is a historical borrowing model like the mixture model or hierarchical
model.