This guide provides an introduction to applying the
multicate
package in health data analyses. The explanations
below are based on the original paper that is based on for development
of this package: Comparison of methods that combine multiple
randomized trials to estimate heterogeneous treatment effects.
Statistics in medicine (Brantner et al., 2024).1 The detailed github
code and readme are in this link(https://github.com/dobengjhu/multicate).
Randomized controlled trials (RCTs) are considered the gold standard for unbiased estimation of treatment effects. However, they often have limited power to detect heterogeneous treatment effects (HTEs) due to sample size constraints and may not reflect broader populations. To address this limitation, researchers have developed various methods to combine information from multiple studies to improve treatment effect estimation, including meta-analysis or hierarchical models. However, these approaches often do not explicitly target conditional average treatment effects and typically rely on aggregate-level data, making it challenging to estimate treatment effects conditional on individual-level characteristics.
This package multicate
is to fill this gap when
combining RCTs using Individual Patient Data (IPD). The
estimand in this package is the conditional average treatment
effect (CATE), defined under Rubin’s potential outcomes framework. 2 When \(A\) is a binary treatment indicator, \(S\) is a categorical variable for the trial
in which the individual participated (from 1 to \(K\), with a total of \(K\) RCTs), \(X\) the covariates, and \(Y\) a continuous outcome. Let \(Y(0)\) and \(Y(1)\) be the potential outcomes under
control and treatment, respectively. The probability of receiving
treatment given covariates and trial membership (propensity score) can
be denoted as \(\pi_{S}(X) = P(A=1|X,
S=s)\). With the continuous outcome, the CATE is \(\tau=E(Y(1)|X)-E(Y(0)|X)\). The target
estimand of our package is this ‘universal’ CATE, based on potential
outcomes not dependent on study membership. If the parater is set to
study-specific
in the package, then the estimand becomes
the study-specific CATE as, \(\tau_{S}(X) =
E(Y(1)|X, S=s)-E(Y(0)|X, S=s)\).
The multicate
package enables researchers to integrate
multiple RCTs using various aggregation and estimation methods, while
effectively handling heterogeneity in the data. This approach not only
increases statistical power but also extends findings from a single
study to multiple studies, providing a more robust foundation for
personalized decision-making in diverse populations.
Goal: Guide treatment decision-making in a health care or other practical setting by estimating heterogeneous treatment effects
Setting: Have multiple RCTs and a target population at baseline that is not from those RCTs
Key features:
Compared to other packages that more focus primarily on
meta-analysis, our package, mulicate
is unique in that it
is specially designed to estimate Conditional Average Treatment Effects
(CATE) by combining multiple RCTs and predict CATEs in a target
population of interest. The aim of this package is to use this package
to make informed decisions on which treatment may be preferable for the
given covariate profile. To our knowledge, no other R package currently
supports both estimation and prediction of CATEs specifically when
combining RCTs with machine learning methods. A more detailed
explanation of the methods, their contributions and limitations is
provided in Brantner et al. (2024).3
The standard causal inference assumptions, including the Stable Unit Treatment Value Assumption (SUTVA) are maintained within each RCT, but for the assumption 4, which requires that any covariate \(X\) is possible to be observed in all studies, may be relaxed depending on the method used.
While Meta-analysis is commonly applied within a parametric framework, real-world clinical data often involve complex structures that violate parametric assumptions. Unlike parametric models that require pre-specification of effect moderators and distributional assumptions, nonparametric methods offer greater flexibility, especially in modeling nonlinear relationships between covariates and treatment effects.
The S-learner is a ‘meta-learner’ in that it combines base learners of different forms in a special way, and use this learner to estimate a conditional outcome mean function given observed covariates and assigned treatment as \(\mu(X,A) = E(Y|X,A)\). Then by plugging in 0 and 1 for \(A\), we can obtain predicted outcomes under treatment and control for each individual and calculate \(\hat{\tau} = \hat{\mu} (X,1) - \hat{\mu} (X,0)\). In this package, we applied ‘random forest’ for this base learner.
Causal forest is similar to a traditional random forest, but the primary estimand is the treatment effect itself, not the outcome mean function. It recursively partitions covariates to best split based on treatment effect heterogeneity. The treatment effect is estimated as the difference in average outcomes between the treatment and control units ‘within each leaf’. In other words, the causal forest is the weighted aggregation of many causal trees.
This package is especially well-suited for combining multiple RCTs with various aggregation methods. One of the most common aggregation methods is the ‘federated learning setting’, where individual-level data cannot be shared across study sites, and only aggregate results or models are shared. However, our package also supports settings where individual-level data can be shared across trials.
When trials are too heterogeneous to justify combining information across studies, it may be preferable to estimate effects separately for each trial. In this case, fitting models within each study independently would be most appropriate. Note that this is not technically an ‘aggregation approach’ since each study is analyzed independently, and no cross-study information is used. You can specify this setting with `aggregation_method = “studyspecific”.
A ‘complete pooling approach’ - combining all data and treating it as a single study - requires strong assumptions. To relax these, our package implements ‘pooling with a trial indicator’. Basically, all of the individual data from all RCTs is combined into one comprehensive dataset, but a categorical study variable is included. This allows researchers to apply single-study approaches while accounting for full covariates including membership indicator. This will yield trial-specific CATE estimates. You can use aggregation_method = “studyindicator” to apply this method.
This method is based on Tan and colleagues’ methods4 for federated learning, devised for scenarios in which individual data cannot be shared across trial sites. First, it builds localized models for CATE within each trial, and apply these models to all individual in all RCTs to get each individual trial-specific CATE estimates. Then, an ensemble model is trained using these estimates as the response variable, with individual covariates and trial indicators as predictors. This method can be selected with aggregation_method = “ensembleforest”.
A unique feature of the multicate
package is its ability
to predict CATEs for a target population — such as a new group of
patients at baseline in a healthcare system — using models trained on
multiple RCTs. This is particularly useful when models trained on RCTs
need to be applied to individuals outside of the original study samples,
such as patients from electronic health records (EHR) or historical data
from clinical systems. When multiple RCTs involve the same treatment
options, you can use this package to apply fitted models to these
external individuals and guide real-world treatment decisions.
Brantner, C. L., Nguyen, T. Q., Tang, T., Zhao, C., Hong, H., & Stuart, E. A. (2024). Comparison of methods that combine multiple randomized trials to estimate heterogeneous treatment effects. Statistics in medicine, 43(7), 1291-1314. https://onlinelibrary.wiley.com/doi/abs/10.1002/sim.9955↩︎
Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5), 688. https://psycnet.apa.org/record/1975-06502-001↩︎
Brantner, C. L., Nguyen, T. Q., Tang, T., Zhao, C., Hong, H., & Stuart, E. A. (2024). Comparison of methods that combine multiple randomized trials to estimate heterogeneous treatment effects. Statistics in medicine, 43(7), 1291-1314. https://onlinelibrary.wiley.com/doi/abs/10.1002/sim.9955↩︎
Tan, X., Chang, C. C. H., Zhou, L., & Tang, L. (2022, June). A tree-based model averaging approach for personalized treatment effect estimation from heterogeneous data sources. In International Conference on Machine Learning (pp. 21013-21036). PMLR. https://proceedings.mlr.press/v162/tan22a.html↩︎