| xfr_surrogate | R Documentation |
A function for estimating the proportion of treatment effect explained using repeated cross-fitting.
xfr_surrogate( ds, x = NULL, s, y, a, splits = 50, K = 5, outcome_learners = NULL, ps_learners = NULL, interaction_model = TRUE, trim_at = 0.05, outcome_family = gaussian(), mthd = "superlearner", n_ptb = 0, ... )
ds |
a |
x |
names of all covariates in |
s |
names of surrogates in |
y |
name of the outcome in |
a |
treatment variable name (eg. groups). Expect a binary variable made of |
splits |
number of data splits to perform. |
K |
number of folds for cross-fitting. Default is |
outcome_learners |
string vector indicating learners to be used for estimation of the outcome function (e.g., |
ps_learners |
string vector indicating learners to be used for estimation of the propensity score function (e.g., |
interaction_model |
logical indicating whether outcome functions for treated and control should be estimated separately. Default is |
trim_at |
threshold at which to trim propensity scores. Default is |
outcome_family |
default is |
mthd |
selected regression method. Default is |
n_ptb |
Number of perturbations. Default is |
... |
additional parameters (in particular for super_learner) |
a tibble with columns:
Rm: estimate of the proportion of treatment effect explained, computed as the median over the repeated splits.
R_se0 standard error for the PTE, accounting for the variability due to splitting.
R_cil0 lower confidence interval value for the PTE.
R_cih0 upper confidence interval value for the PTE.
Dm: estimate of the overall treatment effect, computed as the median over the repeated splits.
D_se0 standard error for the overall treatment effect, accounting for the variability due to splitting.
D_cil0 lower confidence interval value for the overall treatment effect.
D_cih0 upper confidence interval value for the overall treatment effect.
Dsm: estimate of the residual treatment effect, computed as the median over the repeated splits.
Ds_se0 standard error for the residual treatment effect, accounting for the variability due to splitting.
Ds_cil0 lower confidence interval value for the residual treatment effect.
Ds_cih0 upper confidence interval value for the residual treatment effect.
n <- 100
p <- 20
q <- 2
wds <- sim_data(n = n, p = p)
if(interactive()){
lasso_est <- xfr_surrogate(ds = wds,
x = paste('x.', 1:q, sep =''),
s = paste('s.', 1:p, sep =''),
a = 'a',
y = 'y',
splits = 2,
K = 2,
trim_at = 0.01,
mthd = 'lasso',
ncores = 1)
}