Identifies variable sets for shift interventions by fitting a flexible estimator using a discrete Super Learner. This function non-parametrically performs variable importance of variable sets, such as identifying two mixture components or mixture components and baseline covariates that explain the outcome given an ANOVA-like decomposition. An F-statistic is used to threshold the importance of variable sets, and these variable sets are treated as the data- adaptive parameter.

fit_basis_estimators(
  at,
  a_names,
  z_names,
  w_names,
  outcome,
  outcome_type,
  mediator_type,
  quantile_thresh,
  zeta_learner,
  fold,
  seed
)

Arguments

at

Training dataframe

a_names

Names of treatment variables

z_names

Names of mediator variables

w_names

Names of baseline covariate variables

outcome

Variable name for the outcome

outcome_type

Variable type of the outcome

mediator_type

Variable type of the mediator

quantile_thresh

Quantile level to set the F-statistic threshold for determining basis functions for estimation

zeta_learner

Stack of algorithms made in SL3 used in ensemble machine learning to fit Y|A,W

fold

Current fold in the cross-validation

seed

Seed number for consistent results

Value

A list of results from fitting the best b-spline model to the data this list includes the selected learner model, the name of the learner, the ANOVA fit on the model matrix of the basis functions, the basis used, and the residual metrics