FISTA_backtracking — FISTA_backtracking • MRTpostInfLASSO

This fuction applies the FISTA algorithm with backtracking to solve randomized LASSO problem. Reference paper is "A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems" by Amir Beck and Marc Teboulle.

Usage

FISTA_backtracking(
  data,
  ID,
  moderator_formula,
  lam = NULL,
  noise_scale = NULL,
  splitrat = 0.8,
  max_ite = 10^(5),
  tol = 10^(-4),
  beta = NULL
)

Arguments

data: the output of pseudo_outcomecal function
ID: the name of column where participants' ID are stored
moderator_formula: determines the formula for the f(St)T*beta function
lam: the value of penalty term of randomized LASSO. If it is not provided, the default value will be used. Default is \(\sqrt{2n* logp} \rho sd(y)\) where \(\rho\) is the split rate and \(n\) is the number of rows.
noise_scale: Standard deviation of Gaussian noise added to objective. Default is \(\sqrt{\frac{1-\rho}{\rho}\, n}\,\mathrm{sd}(y)\) where \(\rho\) is the split rate and \(n\) is the number of rows. The random noises, \(\omega\), are iid drawn from normal distribution with standard deviation noise_scale
splitrat: this value is corresponding to the data splitting rate. Details can read "Exact Selective Inference with Randomization" page 15 equation (10). This value will be used only when user doesn't provide the noise_scale or lam.
max_ite: the maximum iteration for searching predictors
tol: when the improvement of residual sum of square less than this number, stop searching
beta: the true coefficient value (if simulation is conducted)

Value

formula: the raw model before selection
E: Selected variables
NE: Not selected variables
n: the number of participants in the study
soln: Estimated coefficients of selected variables from the penalized randomized regression.
Z: Subgradient of unselected variables.
OMEGA: The scaled variance for the added random noise. This value is scaled by 4.
lam: Scale orginal regularization parameter by -2 for later inference purpose.
ori_lam: Regularization parameter (lambda) used in the penalization.
perturb: Random noise added for each variables. This value is scaled by -2 for later inference purpose.
nonzero: Boolean vector indicating which variables were selected.
postbeta: True beta projected on space spinned by post-selection predictors
sign_soln: Signs of the estimated coefficients for selected variables

Examples


sim_data = generate_dataset(N = 1000, T = 40, P = 50, sigma_residual = 1.5, sigma_randint = 1.5, main_rand = 3, rho = 0.7,
  beta_logit = c(-1, 1.6 * rep(1/50, 50)), model = ~ state1 + state2 + state3 + state4,
  beta = matrix(c(-1, 1.7, 1.5, -1.3, -1),ncol = 1),
  theta1 = 0.8)

Ht = unlist(lapply(1:50, FUN = function(X) paste0("state",X)))
St = unlist(lapply(1:25, FUN = function(X) paste0("state",X)))

pseudo_outcome_CVlasso = pseudo_outcome_generator_CVlasso(fold = 5,ID = "id", sim_data, Ht,
  St, "action", "outcome",core_num = 5)
#> Error in pseudo_outcome_generator_CVlasso(fold = 5, ID = "id", sim_data,     Ht, St, "action", "outcome", core_num = 5): argument "outcome" is missing, with no default

my_formula = as.formula(paste("yDR ~ ", paste(St, collapse = " + ")))

FISTA_backtracking(data = sim_data, moderator_formula = my_formula, lam = NULL, noise_scale = NULL,
  splitrat = 0.7, beta = matrix(c(-1, 1.7, 1.5, -1.3, -1, rep(0,21)), ncol = 1))
#> Error in `[.data.frame`(data, , "ptSt"): undefined columns selected