When to Act and When to Ask: Policy Learning With Deferral Under Hidden Confounding

A method for learning policies from observational data, where the policy model can predict a treatment assignment or defer to an expert under hidden confounding.

Requirements

Usage Example

We train a CARED policy on observational train and validation sets $Z=(X, A, Y)$: ds_train and ds_test, predict the policy on a test set ds_test. Given the CAPO_bounds We get from the BLearner with a specified confounding degree of gamma. We then train the policy as follows:

import torch
import torch.nn as nn
from models.lce_policy.lce_policy import LCE_Policy

# Logistic Policy learner
policy_model = nn.Sequential(
    nn.Linear(features_num, treatments_num + 1),
    nn.Sigmoid())
# Policy model that optimizes the $L_{CE}$ Objective
lce_policy = LCE_Policy(tau_hat=CAPO_bounds,
	policy_model=policy_model,
	use_rho=True,                              
	gamma=gamma,                                
	higher_better=True)

# Train the policy model
lce_policy.fit(ds_train=ds_train, ds_valid=ds_valid, devices=devices)
# Predict on the test set
lce_pi_with_deferral = lce_policy.predict(ds_test=ds_test)
# Replace all deferrals with the expert's actions to get the final treatment assignment
lce_pi, deferral_count, deferral_rate = update_expert_preds(preds=lce_logistic_pi,
	expert_labels=torch.Tensor(
	ds_test.t).type(torch.LongTensor))

Replication Code for Paper

The following commands will replicate the figures from the paper.

For Figure 1, run synthetic_experiment.py
For Figure 2, run ihdp_experiment.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
datasets		datasets
models		models
.gitignore		.gitignore
README.md		README.md
compute_blearner_bounds_ihdp.py		compute_blearner_bounds_ihdp.py
compute_blearner_bounds_synthetic.py		compute_blearner_bounds_synthetic.py
ihdp_experiment.py		ihdp_experiment.py
synthetic_experiment.py		synthetic_experiment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

When to Act and When to Ask: Policy Learning With Deferral Under Hidden Confounding

Requirements

Usage Example

Replication Code for Paper

About

Uh oh!

Releases

Packages

Languages

marahgh/CARED

Folders and files

Latest commit

History

Repository files navigation

When to Act and When to Ask: Policy Learning With Deferral Under Hidden Confounding

Requirements

Usage Example

Replication Code for Paper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages