In this analysis of Meta-Learner methodologies, we'll explore the recently published B-Learner paper. B-Learner postulates the degree of hidden confounding factors and derives precise boundaries alongside CATE predictions. (The level of confounding factors requires supplementation with domain expertise.)
A precise boundary represents the minimal bound that accounts for the effects of observed data and hidden confounders (domain knowledge). A less stringent bound is termed a valid bound, which encompasses additional values beyond the true causal effect. Consequently, caution is necessary when making decisions based on valid bounds.
We'll employ the Neyman-Rubin outcome framework. The unobservable distribution
Given these premises, we can calculate CATE (Conditional Average Treatment Effect) from the difference in outcomes:
Furthermore, in the absence of confounding factors, we can derive CATE from the difference in expected values of latent outcomes:
However, some confounding exists that cannot be fully explained by the observed covariates
When
The sensitivity parameter
Let
$$\Lambda^{-1 } \leq \frac{Q(A=1|X=x,U=u)}{Q(A=0|X=x,U=u)} / \frac{e^(x)}{1-e^(x)} \leq \Lambda$$
Define the upper limit of outcome and CATE in the situation where
The lower limit can be obtained by converting sup to inf. The superscript + denotes the upper limit, while - indicates the lower limit.
Estimates satisfying
Conversely,
Estimates satisfying
We formalise the sharp bound of CATE by the observed data distribution
First, we introduce a pseudo outcome corresponding to CVaR (Conditional Value at Risk) and unobserved outcome boundaries.
Even if the effect of a measure is positive overall (ATE), it may be negative in terms of individual effects (ITE). CVaR formalises this negative impact as a risk.
If
Here, if we use the sharp bound
$$Y^+(x,1) = e^(x)\mu^(x,1) + (1-e^(x))\rho^{+}(x,1)$$
$$Y^-(x,0) = (1-e^(x))\mu^(x,0) + e^(x)\rho^{-}(x,0)$$
From the above, we can now express the upper limit of CATE's sharp bound as
However, the counterfactual
We've obtained a valid/sharp bound for
Up to this point, we've formalised CATE's sharp bound. From here, we'll propose B-Learner by further improving the accuracy of the sharp bound.
Plug-in estimation of
The influence function is a system of functions used to learn conditional DTE (CDTE), and has a form similar to double-robust:
Given estimated nuisance
The third term on the right side of
Pseudo outcome can be considered a statistical estimator of observed data distribution
B-Learner is a two-step estimation method:
- In the first stage, we estimate the nuisances (outcome, propensity score, CVaR) with k-fold cross-fitting and construct a pseudo-outcome estimator.
- In the second step, we use the estimated pseudo-outcome as a covariate. Regress on
$X$ and obtain the CATE bound.
The propensity score $e^(x)$ or quantile $q^{\pm}$ is derived using standard classifiers or regression models.
Also, for the outcome $\rho^{\pm*}(x,a) = \Lambda^{-1}\mu^(x,a) + (1-\Lambda)^{-1}CVaR^{\pm}(x,a)$, it's possible to derive this by separately predicting $\mu^(x,a)$ and
In the first stage alone, the estimation error of the sharp bound bias will be
Also, if the quantile estimates are inconsistent,
In the paper, three types of verification were performed: simulation data, IHDP, and 401(k) eligibility. Here, we'll only discuss the results for 401(k) eligibility.
401(k) Eligibility is a dataset about 401(k) eligibility and its impact on financial assets. This dataset is known to be unconfounded, but assuming there is confounding, we verify B-Learner by changing
Left figure: For
Right figure: When changing
Let us verify whether the lower and upper bounds can be estimated using simulation data.
The simulation data generation process follows this format (
make help
make all
make run
make clean
The analysis of the B-Learner simulation results reveals several notable patterns.
Firstly, the average true Conditional Average Treatment Effect (CATE) remains constant at 2.0661 across all levels of log_gamma
, indicating a consistent positive effect of the treatment. As log_gamma
increases from 0 to 1, a widening of the estimated bounds is observed, with both the average lower and upper bounds diverging symmetrically. This expansion of the bounds reflects increasing uncertainty about the treatment effect as the model accounts for potential unobserved confounding. The coverage, representing the proportion of true CATEs falling within the estimated bounds, steadily improves from 10.65% at log_gamma = 0
to 34.43% at log_gamma = 1
, suggesting more conservative, wider bounds are more likely to contain the true effect.
Interestingly, the percentage of negative lower bounds increases from 60.11% to 87.16% as log_gamma
rises, indicating a growing possibility of negative treatment effects for some individuals despite the positive average effect. The initial negative lower bounds across all log_gamma
values suggest the method cannot rule out the possibility of negative treatment effects when accounting for potential confounding.
Finally, the gradual improvement in coverage as log_gamma
increases demonstrates the method's ability to capture the true effect more reliably as it allows for greater confounding, albeit at the cost of wider, less precise bounds.
The results of the 401k simulation data reveal significant implications for understanding the impact of 401k eligibility on financial assets. The consistent positive average true CATE of 2.0661 suggests a generally beneficial effect of 401k eligibility. However, the increasing percentage of negative lower bounds, reaching 87.16% at log_gamma = 1
, indicates substantial heterogeneity in individual treatment effects. This heterogeneity implies 401k eligibility might adversely affect some individuals' financial assets, despite the overall positive average effect. The widening bounds and improving coverage as log_gamma
increases reflect the model's growing uncertainty when accounting for potential unobserved confounders. These findings accentuate the complexity of 401k eligibility's impact on wealth distribution and highlight the importance of considering individual variations and potential hidden confounders when evaluating retirement savings policies.
This project is licensed under the GNU General Public License v3.0.
@misc{ble401kv2024,
author = {Oketunji, A.F.},
title = {B-Learner Experiment},
year = 2024,
version = {0.0.1},
publisher = {Zenodo},
doi = {10.5281/zenodo.13294344},
url = {https://doi.org/10.5281/zenodo.13294344}
}
(c) 2024 Finbarrs Oketunji. All Rights Reserved.