An envelope script for Causal Inference on Synthetic Data
The script H:/MR/Projects/Shared/CausalEffects/CausalEffectScripts/run_process.py is an envelope for generating synthetic data for causal-inference, applying several methods, and analyzing. At the moment, the script should be executed on node-05. The scripts parmeters are:
-
config is the only required parameter. An example of a configuration file is -
Note that all keys are required, and no sapces allowed around the assignment sign. All keys are used for generating the synthetic data, and ** is also used for all further steps -
** tells the script to list all stages and stop:
-
** and ** allow running only a subset of the script's stages
- slow stages are - IPW.NN and Quasi.Full_NN The stages of the script perform the following tasks-
Generate | Generate the synthetic data |
Stats | Some analyses on synthetic data, including generation of true validation ITEs file |
Naive.LGBM → IPW.SHAP | check_toy_model using various methods |
Performance | Check performance of various methods using correlation to true ITE |
A part of the Stats stage is running the script H:/MR/Projects/Shared/CausalEffects/CausalEffectScripts/analyze_risk_matrix.py * *that generates a PDF file () with various graphs. However, due to Python issues we currently do not run it within the envelope script and it should be executed seperately The output of the envelope script is a file **/Summary which include both config file information, as well as the performance evaluation information: