因果机器学习调研综述(引用上)

323 阅读41分钟

目录与链接

因果机器学习调研综述(上)

因果机器学习调研综述(中)

因果机器学习调研综述(下)

因果机器学习调研综述(引用上)

因果机器学习调研综述(引用下)

引用目录

[1] A. D'Amour, K. Heller, D. Moldovan, B. Adlam, B. Alipanahi, A. Beutel, C. Chen, J. Deaton, J. Eisenstein, M. D. Hoffman et al., "Underspecification presents challenges for credibility in modern machine learning," arXiv preprint arXiv:2011.03395, 2020.

[2] A. Jahanian*, L. Chai*, and P. Isola, "On the "steerability" of generative adversarial networks," in International Conference on Learning Representations, 2020. [Online]. Available: openreview.net/forum?id=Hy…

[3] N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan, "A survey on bias and fairness in machine learning," ACM Computing Surveys (CSUR), vol. 54, no. 6, pp. 1–35, 2021.

[4] E. M. Bender, T. Gebru, A. McMillan-Major, and S. Shmitchell, "On the dangers of stochastic parrots: Can language models be too big?" in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021, pp. 610–623.

[5] Z. C. Lipton, "The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery." Queue, vol. 16, no. 3, pp. 31–57, 2018.

[6] G. Dulac-Arnold, D. Mankowitz, and T. Hester, "Challenges of real-world reinforcement learning," arXiv preprint arXiv:1904.12901, 2019.

[7] J. Peters, D. Janzing, and B. Schölkopf, Elements of Causal Inference - Foundations and Learning Algorithms, ser. Adaptive Computation and Machine Learning Series. Cambridge, MA, USA: The MIT Press, 2017.

[8] A. Goyal and Y. Bengio, "Inductive biases for deep learning of higher-level cognition," arXiv preprint arXiv:2011.15091, 2020.

[9] J. Pearl, "The seven tools of causal inference, with reflections on machine learning," Communications of the ACM, vol. 62, no. 3, pp. 54–60, 2019.

[10] O. Ahmed, F. Träuble, A. Goyal, A. Neitz, Y. Bengio, B. Schölkopf, M. Wüthrich, and S. Bauer, "Causalworld: A robotic manipulation benchmark for causal structure and transfer learning," arXiv preprint arXiv:2010.04296, 2020.

[11] B. Schölkopf, F. Locatello, S. Bauer, N. R. Ke, N. Kalchbrenner, A. Goyal, and Y. Bengio, "Towards causal representation learning," arXiv preprint arXiv:2102.11107, 2021.

[12] J. Pearl, Causality. Cambridge university press, 2009.

[13] B. Neal, "Introduction to causal inference," 2020.

[14] B. Schölkopf and J. von Kügelgen, "From statistical to causal learning," 2022. [Online]. Available: arxiv.org/abs/2204.00…

[15] A. Van Oord, N. Kalchbrenner, and K. Kavukcuoglu, "Pixel recurrent neural networks," in International conference on machine learning. PMLR, 2016, pp. 1747–1756.

[16] A. van den Oord, N. Kalchbrenner, L. Espeholt, K. Kavukcuoglu, O. Vinyals, and A. Graves, "Conditional image generation with pixelcnn decoders," inAdvances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, D. D. Lee, M. Sugiyama, U. von Luxburg, I. Guyon, and R. Garnett, Eds., 2016, pp. 4790–4798. [Online]. Available: proceedings.neurips. cc/paper/2016/hash/b1301141feffabac455e1f90a7de2054-Abstract.html

[17] Y. Yu, X. Si, C. Hu, and J. Zhang, "A review of recurrent neural networks: Lstm cells and network architectures," Neural computation, vol. 31, no. 7, pp. 1235–1270, 2019.

[18] J. Kaddour, Y. Zhu, Q. Liu, M. Kusner, and R. Silva, "Causal effect inference for structured treatments," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=0v…

[19] F. Huszár, "Ml beyond curve fitting: An intro to causal inference and do-calculus," 2018. [Online]. Available: www.inference.vc/untitled/

[20] M. Glymour, J. Pearl, and N. P. Jewell, Causal inference in statistics: A primer. John Wiley & Sons, 2016.

[21] V. Papyan, X. Han, and D. L. Donoho, "Prevalence of neural collapse during the terminal phase of deep learning training," Proceedings of the National Academy of Sciences, vol. 117, no. 40, pp. 24 652–24 663, 2020.

[22] R. Geirhos, J.-H. Jacobsen, C. Michaelis, R. Zemel, W. Brendel, M. Bethge, and F. A. Wichmann, "Shortcut learning in deep neural networks," Nature Machine Intelligence, vol. 2, no. 11, pp. 665–673, 2020.

[23] Y. Wang and M. I. Jordan, "Desiderata for representation learning: A causal perspective," arXiv preprint arXiv:2109.03795, 2021.

[24] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, "Imagenet: A large-scale hierarchical image database," in 2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009, pp. 248–255.

[25] S. Singla and S. Feizi, "Salient imagenet: How to discover spurious features in deep learning?" in International Conference on Learning Representations, 2022. [Online]. Available: openreview.net/forum?id=XV…

[26] D. Janzing, D. Balduzzi, M. Grosse-Wentrup, and B. Schölkopf, "Quantifying causal influences," The Annals of Statistics, vol. 41, no. 5, pp. 2324–2358, 2013.

[27] M. Arjovsky, L. Bottou, I. Gulrajani, and D. Lopez-Paz, "Invariant risk minimization," CoRR, vol. abs/1907.02893, 2019. [Online]. Available: arxiv.org/abs/1907.02…

[28] Z. Shen, J. Liu, Y. He, X. Zhang, R. Xu, H. Yu, and P. Cui, "Towards out-ofdistribution generalization: A survey," arXiv preprint arXiv:2108.13624, 2021.

[29] S. Beery, G. Van Horn, and P. Perona, "Recognition in terra incognita," inProceedings of the European conference on computer vision (ECCV), 2018, pp. 456–473.

[30] J. Mitrovic, B. McWilliams, J. C. Walker, L. H. Buesing, and C. Blundell, "Representation learning via invariant causal mechanisms," in International Conference on Learning Representations, 2021.

[31] M. Ilse, J. M. Tomczak, and P. Forré, "Selecting data augmentation for simulating interventions," in Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, ser. Proceedings of Machine Learning Research, M. Meila and T. Zhang, Eds., vol. 139. PMLR, 2021, pp. 4555–4562. [Online]. Available: proceedings.mlr.press/v139/ilse21…

[32] J. V. Kügelgen, Y. Sharma, L. Gresele, W. Brendel, B. Schölkopf, M. Besserve, and F. Locatello, "Self-supervised learning with data augmentations provably isolates content from style," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=4p…

[33] D. Kaushik, A. Setlur, E. H. Hovy, and Z. C. Lipton, "Explaining the efficacy of counterfactually augmented data," in 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [Online]. Available: openreview.net/forum?id=HH…

[34] D. Teney, E. Abbasnejad, and A. van den Hengel, "Learning what makes a difference from counterfactual examples and gradient supervision," inComputer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part X, ser. Lecture Notes in Computer Science, A. Vedaldi, H. Bischof, T. Brox, and J. Frahm, Eds., vol. 12355. Springer, 2020, pp. 580–599. [Online]. Available: doi.org/10.1007/978…

[46] L. Chen, P. Bentley, K. Mori, K. Misawa, M. Fujiwara, and D. Rueckert, "Selfsupervised learning for medical image analysis using image context restoration," Medical image analysis, vol. 58, p. 101539, 2019.

[47] H. Wang, J. Kaddour, S. Liu, J. Tang, M. Kusner, J. Lasenby, and Q. Liu, "Evaluating self-supervised learning for molecular graph embeddings," 2022. [Online]. Available: arxiv.org/abs/2206.08…

[48] K. He, X. Chen, S. Xie, Y. Li, P. Dollár, and R. B. Girshick, "Masked autoencoders are scalable vision learners," arXiv:2111.06377, 2021.

[49] Z. Wu, Y. Xiong, S. Yu, and D. Lin, "Unsupervised feature learning via non-parametric instance-level discrimination," 2018. [Online]. Available: arxiv.org/abs/1805.01…

[50] J.-B. Grill, F. Strub, F. Altché, C. Tallec, P. H. Richemond, E. Buchatskaya, C. Doersch, B. A. Pires, Z. D. Guo, M. G. Azar et al., "Bootstrap your own latent: A new approach to self-supervised learning," in NeurIPS, 2020.

[51] Y. Tian, D. Krishnan, and P. Isola, "Contrastive multiview coding," in European conference on computer vision. Springer, 2020, pp. 776–794.

[52] N. Tomasev, I. Bica, B. McWilliams, L. Buesing, R. Pascanu, C. Blundell, and J. Mitrovic, "Pushing the limits of self-supervised resnets: Can we outperform supervised learning without labels on imagenet?" 2022. [Online]. Available: arxiv.org/abs/2201.05…

[53] G. Chen, J. Li, J. Lu, and J. Zhou, "Human trajectory prediction via counterfactual analysis," 2021. [Online]. Available: arxiv.org/abs/ 2107.14202

[54] Y. Niu, K. Tang, H. Zhang, Z. Lu, X.-S. Hua, and J.-R. Wen, "Counterfactual vqa: A cause-effect look at language bias," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2021, pp. 12 700–12 710.

[55] Y. Rao, G. Chen, J. Lu, and J. Zhou, "Counterfactual attention learning for fine-grained visual categorization and re-identification," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 1025– 1034.

[56] P. W. Koh, S. Sagawa, H. Marklund, S. M. Xie, M. Zhang, A. Balsubramani, W. Hu, M. Yasunaga, R. L. Phillips, I. Gao et al., "Wilds: A benchmark of in-the-wild distribution shifts," in International Conference on Machine Learning. PMLR, 2021, pp. 5637–5664.

[57] J. Peters, P. Bühlmann, and N. Meinshausen, "Causal inference by using invariant prediction: identification and confidence intervals," Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 78, no. 5, pp. 947–1012, 2016.

[58] E. Rosenfeld, P. K. Ravikumar, and A. Risteski, "The risks of invariant risk minimization," in International Conference on Learning Representations, 2021. [Online]. Available: openreview.net/forum?id=Bb…

[59] P. Kamath, A. Tangella, D. Sutherland, and N. Srebro, "Does invariant risk minimization capture invariance?" in International Conference on Artificial Intelligence and Statistics. PMLR, 2021, pp. 4069–4077.

[60] K. Ahuja, K. Shanmugam, K. Varshney, and A. Dhurandhar, "Invariant risk minimization games," in International Conference on Machine Learning. PMLR, 2020, pp. 145–155.

[61] K. Ahuja, E. Caballero, D. Zhang, J.-C. Gagnon-Audet, Y. Bengio, I. Mitliagkas, and I. Rish, "Invariance principle meets information bottleneck for out-of-distribution generalization," Advances in Neural Information Processing Systems, vol. 34, 2021.

[62] N. Tishby, F. C. Pereira, and W. Bialek, "The information bottleneck method," arXiv preprint physics/0004057, 2000.

[63] D. Krueger, E. Caballero, J. Jacobsen, A. Zhang, J. Binas, D. Zhang, R. L. Priol, and A. C. Courville, "Out-of-distribution generalization via risk extrapolation (rex)," in Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, ser. Proceedings of Machine Learning Research, M. Meila and T. Zhang, Eds., vol. 139. PMLR, 2021, pp. 5815–5826. [Online]. Available: proceedings.mlr.press/v139/kruege…

[64] D. Mahajan, S. Tople, and A. Sharma, "Domain generalization using causal matching," in Proceedings of the 38th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, M. Meila and T. Zhang, Eds., vol. 139. PMLR, 18–24 Jul 2021, pp. 7313–7324. [Online]. Available: proceedings.mlr.press/v139/mahaja…

[65] X. Sun, B. Wu, X. Zheng, C. Liu, W. Chen, T. Qin, and T.-Y. Liu, "Recovering latent causal factor for generalization to distributional shifts," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=go…

[66] C. Liu, X. Sun, J. Wang, T. Li, T. Qin, W. Chen, and T.-Y. Liu, "Learning causal semantic representation for out-of-distribution prediction," 11 2020.

[67] C. Lu, Y. Wu, J. M. Hernández-Lobato, and B. Schölkopf, "Invariant causal representation learning for out-of-distribution generalization," inInternational Conference on Learning Representations, 2022. [Online]. Available: openreview.net/forum?id=-e…

[68] P. Spirtes, C. N. Glymour, R. Scheines, and D. Heckerman, Causation, prediction, and search. MIT press, 2000.

[69] Y. Atzmon, F. Kreuk, U. Shalit, and G. Chechik, "A causal view of compositional zero-shot recognition," arXiv:2006.14610 [cs], Nov. 2020, arXiv: 2006.14610. [Online]. Available: arxiv.org/abs/2006.14…

[70] J. Johnson, B. Hariharan, L. Van Der Maaten, L. Fei-Fei, C. Lawrence Zitnick, and R. Girshick, "Clevr: A diagnostic dataset for compositional language and elementary visual reasoning," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2901–2910.

[71] G. Parascandolo, N. Kilbertus, M. Rojas-Carulla, and B. Schölkopf, "Learning independent causal mechanisms," in Proceedings of the 35th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, J. Dy and A. Krause, Eds., vol. 80. PMLR, 10–15 Jul 2018, pp. 4036–4044. [Online]. Available: https: //proceedings.mlr.press/v80/parascandolo18a.html

[72] A. Goyal, A. Lamb, J. Hoffmann, S. Sodhani, S. Levine, Y. Bengio, and B. Schölkopf, "Recurrent independent mechanisms," in International Conference on Learning Representations, 2021. [Online]. Available: https: //openreview.net/forum?id=mLcmdlEUxy

[73] S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.

[74] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, "Attention is all you need," 2017. [Online]. Available: arxiv.org/abs/1706.03…

[75] K. Madan, N. R. Ke, A. Goyal, B. Schölkopf, and Y. Bengio, "Fast and slow learning of recurrent independent mechanisms," 2021. [Online]. Available: openreview.net/forum?id=Lc…

[76] Z. Yue, Q. Sun, X.-S. Hua, and H. Zhang, "Transporting causal mechanisms for unsupervised domain adaptation," 2021. [Online]. Available: arxiv.org/abs/2107.11…

[77] J. Pearl and E. Bareinboim, "External validity: From do-calculus to transportability across populations," Statistical Science, vol. 29, no. 4, nov 2014.

[78] T. Teshima, I. Sato, and M. Sugiyama, "Few-shot domain adaptation by causal mechanism transfer," in International Conference on Machine Learning. PMLR, 2020, pp. 9458–9469.

[79] Z. Liu, P. Luo, X. Wang, and X. Tang, "Deep learning face attributes in the wild," in Proceedings of International Conference on Computer Vision (ICCV), December 2015.

[80] A. Subbaswamy, R. Adams, and S. Saria, "Evaluating model robustness and stability to dataset shift," in The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event, ser. Proceedings of Machine Learning Research, A. Banerjee and K. Fukumizu, Eds., vol. 130. PMLR, 2021, pp. 2611–2619. [Online]. Available: proceedings.mlr.press/v130/subbas…

[81] E. Grant, C. Finn, S. Levine, T. Darrell, and T. L. Griffiths, "Recasting gradient-based meta-learning as hierarchical bayes," in 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. [Online]. Available: openreview.net/forum?id=BJ…

[82] V. Mikulik, G. Delétang, T. McGrath, T. Genewein, M. Martic, S. Legg, and P. Ortega, "Meta-trained agents implement bayes-optimal agents," Advances in neural information processing systems, vol. 33, pp. 18 691–18 703, 2020.

[83] J. Kaddour, S. Sæmundsson, and M. P. Deisenroth, "Probabilistic active metalearning," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[84] K. Sohn, H. Lee, and X. Yan, "Learning structured output representation using deep conditional generative models," in NIPS, 2015.

[85] Y. Song, J. Sohl-Dickstein, D. P. Kingma, A. Kumar, S. Ermon, and B. Poole, "Score-based generative modeling through stochastic differential equations,"arXiv preprint arXiv:2011.13456, 2020.

[86] N. Pawlowski, D. Coelho de Castro, and B. Glocker, "Deep structural causal models for tractable counterfactual inference," Advances in Neural Information Processing Systems, vol. 33, 2020.

[87] A. Sauer and A. Geiger, "Counterfactual generative networks," inInternational Conference on Learning Representations, 2021. [Online]. Available: openreview.net/forum?id=BX…

[88] F. Locatello, S. Bauer, M. Lucic, G. Raetsch, S. Gelly, B. Schölkopf, and O. Bachem, "Challenging common assumptions in the unsupervised learning of disentangled representations," in international conference on machine learning. PMLR, 2019, pp. 4114–4124.

[89] I. Khemakhem, D. Kingma, R. Monti, and A. Hyvarinen, "Variational autoencoders and nonlinear ica: A unifying framework," in International Conference on Artificial Intelligence and Statistics. PMLR, 2020, pp. 2207–2217.

[90] I. Kobyzev, S. J. Prince, and M. A. Brubaker, "Normalizing flows: An introduction and review of current methods," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 11, pp. 3964–3979, nov 2021.

[91] D. P. Kingma and M. Welling, "Auto-encoding variational bayes," 2013. [Online]. Available: arxiv.org/abs/1312.61…

[92] P. Sanchez-Martin, M. Rateike, and I. Valera, "Vaca: Design of variational graph autoencoders for interventional and counterfactual queries." arXiv, 2021. [Online]. Available: arxiv.org/abs/2110.14…

[93] H. Kim, S. Shin, J. Jang, K. Song, W. Joo, W. Kang, and I.-C. Moon, "Counterfactual fairness with disentangled causal effect variational autoencoder," 2020. [Online]. Available: arxiv.org/abs/2011.11…

[94] J. Ho, A. Jain, and P. Abbeel, "Denoising diffusion probabilistic models,"Advances in Neural Information Processing Systems, vol. 33, pp. 6840–6851, 2020.

[95] P. Dhariwal and A. Nichol, "Diffusion models beat gans on image synthesis,"Advances in Neural Information Processing Systems, vol. 34, 2021.

[96] P. Sanchez and S. A. Tsaftaris, "Diffusion causal models for counterfactual estimation," arXiv preprint arXiv:2202.10166, 2022.

[97] A. Brock, J. Donahue, and K. Simonyan, "Large scale gan training for high fidelity natural image synthesis," arXiv preprint arXiv:1809.11096, 2018.

[98] M. Yang, F. Liu, Z. Chen, X. Shen, J. Hao, and J. Wang, "Causalvae: Disentangled representation learning via neural structural causal models," in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. Computer Vision Foundation / IEEE, 2021, pp. 9593–9602. [Online]. Available: openaccess.thecvf.com/content/CVP… Yang_CausalVAE_Disentangled_Representation_Learning_via_Neural_Structural_Causal_Models_CVPR_2021_paper.html

[99] F. Locatello, B. Poole, G. Rätsch, B. Schölkopf, O. Bachem, and M. Tschannen, "Weakly-supervised disentanglement without compromises," in International Conference on Machine Learning. PMLR, 2020, pp. 6348–6359.

[100] J. Brehmer, P. De Haan, P. Lippe, and T. Cohen, "Weakly supervised causal representation learning," arXiv preprint arXiv:2203.16437, 2022.

[101] L. Gresele, P. K. Rubenstein, A. Mehrjou, F. Locatello, and B. Schölkopf, "The incomplete rosetta stone problem: Identifiability results for multi-view nonlinear ica," in Uncertainty in Artificial Intelligence. PMLR, 2020, pp. 217–227.

[102] K. Sachs, O. Perez, D. Pe'er, D. A. Lauffenburger, and G. P. Nolan, "Causal protein-signaling networks derived from multiparameter single-cell data," Science, vol. 308, no. 5721, pp. 523–529, 2005.

[103] N. Guelzim, S. Bottani, P. Bourgine, and F. Képès, "Topological and causal structure of the yeast transcriptional regulatory network," Nature genetics, vol. 31, no. 1, pp. 60–63, 2002.

[104] C. Shorten and T. M. Khoshgoftaar, "A survey on image data augmentation for deep learning," Journal of big data, vol. 6, no. 1, pp. 1–48, 2019.

[105] A. Hernández-García and P. König, "Data augmentation instead of explicit regularization," 2018. [Online]. Available: arxiv.org/abs/1806.03…

[106] X. Chen, C.-J. Hsieh, and B. Gong, "When vision transformers outperform resnets without pre-training or strong data augmentations," 2021. [Online]. Available: arxiv.org/abs/2106.01…

[107] J. Kaddour, L. Liu, R. Silva, and M. J. Kusner, "A fair comparison of two popular flat minima optimizers: Stochastic weight averaging vs. sharpness-aware minimization," 2022. [Online]. Available: arxiv.org/abs/2202.00…

[108] E. D. Cubuk, B. Zoph, D. Mane, V. Vasudevan, and Q. V. Le, "Autoaugment: Learning augmentation policies from data," arXiv preprint arXiv:1805.09501, 2018.

[109] E. D. Cubuk, B. Zoph, J. Shlens, and Q. V. Le, "Randaugment: Practical automated data augmentation with a reduced search space," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020, pp. 702–703.

[110] T. Miller, "Explanation in artificial intelligence: Insights from the social sciences," Artificial intelligence, vol. 267, pp. 1–38, 2019.

[111] C. Molnar, Interpretable Machine Learning, 2nd ed., 2022. [Online]. Available:christophm.github.io/interpretable-ml-book/

[112] N. Burkart and M. F. Huber, "A survey on the explainability of supervised machine learning," Journal of Artificial Intelligence Research, vol. 70, pp. 245– 317, 2021.

[113] F. K. Došilović, M. Brčić, and N. Hlupić, "Explainable artificial intelligence: A survey," in 2018 41st International convention on information and communication technology, electronics and microelectronics (MIPRO). IEEE, 2018, pp. 0210–0215.

[114] S. Verma, J. P. Dickerson, and K. Hines, "Counterfactual explanations for machine learning: A review," CoRR, vol. abs/2010.10596, 2020. [Online]. Available: arxiv.org/abs/2010.10…

[115] A. Karimi, G. Barthe, B. Schölkopf, and I. Valera, "A survey of algorithmic recourse: definitions, formulations, solutions, and prospects," CoRR, vol. abs/2010.04050, 2020. [Online]. Available: arxiv.org/abs/2010.04…

[116] K. Simonyan, A. Vedaldi, and A. Zisserman, "Deep inside convolutional networks: Visualising image classification models and saliency maps," arXiv preprint arXiv:1312.6034, 2013.

[117] J. Adebayo, J. Gilmer, M. Muelly, I. Goodfellow, M. Hardt, and B. Kim, "Sanity checks for saliency maps," Advances in neural information processing systems, vol. 31, 2018.

[118] P. Schwab and W. Karlen, "Cxplain: Causal explanations for model interpretation under uncertainty," in Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, H. M. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. B. Fox, and R. Garnett, Eds., 2019, pp. 10 220–10 230. [Online]. Available: proceedings.neurips.cc/paper/2019/…

[119] M. R. O'Shaughnessy, G. Canal, M. Connor, C. Rozell, and M. A. Davenport, "Generative causal explanations of black-box classifiers," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[120] S. M. Lundberg and S.-I. Lee, "A unified approach to interpreting model predictions," Advances in neural information processing systems, vol. 30, 2017.

[121] C. Frye, C. Rowat, and I. Feige, "Asymmetric shapley values: incorporating causal knowledge into model-agnostic explainability," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[122] D. Janzing, L. Minorics, and P. Blöbaum, "Feature relevance quantification in explainable AI: A causal problem," in The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy], ser. Proceedings of Machine Learning Research, S. Chiappa and R. Calandra, Eds., vol. 108. PMLR, 2020, pp. 2907–2916. [Online]. Available: proceedings.mlr.press/v108/janzin…

[123] T. Heskes, E. Sijben, I. G. Bucur, and T. Claassen, "Causal shapley values: Exploiting causal knowledge to explain individual predictions of complex models," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[124] H. Chen, J. D. Janizek, S. Lundberg, and S.-I. Lee, "True to the model or true to the data?" 2020. [Online]. Available: arxiv.org/abs/2006.16…

[125] P. Lipton, "Contrastive explanation," Royal Institute of Philosophy Supplements, vol. 27, pp. 247–266, 1990.

[126] A. Jacovi, S. Swayamdipta, S. Ravfogel, Y. Elazar, Y. Choi, and Y. Goldberg, "Contrastive explanations for model interpretability," in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, M. Moens, X. Huang, L. Specia, and S. W. Yih, Eds. Association for Computational Linguistics, 2021, pp. 1597–1611. [Online]. Available: doi.org/10.18653/v1…

[127] S. Verma, J. Dickerson, and K. Hines, "Counterfactual explanations for machine learning: Challenges revisited," arXiv preprint arXiv:2106.07756, 2021.

[128] A. Karimi, B. Schölkopf, and I. Valera, "Algorithmic recourse: from counterfactual explanations to interventions," in FAccT '21: 2021 ACM Conference on Fairness, Accountability, and Transparency, Virtual Event / Toronto, Canada, March 3-10, 2021, M. C. Elish, W. Isaac, and R. S. Zemel, Eds. ACM, 2021, pp. 353–362. [Online]. Available: doi.org/10.1145/344…

[129] M. T. Ribeiro, S. Singh, and C. Guestrin, "" why should i trust you?" explaining the predictions of any classifier," in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 1135–1144.

[130] S. Wachter, B. D. Mittelstadt, and C. Russell, "Counterfactual explanations without opening the black box: Automated decisions and the GDPR," CoRR, vol. abs/1711.00399, 2017. [Online]. Available: arxiv.org/abs/1711.00…

[131] A. Karimi, B. J. von Kügelgen, B. Schölkopf, and I. Valera, "Algorithmic recourse under imperfect causal knowledge: a probabilistic approach," inAdvances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[132] S. Joshi, O. Koyejo, W. Vijitbenjaronk, B. Kim, and J. Ghosh, "Towards realistic individual recourse and actionable explanations in black-box decision making systems," CoRR, vol. abs/1907.09615, 2019. [Online]. Available: arxiv.org/abs/1907.09…

[133] L. Schut, O. Key, R. McGrath, L. Costabello, B. Sacaleanu, M. Corcoran, and Y. Gal, "Generating interpretable counterfactual explanations by implicit minimisation of epistemic and aleatoric uncertainties," in The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event, ser. Proceedings of Machine Learning Research, A. Banerjee and K. Fukumizu, Eds., vol. 130. PMLR, 2021, pp. 1756–1764. [Online]. Available: proceedings.mlr.press/v130/schut2…. html

[134] A. Abid, M. Yuksekgonul, and J. Zou, "Meaningfully explaining model mistakes using conceptual counterfactuals," 2021. [Online]. Available: arxiv.org/abs/2106.12…

[135] D. Mahajan, C. Tan, and A. Sharma, "Preserving causal constraints in counterfactual explanations for machine learning classifiers," CoRR, vol. abs/1912.03277, 2019. [Online]. Available: arxiv.org/abs/1912.03…

[136] B. Ustun, A. Spangher, and Y. Liu, "Actionable recourse in linear classification," in Proceedings of the Conference on Fairness, Accountability, and Transparency, ser. FAT* '19. New York, NY, USA: Association for Computing Machinery, 2019, p. 10–19. [Online]. Available: https: //doi.org/10.1145/3287560.3287566

[137] R. Kommiya Mothilal, D. Mahajan, C. Tan, and A. Sharma, "Towards unifying feature attribution and counterfactual explanations: Different means to the same end," in Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 2021, pp. 652–663.

[138] S. Galhotra, R. Pradhan, and B. Salimi, "Feature attribution and recourse via probabilistic contrastive counterfactuals," 2021.

[139] E. Albini, J. Long, D. Dervovic, and D. Magazzeni, "Counterfactual shapley additive explanations," arXiv preprint arXiv:2110.14270, 2021.

[140] H. Fokkema, R. de Heide, and T. van Erven, "Attribution-based explanations that provide recourse cannot be robust," 2022. [Online]. Available: arxiv.org/abs/2205.15…

[141] A. Artelt and B. Hammer, "On the computation of counterfactual explanations–a survey," arXiv preprint arXiv:1911.07749, 2019.

[142] D. Mahajan, C. Tan, and A. Sharma, "Preserving causal constraints in counterfactual explanations for machine learning classifiers," arXiv preprint arXiv:1912.03277, 2019.

[143] M. Pawelczyk, T. Datta, J. van-den Heuvel, G. Kasneci, and H. Lakkaraju, "Algorithmic recourse in the face of noisy human responses," 2022. [Online]. Available: arxiv.org/abs/2203.06…

[144] K. Rawal, E. Kamar, and H. Lakkaraju, "Algorithmic recourse in the wild: Understanding the impact of data and model shifts," arXiv preprint arXiv:2012.11788, 2020.

[145] A. Ghorbani, M. Kim, and J. Zou, "A distributional framework for data valuation," in International Conference on Machine Learning. PMLR, 2020, pp. 3535–3544.

[146] U. Aïvodji, A. Bolot, and S. Gambs, "Model extraction from counterfactual explanations," arXiv preprint arXiv:2009.01884, 2020.

[147] R. Shokri, M. Strobel, and Y. Zick, "On the privacy risks of model explanations," in Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 2021, pp. 231–241.

[148] Y.-B. He and Z. Geng, "Active learning of causal networks with intervention experiments and optimal designs," Journal of Machine Learning Research, vol. 9, no. Nov, pp. 2523–2547, 2008.

[149] D. Slack, A. Hilgard, H. Lakkaraju, and S. Singh, "Counterfactual explanations can be manipulated," Advances in Neural Information Processing Systems, vol. 34, 2021.

[150] R. Naidu, A. Priyanshu, A. Kumar, S. Kotti, H. Wang, and F. Mireshghallah, "When differential privacy meets interpretability: A case study," arXiv preprint arXiv:2106.13203, 2021.

[151] K. Makhlouf, S. Zhioua, and C. Palamidessi, "Survey on causal-based machine learning fairness notions," arXiv preprint arXiv:2010.09553, 2020.

[][152] S. Barocas, M. Hardt, and A. Narayanan, Fairness and Machine Learning. fairmlbook.org, 2019, www.fairmlbook.org.

[153] M. J. Kusner, J. R. Loftus, C. Russell, and R. Silva, "Counterfactual fairness," in Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, I. Guyon, U. von Luxburg, S. Bengio, H. M. Wallach, R. Fergus, S. V. N. Vishwanathan, and R. Garnett, Eds., 2017, pp. 4066–4076. [Online]. Available: proceedings.neurips.cc/paper/2017/…

[154] R. Nabi and I. Shpitser, "Fair inference on outcomes," in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, 2018.

[155] S. Chiappa, "Path-specific counterfactual fairness," in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, 2019, pp. 7801–7808.

[156] J. Zhang and E. Bareinboim, "Fairness in decision-making—the causal explanation formula," in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, 2018.

[157] T. J. VanderWeele and W. R. Robinson, "On causal interpretation of race in regressions adjusting for confounding and mediating variables," Epidemiology (Cambridge, Mass.), vol. 25, no. 4, p. 473, 2014.

[158] L. Hu and I. Kohler-Hausmann, "What's sex got to do with machine learning?" in Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, ser. FAT* '20. New York, NY, USA: Association for Computing Machinery, 2020, p. 513. [Online]. Available: https: //doi.org/10.1145/3351095.3375674

[159] N. Kilbertus, "Beyond traditional assumptions in fair machine learning," Ph.D. dissertation, University of Cambridge, UK, Sep. 2020, (CambridgeTübingen-Fellowship).

[160] N. Kilbertus, M. Rojas Carulla, G. Parascandolo, M. Hardt, D. Janzing, and B. Schölkopf, "Avoiding discrimination through causal reasoning," Advances in neural information processing systems, vol. 30, 2017.

[161] B. Salimi, L. Rodriguez, B. Howe, and D. Suciu, "Interventional fairness: Causal database repair for algorithmic fairness," in Proceedings of the 2019 International Conference on Management of Data, 2019, pp. 793–810.

[162] H. Singh, R. Singh, V. Mhasawade, and R. Chunara, "Fairness violations and mitigation under covariate shift," in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021, pp. 3–13.

[163] J. Schrouff, N. Harris, O. Koyejo, I. Alabdulmohsin, E. Schnider, K. OpsahlOng, A. Brown, S. Roy, D. Mincu, C. Chen et al., "Maintaining fairness across distribution shift: do we have viable solutions for real-world applications?"arXiv preprint arXiv:2202.01034, 2022.

[164] D. Slack, S. A. Friedler, and E. Givental, "Fairness warnings and fair-maml: learning fairly with minimal data," in FAT* '20: Conference on Fairness, Accountability, and Transparency, Barcelona, Spain, January 27-30, 2020, M. Hildebrandt, C. Castillo, L. E. Celis, S. Ruggieri, L. Taylor, and G. Zanfir-Fortuna, Eds. ACM, 2020, pp. 200–209. [Online]. Available: doi.org/10.1145/335…

[165] M. E. Guy and S. A. McCandless, "Social equity: Its legacy, its promise,"Public Administration Review, vol. 72, no. s1, pp. S5–S13, 2012.

[166] J. McGinnis Johnson, "Race and social equity: A nervous area of government,"Equality, Diversity and Inclusion: An International Journal, vol. 34, pp. 262– 264, 03 2015.

[167] J. G. Richens, R. Beard, and D. H. Thompson, "First do no harm: counterfactual objective functions for safe & ethical ai," 2022. [Online]. Available: arxiv.org/abs/2204.12…

[168] V. Gupta, P. Nokhiz, C. D. Roy, and S. Venkatasubramanian, "Equalizing recourse across groups," 2019. [Online]. Available: arxiv.org/abs/1909.03…

[169] J. von Kügelgen, A.-H. Karimi, U. Bhatt, I. Valera, A. Weller, and B. Schölkopf, "On the fairness of causal algorithmic recourse," arXiv preprint arXiv:2010.06529, 2020.

[170] W. Huan, Y. Wu, L. Zhang, and X. Wu, "Fairness through equality of effort," in Companion Proceedings of the Web Conference 2020, 2020, pp. 743–751.

[171] Y. Wu, L. Zhang, and X. Wu, "Counterfactual fairness: Unidentification, bound and algorithm," in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019.

[172] Y. Wu, L. Zhang, X. Wu, and H. Tong, "Pc-fairness: A unified framework for measuring causality-based fairness," Advances in Neural Information Processing Systems, vol. 32, 2019.

[173] I. Kohler-Hausmann, "Eddie murphy and the dangers of counterfactual causal thinking about detecting racial discrimination," Nw. UL Rev., vol. 113, p. 1163, 2018.

[174] L. Hu, "Causation in the social world," Ph.D. dissertation, Harvard University, 2022.

[175] L. Hu and I. Kohler-Hausmann, "What's sex got to do with machine learning?" in Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, ser. FAT* '20. New York, NY, USA: Association for Computing Machinery, 2020, p. 513. [Online]. Available: doi.org/10.1145/335…

[176] A. Kasirzadeh and A. Smart, "The use and misuse of counterfactuals in ethical machine learning," in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021, pp. 228–236.

[177] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA, USA: A Bradford Book, 2018.

[178] S. Zhu, I. Ng, and Z. Chen, "Causal discovery with reinforcement learning,"arXiv preprint arXiv:1906.04477, 2019.

[179] E. Bareinboim, A. Forney, and J. Pearl, "Bandits with unobserved confounders: A causal approach," Advances in Neural Information Processing Systems, vol. 28, 2015. [180] S. J. Gershman, "Reinforcement learning and causal models," The Oxford handbook of causal reasoning, vol. 1, p. 295, 2017.

[181] C. Szepesvari, "Causality from the perspective of reinforcement learning," 2018, machine Learning for Causal Inference, Counterfactual Prediction, and Autonomous Action (CausalML). [Online]. Available: sites.google.com/site/faim18…

[182] J. Bannon, B. Windsor, W. Song, and T. Li, "Causality and batch reinforcement learning: Complementary approaches to planning in unknown domains,"arXiv preprint arXiv:2006.02579, 2020.

[183] E. Bareinboim, "Causal reinforcement learning," 2020, iCML. [Online]. Available: crl.causalai.net/

[184] S. Weichwald, S. W. Mogensen, T. E. Lee, D. Baumann, O. Kroemer, I. Guyon, S. Trimpe, J. Peters, and N. Pfister, "Learning by doing: Controlling a dynamical system using causality, control, and reinforcement learning," arXiv preprint arXiv:2202.06052, 2022.

[185] D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai, A. Guez, M. Lanctot, L. Sifre, D. Kumaran, T. Graepel et al., "Mastering chess and shogi by self-play with a general reinforcement learning algorithm," arXiv preprint arXiv:1712.01815, 2017.

[186] R. S. Sutton, "The bitter lesson," 2019. [Online]. Available: http: //www.incompleteideas.net/IncIdeas/Bi…

[187] V. Chernozhukov, D. Chetverikov, M. Demirer, E. Duflo, C. Hansen, W. Newey, and J. Robins, "Double/debiased machine learning for treatment and structural parameters," The Econometrics Journal, vol. 21, no. 1, pp. C1–C68, 01 2018.

[188] S. Levine, A. Kumar, G. Tucker, and J. Fu, "Offline reinforcement learning: Tutorial, review, and perspectives on open problems," arXiv preprint arXiv:2005.01643, 2020.

[189] T. Lattimore and C. Szepesvári, Bandit algorithms. Cambridge University Press, 2020.

[190] F. Lattimore, T. Lattimore, and M. D. Reid, "Causal bandits: Learning good interventions via causal inference," in Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, D. D. Lee, M. Sugiyama, U. von Luxburg, I. Guyon, and R. Garnett, Eds., 2016, pp. 1181–1189. [Online]. Available: proceedings.neurips.cc/paper/2016/…

[191] S. Bubeck, R. Munos, and G. Stoltz, "Pure exploration in multi-armed bandits problems," in International conference on Algorithmic learning theory. Springer, 2009, pp. 23–37.

[192] V. Gabillon, M. Ghavamzadeh, and A. Lazaric, "Best arm identification: A unified approach to fixed budget and fixed confidence," Advances in Neural Information Processing Systems, vol. 25, 2012.

[193] J.-Y. Audibert, S. Bubeck, and R. Munos, "Best arm identification in multiarmed bandits." in COLT. Citeseer, 2010, pp. 41–53.

[194] S. Lee and E. Bareinboim, "Structural causal bandits: Where to intervene?" in Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., vol. 31. Curran Associates, Inc., 2018. [Online]. Available: proceedings.neurips.cc/paper/2018/…

[195] Y. Lu, A. Meisami, and A. Tewari, "Causal bandits with unknown graph structure," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=9-…

[196] R. Silva, "Observational-interventional priors for dose-response learning," inAdvances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, D. D. Lee, M. Sugiyama, U. von Luxburg, I. Guyon, and R. Garnett, Eds., 2016, pp. 1561–1569. [Online]. Available: proceedings.neurips.cc/paper/2016/…

[197] V. Aglietti, X. Lu, A. Paleyes, and J. González, "Causal bayesian optimization," in The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy], ser. Proceedings of Machine Learning Research, S. Chiappa and R. Calandra, Eds., vol. 108. PMLR, 2020, pp. 3155–3164. [Online]. Available: proceedings.mlr.press/v108/agliet…

[198] D. R. Jones, M. Schonlau, and W. J. Welch, "Efficient global optimization of expensive black-box functions," Journal of Global optimization, vol. 13, no. 4, pp. 455–492, 1998.

[199] S. Saengkyongam, N. Thams, J. Peters, and N. Pfister, "Invariant policy learning: A causal perspective," CoRR, vol. abs/2106.00808, 2021. [Online]. Available: arxiv.org/abs/2106.00…

[200] Y. Lu, A. Meisami, and A. Tewari, "Efficient reinforcement learning with prior causal knowledge," in First Conference on Causal Learning and Reasoning, 2022. [Online]. Available: openreview.net/forum?id=xz…

[201] T. M. Moerland, J. Broekens, and C. M. Jonker, "Model-based reinforcement learning: A survey," arXiv preprint arXiv:2006.16712, 2020.

[202] D. Ha and J. Schmidhuber, "World models," arXiv preprint arXiv:1803.10122, 2018.

[203] J. Schrittwieser, I. Antonoglou, T. Hubert, K. Simonyan, L. Sifre, S. Schmitt, A. Guez, E. Lockhart, D. Hassabis, T. Graepel et al., "Mastering atari, go, chess and shogi by planning with a learned model," Nature, vol. 588, no. 7839, pp. 604–609, 2020.

[194] S. Lee and E. Bareinboim, "Structural causal bandits: Where to intervene?" in Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., vol. 31. Curran Associates, Inc., 2018. [Online]. Available: proceedings.neurips.cc/paper/2018/…

[195] Y. Lu, A. Meisami, and A. Tewari, "Causal bandits with unknown graph structure," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=9-…

[196] R. Silva, "Observational-interventional priors for dose-response learning," inAdvances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, D. D. Lee, M. Sugiyama, U. von Luxburg, I. Guyon, and R. Garnett, Eds., 2016, pp. 1561–1569. [Online]. Available: proceedings.neurips.cc/paper/2016/…

[197] V. Aglietti, X. Lu, A. Paleyes, and J. González, "Causal bayesian optimization," in The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020, 26-28 August 2020, Online [Palermo, Sicily, Italy], ser. Proceedings of Machine Learning Research, S. Chiappa and R. Calandra, Eds., vol. 108. PMLR, 2020, pp. 3155–3164. [Online]. Available: proceedings.mlr.press/v108/agliet…

[198] D. R. Jones, M. Schonlau, and W. J. Welch, "Efficient global optimization of expensive black-box functions," Journal of Global optimization, vol. 13, no. 4, pp. 455–492, 1998.

[199] S. Saengkyongam, N. Thams, J. Peters, and N. Pfister, "Invariant policy learning: A causal perspective," CoRR, vol. abs/2106.00808, 2021. [Online]. Available: arxiv.org/abs/2106.00…

[200] Y. Lu, A. Meisami, and A. Tewari, "Efficient reinforcement learning with prior causal knowledge," in First Conference on Causal Learning and Reasoning, 2022. [Online]. Available: openreview.net/forum?id=xz…

[201] T. M. Moerland, J. Broekens, and C. M. Jonker, "Model-based reinforcement learning: A survey," arXiv preprint arXiv:2006.16712, 2020.

[202] D. Ha and J. Schmidhuber, "World models," arXiv preprint arXiv:1803.10122, 2018.

[203] J. Schrittwieser, I. Antonoglou, T. Hubert, K. Simonyan, L. Sifre, S. Schmitt, A. Guez, E. Lockhart, D. Hassabis, T. Graepel et al., "Mastering atari, go, chess and shogi by planning with a learned model," Nature, vol. 588, no. 7839, pp. 604–609, 2020.

[204] D. J. Rezende, I. Danihelka, G. Papamakarios, N. R. Ke, R. Jiang, T. Weber, K. Gregor, H. Merzic, F. Viola, J. Wang et al., "Causally correct partial models for reinforcement learning," arXiv preprint arXiv:2002.02836, 2020.

[205] M. Li, M. Yang, F. Liu, X. Chen, Z. Chen, and J. Wang, "Causal world models by unsupervised deconfounding of physical dynamics," arXiv preprint arXiv:2012.14228, 2020.

[206] Z. Wang, X. Xiao, Z. Xu, Y. Zhu, and P. Stone, "Causal dynamics learning for task-independent state abstraction," 2022. [Online]. Available: arxiv.org/abs/2206.13…

[207] D. Chapman and L. P. Kaelbling, "Input generalization in delayed reinforcement learning: An algorithm and performance comparisons." in Ijcai, vol. 91. Citeseer, 1991, pp. 726–731.

[208] A. K. McCallum, Reinforcement learning with selective perception and hidden state. University of Rochester, 1996.

[209] A. A. Mastakouri, B. Schölkopf, and D. Janzing, "Necessary and sufficient conditions for causal feature selection in time series with latent common causes," in International Conference on Machine Learning. PMLR, 2021, pp. 7502– 7511.

[210] L. Weng, "Meta reinforcement learning," lilianweng.github.io, 2019. [Online]. Available: lilianweng.github.io/posts/2019-…

[211] S. A. Sontakke, A. Mehrjou, L. Itti, and B. Schölkopf, "Causal curiosity: RL agents discovering self-supervised experiments for causal representation learning," in Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, ser. Proceedings of Machine Learning Research, M. Meila and T. Zhang, Eds., vol. 139. PMLR, 2021, pp. 9848–9858. [Online]. Available: proceedings.mlr.press/v139/sontak…

[212] F. Doshi-Velez and G. Konidaris, "Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations," in IJCAI: proceedings of the conference, vol. 2016. NIH Public Access, 2016, p. 1432.

[213] K. Rakelly, A. Zhou, C. Finn, S. Levine, and D. Quillen, "Efficient off-policy meta-reinforcement learning via probabilistic context variables," in Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, ser. Proceedings of Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. PMLR, 2019, pp. 5331–5340. [Online]. Available: proceedings.mlr.press/v97/rakelly…

[214] A. Zhang, C. Lyle, S. Sodhani, A. Filos, M. Kwiatkowska, J. Pineau, Y. Gal, and D. Precup, "Invariant causal prediction for block mdps," in International Conference on Machine Learning. PMLR, 2020, pp. 11 214–11 224.

[215] M. Tomar, A. Zhang, R. Calandra, M. E. Taylor, and J. Pineau, "Modelinvariant state abstractions for model-based reinforcement learning," 2021.

[216] K. Kansky, T. Silver, D. A. Mély, M. Eldawy, M. Lázaro-Gredilla, X. Lou, N. Dorfman, S. Sidor, S. Phoenix, and D. George, "Schema networks: Zeroshot transfer with a generative causal model of intuitive physics," 2017.

[217] V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski et al., "Human-level control through deep reinforcement learning," nature, vol. 518, no. 7540, pp. 529–533, 2015.

[218] M. Mutti, R. De Santi, E. Rossi, J. F. Calderon, M. Bronstein, and M. Restelli, "Provably efficient causal model-based reinforcement learning for systematic generalization," arXiv preprint arXiv:2202.06545, 2022.

[219] S. Parbhoo, S. Joshi, and F. Doshi-Velez, "Generalizing off-policy evaluation from a causal perspective for sequential decision-making," 2022.

[220] L. Buesing, T. Weber, Y. Zwols, N. Heess, S. Racaniere, A. Guez, and J.-B. Lespiau, "Woulda, coulda, shoulda: Counterfactually-guided policy search," in International Conference on Learning Representations, 2019. [Online]. Available: openreview.net/forum?id=BJ…

[221] M. Oberst and D. A. Sontag, "Counterfactual off-policy evaluation with gumbel-max structural causal models," in Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, ser. Proceedings of Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. PMLR, 2019, pp. 4881– 4890. [Online]. Available: proceedings.mlr.press/v97/oberst1…

[222] N. Kallus and A. Zhou, "Confounding-robust policy improvement," inAdvances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, S. Bengio, H. M. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., 2018, pp. 9289– 9299. [Online]. Available: proceedings.neurips.cc/paper/2018/…

[223] ——, "Confounding-robust policy evaluation in infinite-horizon reinforcement learning," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[224] A. Bennett, N. Kallus, L. Li, and A. Mousavi, "Off-policy evaluation in infinitehorizon reinforcement learning with latent confounders," in International Conference on Artificial Intelligence and Statistics. PMLR, 2021, pp. 1999–2007.

[225] T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning," arXiv preprint arXiv:1509.02971, 2015.

[226] A. Charpentier, R. Elie, and C. Remlinger, "Reinforcement learning in economics and finance," Computational Economics, pp. 1–38, 2021.

[227] H. Namkoong, R. Keramati, S. Yadlowsky, and E. Brunskill, "Off-policy policy evaluation for sequential decisions under unobserved confounding," Advances in Neural Information Processing Systems, vol. 33, pp. 18 819–18 831, 2020.

[228] G. Tennenholtz, U. Shalit, and S. Mannor, "Off-policy evaluation in partially observable environments," in The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. AAAI Press, 2020, pp. 10 276–10 283. [Online]. Available: ojs.aaai.org/index.php/A…

[229] A. Bennett and N. Kallus, "Proximal reinforcement learning: Efficient offpolicy evaluation in partially observed markov decision processes," CoRR, vol. abs/2110.15332, 2021. [Online]. Available: arxiv.org/abs/2110.15…

[230] M. Gasse, D. Grasset, G. Gaudron, and P.-Y. Oudeyer, "Causal reinforcement learning using observational and interventional data," arXiv preprint arXiv:2106.14421, 2021.

[231] L. Wang, Z. Yang, and Z. Wang, "Provably efficient causal reinforcement learning with confounded observational data," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=tU…

[232] Y. Chen, L. Xu, C. Gulcehre, T. L. Paine, A. Gretton, N. de Freitas, and A. Doucet, "On instrumental variable regression for deep offline policy evaluation," 2021. [Online]. Available: arxiv.org/abs/2105.10…

[233] A. Hussein, M. M. Gaber, E. Elyan, and C. Jayne, "Imitation learning: A survey of learning methods," ACM Computing Surveys (CSUR), vol. 50, no. 2, pp. 1–35, 2017.

[234] P. de Haan, D. Jayaraman, and S. Levine, "Causal confusion in imitation learning," in Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, H. M. Wallach, H. Larochelle,A. Beygelzimer, F. d'Alché-Buc, E. B. Fox, and R. Garnett, Eds., 2019, pp. 11 693–11 704. [Online]. Available: proceedings.neurips.cc/paper/2019/…

[235] J. Tien, J. Z.-Y. He, Z. Erickson, A. D. Dragan, and D. Brown, "A study of causal confusion in preference-based reward learning," arXiv preprint arXiv:2204.06601, 2022.

[236] P. A. Ortega, M. Kunesch, G. Delétang, T. Genewein, J. Grau-Moya, J. Veness, J. Buchli, J. Degrave, B. Piot, J. Perolat, T. Everitt, C. Tallec, E. Parisotto, T. Erez, Y. Chen, S. Reed, M. Hutter, N. de Freitas, and S. Legg, "Shaking the foundations: delusions in sequence models for interaction and control," 2021.

[237] J. Zhang, D. Kumor, and E. Bareinboim, "Causal imitation learning with unobserved confounders," in Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, Eds., vol. 33. Curran Associates, Inc., 2020, pp. 12 263– 12 274. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[238] D. Kumor, J. Zhang, and E. Bareinboim, "Sequential causal imitation learning with unobserved confounders," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=o6…

[239] I. Bica, D. Jarrett, and M. van der Schaar, "Invariant causal imitation learning for generalizable policies," in Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, Eds., 2021. [Online]. Available: openreview.net/forum?id=71…

[240] G. Swamy, S. Choudhury, J. A. Bagnell, and Z. S. Wu, "Causal imitation learning under temporally correlated noise," 2022. [Online]. Available: arxiv.org/abs/2202.01…

[241] A. Hefny, C. Downey, and G. J. Gordon, "Supervised learning for dynamical system learning," Advances in neural information processing systems, vol. 28, 2015.

[242] A. Y. Ng, H. J. Kim, M. I. Jordan, S. Sastry, and S. Ballianda, "Autonomous helicopter flight via reinforcement learning." in NIPS, vol. 16. Citeseer, 2003.

[243] P. G. Wright, Tariff on animal and vegetable oils. Macmillan Company, New York, 1928.

[244] T. Desautels, R. Das, J. Calvert, M. Trivedi, C. Summers, D. J. Wales, and A. Ercole, "Prediction of early unplanned intensive care unit readmission in a uk tertiary care hospital: a cross-sectional machine learning approach," BMJ open, vol. 7, no. 9, p. e017199, 2017.

[245] M. Minsky, "Steps toward artificial intelligence," Proceedings of the IRE, vol. 49, no. 1, pp. 8–30, 1961.

[246] T. Mesnard, T. Weber, F. Viola, S. Thakoor, A. Saade, A. Harutyunyan, W. Dabney, T. S. Stepleton, N. Heess, A. Guez, E. Moulines, M. Hutter, L. Buesing, and R. Munos, "Counterfactual credit assignment in modelfree reinforcement learning," in Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, ser. Proceedings of Machine Learning Research, M. Meila and T. Zhang, Eds., vol. 139. PMLR, 2021, pp. 7654–7664. [Online]. Available: proceedings.mlr.press/v139/mesnar…

[247] M. Seitzer, B. Schölkopf, and G. Martius, "Causal influence detection for improving efficiency in reinforcement learning," Advances in Neural Information Processing Systems, vol. 34, 2021.

[248] G. Weiss, Multiagent systems: a modern approach to distributed artificial intelligence. MIT press, 1999.

[249] S. Gronauer and K. Diepold, "Multi-agent deep reinforcement learning: a survey," Artificial Intelligence Review, vol. 55, no. 2, pp. 895–943, 2022.

[250] Y.-H. Chang, T. Ho, and L. Kaelbling, "All learning is local: Multi-agent learning in global reward games," Advances in neural information processing systems, vol. 16, 2003.

[251] J. N. Foerster, G. Farquhar, T. Afouras, N. Nardelli, and S. Whiteson, "Counterfactual multi-agent policy gradients," in Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2-7, 2018, S. A. McIlraith and K. Q. Weinberger, Eds. AAAI Press, 2018, pp. 2974–2982. [Online]. Available: www.aaai.org/ocs/index.p…

[252] N. Jaques, A. Lazaridou, E. Hughes, Ç. Gülçehre, P. A. Ortega, D. Strouse, J. Z. Leibo, and N. de Freitas, "Social influence as intrinsic motivation for multi-agent deep reinforcement learning," in Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, ser. Proceedings of Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. PMLR, 2019, pp. 3040– 3049. [Online]. Available: proceedings.mlr.press/v97/jaques1…

[253] S. Pitis, E. Creager, and A. Garg, "Counterfactual data augmentation using locally factored dynamics," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds.,2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[254] L. P. Kaelbling, "Learning to achieve goals," in IN PROC. OF IJCAI-93. Morgan Kaufmann, 1993, pp. 1094–1098.

[255] M. Andrychowicz, F. Wolski, A. Ray, J. Schneider, R. Fong, P. Welinder, B. McGrew, J. Tobin, O. Pieter Abbeel, and W. Zaremba, "Hindsight experience replay," in Advances in Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., vol. 30. Curran Associates, Inc., 2017. [Online]. Available: proceedings.neurips.cc/paper/2017/…

[256] M. Andrychowicz, A. Raichuk, P. Stańczyk, M. Orsini, S. Girgin, R. Marinier, L. Hussenot, M. Geist, O. Pietquin, M. Michalski et al., "What matters in onpolicy reinforcement learning? a large-scale empirical study," arXiv preprint arXiv:2006.05990, 2020.

[257] M. Laskin, K. Lee, A. Stooke, L. Pinto, P. Abbeel, and A. Srinivas, "Reinforcement learning with augmented data," in Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., vol. 33. Curran Associates, Inc., 2020, pp. 19 884–19 895. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[258] R. S. Sutton, "Dyna, an integrated architecture for learning, planning, and reacting," ACM Sigart Bulletin, vol. 2, no. 4, pp. 160–163, 1991.

[259] M. Janner, J. Fu, M. Zhang, and S. Levine, "When to trust your model: Model-based policy optimization," in Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, H. M. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. B. Fox, and R. Garnett, Eds., 2019, pp. 12 498– 12 509. [Online]. Available: proceedings.neurips.cc/paper/2019/…

[260] C. Lu, B. Huang, K. Wang, J. M. Hernández-Lobato, K. Zhang, and B. Schölkopf, "Sample-efficient reinforcement learning via counterfactualbased data augmentation," in Offline Reinforcement Learning - Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS), 2020. [Online]. Available: offline-rl-neurips.github.io/pdf/34.pdf

[261] A. Jaiswal, W. AbdAlmageed, Y. Wu, and P. Natarajan, "Bidirectional conditional generative adversarial networks," in Asian Conference on Computer Vision. Springer, 2018, pp. 216–232.

[262] Wikipedia contributors, "Reinventing the wheel," 2022, [Online; accessed 10May-2022]. [Online]. Available: en.wikipedia.org/wiki/Reinve…

[263] J. Zhang and E. Bareinboim, "Markov decision processes with unobserved confounders: A causal approach," Technical report, Technical Report R-23, Purdue AI Lab, Tech. Rep., 2016.

[264] T. Wang, C. Zhou, Q. Sun, and H. Zhang, "Causal attention for unbiased visual recognition," in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021.

[265] A. Hallak, D. Di Castro, and S. Mannor, "Contextual markov decision processes," arXiv preprint arXiv:1502.02259, 2015.

[266] A. Guez, D. Silver, and P. Dayan, "Efficient bayes-adaptive reinforcement learning using sample-based search," Advances in neural information processing systems, vol. 25, 2012.

[267] J. Zhang and E. Bareinboim, "Can humans be out of the loop?" in First Conference on Causal Learning and Reasoning, 2022. [Online]. Available: openreview.net/forum?id=P0…

[268] Z. Yue, H. Zhang, Q. Sun, and X. Hua, "Interventional few-shot learning," inAdvances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[269] Y. Wang, Q. Yao, J. T. Kwok, and L. M. Ni, "Generalizing from a few examples: A survey on few-shot learning," ACM computing surveys (csur), vol. 53, no. 3, pp. 1–34, 2020.

[270] K. P. Murphy, Probabilistic Machine Learning: An introduction. MIT Press, 2022. [Online]. Available: probml.ai

[271] X. Li, Z. Zhang, G. Wei, C. Lan, W. Zeng, X. Jin, and Z. Chen, "Confounder identification-free causal visual feature learning," CoRR, vol. abs/2111.13420, 2021. [Online]. Available: arxiv.org/abs/2111.13…

[272] C. Finn, P. Abbeel, and S. Levine, "Model-agnostic meta-learning for fast adaptation of deep networks," in International conference on machine learning. PMLR, 2017, pp. 1126–1135.

[273] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, "Generative adversarial nets," Advances in neural information processing systems, vol. 27, 2014.

[274] Y. Liu, R. Cadei, J. Schweizer, S. Bahmani, and A. Alahi, "Towards robust and adaptive motion forecasting: A causal representation perspective," 2021. [Online]. Available: arxiv.org/abs/2111.14…

[275] D. Zhang, H. Zhang, J. Tang, X. Hua, and Q. Sun, "Causal intervention for weakly-supervised semantic segmentation," in Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., 2020. [Online]. Available: proceedings.neurips.cc/paper/2020/…

[276] J. Huang, Y. Qin, J. Qi, Q. Sun, and H. Zhang, "Deconfounded visual grounding," 2021. [Online]. Available: arxiv.org/abs/2112.15…

[277] Y. Wang and D. M. Blei, "The blessings of multiple causes," 2018. [Online]. Available: arxiv.org/abs/1805.06…

[278] E. L. Ogburn, I. Shpitser, and E. J. T. Tchetgen, "Comment on "blessings of multiple causes"," Journal of the American Statistical Association, vol. 114, no. 528, pp. 1611–1615, 2019.

[279] A. D'Amour, "On multi-cause approaches to causal inference with unobserved counfounding: Two cautionary failure cases and a promising alternative," inThe 22nd International Conference on Artificial Intelligence and Statistics. PMLR, 2019, pp. 3478–3486.

[280] M. Otani, Y. Nakashima, E. Rahtu, and J. Heikkilä, "Uncovering hidden challenges in query-based video moment retrieval," 2020. [Online]. Available: arxiv.org/abs/2009.00…

[281] G. Nan, R. Qiao, Y. Xiao, J. Liu, S. Leng, H. Zhang, and W. Lu, "Interventional video grounding with dual contrastive learning," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 2765–2775.

[282] X. Yang, F. Feng, W. Ji, M. Wang, and T.-S. Chua, "Deconfounded video moment retrieval with causal intervention," in Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021, pp. 1–10.

[283] Y. Goyal, Z. Wu, J. Ernst, D. Batra, D. Parikh, and S. Lee, "Counterfactual visual explanations," in Proceedings of the 36th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. PMLR, 09–15 Jun 2019, pp. 2376–2384. [Online]. Available: proceedings.mlr.press/v97/goyal19…

[284] L. A. Hendricks, R. Hu, T. Darrell, and Z. Akata, "Generating counterfactual explanations with natural language," CoRR, vol. abs/1806.09809, 2018. [Online]. Available: arxiv.org/abs/1806.09…

[285] Z. Yue, T. Wang, H. Zhang, Q. Sun, and X.-S. Hua, "Counterfactual Zero-Shot and Open-Set Visual Recognition," arXiv:2103.00887 [cs], Mar. 2021, arXiv: 2103.00887. [Online]. Available: arxiv.org/abs/2103.00…

[286] J. Pearl, "Brief report: On the consistency rule in causal inference: "axiom, definition, assumption, or theorem?"," Epidemiology, vol. 21, no. 6, pp. 872–875, 2010. [Online]. Available: www.jstor.org/stable/2078…

[287] X. Zhang, Y. Wong, X. Wu, J. Lu, M. Kankanhalli, X. Li, and W. Geng, "Learning causal representation for training cross-domain pose estimator via generative interventions," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 11 270–11 280.

[288] Y. Li, A. Torralba, A. Anandkumar, D. Fox, and A. Garg, "Causal discovery in physical systems from videos," Advances in Neural Information Processing Systems, vol. 33, pp. 9180–9192, 2020.

[289] T. D. Kulkarni, A. Gupta, C. Ionescu, S. Borgeaud, M. Reynolds, A. Zisserman, and V. Mnih, "Unsupervised learning of object keypoints for perception and control," Advances in neural information processing systems, vol. 32, 2019.

[290] S. Janny, F. Baradel, N. Neverova, M. Nadri, G. Mori, and C. Wolf, "Filteredcophy: Unsupervised learning of counterfactual physics in pixel space," arXiv preprint arXiv:2202.00368, 2022.

[291] D. Maliniak, R. Powers, and B. F. Walter, "The gender citation gap in international relations," International Organization, vol. 67, no. 4, pp. 889–922, 2013.

[292] A. Feder, K. A. Keith, E. Manzoor, R. Pryzant, D. Sridhar, Z. Wood-Doughty, J. Eisenstein, J. Grimmer, R. Reichart, M. E. Roberts, B. M. Stewart, V. Veitch, and D. Yang, "Causal inference in natural language processing: Estimation, prediction, interpretation and beyond," CoRR, vol. abs/2109.00725, 2021.

[293] C. Fong and J. Grimmer, "Discovery of treatments from text corpora," inProceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016, pp. 1600–1609.

[294] R. Pryzant, K. Shen, D. Jurafsky, and S. Wagner, "Deconfounded lexicon induction for interpretable social science," in Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2018, pp. 1615–1625.

[295] M. T. Ribeiro, T. Wu, C. Guestrin, and S. Singh, "Beyond accuracy: Behavioral testing of NLP models with checklist," CoRR, vol. abs/2005.04118, 2020.

[296] A. Ross, T. Wu, H. Peng, M. E. Peters, and M. Gardner, "Tailor: Generating and perturbing text with semantic controls," CoRR, vol. abs/2107.07150, 2021.

[297] R. Adragna, E. Creager, D. Madras, and R. S. Zemel, "Fairness and robustness in invariant learning: A case study in toxicity classification," CoRR, vol. abs/2011.06485, 2020.

[298] S. Garg, V. Perot, N. Limtiaco, A. Taly, E. H. Chi, and A. Beutel, "Counterfactual fairness in text classification through robustness," CoRR, vol. abs/1809.10610, 2018.

[299] Q. Liu, M. J. Kusner, and P. Blunsom, "Counterfactual data augmentation for neural machine translation," in NAACL-HLT. Association for Computational Linguistics, 2021, pp. 187–197.

[300] D. Kiela, H. Firooz, A. Mohan, V. Goswami, A. Singh, P. Ringshia, and D. Testuggine, "The hateful memes challenge: Detecting hate speech in multimodal memes," 2020. [Online]. Available: arxiv.org/abs/2005.04…