Causality for NLP Reading List

This repository lists papers on causality for natural language processing (NLP).

Contributor: Zhijing Jin. Welcome to be a collaborator, -- you can make an issue/pull request, and I can add you :).

Contents (Actively Updating)

1. Causality Basics
2. Causality Applied to General NLP
3. Causality for Various Applications
4. More Resources
Contributions
How to Cite This Repo

1. Causality Basics

1.1 Talks, Tutorial, etc

Talks and Tutorials:

(CausalNLP) Bernhard's Talk on Towards Causal NLP. [Video@EMNLP 2021 Workshop]
(Vivid, beginner-friendly) Yoshua Bengio's Primer on the Future of Causality&NLP. [Video@ELLIS NLP Workshop]

Seminars:

(Global, weekly reading group) Online Causal Inference Seminar. Organized by Stanford, ETH, etc. [speakers] [past recordings]

Every Tuesdays at 8:30 am PT (11:30 am ET / 4:30 pm London / 5:30 pm Berlin).
(Current) Causal Seminar@Harvard Data Science Institute. [talks]
(2021, Recordings available) Beyond i.i.d. learning: Causality, dynamics, and interactions. ETH Seminar by Prof Michael Muehlebach, Bernhard Schölkopf, Andreas Krause. [past recordings]

Motivational Materials:

(2002 AI Magazine) Reasoning with Cause and Effect. Judea Pearl. [pdf]
(Blog) ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus. Ferenc Huszár. [blog]
(Blog) Causal Analysis in Theory and Practice. Judea Pearl's [blog]
(Videos course to introduce a series of concepts) Introduction to Causal Inference [Video]

1.2 Overview Papers

(2021 IEEE, Overview by Schoelkopf+) Towards Causal Representation Learning. Bernhard Schölkopf, Francesco Locatello, Stefan Bauer, Nan Rosemary Ke, Nal Kalchbrenner, Anirudh Goyal, Yoshua Bengio. [pdf]
(2021 Survey) Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond. Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, Diyi Yang. [pdf]
(2019 Overview, Schoelkopf) Causality for Machine Learning. Bernhard Schölkopf. [pdf]
(2018 ACM CSUR) A Survey of Learning Causality with Data: Problems and Methods. Ruocheng Guo, Lu Cheng, Jundong Li, P. Richard Hahn, Huan Liu. [pdf]
(2015 Political Analysis) The Statistics of Causal Inference: A View from Political Methodology. Luke Keele. [pdf]

1.3 Toolboxes

Causal Discovery

(2021) causal-learn (Python package for causal discovery). Carnegie Mellon University. [GitHub] [documentation]
(2019) Causal Discovery Toolbox in Python. [GitHub] [pdf]
Causal discovery tools. University of Pittsburgh/Carnegie Mellon University Center for Causal Discovery. [link]
e.g., Tretrad, py-causal

Causal Effect Estimation

(2020) DoWhy: An End-to-End Library for Causal Inference. Amit Sharma, Emre Kiciman. [GitHub] [pdf]
(2023) Ananke: A module for causal inference. Jaron J. R. Lee, Rohit Bhattacharya, Razieh Nabi, Ilya Shpitser. [link] [pdf]
(2020) CausalML: Python Package for Causal Machine Learning. [GitHub] [pdf]

Papers that give a taxonomy of methods

(2021, Survey on continuous optimization for causal discovery) D’ya like DAGs? A Survey on Structure Learning and Causal Discovery. Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden. [pdf]
(2018 ACM CSUR) A Survey of Learning Causality with Data: Problems and Methods. Ruocheng Guo, Lu Cheng, Jundong Li, P. Richard Hahn, Huan Liu. [pdf]
(2018, National Science Review) Learning causality and causality-related learning: some recent progress. Kun Zhang, Bernhard Schölkopf, Peter Spirtes, Clark Glymour. [pdf]
(2016) Causal discovery and inference: Concepts and recent methodological advances. [pdf]
(2019 Front. Genet.) Review of Causal Discovery Methods Based on Graphical Models. Clark Glymour, Kun Zhang, Peter Spirtes. [pdf]

2. Causality Applied to General NLP

2.1 Overview

(2024 arXiv) Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey. Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, Julian McAuley, Wei Ai, Furong Huang. [pdf]
(2022 TACL) Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond. Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, Diyi Yang. [pdf]
(2024 arXiv) The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning. Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings. [pdf]

2.2 Causal Reasoning

2.2.1 Formal Causal Reasoning

Whether LLMs can do causal reasoning with established solutions

(2023 NeurIPS) CLadder: Assessing Causal Reasoning in Language Models. Zhijing Jin*, Yuen Chen*, Felix Leeb*, Luigi Gresele*, Ojasv Kamal, Zhiheng LYU, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf. [pdf]
(2024 ICLR) Can Large Language Models Infer Causation from Correlation? Zhijing Jin*, Jiarui Liu*, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab†, Bernhard Schölkopf†. [pdf]
(2023 TMLR) Causal Parrots: Large Language Models May Talk Causality But Are Not Causal. Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting. [pdf]
(2024 TMLR) Causal Reasoning and Large Language Models: Opening a New Frontier for Causality. Emre Kıcıman, Robert Ness, Amit Sharma, Chenhao Tan. [pdf]
(2024 ACL Findings) Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data. Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, Yansong Feng. [pdf]
(2024 arXiv) LLMs Are Prone to Fallacies in Causal Inference.

Nitish Joshi, Abulhair Saparov, Yixin Wang, He He.

[pdf]
(2024 arXiv) CLEAR: Can Language Models Really Understand Causal Graphs? Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu. [pdf]
(2024 arXiv) Evaluating Interventional Reasoning Capabilities of Large Language Models. Tejas Kasetty, Divyat Mahajan, Gintare Karolina Dziugaite, Alexandre Drouin, Dhanya Sridhar. [pdf]

LLMs to Help Causal Effect Estimation

(2024 arXiv) End-To-End Causal Effect Estimation from Unstructured Natural Language Data. Nikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul G. Krishnan, Chris J. Maddison. [pdf]

LLMs to Help Causal Relation Discovery

(2023 SPIGM Workshop at ICML) Causal Discovery with Language Models as Imperfect Experts. Stephanie Long, Alexandre Piché, Valentina Zantedeschi, Tibor Schuster, Alexandre Drouin. [pdf]
(2024 arXiv) Efficient Causal Graph Discovery Using Large Language Models. Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio. [pdf]
(2023 arxiv) From Query Tools to Causal Architects: Harnessing Large Language Models for Advanced Causal Discovery from Data. Taiyu Ban, Lyvzhou Chen, Xiangyu Wang, Huanhuan Chen. [pdf]
(2024 arviv) ALCM: Autonomous LLM-Augmented Causal Discovery Framework. Elahe Khatibi, Mahyar Abbasian, Zhongqi Yang, Iman Azimi, Amir M. Rahmani. [pdf]
(2024 arxiv) Large Language Models are Effective Priors for Causal Graph Discovery. Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi. [pdf]
(2024 arxiv) Multi-Agent Causal Discovery Using Large Language Models. Hao Duong Le, Xin Xia, Zhang Chen. [pdf]
(2024 arxiv) Bridging Causal Discovery and Large Language Models: A Comprehensive Survey of Integrative Approaches and Future Directions. Guangya Wan, Yuqi Wu, Mengxuan Hu, Zhixuan Chu, Sheng Li. [pdf]
(2024 arxiv) Large Language Models for Constrained-Based Causal Discovery. Kai-Hendrik Cohrs, Gherardo Varando, Emiliano Diaz, Vasileios Sitokonstantinou, Gustau Camps-Valls. [pdf]
(2024 arxiv) Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models. Yuzhe Zhang, Yipeng Zhang, Yidong Gan, Lina Yao, Chen Wang. [pdf]
(2024 SSRN) Towards Automating Causal Discovery in Financial Markets and Beyond. Alik Sokolov, Fabrizzio Sabelli, Behzad Azadie faraz, Wuding Li, Luis Seco. [pdf]
(2024 COLM) LLM4Causal: Democratized Causal Tools for Everyone via Large Language Model. Haitao Jiang, Lin Ge, Yuhe Gao, Jianian Wang, Rui Song. [pdf]
(2024 arxiv) Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery. Yuni Susanti, Michael Färber. [pdf]
(2024 ACL) Open Event Causality Extraction by the Assistance of LLM in Task Annotation, Dataset, and Method. Kun Luo, Tong Zhou, Yubo Chen, Jun Zhao, Kang Liu. [pdf]
(2023 arxiv) Causal Discovery with Language Models as Imperfect Experts. Stephanie Long, Alexandre Piché, Valentina Zantedeschi, Tibor Schuster, Alexandre Drouin. [pdf]
(2024 ACL) Identifying while Learning for Document Event Causality Identification. Cheng Liu, Wei Xiang, Bang Wang. [pdf]

2.2.2 Commonsense/Empirical Causal Reasoning

(2024 arXiv) The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning. Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings. [pdf]
(2022 arXiv) Causal Inference Principles for Reasoning about Commonsense Causality. Jiayao Zhang, Hongming Zhang, Dan Roth, Weijie J. Su. [pdf]
(2021 ACL Findings) Empowering Language Understanding with Counterfactual Reasoning. Fuli Feng, Jizhi Zhang, Xiangnan He, Hanwang Zhang, Tat-Seng Chua. [pdf]
(2021 ARR) e-CARE: a New Dataset for Exploring Explainable Causal Reasoning. Anonymous. [pdf]
(2020 EMNLP) GLUCOSE: GeneraLized and COntextualized Story Explanations. Nasrin Mostafazadeh, Aditya Kalyanpur, Lori Moon, David Buchanan, Lauren Berkowitz, Or Biran, Jennifer Chu-Carroll. [pdf]
(2019 EMNLP) Counterfactual Story Reasoning and Generation. Lianhui Qin, Antoine Bosselut, Ari Holtzman, Chandra Bhagavatula, Elizabeth Clark, Yejin Choi. [pdf]
(2020 IJCAI) Guided Generation of Cause and Effect. Zhongyang Li, Xiao Ding, Ting Liu, J. Edward Hu, Benjamin Van Durme. [pdf] [video]
[Summary] 314 million automatically extracted cause and effects.

2.2.3 Both Types of Causal Reasoning

(2024 AAAI Workshop) A Critical Review of Causal Reasoning Benchmarks for Large Language Models.

Linying Yang, Vik Shirvaikar, Oscar Clivio, Fabian Falck.

[pdf]
(2024 arXiv) C2P: Featuring Large Language Models with Causal Reasoning.

Abdolmahdi Bagheri, Matin Alinejad, Kevin Bello, Alireza Akhondi-Asl.

[pdf]

2.3 Interpretability by Causal Methods

(2023 ACL) A Causal Framework to Quantify the Robustness of Mathematical Reasoning in Language Models. Alessandro Stolfo*, Zhijing Jin*, Kumar Shridhar, Bernhard Schoelkopf, Mrinmaya Sachan. [pdf]
(2022 ACL Findings) Interpreting the Robustness of Neural NLP Models to Textual Perturbations. Yunxiang Zhang, Liangming Pan, Samson Tan, Min-Yen Kan. [pdf]
(2020 NeurIPS Spotlight) Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias. Jesse Vig, Sebastian Gehrmann, Yonatan Belinkov, Sharon Qian, Daniel Nevo, Simas Sakenis, Jason Huang, Yaron Singer, Stuart Shieber. [pdf]
[Summary] Cause: input text, mediator: some neurons, effect: output prediction
(2021 NeurIPS) Causal Abstractions of Neural Networks. Atticus Geiger, Hanson Lu, Thomas Icard, Christopher Potts. [pdf]
(2021 arXiv) Probing Classifiers: Promises, Shortcomings, and Advances. Yonatan Belinkov. [pdf]
(2021 ACL) Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models. Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal Linzen, Yonatan Belinkov. [pdf]
(2021 CoNLL) Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction. Shauli Ravfogel, Grusha Prasad, Tal Linzen, Yoav Goldberg. [pdf]
(2021 arXiv) Causal Distillation for Language Models. Zhengxuan Wu, Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman. [pdf]
(2021 arXiv) Inducing Causal Structure for Interpretable Neural Networks. Atticus Geiger, Zhengxuan Wu, Hanson Lu, Josh Rozner, Elisa Kreiss, Thomas Icard, Noah D. Goodman, Christopher Potts. [pdf] [slides]
(2020 CL) CausaLM: Causal Model Explanation Through Counterfactual Language Models. Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart. [pdf]
(2020 TACL) Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals. Yanai Elazar, Shauli Ravfogel, Alon Jacovi, Yoav Goldberg. [pdf]
(2020 ICLR) Learning the Difference that Makes a Difference with Counterfactually-Augmented Data. Divyansh Kaushik, Eduard Hovy, Zachary C. Lipton. [pdf]

2.4 Text Features in Causal Graphs (for Social Science, Psychology, etc.)

(2021 EMNLP Findings) Mining the Cause of Political Decision-Making from Social Media: A Case Study of COVID-19 Policies across the US States Zhijing Jin, Zeyu Peng, Tejas Vaidhya, Bernhard Schoelkopf, Rada Mihalcea. [pdf] [talk]
(2021 arXiv) Generating Synthetic Text Data to Evaluate Causal Inference Methods. Zach Wood-Doughty, Ilya Shpitser, Mark Dredze. [pdf]
(2020 ACL) Text and Causal Inference: A Review of Using Text to Remove Confounding from Causal Estimates. Katherine A. Keith, David Jensen, and Brendan O'Connor. [pdf]
(2020 UAI) Adapting Text Embeddings for Causal Inference. Victor Veitch, Dhanya Sridhar, David M. Blei. [pdf]
(2020 AJPS) Adjusting for confounding with text matching. Margaret E Roberts, Brandon M Stewart, and Richard A Nielsen. [pdf]
(2020 arXiv) Adjusting for Confounders with Text: Challenges and an Empirical Evaluation Framework for Causal Inference. Galen Weld, Peter West, Maria Glenski, David Arbour, Ryan Rossi, Tim Althoff. [pdf]
(2020 CSCW) Quantifying the Causal Effects of Conversational Tendencies. Justine Zhang, Sendhil Mullainathan, Cristian Danescu-Niculescu-Mizil. [pdf]
(2020 arXiv) Causal Effects of Linguistic Properties. Reid Pryzant, Dallas Card, Dan Jurafsky, Victor Veitch, Dhanya Sridhar. [pdf] [blog and github]
[Summary] Cause: binary writer intent, confounder: other linguistic habits of the writer, mediator: text by the writer, effect: reader's response time
(2020 arXiv) Decoupling entrainment from consistency using deep neural networks. [pdf]
[Note] Entrainment = speakers adapting to conversation partners so as to become more similar
(2018 EMNLP) Challenges of Using Text Classifiers for Causal Inference. Zach Wood-Doughty, Ilya Shpitser, Mark Dredze. [pdf]
(2018 Political Analysis) Matching with text data: An experimental evaluation of methods for matching documents and of measuring match quality. Reagan Mozer, Luke Miratrix, Aaron Russell Kaufman, L Jason Anastasopoulos. [pdf]
(2018 arXiv) How to Make Causal Inferences Using Texts. Naoki Egami, Christian J. Fong, Justin Grimmer, Margaret E. Roberts, Brandon M. Stewart. [pdf]
(2017 EMNLP) Detecting and Explaining Causes From Text For a Time Series Event. Dongyeop Kang, Varun Gangal, Ang Lu, Zheng Chen, Eduard Hovy. [pdf]
[Summary] Finding causes for the stock price time series.
(2016 ACL) Discovery of Treatments from Text Corpora. Christian Fong, Justin Grimmer. [pdf]
(2016 JMLR) Learning representations for counterfactual inference. Fredrik Johansson, Uri Shalit, David Sontag. [pdf]

2.5 Causality to Bring Insights to NLP Modeling (for Robustness, Domain Adaptation, etc)

(2023 ICML) Towards Trustworthy Explanation: On Causal Rationalization. Wenbo Zhang, Tong Wu, Yunlong Wang, Yong Cai, Hengrui Cai.[pdf]
(2023) Towards Trustworthy and Aligned Machine Learning: A Data-centric Survey with Causality Perspectives. Haoyang Liu, Maheep Chaudhary, Haohan Wang. [pdf]
(2021 EMNLP Oral) Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP. Zhijing Jin*, Julius von Kügelgen*, Jingwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan, Bernhard Schölkopf. [pdf] [talk]
(2021 arXiv) Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests. Victor Veitch, Alexander D'Amour, Steve Yadlowsky, Jacob Eisenstein. [pdf]
(2021 CVPR) Counterfactual VQA: A Cause-Effect Look at Language Bias. Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen. [pdf]
(2021 ICLR Workshop) A Causal Lens for Controllable Text Generation. Zhiting Hu, Li Erran Li. [pdf]
(2021 EMNLP) Uncovering Main Causalities for Long-tailed Information Extraction. Guoshun Nan, Jiaqi Zeng, Rui Qiao, Zhijiang Guo, Wei Lu. [pdf]
(2021 ACL findings) Discovering Topics in Long-tailed Corpora with Causal Intervention. Xiaobao Wu, Chunping Li, Yishu Miao. [pdf]
(2020 EMNLP) Unsupervised Discovery of Implicit Gender Bias. Anjalie Field, Yulia Tsvetkov. [pdf]
[Summary] Method: propensity matching and adversarial learning.
(2020 EMNLP) Counterfactual Generator: A Weakly-Supervised Method for Named Entity Recognition. Xiangji Zeng, Yunliang Li, Yuchen Zhai, Yin Zhang. [pdf]
(2020 EMNLP) De-Biased Court’s View Generation with Causality. Yiquan Wu, Kun Kuang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Jun Xiao, Yueting Zhuang, Luo Si, Fei Wu. [pdf]
(2020 EMNLP Findings) Identifying Spurious Correlations for Robust Text Classification. Zhao Wang, Aron Culotta. [pdf]
(2020 EMNLP) Counterfactual Off-Policy Training for Neural Dialogue Generation. Qingfu Zhu, Weinan Zhang, Ting Liu, William Yang Wang. [pdf]
(2019 EMNLP) Topics to Avoid: Demoting Latent Confounds in Text Classification. Sachin Kumar, Shuly Wintner, Noah A. Smith, Yulia Tsvetkov. [pdf]
[Summary] Cause: native language, confounder: topic, effect: text
(2018 NAACL, Stanford) Deconfounded lexicon induction for interpretable social science. Reid Pryzant, Kelly Shen, Dan Jurafsky, Stefan Wagner. [pdf]
[Summary] Cause: some keywords, effect: output prediction

Related NLP Papers

Data augmentation

(2021 NAACL) Counterfactual Data Augmentation for Neural Machine Translation. Qi Liu, Matt Kusner, Phil Blunsom. [pdf]
[Summary] First do phrase alignment between source and target sentences, and then only change some phrases in the source sentence, expecting the target sentence also only changes by that key phrase. Not much usage of causality.
(2019 ACL) Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology. Ran Zmigrod, Sabrina J. Mielke, Hanna Wallach, Ryan Cotterell. [pdf]
[Summary] Change female words to male words in languages with rich morphology and inflections. Not much usage of causality.

Compositionality and Neuro-Symbolic Approaches

(2022 arXiv) Compositionality as Lexical Symmetry. Ekin Akyürek, Jacob Andreas. [pdf]
(2021 NeurIPS) Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning. Maxwell Nye, Michael Henry Tessler, Joshua B. Tenenbaum, Brenden M. Lake. [pdf]

Related Non-NLP Papers

(2021 arXiv) Desiderata for Representation Learning: A Causal Perspective. Yixin Wang, Michael I. Jordan. [pdf]
[Summary] The causal predictors of a task should be both necessary and sufficient factors.

Causality tools that can be applied to deconfound

(2017 NIPS, MPI, discover causal graphs behind data) Avoiding Discrimination through Causal Reasoning. Niki Kilbertus, Mateo Rojas-Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, Bernhard Schölkopf. [pdf]

Related CV papers on counterfactual generation:

(2021 ICLR) Counterfactual Generative Networks. Axel Sauer, Andreas Geiger. [pdf]

2.6 Causal Relation Extraction

Surveys and Reviews

CREST: A Causal Relation Schema for Text (A repo containing datasets for causal/counterfactual relation extraction) Pedram Hosseini. [GitHub]
[Summary] CausalRE datasets: SemEval 2007 Task 4 (114 causal sentences), SemEval 2021 Task 8 (1,331), EventCausality (485), Causal-TimeBank (318), EventStoryLine v1.5 (2,608), CaTeRS (308), BECauSE v2.1 (554), Choice of Plausible Alternatives (COPA) (1,000), The Penn Discourse Treebank (PDTB) 3.0 (7,991).
(2020 COLING) A Review of Dataset and Labeling Methods for Causality Extraction. Jinghang Xu, Wanli Zuo, Shining Liang, Xianglin Zuo. [pdf]
[Summary] Very reader-friendly survey, including ideas such as Causal connectives (verb: cause, result; conjunction: because, so; preposition: for, because of; adverb: consequently; Verb Phrase: result in, lead to; Prepositional phrase: as a result of; Clause: that's why, so that); Causal concepts (sufficient, necessary, temporal); etc.
(2016 arXiv) Automatic Extraction of Causal Relations from Natural Language Texts: A Comprehensive Survey. Nabiha Asghar. [GitHub]
[Summary] Automatic discovery of linguistic patterns expressing causal relations, such as 1995's work using patterns like "because [cause], [effect]", 2002's work using patterns like "NP1-CausativeVerb-NP2", later work using a Naive Bayes classifier, and many more.
(2018 CMU Thesis) Annotating and Automatically Tagging Constructions of Causal Language. Jesse Dunietz. [pdf]
(2021 arXiv) A Survey on Extraction of Causal Relations from Natural Language Text. Jie Yang, Soyeon Caren Han, Josiah Poon. [pdf]
Richer Event Description (RED) Annotation Guidelines. Martha Palmer, Will Styler, Kevin Crooks, Tim O'Gorman. [GitHub]
(2020 Thesis) Narrative Generation to Support Causal Exploration of Directed Graphs. Arjun Choudhry. [pdf]

Method or Dataset Papers

Causal relation extraction from web data

(2019 EMNLP) Weakly Supervised Multilingual Causality Extraction from Wikipedia. Chikara Hashimoto. [pdf]
[Summary] Cross-verification between Wikipedia and Wikidata, e.g., "Protectionism causes Trade war"
(2016 ACL) Identifying Causal Relations Using Parallel Wikipedia Articles. Christopher Hidey, Kathy McKeown. [pdf]
[Summary] Find linguistic markers like "because".
(2014 ACL) Toward Future Scenario Generation: Extracting Event Causality Exploiting Semantic Relation, Context, and Association Features Chikara Hashimoto, Kentaro Torisawa, Julien Kloetzer, Motoki Sano, István Varga, Jong-Hoon Oh, Yutaka Kidawara. [pdf]
(2012 EMNLP) Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web. Chikara Hashimoto, Kentaro Torisawa, Stijn De Saeger, Jong-Hoon Oh, Jun’ichi Kazama. [pdf]
[Summary] Extracted one million contradiction pairs and 500,000 causality pairs.
(1998 Literary and Linguistic Computing) Automatic extraction of cause-effect information from newspaper text without knowledge-based inferencing. Christopher Khoo, Jaklin Kornfilt, Sung Hyon Myaeng, Robert Oddy. [pdf]
(1991 Knowledge Acquisition) Knowledge-based acquisition of causal relationships in text. Randy M. Kaplan, Genevieve Berry-Rogghe.

Causal relation extraction from curated datasets (relatively small):

(2021 Neurocomputing) Causality Extraction based on Self-attentive BiLSTM-CRF with Transferred Embeddings.Zhaoning Li, Qi Li, Xiaotian Zou, Jiangtao Ren. [GitHub]
(2021 NAACL) Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis Xiao Liu, Da Yin, Yansong Feng, Yuting Wu, Dongyan Zhao. [pdf]
(2021 NAACL) Graph Convolutional Networks for Event Causality Identification with Rich Document-level Structures. Minh Tran Phu and Thien Huu Nguyen [pdf]
(2020 EMNLP) XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning. Edoardo Maria Ponti, Goran Glavaš, Olga Majewska, Qianchu Liu, Ivan Vulić, Anna Korhonen. [pdf]
(2020 EMNLP) Causal Inference of Script Knowledge. Noah Weber, Rachel Rudinger, Benjamin Van Durme. [pdf]
(2019 NAACL) Modeling Document-level Causal Structures for Event Causal Relation Identification. Lei Gao, Prafulla Kumar Choubey, Ruihong Huang. [pdf]
[Summary] EventStoryLine corpus; method: Integer Linear Programming (ILP)
(2018 ACL) Joint Reasoning for Temporal and Causal Relations. Qiang Ning, Zhili Feng, Hao Wu, Dan Roth. [pdf]
[Summary] Method: constrained conditional models (CCMs), n integer linear programming (ILP).
(2018 SIGdial) Automatic Extraction of Causal Relations from Text using Linguistically Informed Deep Neural Networks. Tirthankar Dasgupta, Rupsa Saha, Lipika Dey, Abir Naskar. [pdf]
(2017 ACL Workshop) The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction. Tommaso Caselli, Piek Vossen. [pdf]
(2017 ACL) The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations. Jesse Dunietz, Lori Levin, Jaime Carbonell. [pdf]
(2016 ACL Workshop) CaTeRS: Causal and Temporal Relation Scheme for Semantic Annotation of Event Structures. Nasrin Mostafazadeh, Alyson Grealish, Nathanael Chambers, James Allen, Lucy Vanderwende. [pdf]
(2016 KR) Commonsense Causal Reasoning between Short Texts. Zhiyi Luo, Yuchen Sha, Kenny Q. Zhu, Seung-Won Hwang, Zhongyuan Wang. [pdf]
[Summary] Contains a list of causal cue verbs.
(2015 ACL Workshop) Annotating Causal Language Using Corpus Lexicography of Constructions. Jesse Dunietz, Lori Levin, Jaime Carbonell. [pdf]
[Summary] Quite important. This paper teaches the ontology for causality annotation, e.g., Degrees of Causation (cause, enable, prevent), types of causation (consequence, motivation, purpose), instance (causal connective, cause, effect), linguistic type (causality with no lexical trigger, causality with connectives, temporal language).
(2014 EACL) Annotating Causality in the TempEval-3 Corpus. Paramita Mirza, Rachele Sprugnoli, Sara Tonelli, Manuela Speranza. [pdf]
[Summary] Containing some illustrative examples on page 3.
(2011 ACL) Minimally Supervised Event Causality Identification. Quang Do, Yee Seng Chan, Dan Roth. [pdf]
[Summary] Uses PMI, IDF, etc.
(2009 SemEval) SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations between Pairs of Nominals Iris Hendrickx, Su Nam Kim, Zornitsa Kozareva, Preslav Nakov, Diarmuid Ó Séaghdha, Sebastian Padó, Marco Pennacchiotti, Lorenza Romano, Stan Szpakowicz. [pdf]
(2008 LREC) Building a Corpus of Temporal-Causal Structure. Steven Bethard, William Corvey, Sara Klingenstein, James H. Martin. [pdf]
[Summary] Combining causal and temporal relation extraction, such as "Fuel tanks had [EVENT leaked] and [EVENT contaminated] the soil." <EVENT_leaked, causes, EVENT_contaminated>. Small dataset with 271 causal relations, and 329 BEFORE/AFTER relations.
(2008 LREC) Causal Relation Extraction. Eduardo Blanco, Nuria Castell, Dan Moldovan. [pdf]
(2007 SemEval) SemEval-2007 Task 04: Classification of Semantic Relations between Nominals. Roxana Girju, Preslav Nakov, Vivi Nastase, Stan Szpakowicz, Peter Turney, Deniz Yuret. [pdf]
(2006 IPM) Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities. Du-Seong Chang, Key-Sun Choi. [pdf]
(2004 ICNLP) Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities. Du-Seong Chang, Key-Sun Choi. [pdf]
(2003 ACL Workshop) Automatic detection of causal relations for question answering. Roxana Girju. [pdf]
(2002 AAAI) Text Mining for Causal Relations. Roxana Girju, Dan Moldovan. [pdf]
(2001 PAKDD) Semantic Expectation-Based Causation Knowledge Extraction: A Study on Hong Kong Stock Movement Analysis. Boon-Toh Low, Ki Chan, Lei-Lei Choi, Man-Yee Chin, Sin-Ling Lay. [link]
(2000 ACL) Extracting Causal Knowledge from a Medical Database Using Graphical Patterns. Christopher S. G. Khoo, Syin Chan, Yun Niu. [pdf]
(1997) Coatis, an nlp system to locate expressions of actions connected by causality links. Daniela Garcia. [link]
(1995 PhD Thesis) Automatic identification of causal relations in text and their use for improving precision in information retrieval. Christopher Khoo. [pdf]

Some useful tools:

Helping to identify causal verbs, and connectives:

(2005 PhD Thesis) VerbNet: A broad-coverage, comprehensive verb lexicon. Karin Kipper Schuler. [pdf]
(2009 ACL) Using Syntax to Disambiguate Explicit Discourse Connectives in Text. Emily Pitler, Ani Nenkova. [pdf]

Helping to annotate data efficiently and affordably:

(2017 SIGMOD) Snorkel: Fast Training Set Generation for Information Extraction. Alexander J. Ratner, Stephen H. Bach, Henry R. Ehrenberg, Chris Ré. [pdf]

Helping to analyze the semantics of causal events:

(1998 COLING) The Berkeley FrameNet Project. Collin F. Baker, Charles J. Fillmore, John B. Lowe. [pdf]
AMR parsing
(2016 ACL Workshop) The Storyline Annotation and Representation Scheme (StaR): A Proposal. Tommaso Caselli, Piek Vossen. [pdf]
(2014 Report) Guidelines for ECB+ Annotation of Events and their Coreference. Agata Cybulska, Piek Vossen. [pdf]

3. Causality for Various Applications

3.1 Persuasion

(2020 arXiv) Influence via Ethos: On the Persuasive Power of Reputation in Deliberation Online. Emaad Manzoor, George H. Chen, Dokyun Lee, Michael D. Smith. [pdf]
[Summary] Cause: reputation, effect: persuation in debates.
Estimating Causal Effects of Tone in Online Debates. Dhanya Sridhar and Lise Getoor. [pdf]

3.2 Psychology and Behavior

The effect of wording on message propagation: Topic- and author-controlled natural experiments on Twitter. Chenhao Tan, Lillian Lee, Bo Pang. [pdf]
(CHI 2016) Discovering shifts to suicidal ideation from mental health content in social media. Munmun De Choudhury, Emre Kiciman, Mark Dredze, Glen Coppersmith, Mrinal Kumar. [pdf]
[Summary] Method: propensity score matching. Cause: linguistic and social interaction based measures on Reddit text, effect: suicidal attempt.
(2017 CWSM) The Language of Social Support in Social Media and its Effect on Suicidal Ideation Risk. Munmun De Choudhury, Emre Kıcıman. [pdf]
[Summary] Cause: linguistic clues of Reddit comments, effect: suicidal attempt.
(Political Behavior, 2017) Tweetment Effects on the Tweeted: Experimentally Reducing Racist Harassment. Kevin Munger. [pdf]
(2017 ICWSM) Estimating the Effect Of Exercising On Users’ Online Behavior. Seyed Amin Mirlohi Falavarjani, Hawre Hosseini, Zeinab Noorian, Ebrahim Bagheri. [pdf]
[Summary] Cause: offline activities from Foursquare posts (e.g., check-ins at a gym, effect: user interests from topics of their Twitter posts. Discovery: shift in interest reduces significantly after users start exercising.
(2017 CSCW) Distilling the outcomes of personal experiences: A propensity-scored analysis of social media. Alexandra Olteanu, Onur Varol, Emre Kiciman. [pdf]
(2018 ICWSM) Using longitudinal social media analysis to understand the effects of early college alcohol use. Emre Kiciman, Scott Counts, Melissa Gasser. [pdf]
(2018 ICML) Estimating causal effects of exercise from mood logging data. Dhanya Sridhar, Aaron Springer, Victoria Hollis, Steve Whittaker, Lise Getoor.
[Summary] Cause: daily activities, effect: wellness markers (e.g., mood) on EmotiCal, confounder: Text of mood triggers. Confounding adjustment method: Propensity score matching
(2019 ICWSM) A social media study on the effects of psychiatric medication use. Koustuv Saha, Benjamin Sugar, John Torous, Bruno Abrahao, Emre Kıcıman, Munmun De Choudhury. [pdf]
[Summary] Cause: psychiatric drugs, confounder: previous Twitter posts, effect: psychopathology (incl. mood, cognition, depression, anxiety, psychosis, and suicidal ideation). Method: stratified propensity score matching.
(Psychological Science 2019) Increasing vegetable intake by emphasizing tasty and enjoyable attributes: A randomized controlled multisite intervention for taste-focused labeling. _ Bradley Turnwald, Jaclyn Bertoldo, Margaret Perry, Peggy Policastro, Maureen Timmons, Christopher Bosso, Priscilla Connors, Robert Valgenti, Lindsey Pine, Ghislaine Challamel, Christopher Gardner, Alia Crum_. pdf
[Summary] Cause: taste-focused lables, or health-focused labels, effect: vegetable intake. Method: RCT.
(2021 ICWSM) The Effect of Moderation on Online Mental Health Conversations. _ David Wadden, Tal August, Qisheng Li, Tim Althoff_. pdf
[Summary] Cause: moderation in online mental health conversations, effect: psychological improvements. Method: natural experiment (comparing the data right before moderators are introduced to a platform and right after).

3.3 Economics

(2017 arXiv) A deep causal inference approach to measuring the effects of forming group loans in online non-profit microfinance platform. Thai T Pham and Yuanyuan Shen. [pdf]
(2020 Journal of Economic Surveys) Econometrics Meets Sentiment: An Overview of Methodology and Applications. Andres Algaba, David Ardia, Keven Bluteau, Samuel Borms, and Kris Boudt.
[Summary] Use sentiment as a parameter or a variable to model econometric variables.

3.4 Healthcare

Measuring semantic similarity of clinical trial outcomes using deep pre-trained language representations. Anna Koroleva, Sanjay Kamath, Patrick Paroubek. [pdf]

3.5 Judicial Decision

(SSRN 2015) How Judicial Identity Changes the Text of Legal Rulings Michael Gill and Andrew Hall
[Summary] Cause: male or female/white of PoC judge. Method: RCT. [pdf]

3.6 Marketing strategies and sales prediction

Interpretable Neural Architectures for Attributing an Ad’s Performance to its Writing Style. Reid Pryzant, Sugato Basu, Kazoo Sone. [pdf]
(eCOM@SIGIR 2017) Predicting Sales from the Language of Product Descriptions. Reid Pryzant, Young-Joo Chung, Dan Jurafsky [pdf]
[Summary] Cause: product description (e.g., writing styles and word usages), confounder: brand loyalty and price strategies, effect: sales. Method: adversarial training.

4. More Resources

4.1 Causality Papers from Schoelkopf's Lab, MPI

4.1.0 Overview

(2018 ICLR) Learning Causal Mechanisms (ICLR invited talk). [talk]
(2021 Overview) Towards Causal Representation Learning. [pdf]
(2019 Overview) Causality for Machine Learning. [pdf]

4.1.1 Learning Causal "Units" and Mechanisms (i.e., Causal Representation Learning)

(2022 ICLR) The Role of Pretrained Representations for the OOD Generalization of RL Agents. Frederik Träuble, Andrea Dittadi, Manuel Wuthrich, Felix Widmaier, Peter Vincent Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer. [pdf]
(2021 ICML) On disentangled representations learned from correlated data. Frederik Träuble, Elliot Creager, Niki Kilbertus, Francesco Locatello, Andrea Dittadi, Anirudh Goyal, Bernhard Schölkopf, Stefan Bauer. [pdf]
(2021 NeurIPS) Self-supervised learning with data augmentations provably isolates content from style. Julius Von Kügelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello. [pdf]
(2021 ICML) Causal curiosity: Rl agents discovering self-supervised experiments for causal representation learning. Sumedh A Sontakke, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf. [pdf]
(2021 ICLR) Invariant Causal Representation Learning for Out-of-Distribution Generalization. Chaochao Lu, Yuhuai Wu, José Miguel Hernández-Lobato, Bernhard Schölkopf. [pdf]
(2021 ICLR) Learning explanations that are hard to vary. Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto, Luigi Gresele, Bernhard Schölkopf. [pdf]
(2021 AAAI) A theory of independent mechanisms for extrapolation in generative models. Michel Besserve, Rémy Sun, Dominik Janzing, Bernhard Schölkopf. [pdf]
(2020 NeurIPS) Object-Centric Learning with Slot Attention. Francesco Locatello, Dirk Weissenborn, Thomas Unterthiner, Aravindh Mahendran, Georg Heigold, Jakob Uszkoreit, Alexey Dosovitskiy, Thomas Kipf. [pdf]
(2020 ICML) Weakly-supervised disentanglement without compromises. Francesco Locatello, Ben Poole, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem, Michael Tschannen. [pdf]
(2019 ICML, Best Paper) Challenging common assumptions in the unsupervised learning of disentangled representations. Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem. [pdf]
(2019 arXiv) Disentangling factors of variation using few labels. Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem. [pdf]
(2019 arXiv) Recurrent independent mechanisms. Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf. [pdf]
(2018 ICML) Learning independent causal mechanisms. Giambattista Parascandolo, Niki Kilbertus, Mateo Rojas-Carulla, Bernhard Schölkopf. [pdf]
(2018 NeurIPS) Adaptive skip intervals: Temporal abstraction for recurrent dynamical models. Alexander Neitz, Giambattista Parascandolo, Stefan Bauer, Bernhard Schölkopf. [pdf]
(2018 arXiv) Counterfactuals uncover the modular structure of deep generative models. Michel Besserve, Arash Mehrjou, Rémy Sun, Bernhard Schölkopf. [pdf]
(2017 CVPR) Discovering causal signals in images. David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Scholkopf, Léon Bottou. [pdf]
(2017 NeurIPS) Avoiding Discrimination through Causal Reasoning. Niki Kilbertus, Mateo Rojas-Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, Bernhard Schölkopf. [pdf]

4.1.2 Robustness and Invariance (incl. Semi-Supervised Learning, Covariate Shift, Transfer Learning)

(2021 ICLR) Source-free adaptation to measurement shift via bottom-up feature restoration. Cian Eastwood, Ian Mason, Christopher KI Williams, Bernhard Schölkopf. [pdf]
(2016 ICML) Domain adaptation with conditional transferable components. Mingming Gong, Kun Zhang, Tongliang Liu, Dacheng Tao, Clark Glymour, Bernhard Schölkopf. [pdf]
(2021 EMNLP oral) Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP. Zhijing Jin, Julius von Kügelgen, Jingwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan, Bernhard Schölkopf. [pdf]
(2020 UAI) Semi-supervised learning, causality, and the conditional cluster assumption. Julius Kügelgen, Alexander Mey, Marco Loog, Bernhard Schölkopf. [pdf]
(2018 JMLR) Invariant models for causal transfer learning. Mateo Rojas-Carulla, Bernhard Schölkopf, Richard Turner, Jonas Peters. [pdf]
(2018 NeurIPS workshop) Generalization in anti-causal learning. Niki Kilbertus, Giambattista Parascandolo, Bernhard Schölkopf. [pdf]
(2012 ICML) On Causal and Anticausal Learning. Bernhard Schölkopf, Dominik Janzing, Jonas Peters, Eleni Sgouritsa, Kun Zhang, Joris Mooij. [pdf]

4.1.3 Causal Discovery

(2021 NeurIPS) DiBS: Differentiable Bayesian Structure Learning. Lars Lorch, Jonas Rothfuss, Bernhard Schölkopf, Andreas Krause. [pdf]
(2021 ICML) Necessary and sufficient conditions for causal feature selection in time series with latent common causes. Atalanti A Mastakouri, Bernhard Schölkopf, Dominik Janzing. [pdf]
(2020 JMLR) Causal Discovery from Heterogeneous/Nonstationary Data.. Biwei Huang, Kun Zhang, Jiji Zhang, Joseph D Ramsey, Ruben Sanchez-Romero, Clark Glymour, Bernhard Schölkopf. [pdf]
(2019 Nature communications) Inferring causation from time series in Earth system sciences. Jakob Runge, Sebastian Bathiany, Erik Bollt, Gustau Camps-Valls, Dim Coumou, Ethan Deyle, Clark Glymour, Marlene Kretschmer, Miguel D Mahecha, Jordi Muñoz-Marí, Egbert H van Nes, Jonas Peters, Rick Quax, Markus Reichstein, Marten Scheffer, Bernhard Schölkopf, Peter Spirtes, George Sugihara, Jie Sun, Kun Zhang, Jakob Zscheischler. [pdf]
(2017 UAI) Causal discovery from temporally aggregated time series. Mingming Gong, Kun Zhang, Bernhard Schölkopf, Clark Glymour, Dacheng Tao. [pdf]
(2016 JMLR) Distinguishing cause from effect using observational data: methods and benchmarks. Joris M Mooij, Jonas Peters, Dominik Janzing, Jakob Zscheischler, Bernhard Schölkopf. [pdf]
(2016 UAI) On the Identifiability and Estimation of Functional Causal Models in the Presence of Outcome-Dependent Selection.. Kun Zhang, Jiji Zhang, Biwei Huang, Bernhard Schölkopf, Clark Glymour. [pdf]
(2016 PNAS) Methods for causal inference from gene perturbation experiments and validation. Nicolai Meinshausen, Alain Hauser, Joris M. Mooij, Jonas Peters, Philip Versteeg, and Peter Bühlmann. [pdf]
(2016 ICML) The arrow of time in multivariate time series. Stefan Bauer, Bernhard Schölkopf, Jonas Peters. [pdf]
(2014 UAI) Inferring latent structures via information inequalities. Rafael Chaves, Lukas Luft, Thiago O Maciel, David Gross, Dominik Janzing, Bernhard Schölkopf. [pdf]
(2015 Journal of the Royal Statistical Society) Causal inference using invariant prediction: identification and confidence intervals. Jonas Peters, Peter Bühlmann, Nicolai Meinshausen. [pdf]
(2008IEEE) Causal inference using the algorithmic Markov condition. Dominik Janzing, Bernhard Schölkopf. [pdf]

4.1.4 Causal Effect Estimation

(2016 PASA) **A causal, data-driven approach to modeling the Kepler d.**ata Dun Wang, David W Hogg, Daniel Foreman-Mackey, Bernhard Schölkopf. [pdf]
(2016 PNAS) Modeling confounding by half-sibling regression. Bernhard Schölkopf, David W Hogg, Dun Wang, Daniel Foreman-Mackey, Dominik Janzing, Carl-Johann Simon-Gabriel, Jonas Peters. [pdf]
(2021 ICML) **Conditional distributional treatment effect with kernel conditional mean embeddings and U-statistic regression Junhyung Park, Uri Shalit, Bernhard Schölkopf, Krikamol Muandet http://proceedings.mlr.press/v139/park21c/park21c.pdf
(2020 NeurIPS) Dual Instrumental Variable Regression. Krikamol Muandet, Arash Mehrjou, Si Kai Lee, Anant Raj. [pdf]
(2021 ICML Spotlight) Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction. Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin, Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet. [pdf]
(2021 JMLR) Counterfactual Mean Embeddings. Krikamol Muandet, Motonobu Kanagawa, Sorawit Saengkyongam, Sanparith Marukatat. [pdf]

4.1.5 Foundational work (theory, ICA, etc.)

Topics: thermodynamic arrow of time, modeling hierarchy (ODEs, macro variables, temporal abstractions), links to ICA.

(2021 NeurIPS) Independent mechanism analysis, a new concept? Luigi Gresele, Julius Von Kügelgen, Vincent Stimper, Bernhard Schölkopf, Michel Besserve. [pdf]
(2020 UAI) The incomplete rosetta stone problem: Identifiability results for multi-view nonlinear ICA Luigi Gresele, Paul K Rubenstein, Arash Mehrjou, Francesco Locatello, Bernhard Schölkopf. [pdf]
(2017 arXiv) Causal consistency of structural equation models Paul K Rubenstein, Sebastian Weichwald, Stephan Bongers, Joris M Mooij, Dominik Janzing, Moritz Grosse-Wentrup, Bernhard Schölkopf. [pdf]
(2016 arXiv) From deterministic ODEs to dynamic structural causal models Paul K Rubenstein, Stephan Bongers, Bernhard Schölkopf, Joris M Mooij. [pdf]
(2016 New Journal of Physics) Algorithmic independence of initial condition and dynamical law in thermodynamics and causal inference Dominik Janzing, Rafael Chaves, Bernhard Schölkopf. [pdf]
(2017 Book by MIT Press) Elements of causal inference: foundations and learning algorithms Jonas Peters, Dominik Janzing, Bernhard Schölkopf. [pdf]

4.2 Causality Papers from Bengio's Lab, MILA

(Summary) Yoshua Bengio's Summary Talk. [Video@ELLIS NLP Workshop]

Motivational Position Papers

(2020 Position Paper, arXiv) Inductive Biases for Deep Learning of Higher-Level Cognition. Anirudh Goyal, Yoshua Bengio. [pdf]
[Summary] We need inductive bias for real understanding and generalization => such as decomposing world knowledge into right causal pieces
(Motivational, position paper, arXiv 2017) The Consciousness Prior. Yoshua Bengio. [pdf]

Applying Causality Knowledge for RL Interaction Design

(2020 arXiv) Visual Concept Reasoning Networks. Taesup Kim, Sungwoong Kim, Yoshua Bengio. [pdf]
[Summary] Modularized visual concept reasoning by split-transform-attend-interact-modulate-merge modules
(RLDM 2017) Independently Controllable Features. Emmanuel Bengio, Valentin Thomas, Joelle Pineau, Doina Precup, Yoshua Bengio. [pdf]
(2017 arXiv) Independently Controllable Factors. Valentin Thomas, Jules Pondard, Emmanuel Bengio, Marc Sarfati, Philippe Beaudoin, Marie-Jean Meurs, Joelle Pineau, Doina Precup, Yoshua Bengio. [pdf]
[Summary] The representations of (1) abstract action, and (2) abstract variables, should be learned together, because the action is about controlling variables. A way to disentangle abstractions from images & videos when we have an agent which can interact with the environment. The right abstractions have the right factors. I.e., there exists a policy and a learnable feature for each such aspect of the environment.

Applying causality to model design

Quote from a blog: "Causality is the idea that you can model something with an abstraction of the real-world process that generated it. When you use compositionality and learning-to-learn to break down objects into their parts and relations, modeling the object as a creation of these parts can be seen as modeling them in a causal way."

Recurrent Independent Mechanisms. Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf. [pdf]
[Summary] Divides the overall model into k small subsystems (or modules), Each small module is recurrent
Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems. Anirudh Goyal, Alex Lamb, Phanideep Gampa, Philippe Beaudoin, Sergey Levine, Charles Blundell, Yoshua Bengio, Michael Mozer. [pdf]
[Summary] Modularity when designing the model

Causal induction from interventional data

[Background] Changes in distribution is caused by intervention on few causes/mechanisms (in extension of Independent Mechanisms by [Schoelkopf+ 2012])

(2019 arXiv) Learning Neural Causal Models from unknown Interventions. Nan Rosemary Ke, Olexa Bilaniuk, Anirudh Goyal, Stefan Bauer, Hugo Larochelle, Bernhard Schölkopf, Michael C. Mozer, Chris Pal, Yoshua Bengio. [pdf]
[Summary] Aim: make use of the combination of observational and interventional data, Method: continuous optimization and neural networks, Extended setting: when the identity of the intervened upon variable is unknown
(NeurIPS 2020 Spotlight) Differentiable Causal Discovery from Interventional Data. Philippe Brouillard, Sébastien Lachapelle, Alexandre Lacoste, Simon Lacoste-Julien, Alexandre Drouin. [pdf] [review]
[Summary] Closely related to Ke+ 2019, Method: continuous-constrained framework + normalizing flows
(2019 ICLR) Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future. Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra. [pdf]
(2020 ICLR) A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms. Yoshua Bengio, Tristan Deleu, Nasim Rahaman, Rosemary Ke, Sébastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal. [pdf]

Grounded AI

(2021 ICLR) CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning. Ossama Ahmed, Frederik Träuble, Anirudh Goyal, Alexander Neitz, Yoshua Bengio, Bernhard Schölkopf, Manuel Wüthrich, Stefan Bauer. [pdf]
[Summary] An RL dataset to model do-interventions and discover causality
(2019 ICLR) BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning. Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia, Thien Huu Nguyen, Yoshua Bengio. [pdf]
[Summary] (not yet causal) synthetic language acquisition, A simulation platform, to do language instructions with a simulated human in the loop, Allows curriculum learning for 19 levels

4.3 Other Causality Papers (Potentially Applicable to NLP)

Repos

[Awesome-Causality-in-CV]

Invariance-Based Causal Discovery (for Robustness)

(2016, Journal of the Royal Statistical Society) Causal inference using invariant prediction: identification and confidence intervals. Jonas Peters, Peter Bühlmann, Nicolai Meinshausen. [pdf]
(2018 Journal of Causal Inference) Invariant Causal Prediction for Nonlinear Models. Christina Heinze-Deml*, Jonas Peters, and Nicolai Meinshausen. [pdf]
(2019) Invariant Risk Minimization. Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, David Lopez-Paz. [pdf]
(2020) Nonlinear Invariant Risk Minimization: A Causal Approach. Chaochao Lu, Yuhuai Wu, Jośe Miguel Hernández-Lobato, Bernhard Schölkopf. [pdf]

Interventional Robustness

(2018 IEEE Data Science Workshop) Causality from a Distributional Robustness Point of View. Nicolai Meinshausen. [pdf]
(2018) Anchor regression: heterogeneous data meet causality. Dominik Rothenhäusler, Nicolai Meinshausen, Peter Bühlmann and Jonas Peters. [pdf]

Causality from Cognitive Science Point of View

(2002 NIPS) Theory-Based Causal Inference. Joshua B. Tenenbaum & Thomas L. Griffiths. [pdf]
[Summary] Modeling human's learning of causality as Bayesian computations over a hypothesis space of causal graphical model.

Others

(PSB 2020 Oral) Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph. Irene Y. Chen, Monica Agrawal, Steven Horng, David Sontag. [pdf]
(2021 AAAI; AI Safety, RL with causality) Agent Incentives: A Causal Perspective. Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg. [pdf]
(2019 IJCAI AI Safety Workshop) Modeling AGI Safety Frameworks with Causal Influence Diagrams. Tom Everitt, Ramana Kumar, Victoria Krakovna, Shane Legg (Causal Incentives Working Group). [pdf]
(2018 IEEE workshop; robustness under domain shifts) Causality from a Distributional Robustness Point of View. Nicolai Meinshausen. [pdf]

Some tools that might be useful for disentanglement

(2009) Tensor Decompositions and Applications. Tamara Kolda, Brett Bader. [pdf]

4.4 Books (for Systematic Learning)

(For ML Audience) Elements of Causal Inference. Jonas Peters, Dominik Janzing and Bernhard Schölkopf. [Book, 2017]
(Quick Primer by Judea Pearl) Causal Inference in Statistics: A Primer. Judea Pearl. [Book, 2016]
(Focus on SCM) Causality: Models, Reasoning, and Inference. Judea Pearl. [Book, 2009]
(For statistics people on causal inference) Causation, Prediction and Search. Peter Spirtes. [Book, 2001]
More book recommendations: See Brady Neal's blog, and this pointer to causality books for beginner/intermediate/advanced.
Causal Inference for The Brave and True. [website]
A long paper&book list including multiple categories of causality papers: [spreadsheet]

4.5 Online Courses

Graphical Models and Causality. Isabelle Guyon (ETH). [course website]
Applied Causality (Spring 2019). David Blei (Columbia University). [course reading list]
A Week Long Course on Causal Modeling and Discovery. (Center for Causal Discovery, 2016) [videos]
(Reading group) Causality Reading Club@Amsterdam ML Lab. Prof. Joris Mooij. [past readings]

4.6 People Directory

(Credits to Causal Resources.)

Judea Pearl (UCLA), US.
Bernhard Schölkopf (MPI Tübingen), Tübingen, Germany. [group intro]
Dominik Janzing (Amazon Tübingen; former: MPI Tübingen), Tübingen, Germany.
Joris Mooij (University of Amsterdam; former: MPI Tübingen), Netherlands. [home page]
Jonas Peters (Copenhagen University; former: MPI Tübingen), Denmark. [home page]
Peter Bühlmann (ETH), Switzerland.
Marloes Maathuis (ETH), Switzerland. [video]
Nicolai Meinshausen (ETH), Switzerland. E.g., Anchor regression.
Negar Kiyavash (EPFL), Switzerland. E.g., causal structure learning.
Kun Zhang (CMU; former: MPI Tübingen), US.
Peter Spirtes (CMU Philosophy), US.
Cosma Shalizi (CMU), US.
David Sontag (MIT), US.
Caroline Uhler (MIT), US.
Victor Chernozhukov (MIT), US.
Elias Bareinboim (Columbia), US. [home page]
Andrew Gelman (Columbia; former: UCLA with Judea Pearl), US. [video]
Ilya Shpitser (JHU; former: UCLA with Judea Pearl), US. [home page]
Kosuke Imai (Harvard; former: Princeton), US.
James Robins (Harvard), US.
Ferederick Eberhardt (Caltech; former: CMU), US.
David Heckerman (Amazon), US.
Leon Bottou (Facebook AI), US.
Thomas Richardson (UW), US.
Stephan Hartmann (LMU), Munich, Germany.
Cheng Soon Ong (Data61; former: MPI Tübingen), Canberra, Australia.
You can also track the organizers, area chairs, and advisory board of the CLeaR conference, as well as attendees of causal inference seminars/workshops such as 2021 Frontiers of Causal Inference in Data Science.

4.7 Workshops

Please see this Google Spreadsheet for a list of causality workshops (2016 - now).

4.8 Others

Job postings for causality researchers: [website]
Causality notes by Tailin Wu

Contributions

All types of contributions to this paper list is welcome. Feel free to open a Pull Request.

Contact: Zhijing Jin, PhD of Bernhard Schoelkopf at Max Planck Institute for Intelligent Systems, working on NLP & Causality.

How to Cite This Repo

@misc{causality2021jin,
  author = {Zhijing Jin},
  title = {Causality for NLP Reading List},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/zhijing-jin/Causality4NLP_Papers}}
}

Files

README.md

Latest commit

History