项目简介

本项目是关于因果推断教材 Causal Inference: What If 的编译和解读。

Motivation

Judea Pearl 是 Causal AI 的奠基人,Bernhard Scholkopf 推进了 Causality for Machine Learning,Yoshua Bengio 提出了 System 2 deep learning 作为 Causal AI 的一个范式。沉醉于 life and Intelligence 之美,众多工具中(包括数学,计算机,物理,复杂系统等等),偏好用信息论视角研究如何教会机器因果思维,希望创造具备 free will 的 AI,使之成为我们的良师益友,一起探索解密生命和智能的终极奥秘。

该书对因果推理的概念和方法进行了全面的介绍。是我们 Causal AI 中组成内容。

DoWhy | 让因果推断容易

因果推断并不容易,我们希望结合 DoWhy 和本书的理论让 Causal Inference 更加容易。

要构建 Benigo 的 System 2 deep learning for Causal AI, 我们需要 Pyro 这个强大的工具。

内容概要

本项目的主要内容是:

  • 翻译教材中的部分内容,按照易懂的方式重新组织,然后加入一些理解和解读。

  • 使用 DoWhy 来构建本书的代码工作,用 DoWhy 的视角来理解该教材内容。

注:For simplicity, in this book we do not distinguish between vector and scalar parameters. This is an abuse of notation, but we believe it does not create any confusion

Preface of the Book

Introduction: Towards Less Causal Causal Inferences

Causal Inference is an admittedly pretentious(过于宏大的) title for a book. 因果推理是一项复杂的科学任务 that relies on triangulating evidence from multiple sources and on the application of a variety of methodological approaches. No book can possibly provide a comprehensive description of methodologies for causal inference across the sciences. 任何因果推理书籍的作者都必须选择他们要强调的因果推理的哪些方面。

本引言的标题就已经反映了我们自己的选择: a book that helps scientists–especially health and social scientists–generate and analyze data to make causal inferences that are explicit about both the causal question and the assumptions underlying the data analysis.(为什么作出这样一个选择?) Unfortunately, the scientific literature is plagued by studies in which the causal question is not explicitly stated and the investigators’ unverifiable assumptions are not declared. This casual attitude towards causal inference has led to a great deal of confusion. For example, it is not uncommon to find studies in which the effect estimates are hard to interpret because the data analysis methods cannot appropriately answer the causal question (were it explicitly stated) under the investigators’ assumptions (were they declared).

在这本书中,we stress the need to take the causal question seriously enough to articulate it, and to delineate the separate roles of data and assumptions for causal inference. Once these foundations are in place, causal inferences become necessarily less casual, which helps prevent confusion. The book describes various data analysis approaches that can be used to estimate the causal effect of interest under a particular set of assumptions when data are collected on each individual in a population. 这本书的一个关键信息是,因果推理不能简化为用于数据分析的配方的集合。具体来说,就是像 DoWhy 的设计理念一样,把因果推断分成四个步骤:

  1. Model a causal inference problem using assumptions.

  2. Identify an expression for the causal effect under these assumptions (“causal estimand”).

  3. Estimate the expression using statistical methods such as matching or instrumental variables.

  4. Finally, verify the validity of the estimate using a variety of robustness checks.

(本书的组织结构)The book is divided in three parts of increasing difficulty: Part I is about causal inference without models (i.e., nonparametric identification of causal ef- fects), Part II is about causal inference with models (i.e., estimation of causal effects with parametric models), and Part III is about causal inference from complex longitudinal data (i.e., estimation of causal effects of time-varying treatments). (我们全书使用 Fine Points 和 Technical points 来附加解释相关内容)Throughout the text, we have interspersed Fine Points and Technical points that elaborate on certain topics mentioned in the main text. Fine Points are designed to be accessible to all readers while Technical Points are designed for readers with intermediate training in statistics. The book provides a cohesive presentation of concepts of, and methods for, causal inference that are currently scattered across journals in several disciplines. (我们的目标读者是:)We expect that the book will be of interest to anyone interested in causal inference, e.g., epidemiologists, statisticians, psychologists, economists, sociologists, political scientists, computer scientists…

(我们考虑哲学上因果有关内容)Importantly, this is not a philosophy book. We remain agnostic about metaphysical concepts like causality and cause. Rather, we focus on the identification and estimation of causal effects in populations, that is, numerical quantities that measure changes in the distribution of an outcome under different interventions.

哲学上,mental causation, consciousness prior by Bengio, free will 是重要因果概念。我们强调信息的观点研究因果,两个理由: - (1) the brain is an information processing machine, and - (2) the way we talk about the mental to physical relationship seems similar to the relationship between the abstract content of information and its physical realization.

信息传递可以物理世界,但是不需要遵从能量守恒定律。封闭物理系统一定会遵从能量守恒,那么可以反推出信息的本质是开放物理系统(Agents,典型生命就是与外界交换能量和物质的典型)和其所在环境的交互。如果我们把心智和信息内容对应,那么通过信息传递,心智可以对物理世界产生影响。

What is causality as information transfer.

Collier himself defines it as the transfer of a particular quantity of information from one state of a system to another. Physical causation is a special case in which physical information instances are transferred from one state of physical system to another. (Collier, 1999, p. 215).

更多哲学上因果相关内容请参考 Free will, Causality and Neuroscience

Resources

这里给出相关的资料。