Causal Inference on Discrete Data using Additive Noise Models

ei

Inferring the causal structure of a set of random variables from a finite sample of the joint distribution is an important problem in science. The case of two random variables is particularly challenging since no (conditional) independences can be exploited. Recent methods that are based on additive noise models suggest the following principle: Whenever the joint distribution {\bf P}^{(X,Y)} admits such a model in one direction, e.g., Y=f(X)+N, N \perp\kern-6pt \perp X, but does not admit the reversed model X=g(Y)+\tilde{N}, \tilde{N} \perp\kern-6pt \perp Y, one infers the former direction to be causal (i.e., X\rightarrow Y). Up to now, these approaches only dealt with continuous variables. In many situations, however, the variables of interest are discrete or even have only finitely many states. In this work, we extend the notion of additive noise models to these cases. We prove that it almost never occurs that additive noise models can be fit in both directions. We further propose an efficient algorithm that is able to perform this way of causal inference on finite samples of discrete variables. We show that the algorithm works on both synthetic and real data sets.

 Author(s): Peters, J. and Janzing, D. and Schölkopf, B. Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Volume: 33 Number (issue): 12 Pages: 2436-2450 Year: 2011 Month: December Day: 0 Department(s): Empirical Inference Bibtex Type: Article (article) Digital: 0 DOI: 10.1109/TPAMI.2011.71 Links: BibTex @article{PetersJS2011, title = {Causal Inference on Discrete Data using Additive Noise Models}, author = {Peters, J. and Janzing, D. and Sch{\"o}lkopf, B.}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {33}, number = {12}, pages = {2436-2450}, month = dec, year = {2011}, month_numeric = {12} }