TY - GEN
T1 - Behind Distribution Shift: Mining Driving Forces of Changes and Causal Arrows
AU - HUANG, Biwei
AU - ZHANG, Kun
AU - ZHANG, Jiji
AU - SANCHEZ-ROMERO, Ruben
AU - GLYMOUR, Clark
AU - SCHÖKOPF, Bernhard
PY - 2017
Y1 - 2017
N2 - We address two important issues in causal discovery from nonstationary or heterogeneous data, where parameters associated with a causal structure may change over time or across data sets. First, we investigate how to efficiently estimate the "driving force" of the nonstationarity of a causal mechanism. That is, given a causal mechanism that varies over time or across data sets and whose qualitative structure is known, we aim to extract from data a low-dimensional and interpretable representation of the main components of the changes. For this purpose we develop a novel kernel embedding of nonstationary conditional distributions that does not rely on sliding windows. Second, the embedding also leads to a measure of dependence between the changes of causal modules that can be used to determine the directions of many causal arrows. We demonstrate the power of our methods with experiments on both synthetic and real data.
AB - We address two important issues in causal discovery from nonstationary or heterogeneous data, where parameters associated with a causal structure may change over time or across data sets. First, we investigate how to efficiently estimate the "driving force" of the nonstationarity of a causal mechanism. That is, given a causal mechanism that varies over time or across data sets and whose qualitative structure is known, we aim to extract from data a low-dimensional and interpretable representation of the main components of the changes. For this purpose we develop a novel kernel embedding of nonstationary conditional distributions that does not rely on sliding windows. Second, the embedding also leads to a measure of dependence between the changes of causal modules that can be used to determine the directions of many causal arrows. We demonstrate the power of our methods with experiments on both synthetic and real data.
U2 - 10.1109/ICDM.2017.114
DO - 10.1109/ICDM.2017.114
M3 - Conference paper (refereed)
SP - 913
EP - 918
BT - 2017 IEEE International Conference on Data Mining (ICDM)
PB - IEEE
T2 - IEEE International Conference on Data Mining
Y2 - 18 November 2017 through 21 November 2017
ER -