• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Book chapter

Using a Cascade of Supervised Machine Learning Models to Discover Causality in Pairs of Variables.

P. 187-205.

Discovering causality between two variables is challenging, especially in the nonlinear case. Many causation coefficients exist to identify cause and effect, but most only distinguish two types of relationships,  and , treating all other cases as unidentifiable. Additionally, these models often rely on assumptions (e.g., linearity, non-Gaussian noise) that limit their practical applicability. To address these limitations, many authors adopt a supervised approach, using causation coefficients as features for machine learning models. We propose a cascade of models, each designed to identify only one type of causality. This approach allows us to focus on extracting the most informative features for different causal types. We introduce two new features: (1) the fraction of the variation explained by the first principal component, and (2) the ratio of the skewness of Xand Y. These features improve the detection of confounded pairs and causal directions. Numerical experiments demonstrate that the proposed method outperforms existing supervised and unsupervised algorithms for continuous variables.

In book

Edited by: L. Sokolinsky, M. Zymbler, V. Voevodin et al. Vol. 2891. Springer, 2026.