2. Causal reasoning
In causal reasoning, we are looking how or why something happened and the relationship between the causes and their effects. In causal reasoning, we examine the relationship between causes and effects, and we try to understand how or why something happened. To calculate the causes of an event, current causal models use Individual or Average Treatment Effect (I-ATE). For instance, Pearl [
11] computes Average Causal Effect by subtracting the means of the treatment and control groups. Pearl uses Directed Acyclic Graphs (DAGs) to visualize and compute the associations or causal relationships between a set of elements. He also uses do operators, which are interventions on the nodes of DAGs, as well as probability theory, the Markov assumptions, and other concepts/methods/tools [
11].
However, in [
12] the authors showed that ATE describes the linear relationship between variables and in some examples cannot reflect the causality. Also, Pearl’s approach to causation does not allow reasoning in cases of degrees of uncertainty [
9]. Since
do operators cut the relation between two nodes, Pearl’s approach cannot answer gradient questions such as: given that you smoke a little, what is the probability that you have cancer to a certain degree? To solve Pearl’s do operator problem the authors in [
9,
13], used fuzzy logic rules which implement human language nuances such as “
small” instead of mere zero or one. Furthermore, Janzing [
14] showed that at a macro level, Pearl’s causal model works well with situations that are rare, such as rare medical conditions but, at a micro level, fails with bidirectional nodes. Authors in [
12] showed that Janzing’s model [
15] works well with bidirectional nodes, but fails with situations that are rare [
12].
In [
9] which uses fuzzy logic as the fundamental part of their causal model, there are two types of rules which we call association and causal rules. An association rule can be of type A
B. A causal rule can be ~A & B. That is, tell me what happens to B when A is missing. In other words, using fuzzy logic rules, we can estimate more than fourteen values in the presence and the absence of each element (in the dataset context this becomes columns or variables).
In [
9], instead of cutting the relationships between the treatment and its confounding parents, the model using fuzzification method assigns fuzzy membership values, such as very low, low, medium, and high, to the Treatment. To automatically assign membership values, the authors used fuzzy c-mean algorithm [
16] which is widely used for clustering the datasets using fuzzy logic.
Once, the membership assignments step is done, the model then applies causal fuzzy
interventional rules from [
9] as following
. Doing so, it can calculates more than fourteen different possible membership degree based fuzzy
counterfactual values for each variable using different causal fuzzy rules such as
, where
and
are the highest membership degrees of
and
among all considered fuzzy attributes for
and
, respectively [
9]. We would like to put more emphasis that using the following causal rules, we are applying interventions and estimate fuzzy counterfactuals at the same time.
Here are some of our fuzzy interventional rules.
|
, |
|
, |
|
, |
We note that based on the model introduced in [
9], each of the above rules has a meaning based on a subjective random selection. For instance,
could be seen as the probability of subjectively selecting
as
and
as
, where
and
are the fuzzy attributes that
and
come from, respectively. The model then applies defuzzification to the dataset in order to obtain the outputs. The big changes in the fuzzy output values can indicate possible causes.