Correlation analysis of numerical data in Data Mining
A | B |
3 | 1 |
4 | 6 |
1 | 2 |
Step 1: Find all the initial values
A | B | AB | A^{2}=C | B^{2}=D |
3 | 1 | 3 | 9 | 1 |
4 | 6 | 24 | 16 | 36 |
1 | 2 | 2 | 1 | 4 |
The total number of values (n) is 3.
[quads id=1]
The other values we need are:
ΣA =3 + 4 + 1 = 8
ΣB = 1 + 6 + 2 = 9
ΣAB = 3 + 24 + 2 = 29
ΣC = 9 + 16 + 1 = 26
ΣD= 1 + 36 + 4 = 41
Step 2: Input the Values
(r) =[ nΣAB – (ΣA)(ΣB) / Sqrt([nΣC– (ΣA)^{2}] [nΣD – (ΣB)^{2}])]
r = [3(29) – (8)(9) / Sqrt ([3(26) – (8) ^{2} ] [3(41)-(9) ^{2} ])]
r= [87-72 / Sqrt ([78-64] [123 -81])]
r= [15 / Sqrt ([14] [42])]
r=[15 / Sqrt (588)]
r= 15 / 24.24
r= 0.61