Correlation analysis of numerical data in Data Mining

Correlation analysis of numerical data in Data Mining

A

B

3

1

4

6

1

2

Step 1: Find all the initial values

A

B

AB

A2=C

B2=D

3

1

3

9

1

4

6

24

16

36

1

2

2

1

4

The total number of values (n) is 3.

[quads id=1]

The other values we need are:

ΣA =3 + 4 + 1 = 8

ΣB = 1 + 6 + 2 = 9

ΣAB = 3 + 24 + 2 = 29

ΣC = 9 + 16 + 1 = 26

ΣD= 1 + 36 + 4 = 41

Step 2: Input the Values

(r) =[ nΣAB – (ΣA)(ΣB) / Sqrt([nΣC– (ΣA)2]  [nΣD – (ΣB)2])]

r = [3(29) – (8)(9) / Sqrt ([3(26) – (8) 2 ]  [3(41)-(9) 2  ])]

r= [87-72 / Sqrt ([78-64]  [123 -81])]

r= [15 / Sqrt ([14] [42])]

r=[15 / Sqrt (588)]

r= 15 24.24

r= 0.61