How to read a
confusion matrix
suppose you have 95 records available for prediction. The
record has a target to predict whether the person has a particular disease or
not.
the matrix displays
as
Confirmed
|
Not confirmed
|
|
Confirmed
|
35
|
20
|
Not confirmed
|
25
|
15
|
What do you understand?
I have given
summation of the rows and columns
Predicted values
|
||||
Confirmed
|
Not confirmed
|
total
|
||
Actual values
|
Confirmed
|
35
|
20
|
55
|
Not confirmed
|
25
|
15
|
40
|
|
total
|
60
|
35
|
95
|
Can you get some information?
The green cells are the actuals, while the yellows are the
predicted values and the pink are the errors.
Actual values are the one which are tested physically and
the results are available on hand. Which means in our case 55 people had the
disease and 40 don’t have.
What did the algorithm do ? It has predicted as 60 people who
have disease and 35 who do not have.
How to interpret? Out of the 55 people, the algorithm has
made a wrong prediction of classifying 20 people as they don’t have a
disease.
Secondly, out of 45 people who do not have disease it has predicted
25 people that they may have a disease, which is a very serious issue.