Briefing Cloudera Knowledge

Which is the best cut point for X if you want to discretize these values into two buckets in a way t

Consider the following sample from a distribution that contains a continuous X and label Y that is
either A or B:

Which is the best cut point for X if you want to discretize these values into two buckets in a way
that minimizes the sum of chi-square values?

A.
X8

B.
X6

C.
X5

D.
X4

E.
X2