Consider the following sample from a distribution that contains a continuous X and label Y that is
either A or B:
Which is the best cut point for X if you want to discretize these values into two buckets in a way
that minimizes the sum of chi-square values?
A.
X8
B.
X6
C.
X5
D.
X4
E.
X2