When optimizing a function using stochastic gradient descent, how frequently should you update
your estimate of the gradient?

A.
Once after every pass through the data set
B.
Once per observation
C.
For each observation with a probability that you choose ahead of time
D.
After a random number of observations
E.
Once every N observations, where you decide N ahead of time