BayesianMachineLearningMachineLearningStatistics
Given a data distribution
- Estimation error: How close a particular hypothesis can get to the best-in-class error
- Approximation error: How close the hypothesis class can get to the true minimum achievable error (considering noise in measurements)
respectively:
In regression, where , ad the loss is the squared error, the error rate (noise) can be taken to be the expected conditional variance: where the third equality follows by the definition of conditional expectation and the last by the tower property of conditional expectation. When is not random this is clearly zero. This is the mean-squared error of the conditional expectation.