 
  
  
  
  
 , a hypothesis
 , a hypothesis   is said
to overfit the training data if there exists some alternative
hypothesis
  is said
to overfit the training data if there exists some alternative
hypothesis   , such that
 , such that   has a smaller error
than
  has a smaller error
than   over the training examples, but
  over the training examples, but   has
a smaller error than
  has
a smaller error than   over the entire distribution of instances.
  over the entire distribution of instances.