Distribution Shift

The main use of machine learning is for prediction, i.e. predicting outcomes in scenarios which were not seen in the data which was used to train the model. However in many real-world scenarios, the distribution of the data used in training is different to the distribution of the data in production. This phenomena is called distribution shift.

Examples

  • Diabetic retinopathy prediction, see here.

Emacs 29.4 (Org mode 9.6.15)