Question 1

What is model drift?

Accepted Answer

Model drift is the degradation of a machine learning model's accuracy in production because the real-world data distribution has shifted from the training data. It is the primary reason why models need ongoing monitoring and retraining — even if the code never changes, a model trained on last year's data will become less accurate as the world changes.

Question 2

What is the difference between data drift and concept drift?

Accepted Answer

Data drift (also called feature drift or covariate shift) occurs when the statistical distribution of input features changes — for example, user ages skewing younger as a product attracts a new demographic. Concept drift occurs when the relationship between inputs and the target variable changes — for example, different spending patterns causing a churn model to fail even when feature distributions look normal.

Question 3

How do you detect model drift?

Accepted Answer

Use statistical tests to compare production data distributions against training reference data: Population Stability Index (PSI) for binary features, Kolmogorov-Smirnov (KS) test for continuous features, and Chi-square test for categorical features. Tools like Evidently AI, Whylogs, and NannyML automate this. Also monitor prediction distribution shifts as an early-warning proxy for concept drift when labels are delayed.

Question 4

How do you fix model drift?

Accepted Answer

The primary fix for model drift is retraining on recent data. For data drift, retraining on a window that includes the new distribution usually restores accuracy. For concept drift, you may need to re-examine your feature engineering and labeling strategy in addition to retraining. Use a drift-triggered retraining pipeline so the model updates automatically rather than waiting for manual intervention.

Test	Best for	Threshold
Population Stability Index (PSI)	Categorical + binary features	PSI > 0.2 = drift
Kolmogorov-Smirnov (KS) test	Continuous feature distributions	p-value < 0.05
Chi-square test	Categorical feature frequency	p-value < 0.05
Jensen-Shannon divergence	Prediction distribution shift	JS > 0.1 = alert
CUSUM / page-Hinkley	Gradual concept drift detection	Custom threshold

Model Drift Explained: What It Is and How It Works

Data Drift vs Concept Drift

Types of Drift

Detection Methods

Detecting Drift with Evidently AI

Common Mistakes

FAQ

Related