This question evaluates a candidate's understanding of imbalanced classification and regression concepts, including ROC/PR curves and AUC, prevalence effects on metrics, loss functions (MSE vs MAE) and their gradients, and neural network training strategies for calibration and recall.
You are evaluating a binary classifier and a regression head in a machine learning take-home. Answer all parts concisely but show your steps where calculations are requested.
Given scores for 5 positives and 5 negatives, sweep the decision threshold from +∞ down to −∞. At equal scores (if any), break ties by ranking positives above negatives.
Tasks:
With prevalence = 1% (positives are rare):
For a regression head:
On the same 1% prevalence task, propose two concrete architecture/training changes (e.g., focal loss with typical γ, α; class weighting; positive down/up-sampling; thresholding strategy). For each, discuss likely effects on calibration and on recall.
Login required