Derive and regularize logistic regression

Q: Derive and regularize logistic regression

This question evaluates proficiency in logistic regression theory and regularization, GLM versus OLS assumptions, class imbalance handling and calibration, temporal validation to avoid leakage, correlated-feature penalty effects, and business-threshold decisioning for expected value, within the Machine Learning domain for a Data Scientist role.

Q: How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

Question

Loading...

Churn Propensity with Logistic Regression: Theory, Validation, and Decisions

Context: You are building a churn propensity model (y ∈ {0,1}) using logistic regression for a subscription business. Positives (churners) are 3% of samples. Answer each part precisely and concisely.

1) Logistic regression likelihood and regularization

Starting from a Bernoulli likelihood, derive the logistic regression log-likelihood and its gradient with respect to β.
Show how L2 and L1 regularization change the objective from MLE to MAP. Write the new objective and gradients/subgradients (note: intercept typically unpenalized).

2) OLS assumptions vs. GLMs

List the OLS assumptions. For each assumption that is relevant to GLMs (e.g., multicollinearity, omitted variables, measurement error, non‑IID), explain how violations manifest in logistic regression and how regularization, feature engineering, or robust inference address them.

3) Class imbalance (3% positives)

Compare class weighting vs. focal loss vs. threshold moving. How does each affect calibration?
Describe a calibration check and a recalibration method (Platt vs. isotonic), and when you’d prefer each.

4) Temporal validation without leakage

Define a temporal validation scheme that avoids leakage. Include: feature freeze date, out‑of‑time test window, and a k‑fold strategy compatible with time. Specify the exact splits on a 6‑month dataset.

5) Correlated features and penalties

With correlated features, contrast L1 vs. L2 on sparsity, stability, and interpretability. Propose a workflow that yields a sparse, stable model with confidence intervals for odds ratios.

6) Business decision rule and value

Give one business‑aligned decision rule for choosing the score threshold using asymmetric costs, and show how to compute the expected value uplift over a "message all" policy.

Derive and regularize logistic regression

Churn Propensity with Logistic Regression: Theory, Validation, and Decisions

1) Logistic regression likelihood and regularization

2) OLS assumptions vs. GLMs

3) Class imbalance (3% positives)

4) Temporal validation without leakage

5) Correlated features and penalties

6) Business decision rule and value

Solution

Comments (0)

Derive and regularize logistic regression

Overview

Churn Propensity with Logistic Regression: Theory, Validation, and Decisions

1) Logistic regression likelihood and regularization

2) OLS assumptions vs. GLMs

3) Class imbalance (3% positives)

4) Temporal validation without leakage

5) Correlated features and penalties

6) Business decision rule and value

Solution

Comments (0)