This question evaluates understanding of logistic regression and related competencies including derivation of the Bernoulli log-likelihood, gradient and Hessian for L2-regularized models, convexity reasoning, probability calibration and decision thresholds under asymmetric costs, numerically stable log-sigmoid expressions, and analytic treatment of class imbalance effects. It is categorized under Statistics & Math for data scientist roles and is commonly asked because it combines conceptual theoretical derivations with practical application skills, testing both mathematical reasoning (derivations and convexity proofs) and applied understanding of thresholds, calibration, numerical stability, and regularization.
Assume a binary classification setting with observations {(x_i, y_i)} for i=1..n, where x_i ∈ R^p (with an intercept term) and y_i ∈ {0,1}. Let η_i = x_i^T β and σ(z) be the logistic (sigmoid) function.
Login required