This question evaluates probabilistic modeling and statistical inference skills for censored time-to-event performance data, focusing on estimating player-specific win probabilities and predicting expected future wins from observed "wins before first loss" records.
You are given a DataFrame df where each row summarizes a player’s performance until their first loss.
df columns:
player_id
(string/int)
wins_before_first_loss
(int,
): number of consecutive wins observed before the first loss occurred
Interpretation: for each player, you observed a sequence of games that ended with a loss, e.g., WWW...WL, and wins_before_first_loss = k.
wins_before_first_loss
and explain how you’d estimate parameters.