Implement percentage RMSE and bootstrap its CI

Q: Implement percentage RMSE and bootstrap its CI

This question evaluates implementation and statistical reasoning skills, focusing on robust numerical computation of percentage RMSE (including handling zeros/negatives, optional country weights, and numerical stability) and nonparametric bootstrap methods for confidence intervals.

Q: How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

Q: What difficulty level is this coding question?

This is a Medium difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Google.

Q: What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Google during technical interviews.

Question

Given a CSV with columns [country, actual_revenue, predicted_revenue], define percentage RMSE as pRMSE = sqrt(mean_i((pred_i/actual_i − 1)^2)). a) Implement pRMSE carefully: handle zeros/negatives, optional country weights, and numerical stability. b) Implement a nonparametric n‑of‑n bootstrap to obtain a 95% CI for pRMSE; justify the choice of resample size and discuss when a stratified or cluster bootstrap is preferable. c) What is the exact probability that one bootstrap resample equals the original sample in exactly the same order? Provide the formula in terms of n and explain why it is typically tiny. d) Discuss limitations of bootstrapping this metric (heavy tails, dependence across countries) and mitigations.

PracHub · Accepted Answer

def solution(rows, weights=None): import math if weights is None: weights = {} weighted_sq_errors = [] weight_values = [] for country, actual, predicted in rows: w = float(weights.get(country, 1.0)) actual = float(actual) predicted = float(predicted) if actual <= 0.0 or predicted < 0.0 or w <= 0.0: continue rel = (predicted - actual) / actual weighted_sq_errors.append(w * rel * rel) weight_values.append(w) total_weight = math.fsum(weight_values) if total_weight == 0.0: return None mean_sq = math.fsum(weighted_sq_errors) / total_weight return round(math.sqrt(mean_sq), 12) def solution(rows, B, seed, weights=None): import math if weights is None: weights = {} valid = [] for country, actual, predicted in rows: w = float(weights.get(country, 1.0)) actual = float(actual) predicted = float(predicted) if actual > 0.0 and predicted >= 0.0 and w > 0.0: valid.append((country, actual, predicted, w)) n = len(valid) if n == 0 or B <= 0: return None def prmse(sample): weighted_sq_errors = [] weight_values = [] for _, actual, predicted, w in sample: rel = (predicted - actual) / actual weighted_sq_errors.append(w * rel * rel) weight_values.append(w) total_weight = math.fsum(weight_values) return math.sqrt(math.fsum(weighted_sq_errors) / total_weight) estimate = prmse(valid) state = seed % 97 boot = [] for _ in range(B): sample = [] for _ in range(n): state = (17 * state + 43) % 97 idx = state % n sample.append(valid[idx]) boot.append(prmse(sample)) boot.sort() low_idx = max(0, int(math.floor(0.025 * B))) high_idx = min(B - 1, int(math.ceil(0.975 * B)) - 1) return (round(estimate, 6), round(boot[low_idx], 6), round(boot[high_idx], 6)) def solution(n): if n < 0: return None if n == 0: return (1, 1) return (1, pow(n, n)) def solution(rows, min_valid=5, tail_ratio=25.0, imbalance_threshold=0.6): valid = [] for country, region, actual, predicted in rows: actual = float(actual) predicted = float(predicted) if actual > 0.0 and predicted >= 0.0: valid.append((country, region, actual, predicted)) if not valid: return ['insufficient_data'] recommendations = set() n = len(valid) if n < min_valid: recommendations.add('collect_more_data') errors = [] region_counts = {} region_countries = {} for country, region, actual, predicted in valid: rel = (predicted - actual) / actual errors.append(rel * rel) region_counts[region] = region_counts.get(region, 0) + 1 region_countries.setdefault(region, set()).add(country) errors.sort() m = len(errors) if m % 2 == 1: median = errors[m // 2] else: median = (errors[m // 2 - 1] + errors[m // 2]) / 2.0 max_error = errors[-1] if (median == 0.0 and max_error > 0.0) or (median > 0.0 and max_error / median >= tail_ratio): recommendations.add('winsorize_or_log_transform') recommendations.add('report_robust_metric') for region, countries in region_countries.items(): if region_counts[region] >= 2 and len(countries) >= 2: recommendations.add('cluster_bootstrap_by_region') break if len(region_counts) >= 2: max_share = max(region_counts.values()) / n if max_share > imbalance_threshold: recommendations.add('stratified_bootstrap_by_region') return sorted(recommendations)

Quick Overview

Solution

Hints

Part 2: Bootstrap a 95% CI for pRMSE with deterministic resampling

Constraints

Examples

Solution

Hints

Part 3: Exact probability that a bootstrap resample matches the original sample in order

Constraints

Examples

Solution

Hints

Part 4: Detect bootstrap risks for pRMSE and recommend mitigations

Constraints

Examples

Solution

Hints

Quick Overview