Databricks Data Scientist Interview Questions
Practice the exact questions companies are asking right now.
Relate coefficients under linear feature transformation
Answer both parts. Part 1: Coefficients under feature transformation You have original predictors \(x_1, x_2\) and define transformed predictors: - \(...
Find top-5 most similar rows across datasets
You can solve this in SQL or Python. You are given two datasets with the same feature columns: Tables target_rows (rows you want to match) - target_id...
Compare ROC-AUC vs PR-AUC
Explain the difference between ROC-AUC and PR-AUC (Average Precision / area under the Precision–Recall curve). In your answer, cover: 1. What each cur...
Count weekly customers with ≥$1000 YTD spend
You are given a transaction-level table and need to compute a weekly time series. Table transactions - date (DATE) — transaction date (assume UTC and ...
Test if coin is fair from 560 tails
You flip a coin n = 1000 times and observe 560 tails. At significance level α = 0.05, test whether the coin is fair. Requirements: - State the null an...
Calculate Second-Degree Followers for Each YouTuber
following +----------+----------+ | YouTuber | follower | +----------+----------+ | A | B | | A | C | | B | D ...
Explain Linear Regression Feature Transformation Equivalence
Linear Regression Feature Representations and High-Dimensional Modelling Context You are evaluating two linear regression specifications that use diff...