This question evaluates a candidate's competence in statistical inference and experimental design, focusing on distributional comparison, hypothesis testing, and sampling methodology within the Statistics & Math domain for a Data Scientist role.
You have two datasets collected from two versions of a system, or from two cities. The variable of interest may be continuous, skewed, heavy-tailed, or categorical, and the sample sizes may be different.