This question evaluates competency in statistical inference and multivariate analysis, covering hypothesis formulation about equal distributions, preprocessing and normalization considerations, choice of statistical tests or modeling approaches, and interpretation of significance and effect size.
You are given two groups of users:
Each user has a vector of continuous features (e.g., session duration, click-through rate, purchase conversion rate, etc.).
Describe how you would determine whether the two groups differ significantly in their overall multivariate distribution.
Your answer should cover:
Login required