This question evaluates a data scientist's understanding of identity management, windowed distinct-count metrics (MAU), and the impact of a one-time ID remapping on measurement bias and data modeling.
A product tracks activity using user_id from login events, and computes MAU as:
d
= number of
distinct user_id
with at least one login in the window
[d-29, d]
(inclusive).
On a single day T, the company performs a one-time rehash of all user IDs:
user_id_old
.
user_id_new
.
user_id
?
Login required