Explain train-test generalization gap | Bytedance Interview Question