Test classifier difference with McNemar's test | Microsoft Interview Question