This question evaluates a data scientist's competency in building and evaluating binary classifiers for imbalanced datasets, focusing on model design, evaluation metric selection, and performance interpretation.
You have a dataset of 1,000 URLs labeled as good (alive) or bad (dead). The classes are likely imbalanced (e.g., far fewer dead links than good ones).
Hint: A strong baseline is logistic regression with class-imbalance-aware metrics.
Login required