This question evaluates a candidate's competence in machine-learning data preprocessing, scikit-learn-style transformer API design, unit testing, and code quality assessment including PEP-8 and maintainability concerns.

You are given a Python module that implements one OutlierHandler class and three Imputer classes for preprocessing tabular data. The classes appear to be intended for use in machine-learning pipelines (e.g., scikit-learn style), but the code has mixed style and testing coverage.
Assumptions to make the question self-contained:
from numpy import *
).
Hints: Relate your answers to the scikit-learn transformer API, unit-testing best practices, and PEP-8.
Login required