This question evaluates debugging and practical implementation skills for transformer-based text classification, covering model correctness, training loop integrity, data preprocessing, and evaluation metrics within the Machine Learning / Natural Language Processing domain.
You inherit a small codebase for a transformer-based text classifier. There are four failing unit tests: two correspond to previously documented ("known") issues; two are unexpected ("novel"). Your task is to make the model train and evaluate correctly, and to demonstrate a robust training/evaluation pipeline on a labeled dataset.
Assumptions (to make the task self-contained):
text
(string) and
label
(int), single-label classification with K classes.
Login required