Debug a Broken Transformer
Company: OpenAI
Role: Machine Learning Engineer
Category: Machine Learning
Difficulty: medium
Interview Round: Technical Screen
Quick Answer: This question evaluates a candidate's practical debugging and model engineering skills for Transformer implementations, covering tokenization and labels, tensor shapes, attention masking, positional encodings, train versus eval modes, loss and gradient flow, optimizer configuration, and converting a model to classification within the Machine Learning/deep learning (NLP) domain. It is commonly asked to assess end-to-end troubleshooting and implementation-level reasoning, and its level of abstraction is primarily practical application with some required conceptual understanding of model internals.