Openai Machine Learning Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Debug and fix a PyTorch Transformer training loop
Minimal Causal LM Debugging and Optimization Context You are given a tiny causal decoder-only language model implemented in PyTorch. It appears to "tr...
Implement and Debug Backprop in NumPy
Two-Layer Neural Network: Backpropagation and Gradient Check (NumPy) Context You are implementing a fully connected two-layer neural network for multi...
Debug a transformer training pipeline
Diagnose a Diverging PyTorch Transformer Training Run You are given a PyTorch Transformer training pipeline whose loss diverges and validation accurac...
Train a classifier and analyze dataset
End-to-End Binary Classifier Workflow (EDA → Modeling → Fairness → Report) You are given a labeled tabular dataset and asked to implement a reproducib...
Diagnose Transformer training and inference bugs
Debugging a Transformer That Intermittently Throws Shape/Type Errors and Fails to Converge You are given a Transformer-based sequence model that: - In...
Debug a Machine Learning Pipeline
Debugging a Sudden Accuracy Drop in a Deployed ML Pipeline Context You are on-call for a production machine learning service. Monitoring alerts show t...
Debug transformer and train classifier
Debug and Fix a Transformer Text Classifier, Then Train and Evaluate It Context You inherit a small codebase for a transformer-based text classifier. ...
Debug a transformer training pipeline
Debugging Plan: PyTorch Transformer Text Model with Mask Errors, Metric Plateau, AMP Crashes, and Nondeterminism Context You are training a Transforme...
Debug a failing ML classifier
Debugging a Churn Prediction Pipeline With Poor Generalization Context You are evaluating a binary churn prediction system with: - Training ROC AUC: 0...
Build and troubleshoot image classification and backprop
CIFAR-like Noisy Dataset: Baseline, Data Quality Plan, and First-Principles Backprop Context: You have a CIFAR-like dataset of 32×32 RGB images, 10–20...