Openai Machine Learning Engineer Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Find earliest supporting version under constraints
You are given version strings formatted as {major}.{minor}.{patch}, e.g., "103.003.03". Each version either supports a feature or not. You may call is...
Debug a transformer training pipeline
Diagnose a Diverging PyTorch Transformer Training Run You are given a PyTorch Transformer training pipeline whose loss diverges and validation accurac...
Train a classifier and analyze dataset
End-to-End Binary Classifier Workflow (EDA → Modeling → Fairness → Report) You are given a labeled tabular dataset and asked to implement a reproducib...
Design an enterprise RAG system
System Design Task: Retrieval-Augmented Generation (RAG) for Enterprise Users You are designing a multi-tenant enterprise RAG system that answers user...
Design an ML search system
Design an ML‑Powered Enterprise Document Search System Context You are designing a multi‑tenant enterprise search system that indexes documents from m...
Diagnose Transformer training and inference bugs
Debugging a Transformer That Intermittently Throws Shape/Type Errors and Fails to Converge You are given a Transformer-based sequence model that: - In...
Derive MLE and Bayesian posterior for Bernoulli
Bernoulli/Binomial Inference Task You observe n independent Bernoulli trials with unknown success probability p, and you record k successes (so K ~ Bi...
Implement vectorized NumPy ops and explain broadcasting
Implement vectorized NumPy code for: (a) computing pairwise cosine similarity between two real-valued matrices X (shape n×d) and Y (shape m×d) without...
Design a production RAG system
Design a Production RAG System for Enterprise Document QA Context You are designing a Retrieval-Augmented Generation (RAG) system to answer questions ...
Design enterprise RAG search system
Design an End-to-End Enterprise RAG Search System Background You are tasked with designing a Retrieval-Augmented Generation (RAG) search system for en...
Find earliest supporting dependency version
Question Given a list of dependency versions (e.g. [103.003.02, 103.003.03, 203.003.02]) and a black-box API isSupported(v), design an algorithm to fi...
Debug transformer and train classifier
Debug and Fix a Transformer Text Classifier, Then Train and Evaluate It Context You inherit a small codebase for a transformer-based text classifier. ...
Design an enterprise RAG system
System Design: Retrieval-Augmented Generation (RAG) for Enterprise Context Design a production-grade, multi-tenant RAG platform for enterprise users. ...
Train and analyze a classifier
Given a labeled dataset for binary classification, implement an end-to-end Python solution to train and analyze a classifier. Tasks: ( 1) perform EDA ...
Debug a transformer training pipeline
Debugging Plan: PyTorch Transformer Text Model with Mask Errors, Metric Plateau, AMP Crashes, and Nondeterminism Context You are training a Transforme...
Design a low-latency RAG system
System Design: Production-Grade RAG for Customer Support (p99 ≤ 1.5 s) Goal Design a production-ready retrieval-augmented generation (RAG) system for ...
Build and troubleshoot image classification and backprop
CIFAR-like Noisy Dataset: Baseline, Data Quality Plan, and First-Principles Backprop Context: You have a CIFAR-like dataset of 32×32 RGB images, 10–20...
Describe handling pressure and present your work
Behavioral Prompt: Delivering Under Severe Time Pressure You are interviewing for a technical role where speed, rigor, and communication matter. Descr...
Design an in-memory database
System Design: In-Memory Key–Value Database for Ultra–Low Latency Context You are designing an in-memory, per-node key–value database optimized for ul...
Design an ML search system with RAG
System Design: ML-Powered Enterprise Search with RAG Design an ML-powered enterprise search system using Retrieval-Augmented Generation (RAG) under th...