LLM & Generative AI Interview Questions

Q: What is RAG and how does it differ from fine-tuning?

RAG (Retrieval-Augmented Generation) retrieves relevant documents at inference time and provides them as context to the LLM. Fine-tuning modifies the model weights on your data. RAG is better for frequently changing knowledge; fine-tuning is better for teaching the model new skills or styles.

Q: What transformer concepts should I know for interviews?

Understand self-attention, multi-head attention, positional encoding, and the encoder-decoder architecture. Know why attention scales better than RNNs for long sequences. Be able to explain how the key-query-value mechanism works intuitively.

Question 1

LLM & Generative AI Interview Questions

LLM & Generative AI Interview Questions

Common LLM interview patterns

LLM interview questions

Improve classifier with noisy multi-annotator labels

Debug and fix a PyTorch Transformer training loop

Debug transformer and train classifier

Implement 1NN with NumPy

Explain Core ML Interview Concepts

Implement and Debug Backprop in NumPy

Explain Logistic Regression, Backprop, and Adam

Debug a broken Transformer implementation

Filter Bad Human Annotations

Debug a GRPO training loop and explain ratios

Design a Double Descent Experiment

Diagnose Transformer training and inference bugs

Design robber detection from surveillance video

Train a classifier and analyze dataset

Debug a transformer training pipeline

Design sequence decoding with greedy and beam search

Implement and visualize in-place augmentations

Debug a Broken Transformer

Implement and derive backprop from scratch

Evaluate Promotions for Uber Eats Users

Common mistakes in LLM interviews

How LLM questions are evaluated

Related ML concepts

LLM & Generative AI Interview FAQs

What is RAG and how does it differ from fine-tuning?

What transformer concepts should I know for interviews?