Explain LLM training, RL, and evaluation
Company: Cohere
Role: Machine Learning Engineer
Category: Machine Learning
Difficulty: medium
Interview Round: Onsite
Quick Answer: This question evaluates understanding of the full large language model lifecycle and associated competencies, including pre-training, supervised fine-tuning, preference optimization, reinforcement learning–based post-training, reward design, optimization stability, common failure modes, and evaluation metrics.