Explain deployment, retrieval, and regularization

Q: Explain deployment, retrieval, and regularization

This question evaluates competencies in machine learning systems engineering, covering multimodal model deployment under GPU/VRAM constraints, scalable video retrieval and indexing, regularization and normalization methods, and reinforcement learning–based post-training for language models.

Q: How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

Q: What difficulty level is this interview question?

This is a hard difficulty Machine Learning question, commonly asked during Onsite rounds at Bytedance.

Q: What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Bytedance during technical interviews.

Question

You are interviewing for a machine-learning role at a large-scale short-video platform. Answer the following conceptual questions.

Under tight GPU compute and VRAM constraints, how would you deploy a multimodal model for tasks such as video retrieval or ranking? Discuss model architecture choices, compression, batching, caching, and how you would trade off quality, p99 latency, throughput, and serving cost.
Suppose captions and video embeddings have already been precomputed and stored. How would you accelerate online video retrieval? Discuss indexing, approximate nearest neighbor search, hybrid text-plus-vector retrieval, reranking, memory footprint, and freshness.
What is overfitting, how would you detect it, and how would you mitigate it in deep learning systems?
Explain the intuition and mathematics of Dropout, including why inverted Dropout keeps the expected activation scale consistent between training and inference.
Compare common normalization methods such as BatchNorm, LayerNorm, GroupNorm, and RMSNorm. When is each appropriate, and how are statistics handled at inference time?
Explain how reinforcement learning is used in LLM post-training, especially RLHF. Describe the roles of supervised fine-tuning, preference data, reward modeling, policy optimization, KL regularization, and common failure modes.

Explain deployment, retrieval, and regularization

Quick Overview

Solution

Comments (0)