This question evaluates proficiency in ML system design, MLOps, and audio data engineering by focusing on inference and labeling pipeline components such as feature extraction, ASR integration, score calibration and thresholding, data contracts, labeling workflows, drift detection, and model/version management for near‑real‑time and batch processing of multilingual noisy audio. Commonly asked in ML System Design interviews, it gauges practical application skills for specifying end-to-end production pipelines while requiring conceptual understanding of operational concerns like latency, class imbalance handling, calibration, and retraining workflows.
Design the machine learning inference and data pipeline for an audio detection system that flags policy-relevant speech and keywords. Assume:
Describe:
Login required