This question evaluates a candidate's competence in designing scalable, reliable batch inference pipelines for machine learning, covering model artifact management, feature and input versioning, job scheduling and parallelization, output delivery, and operational concerns such as retries, idempotency, backfills, and monitoring.
You need to generate predictions for a very large offline dataset, such as all users or all products, once per day using an already trained machine learning model. Explain how you would design a batch inference pipeline.
Your answer should cover: