Optimize Job Routing in Parallel Machine Scheduling

Q: Optimize Job Routing in Parallel Machine Scheduling

This question evaluates understanding of online scheduling and algorithmic decision-making, specifically skills in minimizing average flow-time when routing jobs to parallel machines.

Q: How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

Q: What difficulty level is this coding question?

This is a Medium difficulty Coding & Algorithms question, commonly asked during Onsite rounds at TikTok.

Q: What role is this question designed for?

This question is commonly asked for Data Scientist candidates at TikTok during technical interviews.

Question

##### Scenario

In the Production Factory game, jobs with varying processing times arrive and must be routed through two parallel machines to minimize total completion time.

##### Question

Design an algorithm that decides on-the-fly which machine each incoming job should occupy to keep average flow-time minimal.

##### Hints

Think greedy (shortest-processing-time first) and priority queues; discuss time-complexity.

PracHub · Accepted Answer

from typing import List
import heapq

def minimal_total_completion_time(times: List[int]) -> int:
    # Two identical machines
    if not times:
        return 0
    times_sorted = sorted(times)
    # Min-heap of machine next-available times (start both at 0)
    avail = [0, 0]
    total = 0
    heappop = heapq.heappop
    heappush = heapq.heappush
    for p in times_sorted:
        t = heappop(avail)
        finish = t + p
        total += finish
        heappush(avail, finish)
    return total

Sorting by processing time (SPT) and always assigning the next shortest job to the machine that becomes available earliest yields an optimal schedule for two identical machines to minimize the sum of completion times. We simulate this with a min-heap of machine availability times. For each job p in ascending order, pop the earliest available time t, set its completion to t+p, add to the total, and push t+p back. The heap operations are O(log 2) = O(1), so the dominant cost is sorting.

Quick Overview

Optimize Job Routing in Parallel Machine Scheduling

Constraints

Solution

Explanation

Hints