Find first error and propagate failures

Q: Find first error and propagate failures

This question evaluates proficiency with algorithmic invariants and search techniques, specifically binary search on ordered logs, graph traversal (BFS/DFS) for reachability and longest-path reasoning, cycle handling, and formal analysis of correctness and time/space complexity within the Coding & Algorithms domain.

Q: How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

Q: What difficulty level is this coding question?

This is a Medium difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Snowflake.

Q: What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Snowflake during technical interviews.

Question

You are given a chronological log history where each entry is a string prefixed with one of [Info], [Warn], or [Error]. Two invariants hold:
(
1) once the first [Error] appears, all subsequent entries are [Error];
(
2) the entry immediately before the first [Error] is [Warn]. Task 1 (Binary Search): Return the 0-based index of the first [Error] using O(log n) comparisons; explain the decision rule at mid and prove correctness and time/space complexity. Follow-up 1 (Graph BFS/DFS): You are also given a directed service call graph where an edge A -> B means service A calls (depends on) service B, and a single service s that first starts erroring. If failures propagate upstream (when B fails, every caller of B eventually fails), output the set of all services that will end up in error; design an algorithm using BFS or DFS, specify input/output formats (e.g., adjacency list), and describe how you handle cycles or repeated visits. Follow-up 2 (Graph DFS for longest chain): On the same graph and start service s, output one of the longest error propagation chains starting at s (a longest simple path starting from s). Use DFS to maintain the current path and best-so-far path; describe how you avoid revisiting nodes, and analyze complexity and assumptions needed for tractability.

PracHub · Accepted Answer

from collections import defaultdict, deque from functools import lru_cache import sys def analyze_incidents(logs: list[str], edges: list[tuple[str, str]], s: str) -> tuple[int, list[str], list[str]]: # Task 1: First [Error] index via binary search (lower_bound of predicate is_error) n = len(logs) lo, hi = 0, n # [lo, hi) def is_error(i: int) -> bool: return logs[i].startswith('[Error]') while lo < hi: mid = (lo + hi) // 2 if is_error(mid): hi = mid else: lo = mid + 1 first_error_idx = lo if lo < n and is_error(lo) else -1 # Task 2: Build reverse graph and collect all failing services via BFS from s rev = defaultdict(set) for u, v in edges: rev[v].add(u) # reverse edge: failure propagates from v to u failing = set() dq = deque([s]) while dq: u = dq.popleft() if u in failing: continue failing.add(u) for w in rev.get(u, ()): # callers of u if w not in failing: dq.append(w) failing_sorted = sorted(failing) # Task 3: Longest chain from s in reverse graph (assume DAG). Use DP for length and next-pointer. # Restrict adjacency to reachable (failing) nodes for efficiency/determinism. adj = {u: sorted([v for v in rev.get(u, ()) if v in failing]) for u in failing} # Recursion depth can be as large as longest chain; raise limit conservatively. sys.setrecursionlimit(max(1000000, len(failing) * 2 + 10)) @lru_cache(None) def dp(u: str) -> tuple[int, str | None]: # Returns (best_length_from_u_inclusive, best_next_node) best_len = 1 best_next = None for v in adj.get(u, []): l_v, _ = dp(v) cand_len = 1 + l_v if cand_len > best_len: best_len = cand_len best_next = v elif cand_len == best_len: if best_next is None or v < best_next: best_next = v return best_len, best_next # Reconstruct chain from s using next-pointers; guard against unexpected cycles by halting on repeats. chain = [] seen = set() cur = s while cur is not None and cur not in seen: chain.append(cur) seen.add(cur) _, nxt = dp(cur) cur = nxt return first_error_idx, failing_sorted, chain The log prefix predicate "starts with '[Error]'" is monotone due to the invariants, so a lower_bound binary search yields the first error index in O(log n) time and O(1) space. For failure propagation, reversing each dependency edge (u->v becomes v->u) transforms the problem into reachability from s; BFS or DFS visits every caller reachable upstream in O(V+E). For the longest chain (a longest simple path starting at s) on a DAG, use DFS with memoization to compute the longest path length from each node. To ensure deterministic output among multiple longest chains, choose the next node with the smallest name among those achieving the maximum length. Store only the best length and a next-pointer per node, then reconstruct the path by following next-pointers from s. This yields O(V+E) time and O(V+E) space on the DAG assumption.

Quick Overview

Quick Overview

Quick Overview