Solve Data Structure Challenges with Python Algorithms

Q: Solve Data Structure Challenges with Python Algorithms

This question evaluates proficiency in fundamental data structure manipulation and string-processing algorithms, specifically preserving insertion order during list deduplication and performing iterative overlap-based word merging.

Q: How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

Q: What difficulty level is this coding question?

This is a Medium difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Yahoo.

Q: What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Yahoo during technical interviews.

Question

##### Scenario

You are given coding challenges in Python to manipulate simple data structures.

##### Question

Remove duplicates from a list of integers while preserving the original order. 2. Given a list of lowercase words, iteratively merge them into one long word by overlapping the last character of the current word with the first character of the next word when they are the same. Example: ['abc','rgn','ctr'] ➜ 'abctrgn'. Handle edge cases such as no overlap or multiple possible overlaps.

##### Hints

For Q1, collections.OrderedDict keeps insertion order; for Q2, simulate building the string and carefully update indices.

PracHub · Accepted Answer

from collections import deque

def merge_words(words):
    n = len(words)
    if n == 0:
        return ""

# Map from starting character to queue of indices in original order
    char_to_indices = {chr(c): deque() for c in range(ord('a'), ord('z') + 1)}
    for i, w in enumerate(words):
        # words[i] are non-empty by constraints
        char_to_indices[w[0]].append(i)

used = [False] * n
    res = words[0]
    used[0] = True
    used_count = 1

# Pointer to earliest unused index for fallback
    p = 0
    while p < n and used[p]:
        p += 1

last_char = res[-1]

while used_count < n:
        dq = char_to_indices.get(last_char)
        chosen = None
        if dq is not None:
            # Discard used indices lazily
            while dq and used[dq[0]]:
                dq.popleft()
            if dq:
                chosen = dq.popleft()
                used[chosen] = True
                used_count += 1
                # Overlap by 1 character
                res += words[chosen][1:]
                last_char = words[chosen][-1]
                continue

# No match: take earliest unused by index
        while p < n and used[p]:
            p += 1
        if p >= n:
            break
        chosen = p
        used[chosen] = True
        used_count += 1
        res += words[chosen]
        last_char = words[chosen][-1]

return res

We simulate the construction greedily while ensuring determinism via the earliest-index tie-breaker. For O(1) amortized selection of the next candidate, we maintain for each letter a queue (deque) of word indices that start with that letter in original order. At each step, we first try to pick the earliest unused index from the queue for the current last character; if none exists, we fall back to the earliest unused index overall (tracked with a moving pointer). We append only word[1:] when overlapping to avoid duplicating the shared character. Lazy removal of used indices from the deques keeps the total work linear.

Quick Overview

Quick Overview

Quick Overview