How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

What difficulty level is this coding question?

This is a medium difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Google.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Google during technical interviews.

Find top/bottom-k words in list or stream

Company: Google

Role: Software Engineer

Category: Coding & Algorithms

Difficulty: medium

Interview Round: Technical Screen

You are given words (strings) either as a finite list or as an unbounded stream. 1) **List version**: Given an array `words` and an integer `k`, return: - the `k` most frequent distinct words, and - the `k` least frequent distinct words. Specify how you break ties (e.g., lexicographically, or any order). 2) **Stream version (online)**: Design a data structure that supports: - `ingest(word)`: process the next word in the stream - `topK(k)`: return the `k` most frequent distinct words seen so far - `bottomK(k)`: return the `k` least frequent distinct words seen so far Discuss time/space complexity and how you would implement these operations efficiently. 3) **Variant (discuss/design)**: Extend the stream design to support returning the **first `k`** and/or **last `k`** words that are currently **non-repeating** (i.e., words with frequency exactly 1) according to stream arrival order.

Quick Answer: This question evaluates skills in frequency counting, top-k and bottom-k selection, streaming algorithms, and data-structure design for maintaining order, tie-breaking, and dynamic updates.

Part 1: Top and Bottom K Frequent Words in a List

Given an array of strings words and a non-negative integer k, return two lists: the k most frequent distinct words and the k least frequent distinct words. Use deterministic tie-breaking: if two words have the same frequency, the lexicographically smaller word comes first. If k is larger than the number of distinct words, return all distinct words.

Constraints

0 <= len(words) <= 2 * 10^5
0 <= k <= 2 * 10^5
Each word is a non-empty string
Comparison is case-sensitive unless otherwise stated

Examples

Input: (['i', 'love', 'leetcode', 'i', 'love', 'coding'], 2)

Expected Output: [['i', 'love'], ['coding', 'leetcode']]

Explanation: `i` and `love` both appear twice, so lexicographical order gives `['i', 'love']`. `coding` and `leetcode` both appear once, so lexicographical order gives `['coding', 'leetcode']`.

Input: (['apple', 'banana', 'apple', 'cherry', 'banana', 'date'], 3)

Expected Output: [['apple', 'banana', 'cherry'], ['cherry', 'date', 'apple']]

Explanation: Frequencies are: apple=2, banana=2, cherry=1, date=1. Top 3 are ordered by frequency descending, then lexicographically: `apple`, `banana`, `cherry`. Bottom 3 are ordered by frequency ascending, then lexicographically: `cherry`, `date`, `apple`.

Hints

Count how many times each distinct word appears before deciding the answer.
You can sort the distinct words twice: once by (-frequency, word) and once by (frequency, word).

Part 2: Online Top/Bottom K Words from a Stream

You are given a sequence of stream operations. Each operation is a 2-element list of strings: ['ingest', word], ['topK', str(k)], or ['bottomK', str(k)]. Process the operations in order. For each query, return the k most frequent or k least frequent distinct words seen so far. Ties are broken lexicographically ascending. If k is larger than the number of distinct words seen so far, return all distinct words.

Constraints

1 <= len(operations) <= 2 * 10^5
Each operation is one of ['ingest', word], ['topK', str(k)], or ['bottomK', str(k)]
0 <= k <= 2 * 10^5
Queries may appear before any word is ingested

Examples

Input: [['ingest', 'apple'], ['ingest', 'banana'], ['ingest', 'apple'], ['topK', '2'], ['bottomK', '2']]

Expected Output: [['apple', 'banana'], ['banana', 'apple']]

Explanation: After the ingests, counts are apple=2 and banana=1. topK(2) returns the most frequent first, while bottomK(2) returns the least frequent first.

Input: [['topK', '3'], ['ingest', 'dog'], ['ingest', 'cat'], ['ingest', 'cat'], ['topK', '2'], ['bottomK', '2'], ['topK', '5']]

Expected Output: [[], ['cat', 'dog'], ['dog', 'cat'], ['cat', 'dog']]

Explanation: The first query happens before any words are seen, so it returns []. Later, cat has frequency 2 and dog has frequency 1.

Hints

Keep an exact frequency map for all words seen so far.
A max-heap and a min-heap with lazy deletion let you support online updates without fully resorting all words on every query.

Part 3: First/Last K Non-Repeating Words in a Stream

You are given a sequence of stream operations. Each operation is a 2-element list of strings: ['ingest', word], ['firstK', str(k)], or ['lastK', str(k)]. A word is non-repeating if its frequency is exactly 1 among all words seen so far. For each query, return the first k or last k currently non-repeating words according to stream arrival order. For lastK, return the selected words in arrival order, not reversed.

Constraints

1 <= len(operations) <= 2 * 10^5
Each operation is one of ['ingest', word], ['firstK', str(k)], or ['lastK', str(k)]
0 <= k <= 2 * 10^5
Queries may appear before any word is ingested

Examples

Input: ([['ingest', 'a'], ['ingest', 'b'], ['ingest', 'a'], ['ingest', 'b'], ['ingest', 'c'], ['ingest', 'd'], ['firstK', '2'], ['ingest', 'c'], ['ingest', 'e'], ['lastK', '2']],)

Expected Output: [['c', 'd'], ['d', 'e']]

Explanation: After the first six ingests, only 'c' and 'd' are non-repeating, so firstK 2 returns ['c', 'd']. After ingesting another 'c' and then 'e', the non-repeating words are ['d', 'e'], so lastK 2 returns them in arrival order.

Input: ([['firstK', '3'], ['ingest', 'x'], ['ingest', 'x'], ['firstK', '1'], ['lastK', '2']],)

Expected Output: [[], [], []]

Explanation: The first query happens before any ingest, so it returns []. After 'x' is ingested twice, it is repeating, so there are no non-repeating words for the remaining queries.

Hints

When a word appears for the first time, it becomes non-repeating. When it appears for the second time, it must be removed from the non-repeating order.
A doubly linked list plus a hash map from word to node supports O(1) insertion and removal while preserving arrival order.

Quick Overview

This question evaluates skills in frequency counting, top-k and bottom-k selection, streaming algorithms, and data-structure design for maintaining order, tie-breaking, and dynamic updates.