How do I practice coding and algorithm questions?

Use PracHub's coding console to write, test, and debug your solutions in Python or JavaScript. View hints, test against sample inputs, and compare with official solutions.

What difficulty level is this coding question?

This is a hard difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Anthropic.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Anthropic during technical interviews.

Find duplicate files and apply image operations

Quick Overview

This question evaluates skills in string parsing and content-based grouping for duplicate file detection, along with matrix manipulation for image operations like horizontal flips and box blur, covering competencies in algorithm design, data-structure selection, and numerical array transformations.

Company: Anthropic

Role: Software Engineer

Category: Coding & Algorithms

Difficulty: hard

Interview Round: Technical Screen

## Part A — Find duplicate files by content You are given a list of directory records. Each record is a string describing a directory path followed by one or more files in that directory, where each file is described as `name(content)`. Example record: - `"root/a 1.txt(abcd) 2.txt(efgh)"` ### Task Return all groups of **duplicate files**, where two files are duplicates if they have **exactly the same content**. Each group should contain the **full paths** of all files that share that content (only include groups with at least 2 files). ### Input - `paths`: an array of strings, each formatted as: - `dir file1(content1) file2(content2) ...` ### Output - A list of groups (each group is a list of strings), where each string is a full file path like `"root/a/1.txt"`. - Order of groups and order within a group do not matter. ### Constraints (reasonable interview defaults) - `1 <= paths.length <= 2*10^4` - Total number of files across all records can be large; aim for near-linear time in total input size. --- ## Part B — Image processing operations (flip & blur) You are given a grayscale image represented as a 2D matrix `img` of integers (e.g., `0..255`). ### Task Implement the following operations: 1. **Horizontal flip**: reverse each row. 2. **Box blur** with radius 1: each pixel becomes the average of itself and all valid neighbors in the 3×3 window centered at that pixel (use only in-bounds pixels). Use integer division/floor for the average. ### Input - `img`: `H x W` integer matrix - An operation sequence (e.g., `["FLIP", "BLUR"]`) indicating the order to apply operations. ### Output - The resulting image matrix after applying all operations in order. ### Constraints (reasonable interview defaults) - `1 <= H, W <= 2000` - Discuss time and space tradeoffs; avoid unnecessary extra full-size copies when possible.

Quick Answer: This question evaluates skills in string parsing and content-based grouping for duplicate file detection, along with matrix manipulation for image operations like horizontal flips and box blur, covering competencies in algorithm design, data-structure selection, and numerical array transformations.

Part 1: Find Duplicate Files by Content

You are given a list of directory records. Each record starts with a directory path, followed by one or more file descriptions in the form `name(content)`. Two files are considered duplicates if their contents are exactly the same. Return every group of duplicate files as full file paths. For deterministic output in this problem, sort each group lexicographically, then sort the list of groups lexicographically.

Constraints

`1 <= len(paths) <= 2 * 10^4`
Each record contains a directory followed by one or more file descriptions.
File names and file contents contain no spaces.
Aim for near-linear processing in the total size of the input.

Examples

Input: (['root/a 1.txt(abcd) 2.txt(efgh)', 'root/c 3.txt(abcd)', 'root/c/d 4.txt(efgh)', 'root 4.txt(efgh)'],)

Expected Output: [['root/4.txt', 'root/a/2.txt', 'root/c/d/4.txt'], ['root/a/1.txt', 'root/c/3.txt']]

Explanation: There are two duplicated contents: `efgh` and `abcd`.

Input: (['root/a 1.txt(abcd)'],)

Expected Output: []

Explanation: Edge case: only one file, so there are no duplicates.

Input: (['dir a.txt(x) b.txt(x) c.txt(y)', 'dir2 d.txt(y)'],)

Expected Output: [['dir/a.txt', 'dir/b.txt'], ['dir/c.txt', 'dir2/d.txt']]

Explanation: Two different contents each produce one duplicate group.

Input: (['root a.txt() b.txt()', 'other c.txt(z)'],)

Expected Output: [['root/a.txt', 'root/b.txt']]

Explanation: Edge case: empty content is still valid content, so the two files are duplicates.

Hints

Use a hash map from file content to a list of full paths.
For each file token, split at the first `(` to get the file name, and remove the final `)` to get the content.

Part 2: Apply Flip and Blur Operations to a Grayscale Image

You are given a grayscale image as a 2D integer matrix `img` and a sequence of operations. Support two operations: `FLIP`, which reverses every row horizontally, and `BLUR`, which replaces each pixel with the floor of the average of all valid cells in its 3x3 neighborhood centered at that pixel. Apply the operations in the given order and return the final image.

Constraints

`1 <= H, W <= 2000`
`0 <= len(operations) <= 100`
Each pixel value is an integer.
Each operation is either `FLIP` or `BLUR`.

Examples

Input: ([[1, 2, 3], [4, 5, 6]], ['FLIP'])

Expected Output: [[3, 2, 1], [6, 5, 4]]

Explanation: Each row is reversed.

Input: ([[1, 2, 3], [4, 5, 6], [7, 8, 9]], ['BLUR'])

Expected Output: [[3, 3, 4], [4, 5, 5], [6, 6, 7]]

Explanation: Each cell becomes the floor of the average of itself and all valid neighbors.

Input: ([[10, 20, 30], [40, 50, 60]], ['FLIP', 'BLUR'])

Expected Output: [[40, 35, 30], [40, 35, 30]]

Explanation: First reverse each row, then blur the transformed image.

Input: ([[7]], ['BLUR', 'FLIP', 'BLUR'])

Expected Output: [[7]]

Explanation: Edge case: a 1x1 image is unchanged by both operations.

Input: ([[0, 255], [128, 64]], [])

Expected Output: [[0, 255], [128, 64]]

Explanation: Edge case: no operations means the image stays the same.

Hints

A horizontal flip can be done in-place by reversing each row.
For `BLUR`, a 2D prefix-sum table lets you compute each 3x3 neighborhood sum in O(1), making each blur pass O(H * W).

Quick Overview