What does the Harvey Software Engineer interview process look like?

Based on candidate reports compiled in this guide, the Harvey Software Engineer loop typically includes 2 stages: Technical Screen, Onsite. Each stage covers a distinct set of topics walked through in detail above.

What topics does Harvey focus on in Software Engineer interviews?

Harvey Software Engineer interviews cover Coding & Algorithms, System Design. The guide above breaks each topic down into core concepts, worked examples, and the real questions candidates were asked.

How many real Harvey Software Engineer interview questions are in this guide?

This guide is anchored to 18 real Harvey Software Engineer interview questions sourced from candidate reports, each linked to a full practice page with starter code, solution discussion, and community comments.

Harvey Software Engineer Interview Prep Guide

Everything Harvey actually asks Software Engineer candidates — concept walkthroughs, worked examples, and the real interview questions, drawn from candidate reports. Free to read.

Harvey Software Engineer Interview Cheatsheet cover

Technical Screen

Coding & Algorithms

In-Memory File System

Clean node-diagram of an in-memory filesystem trie: root → directories → file node, showing Node fields (children map, isFile, content, size), highlighted traversal to /a/b/file.txt, operation labels and complexity callouts.

What's being tested

This tests hierarchical data modeling with a trie/tree of directories and files, plus robust path parsing and name-collision handling. Interviewers are looking for clean object design, edge-case discipline, and ability to reason about `mkdir`, `ls`, `addContentToFile`, `readContentFromFile`, capacity limits, and duplicate names.

Patterns & templates

Trie/tree node model — each `Node` stores `children: Map[str, Node]`, `isFile`, optional `content`, metadata, and size; operations are O(path components).
Path normalization — split on /, ignore empty components, handle root /; avoid bugs from trailing slashes like /a/b/.
Directory traversal helper — implement `getNode(path, create=False)` to centralize validation, auto-creation, and missing-path behavior.
Lexicographic listing — `ls(path)` returns [filename] for files or sorted child names for directories; sorting costs O(k log k).
Content accounting — for constrained systems, track total bytes and per-file size deltas before appending; reject writes that exceed capacity.
Duplicate naming policy — OS-style collisions often require generating `name(1)`, `name(2)`; maintain counters or scan siblings carefully.
Complexity explanation — use d = path depth, k = directory children, c = content length; traversal O(d), append/read affected by c.

Common pitfalls

Pitfall: Treating files and directories as separate maps makes traversal and collision handling harder than a unified `Node` abstraction.

Pitfall: Forgetting that `ls("/a/file.txt")` should usually return only ["file.txt"], not file content or children.

Pitfall: Capacity checks must account for append delta, not total rewritten content unless the API replaces file contents.

Practice these

The practice cards below cover the canonical variants — solve all of them and time yourself.

Practice questions

Harvey AI

Medium

Software Engineer

Design an in-memory file system with limits

Evaluates the design and implement hierarchical in-memory data structures, manage capacity constraints and OS-style duplicate naming, and reason about...

Harvey Software Engineer Interview Prep Guide

Technical Screen

Coding & Algorithms

In-Memory File System

What's being tested

Patterns & templates

Common pitfalls

Practice these

Design an in-memory file system with limits

Design a constrained in-memory file system

Handle multi-source string matching and tagging

Exact Substring Matching And Highlighting

What's being tested

Patterns & templates

Common pitfalls

Practice these

Highlight Exact Source Matches

System Design

Cloud File Storage Service

What's being tested

Core knowledge

Worked example

A second angle

Common pitfalls

Connections

Further reading

Design a Cloud File Storage Service

Design a production file storage service

Design Cloud File Storage

Hashing-Based File Identity

What's being tested

Core knowledge

Worked example

A second angle

Common pitfalls

Connections

Further reading

Determine identical files ignoring metadata

Determine if two files are identical

Design a Cloud File Storage Service

Onsite

Coding & Algorithms

Spreadsheet Formula Engine

What's being tested

Patterns & templates

Common pitfalls

Practice these

Design Spreadsheet Formula Cells

Implement a Formula Spreadsheet

Implement retrieval and evaluation for a simple RAG

Expression Parsing

What's being tested

Patterns & templates

Common pitfalls

Practice these

Evaluate Symbol Expressions

Implement a Cursor-Based Text Editor

Implement a Database Connection Pool

Dependency Graphs

What's being tested

Patterns & templates

Common pitfalls

Practice these

Implement Spreadsheet Cell Updates

Implement a Hierarchical File System

Frequently asked questions

Related interview prep guides