Nvidia Software Engineer Interview Questions
Practice the exact questions companies are asking right now.
Find Top K Frequent Elements
Question LeetCode 347. Top K Frequent Elements – given an integer array nums, return the k most frequent elements. Initial requirement: return the top...
Solve anagram grouping and in-place allocator
You are asked to solve two coding tasks. 1) Group anagrams Given an array of strings strs, group the strings that are anagrams of each other. - Two st...
Optimize a small-string C++ class
You are implementing a high-performance C/C++ string type that uses a small-string optimization: short strings are stored inline in a fixed buffer, an...
Implement simple VM manager with CRUD operations
You are asked to implement a simple in-memory virtual machine (VM) manager that can: - List all VMs - Add (create) a new VM - Modify (update) an exist...
Find all unique triplets summing to zero
Problem Given an integer array nums, return all unique triplets [nums[i], nums[j], nums[k]] such that: - i, j, and k are distinct indices (i != j != k...
Decide on competitive comparison policy
Comparative Benchmarking Blog Post: Go/No-Go and Ethical Positioning You are a software engineer asked whether the company should publish a blog post ...
Design a distributed multi-user counter
Design a Horizontally Scalable Distributed Counter Service Context You are designing a distributed counter service used concurrently by many clients. ...
Explain optimization and tensor vs pipeline parallelism
Task: Deep Learning Optimization and Parallelism You are asked to explain optimization techniques commonly used to improve deep learning training and ...
Design cross-power/clock-domain signal interface
Cross-Power/Clock-Domain Design for a 1‑bit Control S1 (A → B → A) Context You are designing a SoC with two power domains (voltage islands) A and B. A...
Explain ML compilation optimizations and hardware fit
ML Compiler Optimizations and Platform Targeting Context You are designing a compiler/runtime stack for deep learning workloads that must run efficien...
Define developer-centric usability metrics
Usability and Product Metrics Beyond Latency and Accuracy Context: In a technical screen focused on analytics and experimentation, propose how you wou...
Compare ML frameworks and trends
ML Framework Trends and PyTorch vs. JAX Differences Context You are in a technical screen for a software engineer (machine learning systems) role. Ans...
Implement 2D convolution using NumPy slicing
Implement 2D convolution on a 4×4 input and a 3×3 filter with stride 1 using NumPy. Avoid explicit Python loops by relying on array slicing, broadcast...
Identify impactful blog content pillars
Content Pillars for a Developer-Facing Software Product Blog (Beyond Performance) Context You are planning the editorial strategy for a developer-focu...
Plan discovery and adoption strategy
New ML Technique: Discovery Questions and Developer Adoption Plan Context: You are a software engineer preparing to introduce a new ML technique to de...
Design an artifact store on K8s and Cassandra
System Design: Exactly-Once Creation by Name on Cassandra, Deletes, and Read API Design Context You run a Java web API on Kubernetes backed by a Cassa...
Design signals across power and clock domains
Interface Design for A → B Signal Across Power and Clock Domains Context: You are designing an SoC with two power domains, A and B. A signal (signal_1...
Explain a shader compiler pipeline
Describe the architecture of a shader compiler that ingests GLSL/HLSL and outputs SPIR-V or ISA. Cover lexical analysis, parsing to AST, conversion to...
Implement core graph algorithms for graphics
Given a scene or dependency graph, implement topological sort, BFS/DFS, and shortest path (Dijkstra). Discuss time/space complexity, memory layouts (C...
Design algorithms for test scheduling
You have tens of thousands of graphics test cases with inter-test dependencies and hardware/driver constraints. Model this as a graph and design algor...