How do I approach Machine Learning interview questions?

Machine Learning questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master machine learning interviews.

What difficulty level is this interview question?

This is a medium difficulty Machine Learning question, commonly asked during Technical Screen rounds at NVIDIA.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at NVIDIA during technical interviews.

Derive MLP shapes and explain PyTorch broadcasting

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of tensor shapes, linear layer forward computation, and deep-learning-framework broadcasting semantics. It is commonly asked to confirm practical ability to translate mathematical layer definitions into concrete tensor shapes and to reason about broadcasting behavior; it belongs to the Machine Learning domain and tests practical application of tensor algebra grounded in conceptual understanding.

NVIDIA

Jan 14, 2026, 12:00 AM

Software Engineer

Technical Screen

Machine Learning

You are given a standard MLP layer (fully connected layer) used in deep learning.

Write the forward computation for a linear layer with bias.
Given input tensor shapes, determine output shapes and explain how bias broadcasting works in PyTorch.

Assume:

Input x has shape (B, Din) (batch size B ).
The layer has output dimension Dout .

Answer the following:

What are the shapes of weight and bias in torch.nn.Linear(Din, Dout) ?
What is the output shape?
How does this generalize if x has shape (B, T, Din) (e.g., sequence length T )?
What broadcasting rule makes bias add correctly?

Solution

Show

Comments (0)

Loading comments...

Browse More Questions

More Machine Learning•More NVIDIA•More Software Engineer•NVIDIA Software Engineer•NVIDIA Machine Learning•Software Engineer Machine Learning