PracHub
QuestionsPremiumLearningGuidesInterview PrepCoaches
|Home/Machine Learning/Roblox

Rank features using logistic regression coefficients

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of feature scaling, interpretation of logistic regression coefficients as feature importance, and awareness of how regularization and tie-breaking can affect coefficient-based rankings.

  • easy
  • Roblox
  • Machine Learning
  • Data Scientist

Rank features using logistic regression coefficients

Company: Roblox

Role: Data Scientist

Category: Machine Learning

Difficulty: easy

Interview Round: Take-home Project

You are given a binary classification dataset: - `X`: a 2D array of shape `(n_samples, n_features)` containing numeric features - `y`: a 1D binary array of shape `(n_samples,)` with values in {0,1} - `feature_names`: a list of length `n_features` with the name of each column in `X` ## Task 1. **Normalize** each feature column of `X` using z-score standardization: \[ X'_{:,j} = \frac{X_{:,j} - \mu_j}{\sigma_j} \] where \(\mu_j\) and \(\sigma_j\) are the mean and standard deviation of feature \(j\) computed on the training set. 2. Fit a **logistic regression** model on the normalized features. 3. Rank features by their learned coefficient values (largest to smallest), and return the **top 3 feature names**. ## Output Return a list of 3 strings: the names of the top-3 features. ## Notes - Assume binary logistic regression (one coefficient per feature). - Specify how you handle ties and how you deal with regularization defaults in common libraries.

Quick Answer: This question evaluates understanding of feature scaling, interpretation of logistic regression coefficients as feature importance, and awareness of how regularization and tie-breaking can affect coefficient-based rankings.

Related Interview Questions

  • Normalize features and rank logistic coefficients - Roblox (hard)
  • Fit logistic regression and return top features - Roblox (hard)
  • Explain an ML project end-to-end with tradeoffs - Roblox (Medium)
  • Design leakage-free predictive maintenance pipeline - Roblox (hard)
  • Design real-time payments fraud model under constraints - Roblox (hard)
Roblox logo
Roblox
Oct 3, 2025, 12:00 AM
Data Scientist
Take-home Project
Machine Learning
5
0
Loading...

You are given a binary classification dataset:

  • X : a 2D array of shape (n_samples, n_features) containing numeric features
  • y : a 1D binary array of shape (n_samples,) with values in {0,1}
  • feature_names : a list of length n_features with the name of each column in X

Task

  1. Normalize each feature column of X using z-score standardization:

X:,j′=X:,j−μjσjX'_{:,j} = \frac{X_{:,j} - \mu_j}{\sigma_j}X:,j′​=σj​X:,j​−μj​​

where μj\mu_jμj​ and σj\sigma_jσj​ are the mean and standard deviation of feature jjj computed on the training set.

  1. Fit a logistic regression model on the normalized features.
  2. Rank features by their learned coefficient values (largest to smallest), and return the top 3 feature names .

Output

Return a list of 3 strings: the names of the top-3 features.

Notes

  • Assume binary logistic regression (one coefficient per feature).
  • Specify how you handle ties and how you deal with regularization defaults in common libraries.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Roblox•More Data Scientist•Roblox Data Scientist•Roblox Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.