PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Meta

What features and feature selection would you use?

Last updated: Mar 29, 2026

Quick Overview

This question evaluates feature engineering, feature selection, and production-aware machine learning system design skills for ranking shop ads, within the Machine Learning domain for a Data Scientist role.

  • Medium
  • Meta
  • Machine Learning
  • Data Scientist

What features and feature selection would you use?

Company: Meta

Role: Data Scientist

Category: Machine Learning

Difficulty: Medium

Interview Round: Technical Screen

## Context You are building an ML system to rank/promote **shop ads** in an e-commerce feed/search page. At serving time, the system may score candidate shop ads for a given user and context. Assume you have access to: - User events (impressions, clicks, purchases, shop follows) - Shop metadata (category, price bands, inventory signals) - Query/context (search query, time, device) - Ad/auction signals (bid, budget pacing) ## Questions 1. If you were to build the **shop-ads ranking model**, what feature families would you use? (Give examples.) 2. You have “a ton” of candidate features. How would you identify which ones are **useful**? - Include at least one **offline** approach and one **online**/production-safe approach. 3. If you were **not allowed to use a model-based importance method** (e.g., no SHAP/GBDT gain/permutation importance), how would you still find the key useful features? 4. Call out common pitfalls: leakage, feedback loops, cold start, and feature drift.

Quick Answer: This question evaluates feature engineering, feature selection, and production-aware machine learning system design skills for ranking shop ads, within the Machine Learning domain for a Data Scientist role.

Related Interview Questions

  • Implement 1NN Embeddings and Forward Pass - Meta (hard)
  • Design and evaluate an ads ranking algorithm - Meta (easy)
  • How would you design a Shop Ads ranking algorithm? - Meta (easy)
  • Derive Linear Regression Solution - Meta (medium)
  • Explain key ML metrics and techniques - Meta (medium)
Meta logo
Meta
Aug 10, 2025, 12:00 AM
Data Scientist
Technical Screen
Machine Learning
1
0

Context

You are building an ML system to rank/promote shop ads in an e-commerce feed/search page. At serving time, the system may score candidate shop ads for a given user and context.

Assume you have access to:

  • User events (impressions, clicks, purchases, shop follows)
  • Shop metadata (category, price bands, inventory signals)
  • Query/context (search query, time, device)
  • Ad/auction signals (bid, budget pacing)

Questions

  1. If you were to build the shop-ads ranking model , what feature families would you use? (Give examples.)
  2. You have “a ton” of candidate features. How would you identify which ones are useful ?
    • Include at least one offline approach and one online /production-safe approach.
  3. If you were not allowed to use a model-based importance method (e.g., no SHAP/GBDT gain/permutation importance), how would you still find the key useful features?
  4. Call out common pitfalls: leakage, feedback loops, cold start, and feature drift.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Meta•More Data Scientist•Meta Data Scientist•Meta Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.