Meta Data Manipulation (SQL/Python) Interview Questions
Meta Data Manipulation (SQL/Python) interview questions are a central part of Meta’s hiring for data scientist, data engineer, and analytics roles and usually emphasize practical, product-focused problem solving over abstract algorithm puzzles. What’s distinctive is the scale and product context: interview problems mirror real-world analytics tasks with messy data, session/event tables, and metrics design. Interviewers evaluate accuracy, clarity, and maintainability of your SQL or pandas code, your handling of edge cases (NULLs, deduplication, sampling), and your ability to explain trade-offs between readability and performance using CTEs, window functions, joins, and vectorized Python operations. For interview preparation, expect a timed technical screen (often using a shared editor) with SQL and Python data-manipulation tasks, followed by deeper loop rounds combining coding, product-metrics reasoning, and behavioral questions. Practice end-to-end problems: translate a product question into concrete metrics, write and optimize queries or pandas pipelines, narrate assumptions, and validate results. Work timed problems in CoderPad-like environments, rehearse clarifying questions, and review common pitfalls such as filter vs HAVING, NULL behavior, and inefficient joins. Regular mock interviews and focused drills on window functions, groupings, merges, and missing-data strategies will give the confidence and fluency Meta typically looks for.

"I got asked a hardcore MCM DP question and I saw it on PracHub as well. Solved that question in 5 minutes. Without PracHub I doubt I could solve it in 5 hours. Though somehow didn't get hired, perhaps I guess I solved it too fast? /s"

"Believe me i'm a student here jn US. Recently interviewed for MSFT. They asked me exact question from PracHub. I saw it the night before and ignored it cause why waste time on random sites. I legit wanna go back and redo this whole thing if I had chance. Not saying will work for everyone but there is certainly some merit to that website. And i'm gonna use it in future prep from now on like lc tagged"

"10 years of experience but never worked at a top company. PracHub's senior-level questions helped me break into FAANG at 35. Age is just a number."

"I was skeptical about the 'real questions' claim, so I put it to the test. I searched for the exact question I got grilled on at my last Meta onsite... and it was right there. Word for word."

"Got a Google recruiter call on Monday, interview on Friday. Crammed PracHub for 4 days. Passed every round. This platform is a miracle worker."

"I've used LC, Glassdoor, and random Discords. Nothing comes close to the accuracy here. The questions are actually current — that's what got me. Felt like I had a cheat sheet during the interview."

"The solution quality is insane. It covers approach, edge cases, time complexity, follow-ups. Nothing else comes close."

"Legit the only resource you need. TC went from 180k -> 350k. Just memorize the top 50 for your target company and you're golden."

"PracHub Premium for one month cost me the price of two coffees a week. It landed me a $280K+ starting offer."

"Literally just signed a $600k offer. I only had 2 weeks to prep, so I focused entirely on the company-tagged lists here. If you're targeting L5+, don't overthink it."

"Coaches and bootcamp prep courses cost around $200-300 but PracHub Premium is actually less than a Netflix subscription. And it landed me a $178K offer."

"I honestly don't know how you guys gather so many real interview questions. It's almost scary. I walked into my Amazon loop and recognized 3 out of 4 problems from your database."

"Discovered PracHub 10 days before my interview. By day 5, I stopped being nervous. By interview day, I was actually excited to show what I knew."

"I recently cleared Uber interviews (strong hire in the design round) and all the questions were present in prachub."
"The search is what sold me. I typed in a really niche DP problem I got asked last year and it actually came up, full breakdown and everything. These guys are clearly updating it constantly."
Write SQL for CTR and Revenue
You are given the following tables: ads( ad_id BIGINT, advertiser_id BIGINT, ad_type VARCHAR, -- values include direct and brand ad...
Write SQL for CTR and revenue
Write SQL for the following two tasks. Problem 1: CTR during peak vs. non-peak hours You are given three tables: - ads(ad_id BIGINT, advertiser_id BIG...
Compute ad revenue metrics by geography in SQL
You work on a marketplace app that shows shop ads. You are given the following tables. Assumptions - All timestamps are stored in UTC. - “Revenue” is ...
Compute cohort GMV and payer rate with edge cases
You are given the following schema (timestamps are UTC): users(user_id INT, country STRING, created_at TIMESTAMP) events(user_id INT, event_ts TIMESTA...
Write SQL for call pickup and usage metrics
You have two tables about 1:1 calls. Table 1: calls Each row is a call attempt. - sender_id (BIGINT) — user who initiated the call - receiver_id (BIGI...
Tackle Python tasks under time pressure
In a 15-minute coding round, implement a small Python function or class to solve a well-scoped problem within about 5 minutes of coding. 1) State 1–2 ...
Analyze spend and creation-source shifts
You are working with ads data. Assume the following tables, with all timestamps interpreted in UTC. - advertisers(advertiser_id BIGINT, advertiser_cat...
Compute reply-based user metrics in 7 days
You are analyzing discussions on a social platform. Tables all_post - post_id (BIGINT, PK) - post_author_id (BIGINT, FK → user.user_id) - post_creatio...
Write SQL for Pixel Signal Metrics
You are working on Meta Ads Pixel analytics. Assume all timestamps are stored in UTC, and analyze the last 30 complete calendar days. Tables 1. advert...
Write SQL/pandas for KPI anomaly
Write SQL (and outline equivalent pandas) for a KPI anomaly investigation. Assume today = '2025-09-01'. Schema: Users(user_id INT, country TEXT, signu...
Compute feed ad frequency and retention in SQL
Assume today is 2025-09-01. Schema and tiny samples: feed_impressions(impression_id, user_id, impression_time, content_type, feed_position, session_id...
Compute view prevalence from views and labels
Given the tables below, write SQL to compute view prevalence of violating content. Use “today” = 2025-09-01 and report the last 7 days (2025-08-26 to ...
Compute percent of active users with 50+ calls
Problem You work on a Messenger-like app. You want to measure how many active users in Great Britain (GB) today have been heavy callers recently. Tabl...
Compute invalid event percentage by pixel
Context You work on an ads pixel instrumentation platform. Each pixel emits events throughout the day; some events are missing (not observed) and some...
Count unconnected posts and reactions
You are analyzing a newly launched feed feature intended to improve engagement by showing more unconnected content. Assume the following tables: - pos...
Write Call Analytics SQL Queries
Assume you are given two tables. Table: calls - call_id BIGINT - sender_id BIGINT - receiver_id BIGINT - call_time TIMESTAMP - pickup VARCHAR -- value...
Calculate CTR and ad revenue
This interview included two SQL tasks. Task 1: Compare CTR during peak and non-peak hours You are given three tables: - ads(ad_id BIGINT, advertiser_i...
Find multi-account buckets and unread rate
You are analyzing a product in which one user can own multiple accounts. Use the following schema: Table: accounts - account_id BIGINT - user_id BIGIN...
Compute survey rates and bias-correct ratings
Today is 2025-09-01. Use the schema and sample data below to answer A and B with SQL (standard SQL; you may use CTEs and window functions). Assume tim...
Label new vs old users over time in SQL
Define users as “new” during the first 30 days inclusive after their signup_date, and “old” thereafter. Produce per-user, per-day labels over a window...