Meta Data Manipulation (SQL/Python) Interview Questions
Meta Data Manipulation (SQL/Python) interview questions are a central part of Meta’s hiring for data scientist, data engineer, and analytics roles and usually emphasize practical, product-focused problem solving over abstract algorithm puzzles. What’s distinctive is the scale and product context: interview problems mirror real-world analytics tasks with messy data, session/event tables, and metrics design. Interviewers evaluate accuracy, clarity, and maintainability of your SQL or pandas code, your handling of edge cases (NULLs, deduplication, sampling), and your ability to explain trade-offs between readability and performance using CTEs, window functions, joins, and vectorized Python operations. For interview preparation, expect a timed technical screen (often using a shared editor) with SQL and Python data-manipulation tasks, followed by deeper loop rounds combining coding, product-metrics reasoning, and behavioral questions. Practice end-to-end problems: translate a product question into concrete metrics, write and optimize queries or pandas pipelines, narrate assumptions, and validate results. Work timed problems in CoderPad-like environments, rehearse clarifying questions, and review common pitfalls such as filter vs HAVING, NULL behavior, and inefficient joins. Regular mock interviews and focused drills on window functions, groupings, merges, and missing-data strategies will give the confidence and fluency Meta typically looks for.
Compute ads revenue by geography in SQL
You have ad delivery logs for a shop-ads system. Tables ad_impressions - impression_id STRING (PK) - ts TIMESTAMP (UTC) - user_id STRING - shop_id STR...
Compute reply-based user metrics in 7 days
You are analyzing discussions on a social platform. Tables all_post - post_id (BIGINT, PK) - post_author_id (BIGINT, FK → user.user_id) - post_creatio...
Write SQL for video-call recipients and FR activity
Given the schema and samples below, write ANSI‑SQL to answer both questions. Assume dates are stored in UTC. Today is 2025-09-01, so “yesterday” is 20...
Calculate Response Rate and Compare User Survey Ratings
USERS user_id | signup_date 10 | 2024-03-20 11 | 2024-04-01 12 | 2024-04-05 SURVEYS survey_id | user_id | sent_at 1 | 10 ...
Compute active ad revenue by creation source
You work on an ads platform and need to report active ad revenue broken down by the ad’s creation source. Tables ads - ad_id BIGINT PK - advertiser_id...
Write SQL to compare social-only vs game-only engagement
You are given two tables capturing Oculus app usage. Define an 'active day' as a UTC date on which a user generates at least one event. Consider only ...
Compute percent of active users with 50+ calls
Problem You work on a Messenger-like app. You want to measure how many active users in Great Britain (GB) today have been heavy callers recently. Tabl...
Calculate Average Session Length and Compare App Performance
user_sessions +---------+------------+------------+---------------------+---------------------+ | user_id | session_id | app | session_start ...
Compute video-call SQL metrics with edge cases
Use 'today' = 2025-09-01. Assume UTC timestamps. Write SQL to answer both parts below and call out how your queries handle edge cases (duplicates, fai...
Calculate Total Revenue in USD Using SQL Query
ads_revenue +---------+------------+---------+----------+ | ad_id | country | revenue | currency | +---------+------------+---------+----------+ ...
Analyze Conversation Engagement and Reaction Usage Effectively
messages +-----------+--------+----------+--------------+---------------------+ | messageid | sender | receiver | has_reaction | timestamp |...
Recommend two-hop follows in Python
Given a directed "follows" graph as a Python dict[str, list[str]], implement recommend_two_hop(graph, user) that returns the set (or a sorted list) of...
Write SQL/pandas for KPI anomaly
Write SQL (and outline equivalent pandas) for a KPI anomaly investigation. Assume today = '2025-09-01'. Schema: Users(user_id INT, country TEXT, signu...
Tackle Python tasks under time pressure
In a 15-minute coding round, implement a small Python function or class to solve a well-scoped problem within about 5 minutes of coding. 1) State 1–2 ...
Analyze Group Call Adoption Using SQL Queries
CALL_LOGS | call_id | user_id | call_start | call_end | is_group_call | participant_cnt | | 101 | 12 | 2023-08-01 10:00...
Write SQL for retention, conversion, and churn
Assume today is 2025-09-01 (use the user's local day boundaries based on users.tz). Given the following schema and sample data, write SQL to: (a) Comp...
Write SQL for library analytics
Given a library database, write SQL to answer the following: 1) Count the number of books that are currently not returned (i.e., still checked out) an...
Determine Product Buyer Count and Interaction Percentage
interactions +-----------+----------+------------+----+------------+ | seller_id | buyer_id | product_id | li | create_date| +-----------+----------+-...
Calculate Engagement Metrics for Info-Stream Content Analysis
info_stream_views +----------+-----------+--------------+----------+------------+ | post_id | viewer_id | relationship | duration | ds | +---...
Analyze Recent Calling Behavior in France Using SQL
CALLS +---------+---------+---------------------+-------------------+----------+ | call_id | user_id | call_start_time | participant_cnt | is_vi...