How do I approach Data Manipulation (SQL/Python) interview questions?

Data Manipulation (SQL/Python) questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master data manipulation (sql/python) interviews.

What difficulty level is this interview question?

This is a medium difficulty Data Manipulation (SQL/Python) question, commonly asked during Onsite rounds at Thumbtack.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Thumbtack during technical interviews.

Compare list/dict; parse JSON/CSV at scale

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of Python data structures (list vs dict), algorithmic time and memory complexity, ordering guarantees in CPython, large-scale parsing and streaming of JSON/CSV, and robust data-cleaning and error-handling strategies.

|Home/Data Manipulation (SQL/Python)/Thumbtack

Compare list/dict; parse JSON/CSV at scale

Thumbtack

Oct 13, 2025, 9:49 PM

mediumData ScientistOnsiteData Manipulation (SQL/Python)

Compare Python list and dict precisely: for append/insert/lookup/update/delete, state average and worst-case time complexity, memory implications, and ordering guarantees in CPython 3. How would you store and retrieve values in each (show concise code for appending to a list and updating a dict)? Define JSON vs. CSV and when you would choose JSON over CSV (consider nesting, schema evolution, interoperability, compression). Show exact Python code to stream-read both formats: (a) JSON Lines file via iterating line-by-line and json.loads; (b) CSV via csv.DictReader; and (c) pandas read_csv with chunksize to compute the sum of a numeric column 'value' in data.csv without exceeding memory. Explain how you would handle malformed rows, missing/NaN values, bad encodings, and numeric overflow; propose chunk-size heuristics for a 10 GB file on a 16 GB RAM machine; and provide a non-pandas alternative that still streams safely.

Loading comments...

Browse More Questions

More Data Manipulation (SQL/Python)•More Thumbtack•More Data Scientist•Thumbtack Data Scientist•Thumbtack Data Manipulation (SQL/Python)•Data Scientist Data Manipulation (SQL/Python)