Explore Titanic Dataset: Survival Rates, Age, and Correlations
Company: Spokeo
Role: Data Scientist
Category: Data Manipulation (SQL/Python)
Difficulty: Medium
Interview Round: Onsite
titanic_passengers
+--------------+--------+--------+-----+----------+
| passenger_id | pclass | sex | age | survived |
+--------------+--------+--------+-----+----------+
| 1 | 3 | male | 22 | 0 |
| 2 | 1 | female | 38 | 1 |
| 3 | 3 | female | 26 | 1 |
| 4 | 1 | female | 35 | 1 |
| 5 | 3 | male | 35 | 0 |
+--------------+--------+--------+-----+----------+
##### Scenario
HackerRank Jupyter-Notebook task – exploratory analysis of a classic dataset
##### Question
Perform an EDA on the data: calculate survival rate by sex, average age by class, and the correlation between age and survival. Produce clean visualizations and discuss any insights.
##### Hints
Write readable pandas code, label plots, and explain findings briefly.
Quick Answer: This question evaluates exploratory data analysis, descriptive statistics, group-wise aggregation, correlation interpretation, and data visualization skills in a Data Manipulation (SQL/Python) context for a data scientist role.