This question evaluates proficiency in applied machine learning workflows, including data inspection and cleaning, exploratory data analysis, feature engineering, baseline binary classification modeling, and evaluation using Python libraries like pandas and seaborn.
You are given a tabular dataset for predicting whether a patient has heart disease. The dataset contains a binary target column such as has_heart_disease and several features, for example age, height, weight, blood pressure, cholesterol, smoking status, and other clinical measurements.
Using Python, pandas, and seaborn, walk through how you would:
You may assume standard Python ML libraries are available.