You are given a take-home assignment for a mechanistic interpretability or machine learning interview.
Design an experiment that demonstrates sample-to-feature-ratio double descent in a supervised learning problem. Your submission should be possible to complete in roughly four hours and summarized in a short slide deck.
Your task is to:
-
Choose a concrete learning setup and dataset generation process.
-
Vary the number of training samples relative to the feature dimension so that the test error curve shows double descent, especially near the interpolation threshold.
-
Provide a theoretical explanation for why the double descent behavior appears.
-
Propose at least one method to mitigate the phenomenon and explain why it should help.
-
Present the experimental setup, plots, theory, and mitigation clearly in slides.
A simple high-dimensional linear regression setting is acceptable if it cleanly exhibits the effect.