How do I approach Coding & Algorithms interview questions?

Coding & Algorithms questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master coding & algorithms interviews.

What difficulty level is this interview question?

This is a Medium difficulty Coding & Algorithms question, commonly asked during Technical Screen rounds at Shopify.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at Shopify during technical interviews.

Identify Pirate Themes Using Similarity Score Algorithm

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of similarity scoring, set-based comparison of structured records, and practical data deduplication techniques commonly used in data science.

|Home/Coding & Algorithms/Shopify

Identify Pirate Themes Using Similarity Score Algorithm

Shopify

Aug 4, 2025, 10:55 AM

MediumData ScientistTechnical ScreenCoding & Algorithms

Scenario

Engineering wants an automated way to spot custom themes that are probably just pirate themes in disguise.

Question

Write Python that takes two lists (A and B) and returns their similarity score defined as len(intersection) / len(union). Given pirate_themes (list of dicts) and custom_themes (list of dicts), identify which custom themes are likely pirates using the similarity score and explain your threshold choice.

Hints

Implement a Jaccard similarity; iterate over dictionaries by a chosen key set; threshold of 0.5 is typical.

Submit Your Answer to Earn 20XP

Loading comments...

Browse More Questions

More Coding & Algorithms•More Shopify•More Data Scientist•Shopify Data Scientist•Shopify Coding & Algorithms•Data Scientist Coding & Algorithms