PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/C3 AI

Explain key ML/stats concepts

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's understanding of core machine learning and statistical concepts, covering neural network input structures and model selection (CNN vs RNN), impurity measures and split criteria (Gini and entropy), multicollinearity diagnostics and mitigation in regression, and Pearson correlation with its assumptions and limits. It is commonly asked in Data Science and Machine Learning interviews to probe model-selection intuition, diagnostic and interpretive skills, and statistical reasoning, and it primarily targets conceptual understanding with some simple quantitative application.

  • Medium
  • C3 AI
  • Machine Learning
  • Data Scientist

Explain key ML/stats concepts

Company: C3 AI

Role: Data Scientist

Category: Machine Learning

Difficulty: Medium

Interview Round: Onsite

You are taking an ML/Stats screening with conceptual multiple-choice questions. Answer the following: 1. **CNN vs. RNN** - What kinds of input structure does each model assume? - Give 1–2 examples of tasks where CNNs are typically preferred and tasks where RNNs (or sequence models) are preferred. 2. **Gini impurity (decision trees)** - Define Gini impurity for a node with class probabilities \(p_1,\dots,p_K\). - Compute the Gini impurity for a binary node with \(p=0.8\) and \(1-p=0.2\). 3. **Entropy vs. Gini for split criteria** - Define entropy for a node and compare it to Gini impurity. - Explain how they differ in sensitivity and whether they usually produce meaningfully different trees in practice. 4. **Multicollinearity (linear/logistic regression)** - What is multicollinearity and why is it a problem? - Name at least two ways to detect it and two ways to mitigate it. 5. **Pearson correlation** - Define Pearson correlation and list key assumptions/limitations. - Explain why correlation \(\neq\) causation and give one example of confounding.

Quick Answer: This question evaluates a candidate's understanding of core machine learning and statistical concepts, covering neural network input structures and model selection (CNN vs RNN), impurity measures and split criteria (Gini and entropy), multicollinearity diagnostics and mitigation in regression, and Pearson correlation with its assumptions and limits. It is commonly asked in Data Science and Machine Learning interviews to probe model-selection intuition, diagnostic and interpretive skills, and statistical reasoning, and it primarily targets conceptual understanding with some simple quantitative application.

Related Interview Questions

  • Identify a classic bagging algorithm - C3 AI (Easy)
C3 AI logo
C3 AI
Jul 25, 2025, 12:00 AM
Data Scientist
Onsite
Machine Learning
1
0

You are taking an ML/Stats screening with conceptual multiple-choice questions. Answer the following:

  1. CNN vs. RNN
    • What kinds of input structure does each model assume?
    • Give 1–2 examples of tasks where CNNs are typically preferred and tasks where RNNs (or sequence models) are preferred.
  2. Gini impurity (decision trees)
    • Define Gini impurity for a node with class probabilities p1,…,pKp_1,\dots,p_Kp1​,…,pK​ .
    • Compute the Gini impurity for a binary node with p=0.8p=0.8p=0.8 and 1−p=0.21-p=0.21−p=0.2 .
  3. Entropy vs. Gini for split criteria
    • Define entropy for a node and compare it to Gini impurity.
    • Explain how they differ in sensitivity and whether they usually produce meaningfully different trees in practice.
  4. Multicollinearity (linear/logistic regression)
    • What is multicollinearity and why is it a problem?
    • Name at least two ways to detect it and two ways to mitigate it.
  5. Pearson correlation
    • Define Pearson correlation and list key assumptions/limitations.
    • Explain why correlation ≠\neq= causation and give one example of confounding.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More C3 AI•More Data Scientist•C3 AI Data Scientist•C3 AI Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.