PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Software Engineering Fundamentals/XPeng

Compare Data Infrastructure Storage Choices

Last updated: May 5, 2026

Quick Overview

This question evaluates a candidate's competency in data storage and processing choices, covering PySpark row-wise versus column-wise operations, columnar file formats like Parquet, the trade-offs between NoSQL databases and lakehouse table formats such as Iceberg, and motivations for migrating from Hive-style tables.

  • medium
  • XPeng
  • Software Engineering Fundamentals
  • Data Engineer

Compare Data Infrastructure Storage Choices

Company: XPeng

Role: Data Engineer

Category: Software Engineering Fundamentals

Difficulty: medium

Interview Round: Technical Screen

You are designing data infrastructure for an autonomous-driving company. Discuss the following storage and processing concepts and trade-offs: 1. In PySpark, what is the difference between row-wise and column-wise operations, and when is each appropriate? 2. What is Parquet, and why is it commonly used in analytical data pipelines? 3. When would you choose MongoDB or another NoSQL database instead of a lakehouse table format such as Iceberg? 4. When would you choose Iceberg instead of MongoDB or a traditional Hive table? 5. Why might a company migrate from Hive tables to Iceberg tables? What concrete improvements does Iceberg provide over Hive-style tables?

Quick Answer: This question evaluates a candidate's competency in data storage and processing choices, covering PySpark row-wise versus column-wise operations, columnar file formats like Parquet, the trade-offs between NoSQL databases and lakehouse table formats such as Iceberg, and motivations for migrating from Hive-style tables.

XPeng logo
XPeng
Apr 11, 2026, 12:00 AM
Data Engineer
Technical Screen
Software Engineering Fundamentals
0
0

You are designing data infrastructure for an autonomous-driving company. Discuss the following storage and processing concepts and trade-offs:

  1. In PySpark, what is the difference between row-wise and column-wise operations, and when is each appropriate?
  2. What is Parquet, and why is it commonly used in analytical data pipelines?
  3. When would you choose MongoDB or another NoSQL database instead of a lakehouse table format such as Iceberg?
  4. When would you choose Iceberg instead of MongoDB or a traditional Hive table?
  5. Why might a company migrate from Hive tables to Iceberg tables? What concrete improvements does Iceberg provide over Hive-style tables?

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Software Engineering Fundamentals•More XPeng•More Data Engineer•XPeng Data Engineer•XPeng Software Engineering Fundamentals•Data Engineer Software Engineering Fundamentals
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.