PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/System Design/Speak

Design scenario-based speaking feature

Last updated: Mar 29, 2026

Quick Overview

This question evaluates system-design competency in building scalable, real-time, multilingual audio-driven learning features, including product modeling, session orchestration, audio pipelines (ASR/TTS), personalization, localization, telemetry, and privacy/security considerations.

  • hard
  • Speak
  • System Design
  • Software Engineer

Design scenario-based speaking feature

Company: Speak

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Onsite

Design a scenario-based speaking practice feature for a multilingual learning app that lets users pick any scenario (e.g., ordering food, job interview) and practice spoken dialogs. Describe the user flows, content modeling (scenarios, roles, prompts), session orchestration, and how you would handle audio capture, latency, and offline fallback. Explain personalization (level selection, adaptive difficulty), progress tracking, and content localization for multiple languages. Detail the service architecture, storage choices, APIs, rate limiting, and how you'd scale to millions of sessions while keeping cost low. Identify key privacy/security considerations and how you'd experiment and roll out new scenarios.

Quick Answer: This question evaluates system-design competency in building scalable, real-time, multilingual audio-driven learning features, including product modeling, session orchestration, audio pipelines (ASR/TTS), personalization, localization, telemetry, and privacy/security considerations.

Related Interview Questions

  • Implement auth and profile APIs - Speak (medium)
  • Design speaking scenarios and auth/profile API - Speak (hard)
Speak logo
Speak
Sep 6, 2025, 12:00 AM
Software Engineer
Onsite
System Design
4
0

System Design: Scenario-Based Speaking Practice for a Multilingual Learning App

Goal

Design a scenario-based speaking practice feature where users select real-life scenarios (e.g., ordering food, job interviews) and practice spoken dialogs. Address both product and engineering aspects for a production-scale system.

Requirements

  1. User flows: scenario discovery, selection, in-session experience, feedback, and review.
  2. Content modeling: scenarios, roles (user/agent), dialog turns, prompts, evaluation rubrics, and metadata.
  3. Session orchestration: turn-taking, real-time feedback, error recovery, and session state.
  4. Audio pipeline: capture, streaming, ASR/NLU, TTS/LLM, latency targets, barge-in, and offline fallback.
  5. Personalization: level selection, adaptive difficulty, and content recommendations.
  6. Progress tracking: learning metrics, mastery signals, and reporting.
  7. Content localization: multi-language support, regional variants, and voice selection.
  8. Architecture: services, storage choices, APIs, rate limiting, and cost-aware scaling to millions of sessions.
  9. Privacy and security: data handling, compliance, abuse prevention.
  10. Experimentation and rollout: adding new scenarios safely and measuring impact.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Speak•More Software Engineer•Speak Software Engineer•Speak System Design•Software Engineer System Design
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.