This question evaluates applied machine learning competencies including LLM safety and hallucination mitigation, retrieval-augmented generation and token-cost trade-offs, imbalanced classification handling, and precision-versus-recall decision-making within the Machine Learning domain for a Data Scientist role.
Answer the following applied ML/LLM questions.
You are building a chatbot over an internal knowledge base.
Discuss concrete design choices (e.g., chunking, retrieval quality, prompt construction), evaluation ideas, and failure modes.
You are building a binary classifier where the positive class is rare.
Explain your reasoning and mention practical checks/pitfalls (e.g., calibration, dataset shift).
Login required