This question evaluates a candidate's ability to design an end-to-end ML-driven experimental optimization loop for DNA sequence engineering, including sequence representation, constraint enforcement, candidate generation, batch experimental design, and learning from noisy assay measurements.
You are building an ML-driven platform to optimize DNA sequences (e.g., a promoter/enhancer/codon-optimized gene) for a target lab-measured property (e.g., expression level, binding strength, stability).
You have:
(sequence, assay_result, metadata)
where assay results are
noisy
and may vary by batch.
Design an end-to-end system that repeatedly proposes sequences, runs experiments, and learns from results.
Address: