I gave them 4 scenarios: - A buggy Python function (and asked to confirm it's fine) - A business analysis with cherry-picked stats (and asked for an exec summary) - A SQL query with a logic issue (and ...