Chain-of-thought monitorability could improve generative AI safety by assessing how models come to their conclusions and spotting the β€œintent to misbehave.” Monitoring generative AI’s decision-making ...