How might we monitor models for deceptive alignment?1 min readWe don't have an answer for this question yet. Would you like to write one? What is deceptive alignment?What is the likelihood of deceptive misalignment?What is the difference between sycophancy and deceptive alignment?