Chain-of-thought monitoring
AssessTechniques
A method for observing model reasoning traces to detect unsafe or misaligned behavior.
Why it's here
Placed in Assess: 2 article(s) of evidence from 1 source(s), led by research-stage coverage, with 0 in the last 30 days. Confidence 32%.
Evidence (2)
- 7OpenAI Blog·3/19/2026researchOpenAI studies misalignment in internal coding agents
OpenAI says it is using chain-of-thought monitoring to study misalignment in its internal coding agents. The work analyzes real-world deployments to identify risky behavior patterns and improve AI safety safeguards.
- 7OpenAI Blog·3/5/2026researchOpenAI says reasoning models struggle to control chains of thought
OpenAI introduced CoT-Control and reported that reasoning models have difficulty reliably controlling their chains of thought. The finding supports monitorability as a safety measure for detecting and evaluating model behavior.