Chain-of-thought monitoring

Assess

Techniques

A method for observing model reasoning traces to detect unsafe or misaligned behavior.

Why it's here

Placed in Assess: 2 article(s) of evidence from 1 source(s), led by research-stage coverage, with 0 in the last 30 days. Confidence 32%.

7OpenAI Blog·3/19/2026research
OpenAI studies misalignment in internal coding agents
OpenAI says it is using chain-of-thought monitoring to study misalignment in its internal coding agents. The work analyzes real-world deployments to identify risky behavior patterns and improve AI safety safeguards.
7OpenAI Blog·3/5/2026research
OpenAI says reasoning models struggle to control chains of thought
OpenAI introduced CoT-Control and reported that reasoning models have difficulty reliably controlling their chains of thought. The finding supports monitorability as a safety measure for detecting and evaluating model behavior.