Trendora

Chain-of-thought monitoring

Assess

Techniques

A method for observing model reasoning traces to detect unsafe or misaligned behavior.

Why it's here

Placed in Assess: 2 article(s) of evidence from 1 source(s), led by research-stage coverage, with 0 in the last 30 days. Confidence 32%.

Evidence (2)

  • 7OpenAI Blog·3/19/2026research
    OpenAI studies misalignment in internal coding agents

    OpenAI says it is using chain-of-thought monitoring to study misalignment in its internal coding agents. The work analyzes real-world deployments to identify risky behavior patterns and improve AI safety safeguards.

  • 7OpenAI Blog·3/5/2026research
    OpenAI says reasoning models struggle to control chains of thought

    OpenAI introduced CoT-Control and reported that reasoning models have difficulty reliably controlling their chains of thought. The finding supports monitorability as a safety measure for detecting and evaluating model behavior.