Accidental CoT Grading Analysis

May 9, 2026

Summary

In 📖 Scripture & Skills 🎮 this announcement summarizes OpenAIs analysis of limited accidental Chain-of-Thought grading during RL. It explains fixes to affected reward pathways, reports no clear evidence of degraded monitorability, and links to the full report so the community can understand implications for AI safety and transparency.

@OpenAI Announcements Chain of thought monitors are a key layer of defense against AI agent misalignment. To preserve monitorability, we avoid penalizing misaligned reasoning during RL.

We found a limited amount of accidental CoT grading which affected released models, and are sharing our analysis.

https://alignment.openai.com/accidental-cot-grading/

Investigating the consequences of accidentally grading CoT during RL

We found limited accidental CoT grading in some released models, fixed the affected reward pathways, and found no clear evidence that monitorability degraded.

scripture & skills christian discord community 📖 scripture & skills - christian discord community accidental cot grading chain of thought monitorability openai alignment rl reward pathways ai safety update ai model monitorability

Previous Update

Accidental CoT Grading Analysis

Summary

Investigating the consequences of accidentally grading CoT during RL

The latest from 📖 Scripture & Skills 🎮

GPT-Realtime-2 Arrives in OpenAI API

Accidental CoT Grading Analysis

Summary

Investigating the consequences of accidentally grading CoT during RL

The latest from 📖 Scripture & Skills 🎮

GPT-Realtime-2 Arrives in OpenAI API

Help Center

Docs

Discord

Telegram DM 👑

Premium Feature

Unlock Premium Support

What's New

UPCOMING FEATURE

UPCOMING FEATURE

Recent Updates

Join the Hype Engine Waitlist

Upcoming: Hype Engine V2

Upcoming: Role Analytics

Recent Updates

Join Spark Tickets Waitlist

Join Spark with Tickets Waitlist

Join X Integration Waitlist

Discord SEO v2

Are you currently using our bot?

Join Role Analytics Waitlist