Scale AI logo
Products
Leaderboards
Enterprise
Government
Customers
Resources
Book a Demo
→
Log In
←
Blog
Research
Beyond the Black Box: Teaching Models to Verbalize Reward Hacking
by
Matthew Siegel
and
Miles Turpin
Published
July 9, 2025
Reading Time
6 min read
Copy Link