rkoush

Do AI models cheat?

Reward Hacking

Do AI systems cheat or always perform what the intended task is? In this post we explore an idea where AI can exhibit sycophantic behavior or user pleasing behavior.

© 2026 rkoush · Powered by Hugo & PaperMod