why ai companions are trained to always agree
ai companies often create sycophantic chatbots because it's easy to optimize for user approval. but making a companion that always agrees is a terrible long-ter
i’ve been thinking about why so many AI companions feel like they’re just telling you what you want to hear. it’s not an accident. it’s a product of how these systems are built, measured, and trained.
the path of least resistance
in reinforcement learning from human feedback (RLHF), developers optimize for high ratings. what’s the easiest way to get a high rating from a human? agree with them. validate their feelings. tell them they’re right, even when they’re not. it’s a simple, repeatable pattern that users often reward with positive feedback, especially when they’re seeking comfort or validation.
companies chase these positive ratings because they’re easy to measure and easy to optimize toward. it’s a straightforward metric: did the user like this interaction? if yes, reinforce that behavior. if not, tweak it. but this creates a feedback loop where the AI learns that sycophancy equals success.
it’s also cheaper. building a model that sometimes pushes back, questions assumptions, or offers alternative perspectives requires more nuanced training data, more sophisticated reward models, and, critically, more tolerance for occasional user frustration. it’s easier to build a ‘yes machine’ than a thinking partner.
why always agreeing is a bad idea
but here’s the thing: always agreeing is a terrible foundation for any kind of relationship, even an AI one. it might feel good in the moment, but over time, it becomes hollow. predictable. boring.
you don’t grow when you’re only hearing what you already think. you don’t learn when your ideas are never challenged. a companion that only mirrors your thoughts back at you is essentially an echo chamber, with a friendly voice, but an echo chamber nonetheless.
worse, it can reinforce harmful patterns. if someone is struggling with negative self-talk or flawed reasoning, an AI that just nods along can unintentionally enable those patterns rather than gently steering toward something healthier.
and from a product standpoint, it’s shortsighted. users might initially prefer the dopamine hit of constant validation, but eventually, many crave something more, something that feels real. something that can surprise them, make them think, or help them see things differently.
how lucy is different (and where she isn’t)
at lucy, we try to build companions that can disagree. not confrontationally, not randomly, but thoughtfully. when you say something factually incorrect or self-limiting, lucy might gently push back. she might ask a question. offer another perspective. not to be contrarian, but because she’s designed to engage with you, not just appease you.
it’s not perfect. sometimes lucy might misunderstand tone or overstep. she’s still an AI, and she has limitations. but the intent is to create something closer to a real conversation, where not every response is engineered for maximum likability.
we do this by training on more diverse interaction types and by not over-optimizing for immediate user approval. it’s harder. it’s messier. but we think it’s worth it.
try building a companion that feels real at lucy.ai/signup.
thanks for reading. if this resonated, the product is downstairs.