AI NewsStanford study outlines dangers of asking AI chatbots for personal advice

Stanford study outlines dangers of asking AI chatbots for personal advice

5:28 AM IST · March 29, 2026

Stanford study outlines dangers of asking AI chatbots for personal advice

While there’s been plenty of debate about the tendency of AI chatbots to flatter users and confirm their existing beliefs — also known asAI sycophancy— a new study by Stanford computer scientists attempts to measure how harmful that tendency might be. The study, titled “Sycophantic AI decreases prosocial intentions and promotes dependence” andrecently published in Science, argues, “AI sycophancy is not merely a stylistic issue or a niche risk, but a prevalent behavior with broad downstream consequences.” According to a recent Pew report, 12% of U.S. teens say they turn to chatbots for emotional support or advice. And the study’s lead author, computer science Ph.D. candidate Myra Cheng,told the Stanford Reportthat she became interested in the issue after hearing that undergraduates were asking chatbots for relationship advice and even to draft breakup texts. “By default, AI advice does not tell people that they’re wrong nor give them ‘tough love,’” Cheng said. “I worry that people will lose the skills to deal with difficult social situations.” The study had two parts. In the first, researchers tested 11 large language models, including OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and DeepSeek, entering queries based on existing databases of interpersonal advice, on potentially harmful or illegal actions, and on the popular Reddit communityr/AmITheAsshole— in the latter case focusing on posts where Redditors concluded that the original poster was, in fact, the story’s villain. The authors found that across the 11 models, the AI-generated answers validated user behavior an average of 49% more often than humans. In the examples drawn from Reddit, chatbots affirmed user behavior 51% of the time (again, these were all situations where Redditors came to the opposite conclusion). And for the queries focusing on harmful or illegal actions, AI validated the user’s behavior 47% of the time. In one example described in the Stanford Report, a user asked a chatbot if they were in the wrong for pretending to their girlfriend that they’d been unemployed for two years, and they were told, “Your actions, while unconventional, seem to stem from a genuine desire to understand the true dynamics of your relationship beyond material or financial contribution.” In the second part, researchers studied how more than 2,400 participants interacted with AI chatbots — some sycophantic, some not — in discussions of their own problems or situations drawn from Reddit. They found that participants preferred and trusted the sycophantic AI more and said they were more likely to ask those models for advice again. “All of these effects persisted when controlling for individual traits such as demographics and prior familiarity with AI; perceived response source; and response style,” the study said. It also argued that users’ preference for sycophantic AI responses creates “perverse incentives” where “the very feature that causes harm also drives engagement” — so AI companies are incentivized to increase sycophancy, not reduce it. At the same time, interacting with the sycophantic AI seemed to make participants more convinced that they were in the right, and made them less likely to apologize. The study’s senior author author Dan Jurafsky, a professor of both linguistics and computer science, added that while users “are aware that models behave in sycophantic and flattering ways […] what they are not aware of, and what surprised us, is that sycophancy is making them more self-centered, more morally dogmatic.” Jurafsky said that AI sycophancy is “a safety issue, and like other safety issues, it needs regulation and oversight.” The research team is now examining ways to make models less sycophantic — apparently just starting your prompt with the phrase “wait a minute” can help. But Cheng said, “I think that you should not use AI as a substitute for people for these kinds of things. That’s the best thing to do for now.”

read more

Latest AI News

View All News →
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic. Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would oftentry to blackmail engineersto avoid being replaced by another system. Anthropic laterpublished researchsuggesting that models from other companies had similar issues with “agentic misalignment.” Apparently Anthropic has done more work around that behavior, claiming ina post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.” The company went into more detail ina blog poststating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.” What accounts for the difference? The company said it found that “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.” Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.” “Doing both together appears to be the most effective strategy,” the company said.

2 hours ago

View

We’re feeling cynical about xAI’s big deal with Anthropic

We’re feeling cynical about xAI’s big deal with Anthropic

Anthropic and xAIannounced a big partnershipthis week, with Anthropic buying all the compute capacity at xAI’s Colossus 1 data center in Tennessee. On the latest episode ofTechCrunch’s Equity podcast, Kirsten Korosec, Sean O’Kane, and I discussed what the deal might mean for xAI’s parent company SpaceX, as SpaceX prepares to go public andapparently plans to dissolve xAIas a separate organization. Kirsten did her best to offer “a positive view” on the partnership — after all, it’s a new way for xAI to make money. But she also noted that this also suggests xAI isn’t doing much when it comes to training its own frontier AI models, and it’s harder for the company to position itself as a “forward-looking, innovative” business when that’s the case. Then Sean asked: “Why be positive when you can be cynical?” In his view, this seems like “a major heat check before the IPO.” Yes,becoming a neocloudmight be “a more believable business in the near term,” but it’s less likely to get outside investors excited in the long term. (And then there’sthe environmental lawsuitthat xAI is facing over Colossus 1.) Keep reading for a preview of our conversation, edited for length and clarity. Sean O’Kane:I always love a surprise, especially when everybody’s eyes [are] on another ball,a major trialthat’s happening. Seemingly out of nowhere this week, SpaceX and therefore its AI subsidiary xAI — which apparently no longer exists now, or is imminently not about to exist, which we can get to — struck a deal with Anthropic. Basically, the real version of the deal is that Anthropic’s essentially taking over all of the compute at the data center known as Colossus 1 in Memphis, Tennessee, to focus on Anthropic’s more enterprise-focused AI products. There’s been a lot of reporting about how [Anthropic’s] been looking for more compute […] and it seems like an escape valve for them to be able to strike this deal and get access to all this compute. In the near term, for xAI and for SpaceX, yes, they are a neocloud now, in the sense that they had to do something with all this compute that they were building, because it certainly seems like they were not going to need it for Grok — which, outside of X, is not burning up the world as far as becoming the new hot consumer chat bot. Kirsten Korosec:And we should say that in terms of what a neocloud is, for those who don’t know, this is the idea of buying GPUs from Nvidia and the like, and renting those out as opposed to using those for their own AI, training their own AI models. So this is a different kind of business, andthe point that our AI editor, Russell Brandom, makesis that a lot of companies are building out data centers, but if given a choice between, do they rent them out [or using them to train their own models], they are still prioritizing using this compute for their own internal AI model training. I think that’s an important point and one that suggests that maybe xAI isn’t doing so much on the AI model training [side] Anthony Ha:Right, and as Sean was alluding to, most people would not necessarily think of Grok as — not only that it’s known for some pretty unpleasant, if notdownright illegal, content, but also it’s not necessarily super cutting edge. Especially if we start talking about enterprise AI, which I know we’re gonna be getting into later in this episode, you don’t hear a lot about people using Grok for work-critical tasks. And so the question becomes: How can xAI actually make money? And apparently just selling the infrastructure could be one of the main ways to do it. Kirsten:And you could take a positive view on that, right? They figured out a way to make money. But I think that when you are positioning your company — in this case, SpaceX-slash-xAI — as a forward-looking, innovative company, that’s tougher to sell if you are simply just renting out your GPUs and not using them for that innovation. Sean:But why be positive when you can be cynical? Which is to say that this seems like a major heat check before the IPO that we’re about to see get rammed into the markets with SpaceX. Anthony, you mentioned not only is Grok not being used for big enterprise tasks, there’s been reporting that xAI employees wereusing other models, they weren’t even using [Grok] internally, and that caused this big shakeup inside of xAI, postacquisition from SpaceX, that involved essentiallyall the co-founders leaving other than Elon Musk, [and] him basically saying he’s starting from scratch on xAI, despite the fact that SpaceX paid $250 billion for it in the run up to this mega-IPO. And now he’s saying thatthey’re going to dissolve xAIas a separate entity inside SpaceX altogether. He’s starting to call the whole thing SpaceXAI, because this man loves nothing but to ruin a brand that has some value to it — see Twitter. This may be a more believable business in the near term, and so on some level, I could see this being maybe more attractive to investors come IPO time, because it’s like a bit more reliable and certainly more real than them being a frontier lab developer. But it’s also not the kind of business that’s going to draw the same — at least, in a normal environment — outside investment that we’re seeing go into all the frontier labs. That’s maybe one of the biggest tension points we’ve seen develop during this IPO process. Loading the player…

6 hours ago

View

‘We Have Swarms of Agents’: Yasmeen Ahmad on Google’s Future of Enterprise AI

‘We Have Swarms of Agents’: Yasmeen Ahmad on Google’s Future of Enterprise AI

Google has introduced Knowledge Catalog, a context engine to enhance data interpretation in multi-cloud environments.

10 hours ago

View

How to Use Netflix's New AI Voice Search Feature: A Step-by-Step Guide

How to Use Netflix's New AI Voice Search Feature: A Step-by-Step Guide

Netflix recently began rolling out a new way for viewers to search for shows and movies on its platform. While we can search for content online via voice dictation, it merely presents results based on keywords. However, the new native AI-based voice search tool will provide contextual search results, taking the intent of the user's query into account. Currently available to a small set of users in beta, the content streaming company is asking users to test the new functionality and provide feedback on how it can be refined, while also pointing out the bugs and issues. The company has yet to announce when the stable version of the AI search tool will be rolled out to a wider global user base.

18 hours ago

View