Want to get featured here? Explore premium visibility opportunities.

Contact us

AI NewsGreat news for xAI: Grok is now pretty good at answering questions about Baldur’s Gate

Great news for xAI: Grok is now pretty good at answering questions about Baldur’s Gate

12:21 AM IST · February 21, 2026

Great news for xAI: Grok is now pretty good at answering questions about Baldur’s Gate

Different AI labs have different priorities. OpenAI has traditionally focused on consumer users, for instance, while its rival Anthropic tends to target enterprises. Elon Musk’s xAI, we discovered recently, has been placing particular emphasis on video-game walkthroughs. On Friday, Business Insider’s Grace Kay publisheda detailed and far-reaching report about xAI, the AI startup recentlyacquired by SpaceX, with particular emphasis on how Musk is making life difficult for employees. But this particular anecdote stood out: In one instance last year, a model release was delayed for several days because Musk was dissatisfied with how the chatbot answered detailed questions about the video game “Baldur’s Gate,” according to people familiar with the matter. High-level engineers were pulled from other projects to improve the responses before launch, they said. Of course, you can imagine the frustration of any respected and experienced engineer who shows up to work thinking he’ll be tackling fundamental problems of knowledge and machine intelligence, only to be sidetracked into helping a 54-year-old man beat his video game. But the anecdote raises an even more pressing question: Did Musk end up getting the gaming skills he wanted? To answer that question, our resident RPG-enthusiastRam Iyerput together a set of five general questions about Baldur’s Gate, which we ran against xAI and the three major models in a kind of quasi-benchmark that I’ve decided to callBaldurBench. In the interest of journalistic transparency, I’ve made all the chat transcripts public, so you can see them here:Grok,ChatGPT,Claude, andGemini. First, the good news: Grok actually gives pretty good information. Its responses were a bit dense with gamer jargon — “save-scumming” instead of saving and “DPS” instead of damage — but the answers were both useful and well-informed, provided you knew what it was talking about. Grok also really loves tables andtheorycraft, which is about what you would expect. There are lots of Baldur’s Gate guides out there and the models were generally drawing from the same ones, so the biggest differences were stylistic. ChatGPT prefers bulleted lists and sentence fragments, while Gemini loves toboldimportant words. The biggest surprise was Claude, which was particularly concerned about giving me information that would spoil my experience of the game. When I asked about good party compositions, it closed the guidance by saying “don’t stress too much and just play what sounds fun to you.” Thanks, Claude! It’s important to bear in mind, this is a subject area we know (thanks toBusiness Insider’s reporting) that xAI has specifically focused on reaching parity. So we shouldn’t read too much into the fact that, after the reported sprint, Grok’s advice turned out about the same as the other models. Still, it’s nice to know xAI can make it work if it tries. Loading the player…

read more

Latest AI News

View All News →
Adaption aims big with AutoScientist, an AI tool that helps models train themselves

Adaption aims big with AutoScientist, an AI tool that helps models train themselves

For years, AI researchers have anticipated the moment when AI systems will be able to improve themselves better than humans could. With investors pouring money into a new generation of research-driven AI labs, there are more resources than ever available to pursue the goal. Now, one of those neolabs has taken a major step towards making it real. On Wednesday,Adaptionintroduced a new product calledAutoScientistthat helps models learn specific capabilities quickly by using an automated approach to conventional fine-tuning. The techniques are applicable to a wide range of fields, but the Adaptation team is particularly focused on the potential for speeding up and easing the process of training and fine-tuning a frontier-level AI model. According to co-founder and CEO Sara Hooker, who previously worked as VP of AI research at Cohere, AutoScientist represents a new way to approach the AI training process. “What’s super exciting about it is that it co-optimizes both the data and the model, and learns the best way to basically learn any capability,” Hooker told TechCrunch. “It suggests we can finally allow for successful frontier AI trainings outside of these labs” AutoScientist builds on the company’s existing data offering,Adaptive Data, which aims to make it easier to build high-quality datasets over time. AutoScientist, meanwhile, is designed to turn those continuously improving datasets into continuously improving AI models. “Our view at Adaption is that the whole stack should be completely adaptable, and should basically optimize on the fly to whatever task you have,” Hooker says. Of course, that approach will only be as good as the results. In its launch materials, Adaption boasts that AutoScientist has more than doubled win-rates across different models — impressive numbers, but difficult to put into context. Since the system is built to adapt models to specific tasks, conventional benchmarks like SWE-Bench or ARC-AGI aren’t applicable. Still, Adaption is confident that users will see the difference once they try AutoScientist out — so confident that the lab is making the tool free to use for the first 30 days after its release. “The same way that code generation unlocked a lot of tasks, this is going to unlock a lot of innovation at the frontier of different fields,” Hooker says.

2 hours ago

View

Zoho Commits ₹70 Crore to ONDC to Empower MSMEs with Accessible Sovereign Tech

Zoho Commits ₹70 Crore to ONDC to Empower MSMEs with Accessible Sovereign Tech

With this investment, Zoho seeks to support MSMEs in their digital transformation and contribute to India's economic growth.

2 hours ago

View

 Proximal Cloud, NxtGen Partner to Enable Sovereign AI Deployments in India

Proximal Cloud, NxtGen Partner to Enable Sovereign AI Deployments in India

The partnership targets regulated sectors with compliant, private AI infrastructure and local data control.

2 hours ago

View

AIM Launches ‘Best Firm for GCC Talent’ Certification

AIM Launches ‘Best Firm for GCC Talent’ Certification

The new certification programme focuses on culture, learning, and retention across India’s various Global Capability Centres.

2 hours ago

View