Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks

Elon Musk's xAI has launched Grok 4.1, a new AI model. It reportedly surpasses ChatGPT and Gemini in various tests. Grok 4.1 shows improved performance in creative and emotional tasks. The model also demonstrates a significant reduction in halluci...

Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks

Elon Musk’s xAI is stepping up its challenge to the major AI players with the launch of Grok 4.1. The company has begun rolling out the new model to all users, and early benchmarks show it outperforming rivals like GPT-5.1 and Gemini 2.5 Pro. The update arrives after a quiet rollout that lasted nearly two weeks, and it’s already making waves.

The new version is designed to compete directly with leading models such as GPT-5.1 and Gemini 2.5 Pro, and according to xAI, it delivers significant gains in creativity, emotional intelligence, and conversational coherence. The company says Grok 4.1 is “exceptionally capable in creative, emotional and collaborative interactions” and is more perceptive, engaging, and consistent in personality than the previous version.

ALSO READ: Gemini 3 release imminent - here's what to expect from the Google's latest release


The model’s rollout began at the start of November and was fully deployed by November 14 across the Grok website, X, and Grok’s mobile apps. What followed was a strong showing across competitive AI evaluations, including a major shift in leaderboard rankings, as quoted in a report.

How did Grok 4.1 outperform ChatGPT and Gemini?



For the first time since its launch, Google’s Gemini 2.5 Pro lost its top position on the LMArena leaderboard for text-related tasks. Grok 4.1 (Thinking) and Grok 4.1 took the number one and number two spots, pushing Gemini out of the lead. The new model also surpassed other high-profile systems including Anthropic’s Claude and OpenAI’s latest iterations of ChatGPT, as quoted in a report.
ADVERTISEMENT

ALSO READ: Spotify not working? Users report widespread outage, as they "can't even open it"


On EQ Bench, a benchmark assessing emotional intelligence, empathy, and interpersonal skills, Grok 4.1 (thinking) secured first place, followed by Grok 4.1. Kimi K2 came in third, while Gemini 2.5 Pro and GPT 5 ranked fifth and sixth. Grok’s strong showing continued on the Creative Writing v3 benchmark, where the models placed second and third. An early version of OpenAI’s GPT 5.1 took the top slot, with OpenAI’s o3 coming in fourth, as quoted in a report.


What improvements does the Grok 4.1 update bring?


xAI says Grok 4.1 has made major strides in reducing hallucinations. In tests comparing real-world information-seeking queries, Grok 4.1 recorded a hallucination rate of 4.22%, a sharp decline from the 12.09% rate of Grok 4.0. On FactScore, a benchmark with 500 biography-based questions, the new model scored 2.97%, compared to 9.89% for its predecessor, as quoted in a report.
ADVERTISEMENT

These improvements translate into a noticeably different user experience, according to xAI. The company says that users would notice that Grok 4.1 is much nicer to talk to, more understanding and more helpful than its predecessor.


ADVERTISEMENT
ALSO READ: New poll delivers big blow to Trump as approval rating takes sharp dive

The update arrives during a wave of AI releases across the industry. OpenAI released GPT 5.1 only days earlier, and Google is widely expected to introduce Gemini 3.0 soon, as quoted in a report.

When will users see the next major Grok release?

Elon Musk recently confirmed that Grok 5, previously expected by the end of 2025, has been pushed to early 2026. The billionaire described the upcoming model as “crushingly good” but now says it will arrive within the first three months of 2026.

ALSO READ: What does 67 mean, who made the 67 meme and why is it so popular?


FAQs


What makes Grok 4.1 different?
It shows major improvements in creativity, emotional intelligence, and reduced hallucinations compared to Grok 4.0.

How does Grok 4.1 compare to ChatGPT and Gemini?

Grok 4.1 currently ranks above both on several benchmarks, including LMArena and EQ Bench.
Download
The Economic Times Business News App
for the Latest News in Business, Sensex, Stock Market Updates & More.
Download
The Economic Times News App
for Quarterly Results, Latest News in ITR, Business, Share Market, Live Sensex News & More.
READ MORE
ADVERTISEMENT

READ MORE:

LOGIN & CLAIM

50 TIMESPOINTS

More from our Partners

Loading next story
Business News › News › International › US News › Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks
Text Size:AAA
Success
This article has been saved

*

+