Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks

Elon Musk's xAI has launched Grok 4.1, a new AI model. It reportedly surpasses ChatGPT and Gemini in various tests. Grok 4.1 shows improved performance in creative and emotional tasks. The model also demonstrates a significant reduction in halluci...

By Muskan Singh, Global Desk | Updated: Nov 18, 2025, 10.35 PM IST

Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks

Elon Musk’s xAI is stepping up its challenge to the major AI players with the launch of Grok 4.1. The company has begun rolling out the new model to all users, and early benchmarks show it outperforming rivals like GPT-5.1 and Gemini 2.5 Pro. The update arrives after a quiet rollout that lasted nearly two weeks, and it’s already making waves.

The new version is designed to compete directly with leading models such as GPT-5.1 and Gemini 2.5 Pro, and according to xAI, it delivers significant gains in creativity, emotional intelligence, and conversational coherence. The company says Grok 4.1 is “exceptionally capable in creative, emotional and collaborative interactions” and is more perceptive, engaging, and consistent in personality than the previous version.

ALSO READ: Gemini 3 release imminent - here's what to expect from the Google's latest release

The model’s rollout began at the start of November and was fully deployed by November 14 across the Grok website, X, and Grok’s mobile apps. What followed was a strong showing across competitive AI evaluations, including a major shift in leaderboard rankings, as quoted in a report.

How did Grok 4.1 outperform ChatGPT and Gemini?

For the first time since its launch, Google’s Gemini 2.5 Pro lost its top position on the LMArena leaderboard for text-related tasks. Grok 4.1 (Thinking) and Grok 4.1 took the number one and number two spots, pushing Gemini out of the lead. The new model also surpassed other high-profile systems including Anthropic’s Claude and OpenAI’s latest iterations of ChatGPT, as quoted in a report.

ALSO READ: Spotify not working? Users report widespread outage, as they "can't even open it"

On EQ Bench, a benchmark assessing emotional intelligence, empathy, and interpersonal skills, Grok 4.1 (thinking) secured first place, followed by Grok 4.1. Kimi K2 came in third, while Gemini 2.5 Pro and GPT 5 ranked fifth and sixth. Grok’s strong showing continued on the Creative Writing v3 benchmark, where the models placed second and third. An early version of OpenAI’s GPT 5.1 took the top slot, with OpenAI’s o3 coming in fourth, as quoted in a report.

What improvements does the Grok 4.1 update bring?

xAI says Grok 4.1 has made major strides in reducing hallucinations. In tests comparing real-world information-seeking queries, Grok 4.1 recorded a hallucination rate of 4.22%, a sharp decline from the 12.09% rate of Grok 4.0. On FactScore, a benchmark with 500 biography-based questions, the new model scored 2.97%, compared to 9.89% for its predecessor, as quoted in a report.

These improvements translate into a noticeably different user experience, according to xAI. The company says that users would notice that Grok 4.1 is much nicer to talk to, more understanding and more helpful than its predecessor.

ALSO READ: New poll delivers big blow to Trump as approval rating takes sharp dive

The update arrives during a wave of AI releases across the industry. OpenAI released GPT 5.1 only days earlier, and Google is widely expected to introduce Gemini 3.0 soon, as quoted in a report.

When will users see the next major Grok release?

Elon Musk recently confirmed that Grok 5, previously expected by the end of 2025, has been pushed to early 2026. The billionaire described the upcoming model as “crushingly good” but now says it will arrive within the first three months of 2026.

ALSO READ: What does 67 mean, who made the 67 meme and why is it so popular?

FAQs

What makes Grok 4.1 different?
It shows major improvements in creativity, emotional intelligence, and reduced hallucinations compared to Grok 4.0.

How does Grok 4.1 compare to ChatGPT and Gemini?
Grok 4.1 currently ranks above both on several benchmarks, including LMArena and EQ Bench.

Download
The Economic Times Business News App for the Latest News in Business, Sensex, Stock Market Updates & More.

Grok 4.1 Update: xAI surpasses ChatGPT & Gemini on key AI benchmarks

Elon Musk's xAI has launched Grok 4.1, a new AI model. It reportedly surpasses ChatGPT and Gemini in various tests. Grok 4.1 shows improved performance in creative and emotional tasks. The model also demonstrates a significant reduction in halluci...

How did Grok 4.1 outperform ChatGPT and Gemini?

What improvements does the Grok 4.1 update bring?

When will users see the next major Grok release?

FAQs

Related Articles

READ MORE:

More from our Partners

Popular Categories

Hot on Web

In Case you missed it

Top Searched Companies

Latest News

Download ET APP

Follow us on

become a member