Searched for

OSWORLD TEST

Claude Opus 4.7 hits 92% honesty rate— are we closer than ever to human-like AI with less hallucination? Here’s what Anthropic’s new AI model is capable of
Anthropic says its latest AI model, Claude Opus 4.7, reaches a 92% honesty rate. That is a strong data point. It signals a push toward more...
18 Apr, 2026, 06.53 AM IST
OpenAI launches GPT‑5.4 Thinking and Pro, its ‘most factual and efficient’ model yet
OpenAI has introduced GPT-5.4 Thinking and GPT-5.4 Pro, the newest upgrades to its GPT-5 AI models. The company says the model is more fact...
06 Mar, 2026, 12.36 PM IST
Anthropic launches Claude Sonnet 4.6
This comes after the AI startup introduced Claude Sonnet 4.5 in September last year, claiming it could handle longer coding sessions, and p...
18 Feb, 2026, 07.31 PM IST

Hot on Web