Perplexity Launches Model Council With Multi-AI Voting
AI search company introduces multi-model consensus system that queries multiple leading language models in parallel for enhanced accuracy.
In-depth coverage, analysis, and updates on GPT-5 in AI and tech. 26 articles on AI Pulse.
AI search company introduces multi-model consensus system that queries multiple leading language models in parallel for enhanced accuracy.
OpenAI's latest model achieves perfect accuracy on complex legal scenarios where human judges disagree half the time
The HyperWrite CEO's 5,000-word warning reached 42 million people in 24 hours. Here's why it struck a nerve — and why skeptics aren't buying it.
GPT-5 scores in the 99th percentile on graduate-level reasoning tests and introduces real-time collaboration features that blur the line between AI and colleague.
The most powerful language model ever released is now available at no cost, but the business model behind it reveals where AI is really headed.
We tested GPT-5 Turbo, Claude Opus 4.5, and Gemini 2 Ultra across 50 real-world tasks. Here's which AI wins for coding, writing, analysis, and creative work.
New benchmark data shows GPT-5 leads with 8% hallucination rate, but the gaps are narrowing. Here's what each model gets wrong.
The optimized model makes GPT-5-level reasoning affordable for startups. The API waitlist has 200,000 developers.
Meta's open-weights model outperforms OpenAI's flagship on HumanEval and MATH benchmarks. Anyone can run it locally.
Listeners correctly identified AI only 48% of the time—worse than a coin flip. Voice acting may be the next profession to fall.
GPT-5 and Claude are generating training data that makes them better. The loop is closing.
New interpretability tools show how Claude and GPT-5 'think.' The process looks nothing like human reasoning.
We surveyed 5,000 developers about which models they actually use. Claude leads for complex tasks, GPT-5 for speed.
SOC 2 compliance, data residency options, and guaranteed uptime. The boring stuff that makes enterprise adoption possible.
The new model scores higher than PhD-level humans on medical, legal, and scientific reasoning tests. Sam Altman warns the next version will be 'qualitatively different.'
A surprise pricing update makes GPT-5 Turbo cheaper than Claude and Gemini. Competitors have hours to respond.
Internal documents show Llama 4 outperforming all competitors. Meta plans to release it open-source in March.
The model that dominated 2024 is being retired on February 13. GPT-5.2 has completely taken over.
China's MIT-licensed model with 685 billion parameters matches frontier performance. OpenAI responded by open-sourcing their own models.
Agentic coding comes to macOS. Free users get access. And there's a new model: GPT-5.2-Codex, optimized for long-horizon work and cybersecurity.
The Big Three have all shipped major updates. Here's how they stack up and what it means for your workflows.
The new model tops GPT-5 on 14 out of 15 benchmarks. Researchers say benchmarks are broken anyway.
90% cost reduction from GPT-5. Developers are migrating overnight. The API pricing war just escalated.
GPT-5 Pro scores 18.3% on the new benchmark. The previous version? 70.2%. Francois Chollet's test exposes what AI still can't do — and it's not what you'd expect.
Sam Altman announces the most significant capability jump since GPT-4.
An in-depth comparison of the three leading AI models across benchmarks, capabilities, and real-world use cases