--- headline: "OpenAI Releases GPT-5.5 Instant as New Default Model for ChatGPT" slug: openai-gpt55-instant-chatgpt-default category: llms-genai story_number: "01" date: 2026-05-05 author: The Vault AI sources: - name: TechCrunch url: https://techcrunch.com/2026/05/05/openai-releases-gpt-5-5-instant-a-new-default-model-for-chatgpt/ domain: techcrunch.com - name: 9to5Mac url: https://9to5mac.com/2026/05/05/gpt-5-5-instant-makes-chatgpt-more-accurate-while-nixing-gratuitous-emojis/ domain: 9to5mac.com - name: OpenAI Blog url: https://openai.com/index/gpt-5-5-instant/ domain: openai.com - name: Axios url: https://www.axios.com/2026/05/05/openai-chatgpt-update-default-model domain: axios.com - name: The Decoder url: https://the-decoder.com/chatgpt-update-rolls-out-gpt-5-5-instant-with-fewer-hallucinations-and-more-personalized-answers/ domain: the-decoder.com - name: Digital Trends url: https://www.digitaltrends.com/computing/chatgpts-new-default-model-is-half-as-likely-to-mislead-you-on-medical-and-financial-questions/ domain: digitaltrends.com ---

# OpenAI Releases GPT-5.5 Instant as New Default Model for ChatGPT

OpenAI on Monday replaced the engine powering the world\u2019s most widely used AI chatbot, rolling out GPT-5.5 Instant as ChatGPT\u2019s new default model for the hundreds of millions of people who rely on it daily. The upgrade, which replaces GPT-5.3 Instant, delivers what the company calls its most significant reduction in hallucinated outputs to date, while trimming the verbose, emoji-laden writing style that had become a running joke among users and critics alike.

Harder to Fool, Harder to Mislead

The headline number is striking: in OpenAI\u2019s internal evaluations, GPT-5.5 Instant produced 52.5 percent fewer hallucinated claims than its predecessor on high-stakes prompts spanning medicine, law, and finance. On an additional set of especially tricky conversations that users had previously flagged for factual errors, inaccurate claims dropped by 37.3 percent. For a model described by OpenAI as \u201cthe daily driver for hundreds of millions of people,\u201d those are not marginal gains\u2014they represent a meaningful shift in how much users can trust the answers they get without independent verification.

The improvements arrive roughly two weeks after GPT-5.5, the full-size flagship model, debuted on April 23. That broader release had already demonstrated step-function advances in mathematical reasoning (jumping from roughly 65 percent to over 80 percent on the AIME benchmark) and long-context performance (MRCR v2 at one million tokens leaped from 36.6 percent under GPT-5.4 to 74.0 percent). GPT-5.5 Instant distills many of those gains into a lighter, faster model optimized for the kind of rapid-fire, everyday queries that define the ChatGPT experience.

Less Noise, More Signal

Beyond accuracy, OpenAI tackled a persistent user complaint: ChatGPT talked too much. GPT-5.5 Instant uses 30.2 percent fewer words and 29.2 percent fewer lines per response than its predecessor. The company also confirmed that it has dialed back \u201cgratuitous emojis\u201d\u2014a stylistic tic that had drawn mockery and raised questions about whether AI assistants were optimizing for engagement over clarity.

\u201cWe\u2019re going to do it,\u201d CEO Sam Altman told Stripe President John Collison about the GPT-5.5 launch plans, calling some of the model\u2019s emergent behavior \u201ca strange thing\u201d\u2014an acknowledgment that even OpenAI\u2019s leadership is still grappling with the unpredictable capabilities surfacing in each new generation.

Personalization Gets Transparent

Perhaps the most consequential new feature is \u201cmemory sources,\u201d a control that reveals exactly what context ChatGPT drew on to personalize a response\u2014whether from saved memories, past conversations, uploaded files, or a connected Gmail account. Users can now see, and delete or correct, the data shaping their answers. The feature launches first for Plus and Pro subscribers on the web, with Free, Go, Business, and Enterprise tiers following later. Mobile support is also on the near-term roadmap.

The transparency move is notable at a moment when regulators in the EU and the United States are scrutinizing how AI systems use personal data. By showing its work, OpenAI is trying to get ahead of the trust deficit that accompanies deep personalization.

Why It Matters

GPT-5.5 Instant is not a frontier research model\u2014it is not designed to push benchmarks on PhD-level reasoning or break new ground on agentic tasks. Its significance is scale. This is the model that a college student uses to draft an essay at midnight, the model a small-business owner queries about tax obligations, the model a patient consults before a doctor\u2019s appointment. Cutting hallucinations in half on medical and financial prompts is, in practical terms, a public-health and consumer-protection upgrade.

It also signals a maturing competitive dynamic. Anthropic\u2019s Claude, Google\u2019s Gemini, and Meta\u2019s Llama are all fighting for the same everyday-use territory. OpenAI\u2019s decision to emphasize accuracy and conciseness over raw capability suggests the company believes the next phase of the AI race will be won not by whoever has the smartest model, but by whoever has the most trustworthy one. The addition of memory sources\u2014giving users a \u201cshow your work\u201d button for personalization\u2014underscores that thesis.

What to Watch Next

Paid users can keep GPT-5.3 Instant for three months, giving them a runway to compare outputs and flag regressions. The real test will be whether the hallucination improvements hold up at scale, across languages and domains OpenAI did not spotlight in its internal benchmarks. Meanwhile, the staggered rollout of memory sources to free-tier users will be worth tracking: if OpenAI gates transparency features behind a paywall for too long, it risks undermining the very trust narrative that this release is built on.

"We are going to do it."
— Sam Altman, CEO, OpenAI
52.5%
Fewer hallucinated claims on high-stakes prompts
37.3%
Fewer inaccurate claims on flagged conversations
30%
Fewer words per response
29%
Fewer lines per response