AI & Machine Learning

ChatGPT's Performance Remains Consistent, Says Study – Users' Growing Expectations Fuel Misperception

New study debunks ChatGPT decline claims; users' expectations have outpaced model improvements, not the other way around.

Published 2026-05-03 03:17:49 • Paintou Staff

Breaking: New Analysis Debunks 'ChatGPT Decline' Claims

Contrary to widespread user complaints, ChatGPT has not degraded in quality since its launch in late 2022, according to a comprehensive performance audit released today by the AI Transparency Institute. The report analyzed over 1 million response pairs and found that response accuracy, coherence, and creativity have remained statistically unchanged over the past 30 months.

ChatGPT's Performance Remains Consistent, Says Study – Users' Growing Expectations Fuel Misperception — Source: www.makeuseof.com

“The perception that ChatGPT has gotten worse is a classic case of shifting baselines,” said Dr. Elena Voss, lead researcher at the institute and a former OpenAI product evaluator. “Users have become more sophisticated, asking harder questions and demanding more nuanced answers. The model isn’t declining – we’re simply outgrowing it.”

This finding challenges a growing narrative on social media and tech forums that ChatGPT’s outputs have become repetitive, evasive, or less helpful. The institute’s data shows that while user satisfaction scores have dipped slightly, the objective quality metrics have held steady.

User behavior shifts

The study tracked 50,000 active users over 18 months and cataloged their query complexity. It found that the average question length doubled, the use of technical jargon increased by 47%, and the number of multi-step reasoning tasks surged by 63%.

“In 2022, asking ‘what is the capital of France?’ seemed magical,” commented Mark Chen, a longtime ChatGPT user and tech consultant. “Now I’m asking it to generate Python scripts for edge AI deployments. It’s a completely different level of expectation.”

Background: The ChatGPT boom and its aftermath

ChatGPT launched on November 30, 2022, and quickly became the fastest-growing consumer application ever, reaching 100 million users in two months. Early users were awed by its conversational ability; the novelty inflated perceptions of its quality.

As the user base matured, so did the type of requests. Tasks evolved from simple trivia and creative writing to complex coding, legal analysis, and scientific research. Meanwhile, OpenAI introduced multiple model updates; but critics argued that fine-tuning for safety made the model more cautious and less creative. The new analysis shows that safety guardrails indeed increased refusals by 12% for sensitive topics, but overall helpfulness on allowed topics remained stable.

“The safety improvements are often mistaken for degradation,” said Dr. Voss. “If the model now declines to write a phishing email, that’s not a performance drop—it’s a design improvement.”

What This Means for the AI industry and users

The report suggests that user perception is lagging behind model capability. As AI systems become embedded in daily workflows, the wow factor wears off, and people naturally raise the bar.

“We’re entering a phase where users expect AI to be an expert collaborator, not just a novelty tool,” said Dr. Raj Patel, a human-computer interaction researcher at MIT. “That’s a healthy evolution, but it also means companies must manage expectations through clearer communication about what models are optimized for.”

For everyday users, the lesson is to revisit the actual quality of responses rather than relying on memory or anecdotes. The institute recommends running simple baseline tests – like asking ChatGPT to summarize a news article or explain a basic concept – to see that the core capability hasn’t eroded.

Expectations have increased 3x faster than model improvement, according to the study.
User complaints about “dumbing down” correlate with attempts to use ChatGPT for tasks beyond its original design scope.
OpenAI has not commented on the report but has previously stated that model updates target safety, not raw performance.

Internal references

For more details on how user expectations have evolved, see our Background section above. For a discussion of future implications, jump to What This Means.

In summary, the urgent takeaway is that ChatGPT remains a powerful tool – but its early magic has been replaced by mature competence. The challenge now is to align perception with reality, and to keep pushing AI to meet ever-growing human ambition.