Have We Reached AGI? A Practical Look at LLM Limits

Everyone's debating if models like GPT-4o are 'sparks of AGI.' We'll cut through the hype and show you exactly what they can do, and the critical things they still can't.

May 20, 2026 · 4 min read · SuperThinking team

A glowing circuit board designed in the shape of a human brain.

The answer is no.

Okay, article over. But seriously, the debate around Artificial General Intelligence (AGI) has gotten out of hand. Every new model release is met with breathless claims of seeing 'sparks of AGI.' It feels intelligent, it sounds intelligent, so it must be, right?

Wrong. What we have are incredibly powerful, general-purpose prediction engines. They are phenomenal tools that mimic reasoning, but they don’t actually reason. And the difference is everything.

Let’s get concrete. I'm not here to debate philosophy. I'm here to look at what you can and can't do with these tools today and why that tells us AGI is still firmly in the realm of science fiction.

The 'Sparks' Are Real, But They're Not Fire

First, let's give the models their due. If you showed someone GPT-4o five years ago, they'd think you were a wizard. It can look at a screenshot of a website and write the code for it. It can listen to you talk and respond with a different personality. It can plan a trip, write a marketing email, and debug your Python script in the same conversation.

This multimodality and task-switching is where the AGI illusion comes from. You can throw almost any white-collar, screen-based task at it and get a plausible first draft. That feels general. It feels like intelligence.

For example, you can give it this prompt:

Here's a photo of my whiteboard sketch for a new app UI. And here's a raw dump of user feedback from a Google Form. Based on the feedback, what are two critical changes I should make to my UI sketch? Explain your reasoning.

Getting a coherent answer to that requires vision, language understanding, and a form of emergent reasoning to connect abstract feedback to a visual design. It's a stunning technical achievement. But it's still a parlor trick—a very, very sophisticated one.

A close-up of a robot's cracked face, showing complex wiring underneath.

Where It All Falls Apart

The illusion of intelligence is fragile. It shatters the moment you poke at its fundamental limitations. These aren't just small bugs to be ironed out; they are deep, architectural shortcomings that separate mimicry from true cognition.

Here’s what today’s best models can’t do:

Learn from experience. If you correct a model, it doesn't learn. It just incorporates your correction into the context of the current conversation. Start a new chat, and it will make the exact same mistake. It has no persistent memory or mechanism for updating its own weights based on new information. A human learns not to touch a hot stove once. An LLM will touch it a billion times.

Understand the physical world. Models have no grounding in reality. They don't know what it's like to be in a body, to feel gravity, or that you can't pull on a rope that isn't there. They can recite physics formulas but fail at basic spatial reasoning puzzles that a toddler could solve. This is why self-driving cars are still a hard problem; the real world is messy and requires embodied understanding, not just pattern recognition on text and pixels.

Be reliable. Hallucinations are a feature, not a bug. The model's job is to generate the most probable next word. Sometimes, the most probable thing is a complete fabrication. It will invent API functions, cite non-existent legal precedents, and create fake URLs because they look like plausible text sequences. This makes it impossible to trust for mission-critical tasks without a human expert in the loop.

Have goals or self-awareness. An LLM has no intentions, desires, or consciousness. It is a tool that serves the goals of its user. It doesn't 'want' to help you; it is simply completing a text sequence based on your prompt. It has no internal model of itself or the world, which is a prerequisite for the kind of adaptable, self-directed intelligence we associate with AGI.

A neat arrangement of specialized hand tools on a wooden workbench.

A Better Frame: Agentic, Not General

So if it's not AGI, what is it? It's a general-purpose tool on its way to becoming a useful agent.

Chasing the romantic notion of AGI is a distraction. The real work is in harnessing the power of these models to build reliable, specialized systems that automate complex tasks. Think of it less like building a synthetic human and more like building the world's most capable intern.

An agentic system uses an LLM as its reasoning engine but wraps it in a larger architecture. This system might have access to specific tools (like APIs or a database), a memory module (like a vector database), and a clear, narrow objective. It's not 'general intelligence'; it's directed, tool-augmented intelligence.

For example, you can build an agent that processes customer support tickets. It uses the LLM to understand the user's intent, a tool to look up their order history in Shopify, and another tool to draft a response based on company policy. The LLM is just one piece of the puzzle.

This is a much more productive way to think. Stop asking 'Is it AGI?' and start asking, 'What reliable, autonomous task can I build with this?' You'll find the answers are much more practical and, frankly, more valuable.