Is AGI Here? A Reality Check on GPT-4 and Claude 3
Everyone's claiming AGI is just around the corner, citing amazing demos. We'll show you why today's AI, despite its power, fails key tests of true intelligence.
June 12, 2026 · 4 min read · SuperThinking team
Every few weeks, a new demo drops that makes people scream “AGI!” Whether it’s an AI model holding a freakishly human-like conversation or acing a specialized exam, the hype cycle spins up again. People who should know better start talking about “sparks of AGI.”
Let’s be clear: it’s not AGI. Not even close.
What we have are incredibly powerful systems that are masters of pattern recognition and text generation. They can simulate understanding with terrifying accuracy. But they fail at the core things that constitute actual, general intelligence. And mistaking one for the other is a good way to build the wrong things.
The "Sparks" Are Real, But They Don't Start a Fire
There's no denying the magic. Ask Claude 3 Opus to find a single sentence about pizza toppings hidden inside a 200,000-word digital copy of Moby Dick, and it will. Ask GPT-4o to be your real-time verbal translator for a Spanish conversation, and it will perform flawlessly.
These are stunning feats of engineering. They demonstrate a capacity for context processing and multi-modal interaction that was science fiction just a few years ago. The models can write code, explain complex scientific concepts, and even generate a decent sonnet. This is the stuff that fuels the AGI hype. It feels intelligent because it produces artifacts that we associate with intelligence.
But here's the trick: the model isn't thinking. It's predicting. It's running a statistical analysis across its vast training data to generate the most probable sequence of tokens (words, pixels, sounds) that satisfies your prompt. It’s a simulator, not a thinker. It has no internal world model, no persistent memory, and no goals of its own.
Think of it like the most advanced autocomplete in history. When it gives you a correct answer to a complex physics problem, it's not reasoning from first principles. It's seen thousands of similar problems and their solutions during training, and it's reassembling a new solution based on those patterns. This is an incredibly useful capability, but it’s not the same as understanding.
Where It All Falls Apart: Planning and Self-Correction
The gap between simulation and true intelligence becomes a chasm when you ask the AI to do any multi-step task that requires planning, execution, and adaptation.
Give a model a seemingly simple goal:
“Plan a 3-day trip to Portland for me next month. Find the best flight options from SFO, book a reasonably priced hotel near downtown, and create a morning-by-morning itinerary. Execute the bookings and send me the confirmations.”
The AI will give you a beautiful document. It will generate a plausible-sounding itinerary. It will even invent fake flight numbers and hotel confirmation codes that look real. But it cannot take a single action. It can’t browse a live website, compare real-time prices, or interact with a booking API on its own.
Agentic frameworks like Auto-GPT or CrewAI try to solve this by wrapping the LLM in loops and giving it tools. But watch one of these agents run and you’ll see the problem. It quickly gets stuck in loops, forgets its original goal, and makes logical errors a human would never make. It might decide to search for flights, then get distracted by the airline’s Wikipedia page, and then forget it was booking a trip at all.
This is because the model has no persistent memory or state. Each API call is a fresh start. It has no concept of progress and can't self-correct when a strategy isn't working. True intelligence isn't just about having knowledge; it's about applying it sequentially and dynamically to achieve a goal in a messy, unpredictable world.
The Ghost in the Machine Has No Body
There's another, more fundamental piece missing: embodiment. Intelligence, at least the only kind we know, evolved in organisms that had to navigate the physical world. Our thinking is deeply connected to our senses, our bodies, and our interaction with physical objects.
An LLM has none of this. It has never felt the heat of a stove, lifted a heavy box, or learned that dropping a glass causes it to shatter. Its entire universe consists of the text and images it was trained on. It can describe the physics of a falling object perfectly, but it has no intuitive, grounded understanding of gravity.
Multi-modal models that can process images and video are a step forward, but they are still passive observers. They are disembodied brains in a digital jar, analyzing pixels without any of the feedback that comes from trying to move, touch, and manipulate the world.
Without this physical grounding, an AI's “understanding” will always be brittle and abstract. It’s the difference between reading a book about swimming and actually being in the water. Until AI can learn from direct interaction with the world, its intelligence will remain fundamentally alien and incomplete.
So, What Are We Building?
If it’s not AGI, what is it? We're building phenomenal tools. Universal simulators. Systems that can manipulate symbols and data at a scale and speed we can barely comprehend. These tools will change everything about how we work and create.
We should focus on that. Build better co-pilots, smarter assistants, and more powerful creative partners. Let’s stop chasing the romantic notion of a conscious machine and get to work on practical systems that augment our own intelligence.
The AGI conversation is a distraction. The real work is in figuring out how to integrate these powerful, but flawed, systems into our lives in a way that is useful, safe, and reliable. That's a hard enough problem without worrying about whether your chatbot has a soul.