Is AGI Here? A Sober Look at Today's AI Models

Everyone's debating if the latest models are 'AGI'. The truth is, it's the wrong question. We'll look at what GPT-4o and Claude 3 can actually do—and where they still fail spectacularly.

May 29, 2026 · 4 min read · SuperThinking team

A humanoid robot staring quizzically at a complex espresso machine, unsure how to use it.

The debate is boring. Is GPT-4o AGI? Is Claude 3 sentient? Is Gemini showing 'sparks' of general intelligence? This is navel-gazing. It's like asking if a Formula 1 car can fly. The interesting question isn't whether it meets a poorly defined sci-fi benchmark, but what its actual capabilities and limits are, right now.

Asking "Is it AGI?" treats intelligence as a binary switch. It's not. It's a messy, multi-dimensional spectrum of skills. A model can be a world-class programmer and a world-class idiot in the same conversation. So let's skip the philosophy and look at the engineering reality.

The 'Wow, Maybe It's AGI' Moments

There's no denying the moments that feel like magic. When GPT-4o translates a conversation between two people speaking different languages in real-time, it feels like a universal translator. When Claude 3 Opus finds a single, out-of-place sentence buried in a million tokens of text (the 'needle in a haystack' test), that's a superpower.

These models demonstrate incredible abilities that were unthinkable just a few years ago. Look at what's on the table today:

Cross-Modal Reasoning: You can show GPT-4o a live video feed of a math problem you're working on, and it can talk you through the solution. It connects vision, audio, and language in a fluid way that feels deeply intuitive.
Complex Code Generation: These models can scaffold entire applications. You can describe a web app, its database schema, its API endpoints, and its front-end framework, and it will generate thousands of lines of high-quality boilerplate code. It's not perfect, but it's a massive accelerator.
Expert-Level Knowledge Synthesis: Ask it to explain quantum computing in the style of a 1940s detective novel. It will do it. Ask it to compare the supply chain logistics of Amazon versus Walmart, citing specific operational differences. It will do that too. The ability to synthesize and re-contextualize vast amounts of information is staggering.

When you're in the flow with one of these systems, using it as a creative partner or a coding assistant, the AGI question feels plausible. The model understands context, remembers previous parts of the conversation, and generates novel, useful output. It's an incredible tool.

An intricate circuit board with glowing pathways, symbolizing the power of modern AI.

And Then, The Utter Dumbness

But just when you're convinced you're talking to a new form of life, the illusion shatters. The model will make a mistake so basic, so fundamentally nonsensical, that it reminds you you're dealing with a very sophisticated text-prediction engine, not a thinking entity.

These aren't just edge-case failures. They're fundamental gaps in understanding.

Physical World Amnesia: An AI can describe the physics of a bouncing ball in perfect detail. But ask it a simple riddle like, "I have a box of five apples. I take out three. How many apples do I have?" and it might get it right. But then ask, "How many apples are in the box?" and it can get confused. It doesn't have an intuitive model of object permanence.
Brittleness and Inconsistency: You can get a brilliant answer to a complex question. Five minutes later, you can ask the exact same question and get a completely wrong or nonsensical answer. There's no persistent, stable reasoning process. It's re-rolling the dice every single time.
Lack of True Self-Correction: A model might 'apologize' for a mistake, but it hasn't learned from it. It's just generating a response that looks like an apology. You can point out a logical flaw in its reasoning, and it will agree with you, but then repeat the same flaw in its next response. It lacks the metacognition to truly update its own model of the world.

I recently asked a top-tier model to plan a road trip. It laid out a perfect itinerary. Then I said, "Okay, now reverse the trip." It got completely lost, suggesting impossible driving times and putting cities in the wrong order. The conceptual understanding of 'reversing a sequence' just wasn't there.

A chaotic jumble of multi-colored wires and cables, representing the unpredictable failures of AI.

A Better Way to Think About It

So, is it AGI? Who cares. It's the wrong framing. A better way to think about this is in terms of autonomy and reliability. Can I give this system a high-level goal and trust it to execute the necessary steps reliably, without hallucinating or getting stuck in a nonsensical loop?

For most non-trivial tasks, the answer is still no. You can't tell an AI, "Start a successful SaaS company for me." You can tell it, "Write a Python script to analyze this CSV file and generate a report."

We're moving from narrow AI (great at one thing) to something you could call 'broadly capable AI' (pretty good at many things). But it's not 'general intelligence' in the human sense. It's a different kind of intelligence altogether—alien, powerful, and deeply weird.

Instead of waiting for some mythical AGI arrival, focus on the practical. What tasks can you reliably automate today? Where does the human need to stay in the loop? That's where the real work, and the real value, is.