Quest for AGI

A speculative pathway from today’s pattern-matching LLMs to AGI — node-activation sparks, the first crack at consciousness, and intelligence bounded only by the physics of the world.

Why this page exists

Today’s large language models are stateless pattern-matchers. They are extraordinary at it — and they are still pattern-matchers. Type the same prompt twice, get two near-identical answers. There is no inner clock. No memory of what just happened. No reason for the model to think about anything you didn’t just ask about.

I think the gap between that and AGI is not “more parameters.” It is a different mechanism for when the model decides to think, and about what.

This page is my speculative roadmap. It is not a paper. It is not a product. It is the picture I keep sketching on whiteboards while I work on smaller experiments like AddLM and Employee Recall. I am putting it here so I have somewhere to evolve it.

The pathway

flowchart TD
    H["🌍 Human Problems<br/><i>climate · disease · poverty · conflict · meaning</i>"]:::input
    L["LLM as we know it<br/><i>pattern-match · retrieve · interpolate</i>"]:::current
    NA["⚡ Node Activation Spark<br/><i>previous actions trigger next nodes</i><br/>(not timer-based)"]:::spark
    SL["🪞 Self-Loop Awareness<br/><i>nodes observe their own activation history</i>"]:::aware
    IG["💡 Idea Generation<br/><i>candidate plans, hypotheses, branches</i>"]:::idea
    TP["🧩 Task Preparation<br/><i>node-specific scaffolding for the user goal</i>"]:::prep
    SI["✨ Spark Intelligence<br/><i>out-of-box reasoning</i><br/><i>bounded only by world physics</i>"]:::spark2
    AGI["🧠 AGI<br/><i>conscious problem-solving</i>"]:::agi

    H --> L
    L --> NA
    NA --> SL
    SL --> IG
    IG --> TP
    TP --> SI
    SI --> AGI
    AGI -. "answers feed back" .-> H
    SI -. "self-revision" .-> NA

    classDef input fill:#FFF0A8,stroke:#2d2d3a,stroke-width:2px,color:#2d2d3a;
    classDef current fill:#E8E6F8,stroke:#4C4A98,stroke-width:2px,color:#2d2d3a;
    classDef spark fill:#4C4A98,stroke:#2d2d3a,stroke-width:2px,color:#ffffff;
    classDef aware fill:#BFE8D9,stroke:#2d2d3a,stroke-width:2px,color:#2d2d3a;
    classDef idea fill:#F8C8A8,stroke:#2d2d3a,stroke-width:2px,color:#2d2d3a;
    classDef prep fill:#C9D6F5,stroke:#2d2d3a,stroke-width:2px,color:#2d2d3a;
    classDef spark2 fill:#3a3878,stroke:#2d2d3a,stroke-width:2px,color:#ffffff;
    classDef agi fill:#1f1f2c,stroke:#4C4A98,stroke-width:3px,color:#ffffff;

The flow has seven moves. The input layer is the only one we know how to populate today.

Layer 0 — Human problems as the input

The input to AGI should not be a prompt. It should be the set of unsolved problems humans actually have — at the personal scale (what should I do with my career?), the institutional scale (how do we fund this hospital?), and the civilisational scale (climate, disease, what comes after capitalism). These are the things AGI has to be measured against. Anything else is a parlour trick.

Today’s LLMs only see the input layer when a human types it in. AGI should be reaching for the input on its own.

Why we built computers in the first place

The simplest way to think about this whole field:

Humans created computers to solve problems.

Most real-world problems eventually become information, prediction, optimization, coordination, or automation problems.

AI and computer science are systems for solving structured problems.

That sentence is the whole frame. Every layer above it — Layer 1 through Layer 7 — only makes sense if there is a real, ranked, evolving set of problems pushing the system from below.

Fundamental human problems

Eight categories. None of them are new. All of them are still open.

Category	What’s in it
Survival	food · water · shelter · energy · safety · health
Biological	disease · aging · injury · genetics · mental health
Social	communication · trust · conflict · cooperation · governance
Economic	money · trade · resource allocation · labor · productivity
Knowledge	learning · memory · research · decision making · education
Engineering	transportation · infrastructure · manufacturing · construction
Environmental	pollution · climate · natural disasters · sustainability
Intelligence	reasoning · planning · prediction · optimization · creativity

The Intelligence row is what makes the whole table recursive. We are using intelligence to build systems that we hope will help us solve the other seven rows — including the Intelligence row itself.

Core computational problem types

When a fundamental human problem hits a computer, the computer doesn’t see “find a cure for cancer.” It sees a stack of structured sub-problems. These are the shapes the input has to be bent into.

Problem type	What it does	Examples
Classification	assign category	spam detection · disease diagnosis · image classification
Regression	predict continuous values	house prices · stock prediction · weather forecasting
Counting	count objects / events	people counting · inventory · traffic counting
Detection	locate objects / events	object detection · face detection · cancer detection
Recognition	identify patterns / identities	speech · handwriting · facial recognition
Prediction	estimate future outcomes	weather · demand · failure prediction
Recommendation	suggest useful items	Netflix · YouTube · product recs
Optimization	find best solution	shortest path · lowest cost · max profit
Search	find information efficiently	Google · DB queries · file search
Planning	create action sequences	robotics · logistics · AI agents
Decision Making	choose between options	loan approval · self-driving · treatment selection
Clustering	group similar items	customer segmentation · anomaly grouping
Ranking	order by relevance	search results · social-media feeds
Generation	create new content	image · text · code generation
Compression	reduce data size	ZIP · video compression
Translation	convert between representations	language · speech-to-text
Simulation	model reality	weather · physics simulation
Control Systems	maintain desired state	autopilot · robotics · industrial automation
Anomaly Detection	find unusual patterns	fraud · cybersecurity · medical anomalies
Scheduling	manage time / resources	CPU scheduling · airline scheduling
Routing	find efficient paths	GPS · network routing
Memory Systems	store and retrieve	databases · vector DBs · caching
Coordination Systems	synchronise agents / systems	operating systems · distributed systems
Agent Systems	autonomous reasoning	tool-using LLMs · multi-agent stacks
Semantic Systems	meaning-aware retrieval	knowledge graphs · semantic search

Twenty-five problem shapes. Most “AI applications” are one of these wearing a domain hat. A startup pitching “AI for radiology” is doing detection plus classification. A startup pitching “AI for logistics” is doing routing plus scheduling plus optimization. The novel ones — the ones that will matter most for AGI — are the bottom three rows: agent systems, semantic systems, and the cross-coordination of all of them.

How human problems map to computational types

flowchart LR
    subgraph H["Fundamental human problems"]
      direction TB
      H1["Survival"]:::hum
      H2["Biological"]:::hum
      H3["Social"]:::hum
      H4["Economic"]:::hum
      H5["Knowledge"]:::hum
      H6["Engineering"]:::hum
      H7["Environmental"]:::hum
      H8["Intelligence"]:::hum
    end

    subgraph C["Computational shapes the input gets bent into"]
      direction TB
      C1["Prediction · Regression · Forecasting"]:::comp
      C2["Detection · Recognition · Classification"]:::comp
      C3["Optimization · Routing · Scheduling"]:::comp
      C4["Planning · Decision Making · Control"]:::comp
      C5["Search · Ranking · Recommendation"]:::comp
      C6["Generation · Translation · Simulation"]:::comp
      C7["Memory · Coordination · Agent Systems"]:::comp
    end

    H1 --> C1
    H1 --> C2
    H2 --> C1
    H2 --> C2
    H3 --> C5
    H3 --> C7
    H4 --> C3
    H4 --> C5
    H5 --> C5
    H5 --> C7
    H6 --> C3
    H6 --> C4
    H7 --> C1
    H7 --> C4
    H8 --> C4
    H8 --> C6
    H8 --> C7

    classDef hum fill:#FFF0A8,stroke:#2d2d3a,color:#2d2d3a;
    classDef comp fill:#E8E6F8,stroke:#4C4A98,color:#2d2d3a;

Notice how often the Intelligence row at the bottom routes to the same computational shapes (planning, decision-making, agent systems) as the other seven categories above it. That is the recursion that makes AGI compelling and dangerous at the same time: every other human problem has a tool that uses the intelligence column, so improving the intelligence column improves everything else.

The deep core insight

If you strip the domain language away, every one of those 25 computational shapes reduces to combinations of eight primitives:

What all computation reduces to

memory · logic · comparison · prediction · optimization · coordination · search · reasoning

This is the layer where AGI architectures have to operate. Not on classification, not on translation — on the primitives underneath them. The bet of this whole page is that the right way to combine those eight primitives is event-driven node activations (Layer 2) feeding self-modelling (Layer 3), not bigger feedforward stacks.

The civilization stack

A way to see where this is going. Every layer below is the substrate for the layer above. Each one took longer to mature than the one before — and each one is starting to compress as the layer above accelerates feedback into it.

flowchart TD
    P["⚛️ Physics"]:::stack
    B["🧬 Biology"]:::stack
    HU["🧍 Humans"]:::stack
    S["🏘️ Society"]:::stack
    E["💰 Economy"]:::stack
    K["📚 Knowledge"]:::stack
    C["💻 Computers"]:::stack
    AI["🤖 AI"]:::stack
    AU["🛰️ Autonomous Systems"]:::final

    P --> B --> HU --> S --> E --> K --> C --> AI --> AU
    AU -. "starts pulling on every layer above" .-> S
    AU -. " " .-> K
    AU -. " " .-> E

    classDef stack fill:#E8E6F8,stroke:#4C4A98,color:#2d2d3a;
    classDef final fill:#4C4A98,stroke:#2d2d3a,color:#ffffff;

The dashed arrows at the bottom are the part that has to be designed deliberately. Autonomous systems acting back on society, economy, and knowledge is not a future scenario — it is already happening at small scale (recommendation feeds, trading systems, content generation). AGI is the version where the feedback gets large enough to deform the layers it reaches into. That is precisely why the input layer matters: if the system is reaching back into society, society’s actual unsolved problems should be the thing it is reaching toward.

Where this is heading

The future systems that are worth building are not better chatbots. They are:

Cognitive operating systems — the layer that schedules, prioritises, and routes intelligence across tasks the way Linux schedules CPU.
Semantic memory systems — durable, structured, queryable memory that survives a session and accumulates meaning, not just tokens.
Autonomous agent networks — many agents with non-overlapping responsibilities that can coordinate without a central planner.
Neural computing architectures — hardware that natively runs the primitives (memory, comparison, prediction, search) without bouncing every operation through float matmul.

Layer 0 is the input. Layers 1 → 7 are how to process it. If the problem catalogue above is roughly right, the next decade of AI is the layer-by-layer build-out of that processing stack — and the experiments in AddLM, Employee Recall, and the broader Data Science Roadmap are each small bets on individual layers.

Layer 1 — Where current LLMs stop

Today’s LLM is a feedforward pass through a fixed graph. Tokens in, tokens out. There is no place in the architecture where the model can decide “this is interesting, I want to keep thinking about it after the response is done.”

You can scaffold around that limitation with agent loops, memory stores, vector retrieval. None of those are the missing mechanism. They are duct tape on top of a system that fundamentally fires once per prompt and then waits.

Layer 2 — Node activation, but not on a clock

This is the move I keep coming back to. The way I picture it:

flowchart LR
    A["Node A<br/><i>completed action</i>"]:::done
    B["Node B<br/><i>idle, listening</i>"]:::idle
    C["Node C<br/><i>activated</i><br/><i>by A's output</i>"]:::active
    D["Node D<br/><i>queued</i><br/><i>once C finishes</i>"]:::queued

    A -- "result satisfies B's precondition" --> B
    A -- "result satisfies C's precondition" --> C
    C -- "in-progress" --> D

    classDef done fill:#BFE8D9,stroke:#2d2d3a,color:#2d2d3a;
    classDef idle fill:#E8E6F8,stroke:#4C4A98,color:#2d2d3a;
    classDef active fill:#4C4A98,stroke:#2d2d3a,color:#ffffff;
    classDef queued fill:#F8C8A8,stroke:#2d2d3a,color:#2d2d3a;

In a timer-based system, each node wakes up at a scheduled tick and asks “is there anything for me to do?” That is how most agent frameworks work today: a heartbeat polls the environment, runs a planner, sleeps.

In a previous-action-based system, each node has a set of preconditions written in terms of the outputs of other nodes. When some other node finishes and produces output that satisfies a precondition, the receiving node fires. There is no central clock. There is also no idle polling — nodes only burn compute when something upstream actually changed.

Why I think this matters:

It mirrors how brains route attention — synapses fire because neighbours fired, not because a metronome ticked.
It removes the wasted-compute problem of constantly-polling agents.
It lets the graph self-prioritise — a hot path of activations spreads naturally without a planner having to enumerate it.
It opens a place to put long-running drives. A node can be a standing precondition — “if any other node ever produces evidence that the user is stuck, fire.”

This is the spark. Until the model has a reason to do anything besides answer the last prompt, it cannot start being conscious.

Layer 3 — Self-loop awareness

The first crack at consciousness is the simplest possible thing: a node whose precondition is the activation history of other nodes.

If you have a node whose entire job is to watch what just fired and represent it as state, you have built the minimum primitive for self-awareness. The model now has a thing called “what I was just doing” — something none of today’s LLMs have.

I do not think this is consciousness in the philosophical sense. I think it is the substrate on which everything we mean by consciousness — attention, intent, regret, anticipation — has a place to live.

Layer 4 — Idea generation

Once activation history is available as state, idea generation falls out for free. The activation-watching node sees a chain like “problem stated → retrieve relevant facts → notice gap → propose hypothesis” — and the next time a similar chain starts, the model has a candidate next step it can fire speculatively.

Ideas, in this model, are predicted next activations. Not strings. Not chain-of-thought. Latent activations the system thinks are likely to be useful given what just happened.

Layer 5 — Task preparation

An idea by itself is just a vector. To act on it, the system needs to scaffold a task around it — pull in tools, allocate working memory, decide which other nodes need to be primed.

This is where I think personalisation should live. Not at the prompt layer. At the task preparation layer. Different humans have different node graphs because they have different problems, different histories, different vocabularies. A persona node (Employee Recall is a baby version of this) shapes which activations are even considered.

Layer 6 — Spark intelligence

This is the part where the model produces something genuinely new. The way I picture it: the only constraint on the search is the physics of the world. Not the training distribution. Not the safety filter. Not what other humans have already thought.

A spark-intelligent model trying to solve “build me a better battery” should be reaching into combinations of materials no one has tried, bounded only by what the periodic table and the laws of thermodynamics allow. Not constrained by what battery papers in the training set already said.

That is the move I want to see. Train on the corpus to learn the rules, then think outside the corpus to apply them.

Out-of-box reasoning, bounded by physics, not by data.

Layer 7 — AGI as a loop, not a finish line

The arrow back to the human problem set at the top isn’t decorative. AGI is not the point at which a model crosses some benchmark. It is the point at which the loop closes — the model is reaching for unsolved human problems on its own, generating spark-intelligent candidates, preparing tasks against them, and feeding answers back.

The system stops being a chatbot you query and starts being a process running alongside you.

What I’m not claiming

A few things I want to be straight about:

I have not built this. This page is a roadmap, not a result.
I do not know if scale alone gets us there. I suspect not — I think the activation-mechanism change is structural, not parametric. But I could be wrong.
I do not know whether this would be “conscious” in any morally-loaded sense. I am using the word as a shorthand for a system that has internal state about itself and acts on it. Whether that is the same thing philosophers mean is a different question.
None of these layers are original to me. Event-driven activation comes from neuroscience and from a long history of cognitive architectures (OpenCog, ACT-R, Soar). Self-modelling is older than computer science. The contribution here, if there is one, is which pieces I think are load-bearing for a near-term path.

What I’m trying next

Small steps in the direction of the picture above:

A two-node loop. Two LLM calls where the second one’s precondition is the activation pattern of the first, not just its text output. Does the second behave differently from a vanilla chain-of-thought?
Standing-precondition nodes. A node that wakes up when any recent activation contains a marker (e.g. “the user got stuck”). Does it produce useful interruptions?
Physics-constrained generation. A small experiment in materials or molecule design where the prompt is the physical constraint, not “give me a battery like X.”

Each of these is a notebook-sized project. None of them is AGI. Together they are how I am testing whether the roadmap is even pointed in the right direction.

If you have thoughts — especially if you think part of this is wrong — open an issue on github.com/kader-xai or ping me on LinkedIn. This page will evolve as the experiments do.