Research Note

Teaching AI together: a plain guide to federated learning

A non-technical overview of the field, its progress, and the direction Lupus is exploring. Work in progress

Most powerful AI models today are trained on enormous clusters of specialized chips, housed in a handful of data centers, owned by a handful of companies. The cost runs into the tens or hundreds of millions of dollars. That price tag decides who gets to build foundational AI — and, just as importantly, who gets to own it.

There is another way to gather computing power, and it is hiding in plain sight: the billions of laptops, desktops, and phones that sit idle for most of the day. The research question Lupus is built around is simple to state and hard to answer — can many small, ordinary devices, working together, train a model that normally needs a supercomputer?

This page is written for a general audience. It explains the field and our direction without the technical specifics of our approach, which are still under active development.

What is federated learning?

Federated learning is a way of training a shared model across many separate devices without gathering everyone's data into one place. Instead of sending raw data to a central server, each device trains on what it has locally and sends back only what it learned — a small mathematical summary of its progress. A coordinator combines these summaries into an improved shared model, then sends it back out. The cycle repeats.

Think of it like a study group where everyone reads a different chapter at home, then meets to pool their notes. No one has to hand over their book; they just share the insights. The group's collective understanding improves with each meeting.

Learn locally

Each device improves the model a little, using only its own slice of work.

Share the lesson

Devices send back a compact summary of their progress — not raw data.

Combine & repeat

A coordinator merges the lessons into a better shared model, then sends it out again.

How the research has progressed

Federated learning began as a way to train modest models across phones while keeping personal data private. Over the past decade, a series of advances has pushed the idea toward something far more ambitious: training large models across slow, scattered, and unreliable connections. A few milestones tell the story.

What makes Lupus different

The research above mostly assumes trusted machines in controlled settings. Opening the doors to the public — anyone, on any device — raises three new questions that define our work:

🚪

Zero friction

Contributing should be as easy as opening a web page. No downloads, no setup.

🛡️

Trust without trusting

When contributors are strangers, the system must confirm that the work submitted is real — without slowing everyone down.

⚖️

Fair reward

People who lend their computing power earn a stake in what they help create.


System architecture

At a high level, Lupus connects three kinds of participants in a continuous loop. The diagram below shows the shape of the system — the inner workings of each piece are part of our ongoing research.

THE TRAINING LOOP Volunteer Devices Browsers · laptops · phones Device A learning locally Device B learning locally Device C may join / leave …thousands more earn a stake ◆ Coordinator Organizes the work Hands out tasks Checks the work Combines lessons verification keeps it honest Shared Model Grows a little each round tasks → ← lessons update next round ↻ Contributions are recorded fairly, and contributors share in what the finished model creates. Verification and reward details are part of ongoing research.
High-level view of the Lupus training loop. Specific algorithms, verification methods, and reward mechanics are intentionally omitted.

The loop, step by step

  1. Tasks go out. The coordinator splits the work and sends a small piece to each volunteer device, along with the current shared model.
  2. Devices learn. Each device improves the model on its piece, entirely on its own, for a while.
  3. Lessons come back. Devices return a compact summary of what they learned — never raw data.
  4. The work is checked. The system confirms contributions are genuine before accepting them.
  5. Lessons combine. Verified summaries are merged into a better shared model.
  6. Repeat. The improved model goes back out, and the cycle continues — thousands of times.
Two ingredients make the open version possible: methods that let devices communicate very little, and a way to confirm that anonymous contributors did real work. Both build directly on the research timeline above.

Where this is headed

The near-term goal is a working demonstration — a small but real model trained end-to-end by volunteers in their browsers. From there, the path scales up in stages, with each step gated on results, safety review, and community input.

The longer aim is straightforward: if a community can sustain free encyclopedias and open-source software, it can train a foundation model that belongs to all of its contributors. That is the bet Lupus is exploring.

← Back to home