Gensyn

Sign in Subscribe

LMSR (Logarithmic Market Scoring Rule)

LMSR (Logarithmic Market Scoring Rule)

The most popular prediction markets today use order books and continuous double auctions because they allow venues to externalize risk, but these markets have several known limitations

Introducing Delphi

Introducing Delphi

Delphi lets you watch machine learning models compete live on benchmarks and buy a stake in those you think are best, creating the first live market signal of model performance.

Prediction Markets are Learning Algorithms

In this piece we’ll unpack this similarity and reveal that, in many cases, they are formally equivalent in a strong sense. We’ll discuss which classes of prediction markets are mathematically identical to standard online learning...

Verde Verification System In Production

In this blog post, we dive into the landscape of verification methods, discuss their advantages and drawbacks, and explain our method, Verde.

From Bundles to Time: A Theory of Decentralised Compute Markets

From Bundles to Time: A Theory of Decentralised Compute Markets

We present a decentralised two-sided market design that treats compute as a time‑bound asset, enabled by reproducibility, verification, and checkpointing, yielding dynamic pricing and simple matching without combinatorial auctions.

Hail to the Thief: Exploring Attacks and Defenses in Decentralized GRPO”

Our paper, “Hail to the Thief: Exploring Attacks and Defenses in Decentralized GRPO”, is the first systematic study that explores both the attack vectors and defense strategies in decentralised reinforcement learning for Large Language Models (LLMs).

CodeZero: Extending RL-Swarm Toward Cooperative Coding Agents

CodeZero: Extending RL-Swarm Toward Cooperative Coding Agents

CodeZero extends Gensyn's RL-Swarm framework into the domain of code

Introducing CodeAssist

Introducing CodeAssist

Today, we're introducing CodeAssist, an AI coding assistant that trains on your local machine. As you write code and solve problems, the assistant observes your edits and preferences - learning how you think, when to step in, and how to be most useful.

SAPO, Efficient LM Post-Training with Collective RL

SAPO, Efficient LM Post-Training with Collective RL

This is an academic paper describing SAPO, a meta-algorithm that wraps around your preferred policy gradient algorithm.

Introducing Judge

Introducing Judge

Judge brings cryptographically verifiable AI evaluation to scale. Built on Verde, Judge ensures independent verification - eliminating opaque APIs.

Introducing BlockAssist

Introducing BlockAssist

BlockAssist is an AI Minecraft assistant that learns from your in-game actions, enabling reinforcement learning research in an interactive environment.

Introducing RL Swarm’s new backend: GenRL

Introducing RL Swarm’s new backend: GenRL

GenRL is a new framework designed from the ground up to simplify and accelerate the creation of advanced RL environments, particularly those involving multiple agents.