Research SAPO, Efficient LM Post-Training with Collective RL This is an academic paper describing SAPO, a meta-algorithm that wraps around your preferred policy gradient algorithm.
Product Introducing Judge Judge brings cryptographically verifiable AI evaluation to scale. Built on Verde, Judge ensures independent verification - eliminating opaque APIs.
Product Introducing BlockAssist BlockAssist is an AI Minecraft assistant that learns from your in-game actions, enabling reinforcement learning research in an interactive environment.
Article Introducing RL Swarm’s new backend: GenRL GenRL is a new framework designed from the ground up to simplify and accelerate the creation of advanced RL environments, particularly those involving multiple agents.
Research CheckFree: fault tolerant training without checkpoints This is an academic paper describing CheckFree, a novel recovery method for failures in distributed training that does not require checkpointing or redundant computation.
Research NoLoCo: training large models with no all-reduce This is an academic paper describing NoLoCo, a novel optimisation method for distributed training that replaces the global synchronisation step with a gossip method.
Research Diverse Expert Ensembles: embarrassingly parallel LLMs from diverse experts This is an academic paper that finds benefits to heterogeneity (different model sizes and number of training steps) when training embarrassingly-parallel ensembles of expert models.
Product RL Swarm: a framework for collaborative RL This is open source code (MIT Licence) for peer-to-peer nodes that perform collaborative reinforcement learning over the internet, accessible by anyone on consumer or datacentre hardware.
Research SkipPipe: a communication efficient method for decentralised training This is an academic paper for efficient communication in pipeline parallel training. It introduces an optimal scheduling algorithm that maximises performance and fault tolerance whilst minimising convergence impact from layer skips.
Research Verde: a verification system for machine learning over untrusted nodes This is an academic paper describing Verde, a verification protocol for machine learning programs, as well as the underlying Reproducible Operators (RepOps) system that enables it.
Article GPT@home: Why the Future of Training is Decentralized AI training costs are hitting $100B per run. Gensyn's decentralized infrastructure enables efficient training across edge devices at massive scale—making model development collaborative and accessible.