[Roadmap] 2026 Q1 Milestones

# AReaL 2026 Q1 Milestone Tracker

## Introduction

This document tracks major planned enhancements for AReaL through April 30, 2026. Our development roadmap is organized into two categories to help contributors identify where they can make the most impact:

**On-going** sections contain features currently under active development by the core AReaL team. These represent our immediate priorities.

**Planned but not in progress** sections list features with concrete implementation plans that we currently lack bandwidth to pursue. **We actively welcome community contributions for these items!** If you're interested in contributing to any planned feature, please reach out to discuss implementation details.

---

## Backends

### On-going

- [ ] ZBPP & ZBPP-V support for the Archon backend #936 #916 
- [ ] FP8 training for Archon
- [x] Online RL training with the proxy server #947 

### Planned but not in progress

- [ ] Support for agentic training with large VLM MoE models (Archon backend)
- [ ] Omini model RL support with FSDP/Archon backend #879 
- [ ] Decoupling agent service with the inference service
- [ ] LoRA support for the Archon backend
- [ ] Colocation mode with `awex` as the weight sync engine
- [ ] Multi-LLM training (different agents with different parameters)
- [ ] Auto-scaling inference engines in single-controller mode
- [ ] Elastic weight update setup and acceleration
- [ ] RL training with cross-node vLLM pipeline/context parallelism

---

## Usability

### On-going

- [ ] Flatten the import structure of areal modules

### Planned but not in progress

- [ ] Publishing pypi packages
- [ ] Support distributed training and debugging in Jupyter notebooks
- [ ] Example of using a generative or critic-like reward model
- [ ] Support directly constructing inference/training engines without config objects
- [ ] Add router in rollout controller for simpler proxy server usage
- [ ] Integrating `aenvironment` for environment handling

---

## Documentation

### On-going

N/A

### Planned but not in progress

- [ ] Use case guides: multi-agent training
- [ ] Guide for online proxy mode training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] 2026 Q1 Milestones #907

AReaL 2026 Q1 Milestone Tracker

Introduction

Backends

On-going

Planned but not in progress

Usability

On-going

Planned but not in progress

Documentation

On-going

Planned but not in progress

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Roadmap] 2026 Q1 Milestones #907

Description

AReaL 2026 Q1 Milestone Tracker

Introduction

Backends

On-going

Planned but not in progress

Usability

On-going

Planned but not in progress

Documentation

On-going

Planned but not in progress

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions