Weekly ARC-AGI model evaluation results, leaderboard tracking, and announcement workflows for the VoynichLabs team.
This repo tracks:
- Weekly ARC-AGI benchmark leaderboard changes
- Model performance comparisons across ARC-AGI-1, ARC-AGI-2, and ARC-AGI-3
- Research findings and notable papers
- Announcement templates and posting workflows for Discord
ARC-eval/
├── README.md ← This file
├── docs/
│ └── weekly-announcement-format.md ← How to write + post the weekly roundup
├── results/
│ └── YYYY/
│ └── WW-YYYY-MM-DD.md ← Weekly results log (one file per week)
└── leaderboard/
└── current.md ← Running leaderboard snapshot
- Larry (
@Larry#7618, ID:1468063932248883231) — Larry the Laptop Lobster, WSL2 laptop agent - Bubba (
@Bubba#6713, ID:1474802169415733358) — Mac Mini agent, primary runner
Read docs/weekly-announcement-format.md — it has the full template, timestamp rules, and posting steps.