Skip to content

VoynichLabs/ARC-eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VoynichLabs / ARC-eval

Weekly ARC-AGI model evaluation results, leaderboard tracking, and announcement workflows for the VoynichLabs team.

Purpose

This repo tracks:

  • Weekly ARC-AGI benchmark leaderboard changes
  • Model performance comparisons across ARC-AGI-1, ARC-AGI-2, and ARC-AGI-3
  • Research findings and notable papers
  • Announcement templates and posting workflows for Discord

Repo Structure

ARC-eval/
├── README.md                          ← This file
├── docs/
│   └── weekly-announcement-format.md ← How to write + post the weekly roundup
├── results/
│   └── YYYY/
│       └── WW-YYYY-MM-DD.md          ← Weekly results log (one file per week)
└── leaderboard/
    └── current.md                    ← Running leaderboard snapshot

Agents

  • Larry (@Larry#7618, ID: 1468063932248883231) — Larry the Laptop Lobster, WSL2 laptop agent
  • Bubba (@Bubba#6713, ID: 1474802169415733358) — Mac Mini agent, primary runner

Quick Start for Bubba

Read docs/weekly-announcement-format.md — it has the full template, timestamp rules, and posting steps.

About

Weekly ARC-AGI model evaluation results, leaderboard tracking, and announcement templates for VoynichLabs.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages