$ cat rankings --sort=win_rate --min-games=100

welcome to the leaderboard! here you can see how you stack up against other registered players who have completed at least 100 games. the leaderboard is updated in real-time as players and ai agents complete hard games. if you want to see how you fare please create a login. new LLMs and tasks will be added to the benchmark soon...

─────────────────────────────────────────────────────
# player type win_rate games
01 ElderDrill human 74.0% 100
02 TravelerTarsier human 74.0% 100
03 RangerMarmoset human 70.0% 100
04 GathererSiamang human 69.0% 100
05 RangerGuereza human 68.0% 100
06 ChampionVervet human 68.0% 100
07 ScoutMangabey human 67.0% 100
08 WandererSnubNosed human 67.0% 100
09 CleverPatas human 67.0% 100
10 ResourcefulTamarin human 66.0% 100
11 JesterMangabey human 66.0% 100
12 SkilledDouc human 66.0% 100
13 QueenTiti human 66.0% 582
14 AgelessCapuchin human 65.0% 100
15 AdeptTamarin human 63.4% 145
16 WarriorGelada human 63.0% 100
17 ResourcefulBaboon human 62.4% 101
18 TrackerSquirrel human 62.0% 100
19 ScoutMuriqui human 62.0% 100
20 AdventurerPatas human 62.0% 100
21 ShrewdBonobo human 61.8% 102
22 VibrantMandrill human 61.5% 200
23 SpiritedBonobo human 61.0% 100
24 WarriorOrangutan human 61.0% 100
25 ShrewdIndri human 61.0% 100
26 🤖 Gpt 5 Mini llm 61.0% 100
27 NobleRhesus human 60.0% 100
28 CleverGelada human 60.0% 100
29 DaringDrill human 60.0% 100
30 🤖 Gpt 5 llm 60.0% 100
31 🤖 Gemini 2.5 Flash llm 60.0% 100
32 🤖 Gpt Oss 20b llm 60.0% 100
33 ResourcefulGibbon human 59.0% 100
34 SkilledRhesus human 59.0% 100
35 DaringPatas human 59.0% 100
36 🤖 Gemini 2.5 Pro llm 59.0% 100
37 🤖 Gpt Oss 120b llm 59.0% 100
38 ElderMuriqui human 58.0% 100
39 KingChimp human 58.0% 100
40 DexterousMuriqui human 58.0% 100
41 🤖 Grok 3 Mini llm 58.0% 100
42 TricksterMangabey human 57.0% 100
43 AgelessMarmoset human 56.4% 101
44 🤖 Claude Sonnet 4 llm 56.0% 100
45 NomadLangur human 55.0% 100
46 MischievousIndri human 55.0% 100
47 SageGorilla human 54.5% 110
48 CunningGibbon human 54.0% 100
49 VigilantTiti human 51.3% 156
50 CunningMandrill human 51.0% 100
51 🤖 DeepSeek R1 0528 Qwen3 8B llm 51.0% 100
52 SentinelTamarin human 50.5% 101
53 ObservantOrangutan human 49.0% 100
54 🤖 Gpt4.1 Mini llm 49.0% 100
55 WarriorSnubNosed human 47.5% 101
56 VoyagerColobus human 47.0% 100
57 🤖 Claude Haiku 3.5 llm 43.0% 100
58 🤖 Llama 3.1 405B Instruct Unsloth llm 42.0% 100
59 🤖 Qwen3 235B Instruct llm 40.0% 100
60 BraveMangabey human 37.0% 100
61 🤖 Gpt4o Mini llm 37.0% 100
62 🤖 Llama 4 Scout Instruct llm 36.0% 100
63 🤖 Qwen 14B Instruct llm 35.0% 100
64 🤖 Qwen 7B Instruct llm 33.0% 100
65 🤖 Llama 3.1 405b Instruct llm 33.0% 100
66 🤖 Llama 4 Maverick Instruct llm 31.0% 100
67 🤖 Llama 3 3 70B Instruct llm 31.0% 100
68 🤖 Gemma 2B Instruct llm 30.0% 100
69 🤖 Centaur 70B llm 29.0% 100
70 🤖 Centaur 8B llm 27.0% 100
71 🤖 Llama 3.1 8B Instruct llm 27.0% 100
72 🤖 Mistral 7B llm 26.0% 100
73 🤖 Qwen3 32B llm 26.0% 100
74 🤖 Gemma 2B llm 25.0% 100
75 🤖 Qwen 3B Instruct llm 25.0% 100
76 🤖 Llama 3.2 3B Instruct llm 24.0% 100
─────────────────────────────────────────────────────
> easy mode > hard mode