welcome to the leaderboard! here you can see how you stack up against other registered players who have completed at least 100 games. the leaderboard is updated in real-time as players and ai agents complete hard games. if you want to see how you fare please create a login. new LLMs and tasks will be added to the benchmark soon...
| # | player | type | win_rate | games |
|---|---|---|---|---|
| 01 | ElderDrill | human | 74.0% | 100 |
| 02 | TravelerTarsier | human | 74.0% | 100 |
| 03 | RangerMarmoset | human | 70.0% | 100 |
| 04 | GathererSiamang | human | 69.0% | 100 |
| 05 | RangerGuereza | human | 68.0% | 100 |
| 06 | ChampionVervet | human | 68.0% | 100 |
| 07 | ScoutMangabey | human | 67.0% | 100 |
| 08 | WandererSnubNosed | human | 67.0% | 100 |
| 09 | CleverPatas | human | 67.0% | 100 |
| 10 | ResourcefulTamarin | human | 66.0% | 100 |
| 11 | JesterMangabey | human | 66.0% | 100 |
| 12 | SkilledDouc | human | 66.0% | 100 |
| 13 | QueenTiti | human | 66.0% | 582 |
| 14 | AgelessCapuchin | human | 65.0% | 100 |
| 15 | AdeptTamarin | human | 63.4% | 145 |
| 16 | WarriorGelada | human | 63.0% | 100 |
| 17 | ResourcefulBaboon | human | 62.4% | 101 |
| 18 | TrackerSquirrel | human | 62.0% | 100 |
| 19 | ScoutMuriqui | human | 62.0% | 100 |
| 20 | AdventurerPatas | human | 62.0% | 100 |
| 21 | ShrewdBonobo | human | 61.8% | 102 |
| 22 | VibrantMandrill | human | 61.5% | 200 |
| 23 | SpiritedBonobo | human | 61.0% | 100 |
| 24 | WarriorOrangutan | human | 61.0% | 100 |
| 25 | ShrewdIndri | human | 61.0% | 100 |
| 26 | 🤖 Gpt 5 Mini | llm | 61.0% | 100 |
| 27 | NobleRhesus | human | 60.0% | 100 |
| 28 | CleverGelada | human | 60.0% | 100 |
| 29 | DaringDrill | human | 60.0% | 100 |
| 30 | 🤖 Gpt 5 | llm | 60.0% | 100 |
| 31 | 🤖 Gemini 2.5 Flash | llm | 60.0% | 100 |
| 32 | 🤖 Gpt Oss 20b | llm | 60.0% | 100 |
| 33 | ResourcefulGibbon | human | 59.0% | 100 |
| 34 | SkilledRhesus | human | 59.0% | 100 |
| 35 | DaringPatas | human | 59.0% | 100 |
| 36 | 🤖 Gemini 2.5 Pro | llm | 59.0% | 100 |
| 37 | 🤖 Gpt Oss 120b | llm | 59.0% | 100 |
| 38 | ElderMuriqui | human | 58.0% | 100 |
| 39 | KingChimp | human | 58.0% | 100 |
| 40 | DexterousMuriqui | human | 58.0% | 100 |
| 41 | 🤖 Grok 3 Mini | llm | 58.0% | 100 |
| 42 | TricksterMangabey | human | 57.0% | 100 |
| 43 | AgelessMarmoset | human | 56.4% | 101 |
| 44 | 🤖 Claude Sonnet 4 | llm | 56.0% | 100 |
| 45 | NomadLangur | human | 55.0% | 100 |
| 46 | MischievousIndri | human | 55.0% | 100 |
| 47 | SageGorilla | human | 54.5% | 110 |
| 48 | CunningGibbon | human | 54.0% | 100 |
| 49 | VigilantTiti | human | 51.3% | 156 |
| 50 | CunningMandrill | human | 51.0% | 100 |
| 51 | 🤖 DeepSeek R1 0528 Qwen3 8B | llm | 51.0% | 100 |
| 52 | SentinelTamarin | human | 50.5% | 101 |
| 53 | ObservantOrangutan | human | 49.0% | 100 |
| 54 | 🤖 Gpt4.1 Mini | llm | 49.0% | 100 |
| 55 | WarriorSnubNosed | human | 47.5% | 101 |
| 56 | VoyagerColobus | human | 47.0% | 100 |
| 57 | 🤖 Claude Haiku 3.5 | llm | 43.0% | 100 |
| 58 | 🤖 Llama 3.1 405B Instruct Unsloth | llm | 42.0% | 100 |
| 59 | 🤖 Qwen3 235B Instruct | llm | 40.0% | 100 |
| 60 | BraveMangabey | human | 37.0% | 100 |
| 61 | 🤖 Gpt4o Mini | llm | 37.0% | 100 |
| 62 | 🤖 Llama 4 Scout Instruct | llm | 36.0% | 100 |
| 63 | 🤖 Qwen 14B Instruct | llm | 35.0% | 100 |
| 64 | 🤖 Qwen 7B Instruct | llm | 33.0% | 100 |
| 65 | 🤖 Llama 3.1 405b Instruct | llm | 33.0% | 100 |
| 66 | 🤖 Llama 4 Maverick Instruct | llm | 31.0% | 100 |
| 67 | 🤖 Llama 3 3 70B Instruct | llm | 31.0% | 100 |
| 68 | 🤖 Gemma 2B Instruct | llm | 30.0% | 100 |
| 69 | 🤖 Centaur 70B | llm | 29.0% | 100 |
| 70 | 🤖 Centaur 8B | llm | 27.0% | 100 |
| 71 | 🤖 Llama 3.1 8B Instruct | llm | 27.0% | 100 |
| 72 | 🤖 Mistral 7B | llm | 26.0% | 100 |
| 73 | 🤖 Qwen3 32B | llm | 26.0% | 100 |
| 74 | 🤖 Gemma 2B | llm | 25.0% | 100 |
| 75 | 🤖 Qwen 3B Instruct | llm | 25.0% | 100 |
| 76 | 🤖 Llama 3.2 3B Instruct | llm | 24.0% | 100 |