Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words
-
Updated
Nov 21, 2025 - Python
Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words
Build an Autonomous Web3 AI Trading Agent (BASE + Uniswap V4 example)
GPT-5-powered multi-model Discord bot to try with GPT-5, Gemini 3.0 Flash and other models from OpenRouter, Anthropic Claude 4.5 Sonnet, Kimi K2, Grok 4 Fast, GLM 4.5, and More. in Discord! Try below or host your own
ZYPHERON CLI Powerful command-line interface for automated security testing. Integrate ZYPHERON into your DevSecOps pipeline. Get CLI
Test and compare different large language models on various tasks.
Add a description, image, and links to the grok4 topic page so that developers can more easily learn about it.
To associate your repository with the grok4 topic, visit your repo's landing page and select "manage topics."