Pinned Loading
-
MMLU_benchmark
MMLU_benchmark PublicAn easy-to-use and standardised framework for evaluating Large Language Models (LLMs) on the Massive Multitask Language Understanding (MMLU) dataset. Currently supported: Hugging Face transformer m…
Python 1
-
oeis-sequences-benchmark
oeis-sequences-benchmark PublicA Python toolkit and benchmark dataset for predicting the next term in OEIS integer sequences, designed to evaluate AI models.
Python 1
-
AI-linguistic-signatures
AI-linguistic-signatures PublicThis repository contains a toolkit to analyse the stylistic fingerprints that distinguish AI-generated text from human writing. It fetches articles from Wikipedia, generates counterparts using loca…
Python 1
-
connect-4-game-engine
connect-4-game-engine PublicA lightweight Connect 4 engine in pure Python using minimax with alpha-beta pruning. It includes a reusable package and CLI for analysing games or playing against agents, with all parsing and evalu…
If the problem persists, check the GitHub status page or contact support.