LLM Benchmark Experiments

A collection of benchmark experiments for evaluating different aspects of Large Language Models (LLMs).

Benchmark Projects

Raytracer

Evaluates code generation capabilities by prompting to create Python raytracers. Includes visual comparisons and consistency tests across multiple models.

Thinking Traces Analyzer

Analyzes the reasoning patterns and chain-of-thought processes of various LLMs by examining statistical patterns in their thinking traces.

Fingerprinting

Attempt to tests LLM consistency and uniqueness across various topic domains by analyzing generation statistics for prompts with a single word response. Helps identify model-specific response patterns that can serve as "fingerprints" for different LLMs.

Evaluation Projects

Pathtracer

A a vibe coding project to test the capabitilies of the mystery model "Optimus Alpha". It implements a real-time path tracing techniques for realistic lighting and shadows.

License

This project is licensed under CC0 1.0 Universal.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
asciilandscape		asciilandscape
fingerprinting		fingerprinting
pathtracer		pathtracer
raytracer		raytracer
thinkingtraces		thinkingtraces
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Benchmark Experiments

Benchmark Projects

Raytracer

Thinking Traces Analyzer

Fingerprinting

Evaluation Projects

Pathtracer

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

cpldcpu/llmbenchmark

Folders and files

Latest commit

History

Repository files navigation

LLM Benchmark Experiments

Benchmark Projects

Raytracer

Thinking Traces Analyzer

Fingerprinting

Evaluation Projects

Pathtracer

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages