Here is list of all the 75+ LLM Evaluation methods, github repos, tools, blogs I could find (till Nov, 2023) for LLM Evaluation

Order is random

Repo / Tools:

Some Models:

Check your facts and try again
Researching and Revising What Language Models Say
Fact-Checking Complex Claims with Program-Guided Reasoning
Repo + Paper -> SAC: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
Hallucination detection: Robustly discerning reliable answers in Large Language Models