-
OmniDocBench Public
Forked from opendatalab/OmniDocBenchA Comprehensive Benchmark for Document Parsing and Evaluation
Python Apache License 2.0 UpdatedMar 6, 2025 -
smallpond Public
Forked from deepseek-ai/smallpondA lightweight data processing framework built on DuckDB and 3FS.
Python UpdatedMar 5, 2025 -
lagent Public
Forked from InternLM/lagentA lightweight framework for building LLM-based agents
Python Apache License 2.0 UpdatedNov 21, 2024 -
Freebase-Setup Public
Forked from dki-lab/Freebase-SetupThe last data dump of Freebase with introductory explanation of its schema
Python Creative Commons Zero v1.0 Universal UpdatedOct 23, 2024 -
AgentBench Public
Forked from THUDM/AgentBenchA Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Python UpdatedOct 10, 2024 -
CRUD_RAG Public
Forked from IAAR-Shanghai/CRUD_RAGCRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Python UpdatedOct 10, 2024 -
robustqa-acl23 Public
Forked from awslabs/robustqa-acl23Python Apache License 2.0 UpdatedOct 10, 2024 -
RefChecker Public
Forked from amazon-science/RefCheckerRefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Python Apache License 2.0 UpdatedOct 10, 2024 -
RAGChecker Public
Forked from amazon-science/RAGCheckerRAGChecker: A Fine-grained Framework For Diagnosing RAG
Python Apache License 2.0 UpdatedOct 10, 2024 -
-
MindSearch Public
Forked from InternLM/MindSearchLLM-based Multi-agent Framework of AI Search Engine
Python Apache License 2.0 UpdatedAug 30, 2024 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedApr 22, 2024