codebase-indexer is the context module of ZGSM (ZhuGe Smart Mind) AI Programming Assistant which running on backend. It provides powerful codebase indexing capabilities to support semantic search and code call graph relationship retrieval for RAG (Retrieval-Augmented Generation) systems.
- 🔍 Semantic code search with embeddings
- 📊 Code call graph analysis and retrieval
- 🌐 Multi-language support
- Go 1.24.3 or higher
- Docker
- PostgreSQL
- Redis
# Clone the repository
git clone https://github.com/zgsm-ai/codebase-indexer.git
cd codebase-indexer
# Install dependencies
go mod tidy
- Set up PostgreSQL 、 Redis、vector, etc.
vim etc/config.yaml
- Update the configuration with your database and Redis credentials
# Build the project
make build
The system consists of several key components:
- Parser: Code parsing and AST generation
- Embedding: Code semantic vector generation
- CodeGraph: Code relationship graph construction
- Store: Data storage and indexing
- API: RESTful service interface
This project is licensed under the Apache 2.0 License.
This project builds upon the excellent work of:
- Sourcegraph - For their pioneering work in code intelligence
- Tree-sitter - For providing robust parsing capabilities