codebase-indexer is the context module of ZGSM (ZhuGe Smart Mind) AI Programming Assistant which running on backend. It provides powerful codebase indexing capabilities to support semantic search for RAG (Retrieval-Augmented Generation) systems.
- 🔍 Semantic code search with embeddings
- 🌐 Multi-language support
- Go 1.24.3 or higher
- Docker
- PostgreSQL
- Redis
- Weavaite
# Clone the repository
git clone https://github.com/zgsm-ai/codebase-embedder.git
cd codebase-embedder
# Install dependencies
go mod tidy
- Set up PostgreSQL 、 Redis、vector, etc.
vim etc/config.yaml
- Update the configuration with your database and Redis credentials
# Build the project
make build
The system consists of several key components:
- Parser: Code parsing and AST generation
- Embedding: Code semantic vector generation
- Store: Data storage and indexing
- API: RESTful service interface
This project is licensed under the Apache 2.0 License.
This project builds upon the excellent work of:
- Tree-sitter - For providing robust parsing capabilities