feat: Add API indexer and enhance embedding management

- Add new API indexer script for document processing
- Update embedding manager with improved functionality
- Remove old cache files and update dependencies
- Modify workflow execute documentation

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
catlog22
2025-09-23 19:40:22 +08:00
parent 984fa3a4f3
commit 410d0efd7b
8 changed files with 506 additions and 337 deletions

View File

@@ -2,14 +2,18 @@
numpy>=1.21.0
scikit-learn>=1.0.0
# Sentence Transformers for advanced embeddings
sentence-transformers>=2.2.0
# Sentence Transformers for advanced embeddings (CodeSage V2 compatible)
sentence-transformers>=3.0.0
transformers>=4.40.0
# Optional: For better performance and additional models
torch>=1.9.0
# PyTorch for model execution (required for CodeSage V2)
torch>=2.0.0
# Development and testing
pytest>=6.0.0
# Data handling
pandas>=1.3.0
# Additional dependencies for CodeSage V2
accelerate>=0.26.0