Customer-service-assistant rag-service
Tenant-specific RAG service for company knowledge in the voice assistant platform.
install
source · Clone the upstream repo
git clone https://github.com/papdawin/customer-service-assistant
manifest:
rag/skill.mdsource content
RAG Service
Purpose
Enable each company to supply its own information (e.g., location, opening hours, contact and reachability details) so the voice assistant can search and answer accurately. Each tenant has its own index; the LLM is shared across tenants.
Interfaces
- HTTP GET
for index and model status./health - HTTP POST
for question answering and timing data./query
Models
- Embedding model from
.EMBED_MODEL - Shared LLM client
fromllm
module.llm
Libraries
fastapi- stdlib:
,timecontextlib
Runtime Config
,TENANT_ID
,EMBED_MODELTOP_K
,CHUNK_SIZE
,CHUNK_OVERLAPCONTEXT_TOKENS_BUDGET
Main Components
- Lifespan startup loads FAISS index and retriever for the tenant.
- Retrieval via
and context formatting.retrieve_documents - LLM prompt assembly and invocation with timing.