Customer-service-assistant stt-service
Shared speech-to-text microservice for the multi-tenant voice assistant platform.
install
source · Clone the upstream repo
git clone https://github.com/papdawin/customer-service-assistant
manifest:
stt/skill.mdsource content
STT Service (Whisper)
Purpose
Provide speech-to-text for a multi-tenant voice assistant deployed at different companies. This service is shared across tenants.
Interfaces
- HTTP GET
for readiness and model/device info./health - HTTP POST
for audio transcription with optional language and beam size./transcribe
Models
- Whisper model from
(defaultSTT_MODEL_ID
).large-v3
Libraries
fastapitorchwhisper- stdlib:
,os
,tempfiletime
Runtime Config
(model name)STT_MODEL_ID
(default language)STT_LANGUAGE
Main Components
- Model load on startup with GPU/CPU selection.
- Temp file handling for uploads.
- Transcription timing and
header.Server-Timing