Comfy-agent workflow_ep33_one_speaker_tts_workflow
name: workflow_ep33_one_speaker_tts_workflow
git clone https://github.com/steliosot/comfy-agent
skills/workflows/txt2img/workflow_ep33_one_speaker_tts_workflow/skill.yamlname: workflow_ep33_one_speaker_tts_workflow description: Workflow wrapper for EP33 One Speaker - TTS Workflow.json inputs: prompt: type: string required: false negative_prompt: type: string required: false width: type: integer required: false height: type: integer required: false seed: type: integer required: false steps: type: integer required: false cfg: type: number required: false sampler_name: type: string required: false scheduler: type: string required: false denoise: type: number required: false server: type: string required: false headers: type: object required: false api_prefix: type: string required: false outputs: status: type: string prompt_id: type: string output_images: type: array requirements: models: [] custom_nodes: [] links: [] input_modalities:
- text_prompt output_modalities:
- audio/wav model_families:
- other node_count: 3 node_types:
- KokoroGenerator
- KokoroSpeaker
- SaveAudio selection_metadata: family: txt2img resource_profile: low complexity_score: 2 estimated_runtime: fast (usually under 30s on modern GPU) warnings:
- Audio generation may take longer on CPU-only or low-VRAM servers. max_width: null max_height: null max_steps: null