Comfy-agent workflow_ep33_two_speakers_combined_tts_workflow

name: workflow_ep33_two_speakers_combined_tts_workflow

install
source · Clone the upstream repo
git clone https://github.com/steliosot/comfy-agent
manifest: skills/workflows/txt2img/workflow_ep33_two_speakers_combined_tts_workflow/skill.yaml
source content

name: workflow_ep33_two_speakers_combined_tts_workflow description: Workflow wrapper for EP33 Two Speakers Combined - TTS Workflow.json inputs: prompt: type: string required: false negative_prompt: type: string required: false width: type: integer required: false height: type: integer required: false seed: type: integer required: false steps: type: integer required: false cfg: type: number required: false sampler_name: type: string required: false scheduler: type: string required: false denoise: type: number required: false server: type: string required: false headers: type: object required: false api_prefix: type: string required: false outputs: status: type: string prompt_id: type: string output_images: type: array requirements: models: [] custom_nodes: [] links: [] input_modalities:

  • text_prompt output_modalities:
  • audio/wav model_families:
  • other node_count: 5 node_types:
  • KokoroGenerator
  • KokoroSpeaker
  • KokoroSpeakerCombiner
  • SaveAudio selection_metadata: family: txt2img resource_profile: low complexity_score: 2 estimated_runtime: fast (usually under 30s on modern GPU) warnings:
  • Audio generation may take longer on CPU-only or low-VRAM servers. max_width: null max_height: null max_steps: null