Comfy-agent curated_vibevoice_q8_12gb_vram
install
source · Clone the upstream repo
git clone https://github.com/steliosot/comfy-agent
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/steliosot/comfy-agent "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/workflows/audio/curated_vibevoice_q8_12gb_vram" ~/.claude/skills/steliosot-comfy-agent-curated-vibevoice-q8-12gb-vram && rm -rf "$T"
manifest:
skills/workflows/audio/curated_vibevoice_q8_12gb_vram/SKILL.mdsource content
curated_vibevoice_q8_12gb_vram
Curated workflow skill generated from
VibeVoice Q8 - 12GB VRAM.json.
Capability Family
audio
Inputs
- Optional runtime overrides supported by
:run(...)promptnegative_prompt
,widthheight
,seed
,stepscfg
,sampler_name
,schedulerdenoise
,server
,headersapi_prefix
Outputs
- Returns JSON with:
statusprompt_id
(includes image/video entries reported by Comfy history)output_images
Model Requirements
- None detected from loader nodes.
Custom Node Requirements
vibevoice-comfyui
Links Extracted From Workflow Notes
- https://discord.com/invite/gggpkVgBf3
- https://github.com/Enemyx-net/VibeVoice-ComfyUI
- https://huggingface.co/FabioSarracino/VibeVoice-Large-Q8/tree/main
- https://huggingface.co/Qwen/Qwen2.5-1.5B/tree/main
- https://www.youtube.com/@pixaroma
Source
- Original:
comfy-data/workflows/VibeVoice Q8 - 12GB VRAM.json
Routing Metadata
- Family:
audio - Input modalities:
audio - Output modalities:
audio/wav - Model families:
other - Node count:
4 - Complexity score:
3 - Resource profile:
medium - Estimated runtime:
moderate (about 30-120s depending on server) - Max latent resolution hint:
xNoneNone - Max sampler steps hint:
None
Detected Models
- None detected.
Detected Custom Nodes
vibevoice-comfyui
Runtime Warnings
- Audio generation may take longer on CPU-only or low-VRAM servers.
- Uses custom nodes; missing nodes can cause validation/runtime failures.