Comfy-agent workflow_ep49_wan2_2_1_vace_gguf_text_to_video
install
source · Clone the upstream repo
git clone https://github.com/steliosot/comfy-agent
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/steliosot/comfy-agent "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/workflows/video_t2v_i2v_avatar/workflow_ep49_wan2_2_1_vace_gguf_text_to_video" ~/.claude/skills/steliosot-comfy-agent-workflow-ep49-wan2-2-1-vace-gguf-text-to-video && rm -rf "$T"
manifest:
skills/workflows/video_t2v_i2v_avatar/workflow_ep49_wan2_2_1_vace_gguf_text_to_video/SKILL.mdsource content
workflow_ep49_wan2_2_1_vace_gguf_text_to_video
Imported workflow skill generated from
Ep49 Wan2 2.1 Vace GGUF Text To Video.json.
Family
video_t2v_i2v_avatar
Inputs
- Optional runtime overrides supported by
:run(...)promptnegative_prompt
,widthheight
,seed
,stepscfg
,sampler_name
,schedulerdenoise
,server
,headersapi_prefix
Outputs
- Returns JSON with:
statusprompt_idoutput_images
Model Requirements
:diffusion_model
->Wan2.1-VACE-14B-Q4_K_M.ggufmodels/diffusion_models
:clip
->umt5_xxl_fp8_e4m3fn_scaled.safetensorsmodels/clip
:vae
->wan_2.1_vae.safetensorsmodels/vae
Custom Node Requirements
comfyui-gguf
Links
- https://discord.com/invite/gggpkVgBf3
- https://docs.comfy.org/tutorials/video/wan/vace
- https://github.com/ali-vilab/VACE
- https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp16.safetensors?download=true
- https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors?download=true
- https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/vae/wan_2.1_vae.safetensors?download=true
- https://huggingface.co/QuantStack/Wan2.1-VACE-14B-GGUF/resolve/main/Wan2.1-VACE-14B-Q4_K_M.gguf?download=true
- https://huggingface.co/QuantStack/Wan2.1-VACE-14B-GGUF/tree/main
- https://www.youtube.com/@pixaroma
Routing Metadata
- Family:
video_t2v_i2v_avatar - Input modalities:
text_prompt - Output modalities:
video/mp4 - Model families:
sd3, wan - Node count:
14 - Complexity score:
8 - Resource profile:
high - Estimated runtime:
slow (often 2-6 min depending on model/server load) - Max latent resolution hint:
xNoneNone - Max sampler steps hint:
20
Detected Models
:diffusion_model
->Wan2.1-VACE-14B-Q4_K_M.ggufmodels/diffusion_models
:clip
->umt5_xxl_fp8_e4m3fn_scaled.safetensorsmodels/clip
:vae
->wan_2.1_vae.safetensorsmodels/vae
Detected Custom Nodes
comfyui-gguf
Runtime Warnings
- Large model(s) detected; ensure enough VRAM and disk space.
- Uses custom nodes; missing nodes can cause validation/runtime failures.
- Video workflow: usually slower and VRAM-intensive than still-image workflows.