Comfy-agent curated_stability_ai_text_to_audio
install
source · Clone the upstream repo
git clone https://github.com/steliosot/comfy-agent
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/steliosot/comfy-agent "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/workflows/audio/curated_stability_ai_text_to_audio" ~/.claude/skills/steliosot-comfy-agent-curated-stability-ai-text-to-audio-336665 && rm -rf "$T"
manifest:
skills/workflows/audio/curated_stability_ai_text_to_audio/SKILL.mdsource content
curated_stability_ai_text_to_audio
Curated workflow skill generated from
Stability AI Text To Audio.json.
Capability Family
audio
Inputs
- Optional runtime overrides supported by
:run(...)promptnegative_prompt
,widthheight
,seed
,stepscfg
,sampler_name
,schedulerdenoise
,server
,headersapi_prefix
Outputs
- Returns JSON with:
statusprompt_id
(includes image/video entries reported by Comfy history)output_images
Model Requirements
- None detected from loader nodes.
Custom Node Requirements
- None detected.
Links Extracted From Workflow Notes
- https://discord.com/invite/gggpkVgBf3
- https://docs.comfy.org/tutorials/api-nodes/pricing
- https://www.youtube.com/@pixaroma
Source
- Original:
comfy-data/workflows/Stability AI Text To Audio.json
Routing Metadata
- Family:
audio - Input modalities:
text_prompt - Output modalities:
audio/wav - Model families:
other - Node count:
4 - Complexity score:
2 - Resource profile:
low - Estimated runtime:
fast (usually under 30s on modern GPU) - Max latent resolution hint:
xNoneNone - Max sampler steps hint:
None
Detected Models
- None detected.
Detected Custom Nodes
- None detected.
Runtime Warnings
- Audio generation may take longer on CPU-only or low-VRAM servers.