Comfy-agent curated_stability_ai_text_to_audio

install

source · Clone the upstream repo

git clone https://github.com/steliosot/comfy-agent

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/steliosot/comfy-agent "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/workflows/audio/curated_stability_ai_text_to_audio" ~/.claude/skills/steliosot-comfy-agent-curated-stability-ai-text-to-audio-336665 && rm -rf "$T"

manifest: skills/workflows/audio/curated_stability_ai_text_to_audio/SKILL.md

source content

curated_stability_ai_text_to_audio

Curated workflow skill generated from

Stability AI Text To Audio.json

Capability Family

```
audio
```

Inputs

Optional runtime overrides supported by

run(...)

```
prompt
```
```
negative_prompt
```
```
width
```
,
```
height
```
```
seed
```
,
```
steps
```
,
```
cfg
```
```
sampler_name
```
,
```
scheduler
```
,
```
denoise
```
```
server
```
,
```
headers
```
,
```
api_prefix
```

Outputs

Returns JSON with:
- ```
status
```
- ```
prompt_id
```
- ```
output_images
```
  (includes image/video entries reported by Comfy history)

Model Requirements

None detected from loader nodes.

Custom Node Requirements

None detected.

Links Extracted From Workflow Notes

Source

Original:

comfy-data/workflows/Stability AI Text To Audio.json

Routing Metadata

Family:
```
audio
```
Input modalities:
```
text_prompt
```
Output modalities:
```
audio/wav
```
Model families:
```
other
```
Node count:
```
4
```
Complexity score:
```
2
```
Resource profile:
```
low
```
Estimated runtime:
```
fast (usually under 30s on modern GPU)
```
Max latent resolution hint:
```
None
```
x
```
None
```
Max sampler steps hint:
```
None
```

Detected Models

None detected.

Detected Custom Nodes

None detected.

Runtime Warnings

Audio generation may take longer on CPU-only or low-VRAM servers.