Expanso-skills text-to-speech

Skill: text-to-speech

install
source · Clone the upstream repo
git clone https://github.com/expanso-io/skills.expanso.io
manifest: skills/ai/text-to-speech/skill.yaml
source content

Skill: text-to-speech

Version: 1.0.0

Convert text to speech using OpenAI TTS.

Outputs audio in various formats (mp3, wav, opus).

name: text-to-speech version: 1.0.0 description: Convert text to speech using AI (mp3, wav, opus output)

Credentials

credentials:

  • name: OPENAI_API_KEY required: true description: OpenAI API key for TTS

Skill inputs

inputs:

  • name: text type: string required: true description: Text to convert to speech (max 4096 chars)
  • name: voice type: string default: alloy enum: [alloy, echo, fable, onyx, nova, shimmer] description: Voice to use
  • name: format type: string default: mp3 enum: [mp3, opus, aac, flac, wav, pcm] description: Audio output format

Skill outputs

outputs:

  • name: audio type: binary description: Audio file content (base64 encoded)
  • name: metadata type: object description: Generation metadata

Backend options

backends:

  • name: openai type: remote requires: [OPENAI_API_KEY] description: OpenAI TTS-1 (~$0.015 per 1K chars)

Expanso components used

components: inputs: - stdin - http_server processors: - mapping - openai_speech - log outputs: - stdout - sync_response