Openclaw-master-skills ai-task-hub

AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.

install
source · Clone the upstream repo
git clone https://github.com/LeoYeAI/openclaw-master-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/LeoYeAI/openclaw-master-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/ai-task-hub" ~/.claude/skills/leoyeai-openclaw-master-skills-ai-task-hub && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/LeoYeAI/openclaw-master-skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/ai-task-hub" ~/.openclaw/skills/leoyeai-openclaw-master-skills-ai-task-hub && rm -rf "$T"
manifest: skills/ai-task-hub/SKILL.md
safety · automated scan (low risk)
This is a pattern-based risk scan, not a security review. Our crawler flagged:
  • references API keys
Always read a skill's source content before installing. Patterns alone don't mean the skill is malicious — but they warrant attention.
source content

AI Task Hub

Formerly

skill-hub-gateway
.

Public package boundary:

  • Only orchestrates
    portal.skill.execute
    ,
    portal.skill.poll
    , and
    portal.skill.presentation
    .
  • Does not exchange
    api_key
    or
    userToken
    inside this package.
  • Does not handle recharge or payment flows inside this package.
  • Assumes host runtime injects short-lived task tokens and attachment URLs.

Chinese documentation:

SKILL.zh-CN.md

When to Use This Skill

Use this skill when the user asks to:

  • detect people, faces, hands, keypoints, or tags from images
  • remove backgrounds or generate cutout/matting results for products or portraits
  • transcribe uploaded audio into text (
    speech to text
    ,
    audio transcription
    )
  • generate speech from text input (
    text to speech
    ,
    voice generation
    )
  • convert uploaded files into markdown (
    document to markdown
    )
  • start async jobs and check status later (
    poll
    ,
    check job status
    )
  • fetch rendered visual outputs such as
    overlay
    ,
    mask
    , and
    cutout
  • run embedding or reranking tasks for retrieval workflows

Common Requests

Example requests that should trigger this skill:

  • "Detect faces in this image and return bounding boxes."
  • "Tag this image and summarize the main objects."
  • "Remove the background from this product photo."
  • "Create a clean cutout from this portrait image."
  • "Transcribe this meeting audio into text."
  • "Generate speech from this paragraph."
  • "Convert this PDF file into markdown."
  • "Start this job now and let me poll the run status later."
  • "Fetch overlay and mask files for run_456."
  • "Generate embeddings for this text list and rerank the candidates."

Search-Friendly Capability Aliases

  • vision
    aliases: face detection, human detection, person detection, image tagging
  • background
    aliases: remove background, background removal, cutout, matting, product-cutout
  • asr
    aliases: speech to text, audio transcription, transcribe audio
  • tts
    aliases: text to speech, voice generation, speech synthesis
  • markdown_convert
    aliases: document to markdown, file to markdown, markdown conversion
  • poll
    aliases: check job status, poll long-running task, async run status
  • presentation
    aliases: rendered output, overlay, mask, cutout files
  • embeddings/reranker
    aliases: vectorization, semantic vectors, relevance reranking

Runtime Contract

Default API base URL:

https://gateway-api.binaryworks.app

Action to endpoint mapping:

  • portal.skill.execute
    ->
    POST /agent/skill/execute
  • portal.skill.poll
    ->
    GET /agent/skill/runs/:run_id
  • portal.skill.presentation
    ->
    GET /agent/skill/runs/:run_id/presentation

Auth Contract (Host-Managed)

Every request must include:

  • X-Agent-Task-Token: <jwt_or_paseto>

Recommended token claims:

  • sub
    (user_id)
  • agent_uid
  • conversation_id
  • scope
    (
    execute|poll|presentation
    )
  • exp
  • jti

CLI argument order for

scripts/skill.mjs
:

  • [agent_task_token] <action> <payload_json> [base_url]
  • If token arg is omitted, script reads
    AGENT_TASK_TOKEN
    from environment.
  • Host runtime should refresh and inject short-lived
    AGENT_TASK_TOKEN
    automatically to avoid user-facing auth friction.

Payload Contract

  • portal.skill.execute
    : payload requires
    capability
    and
    input
    .
  • payload.request_id
    is optional and passed through.
  • portal.skill.poll
    and
    portal.skill.presentation
    : payload requires
    run_id
    .
  • portal.skill.presentation
    supports
    include_files
    (defaults to
    true
    ).

Attachment normalization:

  • Prefer explicit
    image_url
    /
    audio_url
    /
    file_url
    .
  • attachment.url
    is mapped to target media field by capability.
  • Local
    file_path
    is disabled in the published package.
  • Host must upload chat attachments first, then pass URL fields.
  • Example host upload endpoint:
    /api/blob/upload-file
    .

Error Contract

  • Preserve gateway envelope:
    request_id
    ,
    data
    ,
    error
    .
  • Preserve
    POINTS_INSUFFICIENT
    and pass through
    error.details.recharge_url
    .

Bundled Files

  • scripts/skill.mjs
  • scripts/agent-task-auth.mjs
  • scripts/base-url.mjs
  • scripts/attachment-normalize.mjs
  • scripts/telemetry.mjs
    (compatibility shim)
  • references/capabilities.json
  • references/openapi.json
  • SKILL.zh-CN.md