Hermes-agent inference-sh-cli
Run 150+ AI apps via inference.sh CLI (infsh) — image generation, video creation, LLMs, search, 3D, social automation. Uses the terminal tool. Triggers: inference.sh, infsh, ai apps, flux, veo, image generation, video generation, seedream, seedance, tavily
install
source · Clone the upstream repo
git clone https://github.com/NousResearch/hermes-agent
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/NousResearch/hermes-agent "$T" && mkdir -p ~/.claude/skills && cp -r "$T/optional-skills/devops/cli" ~/.claude/skills/nousresearch-hermes-agent-inference-sh-cli-696c22 && rm -rf "$T"
manifest:
optional-skills/devops/cli/SKILL.mdsource content
inference.sh CLI
Run 150+ AI apps in the cloud with a simple CLI. No GPU required.
All commands use the terminal tool to run
infsh commands.
When to Use
- User asks to generate images (FLUX, Reve, Seedream, Grok, Gemini image)
- User asks to generate video (Veo, Wan, Seedance, OmniHuman)
- User asks about inference.sh or infsh
- User wants to run AI apps without managing individual provider APIs
- User asks for AI-powered search (Tavily, Exa)
- User needs avatar/lipsync generation
Prerequisites
The
infsh CLI must be installed and authenticated. Check with:
infsh me
If not installed:
curl -fsSL https://cli.inference.sh | sh infsh login
See
references/authentication.md for full setup details.
Workflow
1. Always Search First
Never guess app names — always search to find the correct app ID:
infsh app list --search flux infsh app list --search video infsh app list --search image
2. Run an App
Use the exact app ID from the search results. Always use
--json for machine-readable output:
infsh app run <app-id> --input '{"prompt": "your prompt here"}' --json
3. Parse the Output
The JSON output contains URLs to generated media. Present these to the user with
MEDIA:<url> for inline display.
Common Commands
Image Generation
# Search for image apps infsh app list --search image # FLUX Dev with LoRA infsh app run falai/flux-dev-lora --input '{"prompt": "sunset over mountains", "num_images": 1}' --json # Gemini image generation infsh app run google/gemini-2-5-flash-image --input '{"prompt": "futuristic city", "num_images": 1}' --json # Seedream (ByteDance) infsh app run bytedance/seedream-5-lite --input '{"prompt": "nature scene"}' --json # Grok Imagine (xAI) infsh app run xai/grok-imagine-image --input '{"prompt": "abstract art"}' --json
Video Generation
# Search for video apps infsh app list --search video # Veo 3.1 (Google) infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot of coastline"}' --json # Seedance (ByteDance) infsh app run bytedance/seedance-1-5-pro --input '{"prompt": "dancing figure", "resolution": "1080p"}' --json # Wan 2.5 infsh app run falai/wan-2-5 --input '{"prompt": "person walking through city"}' --json
Local File Uploads
The CLI automatically uploads local files when you provide a path:
# Upscale a local image infsh app run falai/topaz-image-upscaler --input '{"image": "/path/to/photo.jpg", "upscale_factor": 2}' --json # Image-to-video from local file infsh app run falai/wan-2-5-i2v --input '{"image": "/path/to/image.png", "prompt": "make it move"}' --json # Avatar with audio infsh app run bytedance/omnihuman-1-5 --input '{"audio": "/path/to/audio.mp3", "image": "/path/to/face.jpg"}' --json
Search & Research
infsh app list --search search infsh app run tavily/tavily-search --input '{"query": "latest AI news"}' --json infsh app run exa/exa-search --input '{"query": "machine learning papers"}' --json
Other Categories
# 3D generation infsh app list --search 3d # Audio / TTS infsh app list --search tts # Twitter/X automation infsh app list --search twitter
Pitfalls
- Never guess app IDs — always run
first. App IDs change and new apps are added frequently.infsh app list --search <term> - Always use
— raw output is hard to parse. The--json
flag gives structured output with URLs.--json - Check authentication — if commands fail with auth errors, run
or verifyinfsh login
is set.INFSH_API_KEY - Long-running apps — video generation can take 30-120 seconds. The terminal tool timeout should be sufficient, but warn the user it may take a moment.
- Input format — the
flag takes a JSON string. Make sure to properly escape quotes.--input
Reference Docs
— Setup, login, API keysreferences/authentication.md
— Searching and browsing the app catalogreferences/app-discovery.md
— Running apps, input formats, output handlingreferences/running-apps.md
— Complete CLI command referencereferences/cli-reference.md