Higgsfield-ai-prompt-skill higgsfield-models

install
source · Clone the upstream repo
git clone https://github.com/OSideMedia/higgsfield-ai-prompt-skill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/OSideMedia/higgsfield-ai-prompt-skill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/higgsfield-models" ~/.claude/skills/osidemedia-higgsfield-ai-prompt-skill-higgsfield-models && rm -rf "$T"
manifest: skills/higgsfield-models/SKILL.md
source content

Higgsfield Model Selection Guide

Choosing the right model is the single biggest factor in output quality after the prompt. This file handles most selection questions. For deep per-model documentation (prompting specifics, parameters, edge cases, API details) → read

MODELS-DEEP-REFERENCE.md
.


Quick Decision Flowchart

Fast lookup — for detailed comparisons see the full tables below.

NeedRecommended ModelTier
Top-tier cinematic video + audioKling 3.0Premium
Epic scale / spectacleSora 2Premium
Nature / landscapes + ref imagesVeo 3.1Premium
Artistic / stylized videoWan 2.6Mid
Fast video iterationSeedance 2.0 ProMid
VFX / fluid motionMinimax Hailuo 2.3Mid
Budget-friendly videoKling 2.5 Turbo / Higgsfield DoP LiteFree–Low
Fashion / aesthetic imagesSoul 2.0Free
Photorealistic sharp imagesNano Banana ProLow
AI actor generationSoul CastLow
Native 4K imagesKling Image 3.0Mid
Photo style transformationPhotodump (29 presets)Low

Pricing tiers: Free (Soul 2.0, DoP Lite) · Low (0.1–2 credits) · Mid (2–10 credits) · Premium (10+ credits). See the Credit Cost Reference below for exact per-model costs.


Video Models — Comparison

ModelRealismCharacterMotionStyleDurationAudioBest for
Kling 3.0★★★★★★★★★★★★★★★★★★★☆3–15sCinematic, long, audio, multi-shot
Kling 3.0 Omni★★★★★★★★★★★★★★★★★★★☆3–15sVideo clone, storyboard control
Kling 3.0 Omni Edit★★★★★★★★★★★★★★☆3–10sEdit footage at 3.0 quality
Kling O1 Video (legacy)★★★★★★★★★★★★★★☆★★★☆☆5–10sMulti-ref (7), start/end frame
Kling O1 Video Edit (legacy)★★★★☆★★★★★★★★★★3–10sRelight, restyle, swap, remove
Kling 3.0 Motion Control★★★★★★★★★☆★★★★★★★★☆☆3–30sOptionalMotion transfer from reference video
Kling 2.6 (legacy)★★★★★★★★★★★★★★☆★★★☆☆5–10sCharacter drama, realism (no audio)
Kling 2.5 Turbo★★★★☆★★★★☆★★★★☆★★★☆☆5–10sFast Kling iteration
Sora 2★★★★☆★★★☆☆★★★★★★★★★☆Epic scale, physics, action
Wan 2.7★★★★★★★★★☆★★★★★★★★★★2–15s60fps, T2V/I2V/R2V/edit, first+last frame
Wan 2.6★★★★☆★★★☆☆★★★★☆★★★★★5–15sArtistic, stylized, improved physics
Wan 2.5★★★★☆★★★☆☆★★★★☆★★★★★5–10sNative audio, artistic, fantasy
Seedance 2.0★★★★★★★★★★★★★★★★★★★☆10s12-asset multimodal, complex motion
Seedance 1.5 Pro★★★★☆★★★★☆★★★★☆★★★★☆10sBest lip-sync, multilingual audio
Seedance Pro★★★☆☆★★★☆☆★★★☆☆★★★☆☆10sFast iteration, no audio needed
Veo 3.1★★★★★★★★★☆★★★★☆★★★★☆4–8sRef images, first/last frame, 4K
Veo 3.1 Lite★★★★☆★★★★☆★★★★☆★★★★☆4–8sBudget 3.1 quality, 1080p, I2V, volume
Veo 3★★★★☆★★★☆☆★★★★☆★★★☆☆4–8sNature, environment, stable model
Grok Imagine Video★★★★☆★★★☆☆★★★★☆★★★★☆1–15sVideo editing, animate images
Minimax Hailuo 2.3★★★★★★★★★☆★★★★★★★★★☆6–10sVFX, fluid motion, anime, physics
Minimax Hailuo 02★★★★☆★★★☆☆★★★★★★★★☆☆6–10sDance, sports, fluid motion
Higgsfield DoP (Lite/Standard/Turbo)★★★☆☆★★★☆☆★★★★☆★★★☆☆3–5sI2V specialist, 50+ presets, optical physics

Decision Flowchart

Is this image or video?
├── IMAGE
│   ├── Person / portrait? → Soul 2.0
│   ├── Cinematic keyframe for I2V pipeline? → Soul Cinema Preview
│   ├── Native 4K / image series / storyboarding? → Kling Image 3.0
│   ├── Maximum sharpness / 4K? → Nano Banana Pro
│   ├── Fast pro-quality / text rendering? → Nano Banana 2
│   ├── Reference consistency or dense text? → Seedream 4.5
│   ├── Complex layout / multi-panel? → Seedream 5.0 Lite
│   ├── Text/logo in image? → GPT Image 1.5
│   └── Edit an existing image? → Flux Kontext
│
└── VIDEO
    ├── EDIT existing footage?
    │   ├── Relight, restyle, swap, remove → Kling O1 Video Edit
    │   └── Higher quality 3.0 edit → Kling 3.0 Omni Edit
    │
    ├── Is a human character the focus?
    │   ├── Need audio, long clip (15s), multi-shot → Kling 3.0
    │   ├── Need to clone from reference video → Kling 3.0 Omni
    │   ├── Best lip-sync + multilingual → Seedance 1.5 Pro
    │   ├── No audio needed, great character → Kling 2.6
    │   └── Fast iteration → Kling 2.5 Turbo
    │
    ├── Need motion transfer from reference video?
    │   └── → Kling 3.0 Motion Control
    │
    ├── Animate a still image with cinematic camera?
    │   └── → Higgsfield DoP (Lite/Standard/Turbo)
    │
    ├── Is the environment/phenomenon the hero?
    │   ├── Nature, documentary, stable → Veo 3
    │   ├── Need ref image consistency → Veo 3.1
    │   ├── Budget Veo 3.1 quality / volume → Veo 3.1 Lite
    │   ├── 60fps, first+last frame, ref images → Wan 2.7
    │   └── Artistic, painterly, fantasy → Wan 2.5/2.6
    │
    ├── Is it action/spectacle?
    │   ├── Epic scale, crowds, physics → Sora 2
    │   ├── VFX, anime, fluid motion → Minimax Hailuo 2.3
    │   └── Dance, sports, budget motion → Minimax Hailuo 02
    │
    ├── Need maximum reference control?
    │   ├── Up to 12 assets (images+video+audio) → Seedance 2.0
    │   ├── Up to 7 image refs → Kling O1 Video
    │   └── Up to 3 asset refs → Veo 3.1
    │
    └── Speed/cost priority?
        ├── Fastest Kling → Kling 2.5 Turbo
        ├── Fastest Seedance → Seedance Pro
        └── Fastest Veo → Veo 3.1 Fast or Veo 3 Fast

Image Models — Quick Selection

NeedModelCredits
Fashion / cultural portraitSoul 2.0Free
Cinematic keyframe for I2VSoul Cinema PreviewLow
Cheapest generationZ-Image0.15
Low-cost portraitHiggsfield Soul0.5
Low-cost 2K squareKling O10.5
Native 4K / image seriesKling Image 3.0
4K + advanced editingKling Image 3.0 Omni
Fast versatile 2KSeedream 5.0 Lite1
4K versatileSeedream 4.51
Sketch-to-image (Draw)Nano Banana1
Artistic / stylizedWan 2.21
Blend multiple referencesMulti Reference1.5
Fast pro-quality + text renderingNano Banana 21.5
Complex prompts / text in imageGPT Image 1.52
Max fidelity / Thinking mode / 14 refsNano Banana Pro2
Image editing / inpaintingFlux Kontextvaries
Photo style transformation (29 cartoon/illustration presets)PhotodumpLow

Full image model specs + UI controls →

../../image-models.md
Full Photodump preset library (29 named styles) →
../../photodump-presets.md


Budget Tiers

Image models — by credit cost:

  • Free / near-free: Soul 2.0 (5K gens) · Z-Image (0.15) · Face Swap (2 free)
  • Budget (0.5–1): Higgsfield Soul · Kling O1 · Seedream family · Nano Banana · Wan 2.2 · Reve
  • Mid (1.5–2): Nano Banana 2 · Multi Reference · FLUX.2 Pro · Flux Kontext Max · GPT Image · NB Pro · Character Swap
  • Premium (5–6): FLUX.2 Flex · FLUX.2 Max

General pricing tiers (video + image, approximate):

  • Free: Soul 2.0 · DoP Lite (limited)
  • Low: 0.1–2 credits per generation
  • Mid: 2–10 credits per generation
  • Premium: 10+ credits per generation

For exact per-model video costs see the Credit Cost Reference in

../../model-guide.md
.


Unique Feature Matrix

FeatureAvailable on
Native audio (dialogue, SFX, ambient)Kling 3.0/Omni · Seedance 1.5 Pro/2.0 · Veo 3/3.1 · Grok Video
Soul ID character slotSoul 2.0 · GPT Image · Higgsfield Soul
@ Elements syntaxSeedream 4.5/5.0 Lite · Nano Banana Pro · Cinema Studio
Draw (sketch-to-image)Nano Banana · Nano Banana Pro
Video editing (relight/restyle/swap)Kling O1 Video Edit · Kling 3.0 Omni Edit · Grok Video
Multi-image reference blendMulti Reference · Nano Banana Pro (14 refs) · Kling O1 Video (7 refs)
Start/end frame controlKling O1 Video · Veo 3.1 · Wan 2.7
Video extension (up to 148s)Veo 3.1
Performance cloning from videoKling 3.0 Omni
Up to 30s camera/motion transferKling 3.0 Motion Control
Soul Cast AI actorsCinema Studio 2.5
Soul Cast AI actors (General 2K / Character 4K / Location 4K)Cinema Studio 3.0 (Business/Team)
Built-in color gradingCinema Studio 2.5
Native dual-channel stereo audioCinema Studio 3.0 (Business/Team) · Kling 3.0/Omni · Seedance 2.0/1.5 Pro · Veo 3/3.1 · Wan 2.5/2.7
Soul HEX color matchingSoul 2.0 · Soul Cinema Preview · Cinema Studio 2.5
Native 4K image seriesKling Image 3.0
Style presets + Color TransferSoul 2.0
Google Search groundingNano Banana Pro
Negative prompts supportedVeo 3/3.1 only
Smart auto-camera planningCinema Studio 3.0 (Business/Team)

Key Model Notes

Kling 3.0 vs 2.6: 3.0 is the current top model — longer clips (15s vs 10s), native audio, multi-shot AI direction, physics engine, 4K HDR, stylized output engine. 2.6 is now legacy — use 3.0 for all new work unless cost is the primary constraint.

Kling V3 vs O3: Use V3 for prompt-driven cinematic work (text-to-video, image-to-video). Use O3 when you have reference media (video or image+audio) to anchor character identity — O3's reference-based consistency is its defining advantage.

Kling 3.0 Motion Control: Upload a 3–30s reference clip to transfer full-body motion, hand gestures, facial expressions. Image Orientation for camera/talking head; Video Orientation for complex motions (dancing, action, full-body movement).

Seedance 2.0: Rule of 12 (up to 12 assets per generation). Real person face uploads blocked — use synthetic character references. Best practices for Seedance 2.0 prompting are integrated into the sub-skills (see higgsfield-prompt, higgsfield-camera, higgsfield-motion).

Veo 3.1 vs 3.1 Lite vs 3: 3.1 adds reference images (up to 3), first/last frame, video extension, 4K. 3.1 Lite is budget-priced 3.1 quality at 1080p — supports T2V and I2V, costs less than half of 3.1 Fast. 3 is stable and proven. Use 3.1 for subject consistency, 3.1 Lite for volume, 3 for pure environment/nature.

Wan 2.7: Major upgrade — native 60fps (vs 24fps in 2.6), up to 15s duration, first+last frame anchoring, up to 5 reference images, 4-model suite (T2V/I2V/R2V/video edit), Flow-Matching architecture. 40% better physics consistency over 2.6.

Wan 2.5: First Wan version with native audio — joint text/audio/video generation. Supports audio-driven video (upload audio to drive visuals). 1080p, 5–10s.

Minimax Hailuo 2.3 vs 02: 2.3 is a major upgrade — improved physics, anime/illustration styles, facial micro-expressions, better prompt adherence. Fast variant now at 1080p (02 Fast was 512p). 02 remains available for budget motion work.

Grok Imagine: Aurora architecture (autoregressive, not diffusion) — excels at text/logo rendering and multi-image compositing. Image editing supports multi-turn iterative chains.

For deep documentation on any specific model → read

MODELS-DEEP-REFERENCE.md


Cinema Studio 3.0 (Business/Team Plan)

Cinema Studio 3.0 is a separate generation engine available on Business and Team plans. Version toggle in the upper-right corner of the Cinema Studio UI switches between 2.5 and 3.0.

FeatureCinema Studio 2.5Cinema Studio 3.0 (Business/Team)
Video ResolutionUp to 1080pUp to 720p (may increase)
Image ResolutionUp to 4KUp to 4K (Character/Location) · Up to 2K (General)
Max Duration12s15s
Aspect Ratios6 options7 options (+ 21:9 ultrawide)
AudioOn/OffOn/Off (native dual-channel stereo)
Shot ControlManual multi-shotSmart (auto) + Custom multi-shot
Generation CostVaries48 credits

For full Cinema Studio 3.0 documentation → see

higgsfield-cinema


Related skills

  • higgsfield-prompt
    — MCSLA formula, prompt structure
  • higgsfield-cinema
    — Cinema Studio model selection
  • higgsfield-assist
    — Credit optimization and plan selection
  • higgsfield-audio
    — Audio-capable model details
  • templates/
    — Annotated templates with per-genre model recommendations