Clawfu-skills minimalist-image-director

Minimalist Image Director

install
source · Clone the upstream repo
git clone https://github.com/guia-matthieu/clawfu-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/guia-matthieu/clawfu-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/ai-design/minimalist-image-director" ~/.claude/skills/guia-matthieu-clawfu-skills-minimalist-image-director && rm -rf "$T"
manifest: skills/ai-design/minimalist-image-director/SKILL.md
source content

Minimalist Image Director

Art direction framework for generating warm minimalist photography via AI image generators (Flux, Midjourney, DALL-E). Separates compositional minimalism from emotional minimalism to avoid the "beautiful but sad" trap.

When to Use This Skill

  • Generating hero images, card images, or blog illustrations for a website
  • Creating a cohesive visual identity across 10+ AI-generated images
  • Briefing AI image generators (Replicate/Flux, Midjourney, DALL-E) with emotional precision
  • When previous minimalist attempts came back "too cold" or "too sad"
  • Building a visual style guide for a brand's AI-generated photography

Methodology Foundation

Sources:

  • Editorial photography principles (Annie Leibovitz, minimal lifestyle photography trend 2024-2026)
  • Emotional Design (Don Norman, 2004) — visceral, behavioral, reflective processing
  • Color psychology research — warm tones (2700-3000K) activate approach behaviors, cool tones trigger avoidance
  • Neuroscience of visual-thermal perception — 80% of experiments show visual environment manipulation affects thermal perception (red-orange = warmth, green-blue = cold)
  • Black Forest Labs official prompting guides (Flux 1.1 Pro, Flux 2)
  • Kodak Portra 400 color science — the gold standard for warm skin tones in AI photography

Core Principle: Minimalism is about what you KEEP, not what you REMOVE. The fewer elements in a frame, the more each one must carry emotional weight. Empty space amplifies — it amplifies warmth just as easily as coldness.

Why This Matters: AI image generators default to "aesthetic minimalism" which reads as cold, clinical, lonely. The skill teaches how to direct warmth INTO minimal compositions, getting the clean look without the emotional void.

The Neuroscience: Warm colors trigger approach behaviors and lower cognitive vigilance — the viewer feels safe. Cool colors trigger alertness and avoidance. This is not aesthetic preference; it's how photoreceptors and neural pathways process visual information.


What Claude Does vs What You Decide

"Claude handles the prompt engineering. You bring the emotional truth."

Claude handlesYou provide
Translating emotional intent into Flux/MJ prompt syntaxThe emotion each image must convey
Applying the 4-layer prompt architecture consistentlyBrand palette and visual identity
Flagging prompt anti-patterns that produce sad/cold imagesValidation — does this FEEL right?
Generating batch-consistent style prefixesSubject matter and context for each image
Optimizing aspect ratios and technical parametersFinal selection between generated options

Remember: AI can generate technically perfect minimalist images that feel completely wrong. Your gut reaction to the emotion is the quality gate, not the composition.


What This Skill Does

  1. Emotional Calibration - Defines the target emotion BEFORE writing any prompt
  2. 4-Layer Prompt Architecture - Style + Subject + Emotion + Anti-patterns in every prompt
  3. Batch Consistency - Creates a shared style prefix for visual cohesion across sets
  4. Anti-Pattern Detection - Flags words/directions that trigger cold/sad/clinical outputs
  5. Brand Alignment - Maps brand voice to visual language (warm brand = warm photos)

How to Use

Generate images for website cards

I need 3 card images for a child development psychologist website.
Brand palette: cream, coral, warm earth tones.
Cards: Motor Development, Emotional Development, Cognitive Development.
Target emotion: hopeful, warm, possibility.
Generator: Replicate Flux 1.1 Pro, 3:4 aspect ratio.

Create a cohesive blog image set

Generate prompts for 13 blog articles about parenting and child psychology.
All images must feel like they're from the same photo shoot.
Brand: warm, approachable, Latin American families.
Avoid: clinical, sad, isolated figures, stock photo poses.

Fix images that came back too cold

These minimalist images came back sad/cold. Here's the original prompt: [prompt].
Keep the minimalist composition but make it emotionally warm.
The image should make a parent feel "I want to be that parent" not "that's beautiful but lonely."

Instructions

When generating minimalist image prompts, follow this methodology precisely:

Step 1: Define the Emotional Target

Before writing ANY prompt, answer:

## Emotional Brief

**This image should make the viewer feel:** ________________
**The viewer should want to:** ________________
**This is NOT about:** ________________

**Emotional quadrant:**
        WARM
         |
ACTIVE --+-- CALM
         |
        COLD

Target: [e.g., Warm + Calm = nurturing serenity]

Key principle: If you can't name the emotion in 2 words, the image will be vague.

Emotional vocabulary for warm minimalism:

Warm + ActiveWarm + Calm
Delight, play, discoverySerenity, connection, trust
Courage, determination, pridePresence, intimacy, safety
Freedom, possibility, wonderPatience, tenderness, focus
Cold + Active (AVOID)Cold + Calm (AVOID)
Anxiety, urgency, pressureLoneliness, melancholy, void
Frustration, anger, defeatIsolation, clinical, sterile

Color psychology for emotional targeting:

Color rangeEmotional effectUse when...
Cream/ivory (#FAF8F5)Soft, approachable, comfortable baseEvery warm minimalist image (background)
Terracotta (#C2704F)Earthy warmth, trustworthiness, permanenceBrands in family, wellness, coaching
Warm pink (#FFC0CB)Nurturing, gentleness, calmingChild development, early childhood
Golden/yellow (2700K)Happiness, energy, sunlight, cozyGolden hour shots, living room scenes
Orange tonesFriendly, fights depression, invitingSocial/community-oriented images
Sage/olive (muted green)Natural, grounded, trustworthyEarthy brand palettes alongside terracotta

Step 2: Build the 4-Layer Prompt

Every prompt has exactly 4 layers:

## Prompt Architecture

[LAYER 1: STYLE] Technical photography direction
[LAYER 2: SUBJECT] Who/what is in the frame
[LAYER 3: EMOTION] Specific emotional cues
[LAYER 4: ANTI-PATTERNS] What to explicitly exclude

Layer 1 — Style Prefix (reuse across batch):

Warm minimalist photography. Soft natural light, shallow depth of field,
[BRAND PALETTE TONES]. Candid moment, not posed. [DEMOGRAPHIC].
Shot on 85mm f/1.8 lens, Kodak Portra 400 film look, natural skin texture.
No text, no logos, no watermarks. Warm color temperature.

Film stock trick: Adding "Kodak Portra 400" or "Kodak Portra 800" instantly introduces organic warmth, fine grain, and natural skin tones. This single phrase fights AI's default plastic/clinical rendering better than any other modifier.

HEX color precision (Flux 2+): Associate HEX codes with specific objects —

"The wall is #FAF8F5 cream"
works better than
"use #FAF8F5 in the image"
. Always pair HEX with a color name.

Key style levers:

LeverWarm directionCold direction (avoid)
LightSoft natural, golden hour, window lightStudio flash, overhead fluorescent
BackgroundCream, warm wood, sunlit roomWhite void, concrete, gray
Depth of fieldShallow (f/1.8) — intimacyDeep (f/11) — documentary
Color tempWarm (2700-3000K golden, 3200-4500K daylight)Cool (6500K+)
FramingClose, eye-level, inclusiveWide, above, distant
Film stockKodak Portra 400, Fujifilm Pro 400HNo film reference (digital default)
Texture"natural skin texture, pores, freckles""smooth skin, flawless" (= plastic)

Layer 2 — Subject:

A [age] [demographic] child [action verb + specific detail].
[Body language cue]. [One environmental detail].

Rules:

  • One action verb, one detail (not a paragraph)
  • Body language > facial expression for Flux
  • One environmental detail grounds the scene (wooden floor, sunlit garden)
  • "Mid-action" > "posing" (hands placing a block > holding a block)
  • Always specify demographics — Flux has training biases and will default if unspecified

Body language science — warm vs cold signals:

Warm signals (USE)Cold signals (AVOID)
Duchenne smile (eyes squeezing + mouth)Fake smile (mouth only, no eye engagement)
Direct eye contact, maintained gazeEyes turned to side or downward
Open posture, arms uncrossedArms crossed over chest (barrier)
Relaxed, self-assured stanceRigid posture, head tilted back
Physical proximity or gentle touchDistance between subjects
Leaning in, at eye levelLeaning away, looking from above

Layer 3 — Emotion Injection:

[Mood word]. [Light descriptor that reinforces mood].

Proven emotion-to-prompt mappings:

Target emotionPrompt language
Joy/delight"pure delight", "laughing", "arms wide"
Connection"eye contact", "faces close", "at eye level"
Curiosity"deeply focused", "hands mid-action", "slight smile"
Safety"gentle touch", "both at ease", "calm conversation"
Pride"standing tall", "determination", "just accomplished"
Possibility"looking up/ahead", "about to", "the moment before"

Layer 4 — Anti-Pattern Blockers:

Words that trigger cold/sad in AI generators:

NEVER useUse instead
alone
,
solitary
,
quiet room
single subject, clean background
studio lighting
,
white background
soft natural light, warm background
looking at camera
,
posing
candid moment
,
mid-action
dark
,
moody
,
dramatic
warm
,
soft
,
gentle
black and white
,
monochrome
warm tones
,
earth tones
empty
,
vast
,
sparse
minimal
,
clean
,
uncluttered
pensive
,
thoughtful
(alone)
focused
,
curious
,
engaged
sitting alone
sitting with [object/activity]
perfect
,
flawless
,
symmetry
natural
,
authentic
,
organic
smooth skin
,
airbrushed
natural skin texture
,
pores
,
subtle imperfections
3D render
,
CGI
,
hyperrealistic
photography
,
candid
,
film look

Negative prompt suffix (append to every prompt for Flux):

--no plastic skin, glossy surfaces, artificial lighting, airbrushed,
sterile, clinical, 3D render, CGI, harsh shadows, cool tones

Step 3: Validate Before Generating

Before sending to the API, run this checklist:

## Pre-Generation Checklist

- [ ] Can I name the target emotion in 2 words?
- [ ] Does the subject have an ACTION (not just a state)?
- [ ] Is there at least one warmth signal (light, touch, smile, color)?
- [ ] Are there zero isolation signals (alone, empty, quiet)?
- [ ] Is the demographic consistent with the brand?
- [ ] Does the style prefix match the batch?

Step 4: Evaluate Generated Images

Rate each generated image:

## Image Evaluation

**Emotional hit?** [Yes/No] — Does it trigger the target emotion within 2 seconds?
**Warmth level:** [1-5] — 1=clinical, 3=neutral, 5=cozy
**Brand fit:** [Yes/No] — Does it feel like it belongs on the brand's site?
**Minimalism quality:** [Clean/Busy] — Is the composition uncluttered?
**Stock photo test:** [Pass/Fail] — Would you mistake this for generic stock?

If emotional hit = No → rewrite Layer 3 (emotion) first
If warmth < 3 → add warm lighting/color cues to Layer 1
If stock photo test = Fail → make Layer 2 more specific (exact age, exact action)

Step 5: Iterate on Failures

Common failure patterns and fixes:

ProblemRoot causeFix
Image is beautiful but sadIsolation signals in promptAdd connection (person+person or person+activity)
Image is warm but genericSubject too vagueAdd one hyper-specific detail ("wooden blocks" not "toys")
Image feels like stock"Looking at camera" or "smiling"Switch to candid mid-action
Inconsistent batch styleStyle prefix variesCopy-paste exact same Layer 1
Wrong age/demographicGenerator defaultsBe explicit: "4-year-old", "Latin American"

Platform-Specific Guide: Flux 1.1 Pro

Flux is the primary recommended generator for warm minimalist photography. These rules are Flux-specific.

Syntax Rules

  • Write like you're talking to a photographer, not typing keywords into a search engine
  • Flux uses natural language — full sentences, NOT comma-separated keyword lists
  • 30-80 words is the sweet spot — under 200 tokens for optimal quality
  • NO prompt weights
    (text:1.5)
    is ignored. Use "with emphasis on X" instead
  • Specify demographics explicitly — Flux has training biases and will default to Western/white/young if unspecified

Recommended Structure

[Subject + specific details] performing [action] in [environment],
[lighting: direction + quality + time of day],
shot on [camera/lens], [film stock reference],
[composition note], [emotional tone],
natural skin texture, candid moment
--no [anti-patterns]

Camera Filename Trick (Raw Mode)

For hyper-realistic results, prefix the prompt with a camera filename:

  • IMG_2847.HEIC
    — iPhone snapshot realism
  • IMG_5847.CR2
    — Canon DSLR raw aesthetic
  • DSC03291.ARW
    — Sony camera look

Example:

IMG_2847.HEIC A mother and child laughing together at a café table, soft window light, slightly grainy, Kodak Portra 400 warmth, authentic moment

Raw Mode

Flux Raw Mode adds natural imperfections: skin texture, realistic lighting, natural asymmetry. Best for portraits and lifestyle. Use minimal prompts — let the realism engine do its work.

Aspect Ratios

  • 3:4 — Portrait cards, service pages (recommended for website cards)
  • 4:5 — Instagram, vertical lifestyle
  • 16:9 — Blog headers, hero images
  • 1:1 — Social media, profile images

Examples

Example 1: Child Development Website Cards

Context: 3 service area cards for a child psychologist's website. Brand is warm, approachable, cream/coral palette. Target: parents who want to understand their child's development.

Input:

Generate 3 card images for Desarrollo Motor, Emocional, Cognitivo. Warm minimalist, Flux 1.1 Pro, 3:4 ratio.

Output:

Style prefix (shared):

Warm minimalist photography. Soft natural light, shallow depth of field,
warm cream and beige tones. Candid moment, not posed. Latin American family.
Shot on 85mm f/1.8 lens. No text, no logos, no watermarks.
Warm color temperature, hopeful joyful mood.

Card 1 — Desarrollo Motor (Emotion: joyful possibility)

[STYLE PREFIX] A 4-year-old child mid-jump on warm wooden floor, arms wide
open, barefoot. Pure delight and concentration on face. Soft motion blur on
hands. Single subject, clean warm background. The moment right before
landing — suspended possibility.

Card 2 — Desarrollo Emocional (Emotion: intimate connection)

[STYLE PREFIX] A mother crouching at eye level with her 5-year-old daughter,
faces close, gentle smiles, eye contact. Golden hour light. Intimate
connection between adult and child. Minimal warm background, soft cream tones.

Card 3 — Desarrollo Cognitivo (Emotion: serene curiosity)

[STYLE PREFIX] A 6-year-old child sitting cross-legged on the floor, deeply
focused building a tall wooden block tower. Hands mid-action placing a block.
Calm concentration on face, slight smile. Soft overhead natural light,
earth tones.

Why this works: Each image has one clear emotion, one specific action, warm lighting, and no isolation signals. The shared style prefix ensures visual cohesion across the set.


Example 2: Fixing a "Beautiful but Sad" Image

Context: A minimalist illustration of a woman crouching alone was generated for a "limits without yelling" blog post. The art direction was "minimalist" but the result felt melancholy.

Input:

The line art minimalist image came back too sad. It's a woman crouching alone in black and white. Fix it.

Analysis of failure:

  • alone
    → isolation signal
  • black and white
    → removes warmth
  • crouching
    with no context → reads as defeated
  • No other person or activity → loneliness

Fixed prompt:

Warm minimalist photography. Soft natural light, shallow depth of field,
warm cream and beige tones. Candid moment, not posed.
Shot on 85mm f/1.8 lens. No text, no logos, no watermarks.
Warm color temperature.

A mother and 4-year-old child sitting face to face on a couch, mother
holding both of child's hands gently, calm conversation. Both at ease.
Warm living room light filtering through curtains.

What changed:

  • Solo → pair (connection defeats loneliness)
  • B&W → warm tones (color = life)
  • Crouching → sitting face to face (equals, not defeated)
  • Added environmental warmth (couch, living room light)

Skill Boundaries (Frontier Recognition)

This skill excels for:

  • Generating cohesive sets of 3-20+ images with consistent style
  • Warm/approachable brands (family, wellness, education, coaching)
  • Photorealistic AI generators (Flux, Midjourney v6+, DALL-E 3)

This skill is NOT ideal for:

  • Brands that WANT cold/clinical aesthetics (tech, luxury, medical) → Adjust Layer 1 accordingly
  • Abstract/conceptual images (infographics, diagrams) → Use
    data-visualizer
    skill instead
  • Product photography → Requires different prompt architecture
  • Illustration styles (watercolor, vector, line art) → Adapt Layer 1 for illustration-specific generators

Quality Checkpoints

Before accepting the output, verify:

  • 2-second gut check: does the image make you feel the target emotion?
  • Warmth score >= 4 out of 5
  • No accidental isolation signals in the composition
  • Consistent with the rest of the batch (same light, same tones)
  • Would NOT be mistaken for generic stock photography

Iteration Guide

"The first output is a starting point, not a destination."

Recommended Iteration Pattern

PassFocusQuestions to Ask
1stEmotion"Does this FEEL right within 2 seconds?"
2ndSpecificity"Is this too generic? What one detail would make it unique?"
3rdConsistency"Does this match the rest of the set?"
4thBrand"Would the client recognize this as THEIR brand?"

Useful Follow-up Prompts

  • "The image is warm but feels generic. Add one hyper-specific detail to the subject."
  • "The emotion is too [intense/subtle]. Dial it [down/up] by adjusting the body language."
  • "The background is too busy. Simplify to [one element] and increase the bokeh."
  • "This looks like stock. Make the child's action more specific — what exactly are their hands doing?"

Checklists & Templates

Batch Brief Template

## Image Batch Brief

**Brand:** ________________
**Palette:** ________________
**Demographic:** ________________
**Generator:** Flux 1.1 Pro / Midjourney v6 / DALL-E 3
**Aspect ratio:** ________________
**Number of images:** ________________

### Style Prefix (copy-paste for ALL prompts)
[Write once, use everywhere]

### Per-Image Briefs
| # | Subject | Target emotion (2 words) | Specific action |
|---|---------|--------------------------|-----------------|
| 1 | | | |
| 2 | | | |
| 3 | | | |

Red Flags Checklist

## Warning Signs in Your Prompts

- [ ] Any word from the "NEVER use" list (alone, empty, dark, moody, studio)
- [ ] Subject has no action verb (just standing/sitting with no activity)
- [ ] No warmth signal (no mention of light quality, color temperature, or human connection)
- [ ] Demographic not specified (generator will default to its biases)
- [ ] More than 3 adjectives in a row (over-direction = generic output)
- [ ] Prompt longer than 80 words (Flux sweet spot is 30-80 words, degrades past 200 tokens)

References

Core Methodology

  • Norman, Don. "Emotional Design" (2004) - Three levels of design processing (visceral, behavioral, reflective)
  • Annie Leibovitz. Masterclass on Portrait Photography - Light as emotion
  • Kittl x Savee. "2026 Design Trends Report" - Warm minimalism as dominant trend

Flux & AI Image Generation

Color Psychology & Neuroscience

Photography Technique

Warm Minimalism Trend

Art Direction Methodology

Related Skills


Skill Metadata

name: minimalist-image-director
category: ai-design
subcategory: art-direction
version: 2.0
author: GUIA
source_expert: Editorial Photography + Don Norman (Emotional Design) + Color Psychology + Neuroscience of Visual Perception + Black Forest Labs (Flux)
source_work: null
difficulty: intermediate
mode: centaur
estimated_value: Art director day rate (~500-800 EUR/day)
tags: [image-generation, art-direction, minimalism, flux, replicate, midjourney, brand-photography, emotional-design, color-psychology, warm-minimalism, kodak-portra]
created: 2026-02-12
updated: 2026-02-12

This skill is part of the GUIA Premium Marketing Skills Library — the 201 layer that bridges AI basics and technical implementation.