Claude-skill-registry epub-visual-auditor
Run and interpret visual QA pipeline for EPUB XHTML files. Use when user asks about layout, screenshots, visual issues, or rendering problems in REBRANDED_OUTPUT.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/epub-visual-auditor" ~/.claude/skills/majiayu000-claude-skill-registry-epub-visual-auditor && rm -rf "$T"
manifest:
skills/data/epub-visual-auditor/SKILL.mdsource content
EPUB Visual Auditor Skill
Purpose
Run the comprehensive visual QA pipeline to detect layout, typography, and rendering issues across all 44 chapter XHTML files in
REBRANDED_OUTPUT/. Generate screenshots, analyze computed styles, and produce actionable reports.
When to Invoke
- User asks about visual layout or rendering issues
- User requests screenshot comparison across viewports
- User wants to verify chapter formatting consistency
- User needs a visual audit before publication
- User mentions "how does chapter X look" or "check the layout"
Workflow
Step 1: Discover Targets
python3 scripts/find_44_targets.py \ --opf REBRANDED_OUTPUT/content.opf \ --out docs/REBRANDED_VISUAL_AUDIT.json
What it does:
- Parses
spine to extract exactly 44 chapter files in reading ordercontent.opf - Creates initial JSON report with file paths and metadata
- Records selection strategy and any ambiguities
Expected output:
with 44 entriesdocs/REBRANDED_VISUAL_AUDIT.json
Step 2: Visual Review
python3 scripts/visual_review.py \ --root REBRANDED_OUTPUT \ --targets docs/REBRANDED_VISUAL_AUDIT.json \ --screenshots-dir docs/screenshots \ --gallery docs/gallery/index.html
What it does:
- Launches headless Chromium via Playwright
- Renders each XHTML at two viewports:
and768x10241080x1440 - Captures screenshots:
- Title/top-of-document view
- Worksheets/interactive content
- Complex layouts (tables, figures, multi-column)
- Computes style samples for h1–h3, paragraphs, lists, images
- Detects layout issues:
- Missing fonts, suspicious fallbacks
- Text/image overflow or clipping
- Images exceeding viewport width
- Heading hierarchy violations
- Accessibility issues (alt text, contrast, ARIA)
Expected output:
(and 1080x1440 variant)docs/screenshots/{basename}/title_768x1024.png
(if applicable)docs/screenshots/{basename}/worksheets_768x1024_p1.png
(if applicable)docs/screenshots/{basename}/complex_768x1024_p1.png- Updated JSON with
,computed_style_samples
,issues
arraysscreenshots
with 44-row summary tabledocs/REBRANDED_VISUAL_AUDIT.md
Step 3: Interpret Results
After visual review completes:
-
Read the markdown summary:
cat docs/REBRANDED_VISUAL_AUDIT.md -
Check for failures:
- Look for chapters with "FAIL" verdict
- Review key issues column for recurring problems
-
Inspect JSON for details:
jq '.files[] | select(.issues | length > 0)' docs/REBRANDED_VISUAL_AUDIT.json -
View screenshots in gallery:
- Open
in browserdocs/gallery/index.html - Use keyboard navigation to compare viewports
- Open
-
Summarize findings for user:
- Report total PASS vs FAIL count
- Highlight top 3-5 recurring issues
- Recommend specific CSS or XHTML fixes
- Link to affected chapter line numbers when possible
Example Invocation
User: "Can you check if the chapter layouts look good on tablets?"
Agent response:
- Run Step 1 (discover targets)
- Run Step 2 (visual review)
- Wait for completion
- Read results and provide summary:
Visual QA Complete: 44 chapters analyzed - PASS: 41 chapters - FAIL: 3 chapters (issues detected) Common issues found: 1. Chapter IX: Heading hierarchy skip (h1 → h3 without h2) - Line 47 2. Chapter XV: Image overflow on 768px viewport - Line 203 3. Multiple chapters: Missing alt text on decorative images Recommended fixes: - Insert missing h2 in Chapter IX - Add max-width: 100% to image styles - Add alt="" role="presentation" to decorative images View full report: docs/REBRANDED_VISUAL_AUDIT.md Gallery: docs/gallery/index.html
Capabilities
- ✅ Render XHTML with local CSS (no network requests)
- ✅ Capture screenshots at 2 viewport sizes
- ✅ Detect overflow, clipping, font issues
- ✅ Validate heading hierarchy
- ✅ Check image alt text presence
- ✅ Compute style samples for typography analysis
- ✅ Generate interactive screenshot gallery
- ❌ Does NOT modify XHTML or CSS files automatically
- ❌ Does NOT fix issues (only reports them)
Integration with Other Skills
Pair with:
- After visual audit, analyze CSS coverage for unused rulescss-diagnostics
- Verify XHTML rendering matches POD PDFspdf-parity-checker
- Run EPUBCheck after fixing visual issuesepub-publication-validator
Notes
- First run may take 2-3 minutes (Playwright downloads Chromium if needed)
- Subsequent runs are faster (~30 seconds for 44 files)
- Screenshots are cached; delete
to regeneratedocs/screenshots/ - Gallery requires modern browser (Chrome, Firefox, Safari)