Aegiscare website-cloning

Clone any website as a pixel-perfect React + Vite app using Playwright extraction.

install

source · Clone the upstream repo

git clone https://github.com/KartikSharma4448/aegiscare

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/KartikSharma4448/aegiscare "$T" && mkdir -p ~/.claude/skills && cp -r "$T/frontend/.local/secondary_skills/website-cloning" ~/.claude/skills/kartiksharma4448-aegiscare-website-cloning && rm -rf "$T"

manifest: frontend/.local/secondary_skills/website-cloning/SKILL.md

source content

Clone Website — Pixel-Perfect Methodology

Reverse-engineer and rebuild a target website as an exact replica React + Vite clone. Every font, color, icon, image, section, background, transition, and interaction must match the original. Zero guessing, zero placeholders.

Cardinal Rules

Raw HTML is the source of truth. Before building ANY section, read the corresponding portion of
```
raw.html
```
. Never build from memory, screenshots alone, or guessed structure.
One component per visual pattern. Never reuse a component designed for one layout (e.g., product cards with prices) for a structurally different layout (e.g., category cards with just names). If two sections look different, they get different components or distinct CSS classes.
All assets downloaded before building starts. Every image, font, SVG, and video must be local in
```
public/
```
before any component code is written. No mid-build downloads.
Build all sections, then verify the full page. Building and screenshotting section-by-section is too slow. Build all components from raw HTML, assemble in App.tsx, then take a full-page screenshot and fix any discrepancies. This is 3-5x faster than per-section verification loops.
No fabricated content. Every heading, subtitle, button label, badge, price, and link must come from the source HTML. Never invent text that doesn't exist on the original page.
Replace the scaffolded CSS entirely. The
```
createArtifact
```
scaffold includes Tailwind/shadcn boilerplate. Replace
```
index.css
```
completely with plain CSS — a Google Font import, CSS reset, CSS variables for design tokens, and nothing else. Clone pages don't use component libraries.

Anti-Patterns (Common Mistakes to Avoid)

| Mistake | Correct Approach |

|---------|-----------------|

| Outlined/bordered buttons when original uses filled/solid | Check

raw.html

for button classes and extract

background-color

, not

border

| Center-aligned text when original is left-aligned | Extract

text-align

from computed styles |

| Adding badges/labels that don't exist in original | Only add elements that exist in

raw.html

| Skipping sections or changing their order | Follow the section inventory checklist exactly |

| Using placeholder images | Download all images in Phase 1 before building |

| Reusing

ProductCard

for category grids | Each visually distinct card type gets its own component |

| Guessing font sizes, colors, spacing | Extract exact computed values; never approximate |

| Building from screenshot interpretation alone | Always cross-reference

raw.html

for structure and content |

| Using an SVG

<text>

element for the logo | Extract the real SVG logo paths from

raw.html

| Running per-section screenshot QA loops | Build all sections, then do one full-page verify pass |

| Keeping Tailwind/shadcn/Radix in a clone | Replace index.css with plain CSS; remove unused deps |

| Translating/anglicizing text from a non-English page | Clone must use the EXACT language shown on the target page |

| Guessing the announcement bar color | Extract computed

background-color

from the banner element |

| Centering the logo when it's left-aligned | Take a header screenshot and compare logo position |

| Omitting the account/rewards bar text | Extract ALL header elements including loyalty/rewards UI |

Legitimate Use Policy

Before cloning, confirm the user's intent is legitimate. Ask:

"Is this your own website or your client's website?"
"What is this clone for?"

Acceptable: rebuilding your own site, design reference/learning, staging copy, platform migration.

Refuse if: impersonation, phishing, traffic theft, trademark infringement, or deception.

For non-owned sites (design inspiration), remind the user to replace logos, brand names, trademarks, product data, and contact info with their own.

Prerequisites


pip install playwright

CHROMIUM_PATH=$(find /nix/store -maxdepth 4 -name "chromium" -type f 2>/dev/null | head -1)

echo "Chromium at: $CHROMIUM_PATH"

Critical Playwright settings

Always use
```
--no-sandbox
```
args

Use

wait_until="domcontentloaded"

(not

"networkidle"

)

Add
```
page.wait_for_timeout(5000)
```
after navigation
Set
```
timeout=60000
```
on all
```
page.goto()
```
calls

Phase 1: Reconnaissance & Extraction

All extraction happens before any building. See

extraction.md

for complete Python scripts.

1.1 Save Raw HTML (THE SOURCE OF TRUTH)


raw_html = page.content()

with open(f"{OUT_DIR}/raw.html", "w", encoding="utf-8") as f:

f.write(raw_html)

This file is the authoritative reference for ALL section structure, content, class names, element ordering, and text content. Computed styles supplement it but never replace it.

1.2 Language & Locale Detection (CRITICAL)

If the target URL contains a locale path (e.g.,

/es-do

/fr

/de

/ja

), the clone MUST be in that language. However, server-side rendering may return English even for locale URLs — the localization often happens via client-side JavaScript after page load.

Detection steps

After
```
page.wait_for_timeout(8000)
```
(extra wait for JS locale loading), extract all visible text from key areas:


locale_info = page.evaluate("""

() => ({

bannerText: document.querySelector('[class*="banner"], [class*="announcement"]')?.innerText?.trim(),

navLinks: [...document.querySelectorAll('nav a, .main-nav a')].map(a => a.innerText.trim()).filter(t => t).slice(0, 8),

loyaltyText: document.querySelector('[class*="loyalty"], [class*="rewards"]')?.innerText?.trim(),

headerText: document.querySelector('header')?.innerText?.trim()?.slice(0, 500),

htmlLang: document.documentElement.lang,

url: window.location.href

})

""")

If the URL locale doesn't match the extracted text language, the page probably needs more time for JS to run, or the locale is cookie-based.
When in doubt, use the language implied by the URL locale. If
```
/es-do
```
shows English text in the raw HTML, translate all user-facing text to Spanish when building. The URL locale is the user's intent.

Brand terms stay in the original language. Product names (e.g., "ALO Runner"), color names (e.g., "SUNSHINE"), brand names (e.g., "ALO Wellness Club") should NOT be translated — the real site keeps these in English even on localized pages.

1.3 Screenshots (Desktop only for initial build)

Take a full-page screenshot at 1440px. This becomes the primary visual reference. Tablet and mobile screenshots are only needed if the user specifically requests responsive behavior.

Take a separate header-only screenshot at this stage — crop to just the top 150px. This will be your reference for logo placement, nav layout, banner color, and account/rewards UI. Header issues are the most common mistakes.

1.4 Section Inventory

Parse the raw HTML to produce a complete ordered checklist. For each section, record:

Section index and DOM selector (tag, id, classes)
Exact heading text and subheading text
Button labels
Image count
Background color (if non-transparent)

Save as

clone-data/inventory.json

. This becomes the build checklist.

1.5 Design Tokens

Extract CSS custom properties, body font-family, heading font-family, primary colors. Save to

clone-data/tokens.json

1.6 Font Handling

Priority order

Download actual font files — Check
```
@font-face
```
rules for
```
.woff2
```
/
```
.woff
```
URLs. Download to
```
public/fonts/
```
and declare
```
@font-face
```
in
```
index.css
```
.
Use Google Fonts if available — If the site uses Google Fonts, add the
```
@import
```
or
```
<link>
```
tag.
Map to closest equivalent — Only as a last resort:

| Proprietary Font | Google Fonts Equivalent |

|-----------------|----------------------|

| Proxima Nova | DM Sans |

| Geograph | DM Sans |

| Self Modern | DM Serif Text |

| Graphik | Inter |

| Circular | DM Sans |

| GT Walsheim | Plus Jakarta Sans |

| Tiempos | Playfair Display |

| Apercu | Source Sans Pro |

| Founders Grotesk | Space Grotesk |

| National | DM Sans |

| Futura | Jost |

| Avenir | Nunito Sans |

| Gotham | Montserrat |

| Brandon Grotesque | Raleway |

1.7 SVG Logo Extraction (CRITICAL)

The site's logo is almost always an inline SVG in the

raw.html

, NOT just text. Search for it:


# Search raw HTML for SVG near logo references

python3 -c "

with open('clone-data/raw.html') as f:

html = f.read()

# Search around 'logo' class references

import re

for m in re.finditer(r'logo', html[:15000], re.IGNORECASE):

idx = m.start()

# Look for SVG nearby

svg_start = html.find('<svg', max(0, idx-200))

if svg_start != -1 and svg_start < idx + 500:

svg_end = html.find('</svg>', svg_start) + 6

print(html[svg_start:svg_end])

break

"

Never use an SVG

<text>

element as a logo substitute. Extract the real SVG

<path>

elements from the source HTML. The logo is the most recognizable element on the page — getting it wrong immediately signals "fake."

1.8 Asset Download (ALL assets, ALL at once)

Download every image, video, SVG, background image, and font file before building starts. See

extraction.md

for the complete download script.

CDN URL upscaling (increase resolution before downloading):

Shopify
```
_small
```
suffix:
```
_small.jpg
```
→
```
_1200x.jpg
```
(very common pattern)
Shopify query params:
```
?width=X
```
→
```
?width=1200
```
Sanity:
```
?w=X
```
→
```
?w=1200
```
Cloudinary:
```
w_X
```
→
```
w_1200
```
Contentful:
```
?w=X
```
→
```
?w=1200
```

Verification: After downloading, verify every file exists and is >100 bytes. The download script includes automatic retry with fallback User-Agent strings.

1.9 Header Deep Extraction (CRITICAL)

The header is the most error-prone section. Extract detailed information beyond the basic inventory:


header_info = page.evaluate("""

() => {

const header = document.querySelector('header');

if (!header) return null;

// Banner/announcement bar

const banner = document.querySelector('[class*="banner"], [class*="announcement"], [class*="uni-banner"]');

const bannerBg = banner ? getComputedStyle(banner.querySelector('[class*="col"], div') || banner).backgroundColor : null;

// Logo position

const logo = header.querySelector('svg, [class*="logo"] img, [class*="logo"] svg');

const logoRect = logo?.getBoundingClientRect();

const headerRect = header.getBoundingClientRect();

// Nav links

const navLinks = [...header.querySelectorAll('nav a, [class*="nav"] a')].map(a => a.innerText.trim()).filter(t => t && t.length < 30);

// Right-side elements (account, rewards, search, cart, wishlist)

const rightElements = [...header.querySelectorAll('[class*="loyalty"], [class*="rewards"], [class*="account"], [class*="cart"], [class*="wishlist"]')];

return {

bannerText: banner?.innerText?.trim(),

bannerBgColor: bannerBg,

bannerTextColor: banner ? getComputedStyle(banner).color : null,

logoPosition: logoRect ? (logoRect.left < headerRect.width / 3 ? 'left' : logoRect.left < headerRect.width * 2/3 ? 'center' : 'right') : 'unknown',

navLinks: navLinks.slice(0, 10),

rightSideText: rightElements.map(el => el.innerText?.trim()).filter(t => t),

rightSideHTML: rightElements.map(el => el.innerHTML?.slice(0, 300)),

};

}

""")

This prevents the three most common header mistakes: wrong banner color, wrong logo position, missing account/rewards text.

1.10 Footer Link Extraction

Extract all footer links separately — they're needed for the footer component:


footer_data = page.evaluate("""

() => {

const footer = document.querySelector('footer');

if (!footer) return null;

return {

text: footer.innerText,

bgColor: getComputedStyle(footer).backgroundColor,

links: [...footer.querySelectorAll('a')].map(a => ({

text: a.innerText.trim(), href: a.getAttribute('href')

})).filter(l => l.text)

};

}

""")

Phase 2: Foundation Build

Sequential — do this yourself, not delegated.

Create artifact via
```
createArtifact()
```
with type
```
react-vite
```
Replace
index.css
entirely — Remove ALL Tailwind/shadcn boilerplate. Write plain CSS:

Google Fonts
```
@import
```
(or
```
@font-face
```
for self-hosted)

Universal reset (

*, *::before, *::after { margin: 0; padding: 0; box-sizing: border-box; }

)

CSS variables for design tokens (font families, colors)
Basic body styles (font-family, color, background, antialiasing)
Reset styles for
```
a
```
,
```
button
```
,
```
img
```
,
```
ul/ol
```

Replace
App.tsx
— Remove all router/query/toast boilerplate. A clone is a single static page.
Organize assets in
```
public/images/
```

Phase 3: Build All Sections

Build all components from the section inventory, referencing

raw.html

for exact content. Use inline styles or CSS modules — not Tailwind.

For EACH section in the inventory

Step 1: Read the raw HTML for that section's exact structure, text, and element hierarchy.

Step 2: Build the component using:

Exact text content from raw HTML (copy-paste headings, button labels)
Local image paths from
```
public/images/
```
Real SVG paths for logos/icons extracted from raw HTML
Inline styles for layout (position, display, flex, grid, padding, colors, fonts)
Hover interactions via
```
onMouseEnter
```
/
```
onMouseLeave
```
inline handlers

For carousels:

useRef

scrollBy

with

overflow-x: auto; scrollbar-width: none

Step 3: Use

import.meta.env.BASE_URL

prefix for all image

src

attributes so they resolve correctly under the artifact's preview path.

Build tips

Build ALL sections before verifying. Don't stop to screenshot after each one.
Use inline styles — Simpler than CSS files for clones, and avoids naming/scoping issues.
Reusable components are okay when the visual pattern is truly identical (e.g., two hero banners that differ only in image/button text can share a
```
HeroBanner
```
component with props).
```
href="#"
```
is fine — For a visual clone, real link targets are a nice-to-have, not a requirement.
Remove unused scaffolded dependencies — The
```
package.json
```
from
```
createArtifact
```
includes 40+ shadcn/Radix packages. These are dead weight for a clone.

Phase 4: Page Assembly & Verification

Import all components into
```
App.tsx
```
in exact DOM order from the inventory
Start the dev server and take a full-page screenshot at 1280px
Compare against the original screenshot from Phase 1
Fix discrepancies section by section
Run e2e test to verify all sections render (use
```
runTest()
```
)

Verification checklist

Component Specification Format

For complex sections dispatched to subagents, write specs at

docs/research/components/<name>.md


# <ComponentName> Specification

## Overview

- Target file: `src/components/<ComponentName>.tsx`
- Interaction model: <static | click | scroll | time>

## DOM Structure (from raw.html)

<Exact element hierarchy with tag names, classes, nesting>

## Computed Styles (exact values)

### Container

- display: flex; flex-direction: row; gap: 24px; padding: 60px 80px;

### Heading

- font-size: 48px; font-weight: 400; color: \#230d0d;

### Button

- background-color: \#f195a7; border-radius: 999px; padding: 12px 32px;

## Text Content (verbatim from raw.html)

<Every heading, paragraph, button label — copy-pasted exactly>

## Assets (local paths)

- /images/products/charm-1.webp

## States & Behaviors

### Hover on card

- transform: none → scale(1.02)
- transition: transform 0.3s ease

Quick Reference: Full Workflow


1. pip install playwright; find Chromium path
2. Navigate to target URL with Playwright

3. Save raw.html (page.content()) — THIS IS THE SOURCE OF TRUTH
4. Detect locale/language from URL path (e.g., /es-do = Spanish)

5. Take full-page desktop screenshot + header-only screenshot (top 150px)
6. Build section inventory from raw.html → clone-data/inventory.json

7. Extract design tokens → clone-data/tokens.json
8. Extract SVG logo from raw.html (search for <svg near 'logo' classes)

9. Extract header details: banner color, logo position, nav links, rewards/loyalty text
10. Extract fonts (download .woff2 files or map to Google Fonts)

11. Download ALL images/videos/SVGs to public/images/ (batch with retry)
12. Extract footer links → clone-data/footer.json

13. createArtifact("react-vite", ...)
14. Replace index.css (plain CSS reset + design tokens — NO Tailwind/shadcn)

15. Replace App.tsx (remove router/query boilerplate — single page)
16. Build ALL section components (referencing raw.html, using correct language)

17. Assemble page in App.tsx (exact DOM order from inventory)
18. Start dev server, take full-page screenshot, compare vs original

19. Fix discrepancies (check header first — most common mistake area)
20. Run e2e test to verify all sections render

21. Present artifact

Reference Files

```
extraction.md
```
— Complete Python extraction scripts (Playwright)
```
pitfalls.md
```
— Detailed common pitfalls and solutions