Aegiscare website-cloning
Clone any website as a pixel-perfect React + Vite app using Playwright extraction.
git clone https://github.com/KartikSharma4448/aegiscare
T=$(mktemp -d) && git clone --depth=1 https://github.com/KartikSharma4448/aegiscare "$T" && mkdir -p ~/.claude/skills && cp -r "$T/frontend/.local/secondary_skills/website-cloning" ~/.claude/skills/kartiksharma4448-aegiscare-website-cloning && rm -rf "$T"
frontend/.local/secondary_skills/website-cloning/SKILL.mdClone Website — Pixel-Perfect Methodology
Reverse-engineer and rebuild a target website as an exact replica React + Vite clone. Every font, color, icon, image, section, background, transition, and interaction must match the original. Zero guessing, zero placeholders.
Cardinal Rules
-
Raw HTML is the source of truth. Before building ANY section, read the corresponding portion of
. Never build from memory, screenshots alone, or guessed structure.raw.html -
One component per visual pattern. Never reuse a component designed for one layout (e.g., product cards with prices) for a structurally different layout (e.g., category cards with just names). If two sections look different, they get different components or distinct CSS classes.
-
All assets downloaded before building starts. Every image, font, SVG, and video must be local in
before any component code is written. No mid-build downloads.public/ -
Build all sections, then verify the full page. Building and screenshotting section-by-section is too slow. Build all components from raw HTML, assemble in App.tsx, then take a full-page screenshot and fix any discrepancies. This is 3-5x faster than per-section verification loops.
-
No fabricated content. Every heading, subtitle, button label, badge, price, and link must come from the source HTML. Never invent text that doesn't exist on the original page.
-
Replace the scaffolded CSS entirely. The
scaffold includes Tailwind/shadcn boilerplate. ReplacecreateArtifact
completely with plain CSS — a Google Font import, CSS reset, CSS variables for design tokens, and nothing else. Clone pages don't use component libraries.index.css
Anti-Patterns (Common Mistakes to Avoid)
| Mistake | Correct Approach |
|---------|-----------------|
| Outlined/bordered buttons when original uses filled/solid | Check
raw.htmlfor button classes and extractbackground-color, notborder |
| Center-aligned text when original is left-aligned | Extract
text-align from computed styles |
| Adding badges/labels that don't exist in original | Only add elements that exist in
raw.html |
| Skipping sections or changing their order | Follow the section inventory checklist exactly |
| Using placeholder images | Download all images in Phase 1 before building |
| Reusing
ProductCard for category grids | Each visually distinct card type gets its own component |
| Guessing font sizes, colors, spacing | Extract exact computed values; never approximate |
| Building from screenshot interpretation alone | Always cross-reference
raw.html for structure and content |
| Using an SVG
<text>element for the logo | Extract the real SVG logo paths fromraw.html |
| Running per-section screenshot QA loops | Build all sections, then do one full-page verify pass |
| Keeping Tailwind/shadcn/Radix in a clone | Replace index.css with plain CSS; remove unused deps |
| Translating/anglicizing text from a non-English page | Clone must use the EXACT language shown on the target page |
| Guessing the announcement bar color | Extract computed
background-color from the banner element |
| Centering the logo when it's left-aligned | Take a header screenshot and compare logo position |
| Omitting the account/rewards bar text | Extract ALL header elements including loyalty/rewards UI |
Legitimate Use Policy
Before cloning, confirm the user's intent is legitimate. Ask:
- "Is this your own website or your client's website?"
- "What is this clone for?"
Acceptable: rebuilding your own site, design reference/learning, staging copy, platform migration.
Refuse if: impersonation, phishing, traffic theft, trademark infringement, or deception.
For non-owned sites (design inspiration), remind the user to replace logos, brand names, trademarks, product data, and contact info with their own.
Prerequisites
pip install playwright CHROMIUM_PATH=$(find /nix/store -maxdepth 4 -name "chromium" -type f 2>/dev/null | head -1) echo "Chromium at: $CHROMIUM_PATH"
Critical Playwright settings
-
Always use
args--no-sandbox -
Use
(notwait_until="domcontentloaded"
)"networkidle" -
Add
after navigationpage.wait_for_timeout(5000) -
Set
on alltimeout=60000
callspage.goto()
Phase 1: Reconnaissance & Extraction
All extraction happens before any building. See
extraction.md for complete Python scripts.
1.1 Save Raw HTML (THE SOURCE OF TRUTH)
raw_html = page.content() with open(f"{OUT_DIR}/raw.html", "w", encoding="utf-8") as f: f.write(raw_html)
This file is the authoritative reference for ALL section structure, content, class names, element ordering, and text content. Computed styles supplement it but never replace it.
1.2 Language & Locale Detection (CRITICAL)
If the target URL contains a locale path (e.g.,
/es-do,/fr,/de,/ja), the clone MUST be in that language. However, server-side rendering may return English even for locale URLs — the localization often happens via client-side JavaScript after page load.
Detection steps
- After
(extra wait for JS locale loading), extract all visible text from key areas:page.wait_for_timeout(8000)
locale_info = page.evaluate(""" () => ({ bannerText: document.querySelector('[class*="banner"], [class*="announcement"]')?.innerText?.trim(), navLinks: [...document.querySelectorAll('nav a, .main-nav a')].map(a => a.innerText.trim()).filter(t => t).slice(0, 8), loyaltyText: document.querySelector('[class*="loyalty"], [class*="rewards"]')?.innerText?.trim(), headerText: document.querySelector('header')?.innerText?.trim()?.slice(0, 500), htmlLang: document.documentElement.lang, url: window.location.href }) """)
- If the URL locale doesn't match the extracted text language, the page probably needs more time for JS to run, or the locale is cookie-based.
- When in doubt, use the language implied by the URL locale. If
shows English text in the raw HTML, translate all user-facing text to Spanish when building. The URL locale is the user's intent./es-do
Brand terms stay in the original language. Product names (e.g., "ALO Runner"), color names (e.g., "SUNSHINE"), brand names (e.g., "ALO Wellness Club") should NOT be translated — the real site keeps these in English even on localized pages.
1.3 Screenshots (Desktop only for initial build)
Take a full-page screenshot at 1440px. This becomes the primary visual reference. Tablet and mobile screenshots are only needed if the user specifically requests responsive behavior.
Take a separate header-only screenshot at this stage — crop to just the top 150px. This will be your reference for logo placement, nav layout, banner color, and account/rewards UI. Header issues are the most common mistakes.
1.4 Section Inventory
Parse the raw HTML to produce a complete ordered checklist. For each section, record:
-
Section index and DOM selector (tag, id, classes)
-
Exact heading text and subheading text
-
Button labels
-
Image count
-
Background color (if non-transparent)
Save as
clone-data/inventory.json. This becomes the build checklist.
1.5 Design Tokens
Extract CSS custom properties, body font-family, heading font-family, primary colors. Save to
clone-data/tokens.json.
1.6 Font Handling
Priority order
-
Download actual font files — Check
rules for@font-face
/.woff2
URLs. Download to.woff
and declarepublic/fonts/
in@font-face
.index.css -
Use Google Fonts if available — If the site uses Google Fonts, add the
or@import
tag.<link> -
Map to closest equivalent — Only as a last resort:
| Proprietary Font | Google Fonts Equivalent |
|-----------------|----------------------|
| Proxima Nova | DM Sans |
| Geograph | DM Sans |
| Self Modern | DM Serif Text |
| Graphik | Inter |
| Circular | DM Sans |
| GT Walsheim | Plus Jakarta Sans |
| Tiempos | Playfair Display |
| Apercu | Source Sans Pro |
| Founders Grotesk | Space Grotesk |
| National | DM Sans |
| Futura | Jost |
| Avenir | Nunito Sans |
| Gotham | Montserrat |
| Brandon Grotesque | Raleway |
1.7 SVG Logo Extraction (CRITICAL)
The site's logo is almost always an inline SVG in the
raw.html, NOT just text. Search for it:
# Search raw HTML for SVG near logo references python3 -c " with open('clone-data/raw.html') as f: html = f.read() # Search around 'logo' class references import re for m in re.finditer(r'logo', html[:15000], re.IGNORECASE): idx = m.start() # Look for SVG nearby svg_start = html.find('<svg', max(0, idx-200)) if svg_start != -1 and svg_start < idx + 500: svg_end = html.find('</svg>', svg_start) + 6 print(html[svg_start:svg_end]) break "
Never use an SVG
element as a logo substitute. Extract the real SVG<text>
<path> elements from the source HTML. The logo is the most recognizable element on the page — getting it wrong immediately signals "fake."
1.8 Asset Download (ALL assets, ALL at once)
Download every image, video, SVG, background image, and font file before building starts. See
extraction.md for the complete download script.
CDN URL upscaling (increase resolution before downloading):
-
Shopify
suffix:_small
→_small.jpg
(very common pattern)_1200x.jpg -
Shopify query params:
→?width=X?width=1200 -
Sanity:
→?w=X?w=1200 -
Cloudinary:
→w_Xw_1200 -
Contentful:
→?w=X?w=1200
Verification: After downloading, verify every file exists and is >100 bytes. The download script includes automatic retry with fallback User-Agent strings.
1.9 Header Deep Extraction (CRITICAL)
The header is the most error-prone section. Extract detailed information beyond the basic inventory:
header_info = page.evaluate(""" () => { const header = document.querySelector('header'); if (!header) return null; // Banner/announcement bar const banner = document.querySelector('[class*="banner"], [class*="announcement"], [class*="uni-banner"]'); const bannerBg = banner ? getComputedStyle(banner.querySelector('[class*="col"], div') || banner).backgroundColor : null; // Logo position const logo = header.querySelector('svg, [class*="logo"] img, [class*="logo"] svg'); const logoRect = logo?.getBoundingClientRect(); const headerRect = header.getBoundingClientRect(); // Nav links const navLinks = [...header.querySelectorAll('nav a, [class*="nav"] a')].map(a => a.innerText.trim()).filter(t => t && t.length < 30); // Right-side elements (account, rewards, search, cart, wishlist) const rightElements = [...header.querySelectorAll('[class*="loyalty"], [class*="rewards"], [class*="account"], [class*="cart"], [class*="wishlist"]')]; return { bannerText: banner?.innerText?.trim(), bannerBgColor: bannerBg, bannerTextColor: banner ? getComputedStyle(banner).color : null, logoPosition: logoRect ? (logoRect.left < headerRect.width / 3 ? 'left' : logoRect.left < headerRect.width * 2/3 ? 'center' : 'right') : 'unknown', navLinks: navLinks.slice(0, 10), rightSideText: rightElements.map(el => el.innerText?.trim()).filter(t => t), rightSideHTML: rightElements.map(el => el.innerHTML?.slice(0, 300)), }; } """)
This prevents the three most common header mistakes: wrong banner color, wrong logo position, missing account/rewards text.
1.10 Footer Link Extraction
Extract all footer links separately — they're needed for the footer component:
footer_data = page.evaluate(""" () => { const footer = document.querySelector('footer'); if (!footer) return null; return { text: footer.innerText, bgColor: getComputedStyle(footer).backgroundColor, links: [...footer.querySelectorAll('a')].map(a => ({ text: a.innerText.trim(), href: a.getAttribute('href') })).filter(l => l.text) }; } """)
Phase 2: Foundation Build
Sequential — do this yourself, not delegated.
- Create artifact via
with typecreateArtifact()react-vite - Replace
entirely — Remove ALL Tailwind/shadcn boilerplate. Write plain CSS:index.css
-
Google Fonts
(or@import
for self-hosted)@font-face -
Universal reset (
)*, *::before, *::after { margin: 0; padding: 0; box-sizing: border-box; } -
CSS variables for design tokens (font families, colors)
-
Basic body styles (font-family, color, background, antialiasing)
-
Reset styles for
,a
,button
,imgul/ol
- Replace
— Remove all router/query/toast boilerplate. A clone is a single static page.App.tsx - Organize assets in
public/images/
Phase 3: Build All Sections
Build all components from the section inventory, referencing
raw.html for exact content. Use inline styles or CSS modules — not Tailwind.
For EACH section in the inventory
Step 1: Read the raw HTML for that section's exact structure, text, and element hierarchy.
Step 2: Build the component using:
-
Exact text content from raw HTML (copy-paste headings, button labels)
-
Local image paths from
public/images/ -
Real SVG paths for logos/icons extracted from raw HTML
-
Inline styles for layout (position, display, flex, grid, padding, colors, fonts)
-
Hover interactions via
/onMouseEnter
inline handlersonMouseLeave -
For carousels:
+useRef
withscrollByoverflow-x: auto; scrollbar-width: none
Step 3: Use
prefix for all imageimport.meta.env.BASE_URL
src attributes so they resolve correctly under the artifact's preview path.
Build tips
-
Build ALL sections before verifying. Don't stop to screenshot after each one.
-
Use inline styles — Simpler than CSS files for clones, and avoids naming/scoping issues.
-
Reusable components are okay when the visual pattern is truly identical (e.g., two hero banners that differ only in image/button text can share a
component with props).HeroBanner -
is fine — For a visual clone, real link targets are a nice-to-have, not a requirement.href="#" -
Remove unused scaffolded dependencies — The
frompackage.json
includes 40+ shadcn/Radix packages. These are dead weight for a clone.createArtifact
Phase 4: Page Assembly & Verification
-
Import all components into
in exact DOM order from the inventoryApp.tsx -
Start the dev server and take a full-page screenshot at 1280px
-
Compare against the original screenshot from Phase 1
-
Fix discrepancies section by section
-
Run e2e test to verify all sections render (use
)runTest()
Verification checklist
-
All sections present in correct order
-
Logo is the real SVG (not text substitute)
-
Logo position matches (left/center/right)
-
All images load (no broken images in console)
-
Heading text matches exactly
-
All text is in the correct language (match URL locale)
-
Button styles match (filled vs outlined, correct colors)
-
Background colors match for sections with colored backgrounds
-
Announcement bar has correct background color AND text
-
Account/rewards/loyalty text is present in header (if original has it)
-
Carousels scroll properly
-
Hover states work on interactive elements
-
Footer has correct columns and content
Component Specification Format
For complex sections dispatched to subagents, write specs at
docs/research/components/<name>.md:
# <ComponentName> Specification ## Overview - Target file: `src/components/<ComponentName>.tsx` - Interaction model: <static | click | scroll | time> ## DOM Structure (from raw.html) <Exact element hierarchy with tag names, classes, nesting> ## Computed Styles (exact values) ### Container - display: flex; flex-direction: row; gap: 24px; padding: 60px 80px; ### Heading - font-size: 48px; font-weight: 400; color: \#230d0d; ### Button - background-color: \#f195a7; border-radius: 999px; padding: 12px 32px; ## Text Content (verbatim from raw.html) <Every heading, paragraph, button label — copy-pasted exactly> ## Assets (local paths) - /images/products/charm-1.webp ## States & Behaviors ### Hover on card - transform: none → scale(1.02) - transition: transform 0.3s ease
Quick Reference: Full Workflow
1. pip install playwright; find Chromium path 2. Navigate to target URL with Playwright 3. Save raw.html (page.content()) — THIS IS THE SOURCE OF TRUTH 4. Detect locale/language from URL path (e.g., /es-do = Spanish) 5. Take full-page desktop screenshot + header-only screenshot (top 150px) 6. Build section inventory from raw.html → clone-data/inventory.json 7. Extract design tokens → clone-data/tokens.json 8. Extract SVG logo from raw.html (search for <svg near 'logo' classes) 9. Extract header details: banner color, logo position, nav links, rewards/loyalty text 10. Extract fonts (download .woff2 files or map to Google Fonts) 11. Download ALL images/videos/SVGs to public/images/ (batch with retry) 12. Extract footer links → clone-data/footer.json 13. createArtifact("react-vite", ...) 14. Replace index.css (plain CSS reset + design tokens — NO Tailwind/shadcn) 15. Replace App.tsx (remove router/query boilerplate — single page) 16. Build ALL section components (referencing raw.html, using correct language) 17. Assemble page in App.tsx (exact DOM order from inventory) 18. Start dev server, take full-page screenshot, compare vs original 19. Fix discrepancies (check header first — most common mistake area) 20. Run e2e test to verify all sections render 21. Present artifact
Reference Files
— Complete Python extraction scripts (Playwright)extraction.md
— Detailed common pitfalls and solutionspitfalls.md