Geo-seo-claude geo-schema
Schema.org structured data audit and generation optimized for AI discoverability — detect, validate, and generate JSON-LD markup
git clone https://github.com/zubair-trabzada/geo-seo-claude
T=$(mktemp -d) && git clone --depth=1 https://github.com/zubair-trabzada/geo-seo-claude "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/geo-schema" ~/.claude/skills/zubair-trabzada-geo-seo-claude-geo-schema && rm -rf "$T"
skills/geo-schema/SKILL.mdGEO Schema & Structured Data
Purpose
Structured data is the primary machine-readable signal that tells AI systems what an entity IS, what it does, and how it connects to other entities. While schema markup has traditionally been about earning Google rich results, its role in GEO is fundamentally different: structured data is how AI models understand and trust your entity. A complete entity graph in structured data dramatically increases citation probability across all AI search platforms.
How to Use This Skill
- Fetch the target page HTML using
(see note below)fetch_page.py - Detect all existing structured data (JSON-LD, Microdata, RDFa)
- Validate detected schemas against Schema.org specifications
- Identify missing recommended schemas based on business type
- Generate ready-to-use JSON-LD code blocks
- Output GEO-SCHEMA-REPORT.md
Step 1: Detection
IMPORTANT: WebFetch converts HTML to markdown and strips
<head> content, which removes JSON-LD blocks. Use fetch_page.py instead:
python3 ~/.claude/skills/geo/scripts/fetch_page.py <url> page
The output includes a
structured_data array with all parsed JSON-LD blocks from the page.
Scan for JSON-LD
Look for
<script type="application/ld+json"> blocks in the HTML. Parse each block as JSON. A page may contain multiple JSON-LD blocks — collect all of them.
Scan for Microdata
Look for elements with
itemscope, itemtype, and itemprop attributes. Map the hierarchy of nested items. Note: Microdata is harder for AI crawlers to parse than JSON-LD. Flag a recommendation to migrate to JSON-LD if Microdata is the only format found.
Scan for RDFa
Look for elements with
typeof, property, and vocab attributes. Similar to Microdata — recommend migration to JSON-LD.
Priority Order
JSON-LD is the strongly recommended format for GEO. Google, Bing, and AI platforms all process JSON-LD most reliably. If the site uses Microdata or RDFa exclusively, flag this as a high-priority migration.
Step 2: Validation
For each detected schema block, validate:
- Valid JSON: Is the JSON-LD syntactically valid? Check for trailing commas, unquoted keys, malformed strings.
- Valid @type: Does the
match a recognized Schema.org type? Check against https://schema.org/docs/full.html.@type - Required Properties: Does the schema include all required properties for its type? (See per-type requirements below.)
- Recommended Properties: Does the schema include recommended properties that increase AI discoverability?
- sameAs Links: Does the schema include
properties linking to other platform presences?sameAs - URL Validity: Do all URLs in the schema resolve (not 404)?
- Nesting: Is the schema properly nested (e.g., author inside Article, address inside Organization)?
- Rendering Method: Is the JSON-LD in the server-rendered HTML or injected via JavaScript? Per Google's December 2025 guidance, JavaScript-injected structured data may face delayed processing. Flag any schema that requires JS execution.
Step 3: Schema Types for GEO
Organization (CRITICAL — every business site)
Essential for entity recognition across all AI platforms. This is how AI models identify WHAT the business is.
Required properties:
: "Organization" (or subtype: Corporation, LocalBusiness, etc.)@type
: Official business namename
: Official website URLurl
: URL to logo image (ImageObject preferred)logo
Recommended properties for GEO:
: Array of ALL platform URLs (see sameAs strategy below)sameAs
: 1-2 sentence description of the organizationdescription
: ISO 8601 datefoundingDate
: Person schemafounder
: PostalAddress schemaaddress
: ContactPoint with telephone, email, contactTypecontactPoint
: Geographic areaareaServed
: QuantitativeValuenumberOfEmployees
: Text or DefinedTermindustry
: Array of awards receivedaward
: Array of topics the organization is expert in (strong GEO signal)knowsAbout
LocalBusiness (for businesses with physical locations)
Extends Organization. Critical for local AI search results and Google Gemini.
Additional required properties:
: Full PostalAddressaddress
: Phone numbertelephone
: Operating hoursopeningHoursSpecification
Recommended for GEO:
: GeoCoordinates (latitude, longitude)geo
: Price indicatorpriceRange
: AggregateRating schemaaggregateRating
: Array of Review schemasreview
: URL to Google MapshasMap
Article + Author (CRITICAL for publishers)
The Author schema is one of the strongest E-E-A-T signals for AI platforms.
Article required:
: "Article" (or NewsArticle, BlogPosting, TechArticle)@type
: Article titleheadline
: ISO 8601datePublished
: ISO 8601 (critical for freshness signals)dateModified
: Person or Organization schemaauthor
: Organization schema with logopublisher
: Representative imageimage
Author (Person) required for GEO:
: Full namename
: Author page URL on the siteurl
: LinkedIn, Twitter, personal site, Google Scholar, ORCIDsameAs
: Professional titlejobTitle
: Organization schemaworksFor
: Array of expertise areasknowsAbout
: Educational institutionsalumniOf
: Professional awardsaward
Product (for e-commerce)
Required:
,name
,descriptionimage
: Offer with price, priceCurrency, availabilityoffers
: Brand schemabrand
orsku
/gtinmpn
Recommended for GEO:
: AggregateRatingaggregateRating
: Array of individual reviewsreview
: Product categorycategory
,material
,weight
,width
(where applicable)height
FAQPage
Status as of 2024: Google restricts FAQ rich results to government and health sites. However, the FAQPage schema still serves GEO purposes — AI platforms parse FAQ structured data for question-answer extraction. Implement it for AI readability even though rich results may not appear.
Structure:
: "FAQPage"@type
: Array of Question schemas, each withmainEntity
containing an Answer schemaacceptedAnswer
SoftwareApplication (for SaaS)
Required:
,namedescription
: e.g., "BusinessApplication"applicationCategory
: Supported platformsoperatingSystem
: Pricingoffers
Recommended for GEO:
: User ratingsaggregateRating
: Array of features (strong citation signal)featureList
: Screenshotsscreenshot
: Current versionsoftwareVersion
: Link to changelogreleaseNotes
WebSite + SearchAction (for sitelinks search box)
Structure:
{ "@type": "WebSite", "name": "Site Name", "url": "https://example.com", "potentialAction": { "@type": "SearchAction", "target": { "@type": "EntryPoint", "urlTemplate": "https://example.com/search?q={search_term_string}" }, "query-input": "required name=search_term_string" } }
Person (standalone — for personal brands, authors, thought leaders)
Use as a standalone schema on About/Bio pages. This builds the entity graph for individual expertise.
Required:
name, url
Recommended for GEO: sameAs, jobTitle, worksFor, knowsAbout, alumniOf, award, description, image
speakable Property (for voice/AI assistants)
The
speakable property marks specific sections of content as particularly suitable for voice and AI assistant consumption. Add to Article or WebPage schemas.
{ "@type": "Article", "speakable": { "@type": "SpeakableSpecification", "cssSelector": [".article-summary", ".key-takeaway"] } }
This signals to AI assistants which passages are the best candidates for citation or reading aloud.
Step 4: Deprecated/Changed Schemas to Flag
| Schema | Status | Note |
|---|---|---|
| HowTo | Rich results deprecated Aug 2023 | Still useful for AI parsing, but do not promise rich results |
| FAQPage | Restricted to govt/health Aug 2023 | Still useful for AI parsing (see above) |
| SpecialAnnouncement | Deprecated 2023 | Was for COVID; remove if still present |
| CourseInfo | Replaced by Course updates 2024 | Use updated Course schema properties |
VideoObject | Changed behavior 2024 | Must point to actual video file, not page URL |
| Review snippet | Stricter enforcement 2024 | Self-serving reviews on product pages may not display |
Flag any deprecated schemas found and recommend replacements.
Step 5: sameAs Strategy (CRITICAL for Entity Recognition)
The
sameAs property is the single most important structured data property for GEO. It tells AI systems: "This entity on my website is the SAME entity as these profiles elsewhere." This creates the entity graph that AI platforms use to verify, trust, and cite sources.
Recommended sameAs Links (in priority order)
- Wikipedia article — highest authority entity link
- Wikidata item — machine-readable entity identifier (e.g.,
)https://www.wikidata.org/wiki/Q12345 - LinkedIn — company page or personal profile
- YouTube — channel URL
- Twitter/X — profile URL
- Facebook — page URL
- Crunchbase — company profile (for startups/tech)
- GitHub — organization or personal profile (for tech)
- Google Scholar — author profile (for researchers/academics)
- ORCID — researcher identifier (for academics)
- Instagram — profile URL
- Apple App Store / Google Play — app listings (for software)
- BBB — Better Business Bureau listing (for US businesses)
- Industry directories — relevant vertical directories
sameAs Audit Process
- Collect all known web presences for the entity
- Check that each URL resolves (not 404 or redirected)
- Verify the Organization/Person schema includes ALL of them
- Check that the information on each platform is consistent (name, description, founding date, etc.)
- Flag any platforms where the entity should have a presence but does not
Step 6: JSON-LD Generation
Based on the detected business type, generate ready-to-paste JSON-LD blocks. Always generate:
- Organization or Person (depending on entity type) — always
- WebSite with SearchAction — always for the homepage
- Business-type-specific — Article for publishers, Product for e-commerce, LocalBusiness for local, SoftwareApplication for SaaS
- BreadcrumbList — for any page deeper than homepage
Generation Rules
- Use the
pattern to include multiple schemas in one JSON-LD block@graph - All URLs must be absolute (not relative)
- Include
properties for cross-referencing between schemas@id - Use ISO 8601 for all dates
- Include
on Article schemas with CSS selectors pointing to key content sectionsspeakable - Place JSON-LD in
section — NOT injected via JavaScript<head>
Template: Organization with Full GEO Signals
{ "@context": "https://schema.org", "@type": "Organization", "@id": "https://example.com/#organization", "name": "Company Name", "url": "https://example.com", "logo": { "@type": "ImageObject", "url": "https://example.com/logo.png", "width": 600, "height": 60 }, "description": "Concise description of what the company does.", "foundingDate": "2020-01-15", "founder": { "@type": "Person", "name": "Founder Name", "sameAs": "https://www.linkedin.com/in/founder" }, "address": { "@type": "PostalAddress", "streetAddress": "123 Main St", "addressLocality": "City", "addressRegion": "State", "postalCode": "12345", "addressCountry": "US" }, "contactPoint": { "@type": "ContactPoint", "telephone": "+1-555-555-5555", "contactType": "customer service", "email": "support@example.com" }, "sameAs": [ "https://en.wikipedia.org/wiki/Company_Name", "https://www.wikidata.org/wiki/Q12345", "https://www.linkedin.com/company/company-name", "https://www.youtube.com/@companyname", "https://twitter.com/companyname", "https://github.com/companyname", "https://www.crunchbase.com/organization/company-name" ], "knowsAbout": [ "Topic 1", "Topic 2", "Topic 3" ] }
Scoring Rubric (0-100)
| Criterion | Points | How to Score |
|---|---|---|
| Organization/Person schema present and complete | 15 | 15 if full, 10 if basic, 0 if none |
| sameAs links (5+ platforms) | 15 | 3 per valid sameAs link, max 15 |
| Article schema with author details | 10 | 10 if full author schema, 5 if name only, 0 if none |
| Business-type-specific schema present | 10 | 10 if complete, 5 if partial, 0 if missing |
| WebSite + SearchAction | 5 | 5 if present, 0 if not |
| BreadcrumbList on inner pages | 5 | 5 if present, 0 if not |
| JSON-LD format (not Microdata/RDFa) | 5 | 5 if JSON-LD, 3 if mixed, 0 if only Microdata/RDFa |
| Server-rendered (not JS-injected) | 10 | 10 if in HTML source, 5 if JS but in head, 0 if dynamic JS |
| speakable property on articles | 5 | 5 if present, 0 if not |
| Valid JSON + valid Schema.org types | 10 | 10 if no errors, 5 if minor issues, 0 if major errors |
| knowsAbout property on Organization/Person | 5 | 5 if present with 3+ topics, 0 if missing |
| No deprecated schemas present | 5 | 5 if clean, 0 if deprecated schemas found |
Output Format
Generate GEO-SCHEMA-REPORT.md with:
# GEO Schema & Structured Data Report — [Domain] Date: [Date] ## Schema Score: XX/100 ## Detected Schemas | Page | Schema Type | Format | Status | Issues | |---|---|---|---|---| | / | Organization | JSON-LD | Valid | Missing sameAs | | /blog/post-1 | Article | JSON-LD | Valid | No author schema | ## Validation Results [List each schema with pass/fail per property] ## Missing Recommended Schemas [List schemas that should be present based on business type but are not] ## sameAs Audit | Platform | URL | Status | |---|---|---| | Wikipedia | [URL or "Not found"] | Present/Missing | | LinkedIn | [URL or "Not found"] | Present/Missing | [Continue for all recommended platforms] ## Generated JSON-LD Code [Ready-to-paste JSON-LD blocks for each missing or incomplete schema] ## Implementation Notes - Where to place each JSON-LD block - Server-rendering requirements - Testing with Google Rich Results Test and Schema.org Validator