Aiwg corpus-health

Report on research corpus health, completeness, and integrity

install
source · Clone the upstream repo
git clone https://github.com/jmagly/aiwg
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/jmagly/aiwg "$T" && mkdir -p ~/.claude/skills && cp -r "$T/agentic/code/frameworks/sdlc-complete/skills/corpus-health" ~/.claude/skills/jmagly-aiwg-corpus-health-82aabf && rm -rf "$T"
manifest: agentic/code/frameworks/sdlc-complete/skills/corpus-health/SKILL.md
source content

Corpus Health Command

Assess the health and completeness of the research corpus at

.aiwg/research/
.

Kernel Delegation

As of ADR-021,

corpus-health
delegates structural lint to the semantic memory kernel.

Delegation pattern:

  1. corpus-health
    retains its research-specific health-check UX
  2. Delegates to
    memory-lint --consumer research-complete --severity warning
  3. Research-specific layers remain in this wrapper:
    • GRADE coverage check
    • Citation completeness validation
    • Research corpus-specific metrics

Backward compatibility: No UX changes.

@agentic/code/addons/semantic-memory/skills/memory-lint/SKILL.md

Instructions

When invoked, analyze the research corpus and report on its health:

  1. Scan Corpus Structure

    • Count sources in
      .aiwg/research/sources/
    • Count findings in
      .aiwg/research/findings/
    • Count quality assessments in
      .aiwg/research/quality-assessments/
    • Count provenance records in
      .aiwg/research/provenance/records/
  2. Frontmatter Completeness

    • Check each source for required frontmatter per @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/frontmatter-schema.yaml
    • Report sources missing: ref_id, title, authors, year, DOI, key_findings
    • Calculate completeness percentage
  3. PDF Integrity

    • Check
      .aiwg/research/pdfs/
      for downloaded papers
    • Verify SHA-256 checksums against frontmatter
      pdf_hash
      values
    • Report missing PDFs and checksum mismatches
  4. Cross-Reference Integrity

    • Verify all REF-XXX in findings reference valid sources
    • Check for orphaned findings (no source reference)
    • Check for orphaned sources (never cited in any artifact)
  5. Staleness Check

    • Flag sources with
      last_verified
      > 90 days old
    • Flag DOIs that haven't been re-verified
    • Report sources needing refresh
  6. Evidence Gaps

    • Load
      .aiwg/research/TODO.md
      for planned research
    • Count unresolved evidence gaps
    • Report gap severity distribution
  7. Generate Health Report

    • Summary dashboard with pass/warn/fail indicators
    • Detailed findings per category
    • Action items prioritized by severity

Arguments

  • --brief
    - Show summary only
  • --fix
    - Attempt to fix frontmatter gaps and regenerate checksums
  • --report
    - Save report to
    .aiwg/reports/corpus-health.md

References

  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/frontmatter-schema.yaml - Frontmatter requirements
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/fixity-manifest.yaml - Fixity verification
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/rules/research-metadata.md - Research metadata rules
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/agents/citation-verifier.md - Citation Verifier agent