OpenSpace spreadsheet-proof-gate

Edit Excel workbooks only after anchored preflight audit, then require deterministic Python/openpyxl post-write proof that every requested criterion is satisfied or explicitly reconciled before finalizing.

install

source · Clone the upstream repo

git clone https://github.com/HKUDS/OpenSpace

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/HKUDS/OpenSpace "$T" && mkdir -p ~/.claude/skills && cp -r "$T/gdpval_bench/skills/spreadsheet-audit-edit-validate-merged" ~/.claude/skills/hkuds-openspace-spreadsheet-proof-gate && rm -rf "$T"

manifest: gdpval_bench/skills/spreadsheet-audit-edit-validate-merged/SKILL.md

source content

Spreadsheet Proof Gate

Use this skill when you must modify, create, or validate an Excel workbook and the result must be proven against a user specification rather than merely described.

This skill combines:

workbook discovery and identity confirmation
pre-edit structure auditing
safe workbook editing
deterministic post-write verification from disk
a machine-checkable pass/fail checklist for every requested criterion
a hard finalization gate that rejects completion when any required criterion is unmet or unreconciled

This is a workflow skill. It does not require a specific editor, but it does require a direct Python verification pass with

openpyxl

before you report success.

When to use

Use this skill when any of the following are true:

the workbook path must be discovered
multiple similar Excel files may exist
the user requires exact sheet names, columns, counts, or samples
the task edits an existing workbook
a delegated agent may perform spreadsheet work
prior summaries are inconsistent or not trustworthy
exact post-edit workbook state matters
you must prove that mandatory criteria were satisfied

Core rule

Never finalize from narrative summary alone.

A workbook task is only complete when:

the target workbook was identified from the anchored workspace path
pre-edit structure matched the intended operation
the edit was performed on the correct file
the saved workbook was directly re-read from disk
every requested criterion was checked in a deterministic verification report
each criterion is marked as one of:
- ```
PASS
```
- ```
FAIL
```
- ```
UNAVAILABLE-IN-SOURCE
```
any non-pass result is explicitly reconciled before finalizing

If any mandatory criterion is still

FAIL

, do not declare success.

If a criterion is

UNAVAILABLE-IN-SOURCE

, only finalize if:

the source workbook truly lacks the needed input,
you state that clearly,
and the user request did not require inventing unsupported data.

Workspace anchoring rule

All discovery, reads, writes, and verification must be anchored to the exact workspace path provided for the task.

Rules:

treat the provided workspace path as the authoritative root
resolve the workbook to an exact path under that root
do not silently switch to a similarly named file elsewhere
do not trust a delegated tool's reported path without independently confirming it
if the anchored file cannot be found, stop rather than guessing

Outcome contract

Your output to yourself and to the user should be based on proof, not inference.

For every task, maintain three artifacts internally:

```
Candidate inventory
```
```
Pre-edit audit checklist
```
```
Post-write proof checklist
```

The post-write proof checklist is decisive.

Workflow

Phase 1: Discover candidate workbooks

Search under the exact workspace root for plausible Excel files:

```
.xlsx
```
```
.xlsm
```
```
.xls
```

Prefer files that:

match names mentioned by the user
live in likely data/output/project folders
have relevant modification times
contain expected sheet names
contain plausible row/column structure

If several files are plausible, inspect all plausible candidates before choosing.

Minimum evidence before selecting a workbook

Before choosing the workbook to edit, confirm at least:

exact anchored path
file existence
workbook readability
sheet names
approximate row counts in relevant sheets
relevant headers in target sheets

Do not select a workbook based on filename similarity alone.

Phase 2: Convert the user request into explicit criteria

Before editing, extract every requirement you can verify.

Possible criteria include:

required workbook path or output path
required sheet names
sheets to preserve
sheets to create
target sheet to edit
required columns
required formulas or summaries
minimum data rows
exact row counts
sample size
marker/flag count
selection rules
category coverage requirements
preservation of macros/formatting
preservation of untouched sheets

Turn these into an explicit checklist.

Criteria format

Represent each requested requirement as a discrete checkable item.

Good examples:

```
required-sheet: Summary exists
```
```
required-sheet: Sample exists
```

required-column: Sheet Sample has column "Selected"

sample-count: exactly 25 marked rows in Sample

coverage: at least 1 marked row where Product = Trade Finance

preservation: source sheet RawData still exists

formula: column H contains formulas for all populated rows

Bad examples:

```
workbook looks right
```
```
sampling seems okay
```
```
most tabs present
```

Phase 3: Pre-edit audit

Before any write, inspect the workbook structure against the extracted criteria.

Required pre-edit checks

For each relevant workbook:

confirm exact anchored path
list all sheet names
identify target sheets
inspect headers
count relevant rows
note formulas, merged cells, tables, filters, protections, or macros if relevant
confirm whether required columns exist already or must be created
confirm whether the planned edit is structurally safe

Identity consistency check

If you inspect the workbook multiple times or with multiple tools, the inspections must agree on:

exact path
sheet names
relevant row counts
used range or dimensions when available
relevant headers

If they disagree, treat workbook identity as unconfirmed.

Conflict reconciliation rule

If two inspections disagree on sheet names, row counts, dimensions, or headers, do not edit until you can produce all of the following:

an independent direct read from the exact anchored path
explicit evidence that this is the intended workbook
a reconciled explanation of which earlier inspection was wrong and why
a fresh verified inventory of sheets, dimensions, row counts, and headers from the anchored file

If you cannot reconcile the conflict, the decision is no-go.

Phase 4: Pre-edit go/no-go gate

Proceed only if all are true:

the workbook identity is confirmed
the target sheet is unambiguous
required columns exist or can be added safely
row counts are plausible for the requested operation
no unresolved inspection conflicts remain
the requested edit is possible from the available source data

Otherwise stop and report the mismatch.

Phase 5: Perform the edit

Only after the workbook passes pre-edit audit:

edit only the intended workbook
preserve untouched sheets unless instructed otherwise
preserve names unless renaming was requested
preserve formatting/macros/formulas when required
keep a clear mapping from user requirements to written cells/rows

Record at minimum:

workbook path edited
output path written
sheets modified
columns added or populated
rows added or updated
formulas inserted
rows marked/selected
any assumptions made

Phase 6: Deterministic post-write proof

This phase is mandatory.

After saving, directly inspect the saved workbook from disk with Python and

openpyxl

. Do not finalize from memory, delegated prose, or a generic success message.

The verification pass must produce a compact, machine-checkable checklist covering every extracted criterion.

Verification standard

The verification must be:

direct: reads the workbook from disk
deterministic: script-based, not conversational
anchored: uses exact workspace paths
criterion-based: checks each requirement explicitly
decisive: any unresolved failure blocks completion

Required proof outputs

At minimum, the verification must report:

inspected file path
whether the file exists
workbook sheet names
per-sheet headers for relevant sheets
per-sheet non-empty row counts
marker/selected-row counts if relevant
formula presence if relevant
preservation of required source sheets
existence of requested output files
criterion checklist with status for each item

Status vocabulary

Each criterion must end in exactly one status:

```
PASS
```
— requirement satisfied by direct inspection
```
FAIL
```
— requirement not satisfied
```
UNAVAILABLE-IN-SOURCE
```
— requirement could not be satisfied because required source information was absent from the workbook or inputs

Do not replace these with softer wording like:

looks okay
appears complete
probably satisfied
seems unavailable

Finalization gate

You may finalize only when every required criterion is either:

```
PASS
```
, or
```
UNAVAILABLE-IN-SOURCE
```
with explicit reconciliation

You must not finalize when:

any mandatory criterion remains
```
FAIL
```
a claimed output file was not verified from disk
the workbook path is uncertain
post-write proof was not run
you only have delegated summary evidence
requested sample counts and actual marked counts differ without reconciliation
required coverage criteria are missing and not repaired
required sheet names or headers are missing

If direct proof says the workbook is incomplete, that proof is authoritative.

Recommended proof procedure

Step 1: Build the criteria list

Create a numbered list of all checkable requirements from the user request.

Example:

output workbook exists at
```
<path>
```
sheet
```
Sample
```
exists
sheet
```
Summary
```
exists
```
Sample
```
has column
```
Selected
```
exactly 25 rows are marked selected
at least one selected row has
```
Product = Trade Finance
```
source sheet
```
RawData
```
still exists

Step 2: Run a direct Python inspection

Use a standalone script or inline Python.

openpyxl

is the default.

Verification script requirements

Your script should print, in a compact format:

file path
existence
sheet list
relevant headers
relevant row counts
relevant marked-row counts
any criterion-specific counts
one line per criterion with a status

Recommended verification script template

Adapt this template to the task.

from pathlib import Path
from openpyxl import load_workbook

WORKBOOK = Path("TARGET.xlsx")

TRUTHY = {"x", "yes", "true", "1", "y"}
MARKER_CANDIDATES = {"mark", "marked", "flag", "selected", "include", "status"}

def norm(v):
    if v is None:
        return ""
    return str(v).strip()

def lower(v):
    return norm(v).lower()

def first_nonempty_row(ws, max_scan=20):
    for r in ws.iter_rows(min_row=1, max_row=min(ws.max_row, max_scan), values_only=True):
        vals = list(r)
        if any(norm(v) != "" for v in vals):
            return vals
    return []

def data_rows(ws, header_row_idx=1):
    count = 0
    for row in ws.iter_rows(min_row=header_row_idx + 1, values_only=True):
        if any(norm(v) != "" for v in row):
            count += 1
    return count

def marker_index(headers):
    normalized = [lower(h) for h in headers]
    for i, h in enumerate(normalized):
        if h in MARKER_CANDIDATES:
            return i
    return None

def is_truthy(v):
    return lower(v) in TRUTHY

def marked_rows(ws, header_row_idx=1, idx=None):
    if idx is None:
        return 0
    count = 0
    for row in ws.iter_rows(min_row=header_row_idx + 1, values_only=True):
        if not any(norm(v) != "" for v in row):
            continue
        value = row[idx] if idx < len(row) else None
        if is_truthy(value):
            count += 1
    return count

criteria = []

print(f"WORKBOOK: {WORKBOOK}")
print(f"EXISTS: {WORKBOOK.exists()}")

if not WORKBOOK.exists():
    print("CRITERION|output-exists|FAIL|Workbook file missing")
    raise SystemExit(0)

wb = load_workbook(WORKBOOK, data_only=False)
print(f"SHEETS: {wb.sheetnames}")

for ws in wb.worksheets:
    headers = first_nonempty_row(ws)
    idx = marker_index(headers) if headers else None
    print(f"SHEET|{ws.title}|HEADERS|{headers}")
    print(f"SHEET|{ws.title}|DATA_ROWS|{data_rows(ws, header_row_idx=1)}")
    if idx is not None:
        print(f"SHEET|{ws.title}|MARKER_COLUMN|{headers[idx]}")
        print(f"SHEET|{ws.title}|MARKED_ROWS|{marked_rows(ws, header_row_idx=1, idx=idx)}")
    else:
        print(f"SHEET|{ws.title}|MARKER_COLUMN|<not found>")

# Add explicit criteria below. Examples:
required_sheets = ["Sample", "Summary"]
for s in required_sheets:
    status = "PASS" if s in wb.sheetnames else "FAIL"
    reason = "present" if status == "PASS" else "missing"
    print(f"CRITERION|required-sheet:{s}|{status}|{reason}")

Step 3: Add task-specific criteria to the script

The generic script is not enough by itself. Extend it to test the actual request, such as:

exact sample count
marker count by sheet
presence of category coverage
formulas in a target column
exact column names
output file existence
preservation of source sheets
expected row count delta
values meeting business rules

If the task depends on category coverage, test category coverage explicitly rather than assuming it from total counts.

Example criteria additions:

```
exactly 40 marked rows
```

at least one marked row for each required region

column "Review Note" populated for all marked rows

sheet "Summary" cell B2 contains numeric total

sheet "RawData" still present after save

Step 4: Reconcile non-pass results

For each non-pass criterion:

If status is

FAIL

Do one of:

repair the workbook and re-run proof
report failure clearly
ask for clarification if the request was ambiguous

Do not declare completion while a mandatory

FAIL

remains.

If status is

UNAVAILABLE-IN-SOURCE

State explicitly:

what source input was missing
which criterion could not be satisfied from source data
whether the task allowed this limitation
whether the workbook was left unchanged or partially completed because of it

Use

UNAVAILABLE-IN-SOURCE

only for true source limitations, not for editing mistakes.

Required machine-checkable checklist

Every post-write verification must include a checklist in this style:

```
CRITERION|<name>|PASS|<reason>
```
```
CRITERION|<name>|FAIL|<reason>
```

CRITERION|<name>|UNAVAILABLE-IN-SOURCE|<reason>

This format is intentionally rigid so you can reason from concrete output.

Checklist design rules

one criterion per line
criterion names should be specific
reasons should be short and factual
do not collapse multiple requirements into one broad criterion
if a requirement is critical, give it its own line

Decision rules

Hard no-go conditions before editing

Do not edit if:

workbook path is unconfirmed
candidate workbooks remain ambiguous
required target sheet cannot be identified
repeated inspections conflict and are unreconciled
the workbook cannot be read from the anchored path

Hard fail conditions after editing

Do not finalize if:

a required criterion is
```
FAIL
```
sample count differs from request and is unexplained
required category or rule-based coverage is missing
direct verification cannot read the saved workbook
only delegated-agent summary evidence exists
required output path was not verified
required source sheet disappeared unexpectedly

When delegated summaries disagree with direct proof

Trust direct proof from the deterministic script.

If a delegated agent claimed success but your direct verification shows missing sheets, wrong counts, missing categories, or missing output files, the task is not complete.

Practical checklist

Before editing

anchor to the exact workspace root
locate plausible Excel files
inspect all plausible candidates
record exact target path
list sheet names
inspect relevant headers
count relevant rows
detect inspection conflicts
reconcile any conflict by direct anchored read
convert the user request into explicit criteria
decide go/no-go

After editing

save the workbook to the intended path
re-open it from disk
run deterministic Python/openpyxl verification
print workbook facts
print one criterion line per requirement
repair any
```
FAIL
```
if safe
re-run verification
finalize only if every mandatory item is
```
PASS
```
or explicitly reconciled
```
UNAVAILABLE-IN-SOURCE
```

Final response pattern

When reporting to the user, base your summary only on verified facts from the direct proof output.

Include:

exact workbook path verified
whether the output file exists
sheet names present
relevant row counts
marked-row counts if relevant
the outcome of mandatory criteria
any remaining limitation explicitly

Preferred phrasing:

"I verified the saved workbook directly from disk with Python/openpyxl."
"The file exists at
```
...
```
."
"The workbook contains sheets
```
...
```
."
"The verification checklist found
```
N
```
pass items,
```
M
```
fail items, and
```
K
```
unavailable-in-source items."
"Because one mandatory criterion failed, I am not declaring completion."
"Because all mandatory criteria passed, the workbook is ready."

Anti-patterns to avoid

Do not:

finalize because an agent said it wrote the file
rely only on workbook-edit success messages
treat category coverage as satisfied without checking categories directly
assume sheet names from memory
quote counts without direct disk inspection
ignore a failed verification line
downgrade a real failure into vague prose
call a task complete when proof shows mandatory gaps

Why this skill exists

Spreadsheet tasks often fail not during editing but during final judgment. A workbook can be modified successfully and still be wrong.

This skill prevents false completion by forcing:

anchored workbook identity checks
deterministic post-write inspection
criterion-by-criterion proof
a hard stop when mandatory requirements remain unsatisfied

When workbook correctness matters, proof beats summary.