Awesome-Agent-Skills-for-Empirical-Research draft-paper

Draft sections of the sewage-house-prices academic paper. Handles section drafting for the Overleaf LaTeX manuscript, notation protocol, anti-hedging, and humanizer pass. This skill should be used when asked to "draft the paper", "write up the results", "write the intro", or draft any section of the manuscript.

install

source · Clone the upstream repo

git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/41-sticerd-eee-sewage-econometrics-check/skills/draft-paper" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-draft-paper && rm -rf "$T"

manifest: skills/41-sticerd-eee-sewage-econometrics-check/skills/draft-paper/SKILL.md

source content

Draft Paper

Draft a section (or full draft) of the "Sewage in Our Waters" academic paper on the causal impact of sewage spills on house prices and rents in England.

Input:

$ARGUMENTS

— section name optionally followed by specific instructions or notes.

Project-Specific Context

Paper Structure

The manuscript lives in

docs/overleaf/

with this structure:

```
_main.tex
```
— Master document (KOMA-Script
```
scrartcl
```
, APA biblatex,
```
libertinus
```
font)
```
01_introduction.tex
```
— Introduction
```
02_background_context.tex
```
— UK sewage policy and institutional background
```
03_motivating_evidence.tex
```
— Descriptive evidence and stylised facts
```
04_hydraulics_instrument.tex
```
— Hydraulic capacity instrument
```
05_research_question.tex
```
— Identification strategy and empirical specification
```
100_appendix_descriptives.tex
```
— Appendix: descriptive statistics
```
101_appendix_results.tex
```
— Appendix: supplementary results
```
102_appendix_dry_spills.tex
```
— Appendix: dry spill analysis
```
103_appendix_data.tex
```
— Appendix: data documentation
```
glossary.tex
```
— Glossary of terms
```
refs.bib
```
— Bibliography (APA style via
```
biblatex
```
)

Generated Output

```
output/tables/*.tex
```
— Regression tables (modelsummary → LaTeX with tabularray)
```
output/figures/
```
— Figures (maps, event studies, scatter plots)
```
book/
```
— Quarto website with exploratory analysis (
```
.qmd
```
files)

Econometric Approaches

The paper uses multiple identification strategies:

Hedonic regressions — Cross-sectional: spill count/hours (continuous + bins) × prior/full period, with LSOA FE
Repeat sales — Within-property variation (Palmquist 1982 approach)
Long difference — 250m grid-level changes, weighted/unweighted × all/exposed grids
News/media DiD — Google Trends and LexisNexis media coverage as treatment
Upstream/downstream — Directional spillover via river network topology
Dry spills — Spills occurring without rainfall as identification variation

Key Variables and Notation

Outcome:
```
log(price)
```
for sales,
```
log(rent)
```
for rentals

Treatment:

spill_count

spill_hrs

spill_count_daily_avg

spill_hrs_daily_avg

Geography:
```
n_spill_sites
```
within radius,
```
min_dist
```
/
```
mean_dist
```
to nearest site
Radii: 250m, 500m, 1000m, 2000m, 5000m, 10000m
Fixed effects: LSOA (
```
lsoa
```
), MSOA (
```
msoa
```
), year-quarter
Standard errors: heteroskedasticity-robust (
```
vcov = "hetero"
```
) via
```
fixest::feols()
```

LaTeX Conventions

Citations:
```
\textcite{}
```
for textual,
```
\parencite{}
```
for parenthetical (APA biblatex)
Tables:
```
tabularray
```
format with
```
booktabs
```
,
```
[H]
```
float placement
Equations: numbered with
```
\label{eq:...}
```

Workflow

Step 1: Context Gathering

Before drafting:

Read the existing section file in
```
docs/overleaf/
```
(if it exists)
Read
```
_main.tex
```
to understand document structure and preamble
Scan
```
output/tables/
```
for available regression output
Scan
```
output/figures/
```
for available figures
Check
```
refs.bib
```
for available citations
Read relevant
```
.qmd
```
files in
```
book/
```
for analysis context and results interpretation
Read relevant analysis scripts in
```
scripts/R/09_analysis/
```
for methodology details

Step 2: Section Routing

Based on

$ARGUMENTS

intro
: Draft
```
01_introduction.tex
```
background
: Draft
```
02_background_context.tex
```

motivating-evidence
: Draft

03_motivating_evidence.tex

instrument
: Draft
```
04_hydraulics_instrument.tex
```

research-question
: Draft

05_research_question.tex

results
: Draft results section(s) from regression output
conclusion
: Draft conclusion
abstract
: Draft abstract (requires other sections to exist)
appendix
: Draft or extend appendix sections
full
: Draft all sections in sequence, pausing between for user feedback
No argument: Ask which section to draft

Step 3: Drafting Standards

Introduction (~1,000-1,500 words)

Hook with UK sewage crisis context → research question → methodology overview → key findings → contribution → road map
Contribution paragraph names specific papers being advanced (environmental disamenity capitalisation, hedonic pricing literature)
Effect sizes with magnitudes and units (percentage impact on house prices per additional spill)

Background & Context (~800-1,200 words)

UK privatised water industry structure
EDM monitoring requirements and data availability (2021+)
Scale of the sewage spill problem
Policy and regulatory responses

Empirical Strategy (~800-1,200 words)

Identification assumption stated formally
Estimating equation displayed and numbered
Each approach (hedonic, repeat sales, long diff) with clear specification
Threats to identification addressed (sorting, omitted variables, measurement error)

Results (~800-1,500 words per approach)

Main specification with economic interpretation
Effect sizes in meaningful terms (£ impact, percentage change)
Robustness across radii, time periods, and specifications
Heterogeneity results

Conclusion (~500-700 words)

Restate headline effect sizes
Policy implications for water regulation
Limitations and future work (rental market, long-run effects)

Step 4: Humanizer Pass

Apply automatically as final pass:

Strip AI writing patterns (see
```
/humanizer
```
skill for full 24-pattern checklist)
Preserve formal academic structure where genuinely needed
Maintain citation density and technical vocabulary

Step 5: Quality Self-Check

Before presenting the draft:

Every displayed equation is numbered (
```
\label{eq:...}
```
)
All
```
\textcite{}
```
/
```
\parencite{}
```
keys exist in
```
refs.bib
```
Effect sizes stated with units and magnitudes
No banned hedging phrases ("interestingly", "it is worth noting")
Notation consistent with project conventions (LSOA, MSOA, radius distances)
All referenced tables/figures actually exist in
```
output/
```
LaTeX compiles cleanly with the KOMA-Script setup

Step 6: Present to User

Present each section for feedback. Flag:

TBD: Where empirical results are needed but not yet available
VERIFY: Citations that need user confirmation
PLACEHOLDER: Effect sizes awaiting final estimates

Principles

This is the authors' paper (Balboni & Dhingra), not Claude's. Match their voice and style from existing sections.
Never fabricate results. Use TBD placeholders. Cross-check numbers against
```
output/tables/
```
.
Citations must be verifiable. Only cite papers confirmed in
```
refs.bib
```
or the literature review in
```
docs/lit_review/
```
.
Humanizer is automatic. Every draft gets de-AI-ified before presentation.
Match existing conventions. Read existing
```
.tex
```
files first to match style, formatting, and level of formality.