Awesome-Agent-Skills-for-Empirical-Research draft-paper
Draft sections of the sewage-house-prices academic paper. Handles section drafting for the Overleaf LaTeX manuscript, notation protocol, anti-hedging, and humanizer pass. This skill should be used when asked to "draft the paper", "write up the results", "write the intro", or draft any section of the manuscript.
install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/41-sticerd-eee-sewage-econometrics-check/skills/draft-paper" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-draft-paper && rm -rf "$T"
manifest:
skills/41-sticerd-eee-sewage-econometrics-check/skills/draft-paper/SKILL.mdsource content
Draft Paper
Draft a section (or full draft) of the "Sewage in Our Waters" academic paper on the causal impact of sewage spills on house prices and rents in England.
Input:
$ARGUMENTS — section name optionally followed by specific instructions or notes.
Project-Specific Context
Paper Structure
The manuscript lives in
docs/overleaf/ with this structure:
— Master document (KOMA-Script_main.tex
, APA biblatex,scrartcl
font)libertinus
— Introduction01_introduction.tex
— UK sewage policy and institutional background02_background_context.tex
— Descriptive evidence and stylised facts03_motivating_evidence.tex
— Hydraulic capacity instrument04_hydraulics_instrument.tex
— Identification strategy and empirical specification05_research_question.tex
— Appendix: descriptive statistics100_appendix_descriptives.tex
— Appendix: supplementary results101_appendix_results.tex
— Appendix: dry spill analysis102_appendix_dry_spills.tex
— Appendix: data documentation103_appendix_data.tex
— Glossary of termsglossary.tex
— Bibliography (APA style viarefs.bib
)biblatex
Generated Output
— Regression tables (modelsummary → LaTeX with tabularray)output/tables/*.tex
— Figures (maps, event studies, scatter plots)output/figures/
— Quarto website with exploratory analysis (book/
files).qmd
Econometric Approaches
The paper uses multiple identification strategies:
- Hedonic regressions — Cross-sectional: spill count/hours (continuous + bins) × prior/full period, with LSOA FE
- Repeat sales — Within-property variation (Palmquist 1982 approach)
- Long difference — 250m grid-level changes, weighted/unweighted × all/exposed grids
- News/media DiD — Google Trends and LexisNexis media coverage as treatment
- Upstream/downstream — Directional spillover via river network topology
- Dry spills — Spills occurring without rainfall as identification variation
Key Variables and Notation
- Outcome:
for sales,log(price)
for rentalslog(rent) - Treatment:
,spill_count
,spill_hrs
,spill_count_daily_avgspill_hrs_daily_avg - Geography:
within radius,n_spill_sites
/min_dist
to nearest sitemean_dist - Radii: 250m, 500m, 1000m, 2000m, 5000m, 10000m
- Fixed effects: LSOA (
), MSOA (lsoa
), year-quartermsoa - Standard errors: heteroskedasticity-robust (
) viavcov = "hetero"fixest::feols()
LaTeX Conventions
- Citations:
for textual,\textcite{}
for parenthetical (APA biblatex)\parencite{} - Tables:
format withtabularray
,booktabs
float placement[H] - Equations: numbered with
\label{eq:...}
Workflow
Step 1: Context Gathering
Before drafting:
- Read the existing section file in
(if it exists)docs/overleaf/ - Read
to understand document structure and preamble_main.tex - Scan
for available regression outputoutput/tables/ - Scan
for available figuresoutput/figures/ - Check
for available citationsrefs.bib - Read relevant
files in.qmd
for analysis context and results interpretationbook/ - Read relevant analysis scripts in
for methodology detailsscripts/R/09_analysis/
Step 2: Section Routing
Based on
$ARGUMENTS:
: Draftintro01_introduction.tex
: Draftbackground02_background_context.tex
: Draftmotivating-evidence03_motivating_evidence.tex
: Draftinstrument04_hydraulics_instrument.tex
: Draftresearch-question05_research_question.tex
: Draft results section(s) from regression outputresults
: Draft conclusionconclusion
: Draft abstract (requires other sections to exist)abstract
: Draft or extend appendix sectionsappendix
: Draft all sections in sequence, pausing between for user feedbackfull- No argument: Ask which section to draft
Step 3: Drafting Standards
Introduction (~1,000-1,500 words)
- Hook with UK sewage crisis context → research question → methodology overview → key findings → contribution → road map
- Contribution paragraph names specific papers being advanced (environmental disamenity capitalisation, hedonic pricing literature)
- Effect sizes with magnitudes and units (percentage impact on house prices per additional spill)
Background & Context (~800-1,200 words)
- UK privatised water industry structure
- EDM monitoring requirements and data availability (2021+)
- Scale of the sewage spill problem
- Policy and regulatory responses
Empirical Strategy (~800-1,200 words)
- Identification assumption stated formally
- Estimating equation displayed and numbered
- Each approach (hedonic, repeat sales, long diff) with clear specification
- Threats to identification addressed (sorting, omitted variables, measurement error)
Results (~800-1,500 words per approach)
- Main specification with economic interpretation
- Effect sizes in meaningful terms (£ impact, percentage change)
- Robustness across radii, time periods, and specifications
- Heterogeneity results
Conclusion (~500-700 words)
- Restate headline effect sizes
- Policy implications for water regulation
- Limitations and future work (rental market, long-run effects)
Step 4: Humanizer Pass
Apply automatically as final pass:
- Strip AI writing patterns (see
skill for full 24-pattern checklist)/humanizer - Preserve formal academic structure where genuinely needed
- Maintain citation density and technical vocabulary
Step 5: Quality Self-Check
Before presenting the draft:
- Every displayed equation is numbered (
)\label{eq:...} - All
/\textcite{}
keys exist in\parencite{}refs.bib - Effect sizes stated with units and magnitudes
- No banned hedging phrases ("interestingly", "it is worth noting")
- Notation consistent with project conventions (LSOA, MSOA, radius distances)
- All referenced tables/figures actually exist in
output/ - LaTeX compiles cleanly with the KOMA-Script setup
Step 6: Present to User
Present each section for feedback. Flag:
- TBD: Where empirical results are needed but not yet available
- VERIFY: Citations that need user confirmation
- PLACEHOLDER: Effect sizes awaiting final estimates
Principles
- This is the authors' paper (Balboni & Dhingra), not Claude's. Match their voice and style from existing sections.
- Never fabricate results. Use TBD placeholders. Cross-check numbers against
.output/tables/ - Citations must be verifiable. Only cite papers confirmed in
or the literature review inrefs.bib
.docs/lit_review/ - Humanizer is automatic. Every draft gets de-AI-ified before presentation.
- Match existing conventions. Read existing
files first to match style, formatting, and level of formality..tex