Awesome-Agent-Skills-for-Empirical-Research event-study
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
skills/27-dariia-m-my_claude_skills/event-studies/skill.mdEvent Study Skill
This skill covers everything needed to conduct event studies in R - from the traditional two-way fixed effects (TWFE) approach through modern robust estimators that handle staggered treatment timing and heterogeneous treatment effects.
When to Use This Skill
Use this skill when the user wants to:
- Create event study plots (coefficient plots around a treatment event)
- Test for parallel pre-trends in a difference-in-differences design
- Estimate dynamic treatment effects
- Work with staggered treatment adoption (units treated at different times)
- Implement any of the modern DiD estimators (Callaway & Sant'Anna, Sun & Abraham, etc.)
- Diagnose problems with TWFE under treatment effect heterogeneity
Quick Decision Tree
Before writing any code, determine the user's situation:
-
Is treatment timing the same for all treated units?
- Yes → Traditional TWFE event study is fine. See
traditional-event-study.md - No (staggered) → Go to step 2.
- Yes → Traditional TWFE event study is fine. See
-
Is there reason to expect heterogeneous treatment effects across cohorts or over time?
- No strong reason → Traditional TWFE may still be OK, but consider robust alternatives.
- Yes or unsure → Use a robust estimator. See
modern-extensions.md
-
What is the user's goal?
- Quick visualization of pre-trends → Traditional approach, fast and familiar.
- Publication-quality estimation → Recommend a robust estimator + traditional as comparison.
- Diagnostic check → Bacon decomposition to understand TWFE weights.
Core Concepts (Brief)
An event study plot displays estimated coefficients for leads (pre-treatment periods) and lags (post-treatment periods) relative to a baseline period (typically one period before treatment). The key elements are:
- Relative time: Time reindexed so that treatment occurs at period 0 for each unit.
- Baseline/reference period: Usually t = -1 (omitted from regression, normalized to zero).
- Pre-trend coefficients: Periods before treatment. If these are close to zero, it supports the parallel trends assumption.
- Post-treatment coefficients: These capture the dynamic treatment effect over time.
The parallel trends assumption states that, absent treatment, treated and control units would have followed the same trajectory. Pre-trend coefficients near zero are necessary (but not sufficient) evidence for this.
Workflow
For any event study request, follow this general workflow:
- Understand the data: Identify the panel structure (unit ID, time variable, treatment indicator, outcome). Ask the user if not clear.
- Determine treatment timing: Is it uniform or staggered?
- Choose the estimator: Use the decision tree above.
- Estimate the model: Follow the relevant reference file.
- Create the plot: Always produce a clean, publication-ready event study plot.
- Interpret results: Discuss pre-trends, post-treatment dynamics, and any concerns.
Reference Files
Read these as needed based on the user's situation:
-
- Traditional TWFE event study withtraditional-event-study.md
. Read this for any event study request. It covers the baseline approach, plotting, and the assumptions behind it.fixest -
- Modern robust estimators for staggered designs. Read this when treatment timing varies across units or when the user asks about heterogeneous treatment effects, or any of the newer DiD methods.modern-extensions.md -
- Pre-trend testing, placebo checks, Bacon decomposition, sensitivity analysis. Read this when the user wants to validate their design or when you spot potential issues.diagnostics-and-testing.md
Key R Packages
| Package | Purpose | When to use |
|---|---|---|
| TWFE event studies, Sun & Abraham | Default starting point |
| Callaway & Sant'Anna estimator | Staggered treatment, heterogeneous effects |
| Gardner (2022) two-stage DiD | Staggered treatment, clean decomposition |
| de Chaisemartin & D'Haultfoeuille | Staggered, robust to heterogeneity |
| Bacon decomposition | Diagnosing TWFE problems |
| Sensitivity analysis for pre-trends | Robustness checks on parallel trends |
| Plotting | Always |
| Regression tables | When user needs tables |
Plotting Standards
All event study plots should follow these defaults (user can override):
- Use
withggplot2
or a clean custom themetheme_minimal() - Include a vertical dashed line at t = 0 (treatment onset)
- Include a horizontal dashed line at y = 0 (null effect)
- IMPORTANT: Use discrete point estimates with vertical error bars (
+geom_point
), NOT connected lines with shaded ribbons (geom_errorbar
+geom_line
). Each periodgeom_ribbon
should show an individual point with its own error bar. This is the standard format in economics journals.t - Show 95% confidence intervals as error bars (vertical lines at each point)
- Label axes clearly: "Periods Relative to Treatment" (x) and the outcome name (y)
- Normalize the reference period coefficient to 0 explicitly
- Use colorblind-friendly palettes when comparing multiple groups
- If comparing estimators, use facets or distinct colors/shapes with a legend
Common Pitfalls to Watch For
- Forgetting to set the reference period: Always drop one pre-treatment period.
- Binning endpoint periods: With limited pre/post periods, bin the endpoints (e.g., "-5+" and "5+") to avoid small-sample noise.
- Not-yet-treated as controls: In staggered settings, TWFE uses not-yet-treated units as controls, which can introduce bias if treatment effects are heterogeneous.
- Interpreting pre-trends as proof of parallel trends: Pre-trends being zero is necessary but not sufficient. Absence of evidence is not evidence of absence.
- Unbalanced panels: Missing observations can distort event study estimates, especially at the endpoints.