Awesome-omni-skill wandb-weave
Query and analyze W&B experiment data and Weave LLM traces using Python scripts. Use when working with Weights & Biases data, including (1) querying ML experiment runs, metrics, and hyperparameters, (2) analyzing LLM traces and evaluations, (3) creating W&B reports, (4) listing projects and entities.
install
source · Clone the upstream repo
git clone https://github.com/diegosouzapw/awesome-omni-skill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/diegosouzapw/awesome-omni-skill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data-ai/wandb-weave" ~/.claude/skills/diegosouzapw-awesome-omni-skill-wandb-weave && rm -rf "$T"
manifest:
skills/data-ai/wandb-weave/SKILL.mdsource content
W&B & Weave Data Tools
Python scripts to query W&B experiment data and Weave LLM traces.
Prerequisites
pip install wandb weave export WANDB_API_KEY="your-api-key"
Workflow Decision Tree
What do you want to do? │ ├─ Query ML experiments (runs, metrics, sweeps) │ └─ Run: scripts/query_runs.py │ ├─ Analyze LLM traces │ ├─ Need trace data? → scripts/query_traces.py │ └─ Just need count? → scripts/query_traces.py --count-only │ ├─ Create a report │ └─ Run: scripts/create_report.py │ └─ List projects └─ Run: scripts/list_projects.py
Scripts
query_runs.py
Query W&B experiment runs with filtering and sorting.
# List recent runs python scripts/query_runs.py <entity> <project> --limit 10 # Filter by state python scripts/query_runs.py my-team my-project --state finished # Sort by metric (best first) python scripts/query_runs.py my-team my-project --sort "-summary_metrics.accuracy" # Custom filter python scripts/query_runs.py my-team my-project --filter '{"config.model": "gpt-4"}'
| Option | Description |
|---|---|
| Max results (default: 20) |
| Filter: running, finished, crashed, failed |
| Sort field (prefix for desc) |
| JSON filter dict |
| json or table |
query_traces.py
Query Weave LLM traces with filtering.
# List recent traces python scripts/query_traces.py <entity> <project> --limit 50 # Filter by status python scripts/query_traces.py my-team my-project --status success # Filter by model python scripts/query_traces.py my-team my-project --model gpt-4o # Find slow traces python scripts/query_traces.py my-team my-project --min-latency 5000 # Count only python scripts/query_traces.py my-team my-project --count-only
| Option | Description |
|---|---|
| Max results (default: 50) |
| Filter: success, error, running |
| Filter by model name |
| Min latency in ms |
| Only root traces |
| Return count, not data |
| Custom JSON filter (advanced) |
For advanced filter syntax (when
--status, --model, --min-latency are not enough), see references/weave_filters.md.
list_projects.py
List entities and projects.
# List all entities and projects python scripts/list_projects.py # List projects for specific entity python scripts/list_projects.py my-team # List entities only python scripts/list_projects.py --entities-only
create_report.py
Create W&B reports programmatically.
# Create with inline content python scripts/create_report.py my-team my-project "Weekly Summary" \ --content "## Results\n\n- Accuracy: 95%\n- Loss: 0.05" # Create from markdown file python scripts/create_report.py my-team my-project "Analysis" --file report.md # With description python scripts/create_report.py my-team my-project "Q4 Report" \ --content "..." --description "Quarterly analysis"
Common Workflows
Analyze Experiment Performance
# 1. Find your project python scripts/list_projects.py my-team # 2. Query best runs python scripts/query_runs.py my-team my-project \ --state finished \ --sort "-summary_metrics.accuracy" \ --limit 10 # 3. Create summary report python scripts/create_report.py my-team my-project "Best Runs" \ --content "## Top 10 Runs by Accuracy\n\n..."
Debug LLM Application
# 1. Count errors python scripts/query_traces.py my-team my-project --status error --count-only # 2. Get error details python scripts/query_traces.py my-team my-project --status error --limit 20 # 3. Find slow traces python scripts/query_traces.py my-team my-project --min-latency 5000
Resources
- Advanced trace filters: Load references/weave_filters.md when
option is needed for complex queries not covered by built-in options--filter