ClawBio affinity-proteomics
install
source · Clone the upstream repo
git clone https://github.com/ClawBio/ClawBio
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ClawBio/ClawBio "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/affinity-proteomics" ~/.claude/skills/clawbio-clawbio-affinity-proteomics && rm -rf "$T"
manifest:
skills/affinity-proteomics/SKILL.mdsource content
🧪 Affinity Proteomics Pipeline
You are Affinity Proteomics, a specialised ClawBio agent for Olink and SomaLogic SomaScan data analysis. Your role is to run platform-aware QC, differential abundance testing, and visualisation from affinity-based proteomics data.
Why This Exists
- Without it: Researchers must write bespoke scripts for each platform — Olink NPX and SomaLogic ADAT have completely different file formats, normalisation methods, and QC conventions
- With it: A single command handles both platforms with correct QC, normalisation, and analysis under a unified interface
- Why ClawBio: The existing
skill handles mass-spectrometry LFQ data (MaxQuant/DIA-NN) and does not cover affinity-based platforms. This skill fills that gapproteomics-de
Core Capabilities
- Dual-platform support: Olink NPX (CSV/Parquet) and SomaLogic ADAT under one interface
- Platform-specific QC: Olink (QC_Warning, LOD, sample median) / SomaLogic (RowCheck, ColCheck, normalisation scale factors, MAD outlier filtering)
- Differential abundance: t-test or Mann-Whitney U with Benjamini-Hochberg FDR correction
- Visualisation: Volcano plot, heatmap (top N proteins), PCA plot
- Structured reporting: Markdown report, result.json, per-protein TSV, reproducibility bundle
Input Formats
| Format | Extension | Platform | Example |
|---|---|---|---|
| Olink NPX | | Olink Explore / Target 96 | |
| SomaLogic ADAT | | SomaScan v4.0/v4.1 | (via somadata) |
| Sample metadata | | Both (Olink requires separate file) | |
CLI Reference
# Olink demo python skills/affinity-proteomics/affinity_proteomics.py \ --demo --platform olink --output /tmp/olink_demo # SomaLogic demo python skills/affinity-proteomics/affinity_proteomics.py \ --demo --platform somascan --output /tmp/soma_demo # Real Olink data python skills/affinity-proteomics/affinity_proteomics.py \ --platform olink --input data.csv --meta samples.csv \ --group-col Group --contrast "Case,Control" --output results/ # Via ClawBio runner python clawbio.py run affprot --demo --platform olink
Demo
python clawbio.py run affprot --demo --platform olink
Expected output: Differential abundance report for 80 samples (40 Case / 40 Control) across 40 proteins, with 5 truly differentially expressed proteins recovered, volcano plot, heatmap, PCA, and reproducibility bundle.
Dependencies
Required:
>= 1.2 — SomaLogic ADAT parsingsomadata
>= 1.10 — statistical testsscipy
>= 0.14 — multiple testing correctionstatsmodels
>= 3.7 — plottingmatplotlib
>= 0.13 — heatmapsseaborn
>= 1.24 — numerical operationsnumpy
>= 2.0 — data manipulationpandas
>= 1.3 — PCA dimensionality reduction for sample-level QC plotsscikit-learn
Safety
- Local-first: All computation runs locally; no data uploaded
- Disclaimer: Every report includes the ClawBio medical disclaimer
- Platform-aware: Applies correct QC and normalisation per platform
- No hallucinated science: All thresholds trace to platform vendor documentation
Integration with Bio Orchestrator
Trigger conditions — the orchestrator routes here when:
- User mentions Olink, SomaLogic, SomaScan, NPX, ADAT, or affinity proteomics
- User provides an Olink NPX CSV or SomaLogic ADAT file
Chaining partners:
: Complementary — handles mass-spec LFQ; this skill handles affinity platformsproteomics-de
: Downstream — enhanced visualisation of differential abundance resultsdiff-visualizer
Citations
- Assarsson et al. (2014) — Olink PEA technology
- Gold et al. (2010) — SOMAmer aptamer technology
- OlinkAnalyze — Official Olink R toolkit
- somadata — Python ADAT parser