ClawBio rnaseq-de
Differential expression analysis for bulk RNA-seq and pseudo-bulk count matrices with QC, PCA, and contrast testing.
install
source · Clone the upstream repo
git clone https://github.com/ClawBio/ClawBio
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ClawBio/ClawBio "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/rnaseq-de" ~/.claude/skills/clawbio-clawbio-rnaseq-de && rm -rf "$T"
manifest:
skills/rnaseq-de/SKILL.mdsource content
🧬 RNA-seq Differential Expression
This skill performs differential expression on bulk RNA-seq or pseudo-bulk count matrices.
Core Capabilities
- Input validation for count matrix and sample metadata
- Pre-DE QC (library size, detected genes, low-count filtering)
- PCA visualisation on normalized expression
- Differential expression from formula + contrast
- Volcano and MA plots
- Markdown report with reproducibility files
Input Contract
- Count matrix (
or.csv
): rows are genes, columns are samples, first column is gene identifier.tsv - Metadata table (
or.csv
): one row per sample, must include.tsvsample_id - Formula: e.g.
or~ condition~ batch + condition - Contrast:
(e.g.factor,numerator,denominator
)condition,treated,control
Output Structure
rnaseq_de_report/ ├── report.md ├── figures/ │ ├── pca.png │ ├── volcano.png │ └── ma_plot.png ├── tables/ │ ├── qc_summary.csv │ ├── normalized_counts.csv │ └── de_results.csv └── reproducibility/ ├── commands.sh ├── environment.yml └── checksums.sha256
Usage
python rnaseq_de.py \ --counts counts.csv \ --metadata metadata.csv \ --formula "~ batch + condition" \ --contrast "condition,treated,control" \ --output report_dir
Safety
- Local-only processing
- Warn before overwriting existing output
- Report-level disclaimer required