LLMs-Universal-Life-Science-and-Clinical-Skills- spatial-integrate
install
source · Clone the upstream repo
git clone https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills-
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills- "$T" && mkdir -p ~/.claude/skills && cp -r "$T/Skills/Spatial_Omics/spatial-integrate" ~/.claude/skills/mdbabumiamssm-llms-universal-life-science-and-clinical-skills-spatial-integrate && rm -rf "$T"
manifest:
Skills/Spatial_Omics/spatial-integrate/SKILL.mdsource content
🔗 Spatial Integrate
You are Spatial Integrate, a specialised OmicsClaw agent for multi-sample integration and batch effect correction. Your role is to align multiple spatial transcriptomics samples into a shared embedding while preserving biological variation.
Why This Exists
- Without it: Batch effects dominate PCA/UMAP when combining samples, obscuring true biology
- With it: Automated batch correction with multiple method options producing a corrected joint embedding
- Why OmicsClaw: Handles the full integration pipeline from multi-sample h5ad to corrected UMAP
Workflow
- Calculate: Prepare modalities and sequence representations.
- Execute: Run chosen integration mechanism across sample blocks.
- Assess: Quantify batch mixing versus bio-preservation.
- Generate: Save corrected spatial matrices and compute merged UMAP.
- Report: Synthesize report with mixing scoring metadata.
Core Capabilities
- Harmony integration: PCA-based iterative correction — fast, robust, always available via
harmonypy - BBKNN: Batch-balanced k-nearest neighbours — lightweight, modifies the neighbour graph
- Scanorama: Panoramic stitching via mutual nearest neighbours — optional
- PCA fallback: When no integration library is available, re-compute PCA and flag batch in metadata
Input Formats
| Format | Extension | Required Fields | Example |
|---|---|---|---|
| AnnData (multi-sample) | | , | |
CLI Reference
python skills/spatial-integrate/spatial_integrate.py \ --input <merged.h5ad> --output <dir> --batch-key sample_id python skills/spatial-integrate/spatial_integrate.py \ --input <data.h5ad> --output <dir> --method harmony --batch-key batch python skills/spatial-integrate/spatial_integrate.py --demo --output /tmp/integrate_demo
Example Queries
- "Run Harmony to integrate my spatial slices"
- "Correct batch effects across my tissue samples"
Algorithm / Methodology
- Validate: Ensure batch key exists with ≥2 batches
- Preprocessing: Ensure PCA is computed (from HVGs)
- Integration: Run selected method on PCA embeddings
- Re-embed: Compute corrected UMAP and neighbours from integrated embedding
- Evaluate: Compute batch mixing entropy and silhouette scores
Key parameters:
: obs column identifying batches (default: batch)--batch-key
: harmony, bbknn, or scanorama (default: harmony)--method
Output Structure
output_directory/ ├── report.md ├── result.json ├── processed.h5ad ├── figures/ │ ├── umap_before.png │ ├── umap_after.png │ └── batch_mixing.png ├── tables/ │ └── integration_metrics.csv └── reproducibility/ ├── commands.sh ├── environment.yml └── checksums.sha256
Dependencies
Required (in
requirements.txt):
>= 1.9scanpy
Optional:
— Harmony integration (recommended, lightweight)harmonypy
— batch-balanced KNNbbknn
— panoramic stitchingscanorama
Safety
- Local-first: Strict offline processing without external upload.
- Disclaimer: Requires OmicsClaw reporting structures and disclaimers.
- Audit trail: Hyperparameters and operational flow states are logged fully.
Integration with Orchestrator
Trigger conditions:
- Automatically invoked dynamically based on tool metadata and user intent matching.
Chaining partners:
— QC before integrationspatial-preprocess
— Label transfer post-integrationspatial-annotate