OpenClaw-Medical-Skills simo-multiomics-integration-agent

<!--

install
source · Clone the upstream repo
git clone https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/simo-multiomics-integration-agent" ~/.claude/skills/freedomintelligence-openclaw-medical-skills-simo-multiomics-integration-agent && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/simo-multiomics-integration-agent" ~/.openclaw/skills/freedomintelligence-openclaw-medical-skills-simo-multiomics-integration-agent && rm -rf "$T"
manifest: skills/simo-multiomics-integration-agent/SKILL.md
source content
<!-- # COPYRIGHT NOTICE # This file is part of the "Universal Biomedical Skills" project. # Copyright (c) 2026 MD BABU MIA, PhD <md.babu.mia@mssm.edu> # All Rights Reserved. # # This code is proprietary and confidential. # Unauthorized copying of this file, via any medium is strictly prohibited. # # Provenance: Authenticated by MD BABU MIA -->

name: 'simo-multiomics-integration-agent' description: 'AI-powered spatial integration of multi-omics datasets using probabilistic alignment for comprehensive tissue atlas construction and cellular state mapping.' measurable_outcome: Execute skill workflow successfully with valid output within 15 minutes. allowed-tools:

  • read_file
  • run_shell_command

SIMO Multiomics Integration Agent

The SIMO Multiomics Integration Agent performs spatial integration of multi-omics datasets through probabilistic alignment. Unlike previous tools limited to transcriptomics, SIMO integrates spatial transcriptomics with single-cell RNA-seq and expands to chromatin accessibility, DNA methylation, and proteomics data.

When to Use This Skill

  • When integrating spatial transcriptomics with single-cell multi-omics data.
  • For constructing comprehensive tissue atlases with spatial context.
  • To map epigenomic states (ATAC-seq, methylation) onto spatial coordinates.
  • When analyzing multi-modal cellular phenotypes in tissue architecture.
  • For spatial deconvolution combining multiple modalities.

Core Capabilities

  1. Spatial-scRNA Integration: Probabilistically align single-cell RNA-seq to spatial coordinates.

  2. Chromatin Accessibility Mapping: Project scATAC-seq profiles onto spatial tissue locations.

  3. DNA Methylation Spatial Mapping: Integrate single-cell methylation data with spatial context.

  4. Multi-Modal Fusion: Combine transcriptomic, epigenomic, and proteomic layers.

  5. Probabilistic Cell-Type Assignment: Assign cell types to spatial spots with uncertainty quantification.

  6. Spatial Niche Identification: Discover cellular niches defined by multi-omic signatures.

Supported Modalities

ModalityInput FormatSpatial Reference
scRNA-seqAnnData, SeuratVisium, MERFISH, Xenium
scATAC-seqSnapATAC2, ArchRVisium, Slide-seq
scMethylBismark, allcoolsAny spatial modality
CITE-seq (protein)AnnDataSpatial proteomics
Multi-ome (RNA+ATAC)Muon, SnapATAC2All platforms

Integration Algorithm

StepMethodPurpose
Feature SelectionHVG + marker genesReduce dimensionality
EmbeddingVariational autoencoderShared latent space
AlignmentOptimal transportProbabilistic matching
Spatial MappingGaussian processesSmooth spatial predictions
UncertaintyPosterior samplingConfidence intervals

Workflow

  1. Input: Spatial transcriptomics (Visium/MERFISH/Xenium), reference single-cell multi-omics.

  2. Preprocessing: Normalize, select features, QC both datasets.

  3. Embedding: Learn joint latent representation across modalities.

  4. Probabilistic Alignment: Compute cell-to-spot assignment probabilities.

  5. Spatial Imputation: Transfer modalities to spatial coordinates.

  6. Niche Analysis: Identify spatial domains by multi-omic signatures.

  7. Output: Integrated spatial multi-omics object, niche assignments, visualizations.

Example Usage

User: "Integrate our scRNA-seq and scATAC-seq data with the spatial transcriptomics to understand chromatin states in different tissue regions."

Agent Action:

python3 Skills/Genomics/SIMO_Multiomics_Integration_Agent/simo_integration.py \
    --spatial_data visium_data.h5ad \
    --scrna_ref scrna_atlas.h5ad \
    --scatac_ref scatac_atlas.h5ad \
    --modalities rna,atac \
    --n_spots_per_cell 5 \
    --uncertainty_quantification true \
    --output integrated_spatial_multiome.h5ad

Output Components

OutputDescriptionFormat
Integrated ObjectMulti-modal spatial dataAnnData/Muon
Cell Type MapSpatial cell type assignmentsGeoTIFF, CSV
Chromatin Accessibility MapSpatial ATAC patternsBigWig, CSV
Niche AssignmentsSpatial domain labelsCSV, Zarr
Uncertainty MapsPer-spot confidenceGeoTIFF
Gene Activity ScoresATAC-derived gene activityAnnData layer

Spatial Platforms Supported

PlatformResolutionSpots/CellsGenes
10x Visium55 μm~5,000Whole transcriptome
10x Visium HD8 μm~300,000Whole transcriptome
10x XeniumSubcellular>100,000300-5,000 panel
MERFISHSubcellular>1M100-10,000 panel
Slide-seq10 μm~60,000Whole transcriptome
CosMxSubcellular>1M1,000-6,000 panel

AI/ML Components

Variational Integration:

  • Multi-modal VAE for joint embeddings
  • Contrastive learning for modality alignment
  • Batch correction across datasets

Probabilistic Mapping:

  • Optimal transport with entropic regularization
  • Gaussian process spatial smoothing
  • Bayesian uncertainty estimation

Niche Discovery:

  • Multi-view clustering
  • Spatial autocorrelation (Moran's I)
  • Graph neural networks for niche boundaries

Prerequisites

  • Python 3.10+
  • Scanpy, Squidpy, Muon
  • scvi-tools, SnapATAC2
  • POT (Python Optimal Transport)
  • PyTorch, GPyTorch

Related Skills

  • scGPT_Agent - For foundation model embeddings
  • Spatial_Epigenomics_Agent - For spatial epigenomics analysis
  • Cell_Cell_Communication - For ligand-receptor analysis
  • Nicheformer_Spatial_Agent - For spatial niche modeling

Special Considerations

  1. Batch Effects: Pre-align datasets from different protocols
  2. Spot Deconvolution: Lower resolution platforms need deconvolution
  3. Sparsity: scATAC data requires aggregation strategies
  4. Compute: Multi-modal integration is memory-intensive
  5. Validation: Verify spatial patterns with known marker distributions

Applications

ApplicationUse Case
Tumor MicroenvironmentMap chromatin states of immune infiltrates
DevelopmentTrack lineage chromatin dynamics spatially
NeurodegenerationSpatial mapping of epigenetic changes
FibrosisUnderstand spatial activation programs

Author

AI Group - Biomedical AI Platform

<!-- AUTHOR_SIGNATURE: 9a7f3c2e-MD-BABU-MIA-2026-MSSM-SECURE -->