OpenClaw-Medical-Skills rfdiffusion

install
source · Clone the upstream repo
git clone https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/rfdiffusion" ~/.claude/skills/freedomintelligence-openclaw-medical-skills-rfdiffusion && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/rfdiffusion" ~/.openclaw/skills/freedomintelligence-openclaw-medical-skills-rfdiffusion && rm -rf "$T"
manifest: skills/rfdiffusion/SKILL.md
safety · automated scan (medium risk)
This is a pattern-based risk scan, not a security review. Our crawler flagged:
  • pip install
  • downloads files (wget)
Always read a skill's source content before installing. Patterns alone don't mean the skill is malicious — but they warrant attention.
source content

RFdiffusion Backbone Generation

Prerequisites

RequirementMinimumRecommended
Python3.9+3.10
CUDA11.7+12.0+
GPU VRAM16GB24GB (A10G)
RAM16GB32GB

How to run

First time? See Installation Guide to set up Modal and biomodals.

Option 1: Modal (recommended)

# Clone biomodals
git clone https://github.com/hgbrian/biomodals && cd biomodals

# Basic binder design
modal run modal_rfdiffusion.py \
  --pdb target.pdb \
  --contigs "A1-150/0 70-100" \
  --hotspot "A45,A67,A89" \
  --num-designs 100

# With custom GPU/timeout
GPU=A100 TIMEOUT=60 modal run modal_rfdiffusion.py \
  --pdb target.pdb \
  --contigs "A1-150/0 70-100" \
  --num-designs 100

GPU: A10G (24GB) | Timeout: 30min default

Option 2: Local installation

# Clone and install
git clone https://github.com/RosettaCommons/RFdiffusion.git
cd RFdiffusion && pip install -e .

# Download weights
wget http://files.ipd.uw.edu/pub/RFdiffusion/models/Complex_base_ckpt.pt

# Run inference
python run_inference.py \
  inference.input_pdb=target.pdb \
  contigmap.contigs=[A1-150/0 70-100] \
  ppi.hotspot_res=[A45,A67,A89] \
  inference.num_designs=100

Config Schema (Hydra)

Contigmap Syntax

# De novo single chain (50-100 residues)
contigmap.contigs=[50-100]

# Binder + target (A = target chain, fixed with /0)
contigmap.contigs=[A1-150/0 70-100]

# Motif scaffolding (preserve residues, /0 = fixed)
contigmap.contigs=[20-40/0 A10-30/0 20-40]

# Multi-chain binder
contigmap.contigs=[A1-100/0 B1-100/0 60-80]

# Variable length ranges
contigmap.contigs=[A1-150/0 50-100]  # Binder 50-100 AA

Hotspot Specification

# Residues for interface (chain + resnum, no spaces)
ppi.hotspot_res=[A45,A67,A89]

Common mistakes

Contig Syntax

Correct:

contigmap.contigs=[A1-150/0 70-100]  # Target fixed (/0), binder variable

Wrong:

contigmap.contigs=[A1-150 70-100]    # Missing /0 - target will move!
contigmap.contigs="A1-150/0 70-100"  # Quotes break parsing
contigmap.contigs=[A1-150/0, 70-100] # Comma breaks parsing

Hotspot Residues

Correct:

ppi.hotspot_res=[A45,A67,A89]        # Chain letter + residue number

Wrong:

ppi.hotspot_res=[45,67,89]           # Missing chain letter
ppi.hotspot_res=[A45, A67, A89]      # Spaces break parsing
ppi.hotspot_res="A45,A67,A89"        # Quotes break parsing

Complete Parameter Reference

Core Parameters

ParameterDefaultRangeDescription
inference.num_designs
101-10000Number of designs to generate
inference.input_pdb
-pathTarget structure file
inference.output_prefix
outputstringOutput filename prefix
diffuser.T
5020-200Diffusion timesteps
denoiser.noise_scale_ca
1.00.0-2.0CA atom noise (0.5-0.8 = conservative)
denoiser.noise_scale_frame
1.00.0-2.0Frame noise
inference.ckpt_override_path
-pathModel checkpoint
potentials.guide_scale
1.00.1-10Guidance strength
potentials.guide_decay
constantstringDecay type

Advanced Parameters

ParameterDefaultDescription
diffuser.partial_T
NoneStart diffusion from timestep T (partial diffusion)
contigmap.inpaint_str
NoneSequence positions to inpaint
scaffoldguided.scaffoldguided
falseEnable scaffold-guided generation
scaffoldguided.target_pdb
NoneScaffold template PDB
ppi.binderlen
NoneSpecify exact binder length

Symmetry Parameters

ParameterDefaultDescription
symmetry.symmetry
NoneSymmetry type (C2, C3, C4, D2, etc.)
symmetry.recenter
trueRecenter symmetric assembly
symmetry.radius
NoneRadius constraint for symmetric assembly

Fold Conditioning

ParameterDefaultDescription
contigmap.provide_seq
NoneProvide sequence for fold conditioning
contigmap.inpaint_seq
NonePositions for sequence inpainting

Model Checkpoints

CheckpointUse Case
Complex_base_ckpt.pt
Binder design (default)
Base_ckpt.pt
De novo monomers
ActiveSite_ckpt.pt
Active site scaffolding
InpaintSeq_ckpt.pt
Sequence inpainting

Common workflows

Binder Design

  1. Prepare target PDB (trim to binding region + 10A buffer)
  2. Identify 3-6 hotspot residues (exposed, conserved)
  3. Generate 100-500 backbones
  4. Pass to proteinmpnn for sequence design

Motif Scaffolding

  1. Extract motif coordinates
  2. Use
    /0
    to fix motif in contigmap
  3. Generate surrounding scaffold
  4. Validate motif preservation (RMSD < 1.5A)

Symmetric Oligomers

# C3 symmetric trimer
python run_inference.py \
  symmetry.symmetry=C3 \
  contigmap.contigs=[100-150] \
  inference.num_designs=50

# D2 symmetric tetramer
python run_inference.py \
  symmetry.symmetry=D2 \
  contigmap.contigs=[80-120] \
  symmetry.radius=25

# Supported symmetries: C2, C3, C4, C5, C6, D2, D3, D4, tetrahedral, octahedral

Partial Diffusion (Refinement)

# Start from existing structure, diffuse from timestep 10
python run_inference.py \
  inference.input_pdb=initial.pdb \
  diffuser.partial_T=10 \
  contigmap.contigs=[A1-100]

Output format

output/
├── output_0.pdb       # Generated backbone
├── output_1.pdb
├── ...
└── output_99.pdb

Each PDB contains polyalanine backbone - use proteinmpnn for sequence.

Sample output

Successful run

$ python run_inference.py inference.input_pdb=target.pdb contigmap.contigs=[A1-150/0 70-100] inference.num_designs=100
[INFO] Loading model from Complex_base_ckpt.pt
[INFO] Generating design 1/100...
[INFO] Generating design 50/100...
[INFO] Generating design 100/100...
[INFO] Saved 100 designs to output/

Generated:
output/output_0.pdb (85 residues)
output/output_1.pdb (92 residues)
...

What good output looks like:

  • File size: 3-8 KB per PDB (backbone only)
  • Residue count within specified range
  • Secondary structure visible in PyMOL (helices/sheets, not random coil)

Decision tree

Should I use RFdiffusion?
│
├─ Need to generate protein backbone?
│  ├─ Yes → Continue below
│  └─ No, already have backbone → Use ProteinMPNN
│
├─ What type of design?
│  ├─ Binder for protein target → RFdiffusion ✓
│  ├─ De novo monomer → RFdiffusion ✓
│  ├─ Motif scaffolding → RFdiffusion ✓
│  └─ Symmetric assembly → RFdiffusion ✓
│
└─ Priority?
   ├─ Need highest success rate → Consider BindCraft
   ├─ Need diversity/exploration → RFdiffusion ✓
   └─ Need all-atom precision → Consider BoltzGen

Typical performance

Campaign SizeTime (A10G)Cost (Modal)Notes
100 backbones20-30 min~$3Quick exploration
500 backbones1.5-2h~$12Standard campaign
1000 backbones3-4h~$25Large campaign

Expected downstream yield: ~10-15% of backbones pass full QC after sequence design + validation.


Verify

ls output/*.pdb | wc -l  # Should match num_designs

Troubleshooting

Designs lack secondary structure: Decrease noise_scale to 0.5-0.8 Binder not contacting hotspots: Verify residue numbering, increase num_designs OOM errors: Reduce batch size or use A100 GPU Slow generation: Reduce diffuser.T to 25-35

Error interpretation

ErrorCauseFix
RuntimeError: CUDA out of memory
GPU VRAM exceededUse A100 or reduce designs per batch
KeyError: 'A'
Chain not found in PDBCheck chain IDs with
grep ^ATOM target.pdb | cut -c22 | sort -u
ValueError: invalid contig
Syntax error in contigsCheck for spaces, quotes, commas (see Common Mistakes)
FileNotFoundError: ckpt
Missing model weightsDownload from IPD website

Next:

proteinmpnn
for sequence design → structure prediction for validation →
protein-qc
for filtering.