Medical-research-skills diffdock-molecular-docking

Diffusion-based molecular docking to predict 3D ligand–protein binding poses (blind docking) with confidence scoring; use when you need pose prediction for drug discovery or virtual screening.

install

source · Clone the upstream repo

git clone https://github.com/aipoch/medical-research-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Evidence Insight/diffdock-molecular-docking" ~/.claude/skills/aipoch-medical-research-skills-diffdock-molecular-docking && rm -rf "$T"

manifest: scientific-skills/Evidence Insight/diffdock-molecular-docking/SKILL.md

source content

Source: https://github.com/aipoch/medical-research-skills

DiffDock Molecular Docking

When to Use

Blind docking when you have a protein structure (PDB) and a ligand (SMILES) but no known binding site.
Pose prediction to generate multiple plausible 3D binding conformations and rank them.
Virtual screening support to quickly evaluate candidate ligands by predicted binding poses and confidence.
Drug discovery workflows where you need automated docking outputs (SDF poses + scores) for downstream analysis.
Batch/advanced docking when running many ligand–protein pairs or using alternative inputs (e.g., sequence-based workflows; see
```
references/workflows_examples.md
```
).

Key Features

Diffusion generative sampling to produce diverse ligand binding poses.
Confidence model scoring to rank predicted poses.
Simple CLI inference for single protein–ligand docking.
Batch/advanced workflows documented in
```
references/workflows_examples.md
```
.
Structured outputs including ranked SDF pose files and a confidence score report.

Dependencies

Python (version not specified)
PyTorch (version not specified)
PyTorch Geometric / PyG (version not specified)
RDKit (version not specified)
ESM (version not specified)

Example Usage

1) Verify the Environment

python scripts/setup_check.py

2) Run Standard Inference (Single Docking)

Dock a single ligand (SMILES) to a protein structure (PDB) and write results to an output directory:

python scripts/inference_runner.py \
  --protein ./data/protein.pdb \
  --ligand "CC(=O)Oc1ccccc1C(=O)O" \
  --out_dir ./results

Arguments

```
--protein
```
: Path to the protein PDB file.
```
--ligand
```
: Ligand SMILES string.
```
--out_dir
```
: Output directory (default:
```
results/
```
).

3) Outputs

After inference, the tool produces:

Ranked SDF pose files (e.g.,
```
rank1.sdf
```
,
```
rank2.sdf
```
, ...), each containing a predicted 3D binding pose.
Confidence score report:
```
confidence_scores.txt
```
, listing the score for each ranked pose.

Implementation Details

Pose generation: Uses a diffusion-based generative model to sample multiple candidate ligand poses relative to the protein target.
Ranking: A separate confidence model assigns a score to each sampled pose; poses are sorted by this score and saved as
```
rank*.sdf
```
.
Parameterization:
- For the complete CLI argument list and defaults, see
```
references/parameters_reference.md
```
  .
- For confidence interpretation, known limitations, and expected accuracy/scope, see
```
references/confidence_and_limitations.md
```
  .
Advanced workflows: Batch processing and alternative input configurations are documented in
```
references/workflows_examples.md
```
.