Medical-research-skills diffdock-molecular-docking
Diffusion-based molecular docking to predict 3D ligand–protein binding poses (blind docking) with confidence scoring; use when you need pose prediction for drug discovery or virtual screening.
install
source · Clone the upstream repo
git clone https://github.com/aipoch/medical-research-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/aipoch/medical-research-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/scientific-skills/Evidence Insight/diffdock-molecular-docking" ~/.claude/skills/aipoch-medical-research-skills-diffdock-molecular-docking && rm -rf "$T"
manifest:
scientific-skills/Evidence Insight/diffdock-molecular-docking/SKILL.mdsource content
DiffDock Molecular Docking
When to Use
- Blind docking when you have a protein structure (PDB) and a ligand (SMILES) but no known binding site.
- Pose prediction to generate multiple plausible 3D binding conformations and rank them.
- Virtual screening support to quickly evaluate candidate ligands by predicted binding poses and confidence.
- Drug discovery workflows where you need automated docking outputs (SDF poses + scores) for downstream analysis.
- Batch/advanced docking when running many ligand–protein pairs or using alternative inputs (e.g., sequence-based workflows; see
).references/workflows_examples.md
Key Features
- Diffusion generative sampling to produce diverse ligand binding poses.
- Confidence model scoring to rank predicted poses.
- Simple CLI inference for single protein–ligand docking.
- Batch/advanced workflows documented in
.references/workflows_examples.md - Structured outputs including ranked SDF pose files and a confidence score report.
Dependencies
- Python (version not specified)
- PyTorch (version not specified)
- PyTorch Geometric / PyG (version not specified)
- RDKit (version not specified)
- ESM (version not specified)
Example Usage
1) Verify the Environment
python scripts/setup_check.py
2) Run Standard Inference (Single Docking)
Dock a single ligand (SMILES) to a protein structure (PDB) and write results to an output directory:
python scripts/inference_runner.py \ --protein ./data/protein.pdb \ --ligand "CC(=O)Oc1ccccc1C(=O)O" \ --out_dir ./results
Arguments
: Path to the protein PDB file.--protein
: Ligand SMILES string.--ligand
: Output directory (default:--out_dir
).results/
3) Outputs
After inference, the tool produces:
- Ranked SDF pose files (e.g.,
,rank1.sdf
, ...), each containing a predicted 3D binding pose.rank2.sdf - Confidence score report:
, listing the score for each ranked pose.confidence_scores.txt
Implementation Details
- Pose generation: Uses a diffusion-based generative model to sample multiple candidate ligand poses relative to the protein target.
- Ranking: A separate confidence model assigns a score to each sampled pose; poses are sorted by this score and saved as
.rank*.sdf - Parameterization:
- For the complete CLI argument list and defaults, see
.references/parameters_reference.md - For confidence interpretation, known limitations, and expected accuracy/scope, see
.references/confidence_and_limitations.md
- For the complete CLI argument list and defaults, see
- Advanced workflows: Batch processing and alternative input configurations are documented in
.references/workflows_examples.md