Claude-skill-registry dna-assembly
Guidance for Golden Gate assembly primer design and DNA assembly tasks. This skill should be used when designing primers for Golden Gate cloning, Type IIS restriction enzyme assembly, or multi-fragment DNA assembly workflows. It covers overhang selection, primer structure, assembly simulation, and verification strategies.
git clone https://github.com/majiayu000/claude-skill-registry
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/dna-assembly" ~/.claude/skills/majiayu000-claude-skill-registry-dna-assembly && rm -rf "$T"
skills/data/dna-assembly/SKILL.mdDNA Assembly
Overview
This skill provides procedural knowledge for Golden Gate assembly primer design, a molecular cloning technique that uses Type IIS restriction enzymes (like BsaI) to create scarless multi-fragment assemblies. The skill emphasizes rigorous verification and simulation to ensure designed primers produce the expected assembled product.
Workflow
Phase 1: Understand the Assembly Requirements
Before designing primers, thoroughly analyze:
- Input sequences: Read and parse all input FASTA files to understand the fragments to be assembled
- Output sequence: Understand the expected final assembled product
- Assembly topology: Determine if the assembly is linear or circular (circular plasmids require the last overhang to match the first)
- Fragment order: Identify the correct order of fragments in the final assembly
- Reading frame considerations: Note which fragments require start codons, stop codons, or neither
Phase 2: Overhang Design
Overhang selection is critical for efficient Golden Gate assembly. Follow these principles:
Use established overhang sets: Rather than designing arbitrary overhangs, use validated overhang sets from NEB or published literature. See
references/overhang_design.md for recommended sets.
Overhang requirements:
- 4-nucleotide overhangs for BsaI-based assembly
- Overhangs must be sufficiently different (Hamming distance >= 2 recommended)
- Avoid palindromic overhangs that could self-ligate
- Avoid overhangs with high GC content at ligation junction
- For N fragments in circular assembly, exactly N unique overhangs are needed
Common mistake: Selecting overhangs that differ by only one nucleotide (e.g., AACC, AACG, AACT). These similar overhangs can cause mis-ligation and reduce assembly efficiency.
Phase 3: Primer Structure Design
Golden Gate primers have a specific structure. Understanding orientation is crucial:
Forward primer structure (5' to 3'):
[5' extension (optional)] - [BsaI recognition site: GGTCTC] - [N spacer] - [4nt overhang] - [gene-specific binding region]
Reverse primer structure (5' to 3'):
[5' extension (optional)] - [BsaI recognition site: GAGACC] - [N spacer] - [4nt overhang (reverse complement)] - [gene-specific binding region (reverse complement)]
Critical orientation check: The BsaI site must always be at the 5' end of the primer as written. A common mistake is placing the recognition site at the 3' end, which will not produce the intended cut.
Gene-specific binding region requirements:
- Typically 18-25 nucleotides
- Melting temperature (Tm) between 55-65 degrees C
- GC content of 40-60% preferred
- Avoid runs of >4 identical nucleotides
- Check for secondary structure (hairpins) that could affect PCR
Phase 4: Pre-Assembly Verification
Before finalizing primers, perform these checks:
-
Internal restriction site check: Verify that insert sequences do not contain BsaI recognition sites (GGTCTC or GAGACC). If present, consider silent mutations or alternative enzymes.
-
Backbone check: Also verify the plasmid backbone being amplified does not contain internal BsaI sites.
-
Overhang uniqueness verification: Confirm all overhangs are unique and sufficiently different from each other.
-
Primer quality checks:
- Self-complementarity analysis (avoid hairpins)
- Primer-dimer formation potential
- 3' end GC content (1-2 G/C in last 5 bases ideal for specificity)
- Overall GC content (40-60%)
Phase 5: Assembly Simulation
This is the most critical verification step. Before declaring success:
-
Simulate PCR products: For each primer pair, determine the exact PCR product sequence including the BsaI sites and overhangs.
-
Simulate BsaI digestion: Apply the enzyme cut to each PCR product to determine the digested fragment with overhangs.
-
Simulate ligation: Assemble all digested fragments in silico based on overhang complementarity.
-
Compare to expected output: Perform a nucleotide-by-nucleotide comparison of the simulated assembled product against the expected output sequence.
Common mistake: Claiming assembly will work without actually simulating the complete product and comparing it to the expected output.
Phase 6: Output Generation
Generate primer output in a clear format:
- Primer name (indicating fragment and direction)
- Primer sequence (5' to 3')
- Calculated Tm for binding region
- Overhang produced after digestion
Verification Checklist
Before finalizing any primer design, confirm:
- All input sequences parsed correctly
- Expected output sequence understood
- Overhang set uses established/validated sequences
- All overhangs differ by Hamming distance >= 2
- No internal BsaI sites in inserts or backbone
- BsaI sites positioned at 5' end of all primers
- Primer Tm values within acceptable range (55-65 degrees C)
- No significant primer secondary structures
- Full assembly simulated in silico
- Simulated product matches expected output exactly
- Circular topology handled correctly (if applicable)
- Start/stop codons correctly included/excluded per fragment
Common Pitfalls
-
Inconsistent overhang reporting: Track overhangs carefully throughout the design process. If reported overhangs change between steps, this indicates a bug.
-
Primer orientation confusion: Remember that reverse primers are written 5' to 3' but bind to the opposite strand. The overhang sequence in a reverse primer should be the reverse complement of the desired overhang.
-
Circular assembly errors: For circular plasmids, the overhang connecting the last fragment back to the first must be correctly designed. Verify the plasmid closes in the correct orientation.
-
Incomplete verification: Checking that "the fusion matches at position X" is insufficient. Verify the entire assembled sequence matches the expected output.
-
Tm calculation inconsistencies: If Tm values differ between reports, investigate the calculation method. Use a consistent, reliable Tm calculation approach.
-
Script development approach: Test primer design logic incrementally on simple cases before applying to complex multi-fragment assemblies. Avoid writing large scripts that fail with cryptic errors.
Resources
references/
: Validated overhang sets and selection criteria for Golden Gate assemblyoverhang_design.md