Claude-skill-registry bio-tcr-bcr-analysis-immcantation-analysis
Analyze BCR repertoires for somatic hypermutation, clonal lineages, and B cell phylogenetics using the Immcantation framework. Use when studying B cell affinity maturation, germinal center dynamics, or antibody evolution.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/immcantation-analysis" ~/.claude/skills/majiayu000-claude-skill-registry-bio-tcr-bcr-analysis-immcantation-analysis && rm -rf "$T"
manifest:
skills/data/immcantation-analysis/SKILL.mdsource content
Immcantation Analysis
Requires Immcantation suite: alakazam 1.3+, shazam 1.2+, scoper 1.3+, dowser 2.0+, tigger 1.1+.
Load and Format Data
library(alakazam) library(shazam) library(dplyr) # Load AIRR-formatted data (from MiXCR, IMGT/HighV-QUEST, etc.) db <- readChangeoDb('clones_airr.tsv') # Required columns: # sequence_id, sequence, v_call, d_call, j_call, junction, junction_aa
Clonal Clustering
library(scoper) # Assign clones based on junction similarity # Threshold typically 0.15-0.2 (15-20% nucleotide distance) db <- hierarchicalClones( db, threshold = 0.15, method = 'nt', linkage = 'single' ) # Count clones clone_sizes <- countClones(db, groups = 'sample_id')
Somatic Hypermutation Analysis
# Calculate mutation frequencies db <- observedMutations( db, sequenceColumn = 'sequence_alignment', germlineColumn = 'germline_alignment_d_mask', regionDefinition = IMGT_V, mutationDefinition = MUTATION_SCHEMES$S5F ) # Mutation frequency columns added: # mu_count_seq_r, mu_count_seq_s (replacement/silent mutations) # mu_freq_seq_r, mu_freq_seq_s (frequencies) # Summarize by clone mutation_summary <- db %>% group_by(clone_id) %>% summarize( mean_mu = mean(mu_freq_seq_r, na.rm = TRUE), n_sequences = n() )
Selection Analysis
library(shazam) # Test for selection pressure # Compares observed R/S ratio to expected under neutrality baseline <- estimateBaseline( db, sequenceColumn = 'sequence_alignment', germlineColumn = 'germline_alignment_d_mask', testStatistic = 'focused', regionDefinition = IMGT_V, nproc = 4 ) # Summarize selection selection <- summarizeBaseline(baseline, returnType = 'df') # Positive sigma = positive selection (beneficial mutations retained) # Negative sigma = negative selection (deleterious mutations removed)
Build Clonal Lineage Trees
library(dowser) # Build lineage trees for each clone # Requires multiple sequences per clone clones_multi <- db %>% group_by(clone_id) %>% filter(n() >= 3) %>% ungroup() # Build trees using maximum parsimony trees <- buildPhylipLineage( clones_multi, phylip_exec = 'dnapars', rm_temp = TRUE ) # Plot a tree plotTrees(trees[[1]])
Germline Inference
library(tigger) # Infer novel V gene alleles novel <- findNovelAlleles( db, germline_db = 'IMGT_Human_IGHV.fasta', nproc = 4 ) # Genotype the individual genotype <- inferGenotype(db, germline_db = 'IMGT_Human_IGHV.fasta') # Correct V gene calls db <- reassignAlleles(db, genotype)
Visualization
# Plot mutation frequency distribution library(ggplot2) ggplot(db, aes(x = mu_freq_seq_r)) + geom_histogram(bins = 50) + facet_wrap(~ sample_id) + labs(x = 'Replacement Mutation Frequency', y = 'Count') # Plot V gene usage v_usage <- countGenes(db, gene = 'v_call', groups = 'sample_id') plotGeneUsage(v_usage, gene = 'v_call')
Related Skills
- mixcr-analysis - Generate input clonotype data
- vdjtools-analysis - Diversity metrics (TCR-focused)
- phylogenetics - General tree concepts