Marketplace cass
Coding Agent Session Search - unified CLI/TUI to index and search local coding agent history from Claude Code, Codex, Gemini, Cursor, Aider, ChatGPT, Pi-Agent, Factory, and more. Purpose-built for AI agent consumption with robot mode.
git clone https://github.com/aiskillstore/marketplace
T=$(mktemp -d) && git clone --depth=1 https://github.com/aiskillstore/marketplace "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/dicklesworthstone/cass" ~/.claude/skills/aiskillstore-marketplace-cass && rm -rf "$T"
skills/dicklesworthstone/cass/SKILL.mdCASS - Coding Agent Session Search
Unified, high-performance CLI/TUI to index and search your local coding agent history. Aggregates sessions from 11 agents: Codex, Claude Code, Gemini CLI, Cline, OpenCode, Amp, Cursor, ChatGPT, Aider, Pi-Agent, and Factory (Droid).
CRITICAL: Robot Mode Required for AI Agents
NEVER run bare
- it launches an interactive TUI that blocks your session!cass
# WRONG - blocks terminal cass # CORRECT - JSON output for agents cass search "query" --robot cass search "query" --json # alias
Always use
or --robot
flags for machine-readable output.--json
Quick Reference for AI Agents
Pre-Flight Check
# Health check (exit 0=healthy, 1=unhealthy, <50ms) cass health # If unhealthy, rebuild index cass index --full
Essential Commands
# Search with JSON output cass search "authentication error" --robot --limit 5 # Search with metadata (elapsed_ms, cache stats, freshness) cass search "error" --robot --robot-meta # Minimal payload (path, line, agent only) cass search "bug" --robot --fields minimal # View source at specific line cass view /path/to/session.jsonl -n 42 --json # Expand context around a line cass expand /path/to/session.jsonl -n 42 -C 5 --json # Capabilities discovery cass capabilities --json # Full API schema cass introspect --json # LLM-optimized documentation cass robot-docs guide cass robot-docs commands cass robot-docs schemas cass robot-docs examples cass robot-docs exit-codes
Why Use CASS
Cross-Agent Knowledge Transfer
Your coding agents create scattered knowledge:
- Claude Code sessions in
~/.claude/projects - Codex sessions in
~/.codex/sessions - Cursor state in SQLite databases
- Aider history in markdown files
CASS unifies all of this into a single searchable index. When you're stuck on a problem, search across ALL your past agent sessions to find relevant solutions.
Use Cases
# "I solved this before..." cass search "TypeError: Cannot read property" --robot --days 30 # Cross-agent learning (what has ANY agent said about X?) cass search "authentication" --robot --workspace /path/to/project # Agent-to-agent handoff cass search "database migration" --robot --fields summary # Daily review cass timeline --today --json
Command Reference
Indexing
# Full rebuild of DB and search index cass index --full # Incremental update (since last scan) cass index # Watch mode: auto-reindex on file changes cass index --watch # Force rebuild even if schema unchanged cass index --full --force-rebuild # Safe retries with idempotency key (24h TTL) cass index --full --idempotency-key "build-$(date +%Y%m%d)" # JSON output with stats cass index --full --json
Search
# Basic search (JSON output required for agents!) cass search "query" --robot # With filters cass search "error" --robot --agent claude --days 7 cass search "bug" --robot --workspace /path/to/project cass search "panic" --robot --today # Time filters cass search "auth" --robot --since 2024-01-01 --until 2024-01-31 cass search "test" --robot --yesterday cass search "fix" --robot --week # Wildcards cass search "auth*" --robot # prefix: authentication, authorize cass search "*tion" --robot # suffix: authentication, exception cass search "*config*" --robot # substring: misconfigured # Token budget management (critical for LLMs!) cass search "error" --robot --fields minimal # path, line, agent only cass search "error" --robot --fields summary # adds title, score cass search "error" --robot --max-content-length 500 # truncate fields cass search "error" --robot --max-tokens 2000 # soft budget (~4 chars/token) cass search "error" --robot --limit 5 # cap results # Pagination (cursor-based) cass search "TODO" --robot --robot-meta --limit 20 # Use _meta.next_cursor from response: cass search "TODO" --robot --robot-meta --limit 20 --cursor "eyJ..." # Match highlighting cass search "authentication error" --robot --highlight # Query analysis/debugging cass search "auth*" --robot --explain # parsed query, cost estimates cass search "auth error" --robot --dry-run # validate without executing # Aggregations (server-side counts) cass search "error" --robot --aggregate agent,workspace,date # Request correlation cass search "bug" --robot --request-id "req-12345" # Source filtering (for multi-machine setups) cass search "auth" --robot --source laptop cass search "error" --robot --source remote # Traceability (for debugging agent pipelines) cass search "error" --robot --trace-file /tmp/cass-trace.json
Session Analysis
# Export conversation to markdown/HTML/JSON cass export /path/to/session.jsonl --format markdown -o conversation.md cass export /path/to/session.jsonl --format html -o conversation.html cass export /path/to/session.jsonl --format json --include-tools # Expand context around a line (from search result) cass expand /path/to/session.jsonl -n 42 -C 5 --json # Shows 5 messages before and after line 42 # View source at line cass view /path/to/session.jsonl -n 42 --json # Activity timeline cass timeline --today --json --group-by hour cass timeline --days 7 --json --agent claude cass timeline --since 7d --json # Find related sessions for a file cass context /path/to/source.ts --json
Status & Diagnostics
# Quick health (<50ms) cass health cass health --json # Full status snapshot cass status --json cass state --json # alias # Statistics cass stats --json cass stats --by-source # for multi-machine # Full diagnostics cass diag --verbose
Aggregation & Analytics
Aggregate search results server-side to get counts and distributions without transferring full result data:
# Count results by agent cass search "error" --robot --aggregate agent # → { "aggregations": { "agent": { "buckets": [{"key": "claude_code", "count": 45}, ...] } } } # Multi-field aggregation cass search "bug" --robot --aggregate agent,workspace,date # Combine with filters cass search "TODO" --agent claude --robot --aggregate workspace
| Aggregation Field | Description |
|---|---|
| Group by agent type (claude_code, codex, cursor, etc.) |
| Group by workspace/project path |
| Group by date (YYYY-MM-DD) |
| Group by match quality (exact, prefix, fuzzy) |
Top 10 buckets returned per field, with
other_count for remaining items.
Remote Sources (Multi-Machine Search)
Search across sessions from multiple machines via SSH/rsync.
Setup Wizard (Recommended)
cass sources setup
The wizard:
- Discovers SSH hosts from
~/.ssh/config - Probes each for agent data and cass installation
- Optionally installs cass on remotes
- Indexes sessions on remotes
- Configures
sources.toml - Syncs data locally
cass sources setup --hosts css,csd,yto # Specific hosts only cass sources setup --dry-run # Preview without changes cass sources setup --resume # Resume interrupted setup
Manual Setup
# Add a remote machine cass sources add user@laptop.local --preset macos-defaults cass sources add dev@workstation --path ~/.claude/projects --path ~/.codex/sessions # List sources cass sources list --json # Sync sessions cass sources sync cass sources sync --source laptop --verbose # Check connectivity cass sources doctor cass sources doctor --source laptop --json # Path mappings (rewrite remote paths to local) cass sources mappings list laptop cass sources mappings add laptop --from /home/user/projects --to /Users/me/projects cass sources mappings test laptop /home/user/projects/myapp/src/main.rs # Remove source cass sources remove laptop --purge -y
Configuration stored in
~/.config/cass/sources.toml (Linux) or ~/Library/Application Support/cass/sources.toml (macOS).
Robot Mode Deep Dive
Self-Documenting API
CASS teaches agents how to use itself:
# Quick capability check cass capabilities --json # Returns: features, connectors, limits # Full API schema cass introspect --json # Returns: all commands, arguments, response shapes # Topic-based docs (LLM-optimized) cass robot-docs commands # all commands and flags cass robot-docs schemas # response JSON schemas cass robot-docs examples # copy-paste invocations cass robot-docs exit-codes # error handling cass robot-docs guide # quick-start walkthrough cass robot-docs contracts # API versioning cass robot-docs sources # remote sources guide
Forgiving Syntax (Agent-Friendly)
CASS auto-corrects common mistakes:
| What you type | What CASS understands |
|---|---|
| (typo corrected) |
| (single-dash fixed) |
| (case normalized) |
| (alias resolved) |
| (Levenshtein <=2) |
Command Aliases:
,find
,query
,q
,lookup
→grepsearch
,ls
,list
,info
→summarystats
,st
→statestatus
,reindex
,idx
→rebuildindex
,show
,get
→readview
,docs
,help-robot
→robotdocsrobot-docs
Output Formats
# Pretty-printed JSON (default) cass search "error" --robot # Streaming JSONL (header + one hit per line) cass search "error" --robot-format jsonl # Compact single-line JSON cass search "error" --robot-format compact # With performance metadata cass search "error" --robot --robot-meta
Design principle: stdout = JSON only; diagnostics go to stderr.
Token Budget Management
LLMs have context limits. Control output size:
| Flag | Effect |
|---|---|
| Only , , |
| Adds , |
| Custom field selection |
| Truncate long fields (UTF-8 safe) |
| Soft budget (~4 chars/token) |
| Cap number of results |
Truncated fields include
*_truncated: true indicator.
Structured Error Handling
Errors are JSON with actionable hints:
{ "error": { "code": 3, "kind": "index_missing", "message": "Search index not found", "hint": "Run 'cass index --full' to build the index", "retryable": false } }
Exit Codes
| Code | Meaning | Action |
|---|---|---|
| 0 | Success | Parse stdout |
| 1 | Health check failed | Run |
| 2 | Usage error | Fix syntax (hint provided) |
| 3 | Index/DB missing | Run |
| 4 | Network error | Check connectivity |
| 5 | Data corruption | Run |
| 6 | Incompatible version | Update cass |
| 7 | Lock/busy | Retry later |
| 8 | Partial result | Increase |
| 9 | Unknown error | Check flag |
Search Modes
Three search modes, selectable with
--mode flag:
| Mode | Algorithm | Best For |
|---|---|---|
| lexical (default) | BM25 full-text | Exact term matching, code searches |
| semantic | Vector similarity | Conceptual queries, "find similar" |
| hybrid | Reciprocal Rank Fusion | Balanced precision and recall |
cass search "authentication" --mode lexical --robot cass search "how to handle user login" --mode semantic --robot cass search "auth error handling" --mode hybrid --robot
Hybrid combines lexical and semantic using RRF:
RRF_score = Σ 1 / (60 + rank_i)
Pipeline Mode (Chained Search)
Chain searches by piping session paths:
# Find sessions mentioning "auth", then search within those for "token" cass search "authentication" --robot-format sessions | \ cass search "refresh token" --sessions-from - --robot # Build a filtered corpus from today's work cass search --today --robot-format sessions > today_sessions.txt cass search "bug fix" --sessions-from today_sessions.txt --robot
Use cases:
- Drill-down: Broad search → narrow within results
- Cross-reference: Find sessions with term A, then find term B within them
- Corpus building: Save session lists for repeated searches
Query Language
Basic Queries
| Query | Matches |
|---|---|
| Messages containing "error" (case-insensitive) |
| Both "python" AND "error" |
| Exact phrase |
Boolean Operators
| Operator | Example | Meaning |
|---|---|---|
| | Both terms required (default) |
| | Either term matches |
| | First term, excluding second |
| | Shorthand for NOT |
# Complex boolean query cass search "authentication AND (error OR failure) NOT test" --robot # Exclude test files cass search "bug fix -test -spec" --robot # Either error type cass search "TypeError OR ValueError" --robot
Wildcard Patterns
| Pattern | Type | Performance |
|---|---|---|
| Prefix | Fast (edge n-grams) |
| Suffix | Slower (regex) |
| Substring | Slowest (regex) |
Match Types
Results include
match_type:
| Type | Meaning | Score Boost |
|---|---|---|
| Verbatim match | Highest |
| Via prefix expansion | High |
| Via suffix pattern | Medium |
| Via substring pattern | Lower |
| Auto-fallback (sparse results) | Lowest |
Auto-Fuzzy Fallback
When exact query returns <3 results, CASS automatically retries with wildcards:
→auth*auth*- Results flagged with
wildcard_fallback: true
Flexible Time Input
CASS accepts a wide variety of time/date formats:
| Format | Examples |
|---|---|
| Relative | , , , |
| Keywords | , , |
| ISO 8601 | , |
| US Dates | , |
| Unix Timestamp | (seconds or milliseconds) |
Ranking Modes
Cycle with
F12 in TUI or use --ranking flag:
| Mode | Formula | Best For |
|---|---|---|
| Recent Heavy | | "What was I working on?" |
| Balanced | | General search |
| Relevance | | "Best explanation of X" |
| Match Quality | Penalizes fuzzy matches | Precise technical searches |
| Date Newest | Pure chronological | Recent activity |
| Date Oldest | Reverse chronological | "When did I first..." |
Score Components
- Text Relevance (BM25): Term frequency, inverse document frequency, length normalization
- Recency: Exponential decay (today ~1.0, last week ~0.7, last month ~0.3)
- Match Exactness: Exact phrase=1.0, Prefix=0.9, Suffix=0.8, Substring=0.6, Fuzzy=0.4
Blended Scoring Formula
Final_Score = BM25_Score × Match_Quality + α × Recency_Factor
| Mode | α Value | Effect |
|---|---|---|
| Recent Heavy | 1.0 | Recency dominates |
| Balanced | 0.4 | Moderate recency boost |
| Relevance Heavy | 0.1 | BM25 dominates |
| Match Quality | 0.0 | Pure text matching |
Supported Agents (11 Connectors)
| Agent | Location | Format |
|---|---|---|
| Claude Code | | JSONL |
| Codex | | JSONL (Rollout) |
| Gemini CLI | | JSON |
| Cline | VS Code global storage | Task directories |
| OpenCode | directories | SQLite |
| Amp | + VS Code | Mixed |
| Cursor | | SQLite (state.vscdb) |
| ChatGPT | | JSON (v1 unencrypted) |
| Aider | + per-project | Markdown |
| Pi-Agent | | JSONL with thinking |
| Factory (Droid) | | JSONL by workspace |
Note: ChatGPT v2/v3 are AES-256-GCM encrypted (keychain access required). Legacy v1 unencrypted conversations are indexed automatically.
TUI Features (for Humans)
Launch with
cass (no flags):
Keyboard Shortcuts
Navigation:
: Move selectionUp/Down
: Switch panesLeft/Right
: Cycle focusTab/Shift+Tab
: Open inEnter$EDITOR
: Full-screen detail viewSpace
: Jump to first/last resultHome/End
: Scroll by pagePageUp/PageDown
Filtering:
: Agent filterF3
: Workspace filterF4
: Time filters (from/to)F5/F6
: Scope to current result's agentShift+F3
: Clear workspace filterShift+F4
: Cycle presets (24h/7d/30d/all)Shift+F5
: Clear all filtersCtrl+Del
Modes:
: Toggle theme (6 presets)F2
: Context window size (S/M/L/XL)F7
: Match mode (prefix/standard)F9
: Ranking modeF12
: Toggle border styleCtrl+B
Selection & Actions:
: Toggle selectionm
: Select allCtrl+A
: Bulk actions menuA
: Add to queueCtrl+Enter
: Open all queuedCtrl+O
: Copy path/contenty
: Copy all selectedCtrl+Y
: Find in detail pane/
: Next/prev matchn/N
Views & Palette:
: Command paletteCtrl+P
: Load saved view1-9
: Save view to slotShift+1-9
Source Filtering (multi-machine):
: Cycle source filter (all/local/remote)F11
: Source selection menuShift+F11
Global:
: QuitCtrl+C
orF1
: Toggle help?
: Force re-indexCtrl+Shift+R
: Reset all TUI stateCtrl+Shift+Del
Detail Pane Tabs
| Tab | Content | Switch With |
|---|---|---|
| Messages | Full conversation with markdown | / |
| Snippets | Keyword-extracted summaries | / |
| Raw | Unformatted JSON/text | / |
Context Window Sizing
| Size | Characters | Use Case |
|---|---|---|
| Small | ~200 | Quick scanning |
| Medium | ~400 | Default balanced view |
| Large | ~800 | Longer passages |
| XLarge | ~1600 | Full context, code review |
Peek Mode (
Ctrl+Space): Temporarily expand to XL without changing default.
Theme Presets
Cycle through 6 built-in themes with
F2:
| Theme | Description | Best For |
|---|---|---|
| Dark | Tokyo Night-inspired deep blues | Low-light environments |
| Light | High-contrast light background | Bright environments |
| Catppuccin | Warm pastels, reduced eye strain | All-day coding |
| Dracula | Purple-accented dark theme | Popular developer theme |
| Nord | Arctic-inspired cool tones | Calm, focused work |
| High Contrast | Maximum readability | Accessibility needs |
All themes validated against WCAG contrast requirements (4.5:1 minimum for text).
Role-Aware Message Styling
| Role | Visual Treatment |
|---|---|
| User | Blue-tinted background, bold |
| Assistant | Green-tinted background |
| System | Gray/muted background |
| Tool | Orange-tinted background |
Saved Views
Save filter configurations to 9 slots for instant recall.
What Gets Saved:
- Active filters (agent, workspace, time range)
- Current ranking mode
- The search query
Keyboard:
throughShift+1
: Save current viewShift+9
through1
: Load view from slot9
Via Command Palette:
Ctrl+P → "Save/Load view"
Views persist in
tui_state.json across sessions.
Density Modes
Control lines per search result. Cycle with
Shift+D:
| Mode | Lines | Best For |
|---|---|---|
| Compact | 3 | Maximum results visible |
| Cozy | 5 | Balanced view (default) |
| Spacious | 8 | Detailed preview |
Bookmark System
Save important results with notes and tags:
In TUI: Press
b to bookmark, add notes and tags.
Bookmark Structure:
: Short descriptiontitle
,source_path
,line_number
,agentworkspace
: Your annotationsnote
: Comma-separated labelstags
: Extracted contentsnippet
Storage:
~/.local/share/coding-agent-search/bookmarks.db (SQLite)
Optional Semantic Search
Local-only semantic search using MiniLM (no cloud):
Required files (place in data directory):
model.onnxtokenizer.jsonconfig.jsonspecial_tokens_map.jsontokenizer_config.json
Vector index stored as
vector_index/index-minilm-384.cvvi.
CASS does NOT auto-download models; you must manually install them.
Hash Embedder Fallback: When MiniLM not installed, CASS uses a hash-based embedder for approximate semantic similarity.
Watch Mode
Real-time index updates:
cass index --watch
- Debounce: 2 seconds (wait for burst to settle)
- Max wait: 5 seconds (force flush during continuous activity)
- Incremental: Only re-scans modified files
TUI automatically starts watch mode in background.
Deduplication Strategy
CASS uses multi-layer deduplication:
- Message Hash: SHA-256 of
- identical messages stored once(role + content + timestamp) - Conversation Fingerprint: Hash of first N message hashes - detects duplicate files
- Search-Time Dedup: Results deduplicated by content similarity
Noise Filtering:
- Empty messages and pure whitespace
- System prompts (unless searching for them)
- Repeated tool acknowledgments
Performance Characteristics
| Operation | Latency |
|---|---|
| Prefix search (cached) | 2-8ms |
| Prefix search (cold) | 40-60ms |
| Substring search | 80-200ms |
| Full reindex | 5-30s |
| Incremental reindex | 50-500ms |
| Health check | <50ms |
Memory: 70-140MB typical (50K messages) Disk: ~600 bytes/message (including n-gram overhead)
Response Shapes
Search Response:
{ "query": "error", "limit": 10, "count": 5, "total_matches": 42, "hits": [ { "source_path": "/path/to/session.jsonl", "line_number": 123, "agent": "claude_code", "workspace": "/projects/myapp", "title": "Authentication debugging", "snippet": "The error occurs when...", "score": 0.85, "match_type": "exact", "created_at": "2024-01-15T10:30:00Z" } ], "_meta": { "elapsed_ms": 12, "cache_hit": true, "wildcard_fallback": false, "next_cursor": "eyJ...", "index_freshness": { "stale": false, "age_seconds": 120 } } }
Aggregation Response:
{ "aggregations": { "agent": { "buckets": [ {"key": "claude_code", "count": 120}, {"key": "codex", "count": 85} ], "other_count": 15 } } }
Environment Variables
| Variable | Purpose |
|---|---|
| Override data directory |
| Base64 key for encrypted ChatGPT |
| Override Pi-Agent sessions path |
| Per-shard cache entries (default 256) |
| Total cached hits (default 2048) |
| Enable cache debug logging |
| Skip update checks |
Shell Completions
cass completions bash > ~/.local/share/bash-completion/completions/cass cass completions zsh > "${fpath[1]}/_cass" cass completions fish > ~/.config/fish/completions/cass.fish cass completions powershell >> $PROFILE
API Contract & Versioning
cass api-version --json # → { "version": "0.4.0", "contract_version": "1", "breaking_changes": [] } cass introspect --json # → Full schema: all commands, arguments, response types
Guaranteed Stable:
- Exit codes and their meanings
- JSON response structure for
output--robot - Flag names and behaviors
block format_meta
Integration with CASS Memory (cm)
CASS provides episodic memory (raw sessions). CM extracts procedural memory (rules and playbooks):
# 1. CASS indexes raw sessions cass index --full # 2. Search for relevant past experience cass search "authentication timeout" --robot --limit 10 # 3. CM reflects on sessions to extract rules cm reflect
Troubleshooting
| Issue | Solution |
|---|---|
| "missing index" | |
| Stale warning | Rerun index or enable watch |
| Empty results | Check , verify connectors detected |
| JSON parsing errors | Use |
| Watch not triggering | Check , verify file event support |
| Reset TUI state | or |
Installation
# One-liner install curl -fsSL https://raw.githubusercontent.com/Dicklesworthstone/coding_agent_session_search/main/install.sh \ | bash -s -- --easy-mode --verify # Windows irm https://raw.githubusercontent.com/Dicklesworthstone/coding_agent_session_search/main/install.ps1 | iex
Integration with Flywheel
| Tool | Integration |
|---|---|
| CM | CASS provides episodic memory, CM extracts procedural memory |
| NTM | Robot mode flags for searching past sessions |
| Agent Mail | Search threads across agent history |
| BV | Cross-reference beads with past solutions |