Claude-skills tree-sitting
AST-powered code navigation via tree-sitter. Auto-scans codebases and provides progressive-disclosure tree views with symbol search, source retrieval, and reference finding. Each invocation is self-contained — no cross-process state. Use when exploring unfamiliar repos, navigating code, or needing fast symbol lookup. Triggers on "map this codebase", "explore repo", "find symbol", "navigate code", "tree-sitter", or when starting work on an unfamiliar repository.
git clone https://github.com/oaustegard/claude-skills
T=$(mktemp -d) && git clone --depth=1 https://github.com/oaustegard/claude-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/tree-sitting" ~/.claude/skills/oaustegard-claude-skills-tree-sitting && rm -rf "$T"
tree-sitting/SKILL.mdtree-sitting
AST-powered code navigation using tree-sitter. Each invocation auto-scans the codebase (~700ms for 250 files), then runs queries at sub-millisecond speed.
Setup
uv venv /home/claude/.venv 2>/dev/null uv pip install tree-sitter-language-pack --python /home/claude/.venv/bin/python
Total install: <2s cold cache, <400ms warm.
Usage: CLI (treesit.py)
Every call auto-scans, prints a tree overview, then runs any queries. No state to manage between calls.
TREESIT=/mnt/skills/user/tree-sitting/scripts/treesit.py # Orient: root-level overview (default depth=1) /home/claude/.venv/bin/python $TREESIT /path/to/repo # Featuring: complete tree, minimal detail /home/claude/.venv/bin/python $TREESIT /path/to/repo --depth=-1 --detail=sparse # Explore a subdirectory in full detail /home/claude/.venv/bin/python $TREESIT /path/to/repo --path=src/core --detail=full # Run queries (tree overview + query results) /home/claude/.venv/bin/python $TREESIT /path/to/repo 'find:Parser*' 'source:parse_input' # Queries only, no tree /home/claude/.venv/bin/python $TREESIT /path/to/repo --no-tree 'refs:AuthToken'
Options
| Option | Default | Description |
|---|---|---|
| 1 | Directory depth: -1=all, 0=root only, 1=one level |
| normal | Node detail: sparse, normal, full |
| (root) | Scope to subdirectory |
| Extra dirs to skip (comma-separated) | |
| Suppress tree overview, show only queries | |
| Show scan timing and counts |
Detail Levels
| Level | Per-node output | Use case |
|---|---|---|
| | featuring: see the full shape |
| per file (compact) | exploring: quick orientation |
| signature + doc + children + imports | exploring: deep dive into a directory |
Queries
Append after the repo path. Multiple queries per call.
| Query | Example | Description |
|---|---|---|
| | Symbol search (glob/substring) |
| | All symbols in a file |
| | Source code of a symbol |
| | Text references across codebase |
| | Import list for a file |
| | Directory overview (engine format) |
Workflow
1. treesit.py /repo → orient: what dirs, how big 2. treesit.py /repo --path=src/core → drill into interesting directory 3. treesit.py /repo 'find:Parser*' → find specific symbols 4. treesit.py /repo 'source:parse_input' → read implementation 5. treesit.py /repo 'refs:ParseState' → find usage across codebase
Each call is self-contained. No need to "scan first, query later" — scan happens automatically every time (~700ms).
Usage: Direct Python (single invocation)
For custom scripts that need the engine API directly:
import sys; sys.path.insert(0, '/mnt/skills/user/tree-sitting/scripts') from engine import CodeCache cache = CodeCache() cache.scan('/path/to/repo') # All queries in the SAME invocation: print(cache.tree_overview()) print(cache.find_symbol('ClassName')) print(cache.get_source_range('src/core/parser.c', 100, 150))
Important: The cache is in-memory only. All scan + query calls MUST happen in the same Python process. Splitting across separate
python -c
invocations loses the cache — use treesit.py instead.
Supported Languages
Python, JavaScript, TypeScript, TSX, Go, Rust, Ruby, Java, C, C++, C#, Swift, Kotlin, Scala, HTML, CSS, Markdown, JSON, YAML, TOML, Lua, Bash, Elisp, Zig, Elixir.
Three-tier extraction:
- Custom extractors (richest — signatures, hierarchy, docstrings): Python, C, Go, Rust, JavaScript, TypeScript, TSX, Ruby, Markdown (heading outline)
- tags.scm queries (community-maintained — kinds, docs where grammars support it): Java, C++, C#
- Generic heuristic (names + kinds + locations): all others
What It Extracts
- Symbols: functions, classes, structs, enums, methods, constants, defines, types
- Signatures: parameter lists and return types (Python, C; partial for others)
- Doc comments: first-line summaries from docstrings, JSDoc, Doxygen,
,///# - Line ranges: start and end line for every symbol
- Imports: per-file dependency tracking
- Hierarchy: class→methods, struct→fields (Python, C)
Architecture
CodeCache (in-memory, per-invocation) ├── files: {relpath → FileEntry(source, tree, symbols, imports)} ├── _symbol_index: {name → [Symbol, ...]} ← fast lookup └── methods: scan(), find_symbol(), file_symbols(), dir_overview(), ... │ └── treesit.py CLI — auto-scan + progressive-disclosure tree + queries
Parse cost is paid once per invocation. The symbol index enables O(1) exact match and O(n) substring/glob search where n is the number of unique symbol names (not files).