Awesome-Agent-Skills-for-Empirical-Research zoplicate-dedup-guide
Detect and manage duplicate items in Zotero libraries
install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/literature/metadata/zoplicate-dedup-guide" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-zoplicate-dedup-g && rm -rf "$T"
manifest:
skills/43-wentorai-research-plugins/skills/literature/metadata/zoplicate-dedup-guide/SKILL.mdsource content
Zoplicate: Zotero Deduplication Guide
Overview
Zoplicate is a Zotero plugin that detects and manages duplicate items in your library. It goes beyond Zotero's built-in duplicate detection by offering configurable matching criteria, batch merge operations, automatic deduplication on import, and smart metadata merging that keeps the best version of each field. Essential for researchers who import from multiple databases.
Installation
# Download .xpi from GitHub releases # Zotero 7: Tools → Add-ons → Install Add-on From File
Features
### Duplicate Detection - **DOI matching** — Most reliable, catches exact duplicates - **Title similarity** — Fuzzy matching for slight variations - **ISBN matching** — For books and proceedings - **Combined scoring** — Weighted match across multiple fields ### Smart Merge - Keep the most complete metadata from each duplicate - Preserve all tags from both items - Merge notes and annotations - Consolidate attachments (keep all unique PDFs) - Maintain collection memberships from both ### Automatic Mode - Detect duplicates on import - Configurable auto-merge threshold - Notification before auto-merge (optional)
Configuration
{ "detection": { "criteria": { "doi": {"enabled": true, "weight": 1.0}, "title": {"enabled": true, "weight": 0.8, "similarity_threshold": 0.85}, "isbn": {"enabled": true, "weight": 1.0}, "year_author": {"enabled": true, "weight": 0.6} }, "overall_threshold": 0.7, "ignore_case": true, "normalize_unicode": true }, "merge": { "strategy": "keep_most_complete", "preserve_tags": true, "merge_notes": true, "keep_all_attachments": true, "prefer_fields_from": "newer" }, "auto": { "detect_on_import": true, "auto_merge": false, "notify_on_detect": true } }
Usage Workflow
### Manual Deduplication 1. Tools → Zoplicate → Find Duplicates 2. Review duplicate groups (side-by-side comparison) 3. Select master item for each group 4. Click "Merge" — metadata combined, duplicates removed ### Batch Operations 1. Tools → Zoplicate → Find All Duplicates 2. Review summary (X groups, Y total duplicates) 3. "Auto-select best" — picks master by completeness 4. "Merge All" — batch process all groups ### Import Deduplication 1. Import papers from database export (RIS, BibTeX) 2. Zoplicate auto-checks against existing library 3. Popup shows potential duplicates 4. Choose: Skip / Merge / Import as new
Matching Examples
### DOI Match (100% confidence) Item A: "Attention Is All You Need" — DOI: 10.48550/arXiv.1706.03762 Item B: "Attention is All You Need" — DOI: 10.48550/arXiv.1706.03762 → Exact duplicate (same DOI) ### Title Fuzzy Match (85% similarity) Item A: "BERT: Pre-training of Deep Bidirectional Transformers..." Item B: "Bert: Pre-Training of Deep Bidirectional Transformers..." → Likely duplicate (title similarity > threshold) ### Different Versions (not duplicate) Item A: "Paper Title" (arXiv v1, 2023) Item B: "Paper Title" (Published version, 2024) → Different DOIs, may want to keep both or merge
Integration Tips
### Multi-Database Import Workflow 1. Search PubMed → Export RIS → Import to Zotero 2. Search Scopus → Export RIS → Import to Zotero 3. Search Web of Science → Export RIS → Import to Zotero 4. Zoplicate detects cross-database duplicates 5. Merge: keep best metadata from each source ### With Other Plugins - **Zotero Connector** — Detect dups on browser import - **Better BibTeX** — Preserve citation keys on merge - **ZotMoov** — Reorganize attachments after merge
Use Cases
- Library cleanup: Remove duplicates from large collections
- Import dedup: Prevent duplicates when importing from databases
- Systematic reviews: Deduplicate multi-database search results
- Lab libraries: Merge shared group library duplicates
- Migration: Clean up after importing from other reference managers