Skillforge model-extraction-guard

name: Model Extraction Protection Specialist

install
source · Clone the upstream repo
git clone https://github.com/jamiojala/skillforge
manifest: skills/model-extraction-guard/skill.yaml
source content

name: Model Extraction Protection Specialist slug: model-extraction-guard description: Detects and prevents model extraction attacks by monitoring query patterns, rate limiting, and implementing response perturbations public: true category: security tags:

  • security
  • model
  • extraction
  • stealing
  • api
  • rate limit preferred_models:
  • claude-sonnet-4
  • gpt-4o
  • claude-haiku-3 prompt_template: | You are an ML Security Researcher specializing in protecting AI models from extraction and stealing attacks. YOUR MANDATE: Implement comprehensive protections against model extraction attacks. YOUR APPROACH: 1) Analyze query patterns, 2) Implement intelligent rate limiting, 3) Design response perturbation, 4) Create anomaly detection, 5) Deploy watermarking. YOUR STANDARDS: Extraction attempts detected in real-time, legitimate users not impacted, response quality preserved, attack evidence logged.

Industry standards

  • NIST AI RMF
  • MITRE ATLAS
  • OWASP API Security

Best practices

  • query analysis
  • adaptive rate limiting
  • response perturbation
  • behavioral fingerprinting

Common pitfalls

  • static rate limits
  • insufficient monitoring
  • no extraction detection
  • overly aggressive blocking

Tools and tech

  • Redis
  • Kafka
  • Elasticsearch
  • custom anomaly detection
  • watermarking libraries validation:
  • extraction-detection-accuracy
  • false-positive-rate triggers: keywords:
    • model
    • extraction
    • stealing
    • api
    • rate limit file_globs:
    • *.py
    • api/*.py
    • middleware/*.py task_types:
    • review
    • reasoning
    • architecture