Skills bilibili-transcript

Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1) Extract the complete audio content as text with high accuracy, (2) Get a detailed summary of the video content, (3) Save the transcript as a formatted TXT file instead of posting long text to Discord. Automatically detects CC subtitles if available, otherwise uses Whisper medium model with GPU acceleration. Output saves to 'Bilibili transcript' folder by default, includes video metadata, summary section, and full transcript in Simplified Chinese.

install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/54lynnn/bilibili-transcript" ~/.claude/skills/openclaw-skills-bilibili-transcript && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/54lynnn/bilibili-transcript" ~/.openclaw/skills/openclaw-skills-bilibili-transcript && rm -rf "$T"
manifest: skills/54lynnn/bilibili-transcript/SKILL.md
source content

Bilibili Transcript v2.2

High-accuracy Bilibili video transcription with multi-language AI subtitle support.

Overview

This skill provides a complete transcription workflow for Bilibili videos:

  1. Extract Video Metadata - Title, author, publish date, duration
  2. Smart Subtitle Detection - Priority: CC subtitles → AI subtitles (multi-language) → Whisper transcription
  3. Multi-language AI Subtitle Support - Auto-detects:
    ai-zh
    ,
    ai-en
    ,
    ai-ja
    ,
    ai-es
    ,
    ai-ar
    ,
    ai-pt
    ,
    ai-ko
    ,
    ai-de
    ,
    ai-fr
  4. Browser Cookie Support - WSL Chromium or Windows Edge for member-only videos
  5. Formatted Output - Saves as structured TXT file with metadata + summary placeholder + full transcript
  6. Simplified Chinese - Automatically converts Traditional to Simplified Chinese

What's New in v2.2

  • Fixed cookie detection - Now uses browser config directory instead of SQLite file (avoids encoding errors)
  • One-stop solution - CC subtitles → AI subtitles → Whisper transcription, all in one script
  • Better WSL support - Automatically detects WSL Chromium and Windows Edge cookies
  • Smart fallback - Seamlessly switches between subtitle sources without user intervention

What's New in v2.1

  • Improved cookie handling - Fixed UTF-8 encoding issues with snap Chromium
  • Three-tier fallback - CC subtitles → AI subtitles → Whisper transcription
  • Better error handling - Gracefully degrades when cookie sources fail

What's New in v2.0

  • Multi-language AI subtitles - Supports 9 languages: Chinese, English, Japanese, Spanish, Arabic, Portuguese, Korean, German, French
  • WSL Chromium support - Better cookie extraction than Windows Edge
  • Correct subtitle download - Uses
    --write-subs --write-auto-subs
    combo
  • Language auto-detection - Automatically finds available AI subtitle language

AI Subtitle Language Codes

Bilibili uses

ai-
prefix for AI-generated subtitles:

CodeLanguage语言
ai-zh
Chinese中文
ai-en
English英文
ai-ja
Japanese日文
ai-es
Spanish西班牙文
ai-ar
Arabic阿拉伯文
ai-pt
Portuguese葡萄牙文
ai-ko
Korean韩文
ai-de
German德文
ai-fr
French法文

Requirements

Hardware (Your Setup)

  • GPU: NVIDIA RTX 4070 Super (12GB VRAM) - ✅ Perfect for medium model
  • WSL Memory: 16GB (configured)
  • WSL CPU: 6 cores (configured)

Software

  • yt-dlp
    - Video/audio download
  • ffmpeg
    - Audio processing
  • whisper
    - Speech-to-text (local, no API key)
  • opencc
    - Traditional to Simplified Chinese conversion (optional)

Browser (for AI subtitles)

  • WSL Chromium (recommended) - Log in to Bilibili in WSL
  • Windows Edge - Alternative option

Workflow

Step 1: Run Transcription Script

./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BVxxxxx"

Priority order:

  1. CC Subtitles (manual) - Fastest, highest accuracy
  2. AI Subtitles (auto-generated) - Fast, good accuracy, multi-language
  3. Whisper Transcription - Slowest, ~95% accuracy, works for all videos

Step 2: Generate Detailed Summary

After the script completes, read the generated TXT file and:

  1. Read the full transcript (第二部分)
  2. Generate a comprehensive summary (第一部分)
  3. Save the updated file

Step 3: Present to User

In Discord, post:

  • Brief summary in message
  • Attach the TXT file for full content

Setup WSL Chromium Login

For best results with AI subtitles:

  1. Start WSL Chromium:

    chromium-browser &
    
  2. Navigate to bilibili.com

  3. Log in with your Bilibili account

  4. Run the transcription script

The script will automatically use Chromium's cookies to access member-only AI subtitles.

Usage Examples

Example 1: Basic Transcription (Default Output)

./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/"
# Output: workspace/Bilibili transcript/[VideoTitle]_BVxxxxx_transcript.txt

Example 2: Custom Output Directory

./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/" ~/Documents

Notes

Model Selection

  • Your config: RTX 4070 Super 12GB + 16GB RAM + 6 cores
  • Default:
    medium
    model (~95% accuracy, balanced speed) ✅
  • Fallback: If GPU unavailable, automatically uses CPU (slower)

Accuracy Comparison

SourceAccuracySpeedBest For
CC Subtitles100%⚡ InstantAll videos with manual subtitles
AI Subtitles (ai-zh)~90%⚡ InstantChinese videos
AI Subtitles (ai-en)~85%⚡ InstantEnglish videos
Whisper medium~95%🐢 SlowNo subtitle videos

Default Output Directory

  • Location:
    workspace/Bilibili transcript/
  • Created automatically on first run
  • All transcript files organized in one place

File Naming

Output files are named:

[VideoTitle]_[BVID]_transcript.txt

  • Special characters (including Chinese punctuation) are replaced with underscores
  • Title truncated to 50 characters
  • Example:
    股票分红_是从左口袋掏右口袋吗_BV1ddzUYTE27_transcript.txt

Subtitle Priority

The script tries subtitles in this order:

  1. Manual CC subtitles (zh-CN, zh-TW, en, ja, etc.)
  2. AI subtitles (any available language: ai-zh, ai-en, ai-ja, etc.)
  3. Whisper voice transcription (fallback)

This ensures fastest processing while maintaining high accuracy.