Learn-skills.dev content-source-aggregator

统一信息源热点采集。从 X/Twitter、YouTube、B站、GitHub、Reddit、LinuxDo 六大平台免费获取热门内容,输出标准化热点池供内容创作流水线使用。全部使用免费公开 API,无需付费。

install
source · Clone the upstream repo
git clone https://github.com/NeverSight/learn-skills.dev
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/NeverSight/learn-skills.dev "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/skills-md/aaaaqwq/claude-code-skills/content-source-aggregator" ~/.claude/skills/neversight-learn-skills-dev-content-source-aggregator && rm -rf "$T"
manifest: data/skills-md/aaaaqwq/claude-code-skills/content-source-aggregator/SKILL.md
source content

信息源热点采集器

从 6 大平台免费采集热门内容,输出标准化 JSON 热点池。

支持平台

平台方式免费额度状态
X/Twittersyndication API (无需认证)无限
YouTubeRSS Feed (频道级) + yt-dlp无限
B站公开 API (ranking/v2)无限
GitHubSearch API (无认证)60次/小时
RedditPullPush API无限
抖音热搜 API (无需认证)无限
LinuxDoDiscourse JSON API无限⏳ 需登录态
小红书Web 端 SSR 解析无限⏳ 需登录态
微信公众号搜狗微信搜索有限⏳ 需登录态
微信视频号无公开 API-⏳ 需 Playwright

使用方法

# 采集所有平台
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py

# 采集单个平台
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source twitter
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source youtube
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source bilibili
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source github
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source reddit
python3 ~/clawd/skills/content-source-aggregator/scripts/fetch_all.py --source linuxdo

输出

标准化 JSON 写入

~/clawd/workspace/content-pipeline/hotpool/YYYY-MM-DD.json

{
  "date": "2026-02-19",
  "fetched_at": "2026-02-19T10:00:00+08:00",
  "items": [
    {
      "source": "reddit",
      "title": "...",
      "url": "...",
      "summary": "...",
      "heat_score": 85,
      "category": "AI/Tech",
      "engagement": {"upvotes": 1200, "comments": 340},
      "fetched_at": "..."
    }
  ]
}

各平台 API 详情

X/Twitter

  • 端点:
    https://syndication.twitter.com/srv/timeline-profile/screen-name/{username}
  • 方式: 解析返回 HTML 中的推文文本
  • 可配置关注的账号列表

YouTube

  • 端点:
    https://www.youtube.com/feeds/videos.xml?channel_id={id}
  • 方式: RSS XML 解析
  • 可配置关注的频道列表
  • 备选: yt-dlp 获取 trending

B站

  • 热门排行:
    https://api.bilibili.com/x/web-interface/ranking/v2?rid=0&type=all
  • 热搜:
    https://api.bilibili.com/x/web-interface/wbi/search/square?limit=10
  • 需要 Referer: https://www.bilibili.com

GitHub

  • 端点:
    https://api.github.com/search/repositories?q=stars:>100+pushed:>{date}&sort=stars
  • 无认证 60 次/小时,够用

Reddit

  • 端点:
    https://api.pullpush.io/reddit/search/submission/?subreddit={sub}&sort=score&sort_type=desc&size=10
  • 免费无限制

LinuxDo

  • 端点:
    https://linux.do/latest.json?order=default
  • Discourse 标准 API,免费公开

配置

编辑

scripts/config.json
自定义关注的账号/频道/子版块。

与内容流水线集成

本 skill 是内容创作 SOP 的 Phase 1(热点采集),输出供 research agent 做选题评分。