Awesome-Agent-Skills-for-Empirical-Research openalex-api

Query the OpenAlex catalog of scholarly works, authors, and institutions

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/literature/search/openalex-api" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-openalex-api && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/literature/search/openalex-api/SKILL.md
source content

OpenAlex API Guide

Overview

OpenAlex is a free, open catalog of the global research system. It indexes over 250 million scholarly works, 90 million authors, 100,000 institutions, and 65,000 concepts. Created by the nonprofit OurResearch as a replacement for Microsoft Academic Graph, OpenAlex provides comprehensive bibliometric data for academic research analysis.

The API serves as a powerful tool for researchers conducting systematic literature reviews, bibliometric analyses, and research landscape mapping. It covers works across all academic disciplines, linking papers to their authors, institutions, concepts, and citation networks. Each entity in OpenAlex has a persistent identifier (OpenAlex ID) and is enriched with metadata from CrossRef, ORCID, ROR, and other authoritative sources.

OpenAlex is entirely free to use. No API key is required, though providing a contact email in the

mailto
parameter grants access to the polite pool with faster response times and higher rate limits.

Authentication

No authentication required. For better rate limits and access to the polite pool, include your email in requests:

https://api.openalex.org/works?mailto=your@email.com

The polite pool provides significantly faster responses and is recommended for all production usage.

Core Endpoints

Works: Search Scholarly Publications

  • URL:
    GET https://api.openalex.org/works
  • Parameters:
    ParamTypeRequiredDescription
    searchstringNoFull-text search across titles and abstracts
    filterstringNoStructured filters (e.g., publication_year:2024, cited_by_count:>100)
    sortstringNoSort field: cited_by_count, publication_date, relevance_score
    pageintegerNoPage number for pagination (default: 1)
    per_pageintegerNoResults per page (default: 25, max: 200)
    mailtostringNoEmail for polite pool access
  • Example:
    curl "https://api.openalex.org/works?search=large+language+models&filter=publication_year:2024&sort=cited_by_count:desc&per_page=10&mailto=user@example.com"
    
  • Response: JSON with
    results
    array containing
    id
    ,
    title
    ,
    doi
    ,
    publication_date
    ,
    cited_by_count
    ,
    authorships
    ,
    concepts
    ,
    open_access
    status, and
    abstract_inverted_index
    .

Authors: Search Researcher Profiles

  • URL:
    GET https://api.openalex.org/authors
  • Parameters:
    ParamTypeRequiredDescription
    searchstringNoName-based author search
    filterstringNoFilters (e.g., works_count:>50, last_known_institution.id)
    sortstringNoSort field: works_count, cited_by_count, h_index
    per_pageintegerNoResults per page (default: 25, max: 200)
  • Example:
    curl "https://api.openalex.org/authors?search=Geoffrey+Hinton&mailto=user@example.com"
    
  • Response: JSON with
    results
    array containing
    id
    ,
    display_name
    ,
    orcid
    ,
    works_count
    ,
    cited_by_count
    ,
    h_index
    ,
    last_known_institution
    , and
    x_concepts
    .

Institutions: Search Academic Organizations

  • URL:
    GET https://api.openalex.org/institutions
  • Parameters:
    ParamTypeRequiredDescription
    searchstringNoInstitution name search
    filterstringNoFilters (e.g., country_code:US, type:education)
    sortstringNoSort field: works_count, cited_by_count
    per_pageintegerNoResults per page
  • Example:
    curl "https://api.openalex.org/institutions?search=MIT&mailto=user@example.com"
    
  • Response: JSON with institution details including
    id
    ,
    display_name
    ,
    ror
    ,
    country_code
    ,
    type
    ,
    works_count
    , and
    cited_by_count
    .

Concepts: Browse Research Topics

  • URL:
    GET https://api.openalex.org/concepts
  • Parameters:
    ParamTypeRequiredDescription
    searchstringNoConcept name search
    filterstringNoFilters (e.g., level:0 for top-level concepts)
    per_pageintegerNoResults per page
  • Example:
    curl "https://api.openalex.org/concepts?filter=level:0&per_page=50&mailto=user@example.com"
    
  • Response: JSON with concept hierarchy,
    works_count
    , and related concepts.

Rate Limits

Without the

mailto
parameter: 10 requests per second, 100,000 requests per day. With the
mailto
parameter (polite pool): significantly higher throughput. The API uses HTTP 429 responses when limits are exceeded. Implement exponential backoff for production usage.

Common Patterns

Citation Analysis for a Paper

Retrieve all works that cite a specific paper by its DOI:

curl "https://api.openalex.org/works?filter=cites:W2741809807&sort=cited_by_count:desc&per_page=25&mailto=user@example.com"

Institutional Research Output

Analyze publication trends for a specific institution:

curl "https://api.openalex.org/works?filter=institutions.id:I136199984,publication_year:2024&group_by=open_access.is_oa&mailto=user@example.com"

Topic Landscape Mapping

Explore how a concept connects to others in the research landscape:

curl "https://api.openalex.org/concepts/C41008148?mailto=user@example.com"

References