Awesome-Agent-Skills-for-Empirical-Research world-bank-data-guide
Access World Bank development indicators and country statistics
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/domains/economics/world-bank-data-guide" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-world-bank-data-g && rm -rf "$T"
skills/43-wentorai-research-plugins/skills/domains/economics/world-bank-data-guide/SKILL.mdWorld Bank Data API Guide
Overview
The World Bank Data API provides free access to one of the most comprehensive collections of global development data available. The Indicators API covers over 16,000 development indicators across 217 countries and territories, with time series data spanning several decades. Topics include poverty and inequality, health outcomes, education statistics, infrastructure, trade, environmental metrics, and governance indicators.
This API is a cornerstone resource for researchers in development economics, public health, education policy, environmental science, and political science. The World Bank's data is widely cited in academic publications and policy documents, making it an authoritative source for cross-country comparative research.
The API is entirely free, requires no authentication, and returns data in JSON, XML, or other formats. The v2 API supports pagination, filtering by date ranges and income levels, and provides metadata about indicators and data sources.
Authentication
No authentication is required. The World Bank Data API is free and open.
# No API key needed curl "https://api.worldbank.org/v2/country/US/indicator/NY.GDP.MKTP.CD?format=json&date=2020:2024"
Core Endpoints
Get Indicator Data by Country
GET https://api.worldbank.org/v2/country/{country_code}/indicator/{indicator_code}?format=json
Parameters:
: ISO 2-letter or 3-letter code; usecountry_code
for all countriesall
: World Bank indicator codeindicator_code
:format
orjson
(default xml)xml
: Year range (e.g.,date
)2015:2024
: Results per page (default 50, max 32500)per_page
: Page numberpage
Common Indicators:
: GDP (current US$)NY.GDP.MKTP.CD
: GDP per capita (current US$)NY.GDP.PCAP.CD
: Total populationSP.POP.TOTL
: Poverty headcount ratio at $2.15/daySI.POV.DDAY
: Adult literacy rateSE.ADT.LITR.ZS
: Current health expenditure (% of GDP)SH.XPD.CHEX.GD.ZS
: CO2 emissions (metric tons per capita)EN.ATM.CO2E.PC
Example: GDP per capita for BRICS nations:
curl -s "https://api.worldbank.org/v2/country/BR;RU;IN;CN;ZA/indicator/NY.GDP.PCAP.CD?format=json&date=2019:2023&per_page=100" \ | python3 -m json.tool
List All Indicators
curl -s "https://api.worldbank.org/v2/indicator?format=json&per_page=50" \ | python3 -m json.tool
Search Indicators by Topic
curl -s "https://api.worldbank.org/v2/topic/8/indicator?format=json&per_page=20" \ | python3 -m json.tool
Topics include: 1=Agriculture, 3=Economy, 4=Education, 6=Environment, 8=Health, 11=Poverty, etc.
Get Country Metadata
curl -s "https://api.worldbank.org/v2/country/CN?format=json" \ | python3 -m json.tool
Python Example: Cross-Country Development Analysis
import requests import time BASE_URL = "https://api.worldbank.org/v2" def get_indicator(indicator_code, countries="all", date_range="2015:2023", per_page=500): """Fetch World Bank indicator data.""" url = f"{BASE_URL}/country/{countries}/indicator/{indicator_code}" params = { "format": "json", "date": date_range, "per_page": per_page } resp = requests.get(url, params=params) resp.raise_for_status() data = resp.json() if len(data) < 2: return [] return data[1] # Compare health expenditure vs life expectancy health_exp = get_indicator("SH.XPD.CHEX.GD.ZS", countries="US;GB;DE;JP;BR;IN", date_range="2020") for record in health_exp: if record["value"] is not None: country = record["country"]["value"] year = record["date"] value = record["value"] print(f"{country} ({year}): {value:.1f}% of GDP on health")
Python Example: Build a Panel Dataset
import requests import csv def build_panel(indicators, countries, years): """Build a panel dataset from multiple World Bank indicators.""" panel = {} for ind_code, ind_name in indicators.items(): url = f"https://api.worldbank.org/v2/country/{countries}/indicator/{ind_code}" params = {"format": "json", "date": years, "per_page": 5000} resp = requests.get(url, params=params) data = resp.json() if len(data) < 2: continue for record in data[1]: key = (record["country"]["id"], record["date"]) if key not in panel: panel[key] = {"country": record["country"]["value"], "year": record["date"]} panel[key][ind_name] = record["value"] return list(panel.values()) indicators = { "NY.GDP.PCAP.CD": "gdp_per_capita", "SP.POP.TOTL": "population", "SE.ADT.LITR.ZS": "literacy_rate" } data = build_panel(indicators, "BR;IN;NG;ID", "2018:2023") for row in data[:10]: print(row)
Common Research Patterns
Development Panel Regressions: Combine multiple indicators across countries and years to construct panel datasets for econometric analysis. Study relationships between education spending, health outcomes, and economic growth.
Poverty and Inequality Analysis: Track poverty headcount ratios, Gini coefficients, and income share data across countries and over time to study the effectiveness of development interventions.
Environmental-Economic Nexus: Combine CO2 emissions, renewable energy, and GDP data to study the Environmental Kuznets Curve or the decoupling of economic growth from environmental degradation.
Policy Evaluation: Compare indicator trajectories before and after major policy reforms across treated and comparison countries using difference-in-differences or synthetic control methods.
Rate Limits and Best Practices
- No authentication required and no strict rate limit, but keep requests to 1-2 per second
- Pagination: Default 50 results per page; set
for single-page retrieval of most queriesper_page=32500 - Format: Always specify
as the default is XMLformat=json - Null values: Many indicators have null values for certain country-year combinations; always check for nulls
- Country groups: Use aggregate codes like
(EU),EUU
(World),WLD
(Low Income) for pre-computed aggregatesLIC - MRV parameter: Use
to get the most recent value for each country when the latest year variesMRV=1 - Bulk downloads: For large-scale analysis, consider the bulk download options at https://data.worldbank.org/
References
- World Bank Indicators API Documentation: https://datahelpdesk.worldbank.org/knowledgebase/articles/889392-about-the-indicators-api
- World Bank Data Portal: https://data.worldbank.org/
- World Bank Indicator List: https://data.worldbank.org/indicator
- World Bank API Query Builder: https://datahelpdesk.worldbank.org/knowledgebase/articles/898581-api-basic-call-structures