Awesome-Agent-Skills-for-Empirical-Research gbif-api

Global biodiversity data API for species occurrences and datasets

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/domains/ecology/gbif-api" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-gbif-api && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/domains/ecology/gbif-api/SKILL.md
source content

GBIF API Guide

Overview

The Global Biodiversity Information Facility (GBIF) is an international network and data infrastructure funded by governments worldwide, aimed at providing open access to biodiversity data. GBIF aggregates hundreds of millions of species occurrence records from natural history collections, citizen science platforms, monitoring networks, and published literature across the globe.

The GBIF API provides programmatic access to this vast repository of biodiversity data. Researchers can search for species occurrences by taxonomy, geography, time period, and dataset. The API also supports taxonomic name matching, dataset discovery, and species profile lookups. It serves as a foundational resource for ecological research, conservation planning, biogeography, and environmental impact assessments.

Ecologists, conservation biologists, biogeographers, and environmental scientists rely on the GBIF API to retrieve georeferenced occurrence data for species distribution modeling, climate change impact analysis, invasive species tracking, and biodiversity hotspot identification. The data is freely available under open data licenses.

Authentication

No authentication required for read access. The GBIF API is publicly accessible without any API key or token. All search and retrieval endpoints are open. User authentication is only required for data publishing operations (creating datasets and uploading occurrences), which requires a GBIF account.

Core Endpoints

occurrence/search: Search Species Occurrences

Search for georeferenced biodiversity observation and specimen records across all GBIF-indexed datasets.

  • URL:
    GET https://api.gbif.org/v1/occurrence/search
  • Parameters:
ParameterTypeRequiredDescription
qstringNoFull-text search query
taxonKeyintNoGBIF backbone taxonomy key
scientificNamestringNoScientific name to filter by
countrystringNoISO 3166-1 alpha-2 country code
hasCoordinateboolNoFilter for georeferenced records only
yearstringNoYear or range (e.g.,
2020,2024
)
limitintNoNumber of results (default 20, max 300)
offsetintNoPagination offset
  • Example:
curl "https://api.gbif.org/v1/occurrence/search?scientificName=Panthera+tigris&hasCoordinate=true&limit=10"
  • Response: Returns
    count
    (total matches),
    results
    array with
    key
    ,
    scientificName
    ,
    decimalLatitude
    ,
    decimalLongitude
    ,
    country
    ,
    basisOfRecord
    ,
    eventDate
    ,
    datasetKey
    ,
    publishingOrgKey
    , and
    media
    links.

species/match: Taxonomic Name Matching

Match a species name against the GBIF backbone taxonomy to resolve canonical names and get taxonomy keys.

  • URL:
    GET https://api.gbif.org/v1/species/match
  • Parameters:
ParameterTypeRequiredDescription
namestringYesScientific name to match
kingdomstringNoKingdom filter for disambiguation
strictboolNoIf true, only return exact matches
  • Example:
curl "https://api.gbif.org/v1/species/match?name=Homo+sapiens"
  • Response: Returns
    usageKey
    ,
    scientificName
    ,
    canonicalName
    ,
    rank
    ,
    status
    ,
    kingdom
    ,
    phylum
    ,
    class
    ,
    order
    ,
    family
    ,
    genus
    ,
    species
    ,
    confidence
    score, and
    matchType
    .

dataset: Discover Datasets

Search for and retrieve metadata about GBIF-indexed datasets from publishers worldwide.

  • URL:
    GET https://api.gbif.org/v1/dataset
  • Parameters:
ParameterTypeRequiredDescription
qstringNoFull-text search query
typestringNoDataset type:
OCCURRENCE
,
CHECKLIST
, etc.
publishingOrgstringNoPublishing organization UUID
limitintNoNumber of results (default 20, max 1000)
offsetintNoPagination offset
  • Example:
curl "https://api.gbif.org/v1/dataset?q=bird+monitoring&type=OCCURRENCE&limit=5"
  • Response: Returns
    count
    ,
    results
    array with
    key
    ,
    title
    ,
    description
    ,
    type
    ,
    publishingOrganizationKey
    ,
    license
    ,
    recordCount
    , and
    endpoints
    .

Rate Limits

No formal rate limits are enforced on the GBIF API. However, GBIF recommends responsible use patterns. Large data downloads (millions of records) should use the asynchronous download API at

https://api.gbif.org/v1/occurrence/download/request
rather than paginating through the search endpoint. The search endpoint is limited to 100,000 records maximum per query via pagination.

Common Patterns

Species Distribution Mapping

Retrieve georeferenced occurrence data for species distribution modeling:

import requests

params = {
    "taxonKey": 2480498,  # Panthera tigris
    "hasCoordinate": True,
    "limit": 300
}
resp = requests.get("https://api.gbif.org/v1/occurrence/search", params=params)
data = resp.json()

coordinates = [(r["decimalLongitude"], r["decimalLatitude"])
               for r in data["results"]
               if "decimalLongitude" in r and "decimalLatitude" in r]

print(f"Retrieved {len(coordinates)} georeferenced occurrences of {data['results'][0]['scientificName']}")

Taxonomic Name Resolution Pipeline

Resolve a list of species names against the GBIF backbone taxonomy:

import requests

names = ["Homo sapiens", "Canis lupus", "Quercus robur", "Drosophila melanogaster"]

for name in names:
    resp = requests.get("https://api.gbif.org/v1/species/match", params={"name": name})
    match = resp.json()
    print(f"{name} -> {match['canonicalName']} (key: {match['usageKey']}, confidence: {match['confidence']})")

Bulk Occurrence Download

For large-scale analyses requiring millions of records, use the asynchronous download API:

curl -X POST "https://api.gbif.org/v1/occurrence/download/request" \
  -H "Content-Type: application/json" \
  -u username:password \
  -d '{"creator":"username","predicate":{"type":"equals","key":"TAXON_KEY","value":"2480498"}}'

References