Awesome-Agent-Skills-for-Empirical-Research enrichr-api

Perform gene set enrichment analysis using the Enrichr API

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/domains/biomedical/enrichr-api" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-enrichr-api && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/domains/biomedical/enrichr-api/SKILL.md
source content

Enrichr Gene Set Enrichment Analysis API

Overview

Enrichr is the most widely used gene set enrichment analysis tool, developed by the Ma'ayan Lab at the Icahn School of Medicine at Mount Sinai. It tests whether a user-supplied gene list is statistically over-represented in curated gene set libraries spanning pathways, ontologies, transcription factor targets, disease associations, and cell types. The API provides access to 225 background libraries covering over 500,000 annotated gene sets. Free, no authentication required.

Two-Step Workflow

Enrichr uses a submit-then-query pattern:

  1. POST gene list to
    /addList
    -- returns a
    userListId
    token
  2. GET enrichment from
    /enrich
    using that token and a chosen library

The

userListId
persists on the server, so you can run multiple library queries against the same submission without re-uploading.

Core Endpoints

Base URL

https://maayanlab.cloud/Enrichr

Step 1: Submit Gene List

curl -X POST "https://maayanlab.cloud/Enrichr/addList" \
  -F "list=BRCA1
BRCA2
TP53
EGFR
MYC
PTEN
AKT1
KRAS
PIK3CA
RAF1" \
  -F "description=cancer_genes"

Response:

{
  "shortId": "8619200cc78f1513ff1029a04af90ad7",
  "userListId": 124544426
}

Genes are newline-separated. The request must use

multipart/form-data
(the
-F
flag), not
application/x-www-form-urlencoded
.

Step 2: Retrieve Enrichment Results

curl "https://maayanlab.cloud/Enrichr/enrich?userListId=124544426&backgroundType=KEGG_2021_Human"

Response (first 3 of 143 results):

{
  "KEGG_2021_Human": [
    [1, "Breast cancer", 3.37e-22, 198530.0, 9815800.25,
     ["PIK3CA","MYC","PTEN","AKT1","KRAS","BRCA1","BRCA2","RAF1","TP53","EGFR"],
     4.82e-20, 0, 0],
    [2, "Endometrial cancer", 1.35e-19, 1595.2, 69306.12,
     ["PIK3CA","MYC","PTEN","AKT1","KRAS","RAF1","TP53","EGFR"],
     9.68e-18, 0, 0],
    [3, "Central carbon metabolism in cancer", 6.66e-19, 1285.68, 53809.88,
     ["PIK3CA","MYC","PTEN","AKT1","KRAS","RAF1","TP53","EGFR"],
     3.17e-17, 0, 0]
  ]
}

Each result array contains:

[rank, term_name, p_value, z_score, combined_score, overlapping_genes, adjusted_p_value, old_p_value, old_adjusted_p_value]
.

View Submitted Gene List

curl "https://maayanlab.cloud/Enrichr/view?userListId=124544426"
{
  "genes": ["PIK3CA","MYC","AKT1","PTEN","BRCA1","KRAS","BRCA2","EGFR","TP53","RAF1"],
  "description": "cancer_genes"
}

Export Results as TSV

curl "https://maayanlab.cloud/Enrichr/export?userListId=124544426&backgroundType=KEGG_2021_Human&filename=results" \
  -o enrichr_results.txt

List Available Libraries

curl "https://maayanlab.cloud/Enrichr/datasetStatistics"

Returns metadata for all 225 libraries, each entry containing

libraryName
,
numTerms
,
geneCoverage
, and
genesPerTerm
.

Available Libraries (225 Total)

Pathway Databases

LibraryTermsGenes
KEGG_20263528,110
KEGG_2021_Human3208,078
WikiPathways_2024_Human8298,281
Reactome_Pathways_20242,10511,671
BioCarta_20162371,348

Gene Ontology

LibraryTermsGenes
GO_Biological_Process_20255,34314,674
GO_Molecular_Function_20251,17411,484
GO_Cellular_Component_202546811,501

Disease and Phenotype

LibraryTermsGenes
DisGeNET9,82817,464
GWAS_Catalog_20252,36915,030
ClinVar_20256093,481
OMIM_Disease901,759
Human_Phenotype_Ontology1,7793,096

Transcription Factor and Epigenomics

LibraryTermsGenes
ChEA_202275718,365
ENCODE_TF_ChIP-seq_201581626,382
JASPAR_PWM_Human_202567518,518

Cell Type and Tissue

LibraryTermsGenes
CellMarker_20241,69212,642
ARCHS4_Tissues10821,809
Human_Gene_Atlas8413,373

Cancer and Drug

LibraryTermsGenes
MSigDB_Hallmark_2020504,383
MSigDB_Oncogenic_Signatures18911,250
DGIdb_Drug_Targets_20246592,513

Rate Limits

  • No authentication or API key required
  • No officially published rate limits, but automated queries should include reasonable delays (1-2 seconds between requests)
  • Very large gene lists (>3,000 genes) may time out on some libraries
  • The
    userListId
    persists server-side; avoid re-submitting the same list repeatedly

Academic Use Cases

  • Differential expression follow-up: Submit DEGs from RNA-seq to identify enriched pathways and GO terms
  • GWAS hit annotation: Map GWAS-significant genes to disease phenotypes via DisGeNET or GWAS_Catalog
  • Drug target discovery: Cross-reference gene signatures against DGIdb_Drug_Targets for druggable candidates
  • Transcription factor analysis: Identify upstream regulators via ChEA or ENCODE TF libraries

Python Usage

import requests

ENRICHR_URL = "https://maayanlab.cloud/Enrichr"


def submit_gene_list(genes: list[str], description: str = "") -> int:
    """Submit a gene list to Enrichr, return userListId."""
    payload = {
        "list": (None, "\n".join(genes)),
        "description": (None, description),
    }
    resp = requests.post(f"{ENRICHR_URL}/addList", files=payload)
    resp.raise_for_status()
    return resp.json()["userListId"]


def get_enrichment(user_list_id: int, library: str) -> list[dict]:
    """Retrieve enrichment results for a given library."""
    resp = requests.get(
        f"{ENRICHR_URL}/enrich",
        params={"userListId": user_list_id, "backgroundType": library},
    )
    resp.raise_for_status()
    data = resp.json()

    results = []
    for entry in data.get(library, []):
        results.append({
            "rank": entry[0],
            "term": entry[1],
            "p_value": entry[2],
            "z_score": entry[3],
            "combined_score": entry[4],
            "genes": entry[5],
            "adj_p_value": entry[6],
        })
    return results


def get_libraries() -> list[dict]:
    """List all available Enrichr libraries."""
    resp = requests.get(f"{ENRICHR_URL}/datasetStatistics")
    resp.raise_for_status()
    return resp.json()["statistics"]


# Example: enrichment analysis of cancer-related genes
genes = ["BRCA1", "BRCA2", "TP53", "EGFR", "MYC",
         "PTEN", "AKT1", "KRAS", "PIK3CA", "RAF1"]

list_id = submit_gene_list(genes, "cancer_genes")
print(f"Submitted gene list, ID: {list_id}")

# Query KEGG pathways
kegg = get_enrichment(list_id, "KEGG_2021_Human")
print(f"\nTop 5 KEGG pathways ({len(kegg)} total):")
for r in kegg[:5]:
    print(f"  {r['rank']}. {r['term']}")
    print(f"     p={r['p_value']:.2e}, adj_p={r['adj_p_value']:.2e}, "
          f"genes={','.join(r['genes'][:5])}...")

# Query GO Biological Process
go_bp = get_enrichment(list_id, "GO_Biological_Process_2023")
print(f"\nTop 5 GO Biological Processes ({len(go_bp)} total):")
for r in go_bp[:5]:
    print(f"  {r['rank']}. {r['term']}")
    print(f"     p={r['p_value']:.2e}, genes={','.join(r['genes'])}")

References

  • Enrichr Web App
  • Enrichr API Docs
  • Chen, E.Y. et al. (2013). "Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool." BMC Bioinformatics 14:128.
  • Kuleshov, M.V. et al. (2016). "Enrichr: a comprehensive gene set enrichment analysis web server 2016 update." Nucleic Acids Res. 44(W1).
  • Xie, Z. et al. (2021). "Gene Set Knowledge Discovery with Enrichr." Current Protocols 1(3):e90.