Awesome-Agent-Skills-for-Empirical-Research preprint-servers-guide

Guide to preprint servers across scientific disciplines

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/literature/fulltext/preprint-servers-guide" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-preprint-servers- && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/literature/fulltext/preprint-servers-guide/SKILL.md
source content

Preprint Servers Guide

A comprehensive guide to preprint servers across all major academic disciplines, covering submission workflows, licensing, and programmatic access.

What Are Preprints?

Preprints are complete research manuscripts shared publicly before formal peer review. They enable rapid dissemination of findings, establish priority of discovery, and invite community feedback before journal submission.

Key characteristics:

  • Not peer reviewed (but increasingly moderated for basic quality)
  • Freely accessible to anyone
  • Citable with a DOI
  • Versioned (authors can post revisions)
  • Compatible with most journal submissions (check journal policy first)

Major Preprint Servers by Discipline

ServerDisciplinesOperatorModerationDOI Prefix
arXivPhysics, Math, CS, Econ, EE, Stats, Q-BioCornellLight screening10.48550
bioRxivBiology (all subfields)Cold Spring HarborBasic screening10.1101
medRxivClinical/health sciencesYale/BMJ/CSHLEnhanced screening10.1101
ChemRxivChemistryACSModerate screening10.26434
EarthArXivEarth/planetary sciencesCommunity-ledLight screening10.31223
PsyArXivPsychologyCOS/OSFLight screening10.31234
SocArXivSocial sciencesCOS/OSFLight screening10.31235
SSRNSocial sciences, law, economicsElsevierMinimalvarious
engrXivEngineeringCOS/OSFLight screening10.31224
EdArXivEducationCOS/OSFLight screening10.35542
Preprints.orgMultidisciplinaryMDPIBasic screening10.20944
Research SquareMultidisciplinarySpringer NatureIn Review service10.21203
TechRxivElectrical eng., CSIEEEModerate10.36227

Submission Workflow

arXiv Submission

  1. Create an account at arxiv.org and get endorsed (new users need endorsement in the relevant category).
  2. Prepare your manuscript:
    • LaTeX source (preferred): upload
      .tex
      + figures +
      .bbl
      as a single archive
    • PDF: accepted but LaTeX is strongly preferred
  3. Select categories: Choose a primary category (e.g.,
    cs.CL
    ) and optional cross-lists.
  4. Submit metadata: Title, abstract, authors, comments, journal-ref (if applicable).
  5. Wait for processing: Papers appear in the next daily posting (submissions before 14:00 ET on weekdays).
  6. Receive arXiv ID: Format
    YYMM.NNNNN
    (e.g.,
    2401.12345
    ).
# Download arXiv paper PDF programmatically
curl -o paper.pdf https://arxiv.org/pdf/2401.12345

# Get metadata via arXiv API
curl "http://export.arxiv.org/api/query?id_list=2401.12345"

bioRxiv/medRxiv Submission

  1. Create an account at biorxiv.org or medrxiv.org.
  2. Upload manuscript as a single Word or PDF file.
  3. Add metadata: Title, authors with ORCIDs, abstract, subject area.
  4. Select license: CC-BY, CC-BY-NC, CC-BY-ND, CC-BY-NC-ND, or CC0.
  5. Screening: bioRxiv screens for plagiarism, dual submission, and non-scientific content. medRxiv applies additional clinical content screening.
  6. Posting: Typically within 24-48 hours after screening.
# bioRxiv API: search recent preprints
import requests

response = requests.get(
    "https://api.biorxiv.org/details/biorxiv/2024-01-01/2024-01-31",
    params={"cursor": 0, "format": "json"}
)
papers = response.json()["collection"]
for p in papers[:5]:
    print(f"[{p['date']}] {p['title']} (doi: {p['doi']})")

Licensing Options

LicenseAllows ReuseAllows CommercialAllows DerivativesRequires Attribution
CC-BYYesYesYesYes
CC-BY-NCYesNoYesYes
CC-BY-NDYesYesNoYes
CC-BY-NC-NDYesNoNoYes
CC0YesYesYesNo

Recommendation: Use CC-BY for maximum openness and compatibility with funder mandates (NIH, Wellcome Trust, ERC). Use CC-BY-NC if you want to restrict commercial reuse.

Journal Policies on Preprints

Most major publishers now accept manuscripts previously posted as preprints:

  • Accepts preprints: Nature, Science, PNAS, Cell, Lancet, BMJ, PLOS, eLife, all IEEE journals, most ACM venues
  • Does not accept preprints: Some society journals in certain fields (check SHERPA/RoMEO at sherpa.ac.uk/romeo for specific policies)

Important considerations:

  • Some journals require you to update the preprint with a link to the published version.
  • Dual posting on multiple preprint servers may violate policies.
  • Embargo periods may apply for some clinical journals (especially medRxiv).

Programmatic Access Comparison

ServerAPI AvailableBulk DownloadOAI-PMHRate Limit
arXivYes (Atom)Yes (S3 bulk)Yes1 req/3 sec
bioRxivYes (REST)NoNoPolite use
SSRNNo public APINoNoN/A
OSF PreprintsYes (SHARE)YesYesPolite use

Best Practices

  1. Post before or at submission: Maximize the time for community feedback.
  2. Use ORCIDs: Link your preprint to your ORCID profile for discoverability.
  3. Update with journal DOI: After acceptance, add a comment or new version linking to the published article.
  4. Choose the right server: Use the discipline-specific server for maximum visibility within your community.
  5. Check funder requirements: Some funders (NIH, Plan S) mandate preprint posting for funded research.