Awesome-Agent-Skills-for-Empirical-Research paper-reading-assistant

AI-assisted paper reading, PDF Q&A, and summarization workflows

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/research/paper-review/paper-reading-assistant" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-paper-reading-ass && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/research/paper-review/paper-reading-assistant/SKILL.md
source content

Paper Reading Assistant

Systematic workflows for reading, annotating, and extracting insights from academic papers, including AI-assisted summarization and critical analysis techniques.

The Three-Pass Reading Method

Srinivasan Keshav's three-pass approach provides a structured way to read papers at increasing depth:

Pass 1: Survey (5-10 minutes)

Read only:

  1. Title, abstract, and keywords
  2. Introduction (first and last paragraph only)
  3. Section headings (all of them)
  4. Conclusion
  5. Glance at figures and tables (read captions)
  6. Check the reference list for familiar papers

After Pass 1, you should know:

  • Category: Is this an empirical study, theoretical contribution, system paper, survey?
  • Context: What related work does it build on?
  • Correctness: Do the assumptions and claims seem reasonable?
  • Contributions: What are the main claimed contributions?
  • Clarity: Is the paper well-written?

Decision: Stop here if the paper is not relevant, or continue to Pass 2.

Pass 2: Comprehension (30-60 minutes)

Read the full paper, but skip proofs and complex derivations:

  1. Examine figures and tables carefully
  2. Mark unread references for later
  3. Annotate key claims, methods, and results
  4. Try to summarize each section in one sentence

After Pass 2, you should be able to:

  • Summarize the paper's main contribution to someone else
  • Identify the key evidence supporting the claims
  • List the paper's strengths and weaknesses

Pass 3: Recreation (1-4 hours)

For papers you need to deeply understand:

  1. Try to mentally re-derive the key results
  2. Challenge every assumption
  3. Identify implicit assumptions not stated
  4. Think about how you would improve the work
  5. Compare the approach to alternatives

Structured Note-Taking Template

Use a consistent template for every paper you read:

# Paper Notes: [Short Title]

## Metadata
- **Title**: Full title
- **Authors**: First Author et al. (Year)
- **Venue**: Conference/Journal
- **DOI/URL**: link
- **Date read**: YYYY-MM-DD

## Summary (2-3 sentences)
What does this paper do, and what are the main findings?

## Problem
What problem does this paper address? Why is it important?

## Method
How do they approach the problem? Key technical details.

## Key Results
- Result 1: ...
- Result 2: ...
- Result 3: ...

## Strengths
- Strength 1: ...
- Strength 2: ...

## Weaknesses / Limitations
- Weakness 1: ...
- Weakness 2: ...

## Questions / Things I Don't Understand
- Question 1: ...

## Relevance to My Work
How does this connect to my research? What can I use?

## Key References to Follow Up
- [Author, Year] - Why it seems relevant

AI-Assisted Paper Analysis

Summarization Prompts

Use structured prompts to extract specific information from papers:

# Prompt template for paper summarization
summarize_prompt = """Read the following academic paper and provide:

1. ONE-SENTENCE SUMMARY: The core contribution in a single sentence.

2. KEY FINDINGS (3-5 bullet points):
   - Finding 1 with specific numbers/results
   - Finding 2 ...

3. METHODOLOGY: Describe the approach in 2-3 sentences.

4. LIMITATIONS: List 2-3 limitations acknowledged or unacknowledged.

5. RELEVANCE: How does this relate to [your research topic]?

Paper text:
{paper_text}
"""

# Prompt for critical analysis
critique_prompt = """Analyze the following paper critically:

1. VALIDITY: Are the experimental design and statistical analyses sound?
   Identify any threats to internal/external validity.

2. NOVELTY: What is genuinely new? What is incremental?

3. REPRODUCIBILITY: Could you replicate this study from the description given?
   What information is missing?

4. ALTERNATIVE EXPLANATIONS: Are there alternative interpretations
   of the results that the authors do not consider?

5. FOLLOW-UP QUESTIONS: What would you want to investigate next?

Paper text:
{paper_text}
"""

PDF Processing Pipeline

import fitz  # PyMuPDF

def extract_paper_text(pdf_path):
    """Extract structured text from an academic paper PDF."""
    doc = fitz.open(pdf_path)
    sections = []
    current_section = {"heading": "Preamble", "text": ""}

    for page_num, page in enumerate(doc):
        blocks = page.get_text("dict")["blocks"]
        for block in blocks:
            if "lines" not in block:
                continue
            for line in block["lines"]:
                text = "".join(span["text"] for span in line["spans"])
                font_size = max(span["size"] for span in line["spans"])
                is_bold = any("Bold" in span.get("font", "") for span in line["spans"])

                # Heuristic: detect section headings
                if is_bold and font_size > 11 and len(text.strip()) < 80:
                    if current_section["text"].strip():
                        sections.append(current_section)
                    current_section = {"heading": text.strip(), "text": ""}
                else:
                    current_section["text"] += text + " "

    if current_section["text"].strip():
        sections.append(current_section)

    doc.close()
    return sections

# Extract and display
sections = extract_paper_text("paper.pdf")
for s in sections:
    print(f"\n## {s['heading']}")
    print(s['text'][:200] + "...")

Batch Paper Processing

import os
import json

def process_paper_batch(pdf_dir, output_file):
    """Process a batch of papers and save structured notes."""
    results = []

    for filename in os.listdir(pdf_dir):
        if not filename.endswith(".pdf"):
            continue

        pdf_path = os.path.join(pdf_dir, filename)
        sections = extract_paper_text(pdf_path)

        # Find title (usually first bold text or first line)
        title = sections[0]["heading"] if sections else filename

        # Find abstract
        abstract = ""
        for s in sections:
            if "abstract" in s["heading"].lower():
                abstract = s["text"].strip()
                break

        results.append({
            "filename": filename,
            "title": title,
            "abstract": abstract,
            "num_sections": len(sections),
            "total_chars": sum(len(s["text"]) for s in sections)
        })

    with open(output_file, "w") as f:
        json.dump(results, f, indent=2)

    return results

Annotation Tools Comparison

ToolPlatformHighlightsPDF AnnotationAI FeaturesCollaboration
Zotero + ZotFileAllReference management + PDFYesNo (plugins available)Group libraries
PaperpileWeb/ChromeGoogle Docs integrationYesNoShared folders
ReadCube PapersAllSmart citationsYesRecommendationsShared libraries
Semantic ReaderWebAI-augmented readingYesInline explanations, TLDRsNo
ElicitWebAI paper searchNoAutomated extractionTables
ScholarcyWebFlashcard summariesYesAuto-summarizationNo

Reading Strategies by Paper Type

Paper TypeFocus OnTime Budget
Seminal paperFull three-pass reading, understand every detail3-4 hours
Survey/reviewSection headings, taxonomy, open questions1-2 hours
Methods paperAlgorithm/procedure sections, pseudocode, evaluation1-2 hours
Results paperFigures, tables, statistical tests, effect sizes30-60 min
Position paperArguments, assumptions, counterarguments30-60 min
Related work (peripheral)Abstract + conclusion only (Pass 1)5-10 min

Building a Paper Reading Habit

  1. Set a regular schedule: Read 2-3 papers per week during dedicated time blocks.
  2. Maintain a reading log: Track papers read with dates, ratings, and one-line takeaways.
  3. Use a reference manager: Add papers to your library as you read them, with tags and notes.
  4. Discuss papers: Join or start a reading group; explaining papers to others deepens understanding.
  5. Connect to your research: End every reading session by writing one sentence about how the paper relates to your own work.