Awesome-Agent-Skills-for-Empirical-Research learning-science-guide

Evidence-based learning science principles for educational research and practice

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/domains/education/learning-science-guide" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-learning-science- && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/domains/education/learning-science-guide/SKILL.md
source content

Learning Science Guide

A comprehensive skill for applying evidence-based learning science principles to educational research, instructional design, and teaching practice. Grounded in cognitive psychology and educational neuroscience.

Foundational Learning Theories

Cognitive Load Theory (Sweller, 1988)

Working memory has limited capacity. Effective instruction manages three types of cognitive load:

Load TypeDefinitionDesign Strategy
IntrinsicComplexity inherent to the materialSequence from simple to complex; chunk information
ExtraneousLoad from poor instructional designEliminate redundancy; use spatial contiguity
GermaneLoad from schema constructionUse worked examples; encourage self-explanation
# Estimate cognitive load using element interactivity
def estimate_intrinsic_load(elements: list, interactions: list) -> str:
    """
    elements: list of knowledge components
    interactions: list of (element_i, element_j) tuples that must be
                  processed simultaneously
    """
    interactivity = len(interactions) / max(len(elements), 1)
    if interactivity < 0.3:
        return "low intrinsic load - suitable for independent study"
    elif interactivity < 0.7:
        return "moderate intrinsic load - scaffold with worked examples"
    else:
        return "high intrinsic load - use fading strategy and segmenting"

# Example: teaching statistical regression
elements = ['variable', 'coefficient', 'intercept', 'residual', 'R-squared']
interactions = [('coefficient', 'variable'), ('intercept', 'residual'),
                ('coefficient', 'R-squared'), ('residual', 'R-squared')]
print(estimate_intrinsic_load(elements, interactions))

Constructivism and Active Learning

Constructivist approaches emphasize that learners build knowledge through experience. Key active learning strategies with measured effect sizes (Freeman et al., 2014, PNAS):

  • Think-Pair-Share: d = 0.41
  • Problem-Based Learning (PBL): d = 0.68
  • Peer Instruction (Mazur): d = 0.74
  • Inquiry-Based Labs: d = 0.52

Evidence-Based Study Methods

Retrieval Practice

Testing is not just assessment -- it is a powerful learning tool (Roediger & Karpicke, 2006). Implement the testing effect:

Study Session Structure:
  1. Initial encoding (read/watch)          - 15 min
  2. Free recall (close materials, write)   - 10 min
  3. Check accuracy and fill gaps           -  5 min
  4. Spaced retrieval after 1 day           - 10 min
  5. Spaced retrieval after 7 days          - 10 min
  6. Spaced retrieval after 30 days         - 10 min

Spaced Repetition Algorithms

Implement optimal review scheduling:

def next_review_interval(repetition: int, ease_factor: float = 2.5,
                          quality: int = 4) -> float:
    """
    SM-2 inspired algorithm.
    repetition: number of successful reviews
    ease_factor: item difficulty (>= 1.3)
    quality: response quality 0-5
    """
    if quality < 3:
        return 1  # reset to 1 day
    if repetition == 0:
        return 1
    elif repetition == 1:
        return 6
    else:
        interval = 6 * (ease_factor ** (repetition - 1))
        # Adjust ease factor
        new_ef = ease_factor + (0.1 - (5 - quality) * (0.08 + (5 - quality) * 0.02))
        return round(interval, 1)

# Schedule for a moderately difficult concept
for rep in range(6):
    days = next_review_interval(rep)
    print(f"Review {rep + 1}: after {days} days")

Interleaving and Desirable Difficulties

Research shows interleaved practice (mixing problem types) outperforms blocked practice for long-term retention (Rohrer & Taylor, 2007):

  • Blocked: AAABBBCCC -> short-term gains, long-term forgetting
  • Interleaved: ABCBACACB -> harder during practice, better retention

Assessment Design

Bloom's Taxonomy Alignment

Map learning objectives to assessment items across cognitive levels:

remember:
  verbs: [define, list, recall, identify]
  assessment: "Multiple choice, matching"
understand:
  verbs: [explain, summarize, compare, classify]
  assessment: "Short answer, concept maps"
apply:
  verbs: [solve, demonstrate, use, implement]
  assessment: "Problem sets, simulations"
analyze:
  verbs: [differentiate, organize, attribute, deconstruct]
  assessment: "Case studies, data interpretation"
evaluate:
  verbs: [judge, critique, justify, appraise]
  assessment: "Peer review, rubric-based essays"
create:
  verbs: [design, construct, produce, formulate]
  assessment: "Research projects, portfolios"

Item Analysis

After administering assessments, compute item difficulty (p-value) and discrimination index to validate question quality. Target p-values between 0.30 and 0.70 and discrimination indices above 0.30 for optimal measurement.

References

  • Sweller, J. (1988). Cognitive load during problem solving. Cognitive Science, 12(2), 257-285.
  • Freeman, S., et al. (2014). Active learning increases student performance in science. PNAS, 111(23), 8410-8415.
  • Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning. Psychological Science, 17(3), 249-255.