Awesome-Agent-Skills-for-Empirical-Research qualitative-research-guide

Design and conduct qualitative research using grounded theory and case studies

install
source · Clone the upstream repo
git clone https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/43-wentorai-research-plugins/skills/research/methodology/qualitative-research-guide" ~/.claude/skills/brycewang-stanford-awesome-agent-skills-for-empirical-research-qualitative-resea && rm -rf "$T"
manifest: skills/43-wentorai-research-plugins/skills/research/methodology/qualitative-research-guide/SKILL.md
source content

Qualitative Research Guide

A skill for designing and conducting rigorous qualitative research. Covers major qualitative traditions, data collection methods, coding and analysis techniques, and quality criteria for trustworthy qualitative findings.

Major Qualitative Traditions

Choosing an Approach

ApproachResearch Question TypeUnit of AnalysisSample SizeOutput
Grounded TheoryHow does a process work?Process/action20-60Theory
PhenomenologyWhat is the lived experience?Experience5-25Essence description
Case StudyHow/why does this case work?Bounded system1-5 casesCase description
EthnographyHow does this culture work?Cultural groupExtended fieldworkCultural portrait
NarrativeWhat is this person's story?Individual life1-5Narrative account
Thematic AnalysisWhat patterns exist in this data?Themes across dataVariableTheme map

Grounded Theory Process

Data Collection (interviews, observations)
       |
       v
Open Coding: Line-by-line coding of raw data
       |
       v
Axial Coding: Grouping codes into categories,
              identifying relationships
       |
       v
Selective Coding: Identifying the core category
                  that integrates all others
       |
       v
Theoretical Saturation: Stop when new data
                        no longer generates new codes
       |
       v
Substantive Theory: A grounded explanation of the phenomenon

Interview Design

Semi-Structured Interview Protocol

def create_interview_protocol(research_questions: list[str],
                                n_questions: int = 10) -> dict:
    """
    Generate a semi-structured interview protocol template.

    Args:
        research_questions: The study's research questions
        n_questions: Target number of interview questions
    """
    protocol = {
        'opening': {
            'rapport_building': [
                "Thank you for participating. Before we begin, could you "
                "tell me a little about yourself and your background?",
                "How did you first become involved in [topic]?"
            ],
            'time_estimate': '60-90 minutes'
        },
        'main_questions': [],
        'closing': {
            'wrap_up': [
                "Is there anything else you would like to share that we "
                "have not covered?",
                "Looking back, what stands out most to you about [topic]?",
                "Do you have any questions for me?"
            ]
        },
        'guidelines': [
            'Ask open-ended questions (how, what, tell me about)',
            'Avoid leading questions',
            'Use probes: "Can you give me an example?"',
            'Use follow-ups: "You mentioned X, tell me more about that"',
            'Allow silences -- do not rush to fill pauses',
            'Record field notes immediately after each interview'
        ]
    }

    # Generate question structure
    for i, rq in enumerate(research_questions):
        protocol['main_questions'].append({
            'research_question': rq,
            'interview_questions': [
                f'Grand tour question for RQ{i+1}',
                f'Follow-up probe for RQ{i+1}',
                f'Example-seeking probe for RQ{i+1}'
            ]
        })

    return protocol

Sampling Strategies

StrategyDescriptionWhen to Use
PurposiveSelect information-rich casesMost qualitative studies
Maximum variationSelect cases that differ on key dimensionsCapture range of experiences
SnowballParticipants refer othersHard-to-reach populations
TheoreticalDriven by emerging theoryGrounded theory studies
Critical caseSelect cases that are pivotalTesting theoretical propositions
ConvenienceReadily available participantsPilot studies only

Coding and Analysis

Thematic Analysis (Braun & Clarke, 2006)

def thematic_analysis_workflow(transcripts: list[str]) -> dict:
    """
    Outline the six phases of reflexive thematic analysis.
    """
    phases = {
        'phase_1_familiarization': {
            'actions': [
                'Read and re-read all transcripts',
                'Note initial impressions in a research journal',
                'Transcribe recordings if not already done'
            ],
            'output': 'Familiarity with data, initial notes'
        },
        'phase_2_coding': {
            'actions': [
                'Code every data segment systematically',
                'Use open coding (inductive) or deductive codes from framework',
                'Code inclusively -- same segment can have multiple codes',
                'Maintain a codebook with definitions and examples'
            ],
            'output': 'Coded dataset, codebook'
        },
        'phase_3_generating_themes': {
            'actions': [
                'Collate codes into potential themes',
                'Create a thematic map showing relationships',
                'Distinguish between semantic and latent themes'
            ],
            'output': 'Candidate themes and sub-themes'
        },
        'phase_4_reviewing_themes': {
            'actions': [
                'Check themes against coded extracts',
                'Check themes against entire dataset',
                'Merge, split, or discard themes as needed'
            ],
            'output': 'Refined thematic map'
        },
        'phase_5_defining_themes': {
            'actions': [
                'Write a detailed description of each theme',
                'Identify the essence of each theme',
                'Name themes concisely and informatively'
            ],
            'output': 'Theme definitions and names'
        },
        'phase_6_writing_up': {
            'actions': [
                'Weave together analytic narrative and data extracts',
                'Select vivid, compelling quotes for each theme',
                'Connect themes to research questions and literature'
            ],
            'output': 'Final analysis write-up'
        }
    }

    return {
        'phases': phases,
        'n_transcripts': len(transcripts),
        'estimated_time': f'{len(transcripts) * 4}-{len(transcripts) * 8} hours'
    }

Codebook Structure

codebook:
  - code: "ADAPT"
    definition: "Participant describes adapting their behavior in response to a challenge"
    inclusion_criteria: "Explicit mention of changing approach or strategy"
    exclusion_criteria: "Passive acceptance without behavioral change"
    example_quote: "I started doing things differently after that..."
    theme: "Resilience Strategies"

  - code: "BARR"
    definition: "Participant identifies a barrier or obstacle"
    inclusion_criteria: "Something that prevented or hindered progress"
    exclusion_criteria: "General complaints without specific barrier"
    example_quote: "The main thing holding me back was..."
    theme: "Challenges"

Quality Criteria

Trustworthiness (Lincoln & Guba, 1985)

CriterionQuantitative EquivalentStrategies
CredibilityInternal validityMember checking, triangulation, prolonged engagement
TransferabilityExternal validityThick description, purposive sampling
DependabilityReliabilityAudit trail, peer debriefing
ConfirmabilityObjectivityReflexivity journal, negative case analysis

Inter-Coder Reliability

For team-based coding, calculate Cohen's kappa or percent agreement on a subset of data (at least 10-20% of the corpus). Aim for kappa > 0.70 before independent coding proceeds.

Software Tools

  • NVivo: Full-featured qualitative analysis (commercial)
  • ATLAS.ti: Comprehensive coding and analysis (commercial)
  • MAXQDA: Mixed-methods capable (commercial)
  • Dedoose: Cloud-based, collaborative (subscription)
  • Taguette: Free, open-source qualitative coding
  • QualCoder: Free, open-source Python-based tool

Reporting Standards

Follow the COREQ (Consolidated Criteria for Reporting Qualitative Research) checklist: report researcher positionality, sampling strategy, data collection methods, analysis approach, and provide sufficient quotations to evidence each theme.