install
source · Clone the upstream repo
git clone https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills-
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills- "$T" && mkdir -p ~/.claude/skills && cp -r "$T/Skills/Genomics/Single_Cell_RNA_QC" ~/.claude/skills/mdbabumiamssm-llms-universal-life-science-and-clinical-skills-single-cell-rna-qc-d88417 && rm -rf "$T"
manifest:
Skills/Genomics/Single_Cell_RNA_QC/SKILL.mdsource content
<!--
# COPYRIGHT NOTICE
# This file is part of the "Universal Biomedical Skills" project.
# Copyright (c) 2026 MD BABU MIA, PhD <md.babu.mia@mssm.edu>
# All Rights Reserved.
#
# This code is proprietary and confidential.
# Unauthorized copying of this file, via any medium is strictly prohibited.
#
# Provenance: Authenticated by MD BABU MIA
-->
name: scrna-qc description: Execute the MAD-based single-cell RNA-seq QC workflow (scripts + Python API) to filter low-quality cells and emit reports plus filtered AnnData files. measurable_outcome: Produce filtered .h5ad files, before/after plots, and qc_summary.json within 20 minutes per dataset. allowed-tools:
- read_file
- run_shell_command reliability:
- source: https://github.com/scverse/scanpy score: 0.90 rationale: >- Scanpy repository maintained by the scverse core team with actively tested QC utilities (mito/ribo scoring, MAD filtering).
- source: https://github.com/theislab/single-cell-best-practices score: 0.88 rationale: >- Theis Lab best-practices playbook detailing MAD-based QC thresholds, mitochondrial cutoffs, and reproducibility checklists for scRNA-seq.
At-a-Glance
- description (10-20 chars): QC autopilot
- keywords: scRNAseq, MAD, h5ad, QC, plots
Workflow
- Accept
, 10x.h5ad
, or 10x directory inputs; set mitochondrial/ribosomal patterns as needed..h5 - Run
(CLI) or callqc_analysis.py
helpers to compute metrics, apply MAD thresholds, and filter cells/genes.qc_core - Generate standard plots (metrics before/after, threshold overlays) plus filtered data artifacts.
- Document parameters (mad_counts/genes/mt, mt_threshold, min_cells, log1p flag) inside the summary JSON.
- Provide guidance on next steps (doublet detection, downstream analysis).
Guardrails
- Adjust MT% expectations for tissue context; avoid over-filtering rare populations.
- This workflow is QC only—doublet handling and batch correction stay separate.
- Keep reproducibility by storing command invocations and environment info.
References
- See
,README.md
,qc_core.py
, andqc_analysis.py
for API usage and schema details.qc_plotting.py - GitHub provenance: Scanpy QC modules and scverse best-practices notebooks.