LLMs-Universal-Life-Science-and-Clinical-Skills- metabolomics-quantification
install
source · Clone the upstream repo
git clone https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills-
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/mdbabumiamssm/LLMs-Universal-Life-Science-and-Clinical-Skills- "$T" && mkdir -p ~/.claude/skills && cp -r "$T/Skills/Metabolomics/metabolomics-quantification" ~/.claude/skills/mdbabumiamssm-llms-universal-life-science-and-clinical-skills-metabolomics-quant && rm -rf "$T"
manifest:
Skills/Metabolomics/metabolomics-quantification/SKILL.mdsource content
📏 Metabolomics Quantification
Feature quantification with missing value imputation (min/median/KNN) and normalization (TIC/median/log).
CLI Reference
python omicsclaw.py run met-quantify --demo python omicsclaw.py run met-quantify --input <features.csv> --output <dir>
Parameters
| Parameter | Default | Description |
|---|---|---|
| | min, median, or knn |
| | tic, median, or log |
Why This Exists
- Without it: Downstream models crash when encountering missing LC/MS peak values
- With it: Recovers matrix completeness via K-Nearest Neighbors (KNN) or Median Imputation
- Why OmicsClaw: Centralized, reproducible preprocessing steps tailored for sparse metabolomic data
Workflow
- Calculate: Assess inherent missing value distributions per feature.
- Execute: Impute empty values using the user-defined algorithm (KNN, Min, Median).
- Assess: Apply normalization logic (TIC, MAD) to align global gradients.
- Generate: Output structural completed data matrices.
- Report: Produce imputation QC boxplots before and after correction.
Example Queries
- "Impute missing values using KNN"
- "Normalize this feature table with TIC"
Output Structure
output_directory/ ├── report.md ├── result.json ├── quantified.csv ├── figures/ │ └── imputation_boxplot.png ├── tables/ │ └── imputed_matrix.csv └── reproducibility/ ├── commands.sh ├── environment.yml └── checksums.sha256
Safety
- Local-first: Strict offline processing without external upload.
- Disclaimer: Requires OmicsClaw reporting structures and disclaimers.
- Audit trail: Hyperparameters and operational flow states are logged fully.
Integration with Orchestrator
Trigger conditions:
- Automatically invoked dynamically based on tool metadata and user intent matching.
Chaining partners:
— Upstream raw data matrix creationpeak-detection
— Downstream univariate/multivariate testingmet-diff
Citations
- NOREVA — normalization evaluation