tooluniverse-metabolomics-analysis by mims-harvard/tooluniverse
npx skills add https://github.com/mims-harvard/tooluniverse --skill tooluniverse-metabolomics-analysis从代谢物鉴定到定量、统计分析、通路解析以及与其他组学层面的整合,提供全面的代谢组学数据分析。
触发条件:
示例问题:
| 能力 | 描述 |
|---|---|
| 数据导入 | LC-MS、GC-MS、NMR、靶向/非靶向平台 |
| 代谢物鉴定 | 匹配 HMDB、KEGG、PubChem、谱库 |
| 质量控制 | 峰质量、空白扣除、内标归一化 |
| 归一化 | 概率商、总离子流、内标 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 统计分析 | 单变量和多变量分析(PCA、PLS-DA、OPLS-DA) |
| 差异分析 | 识别显著的代谢物变化 |
| 通路富集 | KEGG、Reactome、BioCyc 代谢通路分析 |
| 代谢物-酶整合 | 与表达数据关联 |
| 通量分析 | 代谢通量平衡分析(FBA) |
| 生物标志物发现 | 多代谢物特征 |
输入:代谢组学数据(峰表或谱图)
|
v
阶段 1:数据导入与代谢物鉴定
|-- 加载峰表或处理原始谱图
|-- 根据精确质量(+/- 5 ppm)将特征匹配到 HMDB、KEGG
|-- 置信度评分(级别 1-4)
|
v
阶段 2:质量控制与过滤
|-- QC 样本中的 CV(<30%)
|-- 空白扣除(样本/空白 > 3)
|-- 去除缺失值 >50% 的特征
|
v
阶段 3:归一化
|-- 样本层面:TIC、PQN 或内标
|-- 转换:log2、Pareto 或自动缩放
|-- 批次效应校正(如果有多批次)
|
v
阶段 4:探索性分析
|-- PCA 用于样本聚类
|-- PLS-DA 用于有监督的分离
|-- 离群值检测
|
v
阶段 5:差异分析
|-- t 检验 / ANOVA / Wilcoxon
|-- 倍数变化 + FDR 校正
|-- 火山图、热图
|
v
阶段 6:通路分析
|-- 代谢物集富集分析(MSEA)
|-- KEGG/Reactome 通路映射
|-- 通路拓扑(枢纽/瓶颈代谢物)
|
v
阶段 7:多组学整合
|-- 代谢物-酶 Spearman 相关性
|-- 通路水平一致性评分
|-- 代谢通量推断
|
v
阶段 8:生成报告
|-- 汇总统计、差异代谢物
|-- 通路图、生物标志物组合
加载峰表(CSV/TSV)或处理原始谱图(mzML)。根据精确质量(+/- 5 ppm)将特征匹配到 HMDB。分配置信度级别:L1(标准品匹配)、L2(MS/MS 匹配)、L3(仅质量匹配)、L4(未知)。
评估 QC 样本中的 CV(拒绝 >30%),计算空白比值(保留 >3 倍空白),过滤缺失值 >50% 的特征。检查内标回收率(95-105% 可接受)。
提供三种方法:TIC(简单,假设总丰度相似)、PQN(对大幅变化稳健,推荐使用)、内标法(使用加标标准品时最准确)。随后进行 log2 转换或 Pareto 缩放。
PCA 揭示样本分组和批次效应。PLS-DA 提供有监督的分离(报告 R2 和 Q2 以评估模型质量)。标记并调查离群值。
Welch's t 检验(两组)或 ANOVA(多组),并进行 Benjamini-Hochberg FDR 校正。显著性阈值:adj. p < 0.05 且 |log2FC| > 1.0。
将差异代谢物映射到 KEGG 化合物 ID。执行 MSEA 进行通路富集。考虑拓扑结构:位于通路枢纽(高度/中介中心性高)的代谢物影响更大。
将代谢物水平与酶表达进行相关性分析(Spearman)。预期:底物-酶负相关(消耗),产物-酶正相关(生成)。使用代谢物+基因的联合证据对通路失调进行评分。
完整示例输出请参见 report_template.md。
| 技能 | 用途 | 阶段 |
|---|---|---|
tooluniverse-gene-enrichment | 通路富集 | 阶段 6 |
tooluniverse-rnaseq-deseq2 | 用于整合的酶表达数据 | 阶段 7 |
tooluniverse-proteomics-analysis | 用于整合的蛋白质水平数据 | 阶段 7 |
tooluniverse-multi-omics-integration | 全面整合 | 阶段 7 |
| 组件 | 要求 |
|---|---|
| 代谢物 | 至少 50 个已鉴定的代谢物 |
| 重复 | 每个条件至少 3 个重复 |
| 质量控制 | QC 样本中 CV < 30%,进行空白扣除 |
| 统计检验 | t 检验或 Wilcoxon 检验并进行 FDR 校正 |
| 通路分析 | 使用 KEGG 或 Reactome 进行 MSEA |
| 报告 | 包含质量控制、差异代谢物、通路、可视化 |
方法:
数据库:
每周安装次数
127
代码仓库
GitHub 星标数
1.2K
首次出现
2026年2月19日
安全审计
安装于
codex124
gemini-cli123
opencode123
github-copilot122
cursor120
kimi-cli119
Comprehensive analysis of metabolomics data from metabolite identification through quantification, statistical analysis, pathway interpretation, and integration with other omics layers.
Triggers :
Example Questions :
| Capability | Description |
|---|---|
| Data Import | LC-MS, GC-MS, NMR, targeted/untargeted platforms |
| Metabolite Identification | Match to HMDB, KEGG, PubChem, spectral libraries |
| Quality Control | Peak quality, blank subtraction, internal standard normalization |
| Normalization | Probabilistic quotient, total ion current, internal standards |
| Statistical Analysis | Univariate and multivariate (PCA, PLS-DA, OPLS-DA) |
| Differential Analysis | Identify significant metabolite changes |
| Pathway Enrichment | KEGG, Reactome, BioCyc metabolic pathway analysis |
| Metabolite-Enzyme Integration | Correlate with expression data |
| Flux Analysis | Metabolic flux balance analysis (FBA) |
| Biomarker Discovery | Multi-metabolite signatures |
Input: Metabolomics Data (Peak Table or Spectra)
|
v
Phase 1: Data Import & Metabolite Identification
|-- Load peak table or process raw spectra
|-- Match features to HMDB, KEGG (accurate mass +/- 5 ppm)
|-- Confidence scoring (Level 1-4)
|
v
Phase 2: Quality Control & Filtering
|-- CV in QC samples (<30%)
|-- Blank subtraction (sample/blank > 3)
|-- Remove features with >50% missing
|
v
Phase 3: Normalization
|-- Sample-wise: TIC, PQN, or internal standards
|-- Transformation: log2, Pareto, or auto-scaling
|-- Batch effect correction (if multi-batch)
|
v
Phase 4: Exploratory Analysis
|-- PCA for sample clustering
|-- PLS-DA for supervised separation
|-- Outlier detection
|
v
Phase 5: Differential Analysis
|-- t-test / ANOVA / Wilcoxon
|-- Fold change + FDR correction
|-- Volcano plots, heatmaps
|
v
Phase 6: Pathway Analysis
|-- Metabolite set enrichment (MSEA)
|-- KEGG/Reactome pathway mapping
|-- Pathway topology (hub/bottleneck metabolites)
|
v
Phase 7: Multi-Omics Integration
|-- Metabolite-enzyme Spearman correlation
|-- Pathway-level concordance scoring
|-- Metabolic flux inference
|
v
Phase 8: Generate Report
|-- Summary statistics, differential metabolites
|-- Pathway diagrams, biomarker panel
Load peak tables (CSV/TSV) or process raw spectra (mzML). Match features to HMDB by accurate mass (+/- 5 ppm). Assign confidence levels: L1 (standard match), L2 (MS/MS), L3 (mass only), L4 (unknown).
Assess CV in QC samples (reject >30%), compute blank ratios (keep >3x blank), filter features with >50% missing values. Check internal standard recovery (95-105% acceptable).
Three methods available: TIC (simple, assumes similar total abundance), PQN (robust to large changes, recommended), Internal Standard (most accurate with spiked standards). Follow with log2 transform or Pareto scaling.
PCA reveals sample grouping and batch effects. PLS-DA provides supervised separation (report R2 and Q2 for model quality). Flag and investigate outliers.
Welch's t-test (two groups) or ANOVA (multiple groups) with Benjamini-Hochberg FDR correction. Significance thresholds: adj. p < 0.05 and |log2FC| > 1.0.
Map differential metabolites to KEGG compound IDs. Perform MSEA for pathway enrichment. Consider topology: metabolites at pathway hubs (high degree/betweenness centrality) have greater impact.
Correlate metabolite levels with enzyme expression (Spearman). Expected: substrate-enzyme negative correlation (consumption), product-enzyme positive correlation (production). Score pathway dysregulation using combined metabolite + gene evidence.
See report_template.md for full example output.
| Skill | Used For | Phase |
|---|---|---|
tooluniverse-gene-enrichment | Pathway enrichment | Phase 6 |
tooluniverse-rnaseq-deseq2 | Enzyme expression for integration | Phase 7 |
tooluniverse-proteomics-analysis | Protein levels for integration | Phase 7 |
tooluniverse-multi-omics-integration | Comprehensive integration | Phase 7 |
| Component | Requirement |
|---|---|
| Metabolites | At least 50 identified metabolites |
| Replicates | At least 3 per condition |
| QC | CV < 30% in QC samples, blank subtraction |
| Statistical test | t-test or Wilcoxon with FDR correction |
| Pathway analysis | MSEA with KEGG or Reactome |
| Report | QC, differential metabolites, pathways, visualizations |
Methods :
Databases :
Weekly Installs
127
Repository
GitHub Stars
1.2K
First Seen
Feb 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
codex124
gemini-cli123
opencode123
github-copilot122
cursor120
kimi-cli119
Excel财务建模规范与xlsx文件处理指南:专业格式、零错误公式与数据分析
46,700 周安装
Rust 测试模式指南:TDD 方法论、单元测试与性能基准测试
1,100 周安装
MCP服务器模式指南:构建AI助手工具与资源的最佳实践
1,100 周安装
智能OCR文字识别工具 - 支持100+语言,高精度提取图片/PDF/手写文本
1,100 周安装
DeerFlow AI智能体平台HTTP API使用指南 - 集成Claude与LangGraph实现多智能体协作
1,100 周安装
知识综合工具:企业搜索最后一公里,多源信息去重与可信答案生成
1,100 周安装
Node.js Express Server 最佳实践指南:构建健壮REST API与数据库集成
1,100 周安装