ToolUniverse工具描述优化指南 - 提升JSON配置文件清晰度与用户体验

devtu-optimize-descriptions by mims-harvard/tooluniverse

166 周安装量

1,200 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/mims-harvard/tooluniverse --skill devtu-optimize-descriptions

软件工程开发技术文档

🇨🇳中文介绍

ToolUniverse 工具描述优化

优化 ToolUniverse JSON 配置文件中的工具描述，确保其清晰、完整且用户友好。

何时应用此技能

在以下情况时使用：

审查新创建的工具描述
用户询问“这些工具容易理解吗？”
改进现有工具文档
向 ToolUniverse 添加新工具
用户提及工具可用性、清晰度或文档

快速优化清单

工具描述审查：
- [ ] 已说明先决条件（软件包、API 密钥、账户）
- [ ] 关键缩写词在首次使用时已展开
- [ ] 必需参数与可选参数清晰
- [ ] 互斥选项已编号/标记
- [ ] 参数指导包含权衡说明
- [ ] 过滤器语法显示了可用字段
- [ ] 在相关处添加了文件大小警告
- [ ] 示例展示了实际用法

关键改进项（立即修复）

1. 明确必需的输入要求

问题：用户不知道是需要提供一种输入还是所有输入。

修复：对于互斥选项，使用“必需：提供一种输入类型”。

// 优化前
"description": "处理 BED 区域、基序或基因列表..."

// 优化后
"description": "处理基因组数据。**必需：提供一种输入类型** - (1) BED 区域, (2) DNA 基序, 或 (3) 基因列表。分析..."

对选项进行编号，并使用粗体突出“必需”。

2. 为首个工具添加先决条件

问题：用户不知道在使用前需要安装/配置什么。

修复：为每个工具系列中的第一个工具添加先决条件说明。

"description": "查询单细胞数据。先决条件：需要 'package-name'（安装命令：pip install tooluniverse[extra]）。返回..."

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

3. 展开关键缩写词

问题：新用户不理解技术术语。

修复：在首次使用时以“缩写词（全称）”的格式展开。

需要展开的常见缩写词：

H5AD → 基于 HDF5 的 AnnData
RPM → 每百万读数
TSS → 转录起始位点
TAD → 拓扑关联域
DRS → 数据存储库服务
API 名称（MACS2、IUPAC 等）

// 优化前 "description": "下载 H5AD 文件..."

// 优化后
"description": "下载 H5AD（基于 HDF5 的 AnnData）文件..."

高优先级改进项

4. 增强过滤器参数描述

问题：用户不知道有哪些可用字段或使用什么语法。

修复：列出操作符、常用字段，并提供多个示例。

"parameter_name": {
  "type": "string",
  "description": "使用类 SQL 语法进行过滤。格式：'字段 == \"值\"'。操作符：==, !=, in, <, >, <=, >=。使用 'and'/'or' 组合。常用字段：tissue, cell_type, disease, assay, sex, ethnicity。示例：'tissue == \"lung\"', 'disease == \"COVID-19\" and tissue == \"lung\"', 'cell_type in [\"T cell\", \"B cell\"]'。"
}

语法格式
可用操作符
列出 5-10 个常用字段
2-3 个不同的示例

5. 改进参数指导

问题：用户不知道选择哪个值或存在哪些权衡。

修复：解释每个值的含义并提供建议。

// 优化前
"threshold": "Q 值阈值 (05=1e-5, 10=1e-10, 20=1e-20)"

// 优化后
"threshold": "峰值检测严格度。'05'=1e-5（宽松，峰值更多，特征更宽），'10'=1e-10（中等，平衡），'20'=1e-20（严格，高置信度，窄峰）。默认值 '05' 适用于大多数分析。值越高 = 峰值越少但置信度越高。"

对于每个参数选项，解释：

实际含义
何时使用
涉及的权衡
推荐的默认值

6. 为互斥选项编号

问题：用户提供了多个选项，但只允许一个。

修复：将选项标记为“选项 1”、“选项 2”等。

"bed_data": {
  "description": "**选项 1**：BED 格式区域（制表符分隔：chr, start, end）。示例：'chr1\\t1000\\t2000'。"
},
"motif": {
  "description": "**选项 2**：IUPAC 符号表示的 DNA 序列基序。使用：A/T/G/C, W=A|T, S=G|C。示例：'CANNTG'。"
},
"gene_list": {
  "description": "**选项 3**：基因符号数组。示例：['TP53', 'MDM2']。"
}

中优先级改进项

7. 添加文件大小警告

对于下载或返回大文件的工具：

"description": "下载接触矩阵。注意：文件可能很大（GB 级别），下载前请检查元数据中的 file_size。返回..."

8. 明确 Web 表单与 API 结果

当工具返回提交 URL 而非直接结果时：

"description": "执行富集分析。注意：返回提交 URL（基于 Web 表单的分析）。分析..."

9. 解释文件类型差异

对于具有多种格式选项的工具：

"file_type": "文件格式。常见类型：'cooler'（多分辨率接触矩阵），'pairs'（比对读取对），'hic'（Juicer 格式），'mcool'（多分辨率 cooler）。"

{
  "name": "Tool_operation_name",
  "type": "ToolClassName",
  "description": "[动作动词] 以 [目的]。[如果是第一个工具，添加先决条件]。[关键数据/特性]。[如果是互斥的，说明必需输入]。[关于限制/要求的说明]。用于：[用例 1]、[用例 2]、[用例 3]。",
  "parameter": {
    "properties": {
      "param_name": {
        "type": "string",
        "description": "[它的作用]。[如果适用，说明格式/语法]。[包含权衡的选项]。[示例]。[如果适用，提供建议]。"
      }
    }
  }
}

描述质量检查清单

第一句话目的明确
技术术语已展开
先决条件已预先说明
示例展示了实际用法
“用于：”部分列出了 3-5 个具体用例

必需输入已明确标记
参数选择已解释
已注明限制（文件大小、Web 表单等）
过滤器可用字段已列出
已推荐默认值

新用户无需外部文档即可理解
用户知道需要提供什么
用户可以做出明智的参数选择
错误预防（互斥选项已标记）

要验证描述质量，请询问：

新用户能理解工具的作用吗？
- 仅阅读描述（无文档）
- 应在 30 秒内清晰明了
用户能第一次就提供正确的输入吗？
- 必需输入显而易见
- 格式/语法清晰
- 互斥选项已标记
用户能选择合适的参数吗？
- 权衡已解释
- 已提供建议
- 默认值有理由
先决条件明显吗？
- 安装说明
- API 密钥/账户
- 文件大小警告

按工具类型划分的常见模式

"description": "从 [来源] 查询 [数据类型]。[先决条件]。按 [标准] 过滤。返回 [输出]。[数据规模]。用于：[发现]、[分析]、[特定研究任务]。"

查询的内容
如何过滤
返回的结果
数据规模
先决条件

"description": "从 [来源] 下载 [文件类型]。[格式详情]。[文件大小警告]。[身份验证要求]。用于：[离线分析]、[自定义处理]、[集成]。"

可用的文件格式
大小警告
身份验证需求
文件内容

"description": "分析 [输入类型] 以找到 [结果]。**必需：提供一种输入类型** - (1) [选项], (2) [选项], (3) [选项]。与 [数据库/背景] 进行比较。[结果格式]。用于：[识别]、[发现]、[预测]。"

输入要求清晰
选项已编号
与什么进行比较
能学到什么

更新描述后，验证 JSON 语法：

# 验证所有工具 JSON
python3 -m json.tool src/tooluniverse/data/your_tools.json > /dev/null && echo "✓ 有效"

# 检查类别中的所有工具
for f in src/tooluniverse/data/*_tools.json; do
  python3 -m json.tool "$f" > /dev/null && echo "✓ $f 有效" || echo "✗ $f 无效"
done

示例：优化前与优化后

优化前（不清晰）：

{
  "name": "Tool_enrichment",
  "description": "使用工具执行富集以查找因子。",
  "parameter": {
    "properties": {
      "bed": {"description": "BED 数据"},
      "motif": {"description": "基序"},
      "genes": {"description": "基因"},
      "threshold": {"description": "阈值"}
    }
  }
}

优化后（清晰）：

{
  "name": "Tool_enrichment_analysis",
  "description": "识别数据中富集的转录因子。**必需：提供一种输入类型** - (1) BED 基因组区域, (2) DNA 序列基序（IUPAC 符号）, 或 (3) 基因符号列表。与 400,000+ 个 ChIP-seq 实验进行比较。返回带有富集分数的排名蛋白质。注意：返回提交 URL（基于 Web 的分析）。用于：识别区域调控因子、查找与基序结合的蛋白质、发现调控基因的转录因子。",
  "parameter": {
    "properties": {
      "bed_data": {
        "description": "**选项 1**：BED 格式区域（制表符分隔：chr, start, end）。用于查找与基因组区域结合的蛋白质。示例：'chr1\\t1000\\t2000'。"
      },
      "motif": {
        "description": "**选项 2**：IUPAC 符号表示的 DNA 基序（A/T/G/C, W=A|T, S=G|C, M=A|C, K=G|T, R=A|G, Y=C|T）。示例：'CANNTG'（E-box）。"
      },
      "gene_list": {
        "description": "**选项 3**：基因符号数组或单个基因。示例：['TP53', 'MDM2', 'CDKN1A']。"
      },
      "threshold": {
        "description": "峰值严格度。'05'=1e-5（宽松，峰值更多），'10'=1e-10（中等），'20'=1e-20（严格，高置信度）。默认值 '05' 适用于大多数分析。"
      }
    }
  }
}

优化优先级顺序：

关键（立即修复）：
- 明确必需输入
- 添加先决条件
- 展开缩写词
高（尽快修复）：
- 增强过滤器描述
- 改进参数指导
- 为互斥选项编号
中（锦上添花）：
- 添加文件大小警告
- 明确 Web 表单与 API
- 解释文件类型差异

预期影响：用户错误减少 50-75%，首次成功使用时间加快 50-67%。

🇺🇸English

ToolUniverse Tool Description Optimization

Optimize tool descriptions in ToolUniverse JSON configuration files to ensure they are clear, complete, and user-friendly.

When to Apply This Skill

Use when:

Reviewing newly created tool descriptions
User asks "are these tools easy to understand?"
Improving existing tool documentation
Adding new tools to ToolUniverse
User mentions tool usability, clarity, or documentation

Quick Optimization Checklist

Tool Description Review:
- [ ] Prerequisites stated (packages, API keys, accounts)
- [ ] Critical abbreviations expanded on first use
- [ ] Required vs optional parameters clear
- [ ] Mutually exclusive options numbered/labeled
- [ ] Parameter guidance includes trade-offs
- [ ] Filter syntax shows available fields
- [ ] File size warnings where relevant
- [ ] Examples show realistic usage

Critical Improvements (Fix Immediately)

1. Clarify Required Input Requirements

Problem : Users don't know if they need ONE input or ALL inputs.

Fix : Use "Required: Provide ONE input type " for mutually exclusive options.

// Before
"description": "Process BED regions, motifs, or gene lists..."

// After
"description": "Process genomic data. **Required: Provide ONE input type** - (1) BED regions, (2) DNA motif, or (3) gene list. Analyzes..."

Number the options and use bold for "Required".

2. Add Prerequisites to First Tool

Problem : Users don't know what to install/configure before use.

Fix : Add prerequisites note to first tool in each family.

"description": "Query single-cell data. Prerequisites: Requires 'package-name' (install: pip install tooluniverse[extra]). Returns..."

Include:

Package installation command
API key requirements
Account creation instructions

3. Expand Critical Abbreviations

Problem : New users don't understand technical terms.

Fix : Expand on first use with format: "Abbreviation (Full Name)".

Common abbreviations to expand:

H5AD → HDF5-based AnnData
RPM → Reads Per Million
TSS → Transcription Start Site
TAD → Topologically Associating Domain
DRS → Data Repository Service
API names (MACS2, IUPAC, etc.)

// Before "description": "Download H5AD files..."

// After
"description": "Download H5AD (HDF5-based AnnData) files..."

High-Priority Improvements

4. Enhance Filter Parameter Descriptions

Problem : Users don't know what fields are available or what syntax to use.

Fix : List operators, common fields, and provide multiple examples.

"parameter_name": {
  "type": "string",
  "description": "Filter using SQL-like syntax. Format: 'field == \"value\"'. Operators: ==, !=, in, <, >, <=, >=. Combine with 'and'/'or'. Common fields: tissue, cell_type, disease, assay, sex, ethnicity. Examples: 'tissue == \"lung\"', 'disease == \"COVID-19\" and tissue == \"lung\"', 'cell_type in [\"T cell\", \"B cell\"]'."
}

Include:

Syntax format
Available operators
List of 5-10 common fields
2-3 diverse examples

5. Improve Parameter Guidance

Problem : Users don't know which value to choose or what trade-offs exist.

Fix : Explain what each value means and provide recommendations.

// Before
"threshold": "Q-value threshold (05=1e-5, 10=1e-10, 20=1e-20)"

// After
"threshold": "Peak calling stringency. '05'=1e-5 (permissive, more peaks, broad features), '10'=1e-10 (moderate, balanced), '20'=1e-20 (strict, high confidence, narrow peaks). Default '05' suitable for most analyses. Higher values = fewer but more confident peaks."

For each parameter option, explain:

What it means practically
When to use it
Trade-offs involved
Recommended default

6. Number Mutually Exclusive Options

Problem : Users provide multiple options when only one is allowed.

Fix : Label options as "Option 1 ", "Option 2 ", etc.

"bed_data": {
  "description": "**Option 1**: BED format regions (tab-separated: chr, start, end). Example: 'chr1\\t1000\\t2000'."
},
"motif": {
  "description": "**Option 2**: DNA sequence motif in IUPAC notation. Use: A/T/G/C, W=A|T, S=G|C. Example: 'CANNTG'."
},
"gene_list": {
  "description": "**Option 3**: Gene symbols as array. Example: ['TP53', 'MDM2']."
}

Medium-Priority Improvements

7. Add File Size Warnings

For tools that download or return large files:

"description": "Download contact matrices. Note: Files can be large (GBs), check file_size in metadata before downloading. Returns..."

8. Clarify Web Form vs API Results

When tool returns submission URL instead of direct results:

"description": "Perform enrichment analysis. Note: Returns submission URL (web form-based analysis). Analyzes..."

9. Explain File Type Differences

For tools with multiple format options:

"file_type": "File format. Common types: 'cooler' (multi-resolution contact matrices), 'pairs' (aligned read pairs), 'hic' (Juicer format), 'mcool' (multi-resolution cooler)."

Description Structure Template

{
  "name": "Tool_operation_name",
  "type": "ToolClassName",
  "description": "[Action verb] to [purpose]. [Prerequisites if first tool]. [Key data/features]. [Required inputs if mutually exclusive]. [Note about limitations/requirements]. Use for: [use case 1], [use case 2], [use case 3].",
  "parameter": {
    "properties": {
      "param_name": {
        "type": "string",
        "description": "[What it does]. [Format/syntax if applicable]. [Options with trade-offs]. [Examples]. [Recommendation if applicable]."
      }
    }
  }
}

Description Quality Checklist

Clarity Checks

Purpose clear in first sentence
Technical terms expanded
Prerequisites stated upfront
Examples show realistic usage
"Use for:" section lists 3-5 concrete use cases

Completeness Checks

Required inputs clearly marked
Parameter choices explained
Limitations noted (file size, web form, etc.)
Available fields listed for filters
Default values recommended

Usability Checks

New users can understand without external docs
Users know what to provide
Users can make informed parameter choices
Error prevention (mutually exclusive options labeled)

Testing Description Quality

To verify description quality, ask:

Can a new user understand what the tool does?
- Read only the description (no docs)
- Should be clear within 30 seconds
Can a user provide correct inputs on first try?
- Required inputs obvious
- Format/syntax clear
- Mutually exclusive options labeled
Can a user choose appropriate parameters?
- Trade-offs explained
- Recommendations provided
- Defaults justified
Are prerequisites obvious?
- Installation instructions
- API keys/accounts
- File size warnings

Common Patterns by Tool Type

API Query Tools

"description": "Query [data type] from [source]. [Prerequisites]. Filter by [criteria]. Returns [output]. [Data scale]. Use for: [discovery], [analysis], [specific research tasks]."

Key elements:

What you're querying
How to filter
What you get back
Scale of data
Prerequisites

Data Download Tools

"description": "Download [file types] from [source]. [Format details]. [File size warning]. [Authentication requirement]. Use for: [offline analysis], [custom processing], [integration]."

Key elements:

File formats available
Size warning
Authentication needs
What's in the files

Enrichment/Analysis Tools

"description": "Analyze [input type] to find [results]. **Required: Provide ONE input type** - (1) [option], (2) [option], (3) [option]. Compares against [database/background]. [Result format]. Use for: [identifying], [discovering], [predicting]."

Key elements:

Input requirements clear
Options numbered
What gets compared
What you learn

Validation Commands

After updating descriptions, validate JSON syntax:

# Validate all tool JSONs
python3 -m json.tool src/tooluniverse/data/your_tools.json > /dev/null && echo "✓ Valid"

# Check all tools in category
for f in src/tooluniverse/data/*_tools.json; do
  python3 -m json.tool "$f" > /dev/null && echo "✓ $f valid" || echo "✗ $f invalid"
done

Example: Before and After

Before (Unclear):

{
  "name": "Tool_enrichment",
  "description": "Perform enrichment with tool to find factors.",
  "parameter": {
    "properties": {
      "bed": {"description": "BED data"},
      "motif": {"description": "Motif"},
      "genes": {"description": "Genes"},
      "threshold": {"description": "Threshold value"}
    }
  }
}

After (Clear):

{
  "name": "Tool_enrichment_analysis",
  "description": "Identify transcription factors enriched in your data. **Required: Provide ONE input type** - (1) BED genomic regions, (2) DNA sequence motif (IUPAC notation), or (3) gene symbol list. Compares against 400,000+ ChIP-seq experiments. Returns ranked proteins with enrichment scores. Note: Returns submission URL (web-based analysis). Use for: identifying regulators of regions, finding proteins bound to motifs, discovering transcription factors regulating genes.",
  "parameter": {
    "properties": {
      "bed_data": {
        "description": "**Option 1**: BED format regions (tab-separated: chr, start, end). For finding proteins bound to genomic regions. Example: 'chr1\\t1000\\t2000'."
      },
      "motif": {
        "description": "**Option 2**: DNA motif in IUPAC notation (A/T/G/C, W=A|T, S=G|C, M=A|C, K=G|T, R=A|G, Y=C|T). Example: 'CANNTG' (E-box)."
      },
      "gene_list": {
        "description": "**Option 3**: Gene symbols as array or single gene. Example: ['TP53', 'MDM2', 'CDKN1A']."
      },
      "threshold": {
        "description": "Peak stringency. '05'=1e-5 (permissive, more peaks), '10'=1e-10 (moderate), '20'=1e-20 (strict, high confidence). Default '05' suitable for most analyses."
      }
    }
  }
}

Summary

Priority order for optimization:

Critical (fix immediately):
- Clarify required inputs
- Add prerequisites
- Expand abbreviations
High (fix soon):
- Enhance filter descriptions
- Improve parameter guidance
- Number mutually exclusive options
Medium (nice to have):
- Add file size warnings
- Clarify web form vs API
- Explain file type differences

Expected impact : 50-75% reduction in user errors, 50-67% faster time to first successful use.

Weekly Installs

148

Repository

mims-harvard/to…universe

GitHub Stars

1.2K

First Seen

Feb 4, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

codex143

opencode142

gemini-cli138

github-copilot136

amp132

kimi-cli131

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

118,000 周安装

ToolUniverse工具描述优化指南 - 提升JSON配置文件清晰度与用户体验

🇨🇳中文介绍

ToolUniverse 工具描述优化

何时应用此技能

快速优化清单

关键改进项（立即修复）

1. 明确必需的输入要求

2. 为首个工具添加先决条件

相关 Skills

3. 展开关键缩写词

高优先级改进项

4. 增强过滤器参数描述

5. 改进参数指导

6. 为互斥选项编号

中优先级改进项

7. 添加文件大小警告

8. 明确 Web 表单与 API 结果

9. 解释文件类型差异

描述结构模板

描述质量检查清单

清晰度检查

完整性检查

可用性检查

测试描述质量

按工具类型划分的常见模式

API 查询工具

数据下载工具

富集/分析工具

验证命令

示例：优化前与优化后

总结

🇺🇸English

ToolUniverse Tool Description Optimization

When to Apply This Skill

Quick Optimization Checklist

Critical Improvements (Fix Immediately)

1. Clarify Required Input Requirements

2. Add Prerequisites to First Tool

3. Expand Critical Abbreviations

High-Priority Improvements

4. Enhance Filter Parameter Descriptions

5. Improve Parameter Guidance

6. Number Mutually Exclusive Options

Medium-Priority Improvements

7. Add File Size Warnings

8. Clarify Web Form vs API Results

9. Explain File Type Differences

Description Structure Template

Description Quality Checklist

Clarity Checks

Completeness Checks

Usability Checks

Testing Description Quality

Common Patterns by Tool Type

API Query Tools

Data Download Tools

Enrichment/Analysis Tools

Validation Commands

Example: Before and After

Summary

最新 Skills