tooluniverse-target-research by mims-harvard/tooluniverse
npx skills add https://github.com/mims-harvard/tooluniverse --skill tooluniverse-target-research通过探索9条并行研究路径,收集完整的靶标情报。支持通过基因符号、UniProt登录号、Ensembl ID或基因名称识别靶标。
核心原则:
get_tool_info 验证参数当用户出现以下情况时应用:
何时不使用:简单的蛋白质查找、仅限药物的查询、以疾病为中心的查询、序列检索、结构下载——请改用专门的技能。
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
在首次调用任何工具之前,验证其参数:
tool_info = tu.tools.get_tool_info(tool_name="Reactome_map_uniprot_to_pathways")
# 显示:使用 `id` 而不是 `uniprot_id`
| 工具 | 错误参数 | 正确参数 |
|---|---|---|
Reactome_map_uniprot_to_pathways | uniprot_id | id |
ensembl_get_xrefs | gene_id | id |
GTEx_get_median_gene_expression | 仅 gencode_id | gencode_id + operation="median" |
OpenTargets_* | ensemblID | ensemblId (驼峰式) |
STRING_get_protein_interactions | 单个 ID | protein_ids (列表), species |
intact_get_interactions | 基因符号 | identifier (UniProt登录号) |
GTEx 通常需要版本化的 Ensembl ID。如果 ENSG00000123456 返回空,请尝试使用 ensembl_lookup_gene 中的 ENSG00000123456.{version}。
不要向用户显示搜索过程或工具输出。而是:
[TARGET]_target_report.md),包含所有章节标题和 [研究中...] 占位符。模板参见 REPORT_FORMAT.md。[TARGET]_methods_appendix.md。使用 T1-T4 等级对每个声明进行证据强度分级。等级定义、必需位置和引用格式参见 EVIDENCE_GRADING.md。
Target Query (e.g., "EGFR" or "P00533")
|
+- IDENTIFIER RESOLUTION (always first)
| +- Check if GPCR -> GPCRdb_get_protein
|
+- PATH 0: Open Targets Foundation (ALWAYS FIRST - fills gaps in all other paths)
|
+- PATH 1: Core Identity (names, IDs, sequence, organism)
| +- InterProScan_scan_sequence for novel domain prediction
+- PATH 2: Structure & Domains (3D structure, domains, binding sites)
| +- If GPCR: GPCRdb_get_structures (active/inactive states)
+- PATH 3: Function & Pathways (GO terms, pathways, biological role)
+- PATH 4: Protein Interactions (PPI network, complexes)
+- PATH 5: Expression Profile (tissue expression, single-cell)
+- PATH 6: Variants & Disease (mutations, clinical significance)
| +- DisGeNET_search_gene for curated gene-disease associations
+- PATH 7: Drug Interactions (known drugs, druggability, safety)
| +- Pharos_get_target for TDL classification (Tclin/Tchem/Tbio/Tdark)
| +- BindingDB_get_ligands_by_uniprot for known ligands
| +- PubChem_search_assays_by_target_gene for HTS data
| +- If GPCR: GPCRdb_get_ligands (curated agonists/antagonists)
| +- DepMap_get_gene_dependencies for target essentiality
+- PATH 8: Literature & Research (publications, trends)
每条路径的详细代码实现,请参见 IMPLEMENTATION.md。
在任何研究路径之前解析所有标识符。必需的 ID:
解析后,通过 GPCRdb_get_protein 检查靶标是否为 GPCR。解析和 GPCR 检测代码参见 IMPLEMENTATION.md。
在专门查询之前,为第 5、6、8、9、10、11 节填充基线数据。
| 端点 | 报告章节 | 数据类型 |
|---|---|---|
OpenTargets_get_diseases_phenotypes_by_target_ensemblId | 8 | 疾病/表型 |
OpenTargets_get_target_tractability_by_ensemblId | 9 | 可成药性评估 |
OpenTargets_get_target_safety_profile_by_ensemblId | 10 | 安全性风险 |
OpenTargets_get_target_interactions_by_ensemblId | 6 | PPI 网络 |
OpenTargets_get_target_gene_ontology_by_ensemblId | 5 | GO 注释 |
OpenTargets_get_publications_by_target_ensemblId | 11 | 文献 |
OpenTargets_get_biological_mouse_models_by_ensemblId | 8/10 | 小鼠 KO 表型 |
OpenTargets_get_chemical_probes_by_target_ensemblId | 9 | 化学探针 |
OpenTargets_get_associated_drugs_by_target_ensemblId | 9 | 已知药物 |
工具:UniProt_get_entry_by_accession, UniProt_get_function_by_accession, UniProt_get_recommended_name_by_accession, UniProt_get_alternative_names_by_accession, UniProt_get_subcellular_location_by_accession, MyGene_get_gene_annotation
填充:第 2 节(标识符),第 3 节(基本信息)
使用 3 步结构搜索链(不要仅依赖 PDB 文本搜索):
工具:UniProt_get_entry_by_accession (PDB xrefs), get_protein_metadata_by_pdb_id, PDB_search_similar_structures, alphafold_get_prediction, InterPro_get_protein_domains, UniProt_get_ptm_processing_by_accession
GPCR 靶标:同时查询 GPCRdb_get_structures 以获取激活/失活状态数据。
填充:第 4 节(结构生物学)
3 步链代码参见 IMPLEMENTATION.md。
工具:GO_get_annotations_for_gene, Reactome_map_uniprot_to_pathways, kegg_get_gene_info, WikiPathways_search, enrichr_gene_enrichment_analysis
填充:第 5 节(功能与通路)
工具:STRING_get_protein_interactions, intact_get_interactions, intact_get_complex_details, BioGRID_get_interactions, HPA_get_protein_interactions_by_gene
最低要求:20 个相互作用因子 或 有记录的说明。
填充:第 6 节(蛋白质-蛋白质相互作用)
GTEx 配合版本化 ID 备用方案 + HPA 作为备份。对于全面的 HPA 数据,同时查询细胞系表达比较。
工具:GTEx_get_median_gene_expression, HPA_get_rna_expression_by_source, HPA_get_comprehensive_gene_details_by_ensembl_id, HPA_get_subcellular_location, HPA_get_cancer_prognostics_by_gene, HPA_get_comparative_expression_by_gene_and_cellline, CELLxGENE_get_expression_data
填充:第 7 节(表达谱)
GTEx 备用方案和 HPA 扩展表达代码参见 IMPLEMENTATION.md。
在 ClinVar 结果中区分 SNV 和 CNV。整合 DisGeNET 以获取经过整理的基因-疾病关联评分。
工具:gnomad_get_gene_constraints, clinvar_search_variants, OpenTargets_get_diseases_phenotypes_by_target_ensembl, DisGeNET_search_gene, civic_get_variants_by_gene, cBioPortal_get_mutations
必需:所有 4 个约束评分 (pLI, LOEUF, 错义 Z, pRec)。
填充:第 8 节(遗传变异与疾病)
全面的可成药性评估,包括 TDL 分类、结合数据、筛选数据和必要性。
工具:OpenTargets_get_target_tractability_by_ensemblID, DGIdb_get_gene_druggability, DGIdb_get_drug_gene_interactions, ChEMBL_search_targets, ChEMBL_get_target_activities, Pharos_get_target, BindingDB_get_ligands_by_uniprot, PubChem_search_assays_by_target_gene, DepMap_get_gene_dependencies, OpenTargets_get_target_safety_profile_by_ensemblID, OpenTargets_get_biological_mouse_models_by_ensemblID
GPCR 靶标:同时查询 GPCRdb_get_ligands。
填充:第 9 节(可成药性),第 10 节(安全性),第 12 节(竞争格局)
| 来源 | 提供内容 |
|---|---|
| Pharos TDL | Tclin/Tchem/Tbio/Tdark 分类 |
| BindingDB | 实验性 Ki/IC50/Kd 值 |
| PubChem BioAssay | HTS 筛选命中点和剂量反应 |
| DepMap | 跨癌细胞系的 CRISPR 必要性 |
| ChEMBL | 生物活性记录和化合物数量 |
详细代码参见 IMPLEMENTATION.md,完整工具参数表参见 REFERENCE.md。
工具:PubMed_search_articles, PubMed_get_related, EuropePMC_search_articles, EuropePMC_get_citations, PubTator3_LiteratureSearch, OpenTargets_get_publications_by_target_ensemblID
填充:第 11 节(文献与研究格局)
注意命名冲突的搜索代码参见 IMPLEMENTATION.md。
| 主要工具 | 备用方案 1 | 备用方案 2 |
|---|---|---|
ChEMBL_get_target_activities | GtoPdb_get_target_ligands | OpenTargets drugs |
intact_get_interactions | STRING_get_protein_interactions | OpenTargets interactions |
GO_get_annotations_for_gene | OpenTargets GO | MyGene GO |
GTEx_get_median_gene_expression | HPA_get_rna_expression | 记录为不可用 |
gnomad_get_gene_constraints | OpenTargets constraint | - |
DGIdb_get_drug_gene_interactions | OpenTargets drugs | GtoPdb |
切勿静默跳过失败的工具。 始终在报告中记录失败和备用方案。
在最终确定任何报告之前,运行 EVIDENCE_GRADING.md 中的检查清单:
创建 [TARGET]_target_report.md,并初始化所有 15 个部分。完整模板(包含章节标题、表格格式和完整性检查清单)参见 REPORT_FORMAT.md。
初始文件结构:
## 1. Executive Summary ## 9. Druggability & Pharmacology
## 2. Target Identifiers ## 10. Safety Profile
## 3. Basic Information ## 11. Literature & Research
## 4. Structural Biology ## 12. Competitive Landscape
## 5. Function & Pathways ## 13. Summary & Recommendations
## 6. Protein-Protein Interactions ## 14. Data Sources & Methodology
## 7. Expression Profile ## 15. Data Gaps & Limitations
## 8. Genetic Variation & Disease
| 工具 | 参数 | 备注 |
|---|---|---|
Reactome_map_uniprot_to_pathways | id | 不是 uniprot_id |
ensembl_get_xrefs | id | 不是 gene_id |
GTEx_get_median_gene_expression | gencode_id, operation | 如果为空,尝试版本化 ID |
OpenTargets_* | ensemblId | 驼峰式,不是 ensemblID |
STRING_get_protein_interactions | protein_ids, species | ID 的列表格式 |
intact_get_interactions | identifier | UniProt 登录号 |
| 文件 | 内容 |
|---|---|
| IMPLEMENTATION.md | 标识符解析、GPCR 检测、每条路径实现、重试逻辑的详细代码 |
| EVIDENCE_GRADING.md | T1-T4 等级定义、引用格式、完整性审核检查清单、数据最低要求 |
| REPORT_FORMAT.md | 包含所有 15 个部分、表格格式、特定章节指导的完整报告模板 |
| REFERENCE.md | 按类别组织的完整工具参考(225+ 工具)及参数 |
| EXAMPLES.md | 工作示例:EGFR 完整分析、KRAS 可成药性、靶标比较、CDK4 验证、阿尔茨海默病靶标 |
每周安装次数
161
代码库
GitHub 星标数
1.2K
首次出现
2026年2月4日
安全审计
安装于
codex151
opencode150
gemini-cli147
github-copilot144
amp139
kimi-cli138
Gather complete target intelligence by exploring 9 parallel research paths. Supports targets identified by gene symbol, UniProt accession, Ensembl ID, or gene name.
KEY PRINCIPLES :
get_tool_info before calling unfamiliar toolsApply when users:
When NOT to use : Simple protein lookup, drug-only queries, disease-centric queries, sequence retrieval, structure download -- use specialized skills instead.
BEFORE calling ANY tool for the first time , verify its parameters:
tool_info = tu.tools.get_tool_info(tool_name="Reactome_map_uniprot_to_pathways")
# Reveals: takes `id` not `uniprot_id`
| Tool | WRONG Parameter | CORRECT Parameter |
|---|---|---|
Reactome_map_uniprot_to_pathways | uniprot_id | id |
ensembl_get_xrefs | gene_id | id |
GTEx_get_median_gene_expression | gencode_id only |
GTEx often requires versioned Ensembl IDs. If ENSG00000123456 returns empty, try ENSG00000123456.{version} from ensembl_lookup_gene.
DO NOT show the search process or tool outputs to the user. Instead:
[TARGET]_target_report.md) with all section headers and [Researching...] placeholders. See REPORT_FORMAT.md for template.[TARGET]_methods_appendix.md if requested.Grade every claim by evidence strength using T1-T4 tiers. See EVIDENCE_GRADING.md for tier definitions, required locations, and citation format.
Target Query (e.g., "EGFR" or "P00533")
|
+- IDENTIFIER RESOLUTION (always first)
| +- Check if GPCR -> GPCRdb_get_protein
|
+- PATH 0: Open Targets Foundation (ALWAYS FIRST - fills gaps in all other paths)
|
+- PATH 1: Core Identity (names, IDs, sequence, organism)
| +- InterProScan_scan_sequence for novel domain prediction
+- PATH 2: Structure & Domains (3D structure, domains, binding sites)
| +- If GPCR: GPCRdb_get_structures (active/inactive states)
+- PATH 3: Function & Pathways (GO terms, pathways, biological role)
+- PATH 4: Protein Interactions (PPI network, complexes)
+- PATH 5: Expression Profile (tissue expression, single-cell)
+- PATH 6: Variants & Disease (mutations, clinical significance)
| +- DisGeNET_search_gene for curated gene-disease associations
+- PATH 7: Drug Interactions (known drugs, druggability, safety)
| +- Pharos_get_target for TDL classification (Tclin/Tchem/Tbio/Tdark)
| +- BindingDB_get_ligands_by_uniprot for known ligands
| +- PubChem_search_assays_by_target_gene for HTS data
| +- If GPCR: GPCRdb_get_ligands (curated agonists/antagonists)
| +- DepMap_get_gene_dependencies for target essentiality
+- PATH 8: Literature & Research (publications, trends)
For detailed code implementations of each path, see IMPLEMENTATION.md.
Resolve ALL identifiers before any research path. Required IDs:
After resolution, check if target is a GPCR via GPCRdb_get_protein. See IMPLEMENTATION.md for resolution and GPCR detection code.
Populates baseline data for Sections 5, 6, 8, 9, 10, 11 before specialized queries.
| Endpoint | Report Section | Data Type |
|---|---|---|
OpenTargets_get_diseases_phenotypes_by_target_ensemblId | 8 | Diseases/phenotypes |
OpenTargets_get_target_tractability_by_ensemblId | 9 | Druggability assessment |
OpenTargets_get_target_safety_profile_by_ensemblId | 10 | Safety liabilities |
OpenTargets_get_target_interactions_by_ensemblId | 6 | PPI network |
OpenTargets_get_target_gene_ontology_by_ensemblId |
Tools : UniProt_get_entry_by_accession, UniProt_get_function_by_accession, UniProt_get_recommended_name_by_accession, UniProt_get_alternative_names_by_accession, UniProt_get_subcellular_location_by_accession, MyGene_get_gene_annotation
Populates : Sections 2 (Identifiers), 3 (Basic Information)
Use 3-step structure search chain (do NOT rely solely on PDB text search):
Tools : UniProt_get_entry_by_accession (PDB xrefs), get_protein_metadata_by_pdb_id, PDB_search_similar_structures, alphafold_get_prediction, InterPro_get_protein_domains, UniProt_get_ptm_processing_by_accession
GPCR targets : Also query GPCRdb_get_structures for active/inactive state data.
Populates : Section 4 (Structural Biology)
See IMPLEMENTATION.md for the 3-step chain code.
Tools : GO_get_annotations_for_gene, Reactome_map_uniprot_to_pathways, kegg_get_gene_info, WikiPathways_search, enrichr_gene_enrichment_analysis
Populates : Section 5 (Function & Pathways)
Tools : STRING_get_protein_interactions, intact_get_interactions, intact_get_complex_details, BioGRID_get_interactions, HPA_get_protein_interactions_by_gene
Minimum : 20 interactors OR documented explanation.
Populates : Section 6 (Protein-Protein Interactions)
GTEx with versioned ID fallback + HPA as backup. For comprehensive HPA data, also query cell line expression comparison.
Tools : GTEx_get_median_gene_expression, HPA_get_rna_expression_by_source, HPA_get_comprehensive_gene_details_by_ensembl_id, HPA_get_subcellular_location, HPA_get_cancer_prognostics_by_gene, HPA_get_comparative_expression_by_gene_and_cellline, CELLxGENE_get_expression_data
Populates : Section 7 (Expression Profile)
See IMPLEMENTATION.md for GTEx fallback and HPA extended expression code.
Separate SNVs from CNVs in ClinVar results. Integrate DisGeNET for curated gene-disease association scores.
Tools : gnomad_get_gene_constraints, clinvar_search_variants, OpenTargets_get_diseases_phenotypes_by_target_ensembl, DisGeNET_search_gene, civic_get_variants_by_gene, cBioPortal_get_mutations
Required : All 4 constraint scores (pLI, LOEUF, missense Z, pRec).
Populates : Section 8 (Genetic Variation & Disease)
Comprehensive druggability assessment including TDL classification, binding data, screening data, and essentiality.
Tools : OpenTargets_get_target_tractability_by_ensemblID, DGIdb_get_gene_druggability, DGIdb_get_drug_gene_interactions, ChEMBL_search_targets, ChEMBL_get_target_activities, Pharos_get_target, BindingDB_get_ligands_by_uniprot, PubChem_search_assays_by_target_gene, DepMap_get_gene_dependencies, OpenTargets_get_target_safety_profile_by_ensemblID,
GPCR targets : Also query GPCRdb_get_ligands.
Populates : Sections 9 (Druggability), 10 (Safety), 12 (Competitive Landscape)
| Source | What It Provides |
|---|---|
| Pharos TDL | Tclin/Tchem/Tbio/Tdark classification |
| BindingDB | Experimental Ki/IC50/Kd values |
| PubChem BioAssay | HTS screening hits and dose-response |
| DepMap | CRISPR essentiality across cancer cell lines |
| ChEMBL | Bioactivity records and compound counts |
See IMPLEMENTATION.md for detailed code and REFERENCE.md for complete tool parameter tables.
Tools : PubMed_search_articles, PubMed_get_related, EuropePMC_search_articles, EuropePMC_get_citations, PubTator3_LiteratureSearch, OpenTargets_get_publications_by_target_ensemblID
Populates : Section 11 (Literature & Research Landscape)
See IMPLEMENTATION.md for collision-aware search code.
| Primary Tool | Fallback 1 | Fallback 2 |
|---|---|---|
ChEMBL_get_target_activities | GtoPdb_get_target_ligands | OpenTargets drugs |
intact_get_interactions | STRING_get_protein_interactions | OpenTargets interactions |
GO_get_annotations_for_gene |
NEVER silently skip failed tools. Always document failures and fallbacks in the report.
Run the checklist in EVIDENCE_GRADING.md before finalizing any report:
Create [TARGET]_target_report.md with all 15 sections initialized. See REPORT_FORMAT.md for the full template with section headers, table formats, and completeness checklist.
Initial file structure:
## 1. Executive Summary ## 9. Druggability & Pharmacology
## 2. Target Identifiers ## 10. Safety Profile
## 3. Basic Information ## 11. Literature & Research
## 4. Structural Biology ## 12. Competitive Landscape
## 5. Function & Pathways ## 13. Summary & Recommendations
## 6. Protein-Protein Interactions ## 14. Data Sources & Methodology
## 7. Expression Profile ## 15. Data Gaps & Limitations
## 8. Genetic Variation & Disease
| Tool | Parameter | Notes |
|---|---|---|
Reactome_map_uniprot_to_pathways | id | NOT uniprot_id |
ensembl_get_xrefs | id | NOT gene_id |
GTEx_get_median_gene_expression | gencode_id, |
| File | Contents |
|---|---|
| IMPLEMENTATION.md | Detailed code for identifier resolution, GPCR detection, each PATH implementation, retry logic |
| EVIDENCE_GRADING.md | T1-T4 tier definitions, citation format, completeness audit checklist, data minimums |
| REPORT_FORMAT.md | Full report template with all 15 sections, table formats, section-specific guidance |
| REFERENCE.md | Complete tool reference (225+ tools) organized by category with parameters |
| EXAMPLES.md | Worked examples: EGFR full profile, KRAS druggability, target comparison, CDK4 validation, Alzheimer's targets |
Weekly Installs
161
Repository
GitHub Stars
1.2K
First Seen
Feb 4, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
codex151
opencode150
gemini-cli147
github-copilot144
amp139
kimi-cli138
Excel财务建模规范与xlsx文件处理指南:专业格式、零错误公式与数据分析
43,800 周安装
gencode_id + operation="median" |
OpenTargets_* | ensemblID | ensemblId (camelCase) |
STRING_get_protein_interactions | single ID | protein_ids (list), species |
intact_get_interactions | gene symbol | identifier (UniProt accession) |
| 5 |
| GO annotations |
OpenTargets_get_publications_by_target_ensemblId | 11 | Literature |
OpenTargets_get_biological_mouse_models_by_ensemblId | 8/10 | Mouse KO phenotypes |
OpenTargets_get_chemical_probes_by_target_ensemblId | 9 | Chemical probes |
OpenTargets_get_associated_drugs_by_target_ensemblId | 9 | Known drugs |
OpenTargets_get_biological_mouse_models_by_ensemblIDOpenTargets GO |
MyGene GO |
GTEx_get_median_gene_expression | HPA_get_rna_expression | Document as unavailable |
gnomad_get_gene_constraints | OpenTargets constraint | - |
DGIdb_get_drug_gene_interactions | OpenTargets drugs | GtoPdb |
operation| Try versioned ID if empty |
OpenTargets_* | ensemblId | camelCase, not ensemblID |
STRING_get_protein_interactions | protein_ids, species | List format for IDs |
intact_get_interactions | identifier | UniProt accession |